Xidong Feng

Cited by

	All	Since 2019
Citations	333	333
h-index	10	10
i10-index	10	10

140

105

202020212022202320248 32 70 133 90

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yaodong YangBOYA (博雅) Assistant Professor at Peking UniversityVerified email at pku.edu.cn
Jun WangProfessor, Computer Science, University College LondonVerified email at cs.ucl.ac.uk
Stephen McAleerPostdoc, CMUVerified email at uci.edu
Ying WenAssociate Professor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Oliver SlumbersUniversity College LondonVerified email at ucl.ac.uk
Bo Liu (Benjamin Liu)Ph.D. student, National University of SingaporeVerified email at comp.nus.edu.sg
Jianming HUAssociate Professor of Department of Automation, Tsinghua UniversityVerified email at mail.tsinghua.edu.cn
Jakub Grudzien KubaUC BerkeleyVerified email at berkeley.edu
Hongyao TangMila/UdeMVerified email at mila.quebec
Haotian FuBrown UniversityVerified email at brown.edu
Mengyue YangUniversity College LondonVerified email at ucl.ac.uk

Xidong Feng

University College London

Verified email at ucl.ac.uk - Homepage

Large Language Model Reinforcement Learning Meta Learning Multi-agent Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Y Chen, Y Yang, T Wu, S Wang, X Feng, J Jiang, SM McAleer, H Dong, ... NeurIPS 2022, 2022	60	2022
Vehicle trajectory prediction using intention-based conditional variational autoencoder X Feng, Z Cen, J Hu, Y Zhang 2019 IEEE Intelligent Transportation Systems Conference (ITSC), 3514-3519, 2019	46	2019
Neural Auto-Curricula X Feng, O Slumbers, Y Yang, Z Wan, B Liu, S McAleer, Y Wen, J Wang NeurIPS 2021, 2021	45*	2021
Towards effective context for meta-reinforcement learning: an approach based on contrastive learning H Fu, H Tang, J Hao, C Chen, X Feng, D Li, W Liu Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 7457-7465, 2021	43	2021
Mri reconstruction with interpretable pixel-wise operations using reinforcement learning W Li, X Feng, H An, XY Ng, YJ Zhang Proceedings of the AAAI conference on artificial intelligence 34 (01), 792-799, 2020	28	2020
Heterogeneous-agent mirror learning: A continuum of solutions to cooperative marl JG Kuba, X Feng, S Ding, H Dong, J Wang, Y Yang JMLR, 2022	21*	2022
Cmml: Contextual modulation meta learning for cold-start recommendation X Feng, C Chen, D Li, M Zhao, J Hao, J Wang Proceedings of the 30th ACM International Conference on Information …, 2021	21	2021
Alphazero-like tree-search can guide large language model decoding and training X Feng, Z Wan, M Wen, Y Wen, W Zhang, J Wang NeurIPS2023 FMDM workshop, 2023	15	2023
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning X Feng, B Liu, J Ren, L Mai, R Zhu, J Wang, Y Yang NeurIPS 2022, 2021	12*	2021
ChessGPT: Bridging Policy Learning and Language Modeling X Feng, Y Luo, Z Wang, H Tang, M Yang, K Shao, D Mguni, Y Du, J Wang Advances in Neural Information Processing Systems 36, 2024	11	2024
Autonomous lane change decision making using different deep reinforcement learning methods X Feng, J Hu, Y Huo, Y Zhang CICTP 2019, 5563-5575, 2019	9	2019
Pangu-agent: A fine-tunable generalist agent with structured reasoning F Christianos, G Papoudakis, M Zimmer, T Coste, Z Wu, J Chen, ... arXiv preprint arXiv:2312.14878, 2023	7	2023
Torchopt: An efficient library for differentiable optimization J Ren, X Feng, B Liu, X Pan, Y Fu, L Mai, Y Yang JMLR Open Source Software, 2022	7	2022
Contextual Transformer for Offline Meta Reinforcement Learning R Lin, Y Li, X Feng, Z Zhang, XHW Fung, H Zhang, J Wang, Y Du, Y Yang NeurIPS2022 FMDM workshop, 2022	6*	2022
MANSA: learning fast and slow in multi-agent systems DH Mguni, H Chen, T Jafferjee, J Wang, L Yue, X Feng, SM Mcaleer, ... International Conference on Machine Learning, 24631-24658, 2023	2	2023
Natural Language Reinforcement Learning X Feng, Z Wan, M Yang, Z Wang, GA Koushiks, Y Du, Y Wen, J Wang arXiv preprint arXiv:2402.07157, 2024		2024
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models Z Hu, C Liu, X Feng, Y Zhao, SK Ng, AT Luu, J He, PW Koh, B Hooi arXiv preprint arXiv:2402.03271, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–17

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors