Language agents with reinforcement learning for strategic play in the werewolf game Z Xu, C Yu, F Fang, Y Wang, Y Wu arXiv preprint arXiv:2310.18940, 2023 | 48 | 2023 |
Revisiting some common practices in cooperative multi-agent reinforcement learning W Fu, C Yu, Z Xu, J Yang, Y Wu arXiv preprint arXiv:2206.07505, 2022 | 39 | 2022 |
Fictitious cross-play: Learning global nash equilibrium in mixed cooperative-competitive games Z Xu, Y Liang, C Yu, Y Wang, Y Wu arXiv preprint arXiv:2310.03354, 2023 | 8 | 2023 |
Multi-agent vulnerability discovery for autonomous driving with hazard arbitration reward W Liu, Y Mu, C Yu, X Ning, Z Cao, Y Wu, S Liang, H Yang, Y Wang arXiv preprint arXiv:2112.06185, 2021 | 4 | 2021 |
A Survey on Self-play Methods in Reinforcement Learning R Zhang, Z Xu, C Ma, C Yu, WW Tu, S Huang, D Ye, W Ding, Y Yang, ... arXiv preprint arXiv:2408.01072, 2024 | 2 | 2024 |
Texture BERT for cross-modal texture image retrieval Z Xu, T Yu, P Li Proceedings of the 31st ACM International Conference on Information …, 2022 | 2 | 2022 |
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning J Chen, Z Xu, Y Li, C Yu, J Song, H Yang, F Fang, Y Wang, Y Wu Proceedings of the AAAI Conference on Artificial Intelligence 38 (10), 11320 …, 2024 | 1 | 2024 |
Towards efficient evaluation of risk via herding Z Xu, T Yu, S Sra Negative Dependence: Theory and Applications in Machine Learning, 2019 | 1 | 2019 |