Damai Dai

Cited by

	All	Since 2019
Citations	1589	1584
h-index	15	15
i10-index	18	18

780

390

195

585

20192020202120222023202411 29 64 130 768 575

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Baobao CHANGPeking UniversityVerified email at pku.edu.cn
Qingxiu DongPeking UniversityVerified email at stu.pku.edu.cn
Xu SunAssociate Professor, Peking UniversityVerified email at pku.edu.cn
Li DongMicrosoft ResearchVerified email at microsoft.com
Furu WeiPartner Research Manager, Microsoft ResearchVerified email at microsoft.com
Tianyu LiuAlibabaVerified email at pku.edu.cn
Shuming MaMicrosoft Research AsiaVerified email at microsoft.com
Peiyi WangPeking UniversityVerified email at stu.pku.edu.cn
Fuli LuoDeepSeekVerified email at pku.edu.cn
Wei LiBeijing Language and Culture UniversityVerified email at blcu.edu.cn

Damai Dai

Other names代达劢

MOE Key Lab of Computational Linguistics, School of EECS, Peking University

Verified email at pku.edu.cn

Deep Learning Natural Language Processing Large Language Model Mixture-of-Experts


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A survey on in-context learning Q Dong, L Li, D Dai, C Zheng, Z Wu, B Chang, X Sun, J Xu, Z Sui arXiv preprint arXiv:2301.00234, 2022	588	2022
Knowledge neurons in pretrained transformers D Dai, L Dong, Y Hao, Z Sui, C Baobao, F Wei Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	273	2022
Why can GPT learn in-context? language models implicitly perform gradient descent as meta-optimizers D Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui, F Wei Findings of the Association for Computational Linguistics: ACL 2023, 4005-4019, 2023	210	2023
Calibrating Factual Knowledge in Pretrained Language Models Q Dong, D Dai, Y Song, J Xu, Z Sui, L Li Findings of the Association for Computational Linguistics: EMNLP 2022, 2022	61	2022
Preliminary study on the construction of Chinese medical knowledge graph O Byambasuren, Y Yang, Z Sui, D Dai, B Chang, S Li, H Zan Journal of Chinese Information Processing 33 (10), 1-9, 2019	61*	2019
Learning to control the fine-grained sentiment for story ending generation F Luo, D Dai, P Yang, T Liu, B Chang, Z Sui, X Sun Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019	59	2019
Livebot: Generating live video comments based on visual and textual contexts S Ma, L Cui, D Dai, F Wei, X Sun Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 6810-6817, 2019	54	2019
On the representation collapse of sparse mixture of experts Z Chi, L Dong, S Huang, D Dai, S Ma, B Patra, S Singhal, P Bajaj, X Song, ... Advances in Neural Information Processing Systems 35, 34600-34613, 2022	40	2022
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning L Wang, L Li, D Dai, D Chen, H Zhou, F Meng, J Zhou, X Sun (EMNLP 2023 Best Long Paper) Proceedings of the 2023 Conference on Empirical …, 2023	34	2023
Deepseek llm: Scaling open-source language models with longtermism X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ... arXiv preprint arXiv:2401.02954, 2024	33	2024
StableMoE: Stable Routing Strategy for Mixture of Experts D Dai, L Dong, S Ma, B Zheng, Z Sui, B Chang, F Wei Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	33	2022
Sememe prediction: Learning semantic knowledge from unstructured textual wiki descriptions W Li, X Ren, D Dai, Y Wu, H Wang, X Sun arXiv preprint arXiv:1808.05437, 2018	19	2018
Math-shepherd: Verify and reinforce llms step-by-step without human annotations P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui CoRR, abs/2312.08935, 2023	17*	2023
Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions D Dai, H Zheng, F Luo, P Yang, T Liu, Z Sui, B Chang Proceedings of the 6th ACL Workshop on Representation Learning for NLP …, 2021	17	2021
Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ... arXiv preprint arXiv:2401.06066, 2024	16	2024
Behind the scenes: An exploration of trigger biases problem in few-shot event classification P Wang, R Xun, T Liu, D Dai, B Chang, Z Sui Proceedings of the 30th ACM International Conference on Information …, 2021	14	2021
Hierarchical Curriculum Learning for AMR Parsing P Wang, L Chen, T Liu, D Dai, Y Cao, B Chang, Z Sui Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	12	2022
Neural knowledge bank for pretrained transformers D Dai, W Jiang, Q Dong, Y Lyu, Z Sui CCF International Conference on Natural Language Processing and Chinese …, 2023	10	2023
Decompose, fuse and generate: A formation-informed method for chinese definition generation H Zheng, D Dai, L Li, T Liu, Z Sui, B Chang, Y Liu Proceedings of the 2021 Conference of the North American Chapter of the …, 2021	8	2021
Coarse-to-fine entity representations for document-level relation extraction D Dai, J Ren, S Zeng, B Chang, Z Sui CCF International Conference on Natural Language Processing and Chinese …, 2023	7	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors