Double graph based reasoning for document-level relation extraction S Zeng, R Xu, B Chang, L Li arXiv preprint arXiv:2009.13752, 2020 | 192 | 2020 |
Raise a child in large language model: Towards effective and generalizable fine-tuning R Xu, F Luo, Z Zhang, C Tan, B Chang, S Huang, F Huang arXiv preprint arXiv:2109.05687, 2021 | 129 | 2021 |
Document-level event extraction via heterogeneous graph-based interaction model with a tracker R Xu, T Liu, L Li, B Chang arXiv preprint arXiv:2105.14924, 2021 | 79 | 2021 |
An enhanced span-based decomposition method for few-shot sequence labeling P Wang, R Xu, T Liu, Q Zhou, Y Cao, B Chang, Z Sui arXiv preprint arXiv:2109.13023, 2021 | 40 | 2021 |
A two-stream AMR-enhanced model for document-level event argument extraction R Xu, P Wang, T Liu, S Zeng, B Chang, Z Sui arXiv preprint arXiv:2205.00241, 2022 | 27 | 2022 |
Making pre-trained language models end-to-end few-shot learners with contrastive prompt tuning Z Xu, C Wang, M Qiu, F Luo, R Xu, S Huang, J Huang Proceedings of the Sixteenth ACM International Conference on Web Search and …, 2023 | 20 | 2023 |
Deepseekmath: Pushing the limits of mathematical reasoning in open language models Z Shao, P Wang, Q Zhu, R Xu, J Song, M Zhang, YK Li, Y Wu, D Guo arXiv preprint arXiv:2402.03300, 2024 | 18 | 2024 |
From dense to sparse: Contrastive pruning for better pre-trained language model compression R Xu, F Luo, C Wang, B Chang, J Huang, S Huang, F Huang Proceedings of the AAAI Conference on Artificial Intelligence 36 (10), 11547 …, 2022 | 18 | 2022 |
Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ... arXiv preprint arXiv:2401.06066, 2024 | 17 | 2024 |
Deepseek llm: Scaling open-source language models with longtermism X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ... arXiv preprint arXiv:2401.02954, 2024 | 16 | 2024 |
Behind the scenes: An exploration of trigger biases problem in few-shot event classification P Wang, R Xun, T Liu, D Dai, B Chang, Z Sui Proceedings of the 30th ACM International Conference on Information …, 2021 | 14 | 2021 |
Xiaomingbot: A Multilingual Robot News Reporter R Xu, J Cao, M Wang, J Chen, H Zhou, Y Zeng, Y Wang, L Chen, X Yin, ... The 58th Annual Meeting of the Association for Computational Linguistics, 2020 | 12 | 2020 |
ATP: AMRize then parse! enhancing AMR parsing with PseudoAMRs L Chen, P Wang, R Xu, T Liu, Z Sui, B Chang arXiv preprint arXiv:2204.08875, 2022 | 11 | 2022 |
Math-shepherd: A label-free step-by-step verifier for llms in mathematical reasoning P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui arXiv preprint arXiv:2312.08935, 2023 | 10 | 2023 |
Math-shepherd: Verify and reinforce llms step-by-step without human annotations P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui CoRR, abs/2312.08935, 2023 | 10 | 2023 |
A double-graph based framework for frame semantic parsing C Zheng, X Chen, R Xu, B Chang arXiv preprint arXiv:2206.09158, 2022 | 9 | 2022 |
S4-Tuning: A simple cross-lingual sub-network tuning method R Xu, F Luo, B Chang, S Huang, F Huang Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022 | 8 | 2022 |
Volctrans parallel corpus filtering system for WMT 2020 R Xu, Z Zhi, J Cao, M Wang, L Li arXiv preprint arXiv:2010.14029, 2020 | 5 | 2020 |
On effectively learning of knowledge in continual pre-training C Wang, F Luo, Y Li, R Xu, F Huang, Y Zhang arXiv preprint arXiv:2204.07994, 2022 | 4 | 2022 |
Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation L Chen, R Xu, B Chang arXiv preprint arXiv:2203.02889, 2022 | 4 | 2022 |