Annotators with attitudes: How annotator beliefs and identities bias toxic language detection M Sap, S Swayamdipta, L Vianna, X Zhou, Y Choi, NA Smith Proceedings of the 2022 Conference of the North American Chapter of the …, 2021 | 265 | 2021 |
Webarena: A realistic web environment for building autonomous agents S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng, T Ou, Y Bisk, ... arXiv preprint arXiv:2307.13854, 2023 | 264 | 2023 |
Evaluating commonsense in pre-trained language models X Zhou, Y Zhang, L Cui, D Huang Proceedings of the AAAI conference on artificial intelligence 34 (05), 9733-9740, 2020 | 215 | 2020 |
Challenges in automated debiasing for toxic language detection X Zhou Proceedings of the 16th Conference of the European Chapter of the …, 2021 | 166 | 2021 |
Clever hans or neural theory of mind? stress testing social reasoning in large language models N Shapira, M Levy, SH Alavi, X Zhou, Y Choi, Y Goldberg, M Sap, ... arXiv preprint arXiv:2305.14763, 2023 | 114 | 2023 |
Sotopia: Interactive evaluation for social intelligence in language agents X Zhou, H Zhu, L Mathur, R Zhang, H Yu, Z Qi, LP Morency, Y Bisk, ... arXiv preprint arXiv:2310.11667, 2023 | 103 | 2023 |
Can llms keep a secret? testing privacy implications of language models via contextual integrity theory N Mireshghallah, H Kim, X Zhou, Y Tsvetkov, M Sap, R Shokri, Y Choi arXiv preprint arXiv:2310.17884, 2023 | 67 | 2023 |
FANToM: A benchmark for stress-testing machine theory of mind in interactions H Kim, M Sclar, X Zhou, RL Bras, G Kim, Y Choi, M Sap arXiv preprint arXiv:2310.15421, 2023 | 64 | 2023 |
Linguistically-informed transformations (LIT): A method for automatically generating contrast sets C Li, L Shengshuo, LZ Liu, X Wu, X Zhou, S Steinert-Threlkeld Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting …, 2020 | 35 | 2020 |
Tianyue Ou, Yonatan Bisk, Daniel Fried, Uri Alon, and Graham Neubig S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng Webarena: A realistic web environment for building autonomous agents 2 (3), 4, 2023 | 34 | 2023 |
Cobra frames: Contextual reasoning about effects and harms of offensive statements X Zhou, H Zhu, A Yerukola, T Davidson, JD Hwang, S Swayamdipta, ... Proceedings of the Association for Computational Linguistics (ACL), 2023 | 26 | 2023 |
Consent in crisis: The rapid decline of the ai data commons S Longpre, R Mahari, A Lee, C Lund, H Oderinwale, W Brannon, ... NEURIPS, 2024 | 25 | 2024 |
Tianyue Ou, Yonatan Bisk, Daniel Fried, Uri Alon, and Graham Neubig. 2023. WebArena: A Realistic Web Environment for Building Autonomous Agents S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng arXiv preprint arXiv:2307.13854, 0 | 24 | |
Is this the real life? is this just fantasy? the misleading success of simulating social interactions with llms X Zhou, Z Su, T Eisape, H Kim, M Sap arXiv preprint arXiv:2403.05020, 2024 | 22 | 2024 |
Multilevel text alignment with cross-document attention X Zhou, N Pappas, NA Smith Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 22 | 2020 |
Tianyue Ou, Yonatan Bisk, Daniel Fried, et al. 2023. Webarena: A realistic web environment for building autonomous agents S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng arXiv preprint arXiv:2307.13854, 2023 | 18 | 2023 |
Extracting and inferring personal attributes from dialogue Z Wang Proceedings of the 4th Workshop on NLP for Conversational AI, 2021 | 17 | 2021 |
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models D Jain, P Kumar, S Gehman, X Zhou, T Hartvigsen, M Sap arXiv preprint arXiv:2405.09373, 2024 | 16 | 2024 |
Emergent Communication Fine-tuning (EC-FT) for Pretrained Language Models S Steinert-Threlkeld, X Zhou, Z Liu, CM Downey Emergent Communication Workshop at ICLR 2022, 2022 | 14 | 2022 |
Clever hans or neural theory of mind N Shapira, M Levy, SH Alavi, X Zhou, Y Choi, Y Goldberg, M Sap, ... Stress testing social reasoning in large language models, 2023 | 13 | 2023 |