Follow
Xinyue Shen
Xinyue Shen
CISPA Helmholtz Center for Information Security
Verified email at cispa.de - Homepage
Title
Cited by
Cited by
Year
"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models
X Shen, Z Chen, M Backes, Y Shen, Y Zhang
Proceedings of the 2024 ACM SIGSAC Conference on Computer and Communications …, 2024
2532024
In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
X Shen, Z Chen, M Backes, Y Zhang
arXiv preprint arXiv:2304.08979, 2023
1022023
MGTBench: Benchmarking Machine-Generated Text Detection
X He, X Shen, Z Chen, M Backes, Y Zhang
arXiv preprint arXiv:2303.14822, 2023
712023
Unsafe diffusion: On the generation of unsafe images and hateful memes from text-to-image models
Y Qu, X Shen, X He, M Backes, S Zannettou, Y Zhang
Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications …, 2023
672023
Evil Under the Sun: Understanding and Discovering Attacks on Ethereum Decentralized Applications
L Su, X Shen, X Du, X Liao, XF Wang, L Xing, B Liu
632021
Comprehensive Assessment of Jailbreak Attacks Against LLMs
J Chu, Y Liu, Z Yang, X Shen, M Backes, Y Zhang
arXiv preprint arXiv:2402.05668, 2024
312024
Prompt Stealing Attacks Against Text-to-Image Generation Models
X Shen, Y Qu, M Backes, Y Zhang
33rd USENIX Security Symposium (USENIX Security 24), 5823-5840, 2024
182024
On Xing Tian and the Perseverance of Anti-China Sentiment Online
X Shen, X He, M Backes, J Blackburn, S Zannettou, Y Zhang
Proceedings of the International AAAI Conference on Web and Social Media 16 …, 2022
172022
Backdoor Attacks in the Supply Chain of Masked Image Modeling
X Shen, X He, Z Li, Y Shen, M Backes, Y Zhang
arXiv preprint arXiv:2210.01632, 2022
82022
Comprehensive Assessment of Toxicity in ChatGPT
B Zhang, X Shen, WM Si, Z Sha, Z Chen, A Salem, Y Shen, M Backes, ...
arXiv preprint arXiv:2311.14685, 2023
32023
Voice Jailbreak Attacks Against GPT-4o
X Shen, Y Wu, M Backes, Y Zhang
arXiv preprint arXiv:2405.19103, 2024
12024
UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
Y Qu, X Shen, Y Wu, M Backes, S Zannettou, Y Zhang
arXiv preprint arXiv:2405.03486, 2024
12024
Games and Beyond: Analyzing the Bullet Chats of Esports Livestreaming
Y Jiang, X Shen, R Wen, Z Sha, J Chu, Y Liu, M Backes, Y Zhang
Proceedings of the International AAAI Conference on Web and Social Media 18 …, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–13