Follow
Siyuan Zhuang
Siyuan Zhuang
PhD Student, UC Berkeley
Verified email at berkeley.edu
Title
Cited by
Cited by
Year
Vicuna: An open-source chatbot impressing gpt-4 with 90%* chatgpt quality
WL Chiang, Z Li, Z Lin, Y Sheng, Z Wu, H Zhang, L Zheng, S Zhuang, ...
See https://vicuna. lmsys. org (accessed 14 April 2023), 2023
4702023
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
L Zheng, WL Chiang, Y Sheng, S Zhuang, Z Wu, Y Zhuang, Z Lin, Z Li, ...
arXiv preprint arXiv:2306.05685, 2023
1842023
Terapipe: Token-level pipeline parallelism for training large-scale language models
Z Li, S Zhuang, S Guo, D Zhuo, H Zhang, D Song, I Stoica
International Conference on Machine Learning, 6543-6552, 2021
572021
Efficient memory management for large language model serving with pagedattention
W Kwon, Z Li, S Zhuang, Y Sheng, L Zheng, CH Yu, J Gonzalez, H Zhang, ...
Proceedings of the 29th Symposium on Operating Systems Principles, 611-626, 2023
382023
Hoplite: efficient and fault-tolerant collective communication for task-based distributed systems
S Zhuang, Z Li, D Zhuo, S Wang, E Liang, R Nishihara, P Moritz, I Stoica
Proceedings of the 2021 ACM SIGCOMM 2021 Conference, 641-656, 2021
192021
{SkyPilot}: An Intercloud Broker for Sky Computing
Z Yang, Z Wu, M Luo, WL Chiang, R Bhardwaj, W Kwon, S Zhuang, ...
20th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2023
132023
Eric. P Xing, Hao Zhang, Joseph E. Gonzalez, and Ion Stoica. 2023. Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
L Zheng, WL Chiang, Y Sheng, S Zhuang, Z Wu, Y Zhuang, Z Lin, Z Li, ...
arXiv preprint arXiv:2306.05685, 0
12
sensai: Convnets decomposition via class parallelism for fast inference on live data
G Wang, Z Liu, B Hsieh, S Zhuang, J Gonzalez, T Darrell, I Stoica
Proceedings of Machine Learning and Systems 3, 664-679, 2021
102021
Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality. 2023
WL Chiang, Z Li, Z Lin, Y Sheng, Z Wu, H Zhang, L Zheng, S Zhuang, ...
URL https://lmsys. org/blog/2023-03-30-vicuna 1 (2), 3, 0
10
Judging llm-as-a-judge with mt-bench and chatbot arena. CoRR, abs/2306.05685, 2023. doi: 10.48550
L Zheng, WL Chiang, Y Sheng, S Zhuang, Z Wu, Y Zhuang, Z Lin, Z Li, ...
arXiv preprint arXiv.2306.05685, 0
8
Lmsys-chat-1m: A large-scale real-world llm conversation dataset
L Zheng, WL Chiang, Y Sheng, T Li, S Zhuang, Z Wu, Y Zhuang, Z Li, ...
arXiv preprint arXiv:2309.11998, 2023
62023
Sensai: Fast convnets serving on live data via class parallelism
G Wang, Z Liu, S Zhuang, B Hsieh, J Gonzalez, I Stoica
MLOps Systems workshop in MLSys, 2020
52020
Composing MPC with LQR and neural networks for efficient and stable control
F Wu, G Wang, S Zhuang, K Wang, A Keimer, I Stoica, A Bayen
arXiv preprint arXiv:2112.07238 3, 2021
42021
Rearchitecting in-memory object stores for low latency
D Zhuo, K Zhang, Z Li, S Zhuang, S Wang, A Chen, I Stoica
Proceedings of the VLDB Endowment 15 (3), 555-568, 2021
12021
Composing MPC With LQR and Neural Network for Amortized Efficiency and Stable Control
F Wu, G Wang, S Zhuang, K Wang, A Keimer, I Stoica, A Bayen
IEEE Transactions on Automation Science and Engineering, 2023
2023
{ExoFlow}: A Universal Workflow System for {Exactly-Once}{DAGs}
S Zhuang, S Wang, E Liang, Y Cheng, I Stoica
17th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2023
2023
Hoplite: Efficient Collective Communication for Task-Based Distributed Systems.
S Zhuang, Z Li, D Zhuo, S Wang, E Liang, R Nishihara, P Moritz, I Stoica
CoRR, 2020
2020
AVOIDING GPU OOM FOR DYNAMIC COMPUTATIONAL GRAPHS TRAINING
S Zhuang
The system can't perform the operation now. Try again later.
Articles 1–18