Personal llm agents: Insights and survey about the capability, efficiency and security Y Li, H Wen, W Wang, X Li, Y Yuan, G Liu, J Liu, W Xu, X Wang, Y Sun, ... arXiv preprint arXiv:2401.05459, 2024 | 11 | 2024 |
Convrelu++: Reference-based lossless acceleration of conv-relu operations on mobile cpu R Kong, Y Li, Y Yuan, L Kong Proceedings of the 21st Annual International Conference on Mobile Systems …, 2023 | 4 | 2023 |
Patchbackdoor: Backdoor attack against deep neural networks without model modification Y Yuan, R Kong, S Xie, Y Li, Y Liu Proceedings of the 31st ACM International Conference on Multimedia, 9134-9142, 2023 | 1 | 2023 |
Serving MoE Models on Resource-constrained Edge Devices via Dynamic Expert Swapping R Kong, Y Li, Q Feng, W Wang, L Kong, Y Liu arXiv preprint arXiv:2308.15030, 2023 | 1 | 2023 |
SwapMoE: Efficient Memory-Constrained Serving of Large Sparse MoE Models via Dynamic Expert Pruning and Swapping R Kong, Y Li, Q Feng, W Wang, L Kong, Y Liu arXiv preprint arXiv:2308.15030v2, 2023 | | 2023 |