Distributed deep learning in open collaborations M Diskin, A Bukhtiyarov, M Ryabinin, L Saulnier, A Sinitsin, D Popov, ... Advances in Neural Information Processing Systems 34, 7879-7897, 2021 | 59 | 2021 |
Fast inference of mixture-of-experts language models with offloading A Eliseev, D Mazur arXiv preprint arXiv:2312.17238, 2023 | 34 | 2023 |
Pv-tuning: Beyond straight-through estimation for extreme llm compression V Malinovskii, D Mazur, I Ilin, D Kuznedelev, K Burlachenko, K Yi, ... Advances in Neural Information Processing Systems 37, 5074-5121, 2024 | 9 | 2024 |
Beyond vector spaces: Compact data representation as differentiable weighted graphs D Mazur, V Egiazarian, S Morozov, A Babenko Advances in Neural Information Processing Systems 32, 2019 | 6 | 2019 |
TQCompressor: improving tensor decomposition methods in neural networks via permutations V Abronin, A Naumov, D Mazur, D Bystrov, K Tsarova, A Melnikov, ... 2024 IEEE 7th International Conference on Multimedia Information Processing …, 2024 | 5 | 2024 |
Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models A Shutova, V Malinovskii, V Egiazarian, D Kuznedelev, D Mazur, ... arXiv preprint arXiv:2501.19392, 2025 | | 2025 |