Learning to run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments
Ł Kidziński, SP Mohanty, CF Ong, Z Huang, S Zhou, A Pechenko, ...
The NIPS'17 Competition: Building Intelligent Systems, 121-153, 2018
Artificial intelligence for prosthetics: Challenge solutions
Ł Kidziński, C Ong, SP Mohanty, J Hicks, S Carroll, B Zhou, H Zeng, ...
The NeurIPS'18 Competition: From Machine Learning to Intelligent …, 2020
Run, skeleton, run: skeletal model in a physics-based simulation
M Pavlov, S Kolesnikov, SM Plis
arXiv preprint arXiv:1711.06922, 2017
Showing your offline reinforcement learning work: Online evaluation budget matters
V Kurenkov, S Kolesnikov
International Conference on Machine Learning, 11729-11752, 2022
Catalyst. RL: a distributed framework for reproducible RL research
S Kolesnikov, O Hrinchuk
arXiv preprint arXiv:1903.00027, 2019
Sample efficient ensemble learning with catalyst. rl
S Kolesnikov, V Khrulkov
arXiv preprint arXiv:2003.14210, 2020
Probabilistic embeddings revisited
I Karpukhin, S Dereka, S Kolesnikov
arXiv preprint arXiv:2202.06768, 2022
CVTT: Cross-Validation Through Time
S Kolesnikov, M Andronov
arXiv preprint arXiv:2205.05393, 2022
TTRS: Tinkoff Transactions Recommender System benchmark
S Kolesnikov, O Lashinin, M Pechatov, A Kosov
arXiv e-prints, arXiv: 2110.05589, 2021
LRWR: large-scale benchmark for lip reading in Russian language
E Egorov, V Kostyumov, M Konyk, S Kolesnikov
arXiv preprint arXiv:2109.06692, 2021
Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
D Akimov, V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov
arXiv preprint arXiv:2211.11096, 2022
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
A Nikulin, V Kurenkov, D Tarasov, D Akimov, S Kolesnikov
arXiv preprint arXiv:2211.11092, 2022
CORL: Research-oriented Deep Offline Reinforcement Learning Library
D Tarasov, A Nikulin, D Akimov, V Kurenkov, S Kolesnikov
arXiv preprint arXiv:2210.07105, 2022
EXACT: How to Train Your Accuracy
I Karpukhin, S Dereka, S Kolesnikov
arXiv preprint arXiv:2205.09615, 2022
Towards Interaction-based User Embeddings in Sequential Recommender Models
M Ananyeva, O Lashinin, V Ivanova, S Kolesnikov, DI Ignatov
Deep Image Retrieval is not Robust to Label Noise
S Dereka, I Karpukhin, S Kolesnikov
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
Prompts and Pre-Trained Language Models for Offline Reinforcement Learning
D Tarasov, V Kurenkov, S Kolesnikov
ICLR 2022 Workshop on Generalizable Policy Learning in Physical World, 0
