Decision transformer: Reinforcement learning via sequence modeling L Chen, K Lu, A Rajeswaran, K Lee, A Grover, M Laskin, P Abbeel, ... Neural Information Processing Systems (NeurIPS), 2021, 2021 | 163 | 2021 |
State entropy maximization with random encoders for efficient exploration Y Seo, L Chen, J Shin, H Lee, P Abbeel, K Lee International Conference on Machine Learning (ICML), 2021, 2021 | 39 | 2021 |