Designing rewards for fast learning H Sowerby, Z Zhou, ML Littman arXiv preprint arXiv:2205.15400, 2022 | 19 | 2022 |
Autonomous improvement of instruction following skills via foundation models Z Zhou, P Atreya, A Lee, H Walke, O Mees, S Levine arXiv preprint arXiv:2407.20635, 2024 | 7 | 2024 |
Efficient online reinforcement learning fine-tuning need not retain offline data Z Zhou, A Peng, Q Li, S Levine, A Kumar arXiv preprint arXiv:2412.07762, 2024 | 3 | 2024 |
Characterizing the Action-Generalization Gap in Deep Q-Learning Z Zhou, C Allen, K Asadi, G Konidaris arXiv preprint arXiv:2205.05588, 2022 | 2 | 2022 |
Learning Transferable Sub-Goals by Hypothesizing Generalizing Features A de Mello Koch, A Bagaria, B Huo, Z Zhou, C Allen, G Konidaris | | 2025 |
Tiered Reward: Designing Rewards for Specification and Fast Learning of Desired Behavior Z Zhou, S Raman, H Sowerby, ML Littman Reinforcement Learning Journal 1 (1), 2024 | | 2024 |
Policy Transfer in Lifelong Reinforcement Learning through Learning Generalizing Features Z Zhou Brown University Providence, Rhode Island, 2023 | | 2023 |
Improving Post-Processing Methods on Video Object Recognition Using Inertial Measurement Unit Z Zhou, M Paradiso, S Boyum | | 2020 |
Learning Transferable Sub-goals by Hypothesizing Generalizing Features ADM Koch, A Bagaria, B Huo, C Allen, Z Zhou, G Konidaris | | |
Learning Portable Skills by Identifying Generalizing Features with an Attention-Based Ensemble ATDM Koch, Z Zhou, A Bagaria, H Fu, C Allen, G Konidaris | | |