Подписаться
Ruo Yu Tao
Ruo Yu Tao
Подтвержден адрес электронной почты в домене brown.edu - Главная страница
Название
Процитировано
Процитировано
Год
Textworld: A learning environment for text-based games
MA Côté, Á Kádár, X Yuan, B Kybartas, T Barnes, E Fine, J Moore, ...
arXiv preprint arXiv:1806.11532, 2018
3612018
Novelty Search in representational space for sample efficient exploration
RY Tao, V François-Lavet, J Pineau
Advances in Neural Information Processing Systems 33, 2020
522020
Layla El Asri, Mahmoud Adada, Wendy Tay, and Adam Trischler
MA Côté, Á Kádár, X Yuan, B Kybartas, T Barnes, E Fine, J Moore, ...
Textworld: A learning environment for text-based games. CoRR, abs/1806.11532 2, 2018
382018
Layla El Asri, Mahmoud Adada, Wendy Tay, and Adam Trischler. 2018
MA Côté, Á Kádár, X Yuan, B Kybartas, T Barnes, E Fine, J Moore, ...
Textworld: A learning environment for textbased games. CoRR, abs, 1806
271806
Towards solving text-based games by producing adaptive action spaces
RY Tao, MA Côté, X Yuan, LE Asri
arXiv preprint arXiv:1812.00855, 2018
152018
Measuring and mitigating interference in reinforcement learning
V Liu, H Wang, RY Tao, K Javed, A White, M White
Conference on Lifelong Learning Agents, 781-795, 2023
52023
Agent-state construction with auxiliary inputs
RY Tao, A White, MC Machado
arXiv preprint arXiv:2211.07805, 2022
52022
Mitigating partial observability in sequential decision processes via the lambda discrepancy
C Allen, A Kirtland, RY Tao, S Lobel, D Scott, N Petrocelli, O Gottesman, ...
Advances in Neural Information Processing Systems 37, 62988-63028, 2024
12024
RL: Generic reinforcement learning codebase in TensorFlow
BM Li, A Cowen-Rivers, P Kozakowski, D Tao, SR Kamalakara, ...
Journal of Open Source Software 4 (42), 1524, 2019
2019
Robust Linear Reinforcement Learning
S Lobel, RY Tao, T Akbulut
Resolving Partial Observability in Decision Processes via the Lambda Discrepancy
C Allen, AT Kirtland, RY Tao, D Scott, S Lobel, N Petrocelli, O Gottesman, ...
В данный момент система не может выполнить эту операцию. Повторите попытку позднее.
Статьи 1–11