Подписаться
Sam Lobel
Sam Lobel
PhD Student, Brown University
Подтвержден адрес электронной почты в домене brown.edu - Главная страница
Название
Процитировано
Процитировано
Год
Towards amortized ranking-critical training for collaborative filtering
S Lobel, C Li, J Gao, L Carin
arXiv preprint arXiv:1906.04281, 2019
362019
Flipping coins to estimate pseudocounts for exploration in reinforcement learning
S Lobel, A Bagaria, G Konidaris
International Conference on Machine Learning, 22594-22613, 2023
222023
Tunable depletion potentials driven by shape variation of surfactant micelles
MD Gratale, T Still, C Matyas, ZS Davidson, S Lobel, PJ Collings, ...
Physical Review E 93 (5), 050601, 2016
202016
Optimistic initialization for exploration in continuous control
S Lobel, O Gottesman, C Allen, A Bagaria, G Konidaris
Proceedings of the AAAI Conference on Artificial Intelligence 36 (7), 7612-7619, 2022
142022
Q-functionals for value-based continuous control
S Lobel, S Rammohan, B He, S Yu, G Konidaris
Proceedings of the AAAI Conference on Artificial Intelligence 37 (7), 8932-8939, 2023
82023
An optimal tightness bound for the simulation lemma
S Lobel, R Parr
arXiv preprint arXiv:2406.16249, 2024
22024
Coarse-grained smoothness for rl in metric spaces
O Gottesman, K Asadi, C Allen, S Lobel, G Konidaris, M Littman
arXiv preprint arXiv:2110.12276, 2021
22021
Mitigating partial observability in sequential decision processes via the lambda discrepancy
C Allen, A Kirtland, RY Tao, S Lobel, D Scott, N Petrocelli, O Gottesman, ...
Advances in Neural Information Processing Systems 37, 62988-63028, 2024
12024
Coarse-Grained Smoothness for Reinforcement Learning in Metric Spaces
O Gottesman, K Asadi, CS Allen, S Lobel, G Konidaris, M Littman
International Conference on Artificial Intelligence and Statistics, 1390-1410, 2023
12023
Reproducing “Towards Interpretable ReinforcementLearning Using Attention Augmented Agents”
C Lovering, S Lobel, D Goktas, K Kwegyir-Aggrey, A Webson
12019
Resolving Partial Observability in Decision Processes via the Lambda Discrepancy
C Allen, AT Kirtland, RY Tao, D Scott, S Lobel, N Petrocelli, O Gottesman, ...
Robust Linear Reinforcement Learning
S Lobel, RY Tao, T Akbulut
Q-Functionals for Efficient Value-Based Continuous Control
S Rammohan, B He, S Yu, S Lobel, G Konidaris
В данный момент система не может выполнить эту операцию. Повторите попытку позднее.
Статьи 1–13