Towards amortized ranking-critical training for collaborative filtering S Lobel, C Li, J Gao, L Carin arXiv preprint arXiv:1906.04281, 2019 | 36 | 2019 |
Flipping coins to estimate pseudocounts for exploration in reinforcement learning S Lobel, A Bagaria, G Konidaris International Conference on Machine Learning, 22594-22613, 2023 | 22 | 2023 |
Tunable depletion potentials driven by shape variation of surfactant micelles MD Gratale, T Still, C Matyas, ZS Davidson, S Lobel, PJ Collings, ... Physical Review E 93 (5), 050601, 2016 | 20 | 2016 |
Optimistic initialization for exploration in continuous control S Lobel, O Gottesman, C Allen, A Bagaria, G Konidaris Proceedings of the AAAI Conference on Artificial Intelligence 36 (7), 7612-7619, 2022 | 14 | 2022 |
Q-functionals for value-based continuous control S Lobel, S Rammohan, B He, S Yu, G Konidaris Proceedings of the AAAI Conference on Artificial Intelligence 37 (7), 8932-8939, 2023 | 8 | 2023 |
An optimal tightness bound for the simulation lemma S Lobel, R Parr arXiv preprint arXiv:2406.16249, 2024 | 2 | 2024 |
Coarse-grained smoothness for rl in metric spaces O Gottesman, K Asadi, C Allen, S Lobel, G Konidaris, M Littman arXiv preprint arXiv:2110.12276, 2021 | 2 | 2021 |
Mitigating partial observability in sequential decision processes via the lambda discrepancy C Allen, A Kirtland, RY Tao, S Lobel, D Scott, N Petrocelli, O Gottesman, ... Advances in Neural Information Processing Systems 37, 62988-63028, 2024 | 1 | 2024 |
Coarse-Grained Smoothness for Reinforcement Learning in Metric Spaces O Gottesman, K Asadi, CS Allen, S Lobel, G Konidaris, M Littman International Conference on Artificial Intelligence and Statistics, 1390-1410, 2023 | 1 | 2023 |
Reproducing “Towards Interpretable ReinforcementLearning Using Attention Augmented Agents” C Lovering, S Lobel, D Goktas, K Kwegyir-Aggrey, A Webson | 1 | 2019 |
Resolving Partial Observability in Decision Processes via the Lambda Discrepancy C Allen, AT Kirtland, RY Tao, D Scott, S Lobel, N Petrocelli, O Gottesman, ... | | |
Robust Linear Reinforcement Learning S Lobel, RY Tao, T Akbulut | | |
Q-Functionals for Efficient Value-Based Continuous Control S Rammohan, B He, S Yu, S Lobel, G Konidaris | | |