Botao Hao

Процитировано

	Все	Начиная с 2019 г.
Статистика цитирования	728	716
h-индекс	16	16
i10-индекс	23	21

220

110

165

20182019202020212022202320244 16 40 107 160 220 167

Общий доступ

Просмотреть все

11 статей

0 статей

доступно

недоступно

На основе финансирования

Соавторы

Csaba SzepesvariDeepMind & University of AlbertaПодтвержден адрес электронной почты в домене cs.ualberta.ca
Tor LattimoreDeepMindПодтвержден адрес электронной почты в домене google.com
Zheng WenGoogle DeepMindПодтвержден адрес электронной почты в домене google.com
Yasin Abbasi YadkoriGoogle DeepMindПодтвержден адрес электронной почты в домене google.com
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton UniversityПодтвержден адрес электронной почты в домене princeton.edu
Will Wei SunAssociate Professor, Daniels School of Business, Purdue UniversityПодтвержден адрес электронной почты в домене purdue.edu
Nevena LazicDeepMindПодтвержден адрес электронной почты в домене google.com
Benjamin Van RoyStanford UniversityПодтвержден адрес электронной почты в домене stanford.edu
Jingfei ZhangEmory UniveristyПодтвержден адрес электронной почты в домене emory.edu
Anru ZhangDuke UniversityПодтвержден адрес электронной почты в домене duke.edu
尚作峰 (Zuofeng Shang)New Jersey Institute of TechnologyПодтвержден адрес электронной почты в домене njit.edu
Yufeng LiuUniversity of North Carolina at Chapel HillПодтвержден адрес электронной почты в домене email.unc.edu

Botao Hao

OpenAI

Подтвержден адрес электронной почты в домене openai.com - Главная страница

reinforcement learning multi-armed bandits RLHF


Название По числу цитат По году По названию	Процитировано Процитировано	Год
Simultaneous clustering and estimation of heterogeneous graphical models B Hao, WW Sun, Y Liu, G Cheng Journal of Machine Learning Research 18 (217), 1-58, 2018	75	2018
Adaptive exploration in linear contextual bandit B Hao, T Lattimore, C Szepesvari International Conference on Artificial Intelligence and Statistics, 3536-3545, 2020	66	2020
Sparse and low-rank tensor estimation via cubic sketchings B Hao, AR Zhang, G Cheng International conference on artificial intelligence and statistics, 1319-1330, 2020	61	2020
Bootstrapping upper confidence bound B Hao, Y Abbasi-Yadkori, Z Wen, G Cheng 33rd Conference on Neural Information Processing Systems, 2019	61	2019
High-dimensional sparse linear bandits B Hao, T Lattimore, M Wang 34th Conference on Neural Information Processing Systems, 2020	60	2020
Bootstrapping fitted q-evaluation for off-policy inference B Hao, X Ji, Y Duan, H Lu, C Szepesvari, M Wang International Conference on Machine Learning, 4074-4084, 2021	38	2021
Sparse feature selection makes batch reinforcement learning more sample efficient B Hao, Y Duan, T Lattimore, C Szepesvári, M Wang International Conference on Machine Learning, 4063-4073, 2021	36	2021
Online sparse reinforcement learning B Hao, T Lattimore, C Szepesvári, M Wang International Conference on Artificial Intelligence and Statistics, 316-324, 2021	30	2021
Sparse tensor additive regression B Hao, B Wang, P Wang, J Zhang, J Yang, WW Sun Journal of machine learning research 22 (64), 1-43, 2021	28	2021
Adaptive approximate policy iteration B Hao, N Lazic, Y Abbasi-Yadkori, P Joulani, C Szepesvari Proceedings of the 24th International Conference on Artificial Intelligence …, 2020	27*	2020
Efficient local planning with linear function approximation D Yin, B Hao, Y Abbasi-Yadkori, N Lazić, C Szepesvári International Conference on Algorithmic Learning Theory, 1165-1192, 2022	25	2022
Residual bootstrap exploration for bandit algorithms CH Wang, Y Yu, B Hao, G Cheng arXiv preprint arXiv:2002.08436, 2020	20	2020
Information directed sampling for sparse linear bandits B Hao, T Lattimore, W Deng Advances in Neural Information Processing Systems 34, 16738-16750, 2021	19	2021
The neural testbed: Evaluating joint predictions I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, M Ibrahimi, ... Advances in Neural Information Processing Systems 35, 12554-12565, 2022	17	2022
Regret Bounds for Information-Directed Reinforcement Learning B Hao, T Lattimore Advances in Neural Information Processing Systems, 2022	17	2022
Contextual information-directed sampling B Hao, T Lattimore, C Qin International Conference on Machine Learning, 8446-8464, 2022	16	2022
Bootstrapping Statistical Inference for Off-Policy Evaluation B Hao, X Ji, Y Duan, H Lu, C Szepesvári, M Wang arXiv preprint arXiv:2102.03607, 2021	16	2021
Interacting Contour Stochastic Gradient Langevin Dynamics W Deng, S Liang, B Hao, G Lin, F Liang The Tenth International Conference on Learning Representations, 2022	13	2022
Bandit phase retrieval T Lattimore, B Hao Advances in Neural Information Processing Systems 34, 18801-18811, 2021	11	2021
Low-rank tensor bandits B Hao, J Zhou, Z Wen, WW Sun arXiv e-prints, arXiv: 2007.15788, 2020	11	2020

В данный момент система не может выполнить эту операцию. Повторите попытку позднее.

Статьи 1–20

Ссылок за год

Повторяющиеся цитирования

Объединенные цитирования

СоавторыСоавторы

Подписаться

Процитировано

Соавторы