Tiancheng Jin - Google Scholar

Get my own profile

Cited by

	All	Since 2019
Citations	406	406
h-index	8	8
i10-index	8	8

0

140

70

35

105

2020202120222023202432 96 100 122 56

Public access

5 articles

0 articles

available

not available

Based on funding mandates

Tiancheng Jin

Tiancheng Jin

Ph.D. student, University of Southern California

Verified email at usc.edu

Machine Learning Theory Online Learning Theory RL Theory


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Learning adversarial markov decision processes with bandit feedback and unknown transition C Jin, T Jin, H Luo, S Sra, T Yu International Conference on Machine Learning, 4860-4869, 2020	121*	2020
Deep reinforcement learning for multi-driver vehicle dispatching and repositioning problem J Holler, R Vuorio, Z Qin, X Tang, Y Jiao, T Jin, S Singh, C Wang, J Ye 2019 IEEE International Conference on Data Mining (ICDM), 1090-1095, 2019	114	2019
Simultaneously learning stochastic and adversarial episodic mdps with known transition T Jin, H Luo Advances in neural information processing systems 33, 16557-16566, 2020	58	2020
The best of both worlds: stochastic and adversarial episodic mdps with unknown transition T Jin, L Huang, H Luo Advances in Neural Information Processing Systems 34, 20491-20502, 2021	38	2021
Boosting dynamic programming with neural networks for solving np-hard problems F Yang, T Jin, TY Liu, X Sun, J Zhang Asian Conference on Machine Learning, 726-739, 2018	23	2018
Suvrit Sra, and Tiancheng Yu. Learning adversarial mdps with bandit feedback and unknown transition C Jin, T Jin, H Luo arXiv preprint arXiv:1912.01192, 2019	19	2019
Near-optimal regret for adversarial mdp with delayed bandit feedback T Jin, T Lancewicki, H Luo, Y Mansour, A Rosenberg Advances in Neural Information Processing Systems 35, 33469-33481, 2022	16	2022
Improved best-of-both-worlds guarantees for multi-armed bandits: Ftrl with general regularizers and multiple optimal arms T Jin, J Liu, H Luo Advances in Neural Information Processing Systems 36, 2024	11	2024
Suvrit Sra, and Tiancheng Yu C Jin, T Jin, H Luo Learning adversarial mdps with bandit feedback and unknown transition, 2019	5	2019
No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions T Jin, J Liu, C Rouyer, W Chang, CY Wei, H Luo Advances in Neural Information Processing Systems 36, 2024	1	2024
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback A Rosenberg, H Luo, T Jin, Y Mansour		2022

The system can't perform the operation now. Try again later.

Articles 1–11