Rethinking the theoretical foundation of reinforcement learning

Instructor:Nan Jiang (University of Illinois at Urbana-Champaign)
Schedule:Thur., 16:00-17:00, July 23, 2026
Venue:C548, Shuangqing Complex Building A; Zoom Meeting ID: 271 534 5558 Passcode: YMSC
Date:2026-07-23

Abstract: Given two candidate functions, can we identify which one is the true value function of a large Markov decision process (MDP), given a "benign" dataset? Trivial as it might seem, a version of the question was open for 20+ years in reinforcement learning (RL), and the core difficulties are intimately related to the training instability of modern deep RL. In this talk, I will argue that by rethinking fundamental questions like this, RL theory can provide unique perspectives and solutions to practically relevant problems that are critical to the deployment of RL in real-world scenarios.

Bio: Nan Jiang is an Associate Professor of Computer Science at University of Illinois at Urbana-Champaign. Prior to joining UIUC, he was a postdoc researcher at Microsoft Research NYC. He received his PhD in Computer Science and Engineering at University of Michigan. His research focuses on the theory of reinforcement learning. He coauthors a monograph on RL theory and holds editorial positions in the research community, including Action Editor for JMLR, Editor for FnT in ML, and Senior Area Chairs for ICML and ICLR. His contributions are recognized by Best Paper Award in AAMAS 2015, Outstanding Paper Runner-up in ICML 2022, Adobe Data Science Award in 2021, NSF CAREER Award in 2022, Google Research Scholarship in 2024, and Sloan Research Fellowship in 2024.