Department of Industrial Engineering & decision Analytics [Joint IEDA/ISOM] seminar - Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces

Name: Department of Industrial Engineering & decision Analytics [Joint IEDA/ISOM] seminar - Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces
Start: 2024-12-19
End: 2024-12-19
Location: Room 5583 (lift 29-30)

2024 年 12 月 19 日

10:30am - 11:30am

Room 5583 (lift 29-30)

We introduce a novel framework for analyzing reinforcement learning (RL) in continuous state-action spaces and use it to prove fast rates of convergence in both off-line and on-line settings. Our analysis highlights two key stability properties that relate to how changes in value functions and/or policies affect the Bellman operator and occupation measures. We argue that these properties are satisfied in many continuous state-action Markov decision processes. In this talk, we illustrate the intuition behind these properties through a simple inventory management example. Our framework also sheds new light on the roles of pessimism and optimism in RL. Contrary to earlier beliefs about their necessity for managing uncertainty, our work shows that while these approaches can be useful for obtaining a pilot estimate, they are not essential in later stages of training.

活动形式

研讨会, 演讲, 讲座

讲者/ 表演者:

Prof. Yaqi DUAN

New York University, Stern School of Business

Yaqi Duan joined New York University Stern School of Business as an Assistant Professor of Technology, Operations and Statistics in August 2023. She focuses on new statistical methodologies and theories to address challenges in data-driven decision-making problems. Her works find applications in business analytics, healthcare, and transportations. Prior to joining NYU Stern, she was a postdoctoral researcher at Massachusetts Institute of Technology. She obtained her PhD degree from Princeton University, Department of ORFE.

语言

英文

适合对象

教职员

研究生

主办单位

Department of Industrial Engineering & Decision Analytics

信息，商业统计及营运学系