Department of Industrial Engineering & decision Analytics [Joint IEDA/ISOM] seminar - Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces

10:30am - 11:30am
Room 5583 (lift 29-30)

We introduce a novel framework for analyzing reinforcement learning (RL) in continuous state-action spaces and use it to prove fast rates of convergence in both off-line and on-line settings. Our analysis highlights two key stability properties that relate to how changes in value functions and/or policies affect the Bellman operator and occupation measures. We argue that these properties are satisfied in many continuous state-action Markov decision processes. In this talk, we illustrate the intuition behind these properties through a simple inventory management example. Our framework also sheds new light on the roles of pessimism and optimism in RL. Contrary to earlier beliefs about their necessity for managing uncertainty, our work shows that while these approaches can be useful for obtaining a pilot estimate, they are not essential in later stages of training.

講者/ 表演者:
Prof. Yaqi DUAN
New York University, Stern School of Business

Yaqi Duan joined New York University Stern School of Business as an Assistant Professor of Technology, Operations and Statistics in August 2023. She focuses on new statistical methodologies and theories to address challenges in data-driven decision-making problems. Her works find applications in business analytics, healthcare, and transportations. Prior to joining NYU Stern, she was a postdoctoral researcher at Massachusetts Institute of Technology. She obtained her PhD degree from Princeton University, Department of ORFE.

語言
英文
適合對象
教職員
研究生
主辦單位
Department of Industrial Engineering & Decision Analytics
資訊,商業統計及營運學系
新增活動
請各校內團體將活動發布至大學活動日曆。