Department of Industrial Engineering & decision Analytics [Joint IEDA/ISOM] seminar - Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces

Name: Department of Industrial Engineering & decision Analytics [Joint IEDA/ISOM] seminar - Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces
Start: 2024-12-19
End: 2024-12-19
Location: Room 5583 (lift 29-30)

2024 年 12 月 19 日

10:30am - 11:30am

Room 5583 (lift 29-30)

Supporting the below United Nations Sustainable Development Goals:支持以下聯合國可持續發展目標：支持以下联合国可持续发展目标：

We introduce a novel framework for analyzing reinforcement learning (RL) in continuous state-action spaces and use it to prove fast rates of convergence in both off-line and on-line settings. Our analysis highlights two key stability properties that relate to how changes in value functions and/or policies affect the Bellman operator and occupation measures. We argue that these properties are satisfied in many continuous state-action Markov decision processes. In this talk, we illustrate the intuition behind these properties through a simple inventory management example. Our framework also sheds new light on the roles of pessimism and optimism in RL. Contrary to earlier beliefs about their necessity for managing uncertainty, our work shows that while these approaches can be useful for obtaining a pilot estimate, they are not essential in later stages of training.

活動形式

研討會, 演講, 講座

講者/ 表演者:

Prof. Yaqi DUAN

New York University, Stern School of Business

Yaqi Duan joined New York University Stern School of Business as an Assistant Professor of Technology, Operations and Statistics in August 2023. She focuses on new statistical methodologies and theories to address challenges in data-driven decision-making problems. Her works find applications in business analytics, healthcare, and transportations. Prior to joining NYU Stern, she was a postdoctoral researcher at Massachusetts Institute of Technology. She obtained her PhD degree from Princeton University, Department of ORFE.

語言

英文

適合對象

教職員

研究生

主辦單位

Department of Industrial Engineering & Decision Analytics

資訊，商業統計及營運學系