Department of Mathematics - Seminar on Statistics - Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning

Name: Department of Mathematics - Seminar on Statistics - Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Start: 2023-07-18
End: 2023-07-18
Location: Room 2302 (Lifts 17/18)

2023 年 7 月 18 日

4:00pm - 5:00pm

Room 2302 (Lifts 17/18)

Supporting the below United Nations Sustainable Development Goals:支持以下聯合國可持續發展目標：支持以下联合国可持续发展目标：

In this paper, we propose a robust policy evaluation algorithm in reinforcement learning, to feature outlier contamination and heavy-tailed reward distributions. We further develop a fully-online method to conduct statistical inference for the modeling parameters. Our method converges faster to the minimum asymptotic variance than the classical temporal difference (TD) learning and avoids the selection of the step sizes. Numerical experiments are provided on the effectiveness of the proposed algorithm in real-world reinforcement learning experiments, which highlight the efficiency and robustness of our approach when compared to the existing online bootstrap method. This work is joint with Jiyuan Tu (SUFE), Xi Chen (NYU), and Weidong Liu (SJTU).

活动形式

研讨会, 演讲, 讲座

讲者/ 表演者:

Prof. Yichen ZHANG

Purdue University

语言

英文

适合对象

校友

教职员

研究生

本科生

主办单位

数学系

科学及科技