MATH - Seminar on Statistics and Machine Learning - ROOT-SGD: Sharp Nonasymptotics and Asymptotic Efficiency in a Single Algorithm
11:00am - 12:00pm
https://hkust.zoom.us/j/5616960008

Supporting the below United Nations Sustainable Development Goals:支持以下聯合國可持續發展目標:支持以下联合国可持续发展目标:

The theory and practice of stochastic optimization has focused on stochastic gradient descent (SGD) in recent years, retaining the basic first-order stochastic nature of SGD while aiming to improve it via mechanisms such as averaging, momentum, and variance reduction. Improvement can be measured along various dimensions, however, and it has proved difficult to achieve improvements both in terms of nonasymptotic measures of convergence rate and asymptotic measures of distributional tightness. In this work, we consider first-order stochastic optimization from a general statistical point of view, motivating a specific form of recursive averaging of past stochastic gradients. The resulting algorithm, which we refer to as Recursive One-Over-T SGD (ROOT-SGD), matches the state-of-the-art convergence rate among online variance-reduced stochastic approximation methods. Moreover, under slightly stronger distributional assumptions, the rescaled last-iterate of ROOT-SGD converges to a zero-mean Gaussian distribution that achieves near-optimal covariance. This is a joint work with Wenlong Mou, Martin Wainwright, and Michael Jordan.

講者/ 表演者:
Dr. Junchi LI
University of California, Berkeley

Dr. Junchi Li "Chris" is currently a research scientist at University of California, Berkeley. He obtained his B.S. from Peking University and his Ph.D. from Duke University, both in mathematics. He has since held several research positions, including the roles of visiting postdoctoral research associate in the Department of Operations Research and Financial Engineering, Princeton University and of visiting researcher in Tencent Technology. His research interests include statistical machine learning and optimization, scalable online algorithms for big data analytics, and stochastic dynamics on graphs and social networks. He has published original research articles in both top optimization journals and top machine learning conferences, including an oral presentation paper (1.23%) in NeurIPS 2017 and a spotlight presentation paper (4.08%) in NeurIPS 2018.

語言
英文
適合對象
校友
教職員
研究生
本科生
主辦單位
數學系
聯絡方法
新增活動
請各校內團體將活動發布至大學活動日曆。