Department of Mathematics - Seminar on Statistics - Heteroskedastic Sparse PCA in High Dimensions

11:00am - 12:00pm
Room 4475 (Lifts 25/26)

Supporting the below United Nations Sustainable Development Goals:支持以下聯合國可持續發展目標:支持以下联合国可持续发展目标:

Principal component analysis (PCA) is one of the most commonly used techniques for dimension reduction and feature extraction. Though it has been well-studied for high-dimensional sparse PCA, little is known when the noise is heteroskedastic, which turns out to be ubiquitous in many scenarios, like biological sequencing data and information network data. We propose an iterative algorithm for sparse PCA in the presence of heteroskedastic noise, which alternatively updates the estimates of the sparse eigenvectors using the power method with adaptive thresholding in one step, and imputes the diagonal values of the sample covariance matrix to reduce the estimation bias due to heteroskedasticity in the other step. Our procedure is computationally fast and provably optimal under the generalized spiked covariance model, assuming the leading eigenvectors are sparse. A comprehensive simulation study demonstrates its robustness and effectiveness in various settings.

讲者/ 表演者:
Prof. Zhao REN
University of Pittsburgh University
语言
英文
适合对象
校友
教职员
研究生
本科生
主办单位
数学系
新增活动
请各校内团体将活动发布至大学活动日历。