Department of Mathematics - Seminar on statistics - HeteroJIVE: Joint Subspace Estimation for Heterogeneous Multi-View Data

10:00am - 11:00am

Supporting the below United Nations Sustainable Development Goals:支持以下聯合國可持續發展目標:支持以下联合国可持续发展目标:

Many modern datasets consist of multiple related matrices measured on a common set of units, where the goal is to recover the shared low-dimensional subspace. While the Angle-based Joint and Individual Variation Explained (AJIVE) framework provides a solution, it relies on equal-weight aggregation, which can be strictly suboptimal when views exhibit significant statistical heterogeneity (arising from varying SNR and dimensions) and structural heterogeneity (arising from individual components). In this paper, we propose HeteroJIVE, a weighted two-stage spectral algorithm tailored to such heterogeneity. Theoretically, we first revisit the “non-diminishing” error barrier with respect to the number of views $K$ identified in recent literature for the equal-weight case. We demonstrate that this barrier is not universal: under generic geometric conditions, the bias term vanishes and our estimator achieves the $O(K^{-1/2})$ rate without the need for iterative refinement. Extending this to the general-weight case, we establish error bounds that explicitly disentangle the two layers of heterogeneity. Based on this, we derive an oracle-optimal weighting scheme implemented via a data-driven procedure. Extensive simulations corroborate our theoretical findings, and an application to TCGA-BRCA multi-omics data validates the superiority of HeteroJIVE in practice.

講者/ 表演者:
Dr. Jingyang LI
University of Michigan

Jingyang Li is a Postdoc at the University of Michigan and will soon join Fudan University as an assistant professor. His research interests lie in high-dimensional statistics, with a focus on matrix and tensor learning, nonconvex optimization, and robust statistics.
Jingyang works on developing scalable algorithms and understanding their theoretical properties for large-scale data problems. He has contributed to methods involving Riemannian optimization for tensor estimation, as well as robust frameworks for high-dimensional regression and low-rank recovery. Currently, he is also exploring questions in online learning and federated learning, seeking to design adaptive algorithms that balance computational trade-offs with privacy constraints.

語言
英文
適合對象
教職員
公眾
研究生
本科生
主辦單位
數學系
新增活動
請各校內團體將活動發布至大學活動日曆。