Information Distinguished Lecture Series - A Complex Spectral Mapping Approach to Spectrospatial Filtering

10:30am - 12:00pm
E4-1F-147

Supporting the below United Nations Sustainable Development Goals:支持以下聯合國可持續發展目標:支持以下联合国可持续发展目标:

We present a complex spectral mapping approach to multi-channel speech separation, which trains a deep neural network (DNN) to directly estimate the real and imaginary spectrograms of the target signal from those of the multi-channel noisy mixture. In this approach, the trained DNN itself becomes a nonlinear, time-varying spectrospatial filter. How does this conceptually simple approach perform relative to commonly-used beamforming techniques on different array configurations and in different acoustic environments? We examine this issue systematically on speech dereverberation, speech enhancement, and speaker separation tasks. Comprehensive evaluations show that multi-channel complex spectral mapping achieves speech separation performance superior to beamforming-based spatial filtering for different array geometries, and reduces to monaural speech separation with single-channel recordings, demonstrating its versatility for multi-channel and single-channel speech separation. In addition, such an approach is computationally more efficient than mask-based beamforming. Recent developments in DNN architecture employ global self-attention, and leverage cross-band and narrow-band correlations. These developments elevate speech separation performance by a considerable margin.

讲者/ 表演者:
DeLiang Wang
Ohio State University

DeLiang Wang received the B.S. degree and the M.S. degree from Peking (Beijing) University and the Ph.D. degree in 1991 from the University of Southern California all in computer science. Since 1991, he has been with the Department of Computer Science & Engineering and the Center for Cognitive and Brain Sciences at The Ohio State University, where he is a Professor and University Distinguished Scholar. He received the U.S. Office of Naval Research Young Investigator Award in 1996, the 2008 Helmholtz Award from the International Neural Network Society, the 2007 Outstanding Paper Award of the IEEE Computational Intelligence Society and the 2019 Best Paper Award of the IEEE Signal Processing Society. He is a Fellow of IEEE, ISCA, and AAIA, and currently serves as Editor-in-Chief of Neural Networks.

语言
英文
适合对象
校友
长者
教职员
公众
科大家庭
研究生
本科生
主办单位
Information Hub, HKUST(GZ)
新增活动
请各校内团体将活动发布至大学活动日历。