Public Research Seminar by Microelectronics Thrust, Function Hub, HKUST (GZ) - Silicon Photonic Connectivity for Efficient Neural Network Acceleration
Supporting the below United Nations Sustainable Development Goals:支持以下聯合國可持續發展目標:支持以下联合国可持续发展目标:
Deep neural networks (DNNs) have achieved unprecedented success in a variety of applications. This success is from considerable demands for data and computations, which have spurred a pronounced interest in enhancing DNN acceleration through parallelism and specialization. At the core of any DNN accelerator lies the communication network, a pivotal component that connects the numerous processing units and orchestrates the intricate data movements resulting from the strategic arrangement of computations. As contemporary metallic-based interconnects encounter escalating limitations with the progression of system scaling, we consider silicon photonic interconnects a compelling alternative and investigate the consequent paradigm shift in communication network design and dataflow optimization.
This talk describes our reevaluation of DNN characteristics within the context of silicon photonics and three accelerator architectures that we have developed. The first architecture serves as a versatile platform for easy integration of existing chip-scale DNN accelerators while the inter-chiplet communication is supported by the adaptable silicon photonic interconnects. In the second architecture, silicon photonic interconnects are utilized to encompass both inter-chiplet and intra-chiplet communications, accompanied by a complementary dataflow that spatially distributes independent multiplications while iteratively performing accumulations. The third architecture facilitates multi-DNN execution through astute optimization of hardware resource allocation and one-hop communication support between arbitrarily partitioned hardware resources.
Yuan Li is a postdoctoral associate at George Washington University. Yuan received B.S. in physics from University of Science and Technology of China in 2010, M.S. in microelectronics from University of Newcastle upon Tyne in 2011, and Ph.D. in computer engineering from George Washington University in 2022. Yuan’s research interests span hardware acceleration, machine learning, emerging technologies for computing and communication, and their intersection, including neural network accelerator, silicon photonics-based chiplet integration, and accelerator-level parallelism.
For enquiries, please contact Ms. Lina ZHOU at linalnzhou@hkust-gz.edu.cn.