Data Science and Analytics Seminar | Automatic Partition-based Operator Fusion through Layer by Layer Optimization

9:00am - 9:45am
Zoom ID: 987 0930 6507 Passcode: dsat

Supporting the below United Nations Sustainable Development Goals:支持以下聯合國可持續發展目標:支持以下联合国可持续发展目标:

This presentation studies fusion for deep neural networks in a just-in-time compilation framework. The framework considers both memory- and compute-bound tensor operators for fusion, and integrates graph-level node grouping and operator-level loop fusion closely, widening the fusion search space. The framework also enables the upward feedback from the downstream loop optimizer, enforcing the graph engine to regenerate partition patterns amenable to the downstream pass and thus resolving the scalability issue. Besides data locality, the framework also exploits the parallelism between independent tensor operators, further improving the performance of deep neural networks. Experimental results on training workloads show that the proposed framework can (1) outperform TensorFlow and XLA on GPUs, (2) and improve the performance of a vendor-provided deep learning framework on a domain-specific accelerator.

講者/ 表演者:
Jie ZHAO
State Key Laboratory of Mathematical Engineering and Advanced Computing (SKL-MEAC)

Jie Zhao obtained two PhD degrees, one in computer sciences from the PLA Information Engineering University in 2016, and the other in mathematics from PARKAS, a research group affiliated to the Département d’Informatique of École Normale Supérieure and INRIA Paris in 2018. He was a Lecturer (Assistant Professor) at the State Key Laboratory of Mathematical Engineering and Advanced Computing (SKL-MEAC) between July 2016 and December 2022, but he has quit this position and is looking for a new faculty position in international universities. His research interests include (1) code generation and optimization, (2) system software for deep learning, and (3) floating-point error analysis and repair. Jie published several papers as the first author in some premier compiler-related conferences and journals including PLDI, OSDI (conditionally accepted), MICRO, MLSys, PACT, CC, TACO, TOCS (accepted with minor revision). In particular, his MICRO-53 publication was nominated as one of the four best paper candidates in 2020. Jie Zhao also established good connections with the industry by having served or serving as (senior) consultants and visiting scholars for some China tech giants including Huawei Technologies, Alibaba Group and startups like Streaming Computing Co., Ltd.

語言
英文
適合對象
校友
長者
教職員
公眾
科大家庭
研究生
本科生
主辦單位
Data Science and Analytics Thrust, HKUST(GZ)
聯絡方法
新增活動
請各校內團體將活動發布至大學活動日曆。