ABOUT ME

I am a second-year Ph.D. student at the University of Maryland, College Park, under the supervision of Prof. Abhinav Bhatele in the Parallel Software and Systems Group. My current research focuses on parallel optimization for large-scale GNNs, performance variability and collective communication.

Prior to joining UMD, I completed my Master’s degree at the Institute of Computing Technology (ICT), University of Chinese Academy of Sciences, Beijing, China, in 2023, where I was advised by Prof. Haipeng Jia. During my time at ICT, I focused on irregular matrix multiplication, resulting in first-author publications in TPDS, ICPP, and HPCC.

My research interests include high-performance computing, parallel algorithms, and optimizing AI workloads for modern hardware. I have been honored with awards such as the National Scholarship of China and Outstanding Graduate of Beijing.

For additional details, please refer to his Full CV.

EDUCATION BACKGROUND

  • Ph.D. in Computer Science, University of Maryland, College Park, 09/2024 - Present
  • M. S. in Computer Technology, Institute of Computing Technology (ICT), Chinese Academy of Sciences, 09/2020 - 06/2023
  • B. S. in Mathematics and Applied Mathematics, Zhengzhou University, 09/2016 - 06/2020

PUBLICATIONS

  • Cunyang Wei, Rishi Keshav Pradeep, Abhinav Bhatele. Unmasking Performance Variability in GPU Codes on Production Supercomputers. Poster at 2025 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), 2025.
  • Aditya K. Ranjan, Siddharth Singh, Cunyang Wei, Abhinav Bhatele. Plexus: Taming Billion-edge Graphs with 3D Parallel GNN Training. Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC), 2025.
  • Cunyang Wei, Haipeng Jia, Yunquan Zhang, Jianyu Yao, Chendi Li, Wenxuan Cao. 2024. IrGEMM: An Input-Aware Tuning Framework for Irregular GEMM on ARM and X86 CPUs. IEEE Transactions on Parallel and Distributed Systems (TPDS).
  • Luhan Wang, Haipeng Jia, Lei Xu, Cunyang Wei, Kun Li, Xianmeng Jiang, Yunquan Zhang. 2024. VNEC: A Vectorized Non-Empty Column Format for SpMV on CPUs. 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2024.
  • Rongyuan Guo, Haipeng Jia, Yuanquan Zhang, Mingsen Deng, Cunyang Wei, et al. SA_TRSM: A Shape-Aware Auto-Tuning Framework for Small-Scale Irregular-Shaped TRSM. IEEE 29th International Conference on Parallel and Distributed Systems (ICPADS), 2023.
  • Cunyang Wei, Haipeng Jia, Yunquan Zhang, Liusha Xu, and Ji Qi. 2022. IATF: An Input-Aware Tuning Framework for Compact BLAS Based on ARMv8 CPUs. In 51st International Conference on Parallel Processing (ICPP), 2022.
  • Cunyang Wei, Haipeng Jia, Yunquan Zhang, Kun Li, Luhan Wang. 2022. LBBGEMM: A Load-Balanced Batch GEMM Framework on ARM CPUs. The 24th IEEE International Conference on High Performance Computing & Communications (HPCC), 2022.
  • Luhan Wang, Haipeng Jia, Yunquan Zhang, Kun Li, Cunyang Wei. 2022. EgpuIP: An Embedded GPU Accelerated Library for Image Processing. The 24th IEEE International Conference on High Performance Computing & Communications (HPCC), 2022.

HONORS AND AWARDS

  • 2025 MVAPICH User Group Conference Travel Grant
  • 2024 Dean’s Fellowship, University of Maryland, College Park
  • 2023 Outstanding Graduate of Beijing, Beijing Municipal Education Commission
  • 2023 Outstanding Graduate, University of Chinese Academy of Sciences (Top 3%)
  • 2022 National Scholarship for Postgraduates, Ministry of Education of the People’s Republic of China
  • 2022 First Prize Scholarship, University of Chinese Academy of Sciences
  • 2022 Merit Student, University of Chinese Academy of Sciences
  • 2017 First Prize Scholarship, Zhengzhou University
  • 2017 Merit Student, Zhengzhou University

PROFESSIONAL SERVICE

  • TPC reviewer for IEEE ISPA’25
  • Session Chair for IEEE HPCC’22