πŸ‘€ Biography

Final-year PhD candidate in Computer Science at the University of Toronto (UofT), advised by Prof. Gennady Pekhimenko. I build GPU and ML systems for irregular workloads, distributed inference, and compiler/runtime efficiency, with prior experience at AWS and ByteDance.

🏷️ Research Keywords

  • GPU algorithms
  • parallel algorithms
  • irregular workloads
  • systems for deep learning
  • ML systems
  • ML compilers

🧭 Research / Work / Education

πŸ’Ό Work Experience

Amazon Web Services (AWS)

Applied Scientist Intern

2024.09 - 2025.01

Santa Clara, CA, US

  • Optimized video generation workloads on AWS P4d machines by overlapping communication with computation, achieving up to 1.58Γ— speedup.

ByteDance Inc.

Research Intern

2019.02 - 2019.06

Shanghai, China

  • Explored how to integrate pretrained language models into neural machine translation by designing and evaluating multiple BERT knowledge distillation strategies.
  • Optimized the training data pipeline.

πŸ§ͺ Research Experience

University of Toronto

Research Assistant

2019.10 - 2026.04

Toronto, Canada

  • Conducted research in machine learning systems and high-performance systems under Prof. Gennady Pekhimenko.
  • Representative work includes Minuet, ScaleFusion, and StreamFusion.

2017.06 - 2018.06

Shanghai, China

  • Conducted early-stage research with Prof. Weinan Zhang and Prof. Yong Yu on reinforcement learning and learning-augmented systems.
  • Representative work includes the many-agent reinforcement learning platform MAgent.

πŸ“š Publications

Showing first-author papers by default. Switch to view the full list.

πŸ… Academic Service & Honors

  • ICML 2025, 2026 Reviewer
  • NeurIPS 2024, 2026 Reviewer
  • ACM CAIS 2026 Programming Committee
  • MLSys Artifact Evaluation 2026 Reviewer
  • EuroSys Artifact Evaluation 2025 Reviewer
  • ICLR 2025 Reviewer