I am a Ph.D. student at the National University of Singapore and a member of the Alibaba Platform for AI (PAI) team, jointly advised by Prof. Jialin Li and Wei Lin. My primary research interest is developing highly efficient distributed infrastructure for machine learning.
During my undergraduate studies, I had the privilege (and fun) of spending four years with the NTU HPC club, where we won the Overall Championship at SC'17 and set the LINPACK World Record.
Outside of my academic pursuits, I enjoy cooking, jogging, and skateboarding. I have even developed a menu. My Erdős number is 5.
Download my resumé.
Doctor of Philosophy, Computer Science, 2021 - Current
National University of Singapore
Bachelor of Engineering in Computer Science, 2015 - 2019
Nanyang Technological University
Visiting Student, Fall 2016
New York University
We propose an efficient parameter sharing strategy for Transformer architecture by replacing FFN with MoE layer and sharing the trainable parameters except the normalization layers. Competitive performance across CV and NLP tasks were achieved with up to 6x reduction in the numbers of unique parameters.