Ph.D. Student @ SNU VLSI Lab
Electrical and Computer Engineering, Seoul National University
Building efficient AI systems across models, hardware, and deployment.
Efficient AI systems across model optimization, hardware-aware acceleration, and practical deployment.
Quantization, pruning, sparsity, and token-efficient computation for scalable AI.
Hardware-aware methods that turn algorithmic efficiency into practical speedups.
Serving and deployment paths that reduce inference cost in real workloads.
Recent updates.
Token Sparse Attention is accepted to ICML 2026.
Research on efficient inference, reasoning acceleration, and practical system optimization for large language models.
arXiv preprint arXiv:2510.14211
Electrical and Computer Engineering, Seoul National University
Advisor: Jae-Joon Kim
Electrical and Computer Engineering, Seoul National University
Summa Cum Laude 3.9/4.3
Seoul National University
Dept. of Electrical and Computer Engineering, Seoul National University
Republic of Korea Army