Janghwan Lee

IMG_9869.jpeg

Ph.D student at Hanyang University, Seoul, Korea.

I am currently in my sixth year of the integrated Ph.D. program in Electronic Engineering at Hanyang University. My research primarily focuses on developing deep learning algorithms for efficient hardware, with an emphasis on the areas of Quantization on Transformer Model and Reduced-Precision Numerical Formats. Furthermore, I am also interested in sparsity for lightweight AI inference. I conduct my research in the Artificial Intelligence Hardware and Algorithm Lab under the guidance of Prof. Jungwook Choi.

selected publications

  1. Preprint
    ReSET: Accurate Latency-Critical NVFP4 Reasoning via Step-Aware Temperature Scaling
    Sihwa Lee*, Janghwan Lee*, Donghoon Yoo, Jae Gon Kim, Hanyul Ryu, Soojung Ryu, and Jungwook Choi
    In Preprint, 2026
  2. ICML 2026 Oral
    ReQAT: Achieving Full-Precision Reasoning Accuracy with 4-bit Floating-Point Quantization-Aware Training
    Janghwan Lee, Sihwa Lee, Jinseok Kim, Yongjik Kim, Jieun Lim, Jinwook Oh, and Jungwook Choi
    In Forty-third International Conference on Machine Learning (ICML, Oral), 2026
  3. ACL 2025 Findings
    AMXFP4: Taming Activation Outliers with Asymmetric Microscaling Floating-Point for 4-bit LLM Inference
    Janghwan Lee, Jiwoong Park, Jinseok Kim, Yongjik Kim, Jungju Oh, Jinwook Oh, and Jungwook Choi
    In Findings of the Association for Computational Linguistics (ACL Findings), 2025
  4. AAAI 2025
    RILQ: Rank-Insensitive LoRA-based Quantization Error Compensation for Boosting 2-bit Large Language Model Accuracy
    Geonho Lee*, Janghwan Lee*, Sukjin Hong*, Minsoo Kim, Euijai Ahn, Du-Seong Chang, and Jungwook Choi
    In The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025
  5. ACL 2024 Oral
    Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment
    Janghwan Lee*, Seongmin Park*, Sukjin Hong, Minsoo Kim, Du-Seong Chang, and  Jungwook Choi
    In The 62nd Annual Meeting of the Association for Computational Linguistics (ACL, Oral), 2024
  6. EMNLP 2023
    Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization
    Janghwan Lee*, Minsoo Kim*, Seungcheol Baek, Seokjoong Hwang, Wonyong Sung, and Jungwook Choi
    In The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
  7. ICASSP 2023
    Finding Optimal Numerical Format for Sub-8-Bit Post-Training Quantization of Vision Transformers
    Janghwan Lee, Youngdeok Hwang, and Jungwook Choi
    In 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023