| DoRA: Weight-Decomposed Low-Rank Adaptation SY Liu, CY Wang, H Yin, P Molchanov, YCF Wang, KT Cheng, MH Chen ICML 2024 (Oral), 2024 | 1458 | 2024 |
| LLM-FP4: 4-Bit Floating-Point Quantized Transformers SY Liu, Z Liu, X Huang, P Dong, KT Cheng EMNLP 2023 Main Conference, 2023 | 147 | 2023 |
| Hymba: A hybrid-head architecture for small language models X Dong, Y Fu, S Diao, W Byeon, Z Chen, AS Mahabaleshwarkar, SY Liu, ... ICLR 2025, 2024 | 127* | 2024 |
| Gdpo: Group reward-decoupled normalization policy optimization for multi-reward rl optimization SY Liu, X Dong, X Lu, S Diao, P Belcak, M Liu, MH Chen, H Yin, ... ICML 2026, 2026 | 69* | 2026 |
| Oscillation-free quantization for low-bit vision transformers SY Liu, Z Liu, KT Cheng ICML 2023, 21813-21824, 2023 | 68 | 2023 |
| Robust and Efficient Quantization-aware Training via Coreset Selection X Huang, Z Liu, SY Liu, KT Cheng Transactions on Machine Learning Research, 2024 | 30* | 2024 |
| A 28nm 0.22 μj/token memory-compute-intensity-aware cnn-transformer accelerator with hybrid-attention-based layer-fusion and cascaded pruning for semantic-segmentation P Dong, Y Tan, X Liu, P Luo, Y Liu, L Liang, Y Zhou, D Pang, MT Yung, ... 2025 IEEE International Solid-State Circuits Conference (ISSCC) 68, 01-03, 2025 | 17 | 2025 |
| CMOSE: Comprehensive Multi-Modality Online Student Engagement Dataset with High-Quality Labels CH Wu, SY Liu, X Huang, X Wang, R Zhang, L Minciullo, WK Yiu, K Kwan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 17 | 2024 |
| Dler: Doing length penalty right-incentivizing more intelligence per token via reinforcement learning SY Liu, X Dong, X Lu, S Diao, M Liu, MH Chen, H Yin, YCF Wang, ... arXiv preprint arXiv:2510.15110, 2025 | 16 | 2025 |
| Genetic quantization-aware approximation for non-linear operations in transformers P Dong, Y Tan, D Zhang, T Ni, X Liu, Y Liu, P Luo, L Liang, SY Liu, ... Proceedings of the 61st ACM/IEEE Design Automation Conference, 1-6, 2024 | 16 | 2024 |
| EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation SY Liu, M Khadkevich, NC Fung, C Sakr, CHH Yang, CY Wang, ... arXiv preprint arXiv:2410.21271, 2024 | 15* | 2024 |
| RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization X Huang, Z Liu, SY Liu, KT Cheng EMNLP 2024 Findings, 2024 | 15 | 2024 |
| Ipr: Interaction-level preference ranking for explicit feedback SY Liu, HH Chen, CM Chen, MF Tsai, CJ Wang Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022 | 7 | 2022 |
| Apsq: Additive partial sum quantization with algorithm-hardware co-design Y Tan, P Dong, Y Wu, Y Liu, X Liu, P Luo, SY Liu, X Huang, D Zhang, ... 2025 62nd ACM/IEEE Design Automation Conference (DAC), 1-7, 2025 | 1 | 2025 |
| System and method for fine-tuning rotated outlier-free large language models for effective weight-activation quantization X Huang, KT CHENG, Z Liu, SY LIU US Patent App. 19/352,500, 2026 | | 2026 |