Follow
Shih-Yang Liu
Shih-Yang Liu
PhD Student @ HKUST, NVIDIA Research
Verified email at connect.ust.hk - Homepage
Title
Cited by
Cited by
Year
DoRA: Weight-Decomposed Low-Rank Adaptation
SY Liu, CY Wang, H Yin, P Molchanov, YCF Wang, KT Cheng, MH Chen
ICML 2024 (Oral), 2024
14582024
LLM-FP4: 4-Bit Floating-Point Quantized Transformers
SY Liu, Z Liu, X Huang, P Dong, KT Cheng
EMNLP 2023 Main Conference, 2023
1472023
Hymba: A hybrid-head architecture for small language models
X Dong, Y Fu, S Diao, W Byeon, Z Chen, AS Mahabaleshwarkar, SY Liu, ...
ICLR 2025, 2024
127*2024
Gdpo: Group reward-decoupled normalization policy optimization for multi-reward rl optimization
SY Liu, X Dong, X Lu, S Diao, P Belcak, M Liu, MH Chen, H Yin, ...
ICML 2026, 2026
69*2026
Oscillation-free quantization for low-bit vision transformers
SY Liu, Z Liu, KT Cheng
ICML 2023, 21813-21824, 2023
682023
Robust and Efficient Quantization-aware Training via Coreset Selection
X Huang, Z Liu, SY Liu, KT Cheng
Transactions on Machine Learning Research, 2024
30*2024
A 28nm 0.22 μj/token memory-compute-intensity-aware cnn-transformer accelerator with hybrid-attention-based layer-fusion and cascaded pruning for semantic-segmentation
P Dong, Y Tan, X Liu, P Luo, Y Liu, L Liang, Y Zhou, D Pang, MT Yung, ...
2025 IEEE International Solid-State Circuits Conference (ISSCC) 68, 01-03, 2025
172025
CMOSE: Comprehensive Multi-Modality Online Student Engagement Dataset with High-Quality Labels
CH Wu, SY Liu, X Huang, X Wang, R Zhang, L Minciullo, WK Yiu, K Kwan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
172024
Dler: Doing length penalty right-incentivizing more intelligence per token via reinforcement learning
SY Liu, X Dong, X Lu, S Diao, M Liu, MH Chen, H Yin, YCF Wang, ...
arXiv preprint arXiv:2510.15110, 2025
162025
Genetic quantization-aware approximation for non-linear operations in transformers
P Dong, Y Tan, D Zhang, T Ni, X Liu, Y Liu, P Luo, L Liang, SY Liu, ...
Proceedings of the 61st ACM/IEEE Design Automation Conference, 1-6, 2024
162024
EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation
SY Liu, M Khadkevich, NC Fung, C Sakr, CHH Yang, CY Wang, ...
arXiv preprint arXiv:2410.21271, 2024
15*2024
RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
X Huang, Z Liu, SY Liu, KT Cheng
EMNLP 2024 Findings, 2024
152024
Ipr: Interaction-level preference ranking for explicit feedback
SY Liu, HH Chen, CM Chen, MF Tsai, CJ Wang
Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022
72022
Apsq: Additive partial sum quantization with algorithm-hardware co-design
Y Tan, P Dong, Y Wu, Y Liu, X Liu, P Luo, SY Liu, X Huang, D Zhang, ...
2025 62nd ACM/IEEE Design Automation Conference (DAC), 1-7, 2025
12025
System and method for fine-tuning rotated outlier-free large language models for effective weight-activation quantization
X Huang, KT CHENG, Z Liu, SY LIU
US Patent App. 19/352,500, 2026
2026
The system can't perform the operation now. Try again later.
Articles 1–15