Squant

Official repo for the paper: Squat: Quant Small Language Models on the Edge

Implementation

Follow the instructions of the BabyLLaMA to implement the training environment, and BabyLM Challenge to implement the evaluation environment.

Usage

Download dataset from BabyLM Challenge
Clean the dataset according to BabyLLaMA
Pretrain teacher model
Download FP16 LLaMA-58M model from BabyLLaMA
QAT with scripts in distill_train/scripts/
Evaluation with scripts in evaluation_pipeline/

Citation

@article{shen2025squant,
  title={Squat: Quant Small Language Models on the Edge},
  author={Shen, Xuan and Peiyan, Dong and Kong, Zhenglun and Gong, Yifan and Yang, Changdi and Han, Zhaoyang and Xie, Yanyue and Lu, Lei and others},
  journal={arXiv preprint arXiv:2402.10787},
  year={2025}
}

Acknowledgment

Code is mainly based on BabyLLaMA and LSQ.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
distill_train		distill_train
evaluation_pipeline		evaluation_pipeline
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Squant

Implementation

Usage

Citation

Acknowledgment

About

Uh oh!

Releases

Packages

Languages

shawnricecake/squant

Folders and files

Latest commit

History

Repository files navigation

Squant

Implementation

Usage

Citation

Acknowledgment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages