PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off

Project Page | Paper | Video

PyTorch implementation of PLUM, a quantization-system co-design framework aimed to improve inference efficiency of deep neural networks.

PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off
Sachit Kuhar, Yash Jain, Alexey Tumanov
Georgia Institute of Technology

BibTeX

@article{
kuhar2024plum,
title={{PLUM}: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off},
author={Sachit Kuhar and Yash Jain and Alexey Tumanov},
journal={Transactions on Machine Learning Research},
issn={2835-8856},
year={2024},
url={https://openreview.net/forum?id=IEKtMMSblm},
note={}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
examples/imagenet		examples/imagenet
quant		quant
LICENSE		LICENSE
README.md		README.md
ffcv_env.yaml		ffcv_env.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off

Project Page | Paper | Video

BibTeX

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off

Project Page | Paper | Video

BibTeX

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages