Checklist
Motivation
as titled
https://github.com/NVIDIA/TensorRT-Model-Optimizer is the de facto LLM quant library for fp8 and fp4, supported in both TensorRT LLM and SGLang. We will consider changing all current fp8, fp4 doc, CI, unit test, etc. to default to ModelOpt's checkpoint
ref https://huggingface.co/nvidia
Related resources
No response
Checklist
Motivation
as titled
https://github.com/NVIDIA/TensorRT-Model-Optimizer is the de facto LLM quant library for fp8 and fp4, supported in both TensorRT LLM and SGLang. We will consider changing all current fp8, fp4 doc, CI, unit test, etc. to default to ModelOpt's checkpoint
ref https://huggingface.co/nvidia
Related resources
No response