TensorRT-LLMs/tensorrt_llm/quantization/utils
Necofish 03cdf5804f
[None][fix] impl fused triton kernel for e8m0 resmooth to reduce memory footprint (#10327)
Signed-off-by: Nekofish-L <liuxiangyang@mail.ustc.edu.cn>
Co-authored-by: Kanghwan <861393+karljang@users.noreply.github.com>
2026-01-15 22:13:18 -08:00
..
__init__.py Deepseek R1 FP8 Support on Blackwell (#6486) 2025-08-01 10:26:28 +08:00
fp4_utils.py [None] [feat] Add model gpt-oss (#6645) 2025-08-07 03:04:18 -04:00
fp8_utils.py [None][fix] impl fused triton kernel for e8m0 resmooth to reduce memory footprint (#10327) 2026-01-15 22:13:18 -08:00