TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-04 18:21:52 +08:00

History

Necofish 03cdf5804f [None][fix] impl fused triton kernel for e8m0 resmooth to reduce memory footprint (#10327 ) Signed-off-by: Nekofish-L <liuxiangyang@mail.ustc.edu.cn> Co-authored-by: Kanghwan <861393+karljang@users.noreply.github.com>		2026-01-15 22:13:18 -08:00
..
__init__.py	Deepseek R1 FP8 Support on Blackwell (#6486 )	2025-08-01 10:26:28 +08:00
fp4_utils.py	[None] [feat] Add model gpt-oss (#6645 )	2025-08-07 03:04:18 -04:00
fp8_utils.py	[None][fix] impl fused triton kernel for e8m0 resmooth to reduce memory footprint (#10327 )	2026-01-15 22:13:18 -08:00