This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-14 06:27:45 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
9587f099ac
TensorRT-LLMs
/
tensorrt_llm
/
quantization
/
utils
History
Fanrong Li
e12868bc00
[None][fix] Remove and fuse some element-wise ops in the ds-r1-fp8 model (
#7238
)
...
Signed-off-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
2025-08-27 10:35:38 +08:00
..
__init__.py
Deepseek R1 FP8 Support on Blackwell (
#6486
)
2025-08-01 10:26:28 +08:00
fp4_utils.py
[None] [feat] Add model gpt-oss (
#6645
)
2025-08-07 03:04:18 -04:00
fp8_utils.py
[None][fix] Remove and fuse some element-wise ops in the ds-r1-fp8 model (
#7238
)
2025-08-27 10:35:38 +08:00