TensorRT-LLMs/tests/unittest/trt
Barry Kang 20b42912ce
[TRTLLM-3330][feat] Support DeepSeek-R1 W4A8 on Hopper (#4123)
Support DeepSeek-R1 W4A8 on Hopper

Co-authored-by: Barry Kang <43644113+Barry-Delaney@users.noreply.github.com>
Co-authored-by: Jiang Shao <91270701+StudyingShao@users.noreply.github.com>
Signed-off-by: Barry Kang <43644113+Barry-Delaney@users.noreply.github.com>
2025-05-14 15:48:07 +08:00
..
attention feat: Add FP8 support for SM 120 (#3248) 2025-04-14 16:05:41 -07:00
functional Unify two versions of AllReduce custom op (#3032) 2025-04-22 21:58:42 +08:00
model move the reset models into examples/models/core directory (#3555) 2025-04-19 20:48:59 -07:00
model_api test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00
python_plugin test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00
quantization [TRTLLM-3330][feat] Support DeepSeek-R1 W4A8 on Hopper (#4123) 2025-05-14 15:48:07 +08:00
__init__.py test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00