TensorRT-LLMs/cpp/tests/unit_tests
Pamela Peng 6cdfc54883
feat: Add FP8 support for SM 120 (#3248)
* Allow FP8 on SM120

Signed-off-by: Pamela Peng <179191831+pamelap-nvidia@users.noreply.github.com>

* fix sm121

Signed-off-by: Pamela Peng <179191831+pamelap-nvidia@users.noreply.github.com>

* fix

Signed-off-by: Pamela Peng <179191831+pamelap-nvidia@users.noreply.github.com>

* fix pre-commit

Signed-off-by: Pamela Peng <179191831+pamelap-nvidia@users.noreply.github.com>

* review update

Signed-off-by: Pamela Peng <179191831+pamelap-nvidia@users.noreply.github.com>

---------

Signed-off-by: Pamela Peng <179191831+pamelap-nvidia@users.noreply.github.com>
Co-authored-by: Sharan Chetlur <116769508+schetlur-nv@users.noreply.github.com>
2025-04-14 16:05:41 -07:00
..
batch_manager feat: Allow individual gatherContext for each additional output (#3374) 2025-04-12 17:00:36 +08:00
common chore: remove usernames from comments (#3291) 2025-04-05 13:44:28 +08:00
executor feat: Allow individual gatherContext for each additional output (#3374) 2025-04-12 17:00:36 +08:00
kernels feat: Add FP8 support for SM 120 (#3248) 2025-04-14 16:05:41 -07:00
layers Feat: Variable-Beam-Width-Search (VBWS) part3 (#3338) 2025-04-08 23:51:27 +08:00
runtime feat: Introduce UB allocator for pytorch flow (#3257) 2025-04-08 18:39:49 +08:00
thop Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
utils Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
CMakeLists.txt chore: Stabilize ABI boundary for internal kernel library (#3117) 2025-04-11 15:07:50 +08:00