TensorRT-LLMs/cpp/tests/unit_tests
Patrice Castonguay 9b0f45298f
[None][feat] Have ability to cancel disagg request if KV cache resource are exhausted (#9155)
Signed-off-by: Patrice Castonguay <55748270+pcastonguay@users.noreply.github.com>
2025-11-18 20:59:17 -05:00
..
batch_manager [None][fix] Fix KV cache manager test warnings (#9103) 2025-11-13 07:23:04 -08:00
common chore: remove usernames from comments (#3291) 2025-04-05 13:44:28 +08:00
executor [None][feat] Have ability to cancel disagg request if KV cache resource are exhausted (#9155) 2025-11-18 20:59:17 -05:00
kernels [None][feat] Update TRTLLM MoE cubins; reduce mxfp4 weight padding requirement; tighten TMA bound (#9025) 2025-11-17 10:04:29 +08:00
layers [None][feat] Support ignored prompt length for penalties via new sampling config parameter (#8127) 2025-10-27 13:12:31 -04:00
multi_gpu [TRTLLM-8540][feat] Add support for disagg in DSv3.2 (#8735) 2025-11-12 08:21:11 -08:00
runtime [None][refactor] decoding inputs, part 2 (#5799) 2025-11-18 14:38:51 +01:00
thop Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
utils Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
CMakeLists.txt [None] [ci] Reorganize CMake and Python integration test infrastructure for C++ tests (#6754) 2025-08-24 20:53:17 +02:00