TensorRT-LLMs/cpp/tensorrt_llm/pybind
Wangjue Yao 9f283f330b
[None][feat] Support Mooncake transfer engine as a cache transceiver backend (#8309)
Signed-off-by: wjueyao <wyao123@terpmail.umd.edu>
Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
Co-authored-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
2025-12-19 10:09:51 +08:00
..
batch_manager [https://nvbugs/5601682][fix] Fix cacheTransceiver hang (#9311) 2025-12-05 17:50:12 -05:00
common [None][feat] Add Request specific exception (#6931) 2025-09-04 18:43:42 -04:00
executor [None][feat] Support Mooncake transfer engine as a cache transceiver backend (#8309) 2025-12-19 10:09:51 +08:00
process_group [TRTLLM-7349][feat] Adding new orchestrator type -- ray (#7520) 2025-10-04 08:12:24 +08:00
runtime [None][refactor] decoding inputs, part 2 (#5799) 2025-11-18 14:38:51 +01:00
testing fix: Improve chunking test and skip empty kernel calls (#5710) 2025-07-04 09:08:15 +02:00
thop [TRTLLM-9389][chore] Rename AlltoAll backend names (#9329) 2025-11-23 13:52:57 -08:00
userbuffers [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
bindings.cpp [None][refactor] decoding inputs, part 2 (#5799) 2025-11-18 14:38:51 +01:00
CMakeLists.txt [None][refactor] decoding inputs, part 2 (#5799) 2025-11-18 14:38:51 +01:00