TensorRT-LLMs/cpp/tensorrt_llm
Xiwen Yu 8b532363ce Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_main_0819
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
2025-08-19 17:02:34 +08:00
..
batch_manager [https://nvbugs/5394392][fix] Enlarge scheduler capacity under disagg bs == 1 (#6537) 2025-08-15 09:52:06 -07:00
common Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_main_0819 2025-08-19 17:02:34 +08:00
cutlass_extensions/include/cutlass_extensions Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_main_0819 2025-08-19 17:02:34 +08:00
deep_ep Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_main_0819 2025-08-19 17:02:34 +08:00
deep_gemm [https://nvbugs/5433581][fix] DeepGEMM installation on SBSA (#6588) 2025-08-06 16:44:21 +08:00
executor Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_main_0819 2025-08-19 17:02:34 +08:00
executor_worker Update TensorRT-LLM (#2792) 2025-02-18 21:27:39 +08:00
kernels Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_main_0819 2025-08-19 17:02:34 +08:00
layers refactor: Remove enforced sorted order of batch slots (#3502) 2025-07-14 17:23:02 +02:00
nanobind [None][fix] Clean up linking to CUDA stub libraries in build_wheel.py (#6823) 2025-08-18 11:20:51 -04:00
plugins [https://nvbugs/5302040][feat] Add whisper support (Bert Attention on SM100 and GPTAttention for cross attention on SM100) (#5527) 2025-08-13 11:19:13 -07:00
pybind [None][fix] Clean up linking to CUDA stub libraries in build_wheel.py (#6823) 2025-08-18 11:20:51 -04:00
runtime Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_main_0819 2025-08-19 17:02:34 +08:00
testing fix: Improve chunking test and skip empty kernel calls (#5710) 2025-07-04 09:08:15 +02:00
thop Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_main_0819 2025-08-19 17:02:34 +08:00
CMakeLists.txt [https://nvbugs/5433581][fix] DeepGEMM installation on SBSA (#6588) 2025-08-06 16:44:21 +08:00