TensorRT-LLMs/tensorrt_llm
jthomson04 2450188808
[None][fix] Better error message for mismatched MPI world size (#11294)
Signed-off-by: jthomson04 <jwillthomson19@gmail.com>
2026-02-16 15:37:49 -08:00
..
_tensorrt_engine
_torch [None][fix] Better error message for mismatched MPI world size (#11294) 2026-02-16 15:37:49 -08:00
bench
commands [TRTLLM-10612][feat] Initial support of AIGV models in TRTLLM (#11462) 2026-02-14 06:11:11 +08:00
evaluate [https://nvbugs/5810940][fix] Update lm_eval to 4.9.10 and re-enable Skip Softmax Attention tests on CI. (#11176) 2026-02-11 00:54:40 -05:00
executor [TRTLLM-10612][feat] Initial support of AIGV models in TRTLLM (#11462) 2026-02-14 06:11:11 +08:00
grpc [#11037][fix] Fix proto-to-SamplingParams conversion bugs and add gRPC tests (#11292) 2026-02-05 05:00:29 -05:00
inputs [#11170][fix] Fix for mm placeholder counts (#11461) 2026-02-14 09:12:03 +08:00
layers
llmapi [None][chore] Add warning about 2-model MTP deprecation (#11043) 2026-02-15 19:57:03 +08:00
metrics
models
plugin
quantization
runtime [None][feat] Use new index api, add block scale support, fix max_seq_len esitmation, add flash mla support (#11334) 2026-02-15 21:40:54 +08:00
scaffolding
serve [#11170][fix] Fix for mm placeholder counts (#11461) 2026-02-14 09:12:03 +08:00
tokenizer
tools [TRTLLM-10851][feat] Add line_profiler tool for host overhead analysis. (#11232) 2026-02-15 16:18:10 +08:00
__init__.py [https://nvbugs/5761391][fix] Include triton-kernels as a packaged dependency (#10471) 2026-01-28 19:56:32 -08:00
_common.py
_dlpack_utils.py
_ipc_utils.py
_mnnvl_utils.py
_ray_utils.py
_utils.py [TRTLLM-10487][feat] Add user-provided UUID support for multimodal KV cache identification. (#11075) 2026-02-12 00:48:47 -05:00
builder.py
disaggregated_params.py [TRTLLM-8921][feat] implement gen-first disagg_service (#11020) 2026-02-03 15:46:11 -05:00
functional.py
graph_rewriting.py
logger.py
lora_helper.py
lora_manager.py
mapping.py
math_utils.py
module.py
network.py
parameter.py
profiler.py
prompt_adapter_manager.py
python_plugin.py
ray_stub.py [TRTLLM-10612][feat] Initial support of AIGV models in TRTLLM (#11462) 2026-02-14 06:11:11 +08:00
sampling_params.py
scheduling_params.py
serialization.py [https://nvbugs/5775021] [fix] Replace pickle.load with restricted Unpickler (#10622) 2026-01-21 11:42:54 +08:00
top_model_mixin.py
version.py [None][chore] Bump version to 1.3.0rc4 (#11485) 2026-02-12 16:55:23 -05:00