TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-19 01:05:12 +08:00

History

jthomson04 2450188808 [None][fix] Better error message for mismatched MPI world size (#11294 ) Signed-off-by: jthomson04 <jwillthomson19@gmail.com>		2026-02-16 15:37:49 -08:00
..
_tensorrt_engine
_torch	[None][fix] Better error message for mismatched MPI world size (#11294 )	2026-02-16 15:37:49 -08:00
bench
commands	[TRTLLM-10612][feat] Initial support of AIGV models in TRTLLM (#11462 )	2026-02-14 06:11:11 +08:00
evaluate	[https://nvbugs/5810940 ][fix] Update lm_eval to 4.9.10 and re-enable Skip Softmax Attention tests on CI. (#11176 )	2026-02-11 00:54:40 -05:00
executor	[TRTLLM-10612][feat] Initial support of AIGV models in TRTLLM (#11462 )	2026-02-14 06:11:11 +08:00
grpc	[#11037 ][fix] Fix proto-to-SamplingParams conversion bugs and add gRPC tests (#11292 )	2026-02-05 05:00:29 -05:00
inputs	[#11170 ][fix] Fix for mm placeholder counts (#11461 )	2026-02-14 09:12:03 +08:00
layers
llmapi	[None][chore] Add warning about 2-model MTP deprecation (#11043 )	2026-02-15 19:57:03 +08:00
metrics
models
plugin
quantization
runtime	[None][feat] Use new index api, add block scale support, fix max_seq_len esitmation, add flash mla support (#11334 )	2026-02-15 21:40:54 +08:00
scaffolding
serve	[#11170 ][fix] Fix for mm placeholder counts (#11461 )	2026-02-14 09:12:03 +08:00
tokenizer
tools	[TRTLLM-10851][feat] Add line_profiler tool for host overhead analysis. (#11232 )	2026-02-15 16:18:10 +08:00
__init__.py	[https://nvbugs/5761391 ][fix] Include triton-kernels as a packaged dependency (#10471 )	2026-01-28 19:56:32 -08:00
_common.py
_dlpack_utils.py
_ipc_utils.py
_mnnvl_utils.py
_ray_utils.py
_utils.py	[TRTLLM-10487][feat] Add user-provided UUID support for multimodal KV cache identification. (#11075 )	2026-02-12 00:48:47 -05:00
builder.py
disaggregated_params.py	[TRTLLM-8921][feat] implement gen-first disagg_service (#11020 )	2026-02-03 15:46:11 -05:00
functional.py
graph_rewriting.py
logger.py
lora_helper.py
lora_manager.py
mapping.py
math_utils.py
module.py
network.py
parameter.py
profiler.py
prompt_adapter_manager.py
python_plugin.py
ray_stub.py	[TRTLLM-10612][feat] Initial support of AIGV models in TRTLLM (#11462 )	2026-02-14 06:11:11 +08:00
sampling_params.py
scheduling_params.py
serialization.py	[https://nvbugs/5775021 ] [fix] Replace pickle.load with restricted Unpickler (#10622 )	2026-01-21 11:42:54 +08:00
top_model_mixin.py
version.py	[None][chore] Bump version to 1.3.0rc4 (#11485 )	2026-02-12 16:55:23 -05:00