This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-19 01:05:12 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
fedd7178d1
TensorRT-LLMs
/
tensorrt_llm
History
jthomson04
2450188808
[None][fix] Better error message for mismatched MPI world size (
#11294
)
...
Signed-off-by: jthomson04 <jwillthomson19@gmail.com>
2026-02-16 15:37:49 -08:00
..
_tensorrt_engine
_torch
[None][fix] Better error message for mismatched MPI world size (
#11294
)
2026-02-16 15:37:49 -08:00
bench
commands
[TRTLLM-10612][feat] Initial support of AIGV models in TRTLLM (
#11462
)
2026-02-14 06:11:11 +08:00
evaluate
[
https://nvbugs/5810940
][fix] Update lm_eval to 4.9.10 and re-enable Skip Softmax Attention tests on CI. (
#11176
)
2026-02-11 00:54:40 -05:00
executor
[TRTLLM-10612][feat] Initial support of AIGV models in TRTLLM (
#11462
)
2026-02-14 06:11:11 +08:00
grpc
[
#11037
][fix] Fix proto-to-SamplingParams conversion bugs and add gRPC tests (
#11292
)
2026-02-05 05:00:29 -05:00
inputs
[
#11170
][fix] Fix for mm placeholder counts (
#11461
)
2026-02-14 09:12:03 +08:00
layers
llmapi
[None][chore] Add warning about 2-model MTP deprecation (
#11043
)
2026-02-15 19:57:03 +08:00
metrics
models
plugin
quantization
runtime
[None][feat] Use new index api, add block scale support, fix max_seq_len esitmation, add flash mla support (
#11334
)
2026-02-15 21:40:54 +08:00
scaffolding
serve
[
#11170
][fix] Fix for mm placeholder counts (
#11461
)
2026-02-14 09:12:03 +08:00
tokenizer
tools
[TRTLLM-10851][feat] Add line_profiler tool for host overhead analysis. (
#11232
)
2026-02-15 16:18:10 +08:00
__init__.py
[
https://nvbugs/5761391
][fix] Include triton-kernels as a packaged dependency (
#10471
)
2026-01-28 19:56:32 -08:00
_common.py
_dlpack_utils.py
_ipc_utils.py
_mnnvl_utils.py
_ray_utils.py
_utils.py
[TRTLLM-10487][feat] Add user-provided UUID support for multimodal KV cache identification. (
#11075
)
2026-02-12 00:48:47 -05:00
builder.py
disaggregated_params.py
[TRTLLM-8921][feat] implement gen-first disagg_service (
#11020
)
2026-02-03 15:46:11 -05:00
functional.py
graph_rewriting.py
logger.py
lora_helper.py
lora_manager.py
mapping.py
math_utils.py
module.py
network.py
parameter.py
profiler.py
prompt_adapter_manager.py
python_plugin.py
ray_stub.py
[TRTLLM-10612][feat] Initial support of AIGV models in TRTLLM (
#11462
)
2026-02-14 06:11:11 +08:00
sampling_params.py
scheduling_params.py
serialization.py
[
https://nvbugs/5775021
] [fix] Replace pickle.load with restricted Unpickler (
#10622
)
2026-01-21 11:42:54 +08:00
top_model_mixin.py
version.py
[None][chore] Bump version to 1.3.0rc4 (
#11485
)
2026-02-12 16:55:23 -05:00