TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-08 20:21:48 +08:00

History

Yiqing Yan dd908ae753 [None][chore] Bump version to 1.2.0rc4.post1 (#9826 ) Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>		2025-12-09 15:06:12 +08:00
..
_tensorrt_engine
_torch	[TRTLLM-9660][feat] Convert cuteDSL GEMM to opt-in feature (#9682 )	2025-12-06 02:24:51 -08:00
bench
commands
evaluate
executor	[None][fix] enable hmac in RPC (#9745 )	2025-12-07 08:24:46 +08:00
inputs	[TRTLLM-9522][chore] implement default `attach_multimodal_embeddings` (#9664 )	2025-12-05 22:12:16 -08:00
layers
llmapi	[TRTLLM-9660][feat] Convert cuteDSL GEMM to opt-in feature (#9682 )	2025-12-06 02:24:51 -08:00
metrics
models
plugin
quantization
runtime
scaffolding
serve	[TRTLLM-8920][feat] decouple disagg service from fastapi (#8714 )	2025-12-05 10:44:16 +08:00
tools
__init__.py
_common.py
_dlpack_utils.py
_ipc_utils.py
_mnnvl_utils.py
_ray_utils.py
_utils.py
builder.py
disaggregated_params.py
functional.py	[TRTLLM-9086][doc] Clean up TODOs in documentation (#9292 )	2025-12-05 17:50:12 -05:00
graph_rewriting.py
logger.py
lora_helper.py
lora_manager.py
mapping.py
math_utils.py
module.py
network.py
parameter.py
profiler.py
prompt_adapter_manager.py
python_plugin.py
ray_stub.py
sampling_params.py
scheduling_params.py
serialization.py
top_model_mixin.py
version.py	[None][chore] Bump version to 1.2.0rc4.post1 (#9826 )	2025-12-09 15:06:12 +08:00