TensorRT-LLMs/tensorrt_llm
Yiqing Yan dd908ae753
[None][chore] Bump version to 1.2.0rc4.post1 (#9826)
Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
2025-12-09 15:06:12 +08:00
..
_tensorrt_engine
_torch [TRTLLM-9660][feat] Convert cuteDSL GEMM to opt-in feature (#9682) 2025-12-06 02:24:51 -08:00
bench
commands
evaluate
executor [None][fix] enable hmac in RPC (#9745) 2025-12-07 08:24:46 +08:00
inputs [TRTLLM-9522][chore] implement default attach_multimodal_embeddings (#9664) 2025-12-05 22:12:16 -08:00
layers
llmapi [TRTLLM-9660][feat] Convert cuteDSL GEMM to opt-in feature (#9682) 2025-12-06 02:24:51 -08:00
metrics
models
plugin
quantization
runtime
scaffolding
serve [TRTLLM-8920][feat] decouple disagg service from fastapi (#8714) 2025-12-05 10:44:16 +08:00
tools
__init__.py
_common.py
_dlpack_utils.py
_ipc_utils.py
_mnnvl_utils.py
_ray_utils.py
_utils.py
builder.py
disaggregated_params.py
functional.py [TRTLLM-9086][doc] Clean up TODOs in documentation (#9292) 2025-12-05 17:50:12 -05:00
graph_rewriting.py
logger.py
lora_helper.py
lora_manager.py
mapping.py
math_utils.py
module.py
network.py
parameter.py
profiler.py
prompt_adapter_manager.py
python_plugin.py
ray_stub.py
sampling_params.py
scheduling_params.py
serialization.py
top_model_mixin.py
version.py [None][chore] Bump version to 1.2.0rc4.post1 (#9826) 2025-12-09 15:06:12 +08:00