TensorRT-LLMs/tensorrt_llm
jthomson04 2db3d7eeba
[None][chore] Async Transfer Manager (#9891)
Signed-off-by: jthomson04 <jwillthomson19@gmail.com>
2026-01-20 12:12:47 -05:00
..
_tensorrt_engine
_torch [None][chore] Async Transfer Manager (#9891) 2026-01-20 12:12:47 -05:00
bench
commands
evaluate [None][feat] Support to export data in trtllm-eval (#10075) 2026-01-15 23:27:08 +08:00
executor [TRTLLM-9735][feat] Add processed logprobs functionality to TorchSampler (#9675) 2026-01-16 10:52:41 -08:00
inputs
layers
llmapi [TRTLLM-9735][feat] Add processed logprobs functionality to TorchSampler (#9675) 2026-01-16 10:52:41 -08:00
metrics
models
plugin
quantization [None][chore] docs: clarify LoRA is not supported with --use_fp8_rowwise in Fp8RowwiseAttention (see #2603) (#10320) 2026-01-19 04:38:00 -05:00
runtime
scaffolding
serve
tokenizer
tools
__init__.py
_common.py
_dlpack_utils.py
_ipc_utils.py
_mnnvl_utils.py
_ray_utils.py
_utils.py
builder.py
disaggregated_params.py
functional.py
graph_rewriting.py
logger.py
lora_helper.py
lora_manager.py
mapping.py
math_utils.py
module.py
network.py
parameter.py
profiler.py
prompt_adapter_manager.py
python_plugin.py
ray_stub.py
sampling_params.py [TRTLLM-9735][feat] Add processed logprobs functionality to TorchSampler (#9675) 2026-01-16 10:52:41 -08:00
scheduling_params.py
serialization.py
top_model_mixin.py
version.py [None][chore] Bump version to 1.3.0rc0 (#10681) 2026-01-15 13:55:44 +08:00