TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-05 02:31:33 +08:00

History

jthomson04 2db3d7eeba [None][chore] Async Transfer Manager (#9891 ) Signed-off-by: jthomson04 <jwillthomson19@gmail.com>		2026-01-20 12:12:47 -05:00
..
_tensorrt_engine
_torch	[None][chore] Async Transfer Manager (#9891 )	2026-01-20 12:12:47 -05:00
bench
commands
evaluate	[None][feat] Support to export data in trtllm-eval (#10075 )	2026-01-15 23:27:08 +08:00
executor	[TRTLLM-9735][feat] Add processed logprobs functionality to TorchSampler (#9675 )	2026-01-16 10:52:41 -08:00
inputs
layers
llmapi	[TRTLLM-9735][feat] Add processed logprobs functionality to TorchSampler (#9675 )	2026-01-16 10:52:41 -08:00
metrics
models
plugin
quantization	[None][chore] docs: clarify LoRA is not supported with --use_fp8_rowwise in Fp8RowwiseAttention (see #2603 ) (#10320 )	2026-01-19 04:38:00 -05:00
runtime
scaffolding
serve
tokenizer
tools
__init__.py
_common.py
_dlpack_utils.py
_ipc_utils.py
_mnnvl_utils.py
_ray_utils.py
_utils.py
builder.py
disaggregated_params.py
functional.py
graph_rewriting.py
logger.py
lora_helper.py
lora_manager.py
mapping.py
math_utils.py
module.py
network.py
parameter.py
profiler.py
prompt_adapter_manager.py
python_plugin.py
ray_stub.py
sampling_params.py	[TRTLLM-9735][feat] Add processed logprobs functionality to TorchSampler (#9675 )	2026-01-16 10:52:41 -08:00
scheduling_params.py
serialization.py
top_model_mixin.py
version.py	[None][chore] Bump version to 1.3.0rc0 (#10681 )	2026-01-15 13:55:44 +08:00