TensorRT-LLMs/tensorrt_llm/llmapi
Shunkangz dda7354d1a
Refactor return of first gen token in PD (#2986)
Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
2025-04-01 12:28:27 +08:00
..
__init__.py Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
_perf_evaluator.py Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00
build_cache.py Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
disagg_utils.py Update (#2978) 2025-03-23 16:39:35 +08:00
llm_args.py Add initial EAGLE-3 implementation (#3035) 2025-03-29 22:31:24 +08:00
llm_utils.py Update (#2978) 2025-03-23 16:39:35 +08:00
llm.py Refactor return of first gen token in PD (#2986) 2025-04-01 12:28:27 +08:00
mgmn_leader_node.py Update (#2978) 2025-03-23 16:39:35 +08:00
mgmn_worker_node.py Update TensorRT-LLM (#2333) 2024-10-15 15:28:40 +08:00
mpi_session.py Update (#2978) 2025-03-23 16:39:35 +08:00
tokenizer.py Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
tracer.py Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
trtllm-llmapi-launch Update (#2978) 2025-03-23 16:39:35 +08:00
utils.py Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00