| .. |
|
apps
|
feat: add health_generate route to openai serving (Cherry-pick https://github.com/NVIDIA/TensorRT-LLM/pull/3856) (#4349)
|
2025-05-22 11:46:06 +08:00 |
|
__init__.py
|
test: reorganize tests folder hierarchy (#2996)
|
2025-03-27 12:07:53 +08:00 |
|
_run_mpi_comm_task.py
|
fix: trtllm-bench build trt engine on slurm (#3825)
|
2025-04-27 22:26:23 +08:00 |
|
fake.sh
|
doc: fix path after examples migration (#3814)
|
2025-04-24 02:36:45 +08:00 |
|
run_llm_exit.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
run_llm_with_postproc.py
|
chore: cleanup perf_evaluator code (#3833)
|
2025-05-19 13:21:36 +08:00 |
|
run_llm.py
|
Update (#2978)
|
2025-03-23 16:39:35 +08:00 |
|
test_build_cache.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
test_executor.py
|
fix: [nvbugs/5066257] serialization improvments (#3869)
|
2025-05-23 13:06:29 +08:00 |
|
test_llm_args.py
|
test: reorganize tests folder hierarchy (#2996)
|
2025-03-27 12:07:53 +08:00 |
|
test_llm_download.py
|
test: reorganize tests folder hierarchy (#2996)
|
2025-03-27 12:07:53 +08:00 |
|
test_llm_kv_cache_events.py
|
test: add kv cache event tests for disagg workers (#3602)
|
2025-04-18 18:30:19 +08:00 |
|
test_llm_models.py
|
move the reset models into examples/models/core directory (#3555)
|
2025-04-19 20:48:59 -07:00 |
|
test_llm_multi_gpu_pytorch.py
|
feat: support multi lora adapters and TP (#3885)
|
2025-05-08 23:45:45 +08:00 |
|
test_llm_multi_gpu.py
|
Waive L0 tests (#4645)
|
2025-05-26 11:05:01 +08:00 |
|
test_llm_pytorch.py
|
add changes for fp8, nemotron-nas, API (#4180)
|
2025-05-18 23:27:25 +08:00 |
|
test_llm_quant.py
|
test: reorganize tests folder hierarchy (#2996)
|
2025-03-27 12:07:53 +08:00 |
|
test_llm_utils.py
|
chore: Partition LlmArgs into TorchLlmArgs and TrtLlmArgs (#3823)
|
2025-05-22 09:40:56 +08:00 |
|
test_llm.py
|
chore: fix bug of llama lora test (#4566)
|
2025-05-23 14:06:40 +08:00 |
|
test_mpi_session.py
|
fix: trtllm-bench build trt engine on slurm (#3825)
|
2025-04-27 22:26:23 +08:00 |
|
test_reasoning_parser.py
|
feat: add deepseek-r1 reasoning parser to trtllm-serve (#3354)
|
2025-05-06 08:13:04 +08:00 |
|
test_serialization.py
|
fix: [nvbugs/5066257] serialization improvments (#3869)
|
2025-05-23 13:06:29 +08:00 |