| .. |
|
apps
|
[TRTLLM-5831][feat] Add LoRA support for pytorch backend in trtllm-serve (#5376)
|
2025-06-29 12:46:30 +00:00 |
|
__init__.py
|
test: reorganize tests folder hierarchy (#2996)
|
2025-03-27 12:07:53 +08:00 |
|
_run_mpi_comm_task.py
|
fix[nvbug5298640]: trtllm-llmapi-launch multiple LLM instances (#4727)
|
2025-06-19 06:13:53 +08:00 |
|
fake.sh
|
doc: fix path after examples migration (#3814)
|
2025-04-24 02:36:45 +08:00 |
|
run_llm_exit.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
run_llm_with_postproc.py
|
[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312)
|
2025-06-20 03:01:10 +08:00 |
|
run_llm.py
|
[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312)
|
2025-06-20 03:01:10 +08:00 |
|
test_build_cache.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
test_executor.py
|
[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312)
|
2025-06-20 03:01:10 +08:00 |
|
test_llm_args.py
|
chore: remove cuda_graph_ prefix from cuda_graph_config filed members. (#5585)
|
2025-06-30 12:23:14 -04:00 |
|
test_llm_download.py
|
[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312)
|
2025-06-20 03:01:10 +08:00 |
|
test_llm_kv_cache_events.py
|
[Infra] - Add import pytest (#5565)
|
2025-06-29 11:06:14 +08:00 |
|
test_llm_models.py
|
move the reset models into examples/models/core directory (#3555)
|
2025-04-19 20:48:59 -07:00 |
|
test_llm_multi_gpu_pytorch.py
|
test: Add json_mode_eval for guided decoding evaluation (#5179)
|
2025-06-16 10:03:55 +08:00 |
|
test_llm_multi_gpu.py
|
[Infra] - Waive failed case in post-merge (#5536)
|
2025-06-27 13:55:49 +08:00 |
|
test_llm_pytorch.py
|
[TRTLLM-6104] feat: add request_perf_metrics to LLMAPI (#5497)
|
2025-06-27 17:03:05 +02:00 |
|
test_llm_quant.py
|
[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312)
|
2025-06-20 03:01:10 +08:00 |
|
test_llm_utils.py
|
[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312)
|
2025-06-20 03:01:10 +08:00 |
|
test_llm.py
|
[Infra] - Add some timeout and unwaive a test which dev fixed (#5631)
|
2025-07-01 05:01:32 -04:00 |
|
test_mpi_session.py
|
[Infra] - Waive failed tests in post-merge and increase some timeout setting (#5424)
|
2025-06-24 17:19:31 +08:00 |
|
test_reasoning_parser.py
|
feat: add deepseek-r1 reasoning parser to trtllm-serve (#3354)
|
2025-05-06 08:13:04 +08:00 |
|
test_serialization.py
|
[TRTLLM-4971]: Use safe deserialization in ParallelConfig (#4630)
|
2025-06-27 09:58:41 +08:00 |