Guoming Zhang
|
202bed4574
|
[None][chroe] Rename TensorRT-LLM to TensorRT LLM for source code. (#7851)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
|
2025-09-25 21:02:35 +08:00 |
|
Yechan Kim
|
f77aca9f2c
|
[TRTLLM-7385][feat] Optimize Qwen2/2.5-VL performance (#7250)
Signed-off-by: yechank <161688079+yechank-nvidia@users.noreply.github.com>
|
2025-09-22 03:40:02 -07:00 |
|
Kaiyu Xie
|
6eef19297f
|
[None] [chore] cherry pick changes on slurm scripts from release/1.1.0rc2 (#7750)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-09-16 16:07:13 +08:00 |
|
Yechan Kim
|
12102e2d48
|
[TRTLLM-6772][feat] Multimodal benchmark_serving support (#6622)
Signed-off-by: yechank <161688079+yechank-nvidia@users.noreply.github.com>
|
2025-08-12 19:34:02 -07:00 |
|
Zero Zeng
|
4b4b91ab51
|
[None][feat] improve dataloading for benchmark_dataset by using batch… (#6548)
Signed-off-by: Zero Zeng <38289304+zerollzeng@users.noreply.github.com>
|
2025-08-11 09:50:41 +08:00 |
|
Kaiyu Xie
|
7b09a415c1
|
fix: Make the bench serving script compatible with different usages (#5905)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-07-10 19:36:26 +08:00 |
|
Kaiyu Xie
|
2eb6502b1d
|
feat: Add support for TRTLLM CustomDataset (#5511)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-06-26 18:27:37 +08:00 |
|
Kaiyu Xie
|
c5ae3272b9
|
feat: Make benchmark_serving part of the library (#5428)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-06-25 23:13:56 +08:00 |
|
Yi Zhang
|
e44f7687af
|
feat: Add no_kv_cache_reuse option and streaming support for trtllm serve bench (#4971)
Signed-off-by: Yi Zhang <187001205+yizhang-nv@users.noreply.github.com>
|
2025-06-18 13:37:31 +08:00 |
|
Pengyun Lin
|
bac22ff7b5
|
[feat] support sharegpt downloading in benchmark_serving (#4578)
Signed-off-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com>
|
2025-05-30 17:27:53 +08:00 |
|
Kaiyu Xie
|
52d4302dda
|
bench: TRTLLM-4936 Port benchmark_serving.py (#4011)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
Signed-off-by: jiahanc <173873397+jiahanc@users.noreply.github.com>
Co-authored-by: jiahanc <173873397+jiahanc@users.noreply.github.com>
|
2025-05-07 09:45:14 +08:00 |
|