TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-31 00:01:22 +08:00

History

Kaiyu Xie 2631f21089 Update (#2978 ) Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>		2025-03-23 16:39:35 +08:00
..
auto_deploy	Update (#2978 )	2025-03-23 16:39:35 +08:00
compilation	Update (#2978 )	2025-03-23 16:39:35 +08:00
modeling	Update (#2978 )	2025-03-23 16:39:35 +08:00
multi_gpu	Update (#2978 )	2025-03-23 16:39:35 +08:00
multi_gpu_modeling	Update (#2978 )	2025-03-23 16:39:35 +08:00
speculative	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
thop	Update (#2978 )	2025-03-23 16:39:35 +08:00
helpers.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
pattern_watcher.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_attention.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_autotuner.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_flashinfer_attention.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_flashinfer_star_attn.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_fp4_bmm_quantize.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_fp4_gemm_quantize.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_fp4_linear.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_fp8_block_scale_gemm.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_fp8_linear.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_fp8_quantize.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_fused_moe.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_moe_routing.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_moe.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_overlap_scheduler_input.json	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_overlap_scheduler.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_pytorch_model_engine.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_vanilla_attention.py	Update (#2978 )	2025-03-23 16:39:35 +08:00