|
_torch
|
Add running E2E LoRA flow (#3648)
|
2025-04-23 11:19:41 +08:00 |
|
api_stability
|
Add smart router for moe (#3641)
|
2025-04-23 12:21:59 +08:00 |
|
disaggregated
|
feat: Disaggregated router class (#3584)
|
2025-04-19 00:34:12 +08:00 |
|
others
|
test: reorganize tests folder hierarchy (#2996)
|
2025-03-27 12:07:53 +08:00 |
|
tools
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
trt
|
Unify two versions of AllReduce custom op (#3032)
|
2025-04-22 21:58:42 +08:00 |
|
utils
|
feat: Add FP8 support for SM 120 (#3248)
|
2025-04-14 16:05:41 -07:00 |
|
dump_checkpoint_stats.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
profile_utils.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
test_model_runner_cpp.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |