|
_torch
|
[fix] Fix llama4 + eagle3 (#3998)
|
2025-05-08 19:20:27 -04:00 |
|
disaggregated
|
feat: Disaggregated router class (#3584)
|
2025-04-19 00:34:12 +08:00 |
|
llmapi
|
feat: support multi lora adapters and TP (#3885)
|
2025-05-08 23:45:45 +08:00 |
|
others
|
test: reorganize tests folder hierarchy (#2996)
|
2025-03-27 12:07:53 +08:00 |
|
tools
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
trt
|
Unify two versions of AllReduce custom op (#3032)
|
2025-04-22 21:58:42 +08:00 |
|
utils
|
refactor: Move ModelSpec to core library (#3980)
|
2025-05-04 01:39:09 +08:00 |
|
dump_checkpoint_stats.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
profile_utils.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
test_model_runner_cpp.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |