|
common
|
feat: chunked prefill for MLA (Blackwell) (#4651)
|
2025-06-26 09:01:00 +08:00 |
|
executor_worker
|
Update TensorRT-LLM (#2792)
|
2025-02-18 21:27:39 +08:00 |
|
kernels
|
Fix GEMM+AR nvbugs 5219533,5127801,5072306 (#5969)
|
2025-07-11 10:22:02 -07:00 |
|
plugins
|
Fix GEMM+AR nvbugs 5219533,5127801,5072306 (#5969)
|
2025-07-11 10:22:02 -07:00 |
|
pybind
|
Fix GEMM+AR nvbugs 5219533,5127801,5072306 (#5969)
|
2025-07-11 10:22:02 -07:00 |
|
runtime
|
Fix GEMM+AR nvbugs 5219533,5127801,5072306 (#5969)
|
2025-07-11 10:22:02 -07:00 |
|
testing
|
refactor: Move ModelSpec to core library (#3980)
|
2025-05-04 01:39:09 +08:00 |