|
batch_manager
|
refactor: decoder state setup (#5093)
|
2025-06-30 11:09:43 +02:00 |
|
common
|
feat: chunked prefill for MLA (Blackwell) (#4651)
|
2025-06-26 09:01:00 +08:00 |
|
executor
|
[TRTLLM-5000][feat] NGrams V2 (#4569)
|
2025-06-27 23:00:17 +08:00 |
|
executor_worker
|
Update TensorRT-LLM (#2792)
|
2025-02-18 21:27:39 +08:00 |
|
kernels
|
feat: W4A16 GEMM (#4232)
|
2025-07-01 10:36:05 +03:00 |
|
pybind
|
refactor: decoder state setup (#5093)
|
2025-06-30 11:09:43 +02:00 |
|
runtime
|
refactor: decoder state setup (#5093)
|
2025-06-30 11:09:43 +02:00 |
|
testing
|
refactor: Move ModelSpec to core library (#3980)
|
2025-05-04 01:39:09 +08:00 |
|
thop
|
feat: W4A16 GEMM (#4232)
|
2025-07-01 10:36:05 +03:00 |