TensorRT-LLMs/examples/models
jmydurant 8836990bde
[TRTLLM-3602][feat] support nvfp4 model and fp8 kv cache for MLA chunked prefill (Blackwell) (#5475)
Signed-off-by: Mingyang Jiang <13463932+jmydurant@users.noreply.github.com>
2025-06-26 22:18:08 +08:00
..
contrib feat: add HyperCLOVAX-SEED-Vision support in refactored way (#4799) 2025-06-09 11:04:04 +08:00
core [TRTLLM-3602][feat] support nvfp4 model and fp8 kv cache for MLA chunked prefill (Blackwell) (#5475) 2025-06-26 22:18:08 +08:00