mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-23 12:12:39 +08:00
* One of the tactic is not supported during dispatch. * final_hidden_states should be unpacked if it is not min_latency_mode. Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| allreduce_gemm | ||
| fp8_blockscale_gemm | ||
| fp8_rowwise_gemm | ||
| fpA_intB_gemm | ||
| fused_gated_gemm | ||
| int8_gemm | ||
| python | ||
| CMakeLists.txt | ||
| cutlass_heuristic.cpp | ||
| cutlass_heuristic.h | ||
| cutlass_preprocessors.cpp | ||
| cutlass_preprocessors.h | ||
| cutlass_type_conversion.h | ||