mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-02-17 16:25:05 +08:00
* add mixtral7x8b fp8 test with fixed cutlass fp8 moe gemm Signed-off-by: Faraz Khoubsirat <58580514+farazkh80@users.noreply.github.com> * update cutlass versions Signed-off-by: Faraz Khoubsirat <58580514+farazkh80@users.noreply.github.com> * added internal cutlass with fix and docker update Signed-off-by: Faraz Khoubsirat <58580514+farazkh80@users.noreply.github.com> * added mixtral to pro 6000 Signed-off-by: Faraz Khoubsirat <58580514+farazkh80@users.noreply.github.com> --------- Signed-off-by: Faraz Khoubsirat <58580514+farazkh80@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| .gitignore | ||
| examples_test_list.txt | ||
| llm_multinodes_function_test.txt | ||
| llm_release_gb20x.txt | ||
| llm_release_perf_multinode_test.txt | ||
| llm_release_rtx_pro_6000.txt | ||
| llm_sanity_test.txt | ||
| trt_llm_integration_perf_sanity_test.yml | ||
| trt_llm_integration_perf_test.yml | ||
| trt_llm_release_perf_cluster_test.yml | ||
| trt_llm_release_perf_sanity_test.yml | ||
| trt_llm_release_perf_test.yml | ||