mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-14 06:27:45 +08:00
* increase A30 for cpp test Signed-off-by: junq <22017000+QiJune@users.noreply.github.com> * enable parallel run test for gpt_executor Signed-off-by: junq <22017000+QiJune@users.noreply.github.com> * clean Signed-off-by: junq <22017000+QiJune@users.noreply.github.com> * decrease freeGpuMemoryFraction of cpp tests Signed-off-by: junq <22017000+QiJune@users.noreply.github.com> * fix Signed-off-by: junq <22017000+QiJune@users.noreply.github.com> --------- Signed-off-by: junq <22017000+QiJune@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| blockKeyTest.cpp | ||
| cacheTransceiverTest.cpp | ||
| CMakeLists.txt | ||
| guidedDecoderTest.cpp | ||
| modelSpec.cpp | ||
| modelSpec.h | ||
| modelSpecBinding.cpp | ||
| peftCacheManagerTest.cpp | ||
| trtEncoderModelTest.cpp | ||
| trtGptModelRealDecoderTest.cpp | ||
| trtGptModelTest.cpp | ||