TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Yukun He 28b9a81c58 [TRTLLM-4500][feat] Add serialization/deserialization options for AutoTuner profiling cache (#7738 ) To achieve determinism for the AutoTuner profiling cache, serialization and deserialization are introduced to store the cache on disk in JSON format. Use TLLM_AUTOTUNER_CACHE_PATH to indicate the path where the cache file should be stored: Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>		2025-09-29 07:40:51 +08:00
..
test_autotuner.py	[TRTLLM-4500][feat] Add serialization/deserialization options for AutoTuner profiling cache (#7738 )	2025-09-29 07:40:51 +08:00
test_share_tensor.py	[None][ci] move unittests to sub-directories (#6635 )	2025-08-20 05:42:22 -04:00
test_virtual_memory.py	[TRTLLM-4629] [feat] Add support of CUDA13 and sm103 devices (#7568 )	2025-09-16 09:56:18 +08:00