TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-31 16:21:07 +08:00

History

HuiGao-NV c51e90d7d7 fix: don't perform memory estimation for start_attention (#3485 ) * fix: don't perform memory estimation for start_attention * Enable tests of unittest/_torch/multi_gpu Signed-off-by: Hui Gao <huig@nvidia.com>		2025-04-12 11:34:46 +08:00
..
dev	Update (#2978 )	2025-03-23 16:39:35 +08:00
qa	test: add torch flow test case in qa test list (#3404 )	2025-04-11 16:57:41 +08:00
test-db	test: Add DeepSeek-V3-Lite PP=4 cases (#3454 )	2025-04-12 00:09:12 +08:00
waives.txt	fix: don't perform memory estimation for start_attention (#3485 )	2025-04-12 11:34:46 +08:00