TensorRT-LLMs/tests/unittest/api_stability
Simeng Liu 8cf3faa26a
[feat] Auto-enable ngram with concurrency <= 32. (#6232)
Signed-off-by: Simeng Liu <simengl@nvidia.com>
Signed-off-by: Mike Iovine <miovine@nvidia.com>
Signed-off-by: Mike Iovine <mike.iovine7@gmail.com>
Co-authored-by: Mike Iovine <miovine@nvidia.com>
Co-authored-by: Mike Iovine <mike.iovine7@gmail.com>
2025-07-31 18:45:51 -04:00
..
references chore: remove unused kv_cache_dtype in api reference (#6444) 2025-07-29 14:57:20 -04:00
references_committed [feat] Auto-enable ngram with concurrency <= 32. (#6232) 2025-07-31 18:45:51 -04:00
api_stability_core.py [TRTLLM-5061] chore: add status tags to LLM API reference (#5707) 2025-07-28 15:57:07 +08:00
test_llm_api.py [TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312) 2025-06-20 03:01:10 +08:00