TensorRT-LLMs/tests/unittest/api_stability/references_committed
Simeng Liu 8cf3faa26a
[feat] Auto-enable ngram with concurrency <= 32. (#6232)
Signed-off-by: Simeng Liu <simengl@nvidia.com>
Signed-off-by: Mike Iovine <miovine@nvidia.com>
Signed-off-by: Mike Iovine <mike.iovine7@gmail.com>
Co-authored-by: Mike Iovine <miovine@nvidia.com>
Co-authored-by: Mike Iovine <mike.iovine7@gmail.com>
2025-07-31 18:45:51 -04:00
..
completion_output.yaml feat: Support Top-K logprobs and prompt_logprobs in LLMAPI (#3388) 2025-05-01 12:47:14 -04:00
llm.yaml [feat] Auto-enable ngram with concurrency <= 32. (#6232) 2025-07-31 18:45:51 -04:00
request_output.yaml feat: Support Top-K logprobs and prompt_logprobs in LLMAPI (#3388) 2025-05-01 12:47:14 -04:00
sampling_params.yaml cleanup logprob params (#4039) 2025-05-07 00:50:16 +08:00