TensorRT-LLMs/tests/unittest/api_stability/references
nv-guomingz 578430e64c
[TRTLLM-5530][BREAKING CHANGE]: enhance the llm args pytorch config part 1(cuda_graph_config) (#5014)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
2025-06-30 11:05:40 +08:00
..
batched_logits_processor.yaml test: [TRTLLM-4334] Create 1.0 criteria scope from API stability references (#3069) 2025-03-26 18:14:35 +08:00
calib_config.yaml test: [TRTLLM-4334] Create 1.0 criteria scope from API stability references (#3069) 2025-03-26 18:14:35 +08:00
completion_output.yaml [TRTLLM-6104] feat: add request_perf_metrics to LLMAPI (#5497) 2025-06-27 17:03:05 +02:00
guided_decoding_params.yaml feat: Support the Structural Tag in guided decoding (#4066) 2025-05-12 17:24:50 +08:00
llm.yaml [TRTLLM-5530][BREAKING CHANGE]: enhance the llm args pytorch config part 1(cuda_graph_config) (#5014) 2025-06-30 11:05:40 +08:00
logits_processor.yaml feat: LogitsProcessor in PyTorch backend (#3145) 2025-05-01 14:15:30 -07:00
quant_config.yaml feat: [Deepseek] Add trtllm-gen MOE FP4 MOE backend (#3387) 2025-04-21 10:01:33 +08:00
request_output.yaml feat: Support Top-K logprobs and prompt_logprobs in LLMAPI (#3388) 2025-05-01 12:47:14 -04:00
sampling_params.yaml chore: Cleanup deprecated APIs from LLM-API (part 1/2) (#3732) 2025-05-07 13:20:25 +08:00