TensorRT-LLMs/tests/unittest/api_stability/references
Shunkangz 67a3fd858b
[None][feat] Add support of scheduling attention dp request (#6246)
Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
Signed-off-by: Patrice Castonguay <55748270+pcastonguay@users.noreply.github.com>
Co-authored-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
Co-authored-by: Patrice Castonguay <55748270+pcastonguay@users.noreply.github.com>
2025-08-01 20:38:01 -04:00
..
batched_logits_processor.yaml test: [TRTLLM-4334] Create 1.0 criteria scope from API stability references (#3069) 2025-03-26 18:14:35 +08:00
calib_config.yaml test: [TRTLLM-4334] Create 1.0 criteria scope from API stability references (#3069) 2025-03-26 18:14:35 +08:00
completion_output.yaml [TRTLLM-6104] feat: add request_perf_metrics to LLMAPI (#5497) 2025-06-27 17:03:05 +02:00
guided_decoding_params.yaml feat: Support the Structural Tag in guided decoding (#4066) 2025-05-12 17:24:50 +08:00
llm.yaml [None][feat] Add support of scheduling attention dp request (#6246) 2025-08-01 20:38:01 -04:00
logits_processor.yaml feat: LogitsProcessor in PyTorch backend (#3145) 2025-05-01 14:15:30 -07:00
quant_config.yaml feat: [Deepseek] Add trtllm-gen MOE FP4 MOE backend (#3387) 2025-04-21 10:01:33 +08:00
request_output.yaml feat: Support Top-K logprobs and prompt_logprobs in LLMAPI (#3388) 2025-05-01 12:47:14 -04:00
sampling_params.yaml chore: Cleanup deprecated APIs from LLM-API (part 1/2) (#3732) 2025-05-07 13:20:25 +08:00