TensorRT-LLMs/cpp/tensorrt_llm/pybind/executor
Yixin Dong c90ebadd84
feat: Support the Structural Tag in guided decoding (#4066)
* finish

Signed-off-by: Ubospica <ubospica@gmail.com>

* update

Signed-off-by: Ubospica <ubospica@gmail.com>

* update

Signed-off-by: Ubospica <ubospica@gmail.com>

* fix

Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>

* exc overlap scheduler

Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>

* add test

Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>

* fix api ref

Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>

---------

Signed-off-by: Ubospica <ubospica@gmail.com>
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
Co-authored-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
2025-05-12 17:24:50 +08:00
..
bindings.cpp fix: Properly get decoding mode according to same logic as cpp. (#4026) 2025-05-06 21:53:17 +08:00
bindings.h fix: Move all casters to customCasters. (#3945) 2025-05-02 19:08:28 +08:00
executor.cpp Update TensorRT-LLM (#2436) 2024-11-12 15:27:49 +08:00
executor.h Update TensorRT-LLM (#2562) 2024-12-11 00:31:05 -08:00
executorConfig.cpp [TRTLLM-3429] feat: Overlap scheduling in C++ runtime (#3625) 2025-05-06 15:06:46 +02:00
executorConfig.h fix: Move all casters to customCasters. (#3945) 2025-05-02 19:08:28 +08:00
request.cpp feat: Support the Structural Tag in guided decoding (#4066) 2025-05-12 17:24:50 +08:00
request.h fix: Move all casters to customCasters. (#3945) 2025-05-02 19:08:28 +08:00