TensorRT-LLMs/cpp/tensorrt_llm/pybind
Yixin Dong c90ebadd84
feat: Support the Structural Tag in guided decoding (#4066)
* finish

Signed-off-by: Ubospica <ubospica@gmail.com>

* update

Signed-off-by: Ubospica <ubospica@gmail.com>

* update

Signed-off-by: Ubospica <ubospica@gmail.com>

* fix

Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>

* exc overlap scheduler

Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>

* add test

Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>

* fix api ref

Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>

---------

Signed-off-by: Ubospica <ubospica@gmail.com>
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
Co-authored-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
2025-05-12 17:24:50 +08:00
..
batch_manager [TRTLLM-3429] feat: Overlap scheduling in C++ runtime (#3625) 2025-05-06 15:06:46 +02:00
common fix: Move all casters to customCasters. (#3945) 2025-05-02 19:08:28 +08:00
executor feat: Support the Structural Tag in guided decoding (#4066) 2025-05-12 17:24:50 +08:00
runtime fix: Move all casters to customCasters. (#3945) 2025-05-02 19:08:28 +08:00
testing refactor: Move ModelSpec to core library (#3980) 2025-05-04 01:39:09 +08:00
userbuffers fix: Move all casters to customCasters. (#3945) 2025-05-02 19:08:28 +08:00
bindings.cpp refactor: Move ModelSpec to core library (#3980) 2025-05-04 01:39:09 +08:00
CMakeLists.txt refactor: Move ModelSpec to core library (#3980) 2025-05-04 01:39:09 +08:00