mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-14 06:27:45 +08:00
feat(scaffolding): make sampling_params only setable by controller Signed-off-by: Zhenhuan Chen <chenzhh3671@gmail.com> |
||
|---|---|---|
| .. | ||
| README.md | ||
| stream_generation_controller.py | ||
| stream_generation_run.py | ||
This example shows how to use the StreamGenerationTask and stream_generation_handler to enable efficient streaming-based generation workflows.
How to run the example?
python stream_generation_run.py
See more detail on tensorrt_llm/scaffolding/contrib/AsyncGeneration/README.md.