TensorRT-LLMs/curl_responses_client.sh at ba1cb6831de605011596bb45daef1df6b46e5cbe - TensorRT-LLMs - Gitea: Git with a cup of tea

kanshan/TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

JunyiXu-nv af899d2fe7

[TRTLLM-9860][doc] Add docs and examples for Responses API (#9946 )

Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>

2025-12-14 21:46:13 -08:00

10 lines

239 B

Bash

Raw Blame History

 #! /usr/bin/env bash
 curl http://localhost:8000/v1/responses \
     -H "Content-Type: application/json" \
     -d '{
         "model": "TinyLlama-1.1B-Chat-v1.0",
         "input": "Where is New York?",
         "max_output_tokens": 16
     }'