TensorRT-LLMs/docs/source/dev-on-cloud/dev-on-runpod.md
WeiHaocheng 228e453780
doc: add doc ahout developent on cloud or runpod (#3194)
Signed-off-by: fredw <20514172+WeiHaocheng@users.noreply.github.com>
2025-04-02 18:10:56 +08:00

1.7 KiB

(dev-on-runpod)=

Develop TensorRT-LLM on Runpod

Runpod is a popular cloud platform among many researchers. This doc describes how to develop TensorRT-LLM on Runpod.

Prepare

Create a Runpod account

Please refer to the Runpod Getting Started.

Configure SSH Key

Please refer to the Configure SSH Key.

Note that we can skip the step of "Start your Pod. Make sure of the following things" here as we will introduce it below.

Build the TensorRT-LLM Docker Image and Upload to DockerHub

Please refer to the Build Image to DockerHub.

Note that the docker image must enable ssh access. See on Enable ssh access to the container.

Create a Pod Template

Click "Template" bottom on the menus and click "Create Template" bottom.

Fill the docker image link of DockerHub such as docker.io/<your_dockerhub_username>/tensorrt_llm:devel on "Docker Image" field.

Fill "22" into "Expose TCP Ports" field.

Fill

sleep infinity

into 'Container Start Command' field.

Connect to the Pod

Please refer to the Connect to the Pod.

You can connect the pod with SSH or Web Terminal.

If you want to connect the pod with SSH, you can copy the command from "SSH over exposed TCP" field and run it on your host.

In some scenarios such as using a team account, your public key has not been added to the pod successfully. You can directly add this command to the 'Container Start Command' field as:

bash -c 'echo "<your_public_key>" >> ~/.ssh/authorized_keys;sleep infinity'

Enjoy your development!