TensorRT-LLMs/tensorrt_llm/executor
bhsueh_NV f77252e9ff
fix bug of create cuda stream as default parameter which will be init… (#3764)
* fix bug of create cuda stream as default parameter which will be initialized during importing

Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>

* add torch.cuda.Stream() for the leader node

Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>

* fix pre-commit issue

Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>

---------

Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
2025-04-28 08:16:03 +08:00
..
__init__.py Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
executor.py Add running E2E LoRA flow (#3648) 2025-04-23 11:19:41 +08:00
ipc.py fix: Use hmac authentication for pickle encryption (#3384) 2025-04-17 00:40:13 +08:00
postproc_worker.py feat: return logits in PyTorch flow (#3221) 2025-04-24 16:56:03 -07:00
proxy.py fix bug of create cuda stream as default parameter which will be init… (#3764) 2025-04-28 08:16:03 +08:00
request.py Add running E2E LoRA flow (#3648) 2025-04-23 11:19:41 +08:00
result.py feat: return logits in PyTorch flow (#3221) 2025-04-24 16:56:03 -07:00
utils.py feat: return logits in PyTorch flow (#3221) 2025-04-24 16:56:03 -07:00
worker.py feat: return logits in PyTorch flow (#3221) 2025-04-24 16:56:03 -07:00