mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-24 12:42:54 +08:00
Add Piecewise CUDA Graph Support Signed-off-by: Yi Zhang <187001205+yizhang-nv@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| flashinfer.py | ||
| interface.py | ||
| star_flashinfer.py | ||
| trtllm.py | ||
| utils.py | ||
| vanilla.py | ||