This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-05 02:31:33 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
92ae490410
TensorRT-LLMs
/
cpp
/
tensorrt_llm
/
runtime
History
Balaram Buddharaju
a792c23dcf
[TRTLLM-9465][fix] Swap TP-CP grouping order (
#10350
)
...
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
2026-01-05 20:08:03 +08:00
..
moeLoadBalancer
[None][feat] add flag for EPLB to force using GDRCopy (
#8650
)
2025-10-29 13:33:26 +08:00
utils
[TRTLLM-7349][feat] Adding new orchestrator type -- ray (
#7520
)
2025-10-04 08:12:24 +08:00
bufferManager.cpp
bufferView.h
[None] [refactor] Minor cleanup and improvements (
#7619
)
2025-10-03 11:40:06 +02:00
CMakeLists.txt
[TRTLLM-7349][feat] Adding new orchestrator type -- ray (
#7520
)
2025-10-04 08:12:24 +08:00
cudaMemPool.cpp
cudaMemPool.h
decoderState.cpp
[None][refactor] Simplify decoder state initialization for speculative decoding (
#6869
)
2025-08-22 18:44:17 +02:00
decodingLayerWorkspace.cpp
decodingLayerWorkspace.h
decodingOutput.cpp
eagleBuffers.cpp
explicitDraftTokensBuffers.cpp
explicitDraftTokensModule.h
gptDecoder.cpp
[None][feat] Support ignored prompt length for penalties via new sampling config parameter (
#8127
)
2025-10-27 13:12:31 -04:00
gptDecoderBatched.cpp
[None][fix] Introduce inline namespace to avoid symbol collision (
#9541
)
2025-12-12 23:32:15 +08:00
gptJsonConfig.cpp
iBuffer.cpp
[TRTLLM-4629] [feat] Add support of CUDA13 and sm103 devices (
#7568
)
2025-09-16 09:56:18 +08:00
ipcNvlsMemory.cu
ipcSocket.cpp
ipcSocket.h
ipcUtils.cpp
iTensor.cpp
jsonSerialization.h
layerProfiler.cpp
layerProfiler.h
lookaheadBuffers.cpp
loraCache.cpp
loraManager.cpp
loraManager.h
loraModule.cpp
loraUtils.cpp
loraUtils.h
mcastDeviceMemory.cpp
[
https://nvbugs/5489015
][fix] Support communicator split in MNNVL allreduce and fix the binding issues. (
#7387
)
2025-09-17 07:43:20 +08:00
mcastDeviceMemory.h
[
https://nvbugs/5489015
][fix] Support communicator split in MNNVL allreduce and fix the binding issues. (
#7387
)
2025-09-17 07:43:20 +08:00
mcastGPUBuffer.h
[
https://nvbugs/5489015
][fix] Support communicator split in MNNVL allreduce and fix the binding issues. (
#7387
)
2025-09-17 07:43:20 +08:00
memoryCounters.cpp
ncclCommunicator.cpp
ncclCommunicator.h
promptTuningParams.cpp
runtimeKernels.cu
runtimeKernels.h
tensorView.h
tllmBuffers.cpp
tllmBuffers.h
tllmLogger.cpp
tllmRuntime.cpp
tllmRuntime.h
[None][chroe] Rename TensorRT-LLM to TensorRT LLM for source code. (
#7851
)
2025-09-25 21:02:35 +08:00
tllmStreamReaders.cpp
tllmStreamReaders.h
torch.h
[None][feat] KV Cache Connector API (
#7228
)
2025-08-28 23:09:27 -04:00
torchUtils.h
torchView.h
virtualMemory.cpp
[None][fix] Correct virtual memory allocation alignment (
#9491
)
2025-12-01 10:59:19 +08:00
workerPool.cpp
workerPool.h
worldConfig.cpp
[TRTLLM-9465][fix] Swap TP-CP grouping order (
#10350
)
2026-01-05 20:08:03 +08:00