This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-14 06:27:45 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
3e71ec7a76
TensorRT-LLMs
/
cpp
/
tensorrt_llm
/
runtime
History
Xiwen Yu
38ef850552
Merge remote-tracking branch 'gitlab/main' into user/xiweny/merge_0901
...
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
2025-09-01 11:46:44 +08:00
..
moeLoadBalancer
utils
bufferManager.cpp
bufferView.h
CMakeLists.txt
cudaMemPool.cpp
cudaMemPool.h
decoderState.cpp
decodingLayerWorkspace.cpp
decodingLayerWorkspace.h
decodingOutput.cpp
eagleBuffers.cpp
explicitDraftTokensBuffers.cpp
explicitDraftTokensModule.h
gptDecoder.cpp
gptDecoderBatched.cpp
gptJsonConfig.cpp
iBuffer.cpp
ipcNvlsMemory.cu
ipcSocket.cpp
ipcSocket.h
ipcUtils.cpp
iTensor.cpp
jsonSerialization.h
layerProfiler.cpp
layerProfiler.h
lookaheadBuffers.cpp
loraCache.cpp
loraManager.cpp
loraManager.h
loraModule.cpp
loraUtils.cpp
loraUtils.h
mcastDeviceMemory.cpp
mcastDeviceMemory.h
mcastGPUBuffer.h
memoryCounters.cpp
ncclCommunicator.cpp
ncclCommunicator.h
promptTuningParams.cpp
runtimeKernels.cu
runtimeKernels.h
tensorView.h
tllmBuffers.cpp
tllmBuffers.h
tllmLogger.cpp
tllmRuntime.cpp
tllmRuntime.h
tllmStreamReaders.cpp
tllmStreamReaders.h
torch.h
torchUtils.h
torchView.h
virtualMemory.cpp
workerPool.cpp
workerPool.h
worldConfig.cpp