| .. |
|
banWordsLayer.cpp
|
refactor: Remove enforced sorted order of batch slots (#3502)
|
2025-07-14 17:23:02 +02:00 |
|
banWordsLayer.h
|
Update TensorRT-LLM (#2184)
|
2024-09-03 12:14:23 +02:00 |
|
baseLayer.h
|
Update TensorRT-LLM (#2184)
|
2024-09-03 12:14:23 +02:00 |
|
beamSearchLayer.cu
|
refactor: Remove enforced sorted order of batch slots (#3502)
|
2025-07-14 17:23:02 +02:00 |
|
beamSearchLayer.h
|
Feat: Variable-Beam-Width-Search (VBWS) part4 (#3979)
|
2025-05-12 22:32:29 +02:00 |
|
CMakeLists.txt
|
Update TensorRT-LLM (#2849)
|
2025-03-04 18:44:00 +08:00 |
|
decodingLayer.cpp
|
Feat: Variable-Beam-Width-Search (VBWS) part4 (#3979)
|
2025-05-12 22:32:29 +02:00 |
|
decodingLayer.h
|
open source 4dbf696ae9b74a26829d120b67ab8443d70c8e58 (#2297)
|
2024-10-08 12:19:19 +02:00 |
|
decodingParams.h
|
[None][feat] Support ignored prompt length for penalties via new sampling config parameter (#8127)
|
2025-10-27 13:12:31 -04:00 |
|
dynamicDecodeLayer.cpp
|
Feat: Variable-Beam-Width-Search (VBWS) part4 (#3979)
|
2025-05-12 22:32:29 +02:00 |
|
dynamicDecodeLayer.h
|
Update TensorRT-LLM (#2582)
|
2024-12-16 21:50:47 -08:00 |
|
eagleDecodingLayer.cpp
|
refactor: Remove enforced sorted order of batch slots (#3502)
|
2025-07-14 17:23:02 +02:00 |
|
eagleDecodingLayer.h
|
Update TensorRT-LLM (#2502)
|
2024-11-26 16:51:34 +08:00 |
|
explicitDraftTokensLayer.cpp
|
refactor: Remove enforced sorted order of batch slots (#3502)
|
2025-07-14 17:23:02 +02:00 |
|
explicitDraftTokensLayer.h
|
Update TensorRT-LLM (#2849)
|
2025-03-04 18:44:00 +08:00 |
|
externalDraftTokensLayer.cpp
|
refactor: Remove enforced sorted order of batch slots (#3502)
|
2025-07-14 17:23:02 +02:00 |
|
externalDraftTokensLayer.h
|
Update TensorRT-LLM (#2755)
|
2025-02-11 03:01:00 +00:00 |
|
layersFactory.h
|
Update TensorRT-LLM (#2436)
|
2024-11-12 15:27:49 +08:00 |
|
layerUtils.h
|
v1.2 (#3082)
|
2025-03-26 23:31:29 +08:00 |
|
lookaheadAlgorithm.cpp
|
Update TensorRT-LLM (#2849)
|
2025-03-04 18:44:00 +08:00 |
|
lookaheadAlgorithm.h
|
Update TensorRT-LLM (#2849)
|
2025-03-04 18:44:00 +08:00 |
|
lookaheadDecodingLayer.cpp
|
refactor: Remove enforced sorted order of batch slots (#3502)
|
2025-07-14 17:23:02 +02:00 |
|
lookaheadDecodingLayer.h
|
Update TensorRT-LLM (#2333)
|
2024-10-15 15:28:40 +08:00 |
|
lookaheadDecodingUtils.h
|
Update TensorRT-LLM (#2253)
|
2024-09-24 17:27:31 +02:00 |
|
lookaheadPoolManager.cpp
|
Update TensorRT-LLM (#2849)
|
2025-03-04 18:44:00 +08:00 |
|
lookaheadPoolManager.h
|
Update TensorRT-LLM (#1763)
|
2024-06-11 16:59:02 +08:00 |
|
medusaDecodingLayer.cpp
|
refactor: Remove enforced sorted order of batch slots (#3502)
|
2025-07-14 17:23:02 +02:00 |
|
medusaDecodingLayer.h
|
Update TensorRT-LLM (#2184)
|
2024-09-03 12:14:23 +02:00 |
|
penaltyLayer.cpp
|
[None][feat] Support ignored prompt length for penalties via new sampling config parameter (#8127)
|
2025-10-27 13:12:31 -04:00 |
|
penaltyLayer.h
|
[None][feat] Support ignored prompt length for penalties via new sampling config parameter (#8127)
|
2025-10-27 13:12:31 -04:00 |
|
samplingLayer.cpp
|
refactor: Remove enforced sorted order of batch slots (#3502)
|
2025-07-14 17:23:02 +02:00 |
|
samplingLayer.h
|
Update TensorRT-LLM (#2755)
|
2025-02-11 03:01:00 +00:00 |
|
stopCriteriaLayer.cpp
|
Feat: Variable-Beam-Width-Search (VBWS) part4 (#3979)
|
2025-05-12 22:32:29 +02:00 |
|
stopCriteriaLayer.h
|
Update TensorRT-LLM (#2184)
|
2024-09-03 12:14:23 +02:00 |
|
topKSamplingLayer.cpp
|
refactor: Remove enforced sorted order of batch slots (#3502)
|
2025-07-14 17:23:02 +02:00 |
|
topKSamplingLayer.h
|
Update TensorRT-LLM (#2413)
|
2024-11-05 16:27:06 +08:00 |
|
topPSamplingLayer.cpp
|
refactor: Remove enforced sorted order of batch slots (#3502)
|
2025-07-14 17:23:02 +02:00 |
|
topPSamplingLayer.h
|
Update TensorRT-LLM (#2460)
|
2024-11-19 18:30:34 +08:00 |