TensorRT-LLMs/cpp/tensorrt_llm/layers
2025-10-27 13:12:31 -04:00
..
banWordsLayer.cpp refactor: Remove enforced sorted order of batch slots (#3502) 2025-07-14 17:23:02 +02:00
banWordsLayer.h Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
baseLayer.h Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
beamSearchLayer.cu refactor: Remove enforced sorted order of batch slots (#3502) 2025-07-14 17:23:02 +02:00
beamSearchLayer.h Feat: Variable-Beam-Width-Search (VBWS) part4 (#3979) 2025-05-12 22:32:29 +02:00
CMakeLists.txt Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00
decodingLayer.cpp Feat: Variable-Beam-Width-Search (VBWS) part4 (#3979) 2025-05-12 22:32:29 +02:00
decodingLayer.h open source 4dbf696ae9b74a26829d120b67ab8443d70c8e58 (#2297) 2024-10-08 12:19:19 +02:00
decodingParams.h [None][feat] Support ignored prompt length for penalties via new sampling config parameter (#8127) 2025-10-27 13:12:31 -04:00
dynamicDecodeLayer.cpp Feat: Variable-Beam-Width-Search (VBWS) part4 (#3979) 2025-05-12 22:32:29 +02:00
dynamicDecodeLayer.h Update TensorRT-LLM (#2582) 2024-12-16 21:50:47 -08:00
eagleDecodingLayer.cpp refactor: Remove enforced sorted order of batch slots (#3502) 2025-07-14 17:23:02 +02:00
eagleDecodingLayer.h Update TensorRT-LLM (#2502) 2024-11-26 16:51:34 +08:00
explicitDraftTokensLayer.cpp refactor: Remove enforced sorted order of batch slots (#3502) 2025-07-14 17:23:02 +02:00
explicitDraftTokensLayer.h Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00
externalDraftTokensLayer.cpp refactor: Remove enforced sorted order of batch slots (#3502) 2025-07-14 17:23:02 +02:00
externalDraftTokensLayer.h Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
layersFactory.h Update TensorRT-LLM (#2436) 2024-11-12 15:27:49 +08:00
layerUtils.h v1.2 (#3082) 2025-03-26 23:31:29 +08:00
lookaheadAlgorithm.cpp Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00
lookaheadAlgorithm.h Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00
lookaheadDecodingLayer.cpp refactor: Remove enforced sorted order of batch slots (#3502) 2025-07-14 17:23:02 +02:00
lookaheadDecodingLayer.h Update TensorRT-LLM (#2333) 2024-10-15 15:28:40 +08:00
lookaheadDecodingUtils.h Update TensorRT-LLM (#2253) 2024-09-24 17:27:31 +02:00
lookaheadPoolManager.cpp Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00
lookaheadPoolManager.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
medusaDecodingLayer.cpp refactor: Remove enforced sorted order of batch slots (#3502) 2025-07-14 17:23:02 +02:00
medusaDecodingLayer.h Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
penaltyLayer.cpp [None][feat] Support ignored prompt length for penalties via new sampling config parameter (#8127) 2025-10-27 13:12:31 -04:00
penaltyLayer.h [None][feat] Support ignored prompt length for penalties via new sampling config parameter (#8127) 2025-10-27 13:12:31 -04:00
samplingLayer.cpp refactor: Remove enforced sorted order of batch slots (#3502) 2025-07-14 17:23:02 +02:00
samplingLayer.h Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
stopCriteriaLayer.cpp Feat: Variable-Beam-Width-Search (VBWS) part4 (#3979) 2025-05-12 22:32:29 +02:00
stopCriteriaLayer.h Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
topKSamplingLayer.cpp refactor: Remove enforced sorted order of batch slots (#3502) 2025-07-14 17:23:02 +02:00
topKSamplingLayer.h Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
topPSamplingLayer.cpp refactor: Remove enforced sorted order of batch slots (#3502) 2025-07-14 17:23:02 +02:00
topPSamplingLayer.h Update TensorRT-LLM (#2460) 2024-11-19 18:30:34 +08:00