TensorRT-LLMs/cpp/include/tensorrt_llm/common
Tracin 6c91f1c7ac
Mxfp8xmxfp4 quant mode(#4978)
Signed-off-by: Tracin <10434017+Tracin@users.noreply.github.com>
Co-authored-by: QI JUN <22017000+QiJune@users.noreply.github.com>
2025-06-10 22:01:37 +08:00
..
algorithm.h open source 4dbf696ae9b74a26829d120b67ab8443d70c8e58 (#2297) 2024-10-08 12:19:19 +02:00
arrayView.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
assert.h feat: NIXL interface integration (#3934) 2025-05-19 18:18:22 +08:00
cudaBf16Wrapper.h Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
cudaFp8Utils.h Update TensorRT-LLM (#2783) 2025-02-13 18:40:22 +08:00
cudaProfilerUtils.h Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
cudaUtils.h Feat: Variable-Beam-Width-Search (VBWS) part4 (#3979) 2025-05-12 22:32:29 +02:00
dataType.h Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
logger.h Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
optionalRef.h Update TensorRT-LLM (#2436) 2024-11-12 15:27:49 +08:00
quantization.h Mxfp8xmxfp4 quant mode(#4978) 2025-06-10 22:01:37 +08:00
stringUtils.h chore: improve log-level setting UX (#4352) 2025-05-16 09:47:44 +01:00
tllmException.h chore: Stabilize ABI boundary for internal kernel library (#3117) 2025-04-11 15:07:50 +08:00
utils.h Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00