TensorRT-LLMs/cpp/tensorrt_llm/common
Kaiyu Xie 4bb65f216f
Update TensorRT-LLM (#1274)
* Update TensorRT-LLM

---------

Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
2024-03-12 18:15:52 +08:00
..
allocator.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
assert.cpp Update TensorRT-LLM (#1055) 2024-02-06 18:38:07 +08:00
assert.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
CMakeLists.txt Initial commit 2023-09-20 00:29:41 -07:00
cublasMMWrapper.cpp Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
cublasMMWrapper.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
cublasVersionCheck.h Initial commit 2023-09-20 00:29:41 -07:00
cudaAllocator.cpp Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
cudaAllocator.h Initial commit 2023-09-20 00:29:41 -07:00
cudaBf16Fallbacks.cuh Update TensorRT-LLM (20240116) (#891) 2024-01-16 20:03:11 +08:00
cudaBf16Wrapper.h Initial commit 2023-09-20 00:29:41 -07:00
cudaDriverWrapper.cpp Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
cudaDriverWrapper.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
cudaFp8Utils.cu Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
cudaFp8Utils.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
cudaTypeUtils.cuh Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
cudaUtils.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
customAllReduceUtils.h Update TensorRT-LLM (20240116) (#891) 2024-01-16 20:03:11 +08:00
dataType.h Update TensorRT-LLM (#524) 2023-12-01 22:27:51 +08:00
envUtils.cpp Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
envUtils.h Update TensorRT-LLM (20240116) (#891) 2024-01-16 20:03:11 +08:00
logger.cpp Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
logger.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
memoryUtils.cu Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
memoryUtils.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
mpiUtils.cpp Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
nvtxUtils.h Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
quantization.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
reduceKernelUtils.cuh Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
stlUtils.h Kaiyu/update main (#5) 2023-10-18 22:38:53 +08:00
stringUtils.cpp Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
stringUtils.h Update TensorRT-LLM (#524) 2023-12-01 22:27:51 +08:00
tensor.cpp Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
tensor.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
tensorConversion.h Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
tllmException.cpp Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
tllmException.h Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
workspace.h Update TensorRT-LLM (#667) 2023-12-15 22:14:51 +08:00