TensorRT-LLMs/tensorrt_llm/inputs
Pamela Peng b818a912d7
[https://nvbugs/5540752][fix] Support quantized Phi4 MM models (#8190)
Signed-off-by: Pamela <179191831+pamelap-nvidia@users.noreply.github.com>
2025-10-20 06:36:09 -04:00
..
__init__.py [TRTLLM-6780][fix] Add multimodal data to dummy requests during memory profiling (#7539) 2025-10-16 17:49:22 +02:00
data.py [TRTLLM-3925, https://nvbugs/5245262] [fix] Normalize LLM.generate API (#3985) 2025-05-07 11:06:23 +08:00
multimodal.py [TRTLLM-7385][feat] Optimize Qwen2/2.5-VL performance (#7250) 2025-09-22 03:40:02 -07:00
registry.py [TRTLLM-6780][fix] Add multimodal data to dummy requests during memory profiling (#7539) 2025-10-16 17:49:22 +02:00
utils.py [https://nvbugs/5540752][fix] Support quantized Phi4 MM models (#8190) 2025-10-20 06:36:09 -04:00