mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-02-18 08:45:05 +08:00
- Fix c_sf shape calculation: use pad_up(m, 128) // 128 for non-128-aligned m - Change c_sf dtype to uint8 to match fp4_utils.py SF_DTYPE - Add scale factor shape and value validation in unit test - Fix test to handle padded scale factors correctly Signed-off-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| cpp_custom_ops.py | ||
| cute_dsl_custom_ops.py | ||
| flashinfer_custom_ops.py | ||
| torch_custom_ops.py | ||
| trtllm_gen_custom_ops.py | ||
| userbuffers_custom_ops.py | ||