TensorRT-LLMs/tests/integration/test_lists
William Zhang 478b6b20a1
[#9230][refactor] Replace nemotron patches with custom model implementation (#9751)
[#9230][refactor] Replace nemotron patches with custom model implementation

* Why?

Patching for nemotron H models was growing out of hand, and made certain
optimizations more complex than they needed to be.

* What?

This commit finally gets rid of them, and replaces them with the custom
model implementation in `modeling_nemotron_h.py`.

Closes #9230
Closes NvBug 5747867

Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>
2025-12-18 19:36:27 -08:00
..
dev Update (#2978) 2025-03-23 16:39:35 +08:00
qa [TRTC-71][feat] Add regression testing for config database (#9832) 2025-12-18 16:15:38 -08:00
test-db [None][feat] Support Mooncake transfer engine as a cache transceiver backend (#8309) 2025-12-19 10:09:51 +08:00
waives.txt [#9230][refactor] Replace nemotron patches with custom model implementation (#9751) 2025-12-18 19:36:27 -08:00