This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-14 06:27:45 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
dafc28fb85
TensorRT-LLMs
/
tests
/
integration
/
test_lists
History
Jinyang Yuan
dafc28fb85
fix: Fix FMHA-based MLA in the generation phase and add MLA unit test (
#3863
)
2025-04-29 09:09:43 +08:00
..
dev
Update (
#2978
)
2025-03-23 16:39:35 +08:00
qa
waive failed case in perf test, change default max_batch_size to 512 and write config.json to output log (
#3657
)
2025-04-22 14:51:45 +08:00
test-db
fix: Fix FMHA-based MLA in the generation phase and add MLA unit test (
#3863
)
2025-04-29 09:09:43 +08:00
waives.txt
test: [CI] Add failed cases into waives.txt (
#3867
)
2025-04-28 14:32:48 +08:00