mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-13 22:18:36 +08:00
[TRTLLM-8532][chore] clean warmup method of ModelEngine (#8264)
auto-close-inactive-issues.yml #4713:Scheduled
[None][chore] AutoDeplopy: Update expert section on yaml configuration in README (#8370)
auto-close-inactive-issues.yml #4712:Scheduled
[TLLM-6777][feature] Support SWA KV cache reuse OOW block detach (#7922)
auto-close-inactive-issues.yml #4711:Scheduled
[None][fix] AD test_trtllm_bench to use small model config and skip loading weights (#8149)
auto-close-inactive-issues.yml #4710:Scheduled
[None][chore] Waive failing pre-merge test on main (#8282)
auto-close-inactive-issues.yml #4709:Scheduled
[None][fix] Add Lock to protect mReqeustToSession (#8085)
auto-close-inactive-issues.yml #4708:Scheduled
[None][chore] Restore asserts in pytorch flow LoRA tests (#8227)
auto-close-inactive-issues.yml #4707:Scheduled
[None][chore] Waive some tests failing on main post merge (#8186)
auto-close-inactive-issues.yml #4706:Scheduled
[None][infra] Skip failed cases for main (#8176)
auto-close-inactive-issues.yml #4705:Scheduled
[TRTLLM-8413][chore] resolve sampling defaults in OpenAI API backend (#8121)
auto-close-inactive-issues.yml #4704:Scheduled
[https://nvbugs/5521949][fix] Re-enable test_bielik_11b_v2_2_instruct_multi_lora, fix its API use with pytorch flow LoRA (#8146)
auto-close-inactive-issues.yml #4703:Scheduled
[None][autodeploy] small refactors on attention matching (#8079)
auto-close-inactive-issues.yml #4702:Scheduled
[None] [refactor] Minor cleanup and improvements (#7619)
auto-close-inactive-issues.yml #4701:Scheduled
[https://nvbugs/5556020][chore] waive test_eagle3 (#8119)
auto-close-inactive-issues.yml #4700:Scheduled
[TRTLLM-8031][feat] Add chunked return_generation_logits logic (#7831)
auto-close-inactive-issues.yml #4699:Scheduled
[https://nvbugs/5547414][fix] avoid downloading Tiny llama from HF (#8071)
auto-close-inactive-issues.yml #4698:Scheduled
[https://nvbugs/5542867][fix] Fix the non-determinism issue in the mm_encoder test (#8033)
auto-close-inactive-issues.yml #4697:Scheduled
[TRTLLM-4500][feat] Add serialization/deserialization options for AutoTuner profiling cache (#7738)
auto-close-inactive-issues.yml #4696:Scheduled
[TRTLLM-4500][feat] Add serialization/deserialization options for AutoTuner profiling cache (#7738)
auto-close-inactive-issues.yml #4695:Scheduled
[TRTLLM-4500][feat] Add serialization/deserialization options for AutoTuner profiling cache (#7738)
auto-close-inactive-issues.yml #4694:Scheduled
[TRTLLM-4500][feat] Add serialization/deserialization options for AutoTuner profiling cache (#7738)
auto-close-inactive-issues.yml #4693:Scheduled
[TRTLLM-4500][feat] Add serialization/deserialization options for AutoTuner profiling cache (#7738)
auto-close-inactive-issues.yml #4692:Scheduled
[TRTLLM-4500][feat] Add serialization/deserialization options for AutoTuner profiling cache (#7738)
auto-close-inactive-issues.yml #4691:Scheduled
[TRTLLM-4500][feat] Add serialization/deserialization options for AutoTuner profiling cache (#7738)
auto-close-inactive-issues.yml #4690:Scheduled
[TRTLLM-4500][feat] Add serialization/deserialization options for AutoTuner profiling cache (#7738)
auto-close-inactive-issues.yml #4689:Scheduled
[None][doc] Scaffolding tech blog fix a typo (#8042)
auto-close-inactive-issues.yml #4688:Scheduled
[None][doc] Scaffolding tech blog fix a typo (#8042)
auto-close-inactive-issues.yml #4687:Scheduled
[None][doc] Scaffolding tech blog fix a typo (#8042)
auto-close-inactive-issues.yml #4686:Scheduled
[None][doc] Scaffolding tech blog fix a typo (#8042)
auto-close-inactive-issues.yml #4685:Scheduled
[None][doc] Scaffolding tech blog fix a typo (#8042)
auto-close-inactive-issues.yml #4684:Scheduled