mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-13 14:07:21 +08:00
[TRTLLM-9522][test] cover LLM API `multi_modal_embeddings` (#9963)
auto-close-inactive-issues.yml #4802:Scheduled
[None][doc] Update Qwen3-Next doc by adding known issues section (#10582)
auto-close-inactive-issues.yml #4801:Scheduled
[None][chore] Update AutoDeploy model list (#10505)
auto-close-inactive-issues.yml #4800:Scheduled
[TRTLLM-9932][test] add kimi_k2 single node perf test (#10436)
auto-close-inactive-issues.yml #4799:Scheduled
[None][test] Remove most TRT-backend test cases in llm_perf_nim.yml (#10487)
auto-close-inactive-issues.yml #4797:Scheduled
[https://nvbugs/5749988][fix] Remove redundant qwen3 spec dec test (#10387)
auto-close-inactive-issues.yml #4796:Scheduled
[TRTLLM-9767][feat] Fixed recursive node traversals (#10379)
auto-close-inactive-issues.yml #4795:Scheduled
[None][fix] Decrease Pre Merge Perf Tests (#10390)
auto-close-inactive-issues.yml #4794:Scheduled
[#10056][chore] AutoDeploy: Enable Nemo SuperV3 accuracy test (#10308)
auto-close-inactive-issues.yml #4792:Scheduled
[#10056][fix] AutoDeploy: Handle deletion of nested params in sharding (#10376)
auto-close-inactive-issues.yml #4791:Scheduled
[None][chore] Add failed cases into waives.txt (#10354)
auto-close-inactive-issues.yml #4790:Scheduled
[https://nvbugs/5766986][fix] fixed the shard_all_unprocessed default value to align with the default.yml (#10271)
auto-close-inactive-issues.yml #4789:Scheduled
[None][infra] Some improvements for Slurm execution path in the CI (#10316)
auto-close-inactive-issues.yml #4788:Scheduled
[None][ci] Move remaining DGX-B200 tests to LBD (#9876)
auto-close-inactive-issues.yml #4787:Scheduled
[None][infra] Check in most recent lock file from nightly pipeline
auto-close-inactive-issues.yml #4786:Scheduled
[https://nvbugs/5633700][fix] Cache tiktoken vocab for gpt-oss (#10219)
auto-close-inactive-issues.yml #4785:Scheduled
[TRTLLM-9578][feat] make PDL enabled by default (#9695)
auto-close-inactive-issues.yml #4784:Scheduled
[None][infra] Check GB200 coherent GPU mapping (#10253)
auto-close-inactive-issues.yml #4783:Scheduled
[None] [feat] skip batch_tokenize_prompts in CustomDataset (#10214)
auto-close-inactive-issues.yml #4782:Scheduled
[None][chore] NVLinkOneSided AlltoAll Support zero local_num_tokens. (#9822)
auto-close-inactive-issues.yml #4781:Scheduled
[https://nvbugs/5702793][fix] Fix view operation on uncontiguous tensor (#10147)
auto-close-inactive-issues.yml #4780:Scheduled
[https://nvbugs/5753250][infra] Further waive all tests in _test_openai_responses.py (#10176)
auto-close-inactive-issues.yml #4779:Scheduled
[https://nvbugs/5722653][fix] Unwaive fixed test (#10157)
auto-close-inactive-issues.yml #4778:Scheduled
[https://nvbugs/5456493][feat] Add fp8 bmm on sm120 (#9687)
auto-close-inactive-issues.yml #4777:Scheduled
[TRTLLM-9680][perf] Optimize TRTLLMSampler log_probs performance (Core fix has been merged via #9353) (#9655)
auto-close-inactive-issues.yml #4776:Scheduled
[TRTLLM-9181][feat] improve disagg-server prometheus metrics; synchronize workers' clocks when workers are dynamic (#9726)
auto-close-inactive-issues.yml #4775:Scheduled
[https://nvbugs/5540979][fix] Potential fix for 5540979 (#9716)
auto-close-inactive-issues.yml #4774:Scheduled
[None][infra] Waive failed tests for main branch on 12/14 (#9982)
auto-close-inactive-issues.yml #4773:Scheduled