TensorRT-LLMs/tensorrt_llm/executor
William Zhang a6a88985cf
[TRTLLM-9409][feat] Pass MRoPE tensors for EPD disagg (#9758)
* Why?

Certain VLMs like the Qwen family need more than just the multimodal
embeddings in the language model, and need MRoPE position IDs and
deltas. Prior to this commit, only the embeddings could be communicated
from the encoder worker to the prefill worker.

* What?

This commit extends the `DisaggregatedParams` to include the MRoPE
information. It also adjusts several pieces of code required to
communicate that between E, P and D workers.

Closes TRTLLM-9409.

Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>
2025-12-22 06:32:49 -05:00
..
rpc [None][doc] update readme for rpc (#9972) 2025-12-15 10:16:50 +08:00
__init__.py chore: rename ExecutorBindingsWorker/Proxy (#4716) 2025-05-29 10:32:35 +08:00
base_worker.py [https://nvbugs/5651854][fix] Fix dist-serving perf by clearing CPU affinity (#9549) 2025-12-03 01:17:03 +08:00
executor.py [TRTLLM-9144][fix] enhance RPC robustness (#8711) 2025-12-02 21:37:59 +08:00
ipc.py [https://nvbugs/5720482][fix] Fix test rpc streaming (#9902) 2025-12-13 01:14:43 -08:00
postproc_worker.py [None][feat] perf_metrics endpoint functionality improvement (#8005) 2025-10-02 17:43:25 -07:00
proxy.py [None][refactor] simplify get_stats and get_kvcache_events with rpc (#9980) 2025-12-22 18:23:43 +08:00
ray_executor.py [TRTLLM-9784][fix] Resolve port conflicts (#9780) 2025-12-12 22:10:01 -08:00
ray_gpu_worker.py [TRTLLM-9784][fix] Resolve port conflicts (#9780) 2025-12-12 22:10:01 -08:00
request.py [None][feat] Add opentelemetry tracing (#5897) 2025-10-27 18:51:07 +08:00
result.py [TRTLLM-9409][feat] Pass MRoPE tensors for EPD disagg (#9758) 2025-12-22 06:32:49 -05:00
rpc_proxy_mixin.py [None][refactor] simplify get_stats and get_kvcache_events with rpc (#9980) 2025-12-22 18:23:43 +08:00
rpc_proxy.py [None][refactor] simplify get_stats and get_kvcache_events with rpc (#9980) 2025-12-22 18:23:43 +08:00
rpc_worker_mixin.py [None][refactor] simplify get_stats and get_kvcache_events with rpc (#9980) 2025-12-22 18:23:43 +08:00
rpc_worker.py [None][fix] enable hmac in RPC (#9745) 2025-12-07 08:24:46 +08:00
utils.py [None][refactor] simplify get_stats and get_kvcache_events with rpc (#9980) 2025-12-22 18:23:43 +08:00
worker.py [None][refactor] simplify get_stats and get_kvcache_events with rpc (#9980) 2025-12-22 18:23:43 +08:00