Files
Alexey Kopytko cc9e331213 SYCL: improve MoE prefill throughput (#23142)
- change `k_copy_src1_to_contiguous` so that uses a precomputed contiguous mapping where all rows "owned" by an expert are in one slice with a know starts and ends
- switch the `O(n_as * n_routed_rows)` contraption to a counting sort-based procedure with `O(n_as + n_routed_rows)` complexity
2026-05-22 15:50:17 +03:00
..
2026-04-14 17:32:58 +03:00
2026-04-16 17:21:28 +08:00
2026-05-16 20:06:23 +08:00
2026-03-25 12:53:16 +02:00