mirror of
https://git.suyu.dev/suyu/suyu
synced 2026-02-20 00:18:30 -06:00
The existing implementation only supports 64 invoc-per-subgroup GPUs, and misbehaves on adreno when invocations need to be split into 4 emulated subgroups.