mirror of
https://git.suyu.dev/suyu/suyu
synced 2025-04-17 09:44:08 -05:00

The existing implementation only supports 64 invoc-per-subgroup GPUs, and misbehaves on adreno when invocations need to be split into 4 emulated subgroups.