mirror of
https://git.suyu.dev/suyu/suyu
synced 2025-01-24 00:26:57 -06:00
68ed60cee4
The existing implementation only supports 64 invoc-per-subgroup GPUs, and misbehaves on adreno when invocations need to be split into 4 emulated subgroups.