GPU parallel radix sort: work-efficient scan-based digit passes on manycore architectures

Class:
Algorithm