Loading paper
Improving tasks throughput on accelerators using OpenCL command concurrency | Tomesphere