CUDA, embedded, assembly intrinsics, memory constrained system, things that are low level: these are the core expertise of Cveler. It is also not always possible to do everything on your overcrowded DPSs in which case the workload between DSPs, GPUs, and CPUs needs to be rebalanced.
