Slide View : Parallel Computer Architecture and Programming : 15-418/618 Spring 2017

Previous | Next --- Slide 53 of 81

nemo

The fact that CUDA launches blocks (of CUDA threads) for execution depending on resource availability ensures that the scheduler can fully utilize the cores without us having to specify a fixed number beforehand. This in some sense is similar to how ISPC launches 'tasks'. This also enables the same code to run on GPUs with different specifications.