Slide View : Parallel Computer Architecture and Programming : 15-418/618 Spring 2016

Previous | Next --- Slide 78 of 79

arcticx

NVIDIA provides a quite detailed but concise tutorial on CUDA programming. The example used there is like the convolution example used in this class.

bysreg

"Threads in a thread block actually do run concurrently". To my understanding, the threads are divided into warps, and in one core, it can only run limited amount of warps at the same time, so how can it run concurrently?