Slide View : Parallel Computer Architecture and Programming : 15-418/618 Fall 2016

Previous | Next --- Slide 49 of 78

caiqifang

Is there a particular rule why we define THREADS_PER_BLK as 128?

llcoolj

I think it's based on how the thread scheduling works?

ferozenaina

This is just an example. CUDA Compute 6.1 can have a maximum of 1024 threads per block across all the three dimensions.