Previous | Next --- Slide 52 of 79
Back to Lecture Thumbnails
IntergalacticPeanutMaker
  • 128 threads per block --> #define THREADS_PER_BLK 128
  • B bytes of local data per thread --> automatically allocated
  • 130 floats (520 bytes) of shared space per thread block --> __shared__ float support[THREADS_PER_BLK+2]

Launch 8K thread blocks --> 1024*1024/128 = 8192