In a uniform memory access setting, the performance of interleaved assignment wouldn't be any different from blocked assignment?
It would different depending on the difference in complexity on various data, like we saw in hw1.
In a uniform memory access setting, the performance of interleaved assignment wouldn't be any different from blocked assignment?
It would different depending on the difference in complexity on various data, like we saw in hw1.