Slide View : Parallel Computer Architecture and Programming : 15-418/618 Spring 2017

Efficiently Evaluating Deep Networks

Previous | Next --- Slide 38 of 51

Back to Lecture Thumbnails

chenh1

The convolution in Fourier domain can be referenced here.

eourcs

Interesting to note that the original FFT algorithm (the Cooley-Tukey algorithm) is one of the early examples of a trivially parallelizable divide-and-conquer algorithm. No asymptotic improvements have been made to computing circular convolution since then, but there is a very healthy area of research around decreasing the number of multiplications for certain data sets. For example, Overlap-add is an efficient way of computing convolution when you have a large image and a small filter (which is often the case in deep learning).

themj

The Winograd algorithm is based off the premise that it is possible to uniquely given its remainder with respect to the given moduli. (Assumptions are that moduli Are relatively prime)