Slide View : Parallel Computer Architecture and Programming : 15-418/618 Spring 2017

Previous | Next --- Slide 35 of 51

crow

note that the first fully connected layer in VGG is very expensive in terms of memory. there are 7x7x512x4096 weights on that layer.

a recent paper showed that something like 97% of those weights could be removed (set to 0), without reducing the accuracy of the network. this could lead to much faster performance on bandwidth bound machines

rsvaidya

How do we get the value 392 MB for weights memory for fully connected?
Is it something like 7x7x512(earlier layer input) * 4096 * 4 bytes = 411 MB aprx ? What am I taking extra?

crow

411*1000*1000 bytes ~= 392*1024*1024 bytes

rsvaidya

Thanks @crow