Slide View : Parallel Computer Architecture and Programming : 15-418/618 Spring 2016

Previous | Next --- Slide 21 of 42

Renegade

As a matter of fact, asynchrony has been widely adopted by many modern Distributed Machine Learning frameworks, such as Microsoft Adam, the Parameter Server by Mu Li et al. Besides great speedup from relaxation of synchronisation, the accuracy of model can also be improved surprisingly.

jellybean

Any idea why the accuracy would improve? I find it counter-intuitive that relaxing synchronization improves accuracy. Maybe some level of randomness is beneficial?