“One Weird Trick for Parallelizing Convolutional Neural Networks”, 2014-04-23 (; similar):
I present a new way to parallelize the training of convolutional neural networks across multiple GPUs.
The method scales better than all alternatives when applied to modern convolutional neural networks.