“One Weird Trick for Parallelizing Convolutional Neural Networks”, Alex Krizhevsky2014-04-23 (; similar)⁠:

I present a new way to parallelize the training of convolutional neural networks across multiple GPUs.

The method scales better than all alternatives when applied to modern convolutional neural networks.