“A Connection Between Score Matching and Denoising Autoencoders”, Pascal Vincent2011-07-01 (, )⁠:

[cf. “Diffusion models are autoencoders”] Denoising autoencoders have been previously shown to be competitive alternatives to restricted Boltzmann machines for unsupervised pretraining of each layer of a deep architecture.

We show that a simple denoising autoencoder training criterion is equivalent to matching the score (with respect to the data) of a specific energy-based model to that of a nonparametric Parzen density estimator of the data.

This yields several useful insights: