Bibliography (7):
https://www.lesswrong.com/posts/mxa7XZ8ajE2oarWcr/lawrencec-s-shortform?commentId=pEqfzPMpqsnhaGrNK
‘end-to-end’ directory
Attention Is All You Need
Wikipedia Bibliography:
Deep learning
Transformer (deep learning architecture)
Attention (machine learning)
https://en.wikipedia.org/wiki/Neural_network :
https://en.wikipedia.org/wiki/Neural_network