Bibliography (4):

  1. Contrastive Representation Learning: A Framework and Review

  2. https://github.com/facebookresearch/NPM

  3. ByT5: Towards a token-free future with pre-trained byte-to-byte models

  4. Wikipedia Bibliography:

    1. Softmax function