Bibliography (54):

  1. Recurrent Neural Network Based Language Model § Dynamic Evaluation

  2. CQK Is The First Unused TLA § Effective GPT-4 Programming

  3. Generative AI for Professional Services

  4. Sudowrite - Best AI Writing Partner for Fiction

  5. design#future-tag-features

    [Transclude the forward-link's context]

  6. Optical Character Recognition (OCR) in Google Docs

  7. ‘self-attention’ directory

  8. Generating Sequences With Recurrent Neural Networks

  9. Dynamic Evaluation of Transformer Language Models

  10. In-Context Pretraining (ICP): Language Modeling Beyond Document Boundaries

  11. TTT-NN: Test-Time Training on Nearest Neighbors for Large Language Models

  12. Improving Neural Language Models with a Continuous Cache

  13. Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

  14. Compressive Transformers for Long-Range Sequence Modeling

  15. Memorizing Transformers

  16. Scaling Data-Constrained Language Models

  17. A Neural Corpus Indexer for Document Retrieval

  18. Absolute Unit NNs: Regression-Based MLPs for Everything § Memorize All The Things

    [Transclude the forward-link's context]

  19. Many-Shot In-Context Learning

  20. https://openai.com/pricing#fine-tuning-models

  21. Faster SGD training by minibatch persistency

  22. LoRA: Low-Rank Adaptation of Large Language Models

  23. LoRA vs Full Fine-tuning: An Illusion of Equivalence

  24. When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

  25. A Comparative Study between Full-Parameter and LoRA-based Fine-Tuning on Chinese Instruction Data for Instruction Following Large Language Model

  26. ‘low-precision NN’ directory

  27. Decision Transformer: Reinforcement Learning via Sequence Modeling

  28. Gato: A Generalist Agent

  29. Toolformer: Language Models Can Teach Themselves to Use Tools

  30. DAgger: A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning

  31. holy-war#bitrot

    [Transclude the forward-link's context]

  32. https://maggieappleton.com/lm-sketchbook#daemons

  33. The Turing Complete User

  34. Naked objects: a technique for designing more expressive systems