Bibliography (12):

  1. https://x.com/siyan_zhao/status/1805277462890492321

  2. https://arxiv.org/pdf/2406.11233#page=16

  3. https://gwern.net/doc/ai/nn/fully-connected/2024-zhao-figure1-llmshavemuchrougherdecisionboundariesthanmlpsorsvmsordecisiontrees.png

  4. LLaMA-2: Open Foundation and Fine-Tuned Chat Models

  5. Mistral-7B

  6. Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

  7. https://openai.com/index/hello-gpt-4o/

  8. GPT-3: Language Models are Few-Shot Learners