-
$2024
-
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
-
Attention Is All You Need
-
Dataset Distillation
-
An Empirical Model of Large-Batch Training
-
Beyond AI for Wafer Scale Compute: Setting Records in Computational Fluid Dynamics
-
Introducing Cerebras Inference: AI at Instant Speed
-
12 Hours Later, Groq Deploys Llama-3-Instruct (8 & 70B)
-
Etched Is Making the Biggest Bet in AI
-
$2022
-
Pipelined Backpropagation at Scale: Training Large Models without Batches
-