Bibliography (7):

  1. DiM: Scaling Diffusion Mamba with Bidirectional SSMs for Efficient Image and Video Generation

  2. Learning to (Learn at Test Time): RNNs with Expressive Hidden States

  3. Attention Is All You Need

  4. Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

  5. Gated Delta Networks: Improving Mamba-2 with Delta Rule

  6. https://test-time-training.github.io/video-dit