https://github.com/wiedersehne/Paramixer
ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths
Long Range Arena (LRA): A Benchmark for Efficient Transformers
Temporal Convolutional Networks: A Unified Approach to Action Segmentation