-
https://github.com/wiedersehne/Paramixer
-
ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths
-
Long Range Arena (LRA): A Benchmark for Efficient Transformers
-
Temporal Convolutional Networks: A Unified Approach to Action Segmentation
-