S4: Efficiently Modeling Long Sequences with Structured State Spaces
LSSL: Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space Layers
Legendre Memory Units: Continuous-Time Representation in Recurrent Neural Networks
GRU: Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Wikipedia Bibliography: