dynamic evaluation (NN) tag

See Also
Gwern
- “Nenex: A Neural Personal Wiki Idea”, Gwern 2023
Links
Miscellaneous
Link Bibliography

[Warning: JavaScript Disabled!]

[For support of key website features (link annotation popups/popovers & transclusions, collapsible sections, backlinks, tablesorting, image zooming, sidenotes etc), you must enable JavaScript.]

See Also

Gwern

“Nenex: A Neural Personal Wiki Idea”, Gwern 2023

Nenex: A Neural Personal Wiki Idea

Links

“Neural Spline Fields for Burst Image Fusion and Layer Separation”, Chugunov et al 2023

Neural Spline Fields for Burst Image Fusion and Layer Separation

“Test-Time Adaptation of Discriminative Models via Diffusion Generative Feedback”, Prabhudesai et al 2023

Test-time Adaptation of Discriminative Models via Diffusion Generative Feedback

“In-Context Pretraining (ICP): Language Modeling Beyond Document Boundaries”, Shi et al 2023

In-Context Pretraining (ICP): Language Modeling Beyond Document Boundaries

“OSD: Online Speculative Decoding”, Liu et al 2023

OSD: Online Speculative Decoding

“Test-Time Training on Video Streams”, Wang et al 2023

Test-Time Training on Video Streams

“TTT-NN: Test-Time Training on Nearest Neighbors for Large Language Models”, Hardt & Sun 2023

TTT-NN: Test-Time Training on Nearest Neighbors for Large Language Models

“FWL: Meta-Learning Fast Weight Language Models”, Clark et al 2022

FWL: Meta-Learning Fast Weight Language Models

“Don’t Stop the Training: Continuously-Updating Self-Supervised Algorithms Best Account for Auditory Responses in the Cortex”, Orhan et al 2022

Don’t stop the training: continuously-updating self-supervised algorithms best account for auditory responses in the cortex

“Reconsidering the Past: Optimizing Hidden States in Language Models”, Yoshida & Gimpel 2021

Reconsidering the Past: Optimizing Hidden States in Language Models

“Mind the Gap: Assessing Temporal Generalization in Neural Language Models § Dynamic Evaluation”, Lazaridou et al 2021 (page 7 org deepmind)

Mind the Gap: Assessing Temporal Generalization in Neural Language Models § Dynamic Evaluation

“Mogrifier LSTM”, Melis et al 2019

Mogrifier LSTM

“Dynamic Evaluation of Transformer Language Models”, Krause et al 2019

Dynamic Evaluation of Transformer Language Models

“Dynamic Evaluation of Neural Sequence Models”, Krause et al 2017

Dynamic Evaluation of Neural Sequence Models

“Bayesian Recurrent Neural Networks”, Fortunato et al 2017

Bayesian Recurrent Neural Networks

“Learning Simpler Language Models With the Differential State Framework”, II et al 2017

Learning Simpler Language Models with the Differential State Framework

“Neural Episodic Control”, Pritzel et al 2017

Neural Episodic Control

“Multiplicative LSTM for Sequence Modelling”, Krause et al 2016

Multiplicative LSTM for sequence modelling

“Generating Sequences With Recurrent Neural Networks”, Graves 2013

Generating Sequences With Recurrent Neural Networks

“Recurrent Neural Network Based Language Model § Dynamic Evaluation”, Mikolov et al 2010 (page 2)

Recurrent Neural Network Based Language Model § Dynamic Evaluation

“Fast Text Compression With Neural Networks”, Mahoney 2000

Fast Text Compression with Neural Networks

Wikipedia

Backpropagation through time
Online machine learning

Miscellaneous

Link Bibliography

https://arxiv.org/abs/2307.05014: “Test-Time Training on Video Streams”, Renhao Wang, Yu Sun, Yossi Gandelsman, Xinlei Chen, Alexei A. Efros, Xiaolong Wang

link-bibliography
https://arxiv.org/abs/2305.18466: “TTT-NN: Test-Time Training on Nearest Neighbors for Large Language Models”, Moritz Hardt, Yu Sun

link-bibliography
https://arxiv.org/abs/2212.02475#google: “FWL: Meta-Learning Fast Weight Language Models”, Kevin Clark, Kelvin Guu, Ming-Wei Chang, Panupong Pasupat, Geoffrey Hinton, Mohammad Norouzi

link-bibliography
https://arxiv.org/abs/2112.08653: “Reconsidering the Past: Optimizing Hidden States in Language Models”, Davis Yoshida, Kevin Gimpel

link-bibliography
https://arxiv.org/pdf/2102.01951.pdf#page=7&org=deepmind: “Mind the Gap: Assessing Temporal Generalization in Neural Language Models § Dynamic Evaluation”, Angeliki Lazaridou, Adhiguna Kuncoro, Elena Gribovskaya, Devang Agrawal, Adam Liska, Tayfun Terzi, Mai Gimenez, Cyprien de Masson d’Autume, Tomas Kocisky, Sebastian Ruder, Dani Yogatama, Kris Cao, Susannah Young, Phil Blunsom

link-bibliography
https://arxiv.org/abs/1909.01792#deepmind: “Mogrifier LSTM”, Gábor Melis, Tomáš Kočiský, Phil Blunsom

link-bibliography
https://arxiv.org/abs/1904.08378: “Dynamic Evaluation of Transformer Language Models”, Ben Krause, Emmanuel Kahembwe, Iain Murray, Steve Renals

link-bibliography

[Quote Of The Day]

[Site Of The Day]

[Annotation Of The Day]

[adblock public service announcement]