‘compression’ directory

See Also
Gwern
Links
Miscellaneous
Bibliography

See Also

Gwern

“Quantifying Truesight With SAEs ”, Gwern 2025

Quantifying Truesight With SAEs

“Research Ideas ”, Gwern 2017

Research Ideas

“Umineko: The Hopium Of The Magics ”, Gwern 2018

Umineko: The Hopium Of The Magics

“The `sort –key` Trick ”, Gwern 2014

The sort –key Trick

“Against Copyright ”, Gwern 2008

Against Copyright

Links

“LSTM or Transformer As ‘Malware Packer’ ”, Bednarskiwsieci 2025

LSTM or Transformer as ‘malware packer’

“OpenAI Charges by the Minute, So Make the Minutes Shorter ”, Mandis 2025

OpenAI Charges by the Minute, So Make the Minutes Shorter

“The JPEG XL Image Coding System: History, Features, Coding Tools, Design Rationale, and Future ”, Sneyers et al 2025

The JPEG XL Image Coding System: History, Features, Coding Tools, Design Rationale, and Future

“Hypernym Mercury: Token Optimization Through Semantic Field Constriction And Reconstruction From Hypernyms. A New Text Compression Method ”, Forrester & Sulea 2025

Hypernym Mercury: Token Optimization Through Semantic Field Constriction And Reconstruction From Hypernyms. A New Text Compression Method

“Generative Modeling in Latent Space [Why VAEs Etc Work] ”, Dieleman 2025

Generative modeling in latent space [why VAEs etc work]

“SuperBPE: Space Travel for Language Models ”, Liu et al 2025

SuperBPE: Space Travel for Language Models

“Deep Learning Is Not So Mysterious or Different ”, Wilson 2025

Deep Learning is Not So Mysterious or Different

“PtrHash: Minimal Perfect Hashing at RAM Throughput ”, Koerkamp 2025

PtrHash: Minimal Perfect Hashing at RAM Throughput

“How Unix `spell` Ran in 64kB RAM: How Do You Fit a Dictionary in 64kb RAM? Unix Engineers Solved It With Clever Data Structures and Compression Tricks ”, Upadhyay 2025

How Unix spell Ran in 64kB RAM: How do you fit a dictionary in 64kb RAM? Unix engineers solved it with clever data structures and compression tricks

“Optimal Bounds for Open Addressing Without Reordering ”, Farach-Colton et al 2025

Optimal Bounds for Open Addressing Without Reordering

“The Complexity Dynamics of Grokking ”, DeMoss et al 2024

The Complexity Dynamics of Grokking

“WebP: The WebPage Compression Format ”, Sireneva 2024

WebP: The WebPage compression format :

View HTML:

/doc/www/purplesyringa.moe/21a7fc568307faab0c7c540b9a9b27f66af53ef9.html

“Investigating Learning-Independent Abstract Reasoning in Artificial Neural Networks ”, Barak & Loewenstein 2024

Investigating learning-independent abstract reasoning in artificial neural networks

“Uncheatable_eval: Evaluating LLMs With Dynamic Data ”, Jellyfish042 2024

uncheatable_eval: Evaluating LLMs with Dynamic Data :

View HTML:

/doc/www/github.com/4b0720aa1d823f3509d517179ddfcb2b9de6a4c0.html

“SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound ”, Liu et al 2024

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

“Compression Represents Intelligence Linearly ”, Huang et al 2024

Compression Represents Intelligence Linearly

“Training LLMs over Neurally Compressed Text ”, Lester et al 2024

Training LLMs over Neurally Compressed Text

“Tokenization Is More Than Compression ”, Schmidt et al 2024

Tokenization Is More Than Compression

“Infini-Gram: Scaling Unbounded n-Gram Language Models to a Trillion Tokens ”, Liu et al 2024

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

“Language Modeling Is Compression ”, Delétang et al 2023

Language Modeling Is Compression

“UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting ”, Ertl 2023

UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting

“Bayesian Flow Networks ”, Graves et al 2023

Bayesian Flow Networks

“`ts_zip`: Text Compression Using Large Language Models [RWKV 169M V4] ”, Bellard 2023

ts_zip: Text Compression using Large Language Models [RWKV 169M v4] :

View HTML:

/doc/www/bellard.org/7f1da4826605365242b7d40d83f8e13785239846.html

“Gzip versus Bag-Of-Words for Text Classification With k-NN ”, Opitz 2023

Gzip versus bag-of-words for text classification with k-NN

“High-Fidelity Audio Compression With Improved RVQGAN ”, Kumar et al 2023

High-Fidelity Audio Compression with Improved RVQGAN

“White-Box Transformers via Sparse Rate Reduction ”, Yu et al 2023

White-Box Transformers via Sparse Rate Reduction

“How to Enumerate Trees from a Context-Free Grammar ”, Piantadosi 2023

How to enumerate trees from a context-free grammar

“DIRAC: Neural Image Compression With a Diffusion-Based Decoder ”, Goose et al 2023

DIRAC: Neural Image Compression with a Diffusion-Based Decoder

“Less Is More: Parameter-Free Text Classification With Gzip ”, Jiang et al 2022

Less is More: Parameter-Free Text Classification with Gzip

“Low-Bitrate Redundancy Coding of Speech Using a Rate-Distortion-Optimized Variational Autoencoder ”, Valin et al 2022

Low-Bitrate Redundancy Coding of Speech Using a Rate-Distortion-Optimized Variational Autoencoder

“RGB No More: Minimally-Decoded JPEG Vision Transformers ”, Park & Johnson 2022

RGB no more: Minimally-decoded JPEG Vision Transformers

“High Fidelity Neural Audio Compression ”, Défossez et al 2022

High Fidelity Neural Audio Compression

“Compressing Multidimensional Weather and Climate Data into Neural Networks ”, Huang & Hoefler 2022

Compressing multidimensional weather and climate data into neural networks

“T2CI-GAN: Text to Compressed Image Generation Using Generative Adversarial Network ”, Rajesh et al 2022

T2CI-GAN: Text to Compressed Image generation using Generative Adversarial Network

“DiffC: Lossy Compression With Gaussian Diffusion ”, Theis et al 2022

DiffC: Lossy Compression with Gaussian Diffusion

“What Is a Succinct Rank Data Structure? How Does It Work? ”, templatetypedef 2022

What is a succinct rank data structure? How does it work?

“MuZero With Self-Competition for Rate Control in VP9 Video Compression ”, Mandhane et al 2022

MuZero with Self-competition for Rate Control in VP9 Video Compression

“A Deep Dive into an NSO Zero-Click IMessage Exploit: Remote Code Execution ”, Beer & Groß 2021

A deep dive into an NSO zero-click iMessage exploit: Remote Code Execution

“Tiny Pointers ”, Bender et al 2021

Tiny Pointers

“SZ3: A Modular Framework for Composing Prediction-Based Error-Bounded Lossy Compressors ”, Liang et al 2021

SZ3: A Modular Framework for Composing Prediction-Based Error-Bounded Lossy Compressors

“Palette: Image-To-Image Diffusion Models ”, Saharia et al 2021

Palette: Image-to-Image Diffusion Models

“Autoregressive Diffusion Models ”, Hoogeboom et al 2021

Autoregressive Diffusion Models

“Variational Diffusion Models ”, Kingma et al 2021

Variational Diffusion Models

“Rip Van Winkle’s Razor, a Simple New Estimate for Adaptive Data Analysis ”, Arora & Zhang 2021

Rip van Winkle’s Razor, a Simple New Estimate for Adaptive Data Analysis

“Why Are Tar.xz Files 15× Smaller When Using Python’s Tar Library Compared to MacOS Tar? ”, Lindestøkke 2021

Why are tar.xz files 15× smaller when using Python’s tar library compared to macOS tar?

“Generating Images With Sparse Representations ”, Nash et al 2021

Generating Images with Sparse Representations

“Rip Van Winkle’s Razor: A Simple Estimate of Overfit to Test Data ”, Arora & Zhang 2021

Rip van Winkle’s Razor: A Simple Estimate of Overfit to Test Data

“Generative Speech Coding With Predictive Variance Regularization ”, Kleijn et al 2021

Generative Speech Coding with Predictive Variance Regularization

“1-Bit Adam: Communication Efficient Large-Scale Training With Adam’s Convergence Speed ”, Tang et al 2021

1-bit Adam: Communication Efficient Large-Scale Training with Adam’s Convergence Speed

“Scaling Laws for Autoregressive Generative Modeling ”, Henighan et al 2020

Scaling Laws for Autoregressive Generative Modeling

“Password Similarity Using Probabilistic Data Structures ”, Berardi et al 2020

Password similarity using probabilistic data structures

“Not-So-BigGAN: Generating High-Fidelity Images on Small Compute With Wavelet-Based Super-Resolution ”, Han et al 2020

not-so-BigGAN: Generating High-Fidelity Images on Small Compute with Wavelet-based Super-Resolution

“Zip Files: History, Explanation and Implementation ”, Wennborg 2020

Zip Files: History, Explanation and Implementation

“The 1-Bit Instrument: The Fundamentals of 1-Bit Synthesis, Their Implementational Implications, and Instrumental Possibilities ”, Troise 2020

The 1-Bit Instrument: The Fundamentals of 1-Bit Synthesis, Their Implementational Implications, and Instrumental Possibilities

“People Prefer Simpler Content When There Are More Choices: A Time Series Analysis of Lyrical Complexity in Six Decades of American Popular Music ”, Varnum et al 2019

People Prefer Simpler Content When There Are More Choices: A Time Series Analysis of Lyrical Complexity in Six Decades of American Popular Music

“Bit-Swap: Recursive Bits-Back Coding for Lossless Compression With Hierarchical Latent Variables ”, Kingma et al 2019

Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with Hierarchical Latent Variables

“Unraveling the JPEG: JPEG Images Are Everywhere in Our Digital Lives, but behind the Veil of Familiarity Lie Algorithms That Remove Details That Are Imperceptible to the Human Eye. This Produces the Highest Visual Quality With the Smallest File Size—But What Does That Look Like? Let’s See What Our Eyes Can’t See! ”, Shehata 2019

Unraveling the JPEG: JPEG images are everywhere in our digital lives, but behind the veil of familiarity lie algorithms that remove details that are imperceptible to the human eye. This produces the highest visual quality with the smallest file size—but what does that look like? Let’s see what our eyes can’t see!

“Practical Lossless Compression With Latent Variables Using Bits Back Coding ”, Townsend et al 2019

Practical Lossless Compression with Latent Variables using Bits Back Coding

“Associative Compression Networks for Representation Learning ”, Graves et al 2018

Associative Compression Networks for Representation Learning

“The Description Length of Deep Learning Models ”, Blier & Ollivier 2018

The Description Length of Deep Learning Models

“SignSGD: Compressed Optimization for Non-Convex Problems ”, Bernstein et al 2018

signSGD: Compressed Optimization for Non-Convex Problems

“Lempel-Ziv: a ‘1-Bit Catastrophe’ but Not a Tragedy ”, Lagarde & Perifel 2017

Lempel-Ziv: a ‘1-bit catastrophe’ but not a tragedy

A Mind Is Born, Akesson 2017

A Mind Is Born

“A Mind Is Born Video [256 Byte Demo] ”, Akesson 2017

A Mind Is Born video [256 byte demo] :

https://www.youtube.com/watch?v=sWblpsLZ-O8

“BBhash: Fast and Scalable Minimal Perfect Hashing for Massive Key Sets ”, Limasset et al 2017

BBhash: Fast and scalable minimal perfect hashing for massive key sets

“Wuffs: Wrangling Untrusted File Formats Safely ”, Tao 2017

Wuffs: Wrangling Untrusted File Formats Safely :

View HTML:

/doc/www/github.com/f13e69ccf202b39a2e3f561ae7534f180903658c.html

“Full Resolution Image Compression With Recurrent Neural Networks ”, Toderici et al 2016

Full Resolution Image Compression with Recurrent Neural Networks

“On Learning to Think: Algorithmic Information Theory for Novel Combinations of Reinforcement Learning Controllers and Recurrent Neural World Models ”, Schmidhuber 2015

On Learning to Think: Algorithmic Information Theory for Novel Combinations of Reinforcement Learning Controllers and Recurrent Neural World Models

“Compress and Control ”, Veness et al 2014

Compress and Control

“A Really Simple Approximation of Smallest Grammar ”, Jeż 2014

A really simple approximation of smallest grammar

“One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling ”, Chelba et al 2013

One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling

“The Thermodynamics of Prediction ”, Still et al 2012

The thermodynamics of prediction

“Notes on a New Philosophy of Empirical Science ”, Burfoot 2011

Notes on a New Philosophy of Empirical Science

“Universal Entropy of Word Ordering Across Linguistic Families ”, Montemurro & Zanette 2011

Universal Entropy of Word Ordering Across Linguistic Families

“Google-Wide Profiling: A Continuous Profiling Infrastructure for Data Centers ”, Ren et al 2010

Google-Wide Profiling: A Continuous Profiling Infrastructure for Data Centers

“New Strategy of Lossy Text Compression ”, Al-Dubaee & Ahmad 2010

New Strategy of Lossy Text Compression

“A Monte Carlo AIXI Approximation ”, Veness et al 2009

A Monte Carlo AIXI Approximation

“A Machine Learning Perspective on Predictive Coding With PAQ8 and New Applications ”, Knoll 2009

A Machine Learning Perspective on Predictive Coding with PAQ8 and New Applications :

View PDF:

/doc/www/www.byronknoll.com/16eabf617641b695866278adadce672d0cfe182c.pdf

“Seam Carving for Content-Aware Image Resizing ”, Avidan & Shamir 2007

Seam carving for content-aware image resizing

“The Bayesian Brain: the Role of Uncertainty in Neural Coding and Computation ”, Knill & Pouget 2004

The Bayesian brain: the role of uncertainty in neural coding and computation

“Clustering by Compression ”, Cilibrasi & Vitanyi 2003

Clustering by compression

“Data Compression and Entropy Estimates by Non-Sequential Recursive Pair Substitution ”, Grassberger 2002

Data Compression and Entropy Estimates by Non-sequential Recursive Pair Substitution

“Compression and Information Leakage of Plaintext ”, Kelsey 2002

Compression and Information Leakage of Plaintext

“Estimating and Comparing Entropy across Written Natural Languages Using PPM Compression ”, Behr et al 2002

Estimating and Comparing Entropy across Written Natural Languages Using PPM Compression

“The Similarity Metric ”, Li et al 2001

The similarity metric

“Language Trees and Zipping ”, Benedetto et al 2001

Language Trees and Zipping

“Redundancy Reduction Revisited ”, Barlow 2001

Redundancy reduction revisited

“Optimization Is Easy and Learning Is Hard in the Typical Function ”, English 2000

Optimization is easy and learning is hard in the typical function

“Fast Text Compression With Neural Networks ”, Mahoney 2000

Fast Text Compression with Neural Networks

“Text Compression As a Test for Artificial Intelligence ”, Mahoney 1999

Text Compression as a Test for Artificial Intelligence

“An Information-Theoretic Model for Steganography ”, Cachin 1998

An Information-Theoretic Model for Steganography

“The Art of Computer Programming, Volume 3: Sorting & Searching § Chapter 6, Searching: Hashing: History ”, Knuth 1998

The Art of Computer Programming, Volume 3: Sorting & Searching § Chapter 6, Searching: Hashing: History :

View PDF:

/doc/cs/algorithm/information/compression/1998-knuth-taocp-v3-sortingandsearching-hashinghistory.pdf

“Low-Complexity Art ”, Schmidhuber 1997

Low-Complexity Art

“The Entropy Of English Using Ppm-Based Models ”, Teahan 1996

The Entropy Of English Using Ppm-Based Models :

View PDF:

/doc/cs/algorithm/information/compression/1996-teahan.pdf

“Measuring the Complexity of Writing Systems ”, Bosch et al 1994

Measuring the complexity of writing systems

“Entropy of Natural Languages: Theory and Experiment ”, Levitin & Reingold 1994

Entropy of natural languages: Theory and experiment

“Space-Efficient Static Trees and Graphs ”, Jacobson 1989

Space-efficient static trees and graphs

“Succinct Static Data Structures ”, Jacobson 1988

Succinct static data structures

“Development of a Spelling List ”, McIlroy 1982

Development of a Spelling List

“Possible Principles Underlying the Transformations of Sensory Messages ”, Barlow 1961

Possible Principles Underlying the Transformations of Sensory Messages

“Prediction and Entropy of Printed English ”, Shannon 1951

Prediction and entropy of printed English

“About the Test Data ”

About the Test Data

“Timm S. Mueller ”

Timm S. Mueller

“An Estimate of an Upper Bound for the Entropy of English ”

An estimate of an upper bound for the entropy of English :

View PDF:

/doc/www/aclanthology.org/ee294b20a9260a974d5bf5202ba5a38e141f6b7b.pdf

“Codec2: a Whole Podcast on a Floppy Disk ”

Codec2: a whole Podcast on a Floppy Disk :

View HTML:

/doc/www/auphonic.com/d11f6883e3e6248e8b858475f79c7d391f5d63f3.html

“Finding Near-Duplicates With Jaccard Similarity and MinHash ”

Finding near-duplicates with Jaccard similarity and MinHash :

View HTML:

/doc/www/blog.nelhage.com/823ca6aed7f4694a027316f8407ec5aea7e254e6.html

“How We Shrank Our Trip Planner till It Didn’t Need Data. ”

How we shrank our trip planner till it didn’t need data. :

View HTML:

/doc/www/blog.transitapp.com/6c2459d226425068bb7eddec4ae82f26c6baf464.html#c293

“The Complexity Dynamics of Grokking [Blog] ”, DeMoss et al 2025

The Complexity Dynamics of Grokking [blog] :

View HTML:

/doc/www/brantondemoss.com/4a3ef0cba7954230e30875b15ffbf2c0611950da.html

“Statistical Inference Through Data Compression ”

Statistical Inference Through Data Compression :

View HTML:

/doc/www/complearn.org/5af1e4a5d67ab84d46ac933f01132c9e0a2002cf.html

“A Proposal for Common Crawl to Consider Moving Compression from Gzip to Zstandard ”

A Proposal for Common Crawl to Consider Moving Compression from Gzip to Zstandard :

View HTML:

/doc/www/github.com/2256cff7531a5dab159382679b40023bf6d9e318.html#performance-tuning

“Compression Represents Intelligence Linearly [Code] ”

Compression Represents Intelligence Linearly [code] :

View HTML:

/doc/www/github.com/f8d2badbe21687a400993008fc11ca12d5fc6552.html

“ChessPositionRanking/img/2389704906374985477664262349386869232706664089.png at Main • Tromp/ChessPositionRanking ”

ChessPositionRanking/img/2389704906374985477664262349386869232706664089.png at main • tromp/ChessPositionRanking

“Llm-Compression Data ”

llm-compression data

“King James Programming ”

King James Programming

“LZ4—Extremely Fast Compression ”

LZ4—Extremely fast compression

“Relation of Word Order and Compression Ratio and Degree of Structure ”

Relation of Word Order and Compression Ratio and Degree of Structure :

View HTML:

/doc/www/web.archive.org/832f88f436be10149be2e6b4dce785449906a3f1.html

“That Alien Message ”, Yudkowsky 2025

That Alien Message

Sort By Magic

Annotations sorted by machine learning into inferred 'tags'. This provides an alternative way to browse: instead of by date order, one can browse in topic order. The 'sorted' list has been automatically clustered into multiple sections & auto-labeled for easier browsing.

Beginning with the newest annotation, it uses the embedding of each annotation to attempt to create a list of nearest-neighbor annotations, creating a progression of topics. For more details, see the link.

`hashing`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

`compression-theory`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

`neural-compression`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

Wikipedia (48)

Algorithmic information theory
Asymmetric Numeral Systems :

https://en.wikipedia.org/wiki/Asymmetric_Numeral_Systems
Asymmetric numeral systems
BLUF (communication) § Use in the military :

https://en.wikipedia.org/wiki/BLUF_(communication)#Use_in_the_military
Bit array :

https://en.wikipedia.org/wiki/Bit_array
Brotli
Burrows-Wheeler transform
Bzip2 :

https://en.wikipedia.org/wiki/Bzip2
https://en.wikipedia.org/wiki/Chroma_subsampling :

https://en.wikipedia.org/wiki/Chroma_subsampling
Code golf
Cuckoo filter :

https://en.wikipedia.org/wiki/Cuckoo_filter
Free energy principle
Hashlife
Huffman coding
Hutter Prize
JBIG2
JPEG § Quantization :

https://en.wikipedia.org/wiki/JPEG#Quantization
JPEG XL :

https://en.wikipedia.org/wiki/JPEG_XL
Kolmogorov complexity
LZ77 and LZ78
Lempel-Ziv-Markov chain algorithm
Locality-sensitive hashing
Matt Mahoney :

https://en.wikipedia.org/wiki/Matt_Mahoney
MinHash :

https://en.wikipedia.org/wiki/MinHash
Minimum description length
Minimum message length :

https://en.wikipedia.org/wiki/Minimum_message_length
PAQ :

https://en.wikipedia.org/wiki/PAQ
Perfect hash function :

https://en.wikipedia.org/wiki/Perfect_hash_function
Phil Katz :

https://en.wikipedia.org/wiki/Phil_Katz
Prediction by partial matching :

https://en.wikipedia.org/wiki/Prediction_by_partial_matching
Predictive coding
Prefix code :

https://en.wikipedia.org/wiki/Prefix_code
Random projection
Ray Solomonoff :

https://en.wikipedia.org/wiki/Ray_Solomonoff
Rzip
SDCH :

https://en.wikipedia.org/wiki/SDCH
SimHash :

https://en.wikipedia.org/wiki/SimHash
Snappy (compression) :

https://en.wikipedia.org/wiki/Snappy_(compression)
Solid compression :

https://en.wikipedia.org/wiki/Solid_compression
Solomonoff’s theory of inductive inference
Succinct data structure :

https://en.wikipedia.org/wiki/Succinct_data_structure
Suffix array :

https://en.wikipedia.org/wiki/Suffix_array
The Complexity of Songs
Trie
Universal artificial intelligence :

https://en.wikipedia.org/wiki/Universal_artificial_intelligence
Z-order curve :

https://en.wikipedia.org/wiki/Z-order_curve
Zip bomb
Zstd :

https://en.wikipedia.org/wiki/Zstd

Miscellaneous

Bibliography

https://arxiv.org/abs/2503.13423: “SuperBPE: Space Travel for Language Models ”, Alisa Liu, Jonathan Hayase, Valentin Hofmann, Sewoong Oh, Noah A. Smith, Yejin Choi

link-bibliography
https://arxiv.org/abs/2404.09937: “Compression Represents Intelligence Linearly ”, Yuzhen Huang, Jinghan Zhang, Zifei Shan, Junxian He

link-bibliography
https://arxiv.org/abs/2212.09410: “Less Is More: Parameter-Free Text Classification With Gzip ”, Zhiying Jiang, Matthew Y. R. Yang, Mikhail Tsirlin, Raphael Tang, Jimmy Lin

link-bibliography
https://arxiv.org/abs/2210.13438#facebook: “High Fidelity Neural Audio Compression ”, Alexandre Défossez, Jade Copet, Gabriel Synnaeve, Yossi Adi

link-bibliography
https://www.offconvex.org/2021/04/07/ripvanwinkle/: “Rip Van Winkle’s Razor, a Simple New Estimate for Adaptive Data Analysis ”, Sanjeev Arora, Yi Zhang

link-bibliography
https://arxiv.org/abs/2102.02888#microsoft: “1-Bit Adam: Communication Efficient Large-Scale Training With Adam’s Convergence Speed ”, Hanlin Tang, Shaoduo Gan, Ammar Ahmad Awan, Samyam Rajbhandari, Conglong Li, Xiangru Lian, Ji Liu, Ce Zhang, Yuxiong He

link-bibliography
https://arxiv.org/abs/2010.14701#openai: “Scaling Laws for Autoregressive Generative Modeling ”, Tom Henighan, Jared Kaplan, Mor Katz, Mark Chen, Christopher Hesse, Jacob Jackson, Heewoo Jun, Tom B. Brown, Prafulla Dhariwal, Scott Gray, Chris Hallacy, Benjamin Mann, Alec Radford, Aditya A. Ramesh, Nick Ryder, Daniel M. Ziegler, John Schulman, Dario Amodei, Sam McCandlish

link-bibliography
https://arxiv.org/abs/2009.04433: “Not-So-BigGAN: Generating High-Fidelity Images on Small Compute With Wavelet-Based Super-Resolution ”, Seungwook Han, Akash Srivastava, Cole Hurwitz, Prasanna Sattigeri, David D. Cox

link-bibliography
1997-schmidhuber.pdf: “Low-Complexity Art ”, Jürgen Schmidhuber

link-bibliography
1994-vandenbosch.pdf: “Measuring the Complexity of Writing Systems ”, Antal van den Bosch, Alain Contenty, Walter Daelemansz, Beatrice de Gelder

link-bibliography
1961-barlow.pdf: “Possible Principles Underlying the Transformations of Sensory Messages ”, H. B. Barlow

link-bibliography

[Quote Of The Day]

[Site Of The Day]

[Annotation Of The Day]

[adblock public service announcement]