‘NN sampling’ directory

See Also
Gwern
Links
Miscellaneous
Bibliography

See Also

Gwern

“Research Ideas ”, Gwern 2017

Research Ideas

“GPT-3 Creative Fiction ”, Gwern 2020

GPT-3 Creative Fiction

“Choose-Your-Own-Adventure AI Dungeon Games ”, Gwern 2021

Choose-Your-Own-Adventure AI Dungeon Games

“GPT-2 Neural Network Poetry ”, Gwern & Presser 2019

GPT-2 Neural Network Poetry

“RNN Metadata for Mimicking Author Style ”, Gwern 2015

RNN Metadata for Mimicking Author Style

Links

“Shorter Tokens Are More Likely [In LLM Sampling] ”, Long 2025

Shorter Tokens Are More Likely [in LLM sampling]

“Decoding-Based Regression ”, Song & Bahri 2025

Decoding-based Regression

“The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation ”, Carlsson et al 2024

The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation

“Do LLMs Estimate Uncertainty Well in Instruction-Following? ”, Heo et al 2024

Do LLMs estimate uncertainty well in instruction-following?

“SimpleStrat: Diversifying Language Model Generation With Stratification ”, Wong et al 2024

SimpleStrat: Diversifying Language Model Generation with Stratification

“Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs ”, Laine et al 2024

Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs

“Turning Up the Heat: Min-P Sampling for Creative and Coherent LLM Outputs ”, Nguyen et al 2024

Turning Up the Heat: Min-p Sampling for Creative and Coherent LLM Outputs

“Detecting Hallucinations in Large Language Models Using Semantic Entropy ”, Farquhar et al 2024

Detecting hallucinations in large language models using semantic entropy :

View HTML:

/doc/www/pmc.ncbi.nlm.nih.gov/2a06354d25a743c50df38e666cb1ed64faf539e5.html

“Be like a Goldfish, Don’t Memorize! Mitigating Memorization in Generative LLMs ”, Hans et al 2024

Be like a Goldfish, Don’t Memorize! Mitigating Memorization in Generative LLMs

“Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass ”, Shen et al 2024

Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass

“Generating Converging Narratives for Games With Large Language Models ”, Summers-Stay & Voss 2024

Generating Converging Narratives for Games with Large Language Models

“Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo ”, Zhao et al 2024

Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo

“Σ-GPTs: A New Approach to Autoregressive Models ”, Pannatier et al 2024

σ-GPTs: A New Approach to Autoregressive Models

“LLM Evaluators Recognize and Favor Their Own Generations ”, Panickssery et al 2024

LLM Evaluators Recognize and Favor Their Own Generations

“Re-Evaluating GPT-4’s Bar Exam Performance ”, Martínez 2024

Re-evaluating GPT-4’s bar exam performance

“EDT: Improving Large Language Models’ Generation by Entropy-Based Dynamic Temperature Sampling ”, Zhang et al 2024

EDT: Improving Large Language Models’ Generation by Entropy-based Dynamic Temperature Sampling

“Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking ”, Zelikman et al 2024

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

“Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews ”, Liang et al 2024

Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

“Chain-Of-Thought Reasoning Without Prompting ”, Wang & Zhou 2024

Chain-of-Thought Reasoning Without Prompting

“The Non-Effect of Sampling Temperature on Problem Solving in GPT-3.5/GPT-4 ”, Renze & Guven 2024

The Non-Effect of Sampling Temperature on Problem Solving in GPT-3.5/GPT-4

“Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM ”, Lu et al 2024

Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM

“GIVT: Generative Infinite-Vocabulary Transformers ”, Tschannen et al 2023

GIVT: Generative Infinite-Vocabulary Transformers

“Universal Self-Consistency for Large Language Model Generation ”, Chen et al 2023

Universal Self-Consistency for Large Language Model Generation

“Controlled Text Generation via Language Model Arithmetic ”, Dekoninck et al 2023

Controlled Text Generation via Language Model Arithmetic

“Language Model Inversion ”, Morris et al 2023

Language Model Inversion

“SEDD: Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution ”, Lou et al 2023

SEDD: Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution

“Let Models Speak Ciphers: Multiagent Debate through Embeddings ”, Pham et al 2023

Let Models Speak Ciphers: Multiagent Debate through Embeddings

“Contrastive Decoding Improves Reasoning in Large Language Models ”, O’Brien & Lewis 2023

Contrastive Decoding Improves Reasoning in Large Language Models

“Accelerating LLM Inference With Staged Speculative Decoding ”, Spector & Re 2023

Accelerating LLM Inference with Staged Speculative Decoding

“Efficient Guided Generation for Large Language Models ”, Willard & Louf 2023

Efficient Guided Generation for Large Language Models

“Sampling at Negative Temperature ”, Kauffman 2023

Sampling at negative temperature :

View HTML:

/doc/www/cavendishlabs.org/49ce1a21ece7e1ee5ba6e2a7ce696c3194fd2fbe.html

“Copy Is All You Need ”, Lan et al 2023

Copy Is All You Need

“Stay on Topic With Classifier-Free Guidance ”, Sanchez et al 2023

Stay on topic with Classifier-Free Guidance

“Sequential Monte Carlo Steering of Large Language Models Using Probabilistic Programs ”, Lew et al 2023

Sequential Monte Carlo Steering of Large Language Models using Probabilistic Programs

“How Language Model Hallucinations Can Snowball ”, Zhang et al 2023

How Language Model Hallucinations Can Snowball

“Tractable Control for Autoregressive Language Generation ”, Zhang et al 2023

Tractable Control for Autoregressive Language Generation

“MUX-PLMs: Pre-Training Language Models With Data Multiplexing ”, Murahari et al 2023

MUX-PLMs: Pre-training Language Models with Data Multiplexing

“Characterizing Attribution and Fluency Tradeoffs for Retrieval-Augmented Large Language Models ”, Aksitov et al 2023

Characterizing Attribution and Fluency Tradeoffs for Retrieval-Augmented Large Language Models

“DataMUX: Data Multiplexing for Neural Networks ”, Murahari et al 2023

DataMUX: Data Multiplexing for Neural Networks

“Witscript 3: A Hybrid AI System for Improvising Jokes in a Conversation ”, Toplyn 2023

Witscript 3: A Hybrid AI System for Improvising Jokes in a Conversation

“A Survey on Text Generation Using Generative Adversarial Networks ”, Rosa & Papa 2022

A survey on text generation using generative adversarial networks

“Fast Inference from Transformers via Speculative Decoding ”, Leviathan et al 2022

Fast Inference from Transformers via Speculative Decoding

“The CRINGE Loss: Learning What Language Not to Model ”, Adolphs et al 2022

The CRINGE Loss: Learning what language not to model

“Contrastive Decoding: Open-Ended Text Generation As Optimization ”, Li et al 2022

Contrastive Decoding: Open-ended Text Generation as Optimization

“Help Me Write a Poem: Instruction Tuning As a Vehicle for Collaborative Poetry Writing (CoPoet) ”, Chakrabarty et al 2022

Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry Writing (CoPoet)

“Contrastive Search Is What You Need For Neural Text Generation ”, Su & Collier 2022

Contrastive Search Is What You Need For Neural Text Generation

“Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models ”, Vilnis et al 2022

Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models

“Most Language Models Can Be Poets Too: An AI Writing Assistant and Constrained Text Generation Studio ”, Roush et al 2022

Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio

“Ask Me Anything (AMA): A Simple Strategy for Prompting Language Models ”, Arora et al 2022

Ask Me Anything (AMA): A simple strategy for prompting language models

“Out of One, Many: Using Language Models to Simulate Human Samples ”, Argyle et al 2022

Out of One, Many: Using Language Models to Simulate Human Samples

“Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned ”, Ganguli et al 2022

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

“Effidit: Your AI Writing Assistant ”, Shi et al 2022

Effidit: Your AI Writing Assistant

“DIRECTOR: Generator-Classifiers For Supervised Language Modeling ”, Arora et al 2022

DIRECTOR: Generator-Classifiers For Supervised Language Modeling

“LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks ”, Dinh et al 2022

LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks

“RankGen: Improving Text Generation With Large Ranking Models ”, Krishna et al 2022

RankGen: Improving Text Generation with Large Ranking Models

“Time Control: Language Modeling via Stochastic Processes ”, Wang et al 2022

Time Control: Language modeling via stochastic processes

“Controllable Natural Language Generation With Contrastive Prefixes ”, Qian et al 2022

Controllable Natural Language Generation with Contrastive Prefixes

“Using Natural Language Prompts for Machine Translation ”, Garcia & Firat 2022

Using natural language prompts for machine translation

“A Contrastive Framework for Neural Text Generation ”, Su et al 2022

A Contrastive Framework for Neural Text Generation

“Typical Decoding for Natural Language Generation ”, Meister et al 2022

Typical Decoding for Natural Language Generation

“FIGARO: Generating Symbolic Music With Fine-Grained Artistic Control ”, Rütte et al 2022

FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

“A Survey of Controllable Text Generation Using Transformer-Based Pre-Trained Language Models ”, Zhang et al 2022

A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models

“FRUIT: Faithfully Reflecting Updated Information in Text ”, IV et al 2021

FRUIT: Faithfully Reflecting Updated Information in Text

“NeuroLogic A^✱esque Decoding: Constrained Text Generation With Lookahead Heuristics ”, Lu et al 2021

NeuroLogic A^✱esque Decoding: Constrained Text Generation with Lookahead Heuristics

“Relating Neural Text Degeneration to Exposure Bias ”, Chiang & Chen 2021

Relating Neural Text Degeneration to Exposure Bias

“Program Synthesis With Large Language Models ”, Austin et al 2021

Program Synthesis with Large Language Models

“Scarecrow: A Framework for Scrutinizing Machine Text ”, Dou et al 2021

Scarecrow: A Framework for Scrutinizing Machine Text

“Time-Aware Language Models As Temporal Knowledge Bases ”, Dhingra et al 2021

Time-Aware Language Models as Temporal Knowledge Bases

“Machine Translation Decoding beyond Beam Search ”, Leblond et al 2021

Machine Translation Decoding beyond Beam Search

“Controllable Generation from Pre-Trained Language Models via Inverse Prompting ”, Zou et al 2021

Controllable Generation from Pre-trained Language Models via Inverse Prompting

“Improving Diversity of Neural Text Generation via Inverse Probability Weighting ”, Zhang et al 2021

Improving Diversity of Neural Text Generation via Inverse Probability Weighting

“There Once Was a Really Bad Poet, It Was Automated but You Didn’t Know It ”, Wang et al 2021

There Once Was a Really Bad Poet, It Was Automated but You Didn’t Know It

“A^✱ Search Without Expansions: Learning Heuristic Functions With Deep Q-Networks ”, Agostinelli et al 2021

A^✱ Search Without Expansions: Learning Heuristic Functions with Deep Q-Networks

“MAUVE: Measuring the Gap Between Neural Text and Human Text Using Divergence Frontiers ”, Pillutla et al 2021

MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers

“Prefix-Tuning: Optimizing Continuous Prompts for Generation ”, Li & Liang 2021

Prefix-Tuning: Optimizing Continuous Prompts for Generation

“Bot-Adversarial Dialogue for Safe Conversational Agents ”, Xu et al 2021

Bot-Adversarial Dialogue for Safe Conversational Agents

“Collaborative Storytelling With Large-Scale Neural Language Models ”, Nichols et al 2020

Collaborative Storytelling with Large-scale Neural Language Models

“NeuroLogic Decoding: (Un)supervised Neural Text Generation With Predicate Logic Constraints ”, Lu et al 2020

NeuroLogic Decoding: (Un)supervised Neural Text Generation with Predicate Logic Constraints

“Interacting With GPT-2 to Generate Controlled and Believable Musical Sequences in ABC Notation ”, Geerlings & Meroño-Peñuela 2020

Interacting with GPT-2 to Generate Controlled and Believable Musical Sequences in ABC Notation

“Training Independent Subnetworks for Robust Prediction ”, Havasi et al 2020

Training independent subnetworks for robust prediction

“MEGATRON-CNTRL: Controllable Story Generation With External Knowledge Using Large-Scale Language Models ”, Xu et al 2020

MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models

“Weird AI Yankovic: Generating Parody Lyrics ”, Riedl 2020

Weird AI Yankovic: Generating Parody Lyrics

“A Systematic Characterization of Sampling Algorithms for Open-Ended Language Generation ”, Nadeem et al 2020

A Systematic Characterization of Sampling Algorithms for Open-ended Language Generation

“GeDi: Generative Discriminator Guided Sequence Generation ”, Krause et al 2020

GeDi: Generative Discriminator Guided Sequence Generation

“Mirostat: A Neural Text Decoding Algorithm That Directly Controls Perplexity ”, Basu et al 2020

Mirostat: A Neural Text Decoding Algorithm that Directly Controls Perplexity

“Progressive Generation of Long Text ”, Tan et al 2020

Progressive Generation of Long Text

“This Word Does Not Exist ”, Dimson 2020

This Word Does Not Exist

“True_poetry: Poetry Generator by GPT-2 With Meter and Rhyme Constraints ”, Summers-Stay 2020

true_poetry: Poetry generator by GPT-2 with meter and rhyme constraints

“Blender: A State-Of-The-Art Open Source Chatbot ”, Roller et al 2020

Blender: A state-of-the-art open source chatbot

“Trading Off Diversity and Quality in Natural Language Generation ”, Zhang et al 2020

Trading Off Diversity and Quality in Natural Language Generation

“Rapformer: Conditional Rap Lyrics Generation With Denoising Autoencoders ”, Nikolov et al 2020

Rapformer: Conditional Rap Lyrics Generation with Denoising Autoencoders

“A Hundred Visions and Revisions ”, Binder 2020

A Hundred Visions and Revisions

“Top-K Training of GANs: Improving GAN Performance by Throwing Away Bad Samples ”, Sinha et al 2020

Top-K Training of GANs: Improving GAN Performance by Throwing Away Bad Samples

“Towards a Human-Like Open-Domain Chatbot ”, Adiwardana et al 2020

Towards a Human-like Open-Domain Chatbot

“Controlling Text Generation With Plug and Play Language Models ”, Liu et al 2019

Controlling Text Generation with Plug and Play Language Models

“Plug and Play Language Models: A Simple Approach to Controlled Text Generation ”, Dathathri et al 2019

Plug and Play Language Models: A Simple Approach to Controlled Text Generation

“CTRL: A Conditional Transformer Language Model For Controllable Generation ”, Keskar et al 2019

CTRL: A Conditional Transformer Language Model For Controllable Generation

“Neural Text Generation With Unlikelihood Training ”, Welleck et al 2019

Neural Text Generation with Unlikelihood Training

“GROVER: Defending Against Neural Fake News ”, Zellers et al 2019

GROVER: Defending Against Neural Fake News

“The Curious Case of Neural Text Degeneration ”, Holtzman et al 2019

The Curious Case of Neural Text Degeneration

“Good News, Everyone! Context Driven Entity-Aware Captioning for News Images ”, Biten et al 2019

Good News, Everyone! Context driven entity-aware captioning for news images

“Insertion Transformer: Flexible Sequence Generation via Insertion Operations ”, Stern et al 2019

Insertion Transformer: Flexible Sequence Generation via Insertion Operations

“Blockwise Parallel Decoding for Deep Autoregressive Models ”, Stern et al 2018

Blockwise Parallel Decoding for Deep Autoregressive Models

“Language GANs Falling Short ”, Caccia et al 2018

Language GANs Falling Short

“Discriminator Rejection Sampling ”, Azadi et al 2018

Discriminator Rejection Sampling

“OCD: Optimal Completion Distillation for Sequence Learning ”, Sabour et al 2018

OCD: Optimal Completion Distillation for Sequence Learning

“Controlling Linguistic Style Aspects in Neural Language Generation ”, Ficler & Goldberg 2017

Controlling Linguistic Style Aspects in Neural Language Generation

“Six Challenges for Neural Machine Translation ”, Koehn & Knowles 2017

Six Challenges for Neural Machine Translation

“Language Generation With Recurrent Generative Adversarial Networks without Pre-Training ”, Press et al 2017

Language Generation with Recurrent Generative Adversarial Networks without Pre-training

“A Deep Reinforced Model for Abstractive Summarization ”, Paulus et al 2017

A Deep Reinforced Model for Abstractive Summarization

“Learning to Generate Reviews and Discovering Sentiment ”, Radford et al 2017

Learning to Generate Reviews and Discovering Sentiment

“Improving Neural Machine Translation With Conditional Sequence Generative Adversarial Nets ”, Yang et al 2017

Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets

“Tuning Recurrent Neural Networks With Reinforcement Learning ”, Jaques et al 2017

Tuning Recurrent Neural Networks with Reinforcement Learning

“Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation ”, Johnson et al 2016

Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

“WaveNet: A Generative Model for Raw Audio ”, Oord et al 2016

WaveNet: A Generative Model for Raw Audio

“Sequence-Level Knowledge Distillation ”, Kim & Rush 2016

Sequence-Level Knowledge Distillation

“Sequence Level Training With Recurrent Neural Networks ”, Ranzato et al 2015

Sequence Level Training with Recurrent Neural Networks

“Generative Concatenative Nets Jointly Learn to Write and Classify Reviews ”, Lipton et al 2015

Generative Concatenative Nets Jointly Learn to Write and Classify Reviews

“Semi-Supervised Sequence Learning ”, Dai & Le 2015

Semi-supervised Sequence Learning

“Scheduled Sampling for Sequence Prediction With Recurrent Neural Networks ”, Bengio et al 2015

Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks

“Controlling GPT-3 With Logit Bias ”

Controlling GPT-3 with Logit Bias :

View HTML:

/doc/www/aidungeon.medium.com/091471e73f0fc19d2c9594c40e3e3ea76baec149.html

“Feature: Beam Search for Improving Global Quality of New Text Samples ”

Feature: beam search for improving global quality of new text samples

“Exclude Top Choices (XTC): A Sampler That Boosts Creativity, Breaks Writing Clichés, and Inhibits Non-Verbatim Repetition ”

Exclude Top Choices (XTC): A sampler that boosts creativity, breaks writing clichés, and inhibits non-verbatim repetition :

View HTML:

/doc/www/github.com/58c4b4dfce0d50d482f1c456ed5b526fcccde0df.html#issue-2471950553

“Prompting Diverse Ideas: Increasing AI Idea Variance ”

Prompting Diverse Ideas: Increasing AI Idea Variance

“Pixels Still Beat Text: Attacking the OpenAI CLIP Model With Text Patches and Adversarial Pixel Perturbations ”

Pixels still beat text: Attacking the OpenAI CLIP model with text patches and adversarial pixel perturbations

“Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs ”

Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs :

View HTML:

/doc/www/www.greaterwrong.com/ce9c8f71ad54707afd165ee5607750648a998a5a.html

“Apple or IPod? Easy Fix for Adversarial Textual Attacks on OpenAI’s CLIP Model! ”

Apple or iPod? Easy Fix for Adversarial Textual Attacks on OpenAI’s CLIP Model!

voooooogel

Llama-3.3-70b correctly guesses the sampling constraint (only allowed to use words that are in the Bible)

Sort By Magic

Annotations sorted by machine learning into inferred 'tags'. This provides an alternative way to browse: instead of by date order, one can browse in topic order. The 'sorted' list has been automatically clustered into multiple sections & auto-labeled for easier browsing.

Beginning with the newest annotation, it uses the embedding of each annotation to attempt to create a list of nearest-neighbor annotations, creating a progression of topics. For more details, see the link.

`text-generation humor sampling-strategy generative-ai lm-critique language-diversity storytelling`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

`model-distillation`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

`text-diversity`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

`text-control`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

Wikipedia (11)

Antithetic variates
Beam search
Breadth-first search :

https://en.wikipedia.org/wiki/Breadth-first_search
Dither
Floyd–Steinberg dithering :

https://en.wikipedia.org/wiki/Floyd%E2%80%93Steinberg_dithering
Low-discrepancy sequence
Perlin noise :

https://en.wikipedia.org/wiki/Perlin_noise
Quasi-Monte Carlo method
Rejection sampling
Simplex noise :

https://en.wikipedia.org/wiki/Simplex_noise
Viterbi algorithm :

https://en.wikipedia.org/wiki/Viterbi_algorithm

Miscellaneous

Bibliography

https://arxiv.org/abs/2407.04694: “Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs ”, Rudolf Laine, Bilal Chughtai, Jan Betley, Kaivalya Hariharan, Jeremy Scheurer, Mikita Balesni, Marius Hobbhahn, Alexander Meinke, Owain Evans

link-bibliography
https://arxiv.org/abs/2405.18400: “Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass ”, Ethan Shen, Alan Fan, Sarah M. Pratt, Jae Sung Park, Matthew Wallingford, Sham M. Kakade, Ari Holtzman, Ranjay Krishna, Ali Farhadi, Aditya Kusupati

link-bibliography
https://arxiv.org/abs/2404.13076: “LLM Evaluators Recognize and Favor Their Own Generations ”, Arjun Panickssery, Samuel R. Bowman, Shi Feng

link-bibliography
https://link.springer.com/article/10.1007/s10506-024-09396-9: “Re-Evaluating GPT-4’s Bar Exam Performance ”, Eric Martínez

link-bibliography
https://arxiv.org/abs/2403.09629: “Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking ”, Eric Zelikman, Georges Harik, Yijia Shao, Varuna Jayasiri, Nick Haber, Noah D. Goodman

link-bibliography
https://arxiv.org/abs/2312.02116: “GIVT: Generative Infinite-Vocabulary Transformers ”, Michael Tschannen, Cian Eastwood, Fabian Mentzer

link-bibliography
https://arxiv.org/abs/2309.09117#facebook: “Contrastive Decoding Improves Reasoning in Large Language Models ”, Sean O’Brien, Mike Lewis

link-bibliography
https://arxiv.org/abs/2306.17806#eleutherai: “Stay on Topic With Classifier-Free Guidance ”, Guillaume Sanchez, Honglu Fan, Alexander Spangher, Elad Levi, Pawan Sasanka Ammanamanchi, Stella Biderman

link-bibliography
https://arxiv.org/abs/2306.03081: “Sequential Monte Carlo Steering of Large Language Models Using Probabilistic Programs ”, Alexander K. Lew, Tan Zhi-Xuan, Gabriel Grand, Vikash K. Mansinghka

link-bibliography
https://arxiv.org/abs/2305.13534: “How Language Model Hallucinations Can Snowball ”, Muru Zhang, Ofir Press, William Merrill, Alisa Liu, Noah Smith

link-bibliography
https://arxiv.org/abs/2302.12441: “MUX-PLMs: Pre-Training Language Models With Data Multiplexing ”, Vishvak Murahari, Ameet Deshpande, Carlos E. Jimenez, Izhak Shafran, Mingqiu Wang, Yuan Cao, Karthik Narasimhan

link-bibliography
https://arxiv.org/abs/2212.11119: “A Survey on Text Generation Using Generative Adversarial Networks ”, Gustavo Henrique de Rosa, João Paulo Papa

link-bibliography
https://arxiv.org/abs/2210.15097: “Contrastive Decoding: Open-Ended Text Generation As Optimization ”, Xiang Lisa Li, Ari Holtzman, Daniel Fried, Percy Liang, Jason Eisner, Tatsunori Hashimoto, Luke Zettlemoyer, Mike Lewis

link-bibliography
https://arxiv.org/abs/2210.13669: “Help Me Write a Poem: Instruction Tuning As a Vehicle for Collaborative Poetry Writing (CoPoet) ”, Tuhin Chakrabarty, Vishakh Padmakumar, He He

link-bibliography
https://arxiv.org/abs/2210.14140: “Contrastive Search Is What You Need For Neural Text Generation ”, Yixuan Su, Nigel Collier

link-bibliography
https://arxiv.org/abs/2210.15458#google: “Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models ”, Luke Vilnis, Yury Zemlyanskiy, Patrick Murray, Alexandre Passos, Sumit Sanghai

link-bibliography
https://aclanthology.org/2022.cai-1.2.pdf: “Most Language Models Can Be Poets Too: An AI Writing Assistant and Constrained Text Generation Studio ”, Allen Roush, Sanjay Basu, Akshay Moorthy, Dmitry Dubovoy

link-bibliography
https://arxiv.org/abs/2210.02441: “Ask Me Anything (AMA): A Simple Strategy for Prompting Language Models ”, Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha, Kush Bhatia, Ines Chami, Frederic Sala, Christopher Ré

link-bibliography
https://www.anthropic.com/red_teaming.pdf: “Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned ”, Deep Ganguli, Liane Lovitt, Jackson Kernion, Amanda Askell, Yuntao Bai, Saurav Kadavath, Ben Mann, Ethan Perez, Nicholas Schiefer, Kamal Ndousse, Andy L. Jones, Samuel R. Bowman, Anna Chen, Tom Conerly, Nova DasSarma, Dawn Drain, Nelson Elhage, Sheer El-Showk, Stanislav Fort, Zac Hatfield Dodds, Tom Henighan, Danny Hernandez, Tristan Hume, Josh Jacobson, Scott Johnston, Shauna Kravec, Catherine Olsson, Sam Ringer, Eli Tran-Johnson, Dario Amodei, Tom B. Brown, Nicholas Joseph, Sam McCandlish, Chris Olah, Jared Kaplan, Jack Clark

link-bibliography
https://arxiv.org/abs/2202.11822#google: “Using Natural Language Prompts for Machine Translation ”, Xavier Garcia, Orhan Firat

link-bibliography
https://arxiv.org/abs/2107.01294#allen: “Scarecrow: A Framework for Scrutinizing Machine Text ”, Yao Dou, Maxwell Forbes, Rik Koncel-Kedziorski, Noah A. Smith, Yejin Choi

link-bibliography
https://arxiv.org/abs/2101.00190: “Prefix-Tuning: Optimizing Continuous Prompts for Generation ”, Xiang Lisa Li, Percy Liang

link-bibliography
https://aclanthology.org/2021.naacl-main.235.pdf#facebook: “Bot-Adversarial Dialogue for Safe Conversational Agents ”, Jing Xu, Da Ju, Margaret Li, Y-Lan Boureau, Jason Weston, Emily Dinan

link-bibliography
https://www.thisworddoesnotexist.com/: “This Word Does Not Exist ”, Thomas Dimson

link-bibliography
https://ai.meta.com/blog/state-of-the-art-open-source-chatbot/: “Blender: A State-Of-The-Art Open Source Chatbot ”, Stephen Roller, Jason Weston, Emily Dinan

link-bibliography
https://arxiv.org/abs/2004.03965: “Rapformer: Conditional Rap Lyrics Generation With Denoising Autoencoders ”, Nikola I. Nikolov, Eric Malmi, Curtis G. Northcutt, Loreto Parisi

link-bibliography
https://www.uber.com/blog/pplm/: “Controlling Text Generation With Plug and Play Language Models ”, Rosanne Liu, Sumanth Dathathri, Andrea Madotto, Piero Molino, Jason Yosinski

link-bibliography
https://arxiv.org/abs/1909.05858#salesforce: “CTRL: A Conditional Transformer Language Model For Controllable Generation ”, Nitish Shirish Keskar, Bryan McCann, Lav R. Varshney, Caiming Xiong, Richard Socher

link-bibliography
https://arxiv.org/abs/1811.02549: “Language GANs Falling Short ”, Massimo Caccia, Lucas Caccia, William Fedus, Hugo Larochelle, Joelle Pineau, Laurent Charlin

link-bibliography
https://arxiv.org/abs/1810.01398: “OCD: Optimal Completion Distillation for Sequence Learning ”, Sara Sabour, William Chan, Mohammad Norouzi

link-bibliography

[Quote Of The Day]

[Site Of The Day]

[Annotation Of The Day]

[adblock public service announcement]