Diffusion Model Alignment Using Direct Preference Optimization
A General Theoretical Paradigm to Understand Learning from Human Preferences
Direct Preference Optimization (DPO): Your Language Model is Secretly a Reward Model
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems
Rank-Smoothed Pairwise Learning In Perceptual Quality Assessment
Top-K Off-Policy Correction for a REINFORCE Recommender System
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Spectral Method and Regularized MLE Are Both Optimal for Top-K Ranking
The competitiveness of games in professional sports leagues
Just Sort It! A Simple and Effective Approach to Active Preference Learning
On the Complexity of Best Arm Identification in Multi-Armed Bandit Models
Bayesian Active Learning for Classification and Preference Learning
Aggregating inconsistent information: Ranking and clustering
Do More Expensive Wines Taste Better? Evidence from a Large Sample of Blind Tastings
Paired Comparison Models for Ranking National Soccer Teams
Sympercents: symmetric percentage differences on the 100 loge scale simplify the presentation of log transformed data
The Analysis of Sequential Experiments with Feedback to Subjects
Rating the ratings: Assessing the psychometric quality of rating data
The Rating of Chessplayers, Past and Present (Second Edition)
Optimal Selection Based On Relative Rank (the ‘Secretary Problem’)
Valuing Research Works by Eliciting Comparisons from EA Researchers
Getting Things in Order: An Introduction to the R Package seriation
2020-04-03-florianloitsch-tenkilogramsofchocolatetournament-data.ods
2017-wills-figure10-percentageofgamesatdifferentpredictedpercentagesofteamaheadforgoodbyprofessionalsportindustry.jpg
https://tor-lattimore.com/downloads/book/book.pdf#page=412
https://www.oreilly.com/radar/what-we-learned-from-a-year-of-building-with-llms-part-i/
Rating the ratings: Assessing the psychometric quality of rating data
%252Fdoc%252Fstatistics%252Forder%252Fcomparison%252F1980-saal.pdf.html
Wikipedia Bibliography: