- See Also
-
Links
- “Reputation Inflation”, Et Al 2022
- “Bayesian Inference of the Climbing Grade Scale”, 2021
- “PiRank: Learning To Rank via Differentiable Sorting”, Et Al 2020
- “Rank-Smoothed Pairwise Learning In Perceptual Quality Assessment”, Et Al 2020
- “Self-Play Learning Without a Reward Metric”, Et Al 2019
- “GPT-2 Preference Learning for Music Generation”, 2019
- “Group Testing: An Information Theory Perspective”, Et Al 2019
- “Top-K Off-Policy Correction for a REINFORCE Recommender System”, Et Al 2018
- “Open Questions”, 2018
- “Comparison Based Learning from Weak Oracles”, Et Al 2018
- “OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning”, Et Al 2017
- “Analogical-based Bayesian Optimization”, Et Al 2017
- “Spectral Method and Regularized MLE Are Both Optimal for Top-K Ranking”, Et Al 2017
- “The Competitiveness of Games in Professional Sports Leagues”, 2017
- “Deep Reinforcement Learning from Human Preferences”, Et Al 2017
- “D-TS: Double Thompson Sampling for Dueling Bandits”, 2016
- “Resorting Media Ratings”, 2015
- “Just Sort It! A Simple and Effective Approach to Active Preference Learning”, 2015
- “On the Complexity of Best Arm Identification in Multi-Armed Bandit Models”, Et Al 2014
- “The Paradox of Publicity: How Awards Can Negatively Affect the Evaluation of Quality”
- “Consumer Ability to Detect the Taste of Total Dissolved Solids”
- “Bayesian Active Learning for Classification and Preference Learning”, Et Al 2011
- “TDS and Temperature Affect Consumer Taste Preferences”
- “Case Studies in Bayesian Computation Using INLA”, 2010
- “Sorting from Noisy Information”, 2009
- “Can People Distinguish Pâté From Dog Food? [preprint]”, Et Al 2009
- “Aggregating Inconsistent Information: Ranking and Clustering”, Et Al 2008
- “Pure Exploration for Multi-Armed Bandit Problems”, Et Al 2008
- “Do More Expensive Wines Taste Better? Evidence from a Large Sample of Blind Tastings”, Et Al 2008
- “Noisy Sorting Without Resampling”, 2007
- “Bayesian Adaptive Exploration”, 2003
- “Sympercents: Symmetric Percentage Differences on the 100 LogE Scale Simplify the Presentation of Log Transformed Data”, 2000
- “Born Again Group Testing: Multiaccess Communications”, 1985
- “The Rating of Chessplayers, Past and Present (Second Edition)”, 1978
- “Metacritic Has A (File-Drawer) Problem”
- Miscellaneous
- Link Bibliography
See Also
Links
“Reputation Inflation”, Et Al 2022
“Reputation Inflation”, 2022-05-03 ( ; backlinks; similar)
“Bayesian Inference of the Climbing Grade Scale”, 2021
“Bayesian inference of the climbing grade scale”, 2021-11-15 (similar)
“PiRank: Learning To Rank via Differentiable Sorting”, Et Al 2020
“PiRank: Learning To Rank via Differentiable Sorting”, 2020-12-12 ( ; backlinks; similar)
“Rank-Smoothed Pairwise Learning In Perceptual Quality Assessment”, Et Al 2020
“Rank-Smoothed Pairwise Learning In Perceptual Quality Assessment”, 2020-09-30 ( ; similar)
“Self-Play Learning Without a Reward Metric”, Et Al 2019
“Self-Play Learning Without a Reward Metric”, 2019-12-16 ( ; backlinks; similar)
“GPT-2 Preference Learning for Music Generation”, 2019
“GPT-2 Preference Learning for Music Generation”, 2019-12-16 ( ; backlinks; similar; bibliography)
“Group Testing: An Information Theory Perspective”, Et Al 2019
“Top-K Off-Policy Correction for a REINFORCE Recommender System”, Et Al 2018
“Top-K Off-Policy Correction for a REINFORCE Recommender System”, 2018-12-06 ( ; similar)
“Open Questions”, 2018
“Open Questions”, 2018-10-17 ( ; backlinks; similar; bibliography)
“Comparison Based Learning from Weak Oracles”, Et Al 2018
“Comparison Based Learning from Weak Oracles”, 2018-02-20 (backlinks; similar)
“OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning”, Et Al 2017
“OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning”, 2017-09-20 ( ; backlinks; similar)
“Analogical-based Bayesian Optimization”, Et Al 2017
“Analogical-based Bayesian Optimization”, 2017-09-19 ( ; backlinks; similar)
“Spectral Method and Regularized MLE Are Both Optimal for Top-K Ranking”, Et Al 2017
“Spectral Method and Regularized MLE Are Both Optimal for Top-K Ranking”, 2017-07-31 (backlinks; similar)
“The Competitiveness of Games in Professional Sports Leagues”, 2017
“The competitiveness of games in professional sports leagues”, 2017-07-01 (backlinks; similar)
“Deep Reinforcement Learning from Human Preferences”, Et Al 2017
“Deep reinforcement learning from human preferences”, 2017-06-12 ( ; similar)
“D-TS: Double Thompson Sampling for Dueling Bandits”, 2016
“D-TS: Double Thompson Sampling for Dueling Bandits”, 2016-04-25 ( ; backlinks; similar)
“Resorting Media Ratings”, 2015
“Resorting Media Ratings”, 2015-09-07 ( ; backlinks; similar; bibliography)
“Just Sort It! A Simple and Effective Approach to Active Preference Learning”, 2015
“Just Sort It! A Simple and Effective Approach to Active Preference Learning”, 2015-02-19 ( ; backlinks; similar)
“On the Complexity of Best Arm Identification in Multi-Armed Bandit Models”, Et Al 2014
“On the Complexity of Best Arm Identification in Multi-Armed Bandit Models”, 2014-07-16 ( ; backlinks; similar)
“The Paradox of Publicity: How Awards Can Negatively Affect the Evaluation of Quality”
“Consumer Ability to Detect the Taste of Total Dissolved Solids”
“Bayesian Active Learning for Classification and Preference Learning”, Et Al 2011
“Bayesian Active Learning for Classification and Preference Learning”, 2011-12-24 ( ; backlinks; similar)
“TDS and Temperature Affect Consumer Taste Preferences”
“Case Studies in Bayesian Computation Using INLA”, 2010
“Case studies in Bayesian computation using INLA”, 2010 ( ; backlinks; similar)
“Sorting from Noisy Information”, 2009
“Sorting from Noisy Information”, 2009-10-07 ( ; backlinks; similar)
“Can People Distinguish Pâté From Dog Food? [preprint]”, Et Al 2009
“Can People Distinguish Pâté From Dog Food? [preprint]”, 2009-04-01 ( ; backlinks; similar)
“Aggregating Inconsistent Information: Ranking and Clustering”, Et Al 2008
“Aggregating inconsistent information: Ranking and clustering”, 2008-11 ( ; backlinks; similar)
“Pure Exploration for Multi-Armed Bandit Problems”, Et Al 2008
“Pure Exploration for Multi-Armed Bandit Problems”, 2008-02-19 ( ; backlinks; similar)
“Do More Expensive Wines Taste Better? Evidence from a Large Sample of Blind Tastings”, Et Al 2008
“Do More Expensive Wines Taste Better? Evidence from a Large Sample of Blind Tastings”, 2008 ( ; backlinks; similar)
“Noisy Sorting Without Resampling”, 2007
“Noisy Sorting Without Resampling”, 2007-07-06 ( ; backlinks; similar)
“Bayesian Adaptive Exploration”, 2003
“Bayesian Adaptive Exploration”, 2003 ( ; backlinks)
“Sympercents: Symmetric Percentage Differences on the 100 LogE Scale Simplify the Presentation of Log Transformed Data”, 2000
“Sympercents: symmetric percentage differences on the 100 loge scale simplify the presentation of log transformed data”, 2000-11-08 (similar)
“Born Again Group Testing: Multiaccess Communications”, 1985
“The Rating of Chessplayers, Past and Present (Second Edition)”, 1978
“The Rating of Chessplayers, Past and Present (Second Edition)”, 1978 ( ; backlinks; similar)
“Metacritic Has A (File-Drawer) Problem”
Miscellaneous
-
2019-aquafina.pdf
2019 ( ; backlinks) -
2007-karp.pdf
2007 (backlinks) -
2002-pelc.pdf
2002 ( ; backlinks) -
1961-slater.pdf
1961 (backlinks)
Link Bibliography
-
gpt-2-preference-learning
: “GPT-2 Preference Learning for Music Generation”, Gwern Branwen: -
question
: “Open Questions”, Gwern Branwen: -
resorter
: “Resorting Media Ratings”, Gwern Branwen: