Bibliography:

  1. ‘order statistics’ tag

  2. ‘sorting’ tag

  3. Open Questions

  4. GPT-2 Preference Learning for Music Generation

  5. Resorting Media Ratings

  6. Epistemic Calibration and Searching the Space of Truth

  7. Predicting the direction of phenotypic difference

  8. Diffusion Model Alignment Using Direct Preference Optimization

  9. A General Theoretical Paradigm to Understand Learning from Human Preferences

  10. On the Optimal Bounds for Noisy Computing

  11. Direct Preference Optimization (DPO): Your Language Model is Secretly a Reward Model

  12. Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems

  13. Reputation Inflation

  14. Bayesian inference of the climbing grade scale

  15. PiRank: Learning To Rank via Differentiable Sorting

  16. Rank-Smoothed Pairwise Learning In Perceptual Quality Assessment

  17. Self-Play Learning Without a Reward Metric

  18. Group Testing: An Information Theory Perspective

  19. Top-K Off-Policy Correction for a REINFORCE Recommender System

  20. Comparison Based Learning from Weak Oracles

  21. OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning

  22. Analogical-based Bayesian Optimization

  23. Spectral Method and Regularized MLE Are Both Optimal for Top-K Ranking

  24. The competitiveness of games in professional sports leagues

  25. Deep reinforcement learning from human preferences

  26. PBO: Preferential Bayesian Optimization

  27. D-TS: Double Thompson Sampling for Dueling Bandits

  28. Just Sort It! A Simple and Effective Approach to Active Preference Learning

  29. On the Complexity of Best Arm Identification in Multi-Armed Bandit Models

  30. Bayesian Active Learning for Classification and Preference Learning

  31. Case studies in Bayesian computation using INLA

  32. Sorting from Noisy Information

  33. Can People Distinguish Pâté From Dog Food? [preprint]

  34. Aggregating inconsistent information: Ranking and clustering

  35. Pure Exploration for Multi-Armed Bandit Problems

  36. Do More Expensive Wines Taste Better? Evidence from a Large Sample of Blind Tastings

  37. Noisy Sorting Without Resampling

  38. Noisy binary search and its applications

  39. Paired Comparison Models for Ranking National Soccer Teams

  40. Bayesian Adaptive Exploration

  41. How Dangerous Are Drinking Drivers?

  42. Sympercents: symmetric percentage differences on the 100 loge scale simplify the presentation of log transformed data

  43. Born Again Group Testing: Multiaccess Communications

  44. The Analysis of Sequential Experiments with Feedback to Subjects

  45. Rating the ratings: Assessing the psychometric quality of rating data

  46. The Rating of Chessplayers, Past and Present (Second Edition)

  47. Optimal Selection Based On Relative Rank (the ‘Secretary Problem’)

  48. 7b58146dd047c771e9c48520dbaa8d978c61578d.pdf

  49. Inconsistencies in a Schedule of Paired Comparisons

  50. Metacritic Has A (File-Drawer) Problem

  51. Valuing Research Works by Eliciting Comparisons from EA Researchers

  52. Futurama Theorem

  53. f1a939193fef59455a7d3f2bafc5fcf4bf9bd5bb.html

  54. Getting Things in Order: An Introduction to the R Package seriation

  55. design#future-tag-features

    [Transclude the forward-link's context]

  56. 2020-04-03-florianloitsch-tenkilogramsofchocolatetournament-data.ods

  57. 2019-nathanwpyle-strangeplanet-ihaveattemptedscience.jpg

  58. 2017-wills-figure10-percentageofgamesatdifferentpredictedpercentagesofteamaheadforgoodbyprofessionalsportindustry.jpg

  59. 2002-pelc.pdf

  60. https://chat.lmsys.org/

  61. https://github.com/ealdwulf/bbchop

  62. https://github.com/hiAndrewQuinn/resorter

  63. https://tor-lattimore.com/downloads/book/book.pdf#page=412

  64. 762d4ee657af5f8ac4c3f5096fac3c5ba87c71f0.pdf#page=412

  65. https://www.oreilly.com/radar/what-we-learned-from-a-year-of-building-with-llms-part-i/

  66. https://x.com/Altimor/status/1838688432501240107

  67. Rating the ratings: Assessing the psychometric quality of rating data

  68. %252Fdoc%252Fstatistics%252Forder%252Fcomparison%252F1980-saal.pdf.html