Bibliography:

  1. ‘statistics’ tag

  2. ‘election forecast’ tag

  3. ‘inner monologue (AI)’ tag

  4. ‘inner-monologue (psych)’ tag

  5. ‘Fermi problems’ tag

  6. About This Website

  7. Embryo Selection For Intelligence

  8. LSD microdosing RCT

  9. Complexity no Bar to AI

  10. Darknet Market mortality risks

  11. Technology Forecasting: The Garden of Forking Paths

  12. Predicting Google closures

  13. The Ones Who Walk Towards Acre

  14. History of Iterated Embryo Selection

  15. Nootropics

  16. In Defense of Inclusionism

  17. Zeo sleep self-experiments

  18. Long Bets as Charitable Giving Opportunity

  19. Summers of Code, 2006–2013

  20. ‘HP: Methods of Rationality’ review statistics

  21. ‘Methods of Rationality’ predictions

  22. Wikipedia & Knol: Why Knol Already Failed

  23. NGE Rebuild Predictions

  24. Choosing Software

  25. Hail Jeffrey Wernick [Pre-Hanson Employment & Conditional Prediction Markets]

  26. Genetically-diverse crowds are wiser

  27. The Death and Life of Prediction Markets at Google: Over the past Two Decades, Google Has Hosted Two Different Internal Platforms for Predictions. Why Did the First One Fail—And Will the Other Endure?

  28. Can Language Models Use Forecasting Strategies?

  29. ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

  30. Chronos: Learning the Language of Time Series

  31. Crowd prediction systems: Markets, polls, and elite forecasters

  32. Academics are more specific, and practitioners more sensitive, in forecasting interventions to strengthen democratic attitudes

  33. Large Language Model Prediction Capabilities: Evidence from a Real-World Forecasting Tournament

  34. Cognitive Biases: Mistakes or Missing Stakes?

  35. Evaluating Superhuman Models with Consistency Checks

  36. Self-Resolving Prediction Markets for Unverifiable Outcomes

  37. Incentivizing honest performative predictions with proper scoring rules

  38. Deep Learning based Forecasting: a case study from the online fashion industry

  39. On the Accuracy, Media Representation, and Public Perception of Psychological Scientists’ Judgments of Societal Change

  40. Long-Range Subjective-Probability Forecasts of Slow-Motion Variables in World Politics: Exploring Limits on Expert Judgment

  41. Conditioning Predictive Models: Risks and Strategies

  42. The unlikelihood effect: When knowing more creates the perception of less

  43. Forecasting with trees

  44. Does constructing a belief distribution truly reduce overconfidence?

  45. Reconciling Individual Probability Forecasts

  46. Augur: a Decentralized Oracle and Prediction Market Platform (v2.0)

  47. An appropriate verbal probability lexicon for communicating surgical risks is unlikely to exist

  48. A simple cognitive method to improve the prediction of matters of taste by exploiting the within-person wisdom-of-crowd effect

  49. Language Models (Mostly) Know What They Know

  50. Forecasting Future World Events with Neural Networks

  51. Modeling Transformative AI Risks (MTAIR) Project—Summary Report

  52. Taking a Disagreeing Perspective Improves the Accuracy of People’s Quantitative Estimates

  53. The forecast trap

  54. Teaching Models to Express Their Uncertainty in Words

  55. Politicizing mask-wearing: predicting the success of behavioral interventions among Republicans and Democrats in the US

  56. ‘Two truths and a lie’ as a class-participation activity

  57. DeepMind: The Podcast—Excerpts on AGI

  58. Many heads are more utilitarian than one

  59. Uncalibrated Models Can Improve Human-AI Collaboration

  60. A 680,000-person megastudy of nudges to encourage vaccination in pharmacies

  61. TACTiS: Transformer-Attentional Copulas for Time Series

  62. Dream Interpretation from a Cognitive and Cultural Evolutionary Perspective: The Case of Oneiromancy in Traditional China

  63. M5 accuracy competition: Results, findings, and conclusions

  64. Megastudies improve the impact of applied behavioral science

  65. Forecasting Skills in Experimental Markets: Illusion or Reality?

  66. Strategically overconfident (to a fault): How self-promotion motivates advisor confidence

  67. Market expectations of a warming climate

  68. Long-Range Transformers for Dynamic Spatiotemporal Forecasting

  69. Sigmoids behaving badly: why they usually cannot predict the future as well as they seem to promise

  70. Wise teamwork: Collective confidence calibration predicts the effectiveness of group discussion

  71. Alignment Problems With Current Forecasting Platforms

  72. Behavioral scientists and laypeople misestimate societal effects of COVID-19

  73. How the wisdom of crowds, and of the crowd within, are affected by expertise

  74. Mind the Gap: Assessing Temporal Generalization in Neural Language Models § Scaling

  75. Kaggle forecasting competitions: An overlooked learning opportunity

  76. Danny Hernandez on forecasting and the drivers of AI progress

  77. AI and Efficiency: We’re releasing an analysis showing that since 2012 the amount of compute needed to train a neural net to the same performance on ImageNet classification has been decreasing by a factor of 2 every 16 months

  78. ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data

  79. LightGBM: A Highly Efficient Gradient Boosting Decision Tree

  80. Predicting History

  81. N-BEATS: Neural basis expansion analysis for interpretable time series forecasting

  82. Evidence on good forecasting practices from the Good Judgment Project

  83. Forecasting Transformative AI: An Expert Survey

  84. The Wisdom of Crowds Approach to Influenza-Rate Forecasting

  85. Predicting replication outcomes in the Many Labs 2 study

  86. The wisdom of the inner crowd in three large natural experiments

  87. When Will AI Exceed Human Performance? Evidence from AI Experts

  88. DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks

  89. Roosevelt Predicted to Win: Revisiting the 1936 Literary Digest Poll

  90. Looking Across and Looking Beyond the Knowledge Frontier: Intellectual Distance, Novelty, and Resource Allocation in Science

  91. Mechanical Versus Clinical Data Combination in Selection and Admissions Decisions: A Meta-Analysis

  92. Credit Suisse Global Investment Returns Yearbook 2013

  93. General knowledge norms: Updated and expanded from the Nelson & Narens 1980 norms

  94. Statistical Basis for Predicting Technological Progress

  95. Learning Performance of Prediction Markets with Kelly Bettors

  96. Can physicians accurately predict which patients will lose weight, improve nutrition and increase physical activity?

  97. A Prediction Market for Macro-Economic Variables

  98. Why Do Humans Reason? Arguments for an Argumentative Theory

  99. ba03e8d7db678948a7585a947ea8a4eac13d6abf.pdf

  100. Goodbye 2010

  101. Applying the Fermi Estimation Technique to Business Problems

  102. Predicting the Next Big Thing: Success as a Signal of Poor Judgment

  103. Conditions for intuitive expertise: A failure to disagree

  104. Keep Your Identity Small

  105. Measuring the Crowd Within: Probabilistic Representations Within Individuals

  106. The Meta-Analysis of Clinical Judgment Project: 56 Years of Accumulated Research on Clinical Versus Statistical Prediction

  107. A systematic review on communicating with patients about evidence

  108. Principles of Forecasting: A Handbook for Researchers and Practitioners

  109. Who Is Arguing About the Cat? Moral Action and Enlightenment According to Dōgen

  110. Eliminating the Hindsight Bias

  111. Forecasting Records by Maximum Likelihood

  112. Tools for Thought

  113. Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Sciences

  114. The Futurists

  115. 2022 Expert Survey on Progress in AI

  116. Prediction Markets in The Corporate Setting

  117. Tales from Prediction Markets

  118. George Orwell: In Front of Your Nose

  119. a75d76403cd4d1831187e17ce122c5c0ba842a65.html

  120. Why Did Renewables Become so Cheap so Fast?

  121. Performance Curve Database

  122. Mining the Silver Lining of the Trump Presidency

  123. How to Get Good

  124. A Failed Attempt at Market Manipulation

  125. Predicting the Future With Data+logistic Regression

  126. 31fa2d7a62f734dcba7cf9447271099c79b70e32.html

  127. Prediction Markets: Tales from the Election

  128. Using Learning Curve Theory to Redefine Moore's Law

  129. Forecasting S-Curves Is Hard

  130. f3f0ba74e238496d501d91728b9d3a05a6170a4b.html

  131. Science Fiction As Foresight

  132. The Track Record of Futurists Seems ... Fine

  133. Why Sigmoids Are so Hard to Predict

  134. Can AI Outpredict Humans? Results From Metaculus’s Q3 AI Forecasting Benchmark [No]

  135. 42bce519109425c49a8c8d37ba3769f21c551f94.html

  136. Violating the EMH—Prediction Markets

  137. Getting GPT-3 to Predict Metaculus Questions

  138. Maths Writer/cowritter Needed: How You Can't Distinguish Early Exponential from Early Sigmoid

  139. First Extracorporeal Human Pregnancy

  140. Demographically Diverse Crowds Are Typically Not Much Wiser Than Homogeneous Crowds

  141. How Accurate Are Our Predictions?

  142. Why the State Department’s INR Intelligence Agency May Be the Best in DC

  143. design#future-tag-features

    [Transclude the forward-link's context]

  144. 2022-gelman-figure3-overconfidenceinstudentclassroomcalibrationexercise.jpg

  145. 2022-zou-figure1-exampleautocastdatapointcomparingforecastingoflanguagemodeltohumanforecasters.png

  146. 2022-zou-table2-modelaccuracyonautocastpredictiondatasetbyquestiontypeandmodeltype.png

  147. 2021-hutcherson-figure1-expertandlaymancoronavirusforecastsvsreality.jpg

  148. 2021-milkman-figure1-actualvspredictedeffectsofgymgoingincentivesinrandomizedexperiments.png

  149. 2019-aiimpacts-goodforecasting-gjp-ensemblingperformance.jpg

  150. 2019-mellers-supplement.zip

  151. 2017-vandolder-figure1-scalingofaccuracyofcrowdestimatesbywithinpersonvsbetweenperson.jpg

  152. 2008-vul-figure1-wisdomofinnercrowdimprovespredictions.jpg

  153. 2002-crichton-whyspeculate.html

  154. 2001-rowe.pdf

  155. 1996-rowe.pdf

  156. 1986-osberg.pdf

  157. 1978-fischhoff.pdf

  158. gjp-season4-allpages.maff

  159. https://astralcodexten.substack.com/p/who-predicted-2022

  160. https://blog.polybdenum.com/2023/08/01/how-i-came-second-out-of-999-in-the-salem-center-prediction-market-tournament-without-knowing-anything-about-prediction-markets-and-what-i-learned-along-the-way-part-1.html

  161. https://forum.effectivealtruism.org/posts/H7xWzvwvkyywDAEkL/creating-a-database-for-base-rates

  162. https://forum.effectivealtruism.org/posts/akn2BFhhM9CzwpLEA/wisdom-of-the-crowd-vs-the-best-of-the-best-of-the-best

  163. https://joecarlsmith.com/2023/05/08/predictable-updating-about-ai-risk/

  164. https://manifold.markets/IsaacKing/will-this-markets-probability-be-at

  165. https://marginalrevolution.com/marginalrevolution/2013/12/shiller-on-trills.html

  166. https://news.manifold.markets/p/isaac-kings-whales-vs-minnows-and

  167. https://niplav.github.io/notes.html#Subscripts_for_Probabilities

  168. 3bfea91ca98f0a91ce2b3d1485818f942b03bef9.html#Subscripts_for_Probabilities

  169. https://niplav.site/decompose.html#Small_Experiment

  170. https://progressforum.org/posts/49tFxpkaxkQ748BAu/working-draft-of-getting-the-conditions-right-progress-in

  171. https://sumrevija.si/en/eng-peter-watts-the-wisdom-of-crowds-sum11/

  172. https://time.com/6556168/when-ai-outsmart-humans/

  173. 6ed9020336d83d99170b949cec916aa4956131be.html

  174. https://vitalik.eth.limo/general/2024/01/30/cryptoai.html

  175. https://web.archive.org/web/20070714204136/http://www.michaelcrichton.net/speech-whyspeculate.html

  176. a7d0fb64f6a9dc28d0d0c3a30a5f93448a49666a.html

  177. https://worksinprogress.co/issue/why-prediction-markets-arent-popular/

  178. d48a3d598ad5acd07ca75f0fc1774a66dd577c84.html

  179. https://www.astralcodexten.com/p/against-learning-from-dramatic-events

  180. 2160083ebf7c30339a7c7845f4bf81dc1c9dfe11.html

  181. https://www.astralcodexten.com/p/crowds-are-wise-and-ones-a-crowd

  182. 86086ecadd275353334cf99d05b84b3914fb38e2.html

  183. https://www.astralcodexten.com/p/the-buying-things-from-a-store-faq

  184. https://www.astralcodexten.com/p/the-extinction-tournament

  185. https://www.lesswrong.com/posts/68TGNutjDcBcq6PCZ/bitcoin-cryonics-fund#5Da2f8n9aXmfJ7FYA

  186. https://www.lesswrong.com/posts/epgCXiv3Yy3qgcsys/you-can-t-predict-a-game-of-pinball#wjLFhiWWacByqyu6a

  187. https://www.lesswrong.com/posts/t5W87hQF5gKyTofQB/ufo-betting-put-up-or-shut-up#7qyFLsx9WQJdZfpjC

  188. https://www.lesswrong.com/posts/ufW5LvcwDuL6qjdBT/latent-variables-for-prediction-markets-motivation-technical

  189. https://www.macroscience.org/p/the-frontier-of-scientific-plausibility

  190. a510032f97efcf1eaa298f61f8a4f75e3201ad08.html

  191. https://www.maximum-progress.com/p/grading-extropian-predictions

  192. https://www.metaculus.com/questions/notebooks/8702/the-promise-and-impact-of-the-next-generation-of-weight-loss-drugs/

  193. https://www.mlsafety.org/

  194. cacff2a3087644018d13271adfc659d90beb37a1.html

  195. https://www.nature.com/articles/d41586-023-02120-8

  196. https://x.com/JoshuaBlake_/status/1686272169813143552

  197. Large Language Model Prediction Capabilities: Evidence from a Real-World Forecasting Tournament

  198. https%253A%252F%252Farxiv.org%252Fabs%252F2310.13014.html

  199. On the Accuracy, Media Representation, and Public Perception of Psychological Scientists’ Judgments of Societal Change

  200. %252Fdoc%252Fsociology%252F2023-hutcherson.pdf.html

  201. Forecasting with trees

  202. https%253A%252F%252Fwww.sciencedirect.com%252Fscience%252Farticle%252Fpii%252FS0169207021001679.html

  203. Augur: a Decentralized Oracle and Prediction Market Platform (v2.0)

  204. %252Fdoc%252Fstatistics%252Fprediction%252F2022-peterson.pdf.html

  205. Language Models (Mostly) Know What They Know

  206. Saurav Kadavath

  207. About Me

  208. Andy Jones

  209. Sam Bowman

  210. https://jack-clark.net/about/

  211. Sam McCandlish

  212. Jared Kaplan

  213. https%253A%252F%252Farxiv.org%252Fabs%252F2207.05221%2523anthropic.html

  214. Forecasting Future World Events with Neural Networks

  215. Andy Zou

  216. Mantas Mazeika

  217. Jacob Steinhardt

  218. Owain Evans, AI Alignment Researcher

  219. https://people.eecs.berkeley.edu/~hendrycks/

  220. https%253A%252F%252Farxiv.org%252Fabs%252F2206.15474.html

  221. Politicizing mask-wearing: predicting the success of behavioral interventions among Republicans and Democrats in the US

  222. https%253A%252F%252Fwww.nature.com%252Farticles%252Fs41598-022-10524-1.html

  223. ‘Two truths and a lie’ as a class-participation activity

  224. %252Fdoc%252Fstatistics%252Fprediction%252F2022-gelman.pdf.html

  225. DeepMind: The Podcast—Excerpts on AGI

  226. https%253A%252F%252Fwww.lesswrong.com%252Fposts%252FSbAgRYo8tkHwhd9Qx%252Fdeepmind-the-podcast-excerpts-on-agi.html

  227. A 680,000-person megastudy of nudges to encourage vaccination in pharmacies

  228. Michelle N. Meyer—About

  229. https%253A%252F%252Fwww.pnas.org%252Fdoi%252F10.1073%252Fpnas.2115126119.html

  230. Dream Interpretation from a Cognitive and Cultural Evolutionary Perspective: The Case of Oneiromancy in Traditional China

  231. %252Fdoc%252Fstatistics%252Fprediction%252F2022-hong.pdf.html

  232. M5 accuracy competition: Results, findings, and conclusions

  233. https%253A%252F%252Fwww.sciencedirect.com%252Fscience%252Farticle%252Fpii%252FS0169207021001874.html

  234. Megastudies improve the impact of applied behavioral science

  235. %252Fdoc%252Fstatistics%252Fprediction%252F2021-milkman.pdf.html

  236. Mind the Gap: Assessing Temporal Generalization in Neural Language Models § Scaling

  237. https%253A%252F%252Farxiv.org%252Fabs%252F2102.01951%2523scaling%2526org%253Ddeepmind.html

  238. Evidence on good forecasting practices from the Good Judgment Project

  239. https%253A%252F%252Faiimpacts.org%252Fevidence-on-good-forecasting-practices-from-the-good-judgment-project-an-accompanying-blog-post%252F.html

  240. The wisdom of the inner crowd in three large natural experiments

  241. %252Fdoc%252Fstatistics%252Fprediction%252F2017-vandolder.pdf.html

  242. Mechanical Versus Clinical Data Combination in Selection and Admissions Decisions: A Meta-Analysis

  243. %252Fdoc%252Fstatistics%252Fprediction%252F2013-kuncel.pdf.html

  244. General knowledge norms: Updated and expanded from the Nelson & Narens 1980 norms

  245. https%253A%252F%252Flink.springer.com%252Farticle%252F10.3758%252Fs13428-012-0307-9.html

  246. Goodbye 2010

  247. https%253A%252F%252Fwww.vetta.org%252F2010%252F12%252Fgoodbye-2010%252F.html

  248. Conditions for intuitive expertise: A failure to disagree

  249. %252Fdoc%252Fpsychology%252Fcognitive-bias%252Fillusion-of-depth%252F2009-kahneman.pdf.html