‘DeepSeek’ directory

Annotations sorted by machine learning into inferred 'tags'. This provides an alternative way to browse: instead of by date order, one can browse in topic order. The 'sorted' list has been automatically clustered into multiple sections & auto-labeled for easier browsing.

Beginning with the newest annotation, it uses the embedding of each annotation to attempt to create a list of nearest-neighbor annotations, creating a progression of topics. For more details, see the link.

Wikipedia (1)

Liang Wenfeng :

https://en.wikipedia.org/wiki/Liang_Wenfeng

Miscellaneous

Bibliography

https://www.reuters.com/world/china/deepseek-r2-launch-stalled-ceo-balks-progress-information-reports-2025-06-26/: “DeepSeek R2 Launch Stalled As CEO Balks at Progress, The Information Reports ”, Deborah Sophia

link-bibliography
https://arxiv.org/abs/2505.09343#deepseek: “Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures ”, Chenggang Zhao, Chengqi Deng, Chong Ruan, Damai Dai, Huazuo Gao, Jiashi Li, Liyue Zhang, Panpan Huang, Shangyan Zhou, Shirong Ma, Wenfeng Liang, Ying He, Yuqing Wang, Yuxuan Liu, Y. X. Wei

link-bibliography
https://sinopsis.cz/en/chinas-superstition-boom-in-a-godless-state/: “China’s Superstition Boom in a Godless State § DeepSeek’s Occult Tech Boom ”, Ansel Li

link-bibliography
https://arxiv.org/abs/2505.07215: “Measuring General Intelligence With Generated Games ”, Vivek Verma, David Huang, William Chen, Dan Klein, Nicholas Tomlin

link-bibliography
https://arxiv.org/abs/2504.14379: “The Geometry of Self-Verification in a Task-Specific Reasoning Model ”, Andrew Lee, Lihao Sun, Chris Wendler, Fernanda Viégas, Martin M. Wattenberg

link-bibliography
https://arxiv.org/abs/2503.21934: “Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad ”, Ivo Petrov, Jasper Dekoninck, Lyuben Baltadzhiev, Maria Drencheva, Kristian Minchev, Mislav Balunović, Nikola Jovanović, Martin Vechev

link-bibliography
https://arxiv.org/abs/2503.16219: “Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t ”, Quy-Anh Dang, Chris Ngo

link-bibliography
https://arxiv.org/abs/2502.12896: “None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks ”, Eva Sánchez Salido, Julio Gonzalo, Guillermo Marco

link-bibliography
https://toloka.ai/blog/r1-is-not-on-par-with-o1-and-the-difference-is-qualitative-not-quantitative/: “DS R1 Is Not on Par With OA O1, and the Difference Is Qualitative, Not Quantitative: Long-Tail Benchmarks Reveal Gaps ”, Vitaliy Polshkov

link-bibliography
https://arxiv.org/abs/2501.08156: “Are DeepSeek R1 And Other Reasoning Models More Faithful? ”, James Chua, Owain Evans

link-bibliography
https://arxiv.org/abs/2402.03300#deepseek: “DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models ”, Zhihong Shao, Peiyi Wang, Qihao Zhu, Runxin Xu, Junxiao Song, Xiao Bi, Haowei Zhang, Mingchuan Zhang, Y. K. Li, Y. Wu, Daya Guo

link-bibliography