‘DeepSeek’ tag
- See Also
-
Links
- “Anomalous Tokens in DeepSeek-V3 & Deep-Seek-R1”, Henry 2025
- “DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning”, Guo et al 2025
- “On DeepSeek’s R1”, Mowshowitz 2025
- “The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation”, Carlsson et al 2024
- “DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data”, Xin et al 2024
- “DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models”, Shao et al 2024
- “DeepSeek LLM: Scaling Open-Source Language Models With Longtermism”, Bi et al 2024
- “The Madness of High-Flyer [DeepSeek]: The Approach to LLM by an AI Giant That Few See”, 暗涌Waves & Nebula 2023
- “How Has DeepSeek Improved the Transformer Architecture?”
- “TinyZero”, Pan 2025
- “Deepseek-Ai/DeepSeek-V3”
- “HuggingFace: DeepSeek-R1”
- “Was Zuck Right about Chinese AI Models?”
- “Interview With Deepseek Founder: We’re Done Following. It’s Time to Lead.”
- “Deepseek: The Quiet Giant Leading China’s AI Race”
- “DeepSeek”
- “Two Interviews With the Founder of DeepSeek”
- ryunuck
- teortaxesTex
- Miscellaneous
- Bibliography
See Also
Links
“Anomalous Tokens in DeepSeek-V3 & Deep-Seek-R1”, Henry 2025
“DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning”, Guo et al 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
“On DeepSeek’s R1”, Mowshowitz 2025
View External Link:
“The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation”, Carlsson et al 2024
The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation
“DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data”, Xin et al 2024
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
“DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models”, Shao et al 2024
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
“DeepSeek LLM: Scaling Open-Source Language Models With Longtermism”, Bi et al 2024
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
“The Madness of High-Flyer [DeepSeek]: The Approach to LLM by an AI Giant That Few See”, 暗涌Waves & Nebula 2023
The Madness of High-Flyer [DeepSeek]: The Approach to LLM by an AI Giant that Few See
“How Has DeepSeek Improved the Transformer Architecture?”
“TinyZero”, Pan 2025
TinyZero :
“Deepseek-Ai/DeepSeek-V3”
“HuggingFace: DeepSeek-R1”
“Was Zuck Right about Chinese AI Models?”
“Interview With Deepseek Founder: We’re Done Following. It’s Time to Lead.”
Interview with Deepseek Founder: We’re Done Following. It’s Time to Lead.
“Deepseek: The Quiet Giant Leading China’s AI Race”
“DeepSeek”
“Two Interviews With the Founder of DeepSeek”
ryunuck
teortaxesTex
Miscellaneous
Bibliography
-
https://arxiv.org/abs/2402.03300#deepseek
: “DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models”,