‘GPT-4’ directory

See Also
Links
Miscellaneous
Bibliography

See Also

Links

“Sam Altman Comments on GPT-5 AMA With OpenAI’s Sam Altman and Some of the GPT-5 Team ”, Altman 2025

Sam Altman comments on GPT-5 AMA with OpenAI’s Sam Altman and some of the GPT-5 team

sama @ "2025-08-06"

[surprisingly low reasoning-GPT-4 use rates]

“Introducing Gpt-Oss: `gpt-Oss-120b` and `gpt-Oss-20b` Push the Frontier of Open-Weight Reasoning Models ”, OpenAI 2025

Introducing gpt-oss: gpt-oss-120b and gpt-oss-20b push the frontier of open-weight reasoning models

“Alignment Is Not Free: How Model Upgrades Can Silence Your Confidence Signals ”, Lin 2025

Alignment is not free: How model upgrades can silence your confidence signals

“Introducing ChatGPT Pro: Broadening Usage of Frontier AI ”, OpenAI 2024

Introducing ChatGPT Pro: Broadening usage of frontier AI

“The Structure of the Token Space for Large Language Models ”, Robinson et al 2024

The structure of the token space for large language models

“OpenAI Begins Training next AI Model As It Battles Safety Concerns: Executive Appears to Backtrack on Start-Up’s Vision of Building ‘Superintelligence’ After Exits from ‘Superalignment’ Team ”, Criddle 2024

OpenAI begins training next AI model as it battles safety concerns: Executive appears to backtrack on start-up’s vision of building ‘superintelligence’ after exits from ‘Superalignment’ team

“VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding? ”, Liu et al 2024

VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?

“PhyloLM: Inferring the Phylogeny of Large Language Models and Predicting Their Performances in Benchmarks ”, Yax et al 2024

PhyloLM: Inferring the Phylogeny of Large Language Models and Predicting their Performances in Benchmarks

“How Tech Giants Cut Corners to Harvest Data for AI: OpenAI, Google and Meta Ignored Corporate Policies, Altered Their Own Rules and Discussed Skirting Copyright Law As They Sought Online Information to Train Their Newest Artificial Intelligence Systems ”, Metz et al 2024

How Tech Giants Cut Corners to Harvest Data for AI: OpenAI, Google and Meta ignored corporate policies, altered their own rules and discussed skirting copyright law as they sought online information to train their newest artificial intelligence systems

“Hal-Eval: A Universal and Fine-Grained Hallucination Evaluation Framework for Large Vision Language Models ”, Jiang et al 2024

Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models

“Using Hallucinations to Bypass GPT-4’s Filter ”, Lemkin 2024

Using Hallucinations to Bypass GPT-4’s Filter

“The Carbon Emissions of Writing and Illustrating Are Lower for AI Than for Humans ”, Tomlinson et al 2024

The carbon emissions of writing and illustrating are lower for AI than for humans

“Altman Says ChatGPT Will Have to Evolve in ‘Uncomfortable’ Ways ”, Heath 2024

Altman says ChatGPT will have to evolve in ‘uncomfortable’ ways

“RAG vs Fine-Tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture ”, Balaguer et al 2024

RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture

“Gemini vs GPT-4-V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases ”, Qi et al 2023

Gemini vs GPT-4-V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

“Exploiting Novel GPT-4 APIs ”, Pelrine et al 2023

Exploiting Novel GPT-4 APIs

“ByteDance Is Secretly Using OpenAI’s Tech to Build a Competitor ”, Heath 2023

ByteDance is secretly using OpenAI’s tech to build a competitor

“Did I Get Sam Altman Fired from OpenAI?: Nathan’s Red-Teaming Experience, Noticing How the Board Was Not Aware of GPT-4 Jailbreaks & Had Not Even Tried GPT-4 prior to Its Early Release ”, Labenz 2023

Did I get Sam Altman fired from OpenAI?: Nathan’s red-teaming experience, noticing how the board was not aware of GPT-4 jailbreaks & had not even tried GPT-4 prior to its early release

“Did I Get Sam Altman Fired from OpenAI? § GPT-4-Base ”, Labenz 2023

Did I get Sam Altman fired from OpenAI? § GPT-4-base

“Inside the Chaos at OpenAI: Sam Altman’s Weekend of Shock and Drama Began a Year Ago, With the Release of ChatGPT ”, Hao & Warzel 2023

Inside the Chaos at OpenAI: Sam Altman’s weekend of shock and drama began a year ago, with the release of ChatGPT

“Instruction-Following Evaluation for Large Language Models ”, Zhou et al 2023

Instruction-Following Evaluation for Large Language Models

“Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation ”, Ding et al 2023

Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation

“Holistic Analysis of Hallucination in GPT-4-V(Ision): Bias and Interference Challenges ”, Cui et al 2023

Holistic Analysis of Hallucination in GPT-4-V(ision): Bias and Interference Challenges

“Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation ”, Shah et al 2023

Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation

“Does GPT-4 Pass the Turing Test? ”, Jones & Bergen 2023

Does GPT-4 Pass the Turing Test?

“The Generative AI Paradox: "What It Can Create, It May Not Understand" ”, West et al 2023

The Generative AI Paradox: "What It Can Create, It May Not Understand"

“Interpreting Learned Feedback Patterns in Large Language Models ”, Marks et al 2023

Interpreting Learned Feedback Patterns in Large Language Models

“PAIR: Jailbreaking Black Box Large Language Models in 20 Queries ”, Chao et al 2023

PAIR: Jailbreaking Black Box Large Language Models in 20 Queries

“The Dawn of LMMs: Preliminary Explorations With GPT-4-V(Ision) ”, Yang et al 2023

The Dawn of LMMs: Preliminary Explorations with GPT-4-V(ision)

“Summarization Is (Almost) Dead ”, Pu et al 2023

Summarization is (Almost) Dead

“Non-Determinism in GPT-4 Is Caused by Sparse MoE ”, 152334H 2023

Non-determinism in GPT-4 is caused by Sparse MoE

“Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations ”, Chen et al 2023

Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations

“Large Language Models As Superpositions of Cultural Perspectives ”, Kovač et al 2023

Large Language Models as Superpositions of Cultural Perspectives

“Large Language Models Understand and Can Be Enhanced by Emotional Stimuli ”, Li et al 2023

Large Language Models Understand and Can be Enhanced by Emotional Stimuli

“AI Is a Lot of Work: As the Technology Becomes Ubiquitous, a Vast Tasker Underclass Is Emerging—And Not Going Anywhere ”, Dzieza 2023

AI Is a Lot of Work: As the technology becomes ubiquitous, a vast tasker underclass is emerging—and not going anywhere

“Why YouTube Could Give Google an Edge in AI ”, Victor 2023

Why YouTube Could Give Google an Edge in AI

“LTX by Broadridge Launches BondGPT™ Powered by OpenAI GPT-4 ”, Inc 2023

LTX by Broadridge Launches BondGPT™ Powered by OpenAI GPT-4

“LIMA: Less Is More for Alignment ”, Zhou et al 2023

LIMA: Less Is More for Alignment

“Tree of Thoughts (ToT): Deliberate Problem Solving With Large Language Models ”, Yao et al 2023

Tree of Thoughts (ToT): Deliberate Problem Solving with Large Language Models

“Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4 ”, Chang et al 2023

Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4

“What’s AGI, and Why Are AI Experts Skeptical? ChatGPT and Other Bots Have Revived Conversations on Artificial General Intelligence. Scientists Say Algorithms Won’t Surpass You Any Time Soon ”, Rogers 2023

What’s AGI, and Why Are AI Experts Skeptical? ChatGPT and other bots have revived conversations on artificial general intelligence. Scientists say algorithms won’t surpass you any time soon

“ChemCrow: Augmenting Large-Language Models With Chemistry Tools ”, Bran et al 2023

ChemCrow: Augmenting large-language models with chemistry tools

“Sam Altman on What Makes Him ‘Super Nervous’ About AI: The OpenAI Co-Founder Thinks Tools like GPT-4 Will Be Revolutionary. But He’s Wary of Downsides ”, Swisher 2023

Sam Altman on What Makes Him ‘Super Nervous’ About AI: The OpenAI co-founder thinks tools like GPT-4 will be revolutionary. But he’s wary of downsides

“Introducing Microsoft 365 Copilot—Your Copilot for Work ”, Spataro 2023

Introducing Microsoft 365 Copilot—your copilot for work

“GPT-4 Technical Report ”, OpenAI 2023

GPT-4 Technical Report

“GPT-4 Is Coming next Week—And It Will Be Multimodal, Says Microsoft Germany ”, Hahn 2023

GPT-4 is coming next week—and it will be multimodal, says Microsoft Germany

“As AI Booms, Lawmakers Struggle to Understand the Technology: Tech Innovations Are Again Racing ahead of Washington’s Ability to Regulate Them, Lawmakers and AI Experts Said ”, Kang & Satariano 2023

As AI Booms, Lawmakers Struggle to Understand the Technology: Tech innovations are again racing ahead of Washington’s ability to regulate them, lawmakers and AI experts said

“Allen & Overy Breaks the Internet (And New Ground) With Co-Pilot Harvey ”, Hill 2023

Allen & Overy breaks the internet (and new ground) with co-pilot Harvey

davidtayar5 @ "2023-02-10"

Context on the NVIDIA ChatGPT opportunity—and ramifications of large language model enthusiasm

“New Bing, and an Interview With Kevin Scott and Sam Altman About the Microsoft-OpenAI Partnership ”, Thompson 2023

New Bing, and an Interview with Kevin Scott and Sam Altman About the Microsoft-OpenAI Partnership

“How ChatGPT Kicked Off an AI Arms Race: Even inside the Company, the Chatbot’s Popularity Has Come As Something of a Shock ”, Roose 2023

How ChatGPT Kicked Off an AI Arms Race: Even inside the company, the chatbot’s popularity has come as something of a shock

“Connor Leahy on Aliens, Ethics, Economics, Memetics, and Education § GPT-4 ”, Leahy 2023

Connor Leahy on Aliens, Ethics, Economics, Memetics, and Education § GPT-4

“OpenAI CEO Sam Altman on GPT-4: ‘People Are Begging to Be Disappointed and They Will Be’ ”, Vincent 2023

OpenAI CEO Sam Altman on GPT-4: ‘people are begging to be disappointed and they will be’

“Microsoft Bets Big on the Creator of ChatGPT in Race to Dominate AI: As a New Chatbot Wows the World With Its Conversational Talents, a Resurgent Tech Giant Is Poised to Reap the Benefits While Doubling down on a Relationship With the Start-Up OpenAI ”, Metz & Weise 2023

Microsoft Bets Big on the Creator of ChatGPT in Race to Dominate AI: As a new chatbot wows the world with its conversational talents, a resurgent tech giant is poised to reap the benefits while doubling down on a relationship with the start-up OpenAI

“What to Expect When You’re Expecting…GPT-4. What Comes After ChatGPT? 7 Predictions for 2023 § GPT-4 ”, Marcus 2022

What to Expect When You’re Expecting…GPT-4. What comes after ChatGPT? 7 predictions for 2023 § GPT-4

“Here’s What I Saw at an AI Hackathon: AI Gossip, Celebrity Sightings, Tech Trends—And Some Great Projects ”, Shipper 2022

Here’s What I Saw at an AI Hackathon: AI gossip, celebrity sightings, tech trends—and some great projects

“6 New Theories About AI: Software With Superpowers § GPT-4 ”, Armstrong 2022

6 New Theories About AI: Software with superpowers § GPT-4

immad @ "2022-11-22"

Have talked to several people who have used GPT-4, they all say it is uncanny how good it is

“GPT-4 Rumors From Silicon Valley: People Are Saying Things… ”, Romero 2022

GPT-4 Rumors From Silicon Valley: People are saying things…

EMostaque @ "2022-08-27"

Stack more layers is fine as GPT-4 is about to show but there are superior routes…

“Chinchilla: Training Compute-Optimal Large Language Models ”, Hoffmann et al 2022

Chinchilla: Training Compute-Optimal Large Language Models

“Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer ”, Yang et al 2022

Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

“Sparse Is Enough in Scaling Transformers ”, Jaszczur et al 2021

Sparse is Enough in Scaling Transformers

“An Analysis of AI Political Preferences from a European Perspective ”

An Analysis of AI Political Preferences from a European Perspective

Sort By Magic

Annotations sorted by machine learning into inferred 'tags'. This provides an alternative way to browse: instead of by date order, one can browse in topic order. The 'sorted' list has been automatically clustered into multiple sections & auto-labeled for easier browsing.

Beginning with the newest annotation, it uses the embedding of each annotation to attempt to create a list of nearest-neighbor annotations, creating a progression of topics. For more details, see the link.

`bondgpt thought-generation model-evaluation regulation agi-skeptical`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

`token-interpretation language-understanding generative-tools model-architecture large-lm phylogenetics`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

`multimodal`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

`gpt-future`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

Miscellaneous

Bibliography

https://arxiv.org/abs/2410.08993: “The Structure of the Token Space for Large Language Models ”, Michael Robinson, Sourya Dey, Shauna Sweet

link-bibliography
https://arxiv.org/abs/2404.05955: “VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding? ”, Junpeng Liu, Yifan Song, Bill Yuchen Lin, Wai Lam, Graham Neubig, Yuanzhi Li, Xiang Yue

link-bibliography
https://www.nytimes.com/2024/04/06/technology/tech-giants-harvest-data-artificial-intelligence.html: “How Tech Giants Cut Corners to Harvest Data for AI: OpenAI, Google and Meta Ignored Corporate Policies, Altered Their Own Rules and Discussed Skirting Copyright Law As They Sought Online Information to Train Their Newest Artificial Intelligence Systems ”, Cade Metz, Cecilia Kang, Sheera Frenkel, Stuart A. Thompson, Nico Grant

link-bibliography
https://www.nature.com/articles/s41598-024-54271-x: “The Carbon Emissions of Writing and Illustrating Are Lower for AI Than for Humans ”, Bill Tomlinson, Rebecca W. Black, Donald J. Patterson, Andrew W. Torrance

link-bibliography
https://arxiv.org/abs/2401.08406#microsoft: “RAG vs Fine-Tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture ”, Angels Balaguer, Vinamra Benara, Renato Luiz de Freitas Cunha, Roberto de M. Estevão Filho, Todd Hendry, Daniel Holstein, Jennifer Marsman, Nick Mecklenburg, Sara Malvar, Leonardo O. Nunes, Rafael Padilha, Morris Sharp, Bruno Silva, Swati Sharma, Vijay Aski, Ranveer Chandra

link-bibliography
https://arxiv.org/abs/2312.15011: “Gemini vs GPT-4-V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases ”, Zhangyang Qi, Ye Fang, Mengchen Zhang, Zeyi Sun, Tong Wu, Ziwei Liu, Dahua Lin, Jiaqi Wang, Hengshuang Zhao

link-bibliography
https://cognitiverevolution.substack.com/p/did-i-get-sam-altman-fired-from-openai: “Did I Get Sam Altman Fired from OpenAI?: Nathan’s Red-Teaming Experience, Noticing How the Board Was Not Aware of GPT-4 Jailbreaks & Had Not Even Tried GPT-4 prior to Its Early Release ”, Nathan Labenz

link-bibliography
https://www.theatlantic.com/technology/archive/2023/11/sam-altman-open-ai-chatgpt-chaos/676050/: “Inside the Chaos at OpenAI: Sam Altman’s Weekend of Shock and Drama Began a Year Ago, With the Release of ChatGPT ”, Karen Hao, Charlie Warzel

link-bibliography
https://arxiv.org/abs/2311.03287: “Holistic Analysis of Hallucination in GPT-4-V(Ision): Bias and Interference Challenges ”, Chenhang Cui, Yiyang Zhou, Xinyu Yang, Shirley Wu, Linjun Zhang, James Y. Zou, Huaxiu Yao

link-bibliography
https://arxiv.org/abs/2310.08419: “PAIR: Jailbreaking Black Box Large Language Models in 20 Queries ”, Patrick Chao, Alexander Robey, Edgar Dobriban, Hamed Hassani, George J. Pappas, Eric Wong

link-bibliography
https://arxiv.org/abs/2309.17421: “The Dawn of LMMs: Preliminary Explorations With GPT-4-V(Ision) ”, Zhengyuan Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Chung-Ching Lin, Zicheng Liu, Lijuan Wang

link-bibliography
https://152334h.github.io/blog/non-determinism-in-gpt-4/: “Non-Determinism in GPT-4 Is Caused by Sparse MoE ”, 152334H

link-bibliography
https://arxiv.org/abs/2307.07870: “Large Language Models As Superpositions of Cultural Perspectives ”, Grgur Kovač, Masataka Sawayama, Rémy Portelas, Cédric Colas, Peter Ford Dominey, Pierre-Yves Oudeyer

link-bibliography
https://arxiv.org/abs/2307.11760#microsoft: “Large Language Models Understand and Can Be Enhanced by Emotional Stimuli ”, Cheng Li, Jindong Wang, Yixuan Zhang, Kaijie Zhu, Wenxin Hou, Jianxun Lian, Fang Luo, Qiang Yang, Xing Xie

link-bibliography
https://www.theverge.com/features/23764584/ai-artificial-intelligence-data-notation-labor-scale-surge-remotasks-openai-chatbots: “AI Is a Lot of Work: As the Technology Becomes Ubiquitous, a Vast Tasker Underclass Is Emerging—And Not Going Anywhere ”, Josh Dzieza

link-bibliography
https://www.theinformation.com/articles/why-youtube-could-give-google-an-edge-in-ai: “Why YouTube Could Give Google an Edge in AI ”, Jon Victor

link-bibliography
https://arxiv.org/abs/2305.10601#deepmind: “Tree of Thoughts (ToT): Deliberate Problem Solving With Large Language Models ”, Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L. Griffiths, Yuan Cao, Karthik Narasimhan

link-bibliography
https://www.wired.com/story/what-is-artificial-general-intelligence-agi-explained/: “What’s AGI, and Why Are AI Experts Skeptical? ChatGPT and Other Bots Have Revived Conversations on Artificial General Intelligence. Scientists Say Algorithms Won’t Surpass You Any Time Soon ”, Reece Rogers

link-bibliography
https://nymag.com/intelligencer/2023/03/on-with-kara-swisher-sam-altman-on-the-ai-revolution.html: “Sam Altman on What Makes Him ‘Super Nervous’ About AI: The OpenAI Co-Founder Thinks Tools like GPT-4 Will Be Revolutionary. But He’s Wary of Downsides ”, Kara Swisher

link-bibliography
https://blogs.microsoft.com/blog/2023/03/16/introducing-microsoft-365-copilot-your-copilot-for-work/: “Introducing Microsoft 365 Copilot—Your Copilot for Work ”, Jared Spataro

link-bibliography
https://www.heise.de/news/GPT-4-is-coming-next-week-and-it-will-be-multimodal-says-Microsoft-Germany-7540972.html: “GPT-4 Is Coming next Week—And It Will Be Multimodal, Says Microsoft Germany ”, Silke Hahn

link-bibliography
https://www.nytimes.com/2023/03/03/technology/artificial-intelligence-regulation-congress.html: “As AI Booms, Lawmakers Struggle to Understand the Technology: Tech Innovations Are Again Racing ahead of Washington’s Ability to Regulate Them, Lawmakers and AI Experts Said ”, Cecila Kang, Adam Satariano

link-bibliography
https://legaltechnology.com/2023/02/16/allen-overy-breaks-the-internet-and-new-ground-with-co-pilot-harvey/: “Allen & Overy Breaks the Internet (And New Ground) With Co-Pilot Harvey ”, Caroline Hill

link-bibliography
https://x.com/davidtayar5/status/1627690520456691712: “Context on the NVIDIA ChatGPT Opportunity—And Ramifications of Large Language Model Enthusiasm ”, Morgan Stanley

link-bibliography
https://stratechery.com/2023/new-bing-and-an-interview-with-kevin-scott-and-sam-altman-about-the-microsoft-openai-partnership/: “New Bing, and an Interview With Kevin Scott and Sam Altman About the Microsoft-OpenAI Partnership ”, Ben Thompson

link-bibliography
https://www.nytimes.com/2023/02/03/technology/chatgpt-openai-artificial-intelligence.html: “How ChatGPT Kicked Off an AI Arms Race: Even inside the Company, the Chatbot’s Popularity Has Come As Something of a Shock ”, Kevin Roose

link-bibliography
https://www.theverge.com/23560328/openai-gpt-4-rumor-release-date-sam-altman-interview: “OpenAI CEO Sam Altman on GPT-4: ‘People Are Begging to Be Disappointed and They Will Be’ ”, James Vincent

link-bibliography
https://www.nytimes.com/2023/01/12/technology/microsoft-openai-chatgpt.html: “Microsoft Bets Big on the Creator of ChatGPT in Race to Dominate AI: As a New Chatbot Wows the World With Its Conversational Talents, a Resurgent Tech Giant Is Poised to Reap the Benefits While Doubling down on a Relationship With the Start-Up OpenAI ”, Cade Metz, Karen Weise

link-bibliography
https://garymarcus.substack.com/p/what-to-expect-when-youre-expecting: “What to Expect When You’re Expecting…GPT-4. What Comes After ChatGPT? 7 Predictions for 2023 § GPT-4 ”, Gary Marcus

link-bibliography
https://every.to/chain-of-thought/the-knee-of-the-exponential-curve: “Here’s What I Saw at an AI Hackathon: AI Gossip, Celebrity Sightings, Tech Trends—And Some Great Projects ”, Dan Shipper

link-bibliography
https://www.thealgorithmicbridge.com/p/gpt-4-rumors-from-silicon-valley: “GPT-4 Rumors From Silicon Valley: People Are Saying Things… ”, Alberto Romero

link-bibliography
https://x.com/EMostaque/status/1563715319566409729: “Stack More Layers Is Fine As GPT-4 Is about to Show but There Are Superior Routes… ”, Emad Mostaque

link-bibliography
https://arxiv.org/abs/2203.15556#deepmind: “Chinchilla: Training Compute-Optimal Large Language Models ”, Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego de Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George van den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre

link-bibliography
https://arxiv.org/abs/2203.03466#microsoft: “Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer ”, Greg Yang, Edward J. Hu, Igor Babuschkin, Szymon Sidor, Xiaodong Liu, David Farhi, Nick Ryder, Jakub Pachocki, Weizhu Chen, Jianfeng Gao

link-bibliography
https://arxiv.org/abs/2111.12763#google: “Sparse Is Enough in Scaling Transformers ”, Sebastian Jaszczur, Aakanksha Chowdhery, Afroz Mohiuddin, Łukasz Kaiser, Wojciech Gajewski, Henryk Michalewski, Jonni Kanerva

link-bibliography

[Quote Of The Day]

[Site Of The Day]

[Annotation Of The Day]

[adblock public service announcement]