‘Whisper NN’ directory

See Also
Links
Miscellaneous
Bibliography

See Also

Parent (‘GPT’ tag)

Links

“OpenAI Charges by the Minute, So Make the Minutes Shorter”, Mandis 2025

OpenAI Charges by the Minute, So Make the Minutes Shorter

“How Tech Giants Cut Corners to Harvest Data for AI: OpenAI, Google and Meta Ignored Corporate Policies, Altered Their Own Rules and Discussed Skirting Copyright Law As They Sought Online Information to Train Their Newest Artificial Intelligence Systems”, Metz et al 2024

How Tech Giants Cut Corners to Harvest Data for AI: OpenAI, Google and Meta ignored corporate policies, altered their own rules and discussed skirting copyright law as they sought online information to train their newest artificial intelligence systems

“Careless Whisper: Speech-To-Text Hallucination Harms”, Koenecke et al 2024

Careless Whisper: Speech-to-Text Hallucination Harms

“Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling”, Gandhi et al 2023

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

“Whisper-AT: Noise-Robust Automatic Speech Recognizers Are Also Strong General Audio Event Taggers”, Gong et al 2023

Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers

“Why YouTube Could Give Google an Edge in AI”, Victor 2023

Why YouTube Could Give Google an Edge in AI

“WhisperX: Time-Accurate Speech Transcription of Long-Form Audio”, Bain et al 2023

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

“Whisper: Robust Speech Recognition via Large-Scale Weak Supervision”, Radford et al 2022

Whisper: Robust Speech Recognition via Large-Scale Weak Supervision

“ESB: A Benchmark For Multi-Domain End-To-End Speech Recognition”, Gandhi et al 2022

ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition

“The History of Speech Recognition to the Year 2030”, Hannun 2021

The History of Speech Recognition to the Year 2030

“The History of Speech Recognition to the Year 2030 [Blog]”, Hannun 2021

The History of Speech Recognition to the Year 2030 [blog]

View HTML:

/doc/www/awni.github.io/4cd4c7eaf803a808b8ea623005f67672af20a2fd.html

“SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network”, Chan et al 2021

SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network

Miscellaneous

Bibliography

https://www.nytimes.com/2024/04/06/technology/tech-giants-harvest-data-artificial-intelligence.html: “How Tech Giants Cut Corners to Harvest Data for AI: OpenAI, Google and Meta Ignored Corporate Policies, Altered Their Own Rules and Discussed Skirting Copyright Law As They Sought Online Information to Train Their Newest Artificial Intelligence Systems”, Cade Metz, Cecilia Kang, Sheera Frenkel, Stuart A. Thompson, Nico Grant

link-bibliography
https://arxiv.org/abs/2311.00430: “Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling”, Sanchit Gandhi, Patrick von Platen, Alexander M. Rush

link-bibliography
https://www.theinformation.com/articles/why-youtube-could-give-google-an-edge-in-ai: “Why YouTube Could Give Google an Edge in AI”, Jon Victor

link-bibliography
https://arxiv.org/abs/2212.04356#openai: “Whisper: Robust Speech Recognition via Large-Scale Weak Supervision”, Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, Ilya Sutskever

link-bibliography
https://arxiv.org/abs/2210.13352#huggingface: “ESB: A Benchmark For Multi-Domain End-To-End Speech Recognition”, Sanchit Gandhi, Patrick von Platen, Alexander M. Rush

link-bibliography
https://arxiv.org/abs/2104.02133#google: “SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network”, William Chan, Daniel Park, Chris Lee, Yu Zhang, Quoc V. Le, Mohammad Norouzi

link-bibliography

[Quote Of The Day]

[Site Of The Day]

[Annotation Of The Day]

[adblock public service announcement]