Continuous Autoregressive Models with Noise Augmentation Avoid Error Accumulation
GSoC 2024: Differentiable Logic for Interactive Systems and Generative Music
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
An accurate and rapidly calibrating speech neuroprosthesis
A Disney director tried—and failed—to use an AI Hans Zimmer to create a soundtrack
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
TANGO: Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval
Speak, Read and Prompt (SPEAR-TTS): High-Fidelity Text-to-Speech with Minimal Supervision
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Rock Guitar Tablature Generation via Natural Language Processing
VALL-E: Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Hierarchical Diffusion Models for Singing Voice Neural Vocoder
RealSinger: Ultra-Realistic Singing Voice Generation via Stochastic Differential Equations
MeloForm: Generating Melody with Musical Form based on Expert Systems and Neural Networks
AI composer bias: Listeners like music less when they think it was composed by an AI
Multitrack Music Transformer: Learning Long-Term Dependencies in Music with Diverse Instruments
BigVGAN: A Universal Neural Vocoder with Large-Scale Training
CLAP: Learning Audio Concepts From Natural Language Supervision
Tradformer: A Transformer Model of Traditional Music Transcriptions
SymphonyNet: Symphony Generation with Permutation Invariant Language Model
General-purpose, long-context autoregressive modeling with Perceiver AR
FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control
MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Interacting with GPT-2 to Generate Controlled and Believable Musical Sequences in ABC Notation
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
DeepSinger: Singing Voice Synthesis with Data Mined From the Web
Learning Music Helps You Read: Using Transfer to Study Linguistic Structure in Language Models
Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions
Writing the Next American Hit: Using GPT-2 to Explore the Possibility of Creating Successful AI-Generated Song Lyrics Possibility of Creating Successful AI-Generated Song Lyric
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
MuseNet: a deep neural network that can generate 4-minute musical compositions with 10 different instruments, and can combine styles from country to Mozart to the Beatles
Generative Modeling with Sparse Transformers: We’ve developed the Sparse Transformer, a deep neural network which sets new records at predicting what comes next in a sequence—whether text, images, or sound. It uses an algorithmic improvement of the attention mechanism to extract patterns from sequences 30× longer than possible previously
Music Transformer: Generating Music with Long-Term Structure
This Time with Feeling: Learning Expressive Musical Performance
The challenge of realistic music generation: modeling raw audio at scale
Towards Deep Modeling of Music Semantics using EEG Regularizers
Objective-Reinforced Generative Adversarial Networks (ORGAN) for Sequence Generation Models
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders
Tuning Recurrent Neural Networks with Reinforcement Learning
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
Staring Emmy Straight in the Eye—And Doing My Best Not to Flinch
Connectionist Music Composition Based on Melodic, Stylistic, and Psychophysical Constraints [Technical report CU-CS–495–90]
Autoregressive Long-Context Music Generation With Perceiver AR
midi2abc: Program to Convert MIDI Format Files to Abc Notation
'It's the Screams of the Damned!' The Eerie AI World of Deepfake Music Music
Inside the Discord Where Thousands of Rogue Producers Are Making AI Music
2023-wang-figure1-vallevoicesynthesisautoregressivearchitecture.jpg
2023-girdhar-figure5-objectdetectioninimageswithaudioqueriesinimagebindrequiringnoretraining.png
2020-07-07-nshepperd-openaijukebox-gpt3-theuniverseisaglitch.mp3
2020-04-15-gpt2-midi-pop_midi-setgggeneraloperationsmcnewtonmixx.mp3
2020-03-30-fifteenai-twilightsparkle-sel-presentdaypresenttime.mp3
2019-11-10-gpt2-irish-spaceless-50variantsonynbollanbane.mp3
https://app.suno.ai/song/1e726e3d-c6b4-4b42-9576-e03169f29165/
https://app.suno.ai/song/f81be5c0-3de9-4940-95e1-cfb780d3aa5e/
https://arstechnica.com/ai/2024/02/mastering-music-is-hard-can-one-click-ai-make-it-easy/
https://blog.metabrainz.org/2022/02/16/acousticbrainz-making-a-hard-decision-to-end-the-project/
https://blog.youtube/inside-youtube/ai-and-music-experiment/
https://colab.research.google.com/github/sberbank-ai/music-composer/blob/main/src/Music_Composer_Demo_Colab.ipynb
https://colinmeloy.substack.com/p/i-had-chatgpt-write-a-decemberists
https://deepmind.google/discover/blog/transforming-the-future-of-music-creation/
https://magenta.tensorflow.org/blog/2017/06/01/waybackprop
https://mtg.upf.edu/system/files/publications/Font-Roma-Serra-ACMM-2013.pdf
https://news.play.ht/post/introducing-playht2-0-the-state-of-the-art-generative-voice-ai-model-for-conversational-speech
https://openai.com/blog/navigating-the-challenges-and-opportunities-of-synthetic-voices
https://research.google/blog/google-research-2022-beyond-language-vision-and-generative-models/
https://suno.com/song/0013a4ad-8373-4b80-994a-e04ae57518cd
https://suno.com/song/0a2d2dd8-16aa-47f2-9316-e6946b431bb4
https://suno.com/song/10c8a965-f872-4829-9210-8fa27bdc87c5
https://suno.com/song/1991fffb-fe12-4fa4-8db7-470e40be70c0
https://suno.com/song/1b2710da-a0da-48dc-833d-46b70ee08102
https://suno.com/song/25992548-470d-4bea-85e7-6514fa5b7664
https://suno.com/song/25e1ebd7-84cd-4b3d-a6b7-0b2f93fa638e
https://suno.com/song/341ddaf1-08d5-46a9-8cb5-1742cb22eaf5
https://suno.com/song/366db53d-002d-4590-a2a0-0547458c911c
https://suno.com/song/41c6c7e7-ac7b-43e9-993b-d9f3c8c1b3cb
https://suno.com/song/5be76253-bd58-4930-8265-4d768ed5069e
https://suno.com/song/63df5758-c533-447a-be4a-06dcb5abdbbf
https://suno.com/song/674e86cb-0395-414a-a291-a4c11a9efc4d
https://suno.com/song/79b82aa4-0335-4231-8dfb-b272fa7536b6
https://suno.com/song/8c2c7394-44aa-4fe4-ba38-edd7bea54d3d
https://suno.com/song/9e071497-0547-4539-be8b-b62b8dad63a8
https://suno.com/song/a1d34143-c9f1-4ca5-8642-d2c68d8f3564
https://suno.com/song/ae044a64-27de-40da-9689-0e7485daf698
https://suno.com/song/b1d81bcd-1a9b-4639-9fb0-8462852132c4
https://suno.com/song/c13d9aff-9d63-45cc-95fa-892608ae1f23
https://suno.com/song/da6d4a83-1001-4694-8c28-648a6e8bad0a
https://suno.com/song/e6ef4aca-46c6-499d-aec5-bf87aee2c2ac
https://suno.com/song/f78dedbb-55a2-41e9-84bb-4abe0e4c36f7
https://suno.com/song/f7fc9610-f4fb-4d11-a56d-6c8617422d52
https://suno.com/song/fdacb428-ff27-4f2e-832d-cc318ce8cf7b
https://www.404media.co/harry-styles-one-direction-ai-leaked-songs/
https://www.chinatalk.media/p/reflections-from-neurips-the-worlds#%C2%A7chinas-ai-generated-youtube-propaganda
https://www.engadget.com/drew-carey-made-a-radio-show-with-ai-fans-werent-pleased-143014038.html
https://www.karolpiczak.com/papers/Piczak2015-ESC-Dataset.pdf
https://www.lesswrong.com/posts/DfqcyGXcFcukYbWZ5/i-measure-google-s-musiclm-over-3-months-as-it-appears-to-go
https://www.lesswrong.com/posts/YMo5PuXnZDwRjhHhE/i-have-been-a-good-bing
https://www.newyorker.com/magazine/2024/02/05/inside-the-music-industrys-high-stakes-ai-experiments
https://www.reddit.com/r/singularity/comments/13h0zyy/i_really_crank_out_music_tracks_with_musiclm_this/
https://www.rollingstone.com/music/music-features/suno-ai-chatgpt-for-music-1234982307/
https://www.rollingstone.com/music/music-features/udio-ai-music-chatgpt-suno-1235001675/
https://www.theguardian.com/music/2022/feb/18/confucius-beowulf-and-an-ai-called-kevin-everything-everythings-search-for-hope-in-strange-places
https://www.theguardian.com/technology/2024/apr/28/bbc-presenters-likeness-used-in-advert-after-firm-tricked-by-ai-generated-voice
https://www.vice.com/en/article/k7z8be/torswats-computer-generated-ai-voice-swatting
https%253A%252F%252Farxiv.org%252Fabs%252F2305.09636%2523google.html
https%253A%252F%252Farxiv.org%252Fabs%252F2305.05665%2523facebook.html
TANGO: Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
https%253A%252F%252Fraw.githubusercontent.com%252Fflavioschneider%252Fmaster-thesis%252Fmain%252Faudio_diffusion_thesis.pdf.html
VALL-E: Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
https%253A%252F%252Farxiv.org%252Fabs%252F2301.02111%2523microsoft.html
https%253A%252F%252Farxiv.org%252Fabs%252F2210.13438%2523facebook.html
Hierarchical Diffusion Models for Singing Voice Neural Vocoder
https%253A%252F%252Farxiv.org%252Fabs%252F2210.07508%2523sony.html
AI composer bias: Listeners like music less when they think it was composed by an AI
BigVGAN: A Universal Neural Vocoder with Large-Scale Training
https%253A%252F%252Farxiv.org%252Fabs%252F2206.04658%2523nvidia.html
General-purpose, long-context autoregressive modeling with Perceiver AR
https%253A%252F%252Farxiv.org%252Fabs%252F2202.07765%2523deepmind.html
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
https%253A%252F%252Farxiv.org%252Fabs%252F1910.11480%2523naver.html
MuseNet: a deep neural network that can generate 4-minute musical compositions with 10 different instruments, and can combine styles from country to Mozart to the Beatles
https%253A%252F%252Fopenai.com%252Fresearch%252Fmusenet.html
Music Transformer: Generating Music with Long-Term Structure
https%253A%252F%252Fmagenta.tensorflow.org%252Fmusic-transformer.html