‘AI music’ directory

Annotations sorted by machine learning into inferred 'tags'. This provides an alternative way to browse: instead of by date order, one can browse in topic order. The 'sorted' list has been automatically clustered into multiple sections & auto-labeled for easier browsing.

Beginning with the newest annotation, it uses the embedding of each annotation to attempt to create a list of nearest-neighbor annotations, creating a progression of topics. For more details, see the link.

Wikipedia (7)

Miscellaneous

Bibliography

https://arxiv.org/abs/2305.09636#google: “SoundStorm: Efficient Parallel Audio Generation ”, Zalán Borsos, Matt Sharifi, Damien Vincent, Eugene Kharitonov, Neil Zeghidour, Marco Tagliasacchi

link-bibliography
https://arxiv.org/abs/2305.05665#facebook: “ImageBind: One Embedding Space To Bind Them All ”, Rohit Girdhar, Alaaeldin El-Nouby, Zhuang Liu, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra

link-bibliography
https://arxiv.org/abs/2304.13731: “TANGO: Text-To-Audio Generation Using Instruction-Tuned LLM and Latent Diffusion Model ”, Deepanway Ghosal, Navonil Majumder, Ambuj Mehrish, Soujanya Poria

link-bibliography
https://raw.githubusercontent.com/flavioschneider/master-thesis/main/audio_diffusion_thesis.pdf: “Archisound: Audio Generation With Diffusion ”, Flavio Schneider

link-bibliography
https://arxiv.org/abs/2301.02111#microsoft: “VALL-E: Neural Codec Language Models Are Zero-Shot Text to Speech Synthesizers ”, Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei

link-bibliography
https://arxiv.org/abs/2210.13438#facebook: “High Fidelity Neural Audio Compression ”, Alexandre Défossez, Jade Copet, Gabriel Synnaeve, Yossi Adi

link-bibliography
https://arxiv.org/abs/2210.07508#sony: “Hierarchical Diffusion Models for Singing Voice Neural Vocoder ”, Naoya Takahashi, Mayank Kumar, Singh, Yuki Mitsufuji

link-bibliography
2022-shank.pdf: “AI Composer Bias: Listeners like Music Less When They Think It Was Composed by an AI ”, Daniel B. Shank, Courtney Stefanik, Cassidy Stuhlsatz, Kaelyn Kacirek, Amy M. Belfi

link-bibliography
https://arxiv.org/abs/2206.04658#nvidia: “BigVGAN: A Universal Neural Vocoder With Large-Scale Training ”, Sang-gil Lee, Wei Ping, Boris Ginsburg, Bryan Catanzaro, Sungroh Yoon

link-bibliography
https://arxiv.org/abs/2202.09729: “It’s Raw! Audio Generation With State-Space Models ”, Karan Goel, Albert Gu, Chris Donahue, Christopher Ré

link-bibliography
https://arxiv.org/abs/2202.07765#deepmind: “General-Purpose, Long-Context Autoregressive Modeling With Perceiver AR ”, Curtis Hawthorne, Andrew Jaegle, Cătălina Cangea, Sebastian Borgeaud, Charlie Nash, Mateusz Malinowski, Sander Dieleman, Oriol Vinyals, Matthew Botvinick, Ian Simon, Hannah Sheahan, Neil Zeghidour, Jean-Baptiste Alayrac, João Carreira, Jesse Engel

link-bibliography
https://arxiv.org/abs/2106.13043: “AudioCLIP: Extending CLIP to Image, Text and Audio ”, Andrey Guzhov, Federico Raue, Jörn Hees, Andreas Dengel

link-bibliography
https://fifteen.ai/: “15.ai ”, Fifteen-kun, Pony Preservation Project

link-bibliography
https://arxiv.org/abs/1910.11480#naver: “Parallel WaveGAN: A Fast Waveform Generation Model Based on Generative Adversarial Networks With Multi-Resolution Spectrogram ”, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim

link-bibliography
https://openai.com/research/musenet: “MuseNet: a Deep Neural Network That Can Generate 4-Minute Musical Compositions With 10 Different Instruments, and Can Combine Styles from Country to Mozart to the Beatles ”, Christine Payne

link-bibliography
https://magenta.tensorflow.org/music-transformer: “Music Transformer: Generating Music With Long-Term Structure ”, Cheng-Zhi Anna Huang, Ian Simon, Monica Dinculescu

link-bibliography
https://arxiv.org/abs/1811.02155: “FloWaveNet: A Generative Flow for Raw Audio ”, Sungwon Kim, Sang-gil Lee, Jongyoon Song, Jaehyeon Kim, Sungroh Yoon

link-bibliography
2018-huang.pdf: “Generating Structured Music through Self-Attention ”, Anna Huang, Ashish Vaswani, Jakob Uszkoreit, Noam Shazeer, Andrew Dai, Matt Hoffman, Curtis Hawthorne, Douglas Eck

link-bibliography