Academic literature on the topic 'Generative audio models'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Generative audio models.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "Generative audio models"
Evans, Zach, Scott H. Hawley, and Katherine Crowson. "Musical audio samples generated from joint text embeddings." Journal of the Acoustical Society of America 152, no. 4 (2022): A178. http://dx.doi.org/10.1121/10.0015956.
Full textKang, Hyunju, Geonhee Han, Yoonjae Jeong, and Hogun Park. "AudioGenX: Explainability on Text-to-Audio Generative Models." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 17 (2025): 17733–41. https://doi.org/10.1609/aaai.v39i17.33950.
Full textSamson, Grzegorz. "Perspectives on Generative Sound Design: A Generative Soundscapes Showcase." Arts 14, no. 3 (2025): 67. https://doi.org/10.3390/arts14030067.
Full textJeong, Yujin, Yunji Kim, Sanghyuk Chun, and Jiyoung Lee. "Read, Watch and Scream! Sound Generation from Text and Video." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 17 (2025): 17590–98. https://doi.org/10.1609/aaai.v39i17.33934.
Full textWang, Heng, Jianbo Ma, Santiago Pascual, Richard Cartwright, and Weidong Cai. "V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 14 (2024): 15492–501. http://dx.doi.org/10.1609/aaai.v38i14.29475.
Full textJi, Wenliang, Ming Jin, and Yixin Chen. "Optimization of Digital Media Content Generation and Communication Effect Combined with Deep Learning Technology." Journal of Combinatorial Mathematics and Combinatorial Computing 127a (April 15, 2025): 1449–66. https://doi.org/10.61091/jcmcc127a-084.
Full textSakirin, Tam, and Siddartha Kusuma. "A Survey of Generative Artificial Intelligence Techniques." Babylonian Journal of Artificial Intelligence 2023 (March 10, 2023): 10–14. http://dx.doi.org/10.58496/bjai/2023/003.
Full textBroad, Terence, Frederic Fol Leymarie, and Mick Grierson. "Network Bending: Expressive Manipulation of Generative Models in Multiple Domains." Entropy 24, no. 1 (2021): 28. http://dx.doi.org/10.3390/e24010028.
Full textCao, Yongnian, Xuechun Yang, and Rui Sun. "Generative AI Models Theoretical Foundations and Algorithmic Practices." Journal of Industrial Engineering and Applied Science 3, no. 1 (2025): 1–9. https://doi.org/10.70393/6a69656173.323633.
Full textAldausari, Nuha, Arcot Sowmya, Nadine Marcus, and Gelareh Mohammadi. "Video Generative Adversarial Networks: A Review." ACM Computing Surveys 55, no. 2 (2023): 1–25. http://dx.doi.org/10.1145/3487891.
Full textDissertations / Theses on the topic "Generative audio models"
Douwes, Constance. "On the Environmental Impact of Deep Generative Models for Audio." Electronic Thesis or Diss., Sorbonne université, 2023. http://www.theses.fr/2023SORUS074.
Full textCaillon, Antoine. "Hierarchical temporal learning for multi-instrument and orchestral audio synthesis." Electronic Thesis or Diss., Sorbonne université, 2023. http://www.theses.fr/2023SORUS115.
Full textNishikimi, Ryo. "Generative, Discriminative, and Hybrid Approaches to Audio-to-Score Automatic Singing Transcription." Doctoral thesis, Kyoto University, 2021. http://hdl.handle.net/2433/263772.
Full textCHEMLA, ROMEU SANTOS AXEL CLAUDE ANDRE'. "MANIFOLD REPRESENTATIONS OF MUSICAL SIGNALS AND GENERATIVE SPACES." Doctoral thesis, Università degli Studi di Milano, 2020. http://hdl.handle.net/2434/700444.
Full textGuenebaut, Boris. "Automatic Subtitle Generation for Sound in Videos." Thesis, University West, Department of Economics and IT, 2009. http://urn.kb.se/resolve?urn=urn:nbn:se:hv:diva-1784.
Full textScarlato, Michele. "Sicurezza di rete, analisi del traffico e monitoraggio." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2012. http://amslaurea.unibo.it/3223/.
Full textMehri, Soroush. "Sequential modeling, generative recurrent neural networks, and their applications to audio." Thèse, 2016. http://hdl.handle.net/1866/18762.
Full textBooks on the topic "Generative audio models"
Osipov, Vladimir. Control and audit of the activities of a commercial organization: external and internal. INFRA-M Academic Publishing LLC., 2021. http://dx.doi.org/10.12737/1137320.
Full textKazimagomedov, Abdulla, Aida Abdulsalamova, M. Mel'nikov, and N. Gadzhiev. Analysis of the activities of a commercial bank. INFRA-M Academic Publishing LLC., 2022. http://dx.doi.org/10.12737/1831614.
Full textNikiforova, Elena, Lyudmila Kupriyanova, and Ol'ga Shnayder. Management accounting and analysis. INFRA-M Academic Publishing LLC., 2025. https://doi.org/10.12737/2122904.
Full textColmeiro, José. Peripheral Visions / Global Sounds. Liverpool University Press, 2018. http://dx.doi.org/10.5949/liverpool/9781786940308.001.0001.
Full textAguayo, Angela J. Documentary Resistance. Oxford University Press, 2019. http://dx.doi.org/10.1093/oso/9780190676216.001.0001.
Full textBook chapters on the topic "Generative audio models"
Huzaifah, Muhammad, and Lonce Wyse. "Deep Generative Models for Musical Audio Synthesis." In Handbook of Artificial Intelligence for Music. Springer International Publishing, 2021. http://dx.doi.org/10.1007/978-3-030-72116-9_22.
Full textGallagher, Sean, Ben Gelman, Salma Taoufiq, et al. "Phishing and Social Engineering in the Age of LLMs." In Large Language Models in Cybersecurity. Springer Nature Switzerland, 2024. http://dx.doi.org/10.1007/978-3-031-54827-7_8.
Full textBoll, Antônio Oss, Letícia Maria Puttlitz, Heloísa Oss Boll, and Rodrigo Mor Malossi. "Beyond Audio Signals: Generative Model-Based Speaker Diarization in Portuguese." In Lecture Notes in Computer Science. Springer Nature Switzerland, 2025. https://doi.org/10.1007/978-3-031-79029-4_17.
Full textYe, Sheng, Yu-Hui Wen, Yanan Sun, et al. "Audio-Driven Stylized Gesture Generation with Flow-Based Model." In Lecture Notes in Computer Science. Springer Nature Switzerland, 2022. http://dx.doi.org/10.1007/978-3-031-20065-6_41.
Full textWyse, Lonce, Purnima Kamath, and Chitralekha Gupta. "Sound Model Factory: An Integrated System Architecture for Generative Audio Modelling." In Artificial Intelligence in Music, Sound, Art and Design. Springer International Publishing, 2022. http://dx.doi.org/10.1007/978-3-031-03789-4_20.
Full textFarkas, Michal, and Peter Lacko. "Using Advanced Audio Generating Techniques to Model Electrical Energy Load." In Engineering Applications of Neural Networks. Springer International Publishing, 2017. http://dx.doi.org/10.1007/978-3-319-65172-9_4.
Full textGolani, Mati, and Shlomit S. Pinter. "Generating a Process Model from a Process Audit Log." In Lecture Notes in Computer Science. Springer Berlin Heidelberg, 2003. http://dx.doi.org/10.1007/3-540-44895-0_10.
Full textMa, Bin, Weixun Li, Huifeng Li, et al. "Generate Unnoticeable Adversarial Examples on Audio Classification Models with Multi-perspective Objectives." In Lecture Notes in Networks and Systems. Springer Nature Switzerland, 2024. https://doi.org/10.1007/978-3-031-74443-3_20.
Full textde Berardinis, Jacopo, Valentina Anita Carriero, Nitisha Jain, et al. "The Polifonia Ontology Network: Building a Semantic Backbone for Musical Heritage." In The Semantic Web – ISWC 2023. Springer Nature Switzerland, 2023. http://dx.doi.org/10.1007/978-3-031-47243-5_17.
Full textKim, Sang-Kyun, Doo Sun Hwang, Ji-Yeun Kim, and Yang-Seock Seo. "An Effective News Anchorperson Shot Detection Method Based on Adaptive Audio/Visual Model Generation." In Lecture Notes in Computer Science. Springer Berlin Heidelberg, 2005. http://dx.doi.org/10.1007/11526346_31.
Full textConference papers on the topic "Generative audio models"
Roman, Robin San, Pierre Fernandez, Antoine Deleforge, Yossi Adi, and Romain Serizel. "Latent Watermarking of Audio Generative Models." In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025. https://doi.org/10.1109/icassp49660.2025.10889782.
Full textAkman, Alican, Qiyang Sun, and Björn W. Schuller. "Audio Explanation Synthesis with Generative Foundation Models." In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025. https://doi.org/10.1109/icassp49660.2025.10890370.
Full textYang, Qian, Jin Xu, Wenrui Liu, et al. "AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension." In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 2024. http://dx.doi.org/10.18653/v1/2024.acl-long.109.
Full textHeydari, Mojtaba, Mehrez Souden, Bruno Conejo, and Joshua Atkins. "ImmerseDiffusion: A Generative Spatial Audio Latent Diffusion Model." In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025. https://doi.org/10.1109/icassp49660.2025.10889311.
Full textKushwaha, Saksham Singh, Jianbo Ma, Mark R. P. Thomas, Yapeng Tian, and Avery Bruni. "Diff-SAGe: End-to-End Spatial Audio Generation Using Diffusion Models." In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025. https://doi.org/10.1109/icassp49660.2025.10888882.
Full textLiang, Yiwei, and Ming Li. "Vivid Background Audio Generation Based on Large Language Models and AudioLDM." In 2024 IEEE 14th International Symposium on Chinese Spoken Language Processing (ISCSLP). IEEE, 2024. https://doi.org/10.1109/iscslp63861.2024.10800334.
Full textGao, Yiming. "A systematic research of text-to-audio generation with diffusion models." In Fifth International Conference on Signal Processing and Computer Science (SPCS 2024), edited by Haiquan Zhao and Lei Chen. SPIE, 2025. https://doi.org/10.1117/12.3053123.
Full textKyaw, Kaung Myat, and Jonathan Hoyin Chan. "A Framework for Synthetic Audio Conversations Generation Using Large Language Models." In 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT). IEEE, 2024. https://doi.org/10.1109/wi-iat62293.2024.00056.
Full textLi, Jiaqi, Dongmei Wang, Xiaofei Wang, et al. "Investigating Neural Audio Codecs For Speech Language Model-Based Speech Generation." In 2024 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2024. https://doi.org/10.1109/slt61566.2024.10832266.
Full textYang, Jie, and Feilong Bao. "MDG:Multilingual Co-speech Gesture Generation with Low-level Audio Representation and Diffusion Models." In 2024 International Conference on Asian Language Processing (IALP). IEEE, 2024. http://dx.doi.org/10.1109/ialp63756.2024.10661182.
Full textReports on the topic "Generative audio models"
Decleir, Cyril, Mohand-Saïd Hacid, and Jacques Kouloumdjian. A Database Approach for Modeling and Querying Video Data. Aachen University of Technology, 1999. http://dx.doi.org/10.25368/2022.90.
Full textVakaliuk, Tetiana A., Valerii V. Kontsedailo, Dmytro S. Antoniuk, Olha V. Korotun, Iryna S. Mintii, and Andrey V. Pikilnyak. Using game simulator Software Inc in the Software Engineering education. [б. в.], 2020. http://dx.doi.org/10.31812/123456789/3762.
Full text