Literatura académica sobre el tema "Generative audio models"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte las listas temáticas de artículos, libros, tesis, actas de conferencias y otras fuentes académicas sobre el tema "Generative audio models".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Artículos de revistas sobre el tema "Generative audio models"
Evans, Zach, Scott H. Hawley, and Katherine Crowson. "Musical audio samples generated from joint text embeddings." Journal of the Acoustical Society of America 152, no. 4 (2022): A178. http://dx.doi.org/10.1121/10.0015956.
Texto completoKang, Hyunju, Geonhee Han, Yoonjae Jeong, and Hogun Park. "AudioGenX: Explainability on Text-to-Audio Generative Models." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 17 (2025): 17733–41. https://doi.org/10.1609/aaai.v39i17.33950.
Texto completoSamson, Grzegorz. "Perspectives on Generative Sound Design: A Generative Soundscapes Showcase." Arts 14, no. 3 (2025): 67. https://doi.org/10.3390/arts14030067.
Texto completoJeong, Yujin, Yunji Kim, Sanghyuk Chun, and Jiyoung Lee. "Read, Watch and Scream! Sound Generation from Text and Video." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 17 (2025): 17590–98. https://doi.org/10.1609/aaai.v39i17.33934.
Texto completoWang, Heng, Jianbo Ma, Santiago Pascual, Richard Cartwright, and Weidong Cai. "V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 14 (2024): 15492–501. http://dx.doi.org/10.1609/aaai.v38i14.29475.
Texto completoJi, Wenliang, Ming Jin, and Yixin Chen. "Optimization of Digital Media Content Generation and Communication Effect Combined with Deep Learning Technology." Journal of Combinatorial Mathematics and Combinatorial Computing 127a (April 15, 2025): 1449–66. https://doi.org/10.61091/jcmcc127a-084.
Texto completoSakirin, Tam, and Siddartha Kusuma. "A Survey of Generative Artificial Intelligence Techniques." Babylonian Journal of Artificial Intelligence 2023 (March 10, 2023): 10–14. http://dx.doi.org/10.58496/bjai/2023/003.
Texto completoBroad, Terence, Frederic Fol Leymarie, and Mick Grierson. "Network Bending: Expressive Manipulation of Generative Models in Multiple Domains." Entropy 24, no. 1 (2021): 28. http://dx.doi.org/10.3390/e24010028.
Texto completoCao, Yongnian, Xuechun Yang, and Rui Sun. "Generative AI Models Theoretical Foundations and Algorithmic Practices." Journal of Industrial Engineering and Applied Science 3, no. 1 (2025): 1–9. https://doi.org/10.70393/6a69656173.323633.
Texto completoAldausari, Nuha, Arcot Sowmya, Nadine Marcus, and Gelareh Mohammadi. "Video Generative Adversarial Networks: A Review." ACM Computing Surveys 55, no. 2 (2023): 1–25. http://dx.doi.org/10.1145/3487891.
Texto completoTesis sobre el tema "Generative audio models"
Douwes, Constance. "On the Environmental Impact of Deep Generative Models for Audio." Electronic Thesis or Diss., Sorbonne université, 2023. http://www.theses.fr/2023SORUS074.
Texto completoCaillon, Antoine. "Hierarchical temporal learning for multi-instrument and orchestral audio synthesis." Electronic Thesis or Diss., Sorbonne université, 2023. http://www.theses.fr/2023SORUS115.
Texto completoNishikimi, Ryo. "Generative, Discriminative, and Hybrid Approaches to Audio-to-Score Automatic Singing Transcription." Doctoral thesis, Kyoto University, 2021. http://hdl.handle.net/2433/263772.
Texto completoCHEMLA, ROMEU SANTOS AXEL CLAUDE ANDRE'. "MANIFOLD REPRESENTATIONS OF MUSICAL SIGNALS AND GENERATIVE SPACES." Doctoral thesis, Università degli Studi di Milano, 2020. http://hdl.handle.net/2434/700444.
Texto completoGuenebaut, Boris. "Automatic Subtitle Generation for Sound in Videos." Thesis, University West, Department of Economics and IT, 2009. http://urn.kb.se/resolve?urn=urn:nbn:se:hv:diva-1784.
Texto completoScarlato, Michele. "Sicurezza di rete, analisi del traffico e monitoraggio." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2012. http://amslaurea.unibo.it/3223/.
Texto completoMehri, Soroush. "Sequential modeling, generative recurrent neural networks, and their applications to audio." Thèse, 2016. http://hdl.handle.net/1866/18762.
Texto completoLibros sobre el tema "Generative audio models"
Osipov, Vladimir. Control and audit of the activities of a commercial organization: external and internal. INFRA-M Academic Publishing LLC., 2021. http://dx.doi.org/10.12737/1137320.
Texto completoKazimagomedov, Abdulla, Aida Abdulsalamova, M. Mel'nikov, and N. Gadzhiev. Analysis of the activities of a commercial bank. INFRA-M Academic Publishing LLC., 2022. http://dx.doi.org/10.12737/1831614.
Texto completoNikiforova, Elena, Lyudmila Kupriyanova, and Ol'ga Shnayder. Management accounting and analysis. INFRA-M Academic Publishing LLC., 2025. https://doi.org/10.12737/2122904.
Texto completoColmeiro, José. Peripheral Visions / Global Sounds. Liverpool University Press, 2018. http://dx.doi.org/10.5949/liverpool/9781786940308.001.0001.
Texto completoAguayo, Angela J. Documentary Resistance. Oxford University Press, 2019. http://dx.doi.org/10.1093/oso/9780190676216.001.0001.
Texto completoCapítulos de libros sobre el tema "Generative audio models"
Huzaifah, Muhammad, and Lonce Wyse. "Deep Generative Models for Musical Audio Synthesis." In Handbook of Artificial Intelligence for Music. Springer International Publishing, 2021. http://dx.doi.org/10.1007/978-3-030-72116-9_22.
Texto completoGallagher, Sean, Ben Gelman, Salma Taoufiq, et al. "Phishing and Social Engineering in the Age of LLMs." In Large Language Models in Cybersecurity. Springer Nature Switzerland, 2024. http://dx.doi.org/10.1007/978-3-031-54827-7_8.
Texto completoBoll, Antônio Oss, Letícia Maria Puttlitz, Heloísa Oss Boll, and Rodrigo Mor Malossi. "Beyond Audio Signals: Generative Model-Based Speaker Diarization in Portuguese." In Lecture Notes in Computer Science. Springer Nature Switzerland, 2025. https://doi.org/10.1007/978-3-031-79029-4_17.
Texto completoYe, Sheng, Yu-Hui Wen, Yanan Sun, et al. "Audio-Driven Stylized Gesture Generation with Flow-Based Model." In Lecture Notes in Computer Science. Springer Nature Switzerland, 2022. http://dx.doi.org/10.1007/978-3-031-20065-6_41.
Texto completoWyse, Lonce, Purnima Kamath, and Chitralekha Gupta. "Sound Model Factory: An Integrated System Architecture for Generative Audio Modelling." In Artificial Intelligence in Music, Sound, Art and Design. Springer International Publishing, 2022. http://dx.doi.org/10.1007/978-3-031-03789-4_20.
Texto completoFarkas, Michal, and Peter Lacko. "Using Advanced Audio Generating Techniques to Model Electrical Energy Load." In Engineering Applications of Neural Networks. Springer International Publishing, 2017. http://dx.doi.org/10.1007/978-3-319-65172-9_4.
Texto completoGolani, Mati, and Shlomit S. Pinter. "Generating a Process Model from a Process Audit Log." In Lecture Notes in Computer Science. Springer Berlin Heidelberg, 2003. http://dx.doi.org/10.1007/3-540-44895-0_10.
Texto completoMa, Bin, Weixun Li, Huifeng Li, et al. "Generate Unnoticeable Adversarial Examples on Audio Classification Models with Multi-perspective Objectives." In Lecture Notes in Networks and Systems. Springer Nature Switzerland, 2024. https://doi.org/10.1007/978-3-031-74443-3_20.
Texto completode Berardinis, Jacopo, Valentina Anita Carriero, Nitisha Jain, et al. "The Polifonia Ontology Network: Building a Semantic Backbone for Musical Heritage." In The Semantic Web – ISWC 2023. Springer Nature Switzerland, 2023. http://dx.doi.org/10.1007/978-3-031-47243-5_17.
Texto completoKim, Sang-Kyun, Doo Sun Hwang, Ji-Yeun Kim, and Yang-Seock Seo. "An Effective News Anchorperson Shot Detection Method Based on Adaptive Audio/Visual Model Generation." In Lecture Notes in Computer Science. Springer Berlin Heidelberg, 2005. http://dx.doi.org/10.1007/11526346_31.
Texto completoActas de conferencias sobre el tema "Generative audio models"
Roman, Robin San, Pierre Fernandez, Antoine Deleforge, Yossi Adi, and Romain Serizel. "Latent Watermarking of Audio Generative Models." In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025. https://doi.org/10.1109/icassp49660.2025.10889782.
Texto completoAkman, Alican, Qiyang Sun, and Björn W. Schuller. "Audio Explanation Synthesis with Generative Foundation Models." In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025. https://doi.org/10.1109/icassp49660.2025.10890370.
Texto completoYang, Qian, Jin Xu, Wenrui Liu, et al. "AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension." In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 2024. http://dx.doi.org/10.18653/v1/2024.acl-long.109.
Texto completoHeydari, Mojtaba, Mehrez Souden, Bruno Conejo, and Joshua Atkins. "ImmerseDiffusion: A Generative Spatial Audio Latent Diffusion Model." In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025. https://doi.org/10.1109/icassp49660.2025.10889311.
Texto completoKushwaha, Saksham Singh, Jianbo Ma, Mark R. P. Thomas, Yapeng Tian, and Avery Bruni. "Diff-SAGe: End-to-End Spatial Audio Generation Using Diffusion Models." In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025. https://doi.org/10.1109/icassp49660.2025.10888882.
Texto completoLiang, Yiwei, and Ming Li. "Vivid Background Audio Generation Based on Large Language Models and AudioLDM." In 2024 IEEE 14th International Symposium on Chinese Spoken Language Processing (ISCSLP). IEEE, 2024. https://doi.org/10.1109/iscslp63861.2024.10800334.
Texto completoGao, Yiming. "A systematic research of text-to-audio generation with diffusion models." In Fifth International Conference on Signal Processing and Computer Science (SPCS 2024), edited by Haiquan Zhao and Lei Chen. SPIE, 2025. https://doi.org/10.1117/12.3053123.
Texto completoKyaw, Kaung Myat, and Jonathan Hoyin Chan. "A Framework for Synthetic Audio Conversations Generation Using Large Language Models." In 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT). IEEE, 2024. https://doi.org/10.1109/wi-iat62293.2024.00056.
Texto completoLi, Jiaqi, Dongmei Wang, Xiaofei Wang, et al. "Investigating Neural Audio Codecs For Speech Language Model-Based Speech Generation." In 2024 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2024. https://doi.org/10.1109/slt61566.2024.10832266.
Texto completoYang, Jie, and Feilong Bao. "MDG:Multilingual Co-speech Gesture Generation with Low-level Audio Representation and Diffusion Models." In 2024 International Conference on Asian Language Processing (IALP). IEEE, 2024. http://dx.doi.org/10.1109/ialp63756.2024.10661182.
Texto completoInformes sobre el tema "Generative audio models"
Decleir, Cyril, Mohand-Saïd Hacid, and Jacques Kouloumdjian. A Database Approach for Modeling and Querying Video Data. Aachen University of Technology, 1999. http://dx.doi.org/10.25368/2022.90.
Texto completoVakaliuk, Tetiana A., Valerii V. Kontsedailo, Dmytro S. Antoniuk, Olha V. Korotun, Iryna S. Mintii, and Andrey V. Pikilnyak. Using game simulator Software Inc in the Software Engineering education. [б. в.], 2020. http://dx.doi.org/10.31812/123456789/3762.
Texto completo