Academic literature on the topic 'Automated audio captioning'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Automated audio captioning.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "Automated audio captioning"
Bokhove, Christian, and Christopher Downey. "Automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded data." Methodological Innovations 11, no. 2 (2018): 205979911879074. http://dx.doi.org/10.1177/2059799118790743.
Full textP. Jayanth, K. Lakshmi Sree, K. Karthik Kumar Reddy, G. Om Prakash, and G. Reddy Prasad. "Vision-to-Voice: AI for generating Description & Audio of Visual Content." International Research Journal of Innovations in Engineering and Technology 09, Special Issue ICCIS (2025): 206–13. https://doi.org/10.47001/irjiet/2025.iccis-202533.
Full textSejal Pawar, Shruti Mulay, Jivani Suryawanshi, Vaishnavi Walgude, and Prof. K. V. Patil. "Enhancing Traffic Scene and Understanding Through Image Captioning and Audio." International Research Journal on Advanced Engineering and Management (IRJAEM) 6, no. 07 (2024): 2423–29. http://dx.doi.org/10.47392/irjaem.2024.0349.
Full textHaapaniemi, Riku, Annamaria Mesaros, Manu Harju, Irene Martín Morató, and Maija Hirvonen. "Primerjava semiotične konceptualizacije prevoda z besedilom, ki ga tvori UI." STRIDON: Journal of Studies in Translation and Interpreting 4, no. 1 (2024): 25–51. http://dx.doi.org/10.4312/stridon.4.1.25-51.
Full textSankalp, Kala, and Sridhar Ranganathan Prof. "Deep Learning Based Lipreading for Video Captioning." Engineering and Technology Journal 9, no. 05 (2024): 3935–46. https://doi.org/10.5281/zenodo.11120548.
Full textKoenecke, Allison, Andrew Nam, Emily Lake, et al. "Racial disparities in automated speech recognition." Proceedings of the National Academy of Sciences 117, no. 14 (2020): 7684–89. http://dx.doi.org/10.1073/pnas.1915768117.
Full textMirzaei, Maryam Sadat, Kourosh Meshgi, Yuya Akita, and Tatsuya Kawahara. "Partial and synchronized captioning: A new tool to assist learners in developing second language listening skill." ReCALL 29, no. 2 (2017): 178–99. http://dx.doi.org/10.1017/s0958344017000039.
Full textGuo, Rundong. "Advancing real-time close captioning: blind source separation and transcription for hearing impairments." Applied and Computational Engineering 30, no. 1 (2024): 125–30. http://dx.doi.org/10.54254/2755-2721/30/20230084.
Full textPrabhala, Jagat Chaitanya, Venkatnareshbabu K, and Ragoju Ravi. "OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIARIZATION SYSTEMS: A MATHEMATICAL FORMULATION." Applied Mathematics and Sciences An International Journal (MathSJ) 10, no. 1/2 (2023): 1–10. http://dx.doi.org/10.5121/mathsj.2023.10201.
Full textNam, Somang, and Deborah Fels. "Simulation of Subjective Closed Captioning Quality Assessment Using Prediction Models." International Journal of Semantic Computing 13, no. 01 (2019): 45–65. http://dx.doi.org/10.1142/s1793351x19400038.
Full textDissertations / Theses on the topic "Automated audio captioning"
Labbé, Etienne. "Description automatique des événements sonores par des méthodes d'apprentissage profond." Electronic Thesis or Diss., Université de Toulouse (2023-....), 2024. http://www.theses.fr/2024TLSES054.
Full textBook chapters on the topic "Automated audio captioning"
M., Nivedita, AsnathVictyPhamila Y., Umashankar Kumaravelan, and Karthikeyan N. "Voice-Based Image Captioning System for Assisting Visually Impaired People Using Neural Networks." In Principles and Applications of Socio-Cognitive and Affective Computing. IGI Global, 2022. http://dx.doi.org/10.4018/978-1-6684-3843-5.ch011.
Full textVenturini, Shamira, Michaela Mae Vann, Martina Pucci, and Giulia M. L. Bencini. "Towards a More Inclusive Learning Environment: The Importance of Providing Captions That Are Suited to Learners’ Language Proficiency in the UDL Classroom." In Studies in Health Technology and Informatics. IOS Press, 2022. http://dx.doi.org/10.3233/shti220884.
Full textConference papers on the topic "Automated audio captioning"
Tan, Liwen, Yi Zhou, Yin Liu, and Wang Chen. "Enhanced Automated Audio Captioning Method Based on Heterogeneous Feature Fusion." In 2024 IEEE International Conference on Software System and Information Processing (ICSSIP). IEEE, 2024. https://doi.org/10.1109/icssip63203.2024.11012230.
Full textKim, Minkyu, Kim Sung-Bin, and Tae-Hyun Oh. "Prefix Tuning for Automated Audio Captioning." In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023. http://dx.doi.org/10.1109/icassp49357.2023.10096877.
Full textDrossos, Konstantinos, Sharath Adavanne, and Tuomas Virtanen. "Automated audio captioning with recurrent neural networks." In 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). IEEE, 2017. http://dx.doi.org/10.1109/waspaa.2017.8170058.
Full textChen, Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, and Eng Siong Chng. "Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning." In Interspeech 2022. ISCA, 2022. http://dx.doi.org/10.21437/interspeech.2022-10510.
Full textLiu, Weizhuo, and Zhe Gao. "Improving Automated Audio Captioning with LLM Decoder and BEATs Audio Encoder." In CSAI 2024: 2024 8th International Conference on Computer Science and Artificial Intelligence (CSAI). ACM, 2024. https://doi.org/10.1145/3709026.3709118.
Full textKim, Jaeyeon, Jaeyoon Jung, Jinjoo Lee, and Sang Hoon Woo. "EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning." In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2024. http://dx.doi.org/10.1109/icassp48485.2024.10446672.
Full textLiu, Jizhong, Gang Li, Junbo Zhang, et al. "Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding." In Interspeech 2024. ISCA, 2024. http://dx.doi.org/10.21437/interspeech.2024-65.
Full textYe, Zhongjie, Yuqing Wang, Helin Wang, Dongchao Yang, and Yuexian Zou. "FeatureCut: An Adaptive Data Augmentation for Automated Audio Captioning." In 2022 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). IEEE, 2022. http://dx.doi.org/10.23919/apsipaasc55919.2022.9980325.
Full textKoh, Andrew, Soham Tiwari, and Chng Eng Siong. "Automated Audio Captioning with Epochal Difficult Captions for curriculum learning." In 2022 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). IEEE, 2022. http://dx.doi.org/10.23919/apsipaasc55919.2022.9980242.
Full textWijngaard, Gijs, Elia Formisano, Bruno L. Giordano, and Michel Dumontier. "ACES: Evaluating Automated Audio Captioning Models on the Semantics of Sounds." In 2023 31st European Signal Processing Conference (EUSIPCO). IEEE, 2023. http://dx.doi.org/10.23919/eusipco58844.2023.10289793.
Full text