Journal articles on the topic 'Tacotron-2'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 29 journal articles for your research on the topic 'Tacotron-2.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Liu, Yifan, and Jin Zheng. "Es-Tacotron2: Multi-Task Tacotron 2 with Pre-Trained Estimated Network for Reducing the Over-Smoothness Problem." Information 10, no. 4 (2019): 131. http://dx.doi.org/10.3390/info10040131.
Full textTran, Duc Chung. "The first FOSD-tacotron-2-based text-to-speech application for Vietnamese." Bulletin of Electrical Engineering and Informatics 10, no. 2 (2021): 898–903. http://dx.doi.org/10.11591/eei.v10i2.2539.
Full textTran, Duc Chung. "The First Vietnamese FOSD-Tacotron-2-based Text-to-Speech Model Dataset." Data in Brief 31 (August 2020): 105775. http://dx.doi.org/10.1016/j.dib.2020.105775.
Full textDuc, Chung Tran. "The first FOSD-tacotron-2-based text-to-speech application for Vietnamese." Bulletin of Electrical Engineering and Informatics 10, no. 2 (2021): 898~903. https://doi.org/10.11591/eei.v10i2.2539.
Full textRono, Kelvin Kiptoo, Ciira Wa Maina, and Elijah Mwangi. "Development of a Kiswahili Text-to-Speech System based on Tacotron 2 and Wave Net Vocoder." International Journal of Electrical and Electronics Engineering 10, no. 2 (2023): 75–83. http://dx.doi.org/10.14445/23488379/ijeee-v10i2p107.
Full textGarcía, Víctor, Inma Hernáez, and Eva Navas. "Evaluation of Tacotron Based Synthesizers for Spanish and Basque." Applied Sciences 12, no. 3 (2022): 1686. http://dx.doi.org/10.3390/app12031686.
Full textSirohi, Anant. "Research Paper on Text to Audio Converter using NLP." International Journal for Research in Applied Science and Engineering Technology 13, no. 5 (2025): 1313–16. https://doi.org/10.22214/ijraset.2025.70467.
Full textSavkova, Tatiana, Ivan Opirskyy, and Dmytro Sabodashko. "STUDYING THE RESISTANCE OF BIOMETRIC AUTHENTICATION SYSTEMS TO ATTACKS USING VOICE CLONING TECHNOLOGY BASED ON DEEP NEURAL NETWORKS." Cybersecurity: Education, Science, Technique 2, no. 26 (2024): 27–43. https://doi.org/10.28925/2663-4023.2024.26.670.
Full textAbuali, Batool, and Mohamad-Bassam Kurdy. "Full Diacritization of the Arabic Text to Improve Screen Readers for the Visually Impaired." Advances in Human-Computer Interaction 2022 (July 18, 2022): 1–9. http://dx.doi.org/10.1155/2022/1186678.
Full textOyucu, Saadin. "A Novel End-to-End Turkish Text-to-Speech (TTS) System via Deep Learning." Electronics 12, no. 8 (2023): 1900. http://dx.doi.org/10.3390/electronics12081900.
Full textZhang, Jing-Xuan, Korin Richmond, Zhen-Hua Ling, and Lirong Dai. "TaLNet: Voice Reconstruction from Tongue and Lip Articulation with Transfer Learning from Text-to-Speech Synthesis." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 16 (2021): 14402–10. http://dx.doi.org/10.1609/aaai.v35i16.17693.
Full textTiwari, Kartik. "Deep Learning Based TTS-STT Model with Transliteration for Indic Languages." International Journal for Research in Applied Science and Engineering Technology 9, no. 12 (2021): 2207–13. http://dx.doi.org/10.22214/ijraset.2021.39689.
Full textTorres Núñez del Prado, Paola. "AIELSON: A neural spoken-word poetry generator with a distinct South American voice." Journal of Interdisciplinary Voice Studies 7, no. 1 (2022): 11–33. http://dx.doi.org/10.1386/jivs_00052_1.
Full textGonzález-Docasal, Ander, and Aitor Álvarez. "Enhancing Voice Cloning Quality through Data Selection and Alignment-Based Metrics." Applied Sciences 13, no. 14 (2023): 8049. http://dx.doi.org/10.3390/app13148049.
Full textP. Jayanth, K. Lakshmi Sree, K. Karthik Kumar Reddy, G. Om Prakash, and G. Reddy Prasad. "Vision-to-Voice: AI for generating Description & Audio of Visual Content." International Research Journal of Innovations in Engineering and Technology 09, Special Issue ICCIS (2025): 206–13. https://doi.org/10.47001/irjiet/2025.iccis-202533.
Full textZhang, Lusheng, Shie Wu, and Zhongxun Wang. "Phoneme-Aware Hierarchical Augmentation and Semantic-Aware SpecAugment for Low-Resource Cantonese Speech Recognition." Sensors 25, no. 14 (2025): 4288. https://doi.org/10.3390/s25144288.
Full textSang, Songzhen, and Wanlin Li. "P‐3.10: Research on Key Technologies of Virtual Digital Human." SID Symposium Digest of Technical Papers 56, S1 (2025): 901–4. https://doi.org/10.1002/sdtp.18960.
Full textQiu, Zeyu, Jun Tang, Yaxin Zhang, Jiaxin Li, and Xishan Bai. "A Voice Cloning Method Based on the Improved HiFi-GAN Model." Computational Intelligence and Neuroscience 2022 (October 11, 2022): 1–12. http://dx.doi.org/10.1155/2022/6707304.
Full textGaldino, Julio Cesar, Ariadne Nascimento Matos, Flaviane Romani Fernandes Svartman, and Sandra Maria Aluisio. "The evaluation of prosody in speech synthesis: a systematic review." Journal of the Brazilian Computer Society 31, no. 1 (2025): 466–87. https://doi.org/10.5753/jbcs.2025.5468.
Full textLi, Naihan, Shujie Liu, Yanqing Liu, Sheng Zhao, and Ming Liu. "Neural Speech Synthesis with Transformer Network." Proceedings of the AAAI Conference on Artificial Intelligence 33 (July 17, 2019): 6706–13. http://dx.doi.org/10.1609/aaai.v33i01.33016706.
Full textAziz, Azrul Fahmi Abdul, Sabrina Tiun, and Noraini Ruslan. "End to End Text to Speech Synthesis for Malay Language using Tacotron and Tacotron 2." International Journal of Advanced Computer Science and Applications 14, no. 6 (2023). http://dx.doi.org/10.14569/ijacsa.2023.0140644.
Full textPatole, Prof Mrunalinee, Akhilesh Pandey, Kaustubh Bhagwat, Mukesh Vaishnav, and Salikram Chadar. "A Survey on “Text-to-Speech Systems for Real-Time Audio Synthesis”." International Journal of Advanced Research in Science, Communication and Technology, June 10, 2021, 375–79. http://dx.doi.org/10.48175/ijarsct-1400.
Full textChen, Lijiang, Jie Ren, Pengfei Chen, Xia Mao, and Qi Zhao. "Limited text speech synthesis with electroglottograph based on Bi-LSTM and modified Tacotron-2." Applied Intelligence, March 12, 2022. http://dx.doi.org/10.1007/s10489-021-03075-x.
Full textRono, Kelvin Kiptoo, Dr Ciira wa Maina, and Prof Elijah Mwangi. "Development of a Kiswahili Text-to-Speech System Based on Tacotron 2 and WaveNet Vocoder." SSRN Electronic Journal, 2022. http://dx.doi.org/10.2139/ssrn.4027431.
Full textSarasola, Xabier, Ander Corral, Igor Leturia, and Iñigo Morcillo. "Hizlari-bektore manipulazioaren bidezko genero-anbiguoko hizketaren sintesia euskaraz." EKAIA Euskal Herriko Unibertsitateko Zientzia eta Teknologia Aldizkaria, September 24, 2024. http://dx.doi.org/10.1387/ekaia.26334.
Full text"Cross-Language Speech Synthesis using Transfer Learning." REST Journal on Data Analytics and Artificial Intelligence 4, no. 1 March 2025 (2025): 631–35. https://doi.org/10.46632/jdaai/4/1/80.
Full textSatija, Ishita, Vina Lomte, Yash Wani, Digisha Kaneria, and Shubham Yadav. "Text-To-Speech Synthesis Using Transfer Learning." International Journal of Advanced Research in Science, Communication and Technology, April 9, 2021, 139–44. http://dx.doi.org/10.48175/ijarsct-956.
Full textBhuvan Shridhar and Barath M. "Autoregressive Speech-To-Text Alignment is a Critical Component of Neural Text-To-Speech (TTS) Models." International Journal of Scientific Research in Science, Engineering and Technology, December 5, 2022, 310–16. http://dx.doi.org/10.32628/ijsrset229643.
Full textNikoghosyan, K. H. "LEVERAGING PAUSE DETECTION FOR ENHANCED TTS DATASET GENERATION." Proceedings of National Polytechnic University of Armenia. INFORMATION TECHNOLOGIES, ELECTRONICS, RADIO ENGINEERING, 2024. https://doi.org/10.53297/18293336-2024.2-45.
Full text