Academic literature on the topic 'Speech-to-text (STT)'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Speech-to-text (STT).'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "Speech-to-text (STT)"
Duc, Chung Tran, Long Nguyen Duc, and Fadzil Hassan Mohd. "Development and testing of an FPT.AI-based voicebot." Bulletin of Electrical Engineering and Informatics 9, no. 6 (2020): 2388–95. https://doi.org/10.11591/eei.v9i6.2620.
Full textJournal, IJSREM. "A Review on Speech-to-Text." INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 08, no. 03 (2024): 1–13. http://dx.doi.org/10.55041/ijsrem29004.
Full textBarkovska, Olesia. "RESEARCH INTO SPEECH-TO-TEXT TRANFROMATION MODULE IN THE PROPOSED MODEL OF A SPEAKER’S AUTOMATIC SPEECH ANNOTATION." Innovative Technologies and Scientific Solutions for Industries, no. 4 (22) (December 31, 2022): 5–13. http://dx.doi.org/10.30837/itssi.2022.22.005.
Full textB, Mupini, Chaputsira S, and Sibanda Bk. "Survey on Speech to Text Modelling for the Shona Language." Survey on Speech to Text Modelling for the Shona Language 9, no. 1 (2024): 4. https://doi.org/10.5281/zenodo.10609671.
Full textYang, Hui Jae, Eun-Byel Oh, and Jung-Mee Kim. "Comparison of Automatic Speech Recognition System for School-aged Children’s Narratives: Naver Clova Speech and Google Speech-to-Text." Communication Sciences & Disorders 28, no. 1 (2023): 30–38. http://dx.doi.org/10.12963/csd.23952.
Full textDinata, Candra, Diyah Puspitaningrum, and Ernawati Erna. "IMPLEMENTASI TEKNIK DYNAMIC TIME WARPING (DTW) PADA APLIKASI SPEECH TO TEXT." JURNAL TEKNIK INFORMATIKA 10, no. 1 (2018): 49–58. http://dx.doi.org/10.15408/jti.v10i1.6816.
Full textG, Thimmaraja Yadava, G. Nagaraja B, Yogesh Kumaran S, C. Ramachandra A, and M. Arun Kumar N. "Development of Small Vocabulary Continuous Speech-to-Text System for Kannada Language/Dialects." Indian Journal of Science and Technology 15, no. 45 (2022): 2476–81. https://doi.org/10.17485/IJST/v15i45.1884.
Full textSchwarz, Nikolai, Khia A. Johnson, and Molly Babel. "Exploring the variable efficacy of Google speech-to-text with spontaneous bilingual speech in Cantonese and English." Journal of the Acoustical Society of America 150, no. 4 (2021): A357. http://dx.doi.org/10.1121/10.0008580.
Full textp, Ms SANDHUSTA,. "Speech -To -Text Translation Using Hugging Face Model." INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 08, no. 04 (2024): 1–5. http://dx.doi.org/10.55041/ijsrem30348.
Full textTran, Duc Chung, Duc Long Nguyen, and Mohd Fadzil Hassan. "Development and testing of an FPT.AI-based voicebot." Bulletin of Electrical Engineering and Informatics 9, no. 6 (2020): 2388–95. http://dx.doi.org/10.11591/eei.v9i6.2620.
Full textBooks on the topic "Speech-to-text (STT)"
Kon'kov, Vladimir, and Tat'yana Surikova. Linguistic foundations of business communication. INFRA-M Academic Publishing LLC., 2021. http://dx.doi.org/10.12737/1062745.
Full textSingh, Anushka. Sedition in Liberal Democracies. Oxford University Press, 2018. http://dx.doi.org/10.1093/oso/9780199481699.001.0001.
Full textKasabov, Nikola. Foundations of Neural Networks, Fuzzy Systems, and Knowledge Engineering. The MIT Press, 1996. http://dx.doi.org/10.7551/mitpress/3071.001.0001.
Full textPerry, Seth. Bible Culture and Authority in the Early United States. Princeton University Press, 2018. http://dx.doi.org/10.23943/princeton/9780691179131.001.0001.
Full textUfimtseva, Nataliya V., Iosif A. Sternin, and Elena Yu Myagkova. Russian psycholinguistics: results and prospects (1966–2021): a research monograph. Institute of Linguistics, Russian Academy of Sciences, 2021. http://dx.doi.org/10.30982/978-5-6045633-7-3.
Full textBook chapters on the topic "Speech-to-text (STT)"
Wong Shuk Man, Cecilia, and Patrick Chu Chun-Kau. "Improving Cantonese speech-to-text (STT) recognition by using a pronunciation model." In Applying Technology to Language and Translation. Routledge, 2024. https://doi.org/10.4324/9781003399261-6.
Full textDuran, Logan Wong, Mithil Darur, and Kok Zuea Tang. "Development and Prototyping of a Speech-To-Text (STT) Model and Lip Recognition Software for Tiny Machine Learning (TinyML) Systems." In Proceedings in Technology Transfer. Springer Nature Singapore, 2025. https://doi.org/10.1007/978-981-96-3770-6_39.
Full textCope, Bill, Mary Kalantzis, and Anastasia Olga (Olnancy) Tzirides. "Chapter 13. Meaning without borders." In Studies in Bilingualism. John Benjamins Publishing Company, 2024. http://dx.doi.org/10.1075/sibil.66.13cop.
Full textPasupuleti, Murali Krishna. "Decoding Conversational AI: From Text to Context with NLP." In Natural Language Processing Unveiled: Bridging Human-AI Communication. National Education Services, 2024. http://dx.doi.org/10.62311/nesx/97814.
Full text"Text-to-Speech (TTS) Synthesis." In The Electrical Engineering Handbook - Six Volume Set. CRC Press, 2018. http://dx.doi.org/10.1201/9781315219677-22.
Full textNarupiyakul Lalita, Sirinaovakul Booncharoen, and Cercone Nick. "Thai Syllable Analysis for Rule-Based Text to Speech System." In Frontiers in Artificial Intelligence and Applications. IOS Press, 2013. https://doi.org/10.3233/978-1-61499-258-5-139.
Full textMiller, Jim, and Regina Weinert. "Focus Constructions." In Spontaneous Spoken Language. Oxford University PressOxford, 1998. http://dx.doi.org/10.1093/oso/9780198236566.003.0005.
Full textMoore, Colette. "Before Quotation Marks." In Speech Representation in the History of English. Oxford University Press, 2020. http://dx.doi.org/10.1093/oso/9780190918064.003.0002.
Full textWongkia Wararat, Naruedomkul Kanlaya, and Cercone Nick. "I-Math: an Intelligent Accessible Mathematics system for People with Visual Impairment." In Frontiers in Artificial Intelligence and Applications. IOS Press, 2013. https://doi.org/10.3233/978-1-61499-258-5-83.
Full textSánchez, Àlex, Sergio Montoya, Josep Escrig, and Ivan Huerta. "Automatic Catalan Keyword Spotting Database Generator." In Frontiers in Artificial Intelligence and Applications. IOS Press, 2024. http://dx.doi.org/10.3233/faia240421.
Full textConference papers on the topic "Speech-to-text (STT)"
Minixhofer, Christoph, Ondřej Klejch, and Peter Bell. "TTSDS - Text-to-Speech Distribution Score." In 2024 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2024. https://doi.org/10.1109/slt61566.2024.10832178.
Full textTseng, Cindy, Yun Tang, and Vijendra Raj Apsingekar. "Transducer Consistency Regularization For Speech to Text Applications." In 2024 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2024. https://doi.org/10.1109/slt61566.2024.10832230.
Full textWang, Hankun, Chenpeng Du, Yiwei Guo, Shuai Wang, Xie Chen, and Kai Yu. "Attention-Constrained Inference For Robust Decoder-Only Text-to-Speech." In 2024 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2024. https://doi.org/10.1109/slt61566.2024.10832301.
Full textWang, Changhan, Yun Tang, Xutai Ma, Anne Wu, Dmytro Okhonko, and Juan Pino. "Fairseq S2T: Fast Speech-to-Text Modeling with Fairseq." In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing: System Demonstrations. Association for Computational Linguistics, 2020. http://dx.doi.org/10.18653/v1/2020.aacl-demo.6.
Full textGuo, Haohan, Fenglong Xie, Kun Xie, et al. "SoCodec: A Semantic-Ordered Multi-Stream Speech Codec For Efficient Language Model Based Text-to-Speech Synthesis." In 2024 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2024. https://doi.org/10.1109/slt61566.2024.10832247.
Full textYamauchi, Kazuki, Yuki Saito, and Hiroshi Saruwatari. "Cross-Dialect Text-to-Speech In Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level Bert." In 2024 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2024. https://doi.org/10.1109/slt61566.2024.10832155.
Full textMa, Min, Gary Wang, Kyle Kastner, Isaac Caswell, Charles Yoon, and Andrew Rosenberg. "Enhancing Low-Resource Spoken Language Identification Via Cross-Modality Retrieval and Cross-Lingual Text-to-Speech Synthesis." In 2024 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2024. https://doi.org/10.1109/slt61566.2024.10832238.
Full textWu, Haibin, Xiaofei Wang, Sefik Emre Eskimez, et al. "Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-To-Speech." In 2024 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2024. https://doi.org/10.1109/slt61566.2024.10832181.
Full textKarl, Anderson Luiz, Guilherme Sales Fernandes, Leonardo Augusto Pires, Yvens R. Serpa, and Carlos Caminha. "Synthetic AI Data Pipeline for Domain-Specific Speech-to-Text Solutions." In Simpósio Brasileiro de Tecnologia da Informação e da Linguagem Humana. Sociedade Brasileira de Computação, 2024. https://doi.org/10.5753/stil.2024.245336.
Full textde Lima, Pedro L. S., and Cláudio E. C. Campelo. "Disfluency Detection and Removal in Speech Transcriptions via Large Language Models." In Simpósio Brasileiro de Tecnologia da Informação e da Linguagem Humana. Sociedade Brasileira de Computação, 2024. https://doi.org/10.5753/stil.2024.245417.
Full textReports on the topic "Speech-to-text (STT)"
Yatsymirska, Mariya. SOCIAL EXPRESSION IN MULTIMEDIA TEXTS. Ivan Franko National University of Lviv, 2021. http://dx.doi.org/10.30970/vjo.2021.49.11072.
Full text