Academic literature on the topic 'Speech-to-text systems'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Speech-to-text systems.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "Speech-to-text systems"
Eide, Ellen M. "Training of text-to-speech systems." Journal of the Acoustical Society of America 115, no. 5 (2004): 1874. http://dx.doi.org/10.1121/1.1757180.
Full textChoi, Yeunju, Youngmoon Jung, Younggwan Kim, Youngjoo Suh, and Hoirin Kim. "An end-to-end synthesis method for Korean text-to-speech systems." Phonetics and Speech Sciences 10, no. 1 (March 2018): 39–48. http://dx.doi.org/10.13064/ksss.2018.10.1.039.
Full textKuzmin, A., and S. Ivanov. "Speech to Text System for Noisy and Quiet Speech." Journal of Physics: Conference Series 2096, no. 1 (November 1, 2021): 012071. http://dx.doi.org/10.1088/1742-6596/2096/1/012071.
Full textVan Bezooijen, Renée, and Louis C. W. Pols. "Evaluating text-to-speech systems: Some methodological aspects." Speech Communication 9, no. 4 (August 1990): 263–70. http://dx.doi.org/10.1016/0167-6393(90)90002-q.
Full textSunitha, Dr K. V. N., and P. Sunitha Devi. "Text Normalization for Telugu Text-to-Speech Synthesis." INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY 11, no. 2 (October 10, 2013): 2241–49. http://dx.doi.org/10.24297/ijct.v11i2.1176.
Full textTran, Oanh Thi, and Viet The Bui. "Neural Text Normalization in Speech-to-Text Systems with Rich Features." Applied Artificial Intelligence 35, no. 3 (January 11, 2021): 193–205. http://dx.doi.org/10.1080/08839514.2020.1842108.
Full textGreene, Beth G., and John S. Logan. "Segmental intelligibility of synthetic speech produced by eight text‐to‐speech systems." Journal of the Acoustical Society of America 79, S1 (May 1986): S25. http://dx.doi.org/10.1121/1.2023130.
Full textSirivara, Sudheer. "Compressing and using a concatenative speech database in text-to-speech systems." Journal of the Acoustical Society of America 122, no. 1 (2007): 32. http://dx.doi.org/10.1121/1.2756497.
Full textNazemi, Azadeh, Iain Murray, and David A. McMeekin. "Multilingual Text to Speech in embedded systems using RC8660." INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY 13, no. 4 (April 30, 2014): 4374–81. http://dx.doi.org/10.24297/ijct.v13i4.2859.
Full textMolbæk Hansen, Peter. "Syntax, morphology, and phonology in text-to-speech systems." Annual Report of the Institute of Phonetics University of Copenhagen 23 (January 1, 1989): 119–52. http://dx.doi.org/10.7146/aripuc.v23i.131904.
Full textDissertations / Theses on the topic "Speech-to-text systems"
Chan, Ngor-chi. "Text-to-speech conversion for Putonghua /." [Hong Kong : University of Hong Kong], 1990. http://sunzi.lib.hku.hk/hkuto/record.jsp?B12929475.
Full text陳我智 and Ngor-chi Chan. "Text-to-speech conversion for Putonghua." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 1990. http://hub.hku.hk/bib/B31209580.
Full textBreitenbücher, Mark. "Textvorverarbeitung zur deutschen Version des Festival Text-to-Speech Synthese Systems." [S.l.] : Universität Stuttgart , Fakultät Philosophie, 1997. http://www.bsz-bw.de/cgi-bin/xvms.cgi?SWB6783514.
Full textBaloyi, Ntsako. "A text-to-speech synthesis system for Xitsonga using hidden Markov models." Thesis, University of Limpopo (Turfloop Campus), 2012. http://hdl.handle.net/10386/1021.
Full textThis research study focuses on building a general-purpose working Xitsonga speech synthesis system that is as far as can be possible reasonably intelligible, natural sounding, and flexible. The system built has to be able to model some of the desirable speaker characteristics and speaking styles. This research project forms part of the broader national speech technology project that aims at developing spoken language systems for human-machine interaction using the eleven official languages of South Africa (SA). Speech synthesis is the reverse of automatic speech recognition (which receives speech as input and converts it to text) in that it receives text as input and produces synthesized speech as output. It is generally accepted that most people find listening to spoken utterances better that reading the equivalent of such utterances. The Xitsonga speech synthesis system has been developed using a hidden Markov model (HMM) speech synthesis method. The HMM-based speech synthesis (HTS) system synthesizes speech that is intelligible, and natural sounding. This method can synthesize speech on a footprint of only a few megabytes of training speech data. The HTS toolkit is applied as a patch to the HTK toolkit which is a hidden Markov model toolkit primarily designed for use in speech recognition to build and manipulate hidden Markov models.
Engell, Trond Bøe. "TaleTUC: Text-to-Speech and Other Enhancements to Existing Bus Route Information Systems." Thesis, Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap, 2012. http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-18920.
Full textLambert, Tanya. "Databases for concatenative text-to-speech synthesis systems : unit selection and knowledge-based approach." Thesis, University of East Anglia, 2005. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.421192.
Full textLevefeldt, Christer. "Evaluation of NETtalk as a means to extract phonetic features from text for synchronization with speech." Thesis, University of Skövde, Department of Computer Science, 1998. http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-173.
Full textThe background for this project is a wish to automate synchronization of text and speech. The idea is to present speech through speakers synchronized word-for-word with text appearing on a monitor.
The solution decided upon is to use artificial neural networks, ANNs, to convert both text and speech into streams made up of sets of phonetic features and then matching these two streams against each other. Several text-to-feature ANN designs based on the NETtalk system are implemented and evaluated. The extraction of phonetic features from speech and the synchronization itself are not implemented, but some assessments are made regarding their possible performances. The performance of a finished system is not possible to determine, but a NETtalk-based ANN is believed to be suitable for such a system using phonetic features for synchronization.
Yoon, Kyuchul. "Building a prosodically sensitive diphone database for a Korean text-to-speech synthesis system." Connect to this title online, 2005. http://rave.ohiolink.edu/etdc/view?acc%5Fnum=osu1119010941.
Full textTitle from first page of PDF file. Document formatted into pages; contains xxii, 291 p.; also includes graphics (some col.) Includes bibliographical references (p. 210-216). Available online via OhioLINK's ETD Center
Thorstensson, Niklas. "A knowledge-based grapheme-to-phoneme conversion for Swedish." Thesis, University of Skövde, Department of Computer Science, 2002. http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-731.
Full textA text-to-speech system is a complex system consisting of several different modules such as grapheme-to-phoneme conversion, articulatory and prosodic modelling, voice modelling etc.
This dissertation is aimed at the creation of the initial part of a text-to-speech system, i.e. the grapheme-to-phoneme conversion, designed for Swedish. The problem area at hand is the conversion of orthographic text into a phonetic representation that can be used as a basis for a future complete text-to speech system.
The central issue of the dissertation is the grapheme-to-phoneme conversion and the elaboration of rules and algorithms required to achieve this task. The dissertation aims to prove that it is possible to make such a conversion by a rule-based algorithm with reasonable performance. Another goal is to find a way to represent phonotactic rules in a form suitable for parsing. It also aims to find and analyze problematic structures in written text compared to phonetic realization.
This work proposes a knowledge-based grapheme-to-phoneme conversion system for Swedish. The system suggested here is implemented, tested, evaluated and compared to other existing systems. The results achieved are promising, and show that the system is fast, with a high degree of accuracy.
Mhlana, Siphe. "Development of isiXhosa text-to-speech modules to support e-Services in marginalized rural areas." Thesis, University of Fort Hare, 2011. http://hdl.handle.net/10353/495.
Full textBooks on the topic "Speech-to-text systems"
Taylor, Paul. Text-to-speech synthesis. Cambridge, UK: Cambridge University Press, 2009.
Find full textAn introduction to text-to-speech synthesis. Dordrecht: Kluwer Academic Publishers, 1997.
Find full textAllen, Jonathan. From text to speech: The MITalk system. Cambridge [Cambridgeshire]: Cambridge University Press, 1987.
Find full textRao, K. Sreenivasa. Predicting Prosody from Text for Text-to-Speech Synthesis. New York, NY: Springer New York, 2012.
Find full textWilliam, Sproat Richard, and Lucent Technologies (Firm), eds. Multilingual text-to-speech synthesis: The Bell Labs approach. Dordrecht: Kluwer, 1998.
Find full textvan, Heuven Vincent, and Pols, Louis C. W., 1941-, eds. Analysis and synthesis of speech: Strategic research towards high-quality text-to-speech generation. Berlin: Mouton de Gruyter, 1993.
Find full textSabourin, Conrad. Computational speech processing: Speech analysis, recognition, understanding, compression, transmission, coding, synthesis, text to speech systems, speech to tactile displays, speaker identification, prosody processing : bibliography. Montréal: Infolingua, 1994.
Find full textMeisel, William S. The telephony voice user interface: Applications of speech recognition, text-to-speech, and speaker verification over the telephone. Tarzana, CA: TMA Associates, 1998.
Find full textZaripov, Ruslan, and Lev Gavrilov. Technology consecutive translation. ru: INFRA-M Academic Publishing LLC., 2017. http://dx.doi.org/10.12737/24842.
Full textBook chapters on the topic "Speech-to-text systems"
Bhuta, Samyak, and S. Rama Mohan. "Gujarati Text – To – Speech System." In Information Systems for Indian Languages, 311. Berlin, Heidelberg: Springer Berlin Heidelberg, 2011. http://dx.doi.org/10.1007/978-3-642-19403-0_59.
Full textStüker, Sebastian, Kevin Kilgour, and Jan Niehues. "Quaero Speech-to-Text and Text Translation Evaluation Systems." In High Performance Computing in Science and Engineering '10, 529–42. Berlin, Heidelberg: Springer Berlin Heidelberg, 2011. http://dx.doi.org/10.1007/978-3-642-15748-6_38.
Full textStüker, Sebastian, Kevin Kilgour, and Florian Kraft. "Quaero 2010 Speech-to-Text Evaluation Systems." In High Performance Computing in Science and Engineering '11, 607–18. Berlin, Heidelberg: Springer Berlin Heidelberg, 2012. http://dx.doi.org/10.1007/978-3-642-23869-7_44.
Full textYoon, HyoJeon, Dinh Tuyen Hoang, Ngoc Thanh Nguyen, and Dosam Hwang. "Cross-Lingual Korean Speech-to-Text Summarization." In Intelligent Information and Database Systems, 198–206. Cham: Springer International Publishing, 2019. http://dx.doi.org/10.1007/978-3-030-14799-0_17.
Full textKamath, K. Sanjana, K. Raghavendra N. Bhat, Charishma, and Pearl Infancia D’souza. "Kannada Text-to-Speech System using MATLAB." In Advances in VLSI, Signal Processing, Power Electronics, IoT, Communication and Embedded Systems, 187–96. Singapore: Springer Singapore, 2021. http://dx.doi.org/10.1007/978-981-16-0443-0_15.
Full textSingh, Parminder, and Gurpreet Singh Lehal. "Text-To-Speech Synthesis System for Punjabi Language." In Information Systems for Indian Languages, 302–3. Berlin, Heidelberg: Springer Berlin Heidelberg, 2011. http://dx.doi.org/10.1007/978-3-642-19403-0_54.
Full textQuazza, Silvia, and Henk van den Heuvel. "The Use of Lexica in Text-to-Speech Systems." In Text, Speech and Language Technology, 207–33. Dordrecht: Springer Netherlands, 2000. http://dx.doi.org/10.1007/978-94-010-9458-0_7.
Full textXydas, Gerasimos, and Georgios Kouroupetroglou. "Augmented Auditory Representation of e-Texts for Text-to-Speech Systems." In Text, Speech and Dialogue, 134–41. Berlin, Heidelberg: Springer Berlin Heidelberg, 2001. http://dx.doi.org/10.1007/3-540-44805-5_17.
Full textPanda, Soumya Priyadarsini, and Ajit Kumar Nayak. "A Rule-Based Concatenative Approach to Speech Synthesis in Indian Language Text-to-Speech Systems." In Advances in Intelligent Systems and Computing, 523–31. New Delhi: Springer India, 2014. http://dx.doi.org/10.1007/978-81-322-2009-1_59.
Full textVích, Robert, Jan Nouza, and Martin Vondra. "Automatic Speech Recognition Used for Intelligibility Assessment of Text-to-Speech Systems." In Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction, 136–48. Berlin, Heidelberg: Springer Berlin Heidelberg, 2008. http://dx.doi.org/10.1007/978-3-540-70872-8_10.
Full textConference papers on the topic "Speech-to-text systems"
Santen, Jan P. H. van. "Timing in text-to-speech systems." In 3rd European Conference on Speech Communication and Technology (Eurospeech 1993). ISCA: ISCA, 1993. http://dx.doi.org/10.21437/eurospeech.1993-12.
Full textLee, Sangho, and Yung-Hwan Oh. "A text analyzer for Korean text-to-speech systems." In 4th International Conference on Spoken Language Processing (ICSLP 1996). ISCA: ISCA, 1996. http://dx.doi.org/10.21437/icslp.1996-430.
Full textBapat, Abhijit V., and Lalit K. Nagalkar. "Phonetic Speech Analysis for Speech to Text Conversion." In 2008 IEEE Region 10 and the Third international Conference on Industrial and Information Systems (ICIIS). IEEE, 2008. http://dx.doi.org/10.1109/iciinfs.2008.4798390.
Full textRybarova, Renata, Gonzalo del Corral, and Gregor Rozinaj. "Diphone spanish text-to-speech synthesizer." In 2015 International Conference on Systems, Signals and Image Processing (IWSSIP). IEEE, 2015. http://dx.doi.org/10.1109/iwssip.2015.7314192.
Full textGaved, Maggie. "Pronunciation and text normalisation in applied text-to-speech systems." In 3rd European Conference on Speech Communication and Technology (Eurospeech 1993). ISCA: ISCA, 1993. http://dx.doi.org/10.21437/eurospeech.1993-206.
Full textZemirli, Z. "ARAB_TTS: An Arabic Text To Speech Synthesis." In IEEE International Conference on Computer Systems and Applications, 2006. IEEE, 2006. http://dx.doi.org/10.1109/aiccsa.2006.205206.
Full textBreen, Andrew, Barry Eggleton, Peter Dion, and Steve Minnis. "Refocussing on the text normalisation process in text-to-speech systems." In 7th International Conference on Spoken Language Processing (ICSLP 2002). ISCA: ISCA, 2002. http://dx.doi.org/10.21437/icslp.2002-90.
Full textGopinath, Deepa P., J. Divya Sree, Reshmi Mathew, S. J. Rekhila, and Achuthsankar S. Nair. "Duration Analysis for Malayalam Text-To-Speech Systems." In 9th International Conference on Information Technology (ICIT'06). IEEE, 2006. http://dx.doi.org/10.1109/icit.2006.48.
Full textDiehl, F., M. J. F. Gales, M. Tomalin, and P. C. Woodland. "Phonetic pronunciations for arabic speech-to-text systems." In ICASSP 2008 - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2008. http://dx.doi.org/10.1109/icassp.2008.4517924.
Full textKlavans, Judith L., and Evelyne Tzoukermann. "Machine-readable dictionaries in text-to-speech systems." In the 15th conference. Morristown, NJ, USA: Association for Computational Linguistics, 1994. http://dx.doi.org/10.3115/991250.991305.
Full textReports on the topic "Speech-to-text systems"
Furey, John, Austin Davis, and Jennifer Seiter-Moser. Natural language indexing for pedoinformatics. Engineer Research and Development Center (U.S.), September 2021. http://dx.doi.org/10.21079/11681/41960.
Full textChornodon, Myroslava. FEAUTURES OF GENDER IN MODERN MASS MEDIA. Ivan Franko National University of Lviv, February 2021. http://dx.doi.org/10.30970/vjo.2021.49.11064.
Full text