Log in

Relevant bibliographies by topics / Automatic speech recognition system (ASR) / Dissertations / Theses

To see the other types of publications on this topic, follow the link: Automatic speech recognition system (ASR).

Dissertations / Theses on the topic 'Automatic speech recognition system (ASR)'

Author: Grafiati

Published: 5 June 2025

Last updated: 2 August 2025

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 dissertations / theses for your research on the topic 'Automatic speech recognition system (ASR).'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.

1

Laryea, Joycelyn, and Nipunika Jayasundara. "Automatic Speech Recognition System for Somali in the interest of reducing Maternal Morbidity and Mortality." Thesis, Högskolan Dalarna, Mikrodataanalys, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:du-34436.

Full text

Abstract:

Developing an Automatic Speech Recognition (ASR) system for the Somali language, though not novel, is not actively explored; hence there has been no success in a model for conversational speech. Neither are related works accessible as open-source. The unavailability of digital data is what labels Somali as a low resource language and poses the greatest impediment to the development of an ASR for Somali. The incentive to develop an ASR system for the Somali language is to contribute to reducing the Maternal Mortality Rate (MMR) in Somalia. Researchers acquire interview audio data regarding mate

APA, Harvard, Vancouver, ISO, and other styles

2

Sklar, Alexander Gabriel. "Channel Modeling Applied to Robust Automatic Speech Recognition." Scholarly Repository, 2007. http://scholarlyrepository.miami.edu/oa_theses/87.

Full text

Abstract:

In automatic speech recognition systems (ASRs), training is a critical phase to the system?s success. Communication media, either analog (such as analog landline phones) or digital (VoIP) distort the speaker?s speech signal often in very complex ways: linear distortion occurs in all channels, either in the magnitude or phase spectrum. Non-linear but time-invariant distortion will always appear in all real systems. In digital systems we also have network effects which will produce packet losses and delays and repeated packets. Finally, one cannot really assert what path a signal will take, and

APA, Harvard, Vancouver, ISO, and other styles

3

Karlsson, Joakim. "The integration of automatic speech recognition into the air traffic control system." Thesis, Massachusetts Institute of Technology, 1990. http://hdl.handle.net/1721.1/42184.

Full text

APA, Harvard, Vancouver, ISO, and other styles

4

Haque, Serajul. "Perceptual features for speech recognition." University of Western Australia. School of Electrical, Electronic and Computer Engineering, 2008. http://theses.library.uwa.edu.au/adt-WU2008.0187.

Full text

Abstract:

Automatic speech recognition (ASR) is one of the most important research areas in the field of speech technology and research. It is also known as the recognition of speech by a machine or, by some artificial intelligence. However, in spite of focused research in this field for the past several decades, robust speech recognition with high reliability has not been achieved as it degrades in presence of speaker variabilities, channel mismatch condi- tions, and in noisy environments. The superb ability of the human auditory system has motivated researchers to include features of human perception

APA, Harvard, Vancouver, ISO, and other styles

5

Gong, XiangQi. "Ellection markup language (EML) based tele-voting system." Thesis, University of the Western Cape, 2009. http://etd.uwc.ac.za/index.php?module=etd&action=viewtitle&id=gen8Srv25Nme4_5841_1350999620.

Full text

Abstract:

Elections are one of the most fundamental activities of a democratic society. As is the case in any other aspect of life, developments in technology have resulted changes in the voting procedure from using the traditional paper-based voting to voting by use of electronic means, or e-voting. E-voting involves using different forms of electronic means like<br>voting machines, voting via the Internet, telephone, SMS and digital interactive television. This thesis concerns voting by telephone, or televoting, it starts by giving a brief overview and evaluation of various models and technologies tha

APA, Harvard, Vancouver, ISO, and other styles

6

Tomashenko, Natalia. "Speaker adaptation of deep neural network acoustic models using Gaussian mixture model framework in automatic speech recognition systems." Thesis, Le Mans, 2017. http://www.theses.fr/2017LEMA1040/document.

Full text

Abstract:

Les différences entre conditions d'apprentissage et conditions de test peuvent considérablement dégrader la qualité des transcriptions produites par un système de reconnaissance automatique de la parole (RAP). L'adaptation est un moyen efficace pour réduire l'inadéquation entre les modèles du système et les données liées à un locuteur ou un canal acoustique particulier. Il existe deux types dominants de modèles acoustiques utilisés en RAP : les modèles de mélanges gaussiens (GMM) et les réseaux de neurones profonds (DNN). L'approche par modèles de Markov cachés (HMM) combinés à des GMM (GMM-HM

APA, Harvard, Vancouver, ISO, and other styles

7

暁芸, 王., and Xiaoyun Wang. "Phoneme set design for second language speech recognition." Thesis, https://doors.doshisha.ac.jp/opac/opac_link/bibid/BB13044980/?lang=0, 2017. https://doors.doshisha.ac.jp/opac/opac_link/bibid/BB13044980/?lang=0.

Full text

Abstract:

本論文は第二言語話者の発話を高精度で認識するための音素セットの構成方法に関する研究結果を述べている．本論文では，第二言語話者の発話をネイティブ話者の発話とは異なる音響特徴量の頻度分布を持つ情報源とみなし，これを表現する適切な音素セットを構築する手法を提案している．具体的には，対象とする第二言語と母語との調音位置や調音様式などの類似性に加え，同音異義語の発生による単語識別性能の低下を総合した基準に基づき，最適な音素セットを決定する．提案手法を日本人学生の英語発話の音声認識に適用し，種々の条件下で認識精度の向上を検証した．<br>This dissertation focuses on the problem caused by confused mispronunciation to improve the recognition performance of second language speech. A novel method considering integrated acoustic and linguistic features is proposed to derive a reduced phoneme set for L2 speech recognition. The customized phoneme set is created with a phon

APA, Harvard, Vancouver, ISO, and other styles

8

Hartmann, William. "ASR-Driven Binary Mask Estimation for Robust Automatic Speech Recognition." The Ohio State University, 2012. http://rave.ohiolink.edu/etdc/view?acc_num=osu1338244649.

Full text

APA, Harvard, Vancouver, ISO, and other styles

9

TURRISI, Rosanna. "On Deep Learning strategies to address Automatic Speech Recognition (ASR) for dysarthric speech." Doctoral thesis, Università degli studi di Ferrara, 2021. http://hdl.handle.net/11392/2488127.

Full text

Abstract:

This thesis explores deep learning techniques to improve Automatic Speech Recognition (ASR) for people affected by dysarthria. Dysarthria is a widely spread motor disorder causing high speech unintelligibility and, often, also motor control abnormalities. Hence, ASR-based technologies may represent the only possibility for dysarthric individuals to interact with other people or machines. Unfortunately, traditional ASR systems fail in presence of dysarthric speech. For instance, we tested Google Speech API and IBM on a subset of the TORGO dataset. These provide more than 80% of WER, while the

APA, Harvard, Vancouver, ISO, and other styles

10

Kraal, Ben James, and n/a. "Considering design for automatic speech recognition in use." University of Canberra. Information Sciences and Engineering, 2006. http://erl.canberra.edu.au./public/adt-AUC20070514.092924.

Full text

Abstract:

Talking to a computer is hard. Large vocabulary automatic speech recognition (ASR) systems are difficult to use and yet they are used by many people in their daily work. This thesis addresses the question: How is ASR used and made usable and useful in the workplace now? To answer these questions I went into two workplaces where ASR is currently used and one where ASR could be used in the future. This field work was done with designing in mind. ASR dictation systems are currently used in the Australian Public Service (APS) by people who suffer chronic workplace overuse injuries and in the Hansa

APA, Harvard, Vancouver, ISO, and other styles

11

Lin, Alvin. "Video Based Automatic Speech Recognition Using Neural Networks." DigitalCommons@CalPoly, 2020. https://digitalcommons.calpoly.edu/theses/2343.

Full text

Abstract:

Neural network approaches have become popular in the field of automatic speech recognition (ASR). Most ASR methods use audio data to classify words. Lip reading ASR techniques utilize only video data, which compensates for noisy environments where audio may be compromised. A comprehensive approach, including the vetting of datasets and development of a preprocessing chain, to video-based ASR is developed. This approach will be based on neural networks, namely 3D convolutional neural networks (3D-CNN) and Long short-term memory (LSTM). These types of neural networks are designed to take in temp

APA, Harvard, Vancouver, ISO, and other styles

12

Mossberg, Zimon. "Achieving Automatic Speech Recognition for Swedish using the Kaldi toolkit." Thesis, KTH, Skolan för datavetenskap och kommunikation (CSC), 2016. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-194178.

Full text

Abstract:

The meager offering of online commercial Swedish Automatic Speech Recognition ser-vices prompts the effort to develop a speech recognizer for Swedish using the open sourcetoolkit Kaldi and publicly available NST speech corpus. Using a previous Kaldi recipeseveral GMM-HMM models are trained and evaluated against commercial options toallow for reasoning of the performance of a customized solution for Automatic SpeechRecognition to that of commercial services. The evaluation takes both accuracy andcomputational speed into consideration. Initial results of the evaluation indicate a sys-tematic bia

APA, Harvard, Vancouver, ISO, and other styles

13

Keyvani, Alireza. "Robustness in ASR : an experimental study of the interrelationship between discriminant feature-space transformation, speaker normalization and environment compensation." Thesis, McGill University, 2007. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=99772.

Full text

Abstract:

This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) performance under diverse speaker populations, channel conditions, and acoustic environments. To this end, the thesis analyzes the interactions between environment compensation techniques, frequency warping based speaker normalization, and discriminant feature-space transformation (DFT). These interactions were quantified by performing experiments on the connected digit utterances comprising the Aurora 2 database, using continuous density hidden Markov models (HMM) representing individual digits.

APA, Harvard, Vancouver, ISO, and other styles

14

Swietojanski, Paweł. "Learning representations for speech recognition using artificial neural networks." Thesis, University of Edinburgh, 2016. http://hdl.handle.net/1842/22835.

Full text

Abstract:

Learning representations is a central challenge in machine learning. For speech recognition, we are interested in learning robust representations that are stable across different acoustic environments, recording equipment and irrelevant inter– and intra– speaker variabilities. This thesis is concerned with representation learning for acoustic model adaptation to speakers and environments, construction of acoustic models in low-resource settings, and learning representations from multiple acoustic channels. The investigations are primarily focused on the hybrid approach to acoustic modelling ba

APA, Harvard, Vancouver, ISO, and other styles

15

Badenhorst, Jacob Andreas Cornelius. "Data sufficiency analysis for automatic speech recognition / by J.A.C. Badenhorst." Thesis, North-West University, 2009. http://hdl.handle.net/10394/3994.

Full text

Abstract:

The languages spoken in developing countries are diverse and most are currently under-resourced from an automatic speech recognition (ASR) perspective. In South Africa alone, 10 of the 11 official languages belong to this category. Given the potential for future applications of speech-based information systems such as spoken dialog system (SDSs) in these countries, the design of minimal ASR audio corpora is an important research area. Specifically, current ASR systems utilise acoustic models to represent acoustic variability, and effective ASR corpus design aims to optimise the amount of rele

APA, Harvard, Vancouver, ISO, and other styles

16

Basson, Willem Diederick. "Improving Grapheme-based speech recognition through P2G transliteration / W.D. Basson." Thesis, North-West University, 2014. http://hdl.handle.net/10394/11068.

Full text

Abstract:

Grapheme-based speech recognition systems are faster to develop, but typically do not reach the same level of performance as phoneme-based systems. Using Afrikaans speech recognition as a case study, we first analyse the reasons for the discrepancy in performance, before introducing a technique for improving the performance of standard grapheme-based systems. It is found that by handling a relatively small number of irregular words through phoneme-to-grapheme (P2G) transliteration – transforming the original orthography of irregular words to an ‘idealised’ orthography – grapheme-based accuracy

APA, Harvard, Vancouver, ISO, and other styles

17

Ahmad, Nasir. "A motion based approach for audio-visual automatic speech recognition." Thesis, Loughborough University, 2011. https://dspace.lboro.ac.uk/2134/8564.

Full text

Abstract:

The research work presented in this thesis introduces novel approaches for both visual region of interest extraction and visual feature extraction for use in audio-visual automatic speech recognition. In particular, the speaker‘s movement that occurs during speech is used to isolate the mouth region in video sequences and motionbased features obtained from this region are used to provide new visual features for audio-visual automatic speech recognition. The mouth region extraction approach proposed in this work is shown to give superior performance compared with existing colour-based lip segme

APA, Harvard, Vancouver, ISO, and other styles

18

Gregori, Alessandro <1975&gt. "Automatic Speech Recognition (ASR) and NMT for Interlingual and Intralingual Communication: Speech to Text Technology for Live Subtitling and Accessibility." Doctoral thesis, Alma Mater Studiorum - Università di Bologna, 2021. http://amsdottorato.unibo.it/9931/1/Gregori_Alessandro_tesi.pdf.

Full text

Abstract:

Considered the increasing demand for institutional translation and the multilingualism of international organizations, the application of Artificial Intelligence (AI) technologies in multilingual communications and for the purposes of accessibility has become an important element in the production of translation and interpreting services (Zetzsche, 2019). In particular, the widespread use of Automatic Speech Recognition (ASR) and Neural Machine Translation (NMT) technology represents a recent development in the attempt of satisfying the increasing demand for interinstitutional, multilingual co

APA, Harvard, Vancouver, ISO, and other styles

19

Kocour, Martin. "Automatic Speech Recognition System Continually Improving Based on Subtitled Speech Data." Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2019. http://www.nusl.cz/ntk/nusl-399164.

Full text

Abstract:

V dnešnej dobe systémy rozpoznávania reči s veľkým slovníkom dosahujú pomerne vysoké presnosti. Za ich výsledkami však často stoja desiatky ba až stovky hodín manuálne oanotovaných trénovacích dát. Takéto dáta sú často bežne nedostupné alebo pre požadovaný jazyk vôbec neexistujú. Možným riešením je použitie bežne dostupných no menej kvalitných audiovizuálnych dát. Táto práca sa zaoberá technikou zpracovania práve takýchto dát a ich použitím pre trénovanie akustických modelov. Ďalej táto práca pojednáva o možnom využití týchto dát pre kontinuálne vylepšovanie modelov, kedže tieto dáta sú prakti

APA, Harvard, Vancouver, ISO, and other styles

20

Bengio, Yoshua. "Connectionist models applied to automatic speech recognition." Thesis, McGill University, 1987. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=63920.

Full text

APA, Harvard, Vancouver, ISO, and other styles

21

Narayanan, Arun. "Computational auditory scene analysis and robust automatic speech recognition." The Ohio State University, 2014. http://rave.ohiolink.edu/etdc/view?acc_num=osu1401460288.

Full text

APA, Harvard, Vancouver, ISO, and other styles

22

Sánchez, Cortina Isaías. "Confidence Measures for Automatic and Interactive Speech Recognition." Doctoral thesis, Universitat Politècnica de València, 2016. http://hdl.handle.net/10251/61473.

Full text

Abstract:

[EN] This thesis work contributes to the field of the {Automatic Speech Recognition} (ASR). And particularly to the {Interactive Speech Transcription} and {Confidence Measures} (CM) for ASR. The main goals of this thesis work can be summarised as follows: 1. To design IST methods and tools to tackle the problem of improving automatically generated transcripts. 2. To assess the designed IST methods and tools on real-life tasks of transcription in large educational repositories of video lectures. 3. To improve the reliability of the IST by improving the underlying (CM). Abstracts: The {Automati

APA, Harvard, Vancouver, ISO, and other styles

23

Tran, Michael. "An approach to a robust speaker recognition system." Diss., This resource online, 1994. http://scholar.lib.vt.edu/theses/available/etd-06062008-164814/.

Full text

APA, Harvard, Vancouver, ISO, and other styles

24

De, Vries Nicolaas Johannes. "Effective automatic speech recognition data collection for under–resourced languages / de Vries N.J." Thesis, North-West University, 2011. http://hdl.handle.net/10394/7354.

Full text

Abstract:

As building transcribed speech corpora for under–resourced languages plays a pivotal role in developing automatic speech recognition (ASR) technologies for such languages, a key step in developing these technologies is the effective collection of ASR data, consisting of transcribed audio and associated meta data. The problem is that no suitable tool currently exists for effectively collecting ASR data for such languages. The specific context and requirements for effectively collecting ASR data for underresourced languages, render all currently known solutions unsuitable for such a task. Such r

APA, Harvard, Vancouver, ISO, and other styles

25

Andersstuen, Runar, and Christoffer Jun Marcussen. "TaleTUC : Automatic Speech Recognition for a Bus Route Information System." Thesis, Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap, 2012. http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-20102.

Full text

Abstract:

With the constant increase in smartphone sales, integrated sensors have becomeavailable to the average user. This allows for mobile applications to utilise theusers context to provide more accurate information. The popularity of smartphones also attract developers to create audio functionalities that have earlier been restricted to calling interfaces. There is an increasing interest for Automatic Speech Recognition (ASR) services aimed at everyday tasks, where Apples release of SIRI is a good example of a system that has contributed to the gained popularity. This report describes T

APA, Harvard, Vancouver, ISO, and other styles

26

Millard, Benjamin J. "Oral Proficiency Assessment of French Using an Elicited Imitation Test and Automatic Speech Recognition." BYU ScholarsArchive, 2011. https://scholarsarchive.byu.edu/etd/2690.

Full text

Abstract:

Testing oral proficiency is an important, but often neglected part of the foreign language classroom. Currently accepted methods in testing oral proficiency are timely and expensive. Some work has been done to test and implement new assessment methods, but have focused primarily on English or Spanish (Graham et al. 2008). In this thesis, I demonstrate that the processes established for English and Spanish elicited imitation (EI) testing are relevant to French EI testing. First, I document the development, implementation and evaluation of an EI test to assess French oral proficiency. I also det

APA, Harvard, Vancouver, ISO, and other styles

27

Gibson, Marcia Rose. "A feasibility study on the use of a voice recognition system for training delivery." Diss., This resource online, 1990. http://scholar.lib.vt.edu/theses/available/etd-08252008-162853/.

Full text

APA, Harvard, Vancouver, ISO, and other styles

28

Castro, Ceron Ivan Francisco, and Badillo Andrea Graciela Garcia. "A Keyword Based Interactive Speech Recognition System for Embedded Applications." Thesis, Mälardalens högskola, Akademin för innovation, design och teknik, 2011. http://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-12479.

Full text

Abstract:

Speech recognition has been an important area of research during the past decades. The usage of automatic speech recognition systems is rapidly increasing among different areas, such as mobile telephony, automotive, healthcare, robotics and more. However, despite the existence of many speech recognition systems, most of them use platform specific and non-publicly available software. Nevertheless, it is possible to develop speech recognition systems using already existing open source technology. The aim of this master's thesis is to develop an interactive and speaker independent speech recognit

APA, Harvard, Vancouver, ISO, and other styles

29

Lee, Spencer Jaehoon Gilbert Juan E. "Post-speech-recognition processiing in domain-specific text-corpus-based distributed listening system analysis, interpretation and selection of speech recognition results /." Auburn, Ala., 2006. http://repo.lib.auburn.edu/2006%20Summer/Theses/LEE_SPENCER_7.pdf.

Full text

APA, Harvard, Vancouver, ISO, and other styles

30

Johnson, Kevin. "Identification and correction of speech repairs in the context of an automatic speech recognition system." Thesis, Durham University, 1997. http://etheses.dur.ac.uk/5306/.

Full text

Abstract:

Recent advances in automatic speech recognition systems for read (dictated) speech have led researchers to confront the problem of recognising more spontaneous speech. A number of problems, such as disfluencies, appear when read speech is replaced with spontaneous speech. In this work we deal specifically with what we class as speech-repairs. Most disfluency processes deal with speech-repairs at the sentence level. This is too late in the process of speech understanding. Speech recognition systems have problems recognising speech containing speech-repairs. The approach taken in this work is to

APA, Harvard, Vancouver, ISO, and other styles

31

Dong, Junda. "Designing a Visual Front End in Audio-Visual Automatic Speech Recognition System." DigitalCommons@CalPoly, 2015. https://digitalcommons.calpoly.edu/theses/1382.

Full text

Abstract:

Audio-visual automatic speech recognition (AVASR) is a speech recognition technique integrating audio and video signals as input. Traditional audio-only speech recognition system only uses acoustic information from an audio source. However the recognition performance degrades significantly in acoustically noisy environments. It has been shown that visual information also can be used to identify speech. To improve the speech recognition performance, audio-visual automatic speech recognition has been studied. In this paper, we focus on the design of the visual front end of an AVASR system, which

APA, Harvard, Vancouver, ISO, and other styles

32

吳建雄 and Jianxiong Wu. "A parallel distributed processing system for machine recognition of speech signals." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 1991. http://hub.hku.hk/bib/B31232887.

Full text

APA, Harvard, Vancouver, ISO, and other styles

33

Wu, Jianxiong. "A parallel distributed processing system for machine recognition of speech signals /." [Hong Kong : University of Hong Kong], 1991. http://sunzi.lib.hku.hk/hkuto/record.jsp?B13068568.

Full text

APA, Harvard, Vancouver, ISO, and other styles

34

Casali, Sherry Perdue. "The effects of recognition accuracy and vocabulary size of a speech recognition system on task performance and user acceptance." Thesis, Virginia Tech, 1988. http://hdl.handle.net/10919/43383.

Full text

APA, Harvard, Vancouver, ISO, and other styles

35

Ho, Man-chung. "A recognizer of Guangdonghua : development of speech controlled telephone directory system /." Hong Kong : University of Hong Kong, 1999. http://sunzi.lib.hku.hk/hkuto/record.jsp?B20346578.

Full text

APA, Harvard, Vancouver, ISO, and other styles

36

Intilisano, Antonio Rosario. "Spoken dialog systems: from automatic speech recognition to spoken language understanding." Doctoral thesis, Università di Catania, 2016. http://hdl.handle.net/10761/3920.

Full text

APA, Harvard, Vancouver, ISO, and other styles

37

Noori, Asaad F. "An investigation of the feasabiltiy of neurophysiologically and psycholinguistically automatic speech recognition system." Thesis, King's College London (University of London), 1996. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.321736.

Full text

APA, Harvard, Vancouver, ISO, and other styles

38

Collingham, Russell James. "Towards an automatic speech recognition system for use by deaf students in lectures." Thesis, Durham University, 1994. http://etheses.dur.ac.uk/5840/.

Full text

Abstract:

According to the Royal National Institute for Deaf people there are nearly 7.5 million hearing-impaired people in Great Britain. Human-operated machine transcription systems, such as Palantype, achieve low word error rates in real-time. The disadvantage is that they are very expensive to use because of the difficulty in training operators, making them impractical for everyday use in higher education. Existing automatic speech recognition systems also achieve low word error rates, the disadvantages being that they work for read speech in a restricted domain. Moving a system to a new domain requ

APA, Harvard, Vancouver, ISO, and other styles

39

Holmberg, Marcus. "Speech encoding in the human auditory periphery : modeling and quantitative assessment by means of automatic speech recognition /." Düsseldorf : VDI-Verl, 2009. http://d-nb.info/999124897/04.

Full text

APA, Harvard, Vancouver, ISO, and other styles

40

Jeon, Woojay. "Speech Analysis and Cognition Using Category-Dependent Features in a Model of the Central Auditory System." Diss., Georgia Institute of Technology, 2006. http://hdl.handle.net/1853/14061.

Full text

Abstract:

It is well known that machines perform far worse than humans in recognizing speech and audio, especially in noisy environments. One method of addressing this issue of robustness is to study physiological models of the human auditory system and to adopt some of its characteristics in computers. As a first step in studying the potential benefits of an elaborate computational model of the primary auditory cortex (A1) in the central auditory system, we qualitatively and quantitatively validate the model under existing speech processing recognition methodology. Next, we develop new insights and ide

APA, Harvard, Vancouver, ISO, and other styles

41

何敏聰 and Man-chung Ho. "A recognizer of Guangdonghua: development of speech controlled telephone directory system." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 1999. http://hub.hku.hk/bib/B31220903.

Full text

APA, Harvard, Vancouver, ISO, and other styles

42

Duckitt, William. "The design of a high-performance, floating-point embedded system for speech recognition and audio research purposes." Thesis, Link to the online version, 2008. http://hdl.handle.net/10019/824.

Full text

APA, Harvard, Vancouver, ISO, and other styles

43

Lefèbvre, Claude. "An investigation of the use of an auditory model in an automatic speech recognition system." Thesis, University of Ottawa (Canada), 1986. http://hdl.handle.net/10393/4788.

Full text

APA, Harvard, Vancouver, ISO, and other styles

44

Xue, Sukui, and 薛苏葵. "Voice-enabled CAD system." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2010. http://hub.hku.hk/bib/B45461405.

Full text

APA, Harvard, Vancouver, ISO, and other styles

45

Tran, Thi-Anh-Xuan. "Acoustic gesture modeling. Application to a Vietnamese speech recognition system." Thesis, Université Grenoble Alpes (ComUE), 2016. http://www.theses.fr/2016GREAT023/document.

Full text

Abstract:

La sélection de caractéristiques acoustiques appropriées est essentielle dans tout système de traitement de la parole. Pendant près de 40 ans, la parole a été généralement considérée comme une séquence de signaux quasi-stables (voyelles) séparés par des transitions (consonnes). Bien qu‟un grand nombre d'études documentent clairement l'importance de la coarticulation, et révèlent que les cibles articulatoires et acoustiques ne sont pas indépendantes du contexte, l‟hypothèse que chaque voyelle présente une cible acoustique qui peut être spécifiée d'une manière indépendante du contexte reste très

APA, Harvard, Vancouver, ISO, and other styles

46

Geoffroy, Nancy Anne. "Measuring Speech Intelligibility in Voice Alarm Communication Systems." Link to electronic thesis, 2005. http://www.wpi.edu/Pubs/ETD/Available/etd-050405-192800/.

Full text

Abstract:

Thesis (M.S.) -- Worcester Polytechnic Institute.<br>Keywords: speech intelligibility; voice alarm communication system; common intelligibility scale (CIS); speech transmission index (STI). Includes bibliographical references (p. 80-82).

APA, Harvard, Vancouver, ISO, and other styles

47

Ogun, Sewade. "Generating diverse synthetic data for ASR training data augmentation." Electronic Thesis or Diss., Université de Lorraine, 2024. http://www.theses.fr/2024LORR0116.

Full text

Abstract:

Au cours des deux dernières décennies, le taux d'erreur des systèmes de reconnaissance automatique de la parole (RAP) a chuté drastiquement, les rendant ainsi plus utiles dans les applications réelles. Cette amélioration peut être attribuée à plusieurs facteurs, dont les nouvelles architectures utilisant des techniques d'apprentissage profond, les nouveaux algorithmes d'entraînement, les ensembles de données d'entraînement grands et diversifiés, et l'augmentation des données. En particulier, les jeux de données d'entraînement de grande taille ont été essentiels pour apprendre des représentatio

APA, Harvard, Vancouver, ISO, and other styles

48

Kuffel, Robert F. "Speech recognition software : an alternative to reduce ship control manning /." Thesis, Monterey, Calif. : Springfield, Va. : Naval Postgraduate School ; Available from National Technical Information Service, 2004. http://library.nps.navy.mil/uhtbin/hyperion/04Mar%5FKuffel.pdf.

Full text

Abstract:

Thesis (M.S. in Information Systems and Operations)--Naval Postgraduate School, March 2004.<br>Thesis advisor(s): Russell Gottfried, Monique P. Fargues. Includes bibliographical references (p. 43-45). Also available online.

APA, Harvard, Vancouver, ISO, and other styles

49

Atkinson, Karen A. "FRIC : an expert system to recognize fricatives /." Online version of thesis, 1987. http://hdl.handle.net/1850/8805.

Full text

APA, Harvard, Vancouver, ISO, and other styles

50

Paul, Sheuli [Verfasser], Michael [Akademischer Betreuer] Richter, and Steven [Akademischer Betreuer] Liu. "Dynamic Automatic Noisy Speech Recognition System (DANSR) = Dynamische automatische verrauschte Spracherkennung / Sheuli Paul. Betreuer: Michael Richter ; Steven Liu." Kaiserslautern : Technische Universität Kaiserslautern, 2014. http://d-nb.info/1050977211/34.

Full text

APA, Harvard, Vancouver, ISO, and other styles

We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!