Log in

Relevant bibliographies by topics / Speech Features / Dissertations / Theses

To see the other types of publications on this topic, follow the link: Speech Features.

Dissertations / Theses on the topic 'Speech Features'

Author: Grafiati

Published: 4 June 2021

Last updated: 26 July 2025

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 dissertations / theses for your research on the topic 'Speech Features.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.

1

Haque, Serajul. "Perceptual features for speech recognition." University of Western Australia. School of Electrical, Electronic and Computer Engineering, 2008. http://theses.library.uwa.edu.au/adt-WU2008.0187.

Full text

Abstract:

Automatic speech recognition (ASR) is one of the most important research areas in the field of speech technology and research. It is also known as the recognition of speech by a machine or, by some artificial intelligence. However, in spite of focused research in this field for the past several decades, robust speech recognition with high reliability has not been achieved as it degrades in presence of speaker variabilities, channel mismatch condi- tions, and in noisy environments. The superb ability of the human auditory system has motivated researchers to include features of human perception

APA, Harvard, Vancouver, ISO, and other styles

2

Leung, Ka Yee. "Combining acoustic features and articulatory features for speech recognition /." View Abstract or Full-Text, 2002. http://library.ust.hk/cgi/db/thesis.pl?ELEC%202002%20LEUNGK.

Full text

Abstract:

Thesis (M. Phil.)--Hong Kong University of Science and Technology, 2002.<br>Includes bibliographical references (leaves 92-96). Also available in electronic version. Access restricted to campus users.

APA, Harvard, Vancouver, ISO, and other styles

3

Lileikytė, Rasa. "Quality estimation of speech recognition features." Doctoral thesis, Lithuanian Academic Libraries Network (LABT), 2012. http://vddb.laba.lt/obj/LT-eLABa-0001:E.02~2012~D_20120302_090132-92071.

Full text

Abstract:

The accuracy of speech recognition system depends on characteristics of employed speech recognition features and classifier. Evaluating the accuracy of speech recognition system in ordinary way, the error of speech recognition system has to be calculated for each type of explored feature system and each type of classifier. The amount of such calculations can be reduced if the quality of explored feature system is estimated. Accordingly, the researches were made for quality estimation of speech recognition features. The proposed method for quality estimation of speech recognition features is ba

APA, Harvard, Vancouver, ISO, and other styles

4

Matthews, Iain. "Features for audio-visual speech recognition." Thesis, University of East Anglia, 1998. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.266736.

Full text

APA, Harvard, Vancouver, ISO, and other styles

5

Rankin, D. "Extraction of features from speech spectra." Thesis, Queen's University Belfast, 1985. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.373541.

Full text

APA, Harvard, Vancouver, ISO, and other styles

6

Ore, Brian M. "Multilingual Articulatory Features for Speech Recognition." Wright State University / OhioLINK, 2007. http://rave.ohiolink.edu/etdc/view?acc_num=wright1176169264.

Full text

APA, Harvard, Vancouver, ISO, and other styles

7

Droppo, J. G. "Time-frequency features for speech recognition /." Thesis, Connect to this title online; UW restricted, 2000. http://hdl.handle.net/1773/5965.

Full text

APA, Harvard, Vancouver, ISO, and other styles

8

Iliev, Alexander Iliev. "Emotion Recognition Using Glottal and Prosodic Features." Scholarly Repository, 2009. http://scholarlyrepository.miami.edu/oa_dissertations/515.

Full text

Abstract:

Emotion conveys the psychological state of a person. It is expressed by a variety of physiological changes, such as changes in blood pressure, heart beat rate, degree of sweating, and can be manifested in shaking, changes in skin coloration, facial expression, and the acoustics of speech. This research focuses on the recognition of emotion conveyed in speech. There were three main objectives of this study. One was to examine the role played by the glottal source signal in the expression of emotional speech. The second was to investigate whether it can provide improved robustness in real-world

APA, Harvard, Vancouver, ISO, and other styles

9

Väyrynen, E. (Eero). "Emotion recognition from speech using prosodic features." Doctoral thesis, Oulun yliopisto, 2014. http://urn.fi/urn:isbn:9789526204048.

Full text

Abstract:

Abstract Emotion recognition, a key step of affective computing, is the process of decoding an embedded emotional message from human communication signals, e.g. visual, audio, and/or other physiological cues. It is well-known that speech is the main channel for human communication and thus vital in the signalling of emotion and semantic cues for the correct interpretation of contexts. In the verbal channel, the emotional content is largely conveyed as constant paralinguistic information signals, from which prosody is the most important component. The lack of evaluation of affect and emotional

APA, Harvard, Vancouver, ISO, and other styles

10

Gu, Y. "Perceptually-based features in automatic speech recognition." Thesis, Swansea University, 1991. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.637182.

Full text

Abstract:

Interspeaker variability of speech features is one of most important problems in automatic speech recognition (ASR), and makes speaker-independent systems much more difficult to achieve than speaker-dependent ones. The work described in the Thesis examines two ideas to overcome this problem. The first attempts to extract more reliable speech features by perceptually-based modelling; the second investigates the speaker variability in this speech feature and reduces its effects by a speaker normalisation scheme. The application of human speech perception in automatic speech recognition is discus

APA, Harvard, Vancouver, ISO, and other styles

11

Saenko, Ekaterina 1976. "Articulatory features for robust visual speech recognition." Thesis, Massachusetts Institute of Technology, 2004. http://hdl.handle.net/1721.1/28736.

Full text

Abstract:

Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2004.<br>Includes bibliographical references (p. 99-105).<br>This thesis explores a novel approach to visual speech modeling. Visual speech, or a sequence of images of the speaker's face, is traditionally viewed as a single stream of contiguous units, each corresponding to a phonetic segment. These units are defined heuristically by mapping several visually similar phonemes to one visual phoneme, sometimes referred to as a viseme. However, experimental evidence shows that phonetic models

APA, Harvard, Vancouver, ISO, and other styles

12

Lal, Partha. "Cross-lingual automatic speech recognition using tandem features." Thesis, University of Edinburgh, 2011. http://hdl.handle.net/1842/5773.

Full text

Abstract:

Automatic speech recognition requires many hours of transcribed speech recordings in order for an acoustic model to be effectively trained. However, recording speech corpora is time-consuming and expensive, so such quantities of data exist only for a handful of languages — there are many languages for which little or no data exist. Given that there are acoustic similarities between different languages, it may be fruitful to use data from a well-supported source language for the task of training a recogniser in a target language with little training data. Since most languages do not share a com

APA, Harvard, Vancouver, ISO, and other styles

13

Domont, Xavier. "Hierarchical spectro-temporal features for robust speech recognition." Münster Verl.-Haus Monsenstein und Vannerdat, 2009. http://d-nb.info/1001282655/04.

Full text

APA, Harvard, Vancouver, ISO, and other styles

14

Mubarak, Omer Mohsin Electrical Engineering &amp Telecommunications Faculty of Engineering UNSW. "Speech and music discrimination using short-time features." Awarded by:University of New South Wales. Electrical Engineering & Telecommunications, 2006. http://handle.unsw.edu.au/1959.4/31954.

Full text

Abstract:

This thesis addresses the problem of classifying an audio stream as either speech or music, an issue which is beginning to receive increasing attention due to its wide range of applications. Various techniques have been presented in last decade to discriminate between speech and music. However, their accuracy is still not sufficient since music can refer to a very broad class of signals due to the large number of musical instruments found in audio data. Performance can also be further compromised in noisy conditions, which are unavoidable in some practical situations. This thesis presents an a

APA, Harvard, Vancouver, ISO, and other styles

15

Alnadabi, Muhammad Saeid Muhammad. "Speech/music discrimination : novel features in time domain." Thesis, Durham University, 2010. http://etheses.dur.ac.uk/206/.

Full text

Abstract:

This research aimed to find novel features that can be used to discriminate between speech and music in the time domain for the purpose of data retrieval. The study used speech and music data that were recorded in standard anechoic chambers and sampled at 44.1 kHz. Two types of new features were found and thoroughly examined: the Ratio of Silent Frames (RSF) feature and the Time Series Events (TSE) set of features. The Receiver Operating Characteristics (ROC) curves were used to assess each one of the proposed features as well as certain relevant features from the literature for the purpose of

APA, Harvard, Vancouver, ISO, and other styles

16

Harte, Naomi Antonia. "Segmental phonetic features and models for speech recognition." Thesis, Queen's University Belfast, 1999. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.287466.

Full text

APA, Harvard, Vancouver, ISO, and other styles

17

Schuy, Lars. "Speech features and their significance in speaker recognition." Thesis, University of Sussex, 2002. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.288845.

Full text

Abstract:

This thesis addresses the significance of speech features within the task of speaker recognition. Motivated by the perception of simple attributes like `loud', `smooth', `fast', more than 70 new speech features are developed. A set of basic speech features like pitch, loudness and speech speed are combined together with these new features in a feature set, one set per utterance. A neural network classifier is used to evaluate the significance of these features by creating a speaker recognition system and analysing the behaviour of successfully trained single-speaker networks. An in-depth analy

APA, Harvard, Vancouver, ISO, and other styles

18

Berg, Brian LaRoy. "Investigating Speaker Features From Very Short Speech Records." Diss., Virginia Tech, 2001. http://hdl.handle.net/10919/28691.

Full text

Abstract:

A procedure is presented that is capable of extracting various speaker features, and is of particular value for analyzing records containing single words and shorter segments of speech. By taking advantage of the fast convergence properties of adaptive filtering, the approach is capable of modeling the nonstationarities due to both the vocal tract and vocal cord dynamics. Specifically, the procedure extracts the vocal tract estimate from within the closed glottis interval and uses it to obtain a time-domain glottal signal. This procedure is quite simple, requires minimal manual intervention

APA, Harvard, Vancouver, ISO, and other styles

19

Bryant, Gregory Alan. "Prosodic features of verbal irony in spontaneous speech /." Diss., Digital Dissertations Database. Restricted to UC campuses, 2004. http://uclibs.org/PID/11984.

Full text

APA, Harvard, Vancouver, ISO, and other styles

20

Juneja, Amit. "Speech recognition based on phonetic features and acoustic landmarks." College Park, Md. : University of Maryland, 2004. http://hdl.handle.net/1903/2148.

Full text

Abstract:

Thesis (Ph. D.) -- University of Maryland, College Park, 2004.<br>Thesis research directed by: Electrical Engineering. Title from t.p. of PDF. Includes bibliographical references. Published by UMI Dissertation Services, Ann Arbor, Mich. Also available in paper.

APA, Harvard, Vancouver, ISO, and other styles

21

ALENCAR, VLADIMIR FABREGAS SURIGUE DE. "EFFICIENT FEATURES AND INTERPOLATION DOMAINS IN DISTRIBUTED SPEECH RECOGNITION." PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO, 2005. http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=6201@1.

Full text

Abstract:

COORDENAÇÃO DE APERFEIÇOAMENTO DO PESSOAL DE ENSINO SUPERIOR<br>Com o crescimento gigantesco da Internet e dos sistemas de comunicações móveis celulares, as aplicações de processamento de voz nessas redes têm despertado grande interesse . Um problema particularmente importante nessa área consiste no reconhecimento de voz em um sistema servidor, baseado nos parâmetros acústicos calculados e quantizados no terminal do usuário (Reconhecimento de Voz Distribuído). Como em geral estes parâmetros não são os mais indicados como atributos de voz para o sistema de reconhecimento remoto, é import

APA, Harvard, Vancouver, ISO, and other styles

22

Meng, Helen M. "The use of distinctive features for automatic speech recognition." Thesis, Massachusetts Institute of Technology, 1991. http://hdl.handle.net/1721.1/13279.

Full text

APA, Harvard, Vancouver, ISO, and other styles

23

Lareau, Jonathan. "Application of shifted delta cepstral features for GMM language identification /." Electronic version of thesis, 2006. https://ritdml.rit.edu/dspace/handle/1850/2686.

Full text

APA, Harvard, Vancouver, ISO, and other styles

24

Javadi, Ailar. "Bio-inspired noise robust auditory features." Thesis, Georgia Institute of Technology, 2012. http://hdl.handle.net/1853/44801.

Full text

Abstract:

The purpose of this work is to investigate a series of biologically inspired modifications to state-of-the-art Mel- frequency cepstral coefficients (MFCCs) that may improve automatic speech recognition results. We have provided recommendations to improve speech recognition results de- pending on signal-to-noise ratio levels of input signals. This work has been motivated by noise-robust auditory features (NRAF). In the feature extraction technique, after a signal is filtered using bandpass filters, a spatial derivative step is used to sharpen the results, followed by an envelope detector (recti

APA, Harvard, Vancouver, ISO, and other styles

25

Schutte, Kenneth Thomas 1979. "Parts-based models and local features for automatic speech recognition." Thesis, Massachusetts Institute of Technology, 2009. http://hdl.handle.net/1721.1/53301.

Full text

Abstract:

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.<br>Cataloged from PDF version of thesis.<br>Includes bibliographical references (p. 101-108).<br>While automatic speech recognition (ASR) systems have steadily improved and are now in widespread use, their accuracy continues to lag behind human performance, particularly in adverse conditions. This thesis revisits the basic acoustic modeling assumptions common to most ASR systems and argues that improvements to the underlying model of speech are required to address these shortcomi

APA, Harvard, Vancouver, ISO, and other styles

26

Tang, Min Ph D. Massachusetts Institute of Technology. "Large vocabulary continuous speech recognition using linguistic features and constraints." Thesis, Massachusetts Institute of Technology, 2005. http://hdl.handle.net/1721.1/33203.

Full text

Abstract:

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2005.<br>This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.<br>Includes bibliographical references (leaves 111-123).<br>Automatic speech recognition (ASR) is a process of applying constraints, as encoded in the computer system (the recognizer), to the speech signal until ambiguity is satisfactorily resolved to the extent that only one sequence of words is hypothesized. Such constraints fall

APA, Harvard, Vancouver, ISO, and other styles

27

Herms, Robert. "Effective Speech Features for Cognitive Load Assessment: Classification and Regression." Universitätsverlag Chemnitz, 2018. https://monarch.qucosa.de/id/qucosa%3A33346.

Full text

Abstract:

This thesis is about the effectiveness of speech features for cognitive load assessment, with particular attention being paid to new perspectives of this research area. A new cognitive load database, called CoLoSS, is introduced containing speech recordings of users who performed a learning task. Various acoustic features from different categories including prosody, voice quality, and spectrum are investigated in terms of their relevance. Moreover, Teager energy parameters, which have proven highly successful in stress detection, are introduced for cognitive load assessment and it is demonstra

APA, Harvard, Vancouver, ISO, and other styles

28

Christensen, Carl V. "Fluency Features and Elicited Imitation as Oral Proficiency Measurement." BYU ScholarsArchive, 2012. https://scholarsarchive.byu.edu/etd/3114.

Full text

Abstract:

The objective and automatic grading of oral language tests has been the subject of significant research in recent years. Several obstacles lie in the way of achieving this goal. Recent work has suggested a testing technique called elicited imitation (EI) can be used to accurately approximate global oral proficiency. This testing methodology, however, does not incorporate some fundamental aspects of language such as fluency. Other work has suggested another testing technique, simulated speech (SS), as a supplement to EI that can provide automated fluency metrics. In this work, I investigate a c

APA, Harvard, Vancouver, ISO, and other styles

29

Rask, Linnea. "Prosodic Features in Child-directed Speech during the Child's First Year." Thesis, Stockholms universitet, Avdelningen för fonetik, 2015. http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-118382.

Full text

Abstract:

This study investigates prosodic features of child-directed speech during the child’s first year, using the automated prosodic annotation software Prosogram. From previous studies on first language acquisition and child-directed speech we know that speech directed to infants and small children is characterised by exaggerated use of several prosodic features, including a higher pitch, livelier pitch movement and slower speech rate. Annotation of these phenomena has previously been done manually, which is time consuming and includes a risk of circularity. If we can use semi-automated systems to

APA, Harvard, Vancouver, ISO, and other styles

30

Cooper, Douglas. "Speech Detection using Gammatone Features and One-Class Support Vector Machine." Master's thesis, University of Central Florida, 2013. http://digital.library.ucf.edu/cdm/ref/collection/ETD/id/5923.

Full text

Abstract:

A network gateway is a mechanism which provides protocol translation and/or validation of network traffic using the metadata contained in network packets. For media applications such as Voice-over-IP, the portion of the packets containing speech data cannot be verified and can provide a means of maliciously transporting code or sensitive data undetected. One solution to this problem is through Voice Activity Detection (VAD). Many VAD's rely on time-domain features and simple thresholds for efficient speech detection however this doesn't say much about the signal being passed. More sophisticate

APA, Harvard, Vancouver, ISO, and other styles

31

Tyson, Na'im R. "Exploration of Acoustic Features for Automatic Vowel Discrimination in Spontaneous Speech." The Ohio State University, 2012. http://rave.ohiolink.edu/etdc/view?acc_num=osu1339695879.

Full text

APA, Harvard, Vancouver, ISO, and other styles

32

Pennington, Mark. "The phonetics and phonology of glottal manner features." [Bloomington, Ind.] : Indiana University, 2005. http://wwwlib.umi.com/dissertations/fullcit/3202900.

Full text

Abstract:

Thesis (Ph. D.)--Indiana University, Dept. of Linguistics, 2005.<br>Title from PDF t.p. (viewed Nov. 10, 2008). Source: Dissertation Abstracts International, Volume: 67-01, Section: A, page: 0167. Adviser: Robert F. Port.

APA, Harvard, Vancouver, ISO, and other styles

33

Otterson, Scott. "Use of speaker location features in meeting diarization /." Thesis, Connect to this title online; UW restricted, 2008. http://hdl.handle.net/1773/15463.

Full text

APA, Harvard, Vancouver, ISO, and other styles

34

Perera, Katharine. "The development of prosodic features in children's oral reading." Thesis, University of Manchester, 1989. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.276094.

Full text

APA, Harvard, Vancouver, ISO, and other styles

35

Jeon, Woojay. "Speech Analysis and Cognition Using Category-Dependent Features in a Model of the Central Auditory System." Diss., Georgia Institute of Technology, 2006. http://hdl.handle.net/1853/14061.

Full text

Abstract:

It is well known that machines perform far worse than humans in recognizing speech and audio, especially in noisy environments. One method of addressing this issue of robustness is to study physiological models of the human auditory system and to adopt some of its characteristics in computers. As a first step in studying the potential benefits of an elaborate computational model of the primary auditory cortex (A1) in the central auditory system, we qualitatively and quantitatively validate the model under existing speech processing recognition methodology. Next, we develop new insights and ide

APA, Harvard, Vancouver, ISO, and other styles

36

Kim, Jonathan Chongkang. "Classification of affect using novel voice and visual features." Diss., Georgia Institute of Technology, 2014. http://hdl.handle.net/1853/54301.

Full text

Abstract:

Emotion adds an important element to the discussion of how information is conveyed and processed by humans; indeed, it plays an important role in the contextual understanding of messages. This research is centered on investigating relevant features for affect classification, along with modeling the multimodal and multitemporal nature of emotion. The use of formant-based features for affect classification is explored. Since linear predictive coding (LPC) based formant estimators often encounter problems with modeling speech elements, such as nasalized phonemes and give inconsistent results fo

APA, Harvard, Vancouver, ISO, and other styles

37

Clark, Tracy M. "A Study of Features and Processes Towards Real-time Speech Word Recognition." Thesis, University of Canterbury. Electrical and Electronic Engineering, 1993. http://hdl.handle.net/10092/7561.

Full text

Abstract:

Word recognition techniques are reviewed. An exhaustive comparative study of many of the factors that affect recognition accuracy is presented. Experiments centred on four major areas of word recognition are described: pre-processing techniques, recognition features, recognition algorithms and distance measures. Recognition accuracy, in the context of each of these four areas, is investigated using the digit vocabulary spoken by 10 New Zealand (6 male and 4 female) and 38 American (20 male and 18 female) speakers. Pre-processing techniques examined are the type of window, the length of the dat

APA, Harvard, Vancouver, ISO, and other styles

38

GarciÌa, MariÌa Susana Avila. "Automatic tracking of 3D vocal tract features during speech production using MRI." Thesis, University of Southampton, 2006. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.437111.

Full text

APA, Harvard, Vancouver, ISO, and other styles

39

Peso, Pablo. "Spatial features of reverberant speech : estimation and application to recognition and diarization." Thesis, Imperial College London, 2016. http://hdl.handle.net/10044/1/45664.

Full text

Abstract:

Distant talking scenarios, such as hands-free calling or teleconference meetings, are essential for natural and comfortable human-machine interaction and they are being increasingly used in multiple contexts. The acquired speech signal in such scenarios is reverberant and affected by additive noise. This signal distortion degrades the performance of speech recognition and diarization systems creating troublesome human-machine interactions. This thesis proposes a method to non-intrusively estimate room acoustic parameters, paying special attention to a room acoustic parameter highly correlated

APA, Harvard, Vancouver, ISO, and other styles

40

Albright, Anthony J. "Social Dialect Features of Military Speech: A Sociolinguistic Study of Fargo Veterans." Diss., North Dakota State University, 2019. https://hdl.handle.net/10365/29589.

Full text

Abstract:

This mixed-methods study examines the potential existence of a military dialect separate from regional or social dialects experienced by civilians. In particular, how similar is the military-related storytelling lexicon of veterans in the Fargo-Moorhead area to the lexicon set forth in training bases and training manuals used by the U.S. military? The lexicon used by veterans in storytelling can sometimes seem opaque to an audience. It is typically dense with meaning borne by a few coded words. These words carry a contextual burden that can be better understood by an appeal to the dialect from

APA, Harvard, Vancouver, ISO, and other styles

41

Chan, Oscar. "Prosodic features for a maximum entropy language model." University of Western Australia. School of Electrical, Electronic and Computer Engineering, 2008. http://theses.library.uwa.edu.au/adt-WU2008.0244.

Full text

Abstract:

A statistical language model attempts to characterise the patterns present in a natural language as a probability distribution defined over word sequences. Typically, they are trained using word co-occurrence statistics from a large sample of text. In some language modelling applications, such as automatic speech recognition (ASR), the availability of acoustic data provides an additional source of knowledge. This contains, amongst other things, the melodic and rhythmic aspects of speech referred to as prosody. Although prosody has been found to be an important factor in human speech recognitio

APA, Harvard, Vancouver, ISO, and other styles

42

Kamandulytė, Laura. "Acquisition of Lithuanian adjective: lexical and morphosyntactic features." Doctoral thesis, Lithuanian Academic Libraries Network (LABT), 2010. http://vddb.laba.lt/obj/LT-eLABa-0001:E.02~2010~D_20100208_143347-79866.

Full text

Abstract:

The main results of the PhD thesis are described in the summary. The main objective of the thesis is to define the acquisition of lexical and morphosyntactic features of Lithuanian adjective. The analysis is based on the large quantity of linguistic data (~ 400 000 words), which is the corpus of four children compiled according the method of longitudinal observation. The study has interchangeably applied several methods of analysis: longitudinal observation, corpus linguistics, error analysis and comparative method. Statements to be Defended 1. Lexical features of adjectives are acquired by c

APA, Harvard, Vancouver, ISO, and other styles

43

Sun, Rui. "The evaluation of the stability of acoustic features in affective conveyance across multiple emotional databases." Diss., Georgia Institute of Technology, 2013. http://hdl.handle.net/1853/49041.

Full text

Abstract:

The objective of the research presented in this thesis was to systematically investigate the computational structure for cross-database emotion recognition. The research consisted of evaluating the stability of acoustic features, particularly the glottal and Teager Energy based features, and investigating three normalization methods and two data fusion techniques. One of the challenges of cross-database training and testing is accounting for the potential variation in the types of emotions expressed as well as the recording conditions. In an attempt to alleviate the impact of these types of va

APA, Harvard, Vancouver, ISO, and other styles

44

Johnson, Earl E. "Dispensing Rates of Four Common Hearing Aid Product Features: Associations with Variations in Practice among Audiologists." Digital Commons @ East Tennessee State University, 2010. https://dc.etsu.edu/etsu-works/1711.

Full text

APA, Harvard, Vancouver, ISO, and other styles

45

Swart, Philippa H. "Prosodic features of imperatives in Xhosa : implications for a text-to-speech system." Thesis, Stellenbosch : Stellenbosch University, 2000. http://hdl.handle.net/10019.1/51891.

Full text

Abstract:

Thesis (MA)--University of Stellenbosch, 2000.<br>ENGLISH ABSTRACT: This study focuses on the prosodic features of imperatives and the role of prosodies in the development of a text-to-speech (TIS) system for Xhosa, an African tone language. The perception of prosody is manifested in suprasegmental features such as fundamental frequency (pitch), intensity (loudness) and duration (length). Very little experimental research has been done on the prosodic features of any grammatical structures (moods and tenses) in Xhosa, therefore it has not yet been determined how and to what degree the d

APA, Harvard, Vancouver, ISO, and other styles

46

Ng, Wing-yee. "Speech acts of school-aged children with autistic features with different communication partners." Click to view the E-thesis via HKU Scholors Hub, 2005. http://lookup.lib.hku.hk/lookup/bib/B38279319.

Full text

Abstract:

Thesis (B.Sc)--University of Hong Kong, 2005.<br>"A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, June 30, 2005." Also available in print.

APA, Harvard, Vancouver, ISO, and other styles

47

Koniaris, Christos. "A study on selecting and optimizing perceptually relevant features for automatic speech recognition." Licentiate thesis, Stockholm : Kungliga Tekniska högskolan, 2009. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-11470.

Full text

APA, Harvard, Vancouver, ISO, and other styles

48

Wong, Jimmy Pui Fung. "The use of prosodic features in Chinese speech recognition and spoken language processing /." View Abstract or Full-Text, 2003. http://library.ust.hk/cgi/db/thesis.pl?ELEC%202003%20WONG.

Full text

Abstract:

Thesis (M.Phil.)--Hong Kong University of Science and Technology, 2003.<br>Includes bibliographical references (leaves 97-101). Also available in electronic version. Access restricted to campus users.

APA, Harvard, Vancouver, ISO, and other styles

49

Johnson, Earl E. "Despite Having More Advanced Features, Hearing Aids Hold Line on Retail Price." Digital Commons @ East Tennessee State University, 2008. https://dc.etsu.edu/etsu-works/1968.

Full text

Abstract:

Despite the growing popularity of state-of-the-art open-canal mini-BTEs, which were discussed in last month's Cover Story, and the increasing prevalence of advanced hearing aid features, the annual Hearing Journal/Audiology Onlinedispenser survey conducted in January found only small, inflation-appropriate increases in the average retail price of hearing aids over the past 3 years. Specifically, the average price of the hearing aids that participating dispensers reported selling in 2007 was $1986, only marginally higher than the average prices of $1912 and $1868 in 2006 and 2005, respectively.

APA, Harvard, Vancouver, ISO, and other styles

50

Gangireddy, Siva Reddy. "Recurrent neural network language models for automatic speech recognition." Thesis, University of Edinburgh, 2017. http://hdl.handle.net/1842/28990.

Full text

Abstract:

The goal of this thesis is to advance the use of recurrent neural network language models (RNNLMs) for large vocabulary continuous speech recognition (LVCSR). RNNLMs are currently state-of-the-art and shown to consistently reduce the word error rates (WERs) of LVCSR tasks when compared to other language models. In this thesis we propose various advances to RNNLMs. The advances are: improved learning procedures for RNNLMs, enhancing the context, and adaptation of RNNLMs. We learned better parameters by a novel pre-training approach and enhanced the context using prosody and syntactic features.

APA, Harvard, Vancouver, ISO, and other styles

We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!