Dissertations / Theses on the topic 'Speech - Signal Processing'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Speech - Signal Processing.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Little, M. A. "Biomechanically informed nonlinear speech signal processing." Thesis, University of Oxford, 2007. http://ora.ox.ac.uk/objects/uuid:6f5b84fb-ab0b-42e1-9ac2-5f6acc9c5b80.
Full textWells, Ian. "Digital signal processing architectures for speech recognition." Thesis, University of the West of England, Bristol, 1995. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.294705.
Full textMorris, Robert W. "Enhancement and recognition of whispered speech." Diss., Available online, Georgia Institute of Technology, 2004:, 2003. http://etd.gatech.edu/theses/available/etd-04082004-180338/unrestricted/morris%5frobert%5fw%5f200312%5fphd.pdf.
Full textCoetzee, H. J. "The development of a new objective speech quality measure for speech coding applications." Diss., Georgia Institute of Technology, 1990. http://hdl.handle.net/1853/15474.
Full textRex, James Alexander. "Microphone signal processing for speech recognition in cars." Thesis, University of Southampton, 2000. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.326728.
Full textShah, Afnan Arafat. "Improving automatic speech recognition transcription through signal processing." Thesis, University of Southampton, 2017. https://eprints.soton.ac.uk/418970/.
Full textWu, Ping. "Kohonen self-organising neural networks in speech signal processing." Thesis, University of Reading, 1994. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.386985.
Full textStringer, Paul David. "Binaural signal processing for the enhancement of speech perception." Thesis, University of York, 1995. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.282296.
Full textHanna, Salim Alia. "Digital signal processing algorithms for speech coding and recognition." Thesis, Imperial College London, 1987. http://hdl.handle.net/10044/1/46268.
Full textToner, Edward. "The enhancement of noise corrupted speech signals." Thesis, University of the West of Scotland, 1993. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.359727.
Full textAnderson, David Verl. "Audio signal enhancement using multi-resolution sinusoidal modeling." Diss., Georgia Institute of Technology, 1999. http://hdl.handle.net/1853/15394.
Full textEdwards, Richard. "Advanced signal processing techniques for pitch synchronous sinusoidal speech coders." Thesis, University of Surrey, 2007. http://epubs.surrey.ac.uk/833/.
Full textSpittle, Gary. "An investigation into improving speech intelligibility using binaural signal processing." Thesis, University of York, 2009. http://etheses.whiterose.ac.uk/1141/.
Full textCanagarajah, Cedric Nishanthan. "Digital signal processing techniques for speech enhancement in hearing aids." Thesis, University of Cambridge, 1993. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.260433.
Full textWang, Tianyu Tom. "Toward an interpretive framework of two-dimensional speech-signal processing." Thesis, Massachusetts Institute of Technology, 2011. http://hdl.handle.net/1721.1/65520.
Full textCataloged from PDF version of thesis.
Includes bibliographical references (p. 177-179).
Traditional representations of speech are derived from short-time segments of the signal and result in time-frequency distributions of energy such as the short-time Fourier transform and spectrogram. Speech-signal models of such representations have had utility in a variety of applications such as speech analysis, recognition, and synthesis. Nonetheless, they do not capture spectral, temporal, and joint spectrotemporal energy fluctuations (or "modulations") present in local time-frequency regions of the time-frequency distribution. Inspired by principles from image processing and evidence from auditory neurophysiological models, a variety of twodimensional (2-D) processing techniques have been explored in the literature as alternative representations of speech; however, speech-based models are lacking in this framework. This thesis develops speech-signal models for a particular 2-D processing approach in which 2-D Fourier transforms are computed on local time-frequency regions of the canonical narrowband or wideband spectrogram; we refer to the resulting transformed space as the Grating Compression Transform (GCT). We argue for a 2-D sinusoidal-series amplitude modulation model of speech content in the spectrogram domain that relates to speech production characteristics such as pitch/noise of the source, pitch dynamics, formant structure and dynamics, and offset/onset content. Narrowband- and wideband-based models are shown to exhibit important distinctions in interpretation and oftentimes "dual" behavior. In the transformed GCT space, the modeling results in a novel taxonomy of signal behavior based on the distribution of formant and onset/offset content in the transformed space via source characteristics. Our formulation provides a speech-specific interpretation of the concept of "modulation" in 2-D processing in contrast to existing approaches that have done so either phenomenologically through qualitative analyses and/or implicitly through data-driven machine learning approaches. One implication of the proposed taxonomy is its potential for interpreting transformations of other time-frequency distributions such as the auditory spectrogram which is generally viewed as being "narrowband"/"wideband" in its low/high-frequency regions. The proposed signal model is evaluated in several ways. First, we perform analysis of synthetic speech signals to characterize its properties and limitations. Next, we develop an algorithm for analysis/synthesis of spectrograms using the model and demonstrate its ability to accurately represent real speech content. As an example application, we further apply the models in cochannel speaker separation, exploiting the GCT's ability to distribute speaker-specific content and often recover overlapping information through demodulation and interpolation in the 2-D GCT space. Specifically, in multi-pitch estimation, we demonstrate the GCT's ability to accurately estimate separate and crossing pitch tracks under certain conditions. Finally, we demonstrate the model's ability to separate mixtures of speech signals using both prior and estimated pitch information. Generalization to other speech-signal processing applications is proposed.
by Tianyu Tom Wang.
Ph.D.
Larreategui, Mikel. "High-quality text-to-speech synthesis using sinusoidal techniques." Thesis, Staffordshire University, 1996. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.309790.
Full textLucey, Simon. "Audio-visual speech processing." Thesis, Queensland University of Technology, 2002. https://eprints.qut.edu.au/36172/7/SimonLuceyPhDThesis.pdf.
Full textAllred, Daniel Jackson. "Evaluation and Comparison of Beamforming Algorithms for Microphone Array Speech Processing." Thesis, Georgia Institute of Technology, 2006. http://hdl.handle.net/1853/11606.
Full textMészáros, Tomáš. "Speech Analysis for Processing of Musical Signals." Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2015. http://www.nusl.cz/ntk/nusl-234974.
Full textElvira, Jose M. "Neural networks for speech and speaker recognition." Thesis, Staffordshire University, 1994. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.262314.
Full textOberhofer, Robert. "Pitch adaptive variable bitrate CELP speech coding." Thesis, University of Ulster, 1998. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.264811.
Full textSepehr, H. "Advanced adaptive signal processing techniques for low complexity speech enhancement applications." Thesis, University College London (University of London), 2011. http://discovery.ucl.ac.uk/1306808/.
Full textFallatah, Anwar. "Speech Auditory Brainstem Response Signal Processing: Estimation, Modeling, Detection, and Enhancement." Thesis, Université d'Ottawa / University of Ottawa, 2019. http://hdl.handle.net/10393/39699.
Full textTrinkaus, Trevor R. "Perceptual coding of audio and diverse speech signals." Diss., Georgia Institute of Technology, 1999. http://hdl.handle.net/1853/13883.
Full textRao, Hrishikesh. "Paralinguistic event detection in children's speech." Diss., Georgia Institute of Technology, 2015. http://hdl.handle.net/1853/54332.
Full textLu, Nan. "Development of new digital signal processing procedures and applications to speech, electromyography and image processing." Thesis, University of Liverpool, 2008. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.445962.
Full textSmith, Daniel. "An analysis of blind signal separation for real time application." Access electronically, 2006. http://www.library.uow.edu.au/adt-NWU/public/adt-NWU20070815.152400/index.html.
Full textIkram, Muhammad Zubair. "Multichannel blind separation of speech signals in a reverberant environment." Diss., Georgia Institute of Technology, 2001. http://hdl.handle.net/1853/15023.
Full textNayfeh, Taysir H. "Multi-signal processing for voice recognition in noisy environments." Thesis, This resource online, 1991. http://scholar.lib.vt.edu/theses/available/etd-10222009-125021/.
Full textChan, Arthur Yu Chung. "Robust speech recognition against unknown short-time noise /." View Abstract or Full-Text, 2002. http://library.ust.hk/cgi/db/thesis.pl?ELEC%202002%20CHAN.
Full textIncludes bibliographical references (leaves 119-125). Also available in electronic version. Access restricted to campus users.
Doukas, Nikolaos. "Voice activity detection using energy based measures and source separation." Thesis, Imperial College London, 1998. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.245220.
Full textKale, Kaustubh R. "Low complexity, narrow baseline beamformer for hand-held devices." [Gainesville, Fla.] : University of Florida, 2003. http://purl.fcla.edu/fcla/etd/UFE0001223.
Full textHild, Kenneth E. "Blind separation of convolutive mixtures using Renyi's divergence." [Gainesville, Fla.] : University of Florida, 2003. http://purl.fcla.edu/fcla/etd/UFE0002387.
Full textErtan, Ali Erdem. "Pitch-synchronous processing of speech signal for improving the quality of low bit rate speech coders." Diss., Georgia Institute of Technology, 2004. http://hdl.handle.net/1853/36534.
Full textErtan, Ali Erdem. "Pitch-synchronous processing of speech signal for improving the quality of low bit rate speech coders." Available online, Georgia Institute of Technology, 2004:, 2003. http://etd.gatech.edu/theses/available/etd-06072004-131138/unrestricted/ertan%5Fali%5Fe%5F200405%5Fphd.pdf.
Full textVita. Includes bibliographical references (leaves 221-226).
Bakheet, Mohammed. "Improving Speech Recognition for Arabic language Using Low Amounts of Labeled Data." Thesis, Linköpings universitet, Institutionen för datavetenskap, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-176437.
Full textBirkenes, Øystein. "A Framework for Speech Recognition using Logistic Regression." Doctoral thesis, Norwegian University of Science and Technology, Faculty of Information Technology, Mathematics and Electrical Engineering, 2007. http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-1599.
Full textAlthough discriminative approaches like the support vector machine or logistic regression have had great success in many pattern recognition application, they have only achieved limited success in speech recognition. Two of the difficulties often encountered include 1) speech signals typically have variable lengths, and 2) speech recognition is a sequence labeling problem, where each spoken utterance corresponds to a sequence of words or phones.
In this thesis, we present a framework for automatic speech recognition using logistic regression. We solve the difficulty of variable length speech signals by including a mapping in the logistic regression framework that transforms each speech signal into a fixed-dimensional vector. The mapping is defined either explicitly with a set of hidden Markov models (HMMs) for the use in penalized logistic regression (PLR), or implicitly through a sequence kernel to be used with kernel logistic regression (KLR). Unlike previous work that has used HMMs in combination with a discriminative classification approach, we jointly optimize the logistic regression parameters and the HMM parameters using a penalized likelihood criterion.
Experiments show that joint optimization improves the recognition accuracy significantly. The sequence kernel we present is motivated by the dynamic time warping (DTW) distance between two feature vector sequences. Instead of considering only the optimal alignment path, we sum up the contributions from all alignment paths. Preliminary experiments with the sequence kernel show promising results.
A two-step approach is used for handling the sequence labeling problem. In the first step, a set of HMMs is used to generate an N-best list of sentence hypotheses for a spoken utterance. In the second step, these sentence hypotheses are rescored using logistic regression on the segments in the N-best list. A garbage class is introduced in the logistic regression framework in order to get reliable probability estimates for the segments in the N-best lists. We present results on both a connected digit recognition task and a continuous phone recognition task.
Sukittanon, Somsak. "Modulation scale analysis : theory and application for nonstationary signal classification /." Thesis, Connect to this title online; UW restricted, 2004. http://hdl.handle.net/1773/5875.
Full textWilson, Leslie. "The Music Muse." Thesis, Virginia Tech, 1996. http://hdl.handle.net/10919/36769.
Full textMaster of Science
George, E. Bryan. "An analysis-by-synthesis approach to sinusoidal modeling applied to speech and music signal processing." Diss., Georgia Institute of Technology, 1991. http://hdl.handle.net/1853/15747.
Full textSalvi, Giampiero. "Mining Speech Sounds : Machine Learning Methods for Automatic Speech Recognition and Analysis." Doctoral thesis, Stockholm : KTH School of Computer Science and Comunication, 2006. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-4111.
Full textSkjei, Thomas. "Real-Time Fundamental Frequency Estimation Algorithm for Disconnected Speech." VCU Scholars Compass, 2011. http://scholarscompass.vcu.edu/etd/191.
Full textFaubel, Friedrich [Verfasser], and Dietrich [Akademischer Betreuer] Klakow. "Statistical signal processing techniques for robust speech recognition / Friedrich Faubel. Betreuer: Dietrich Klakow." Saarbrücken : Saarländische Universitäts- und Landesbibliothek, 2016. http://d-nb.info/1090875703/34.
Full textChan, C. F. "Low bit-rate speech coding : A parallel processing approach using digital signal processors." Thesis, University of Essex, 1986. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.375652.
Full textMeyer, Georg. "Models of neurons in the ventral cochlear nucleus : signal processing and speech recognition." Thesis, Keele University, 1993. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.334715.
Full text健紘, 大田, and Kenko Ota. "Studies in signal processing for robust speech recognition in noisy and reverberant environments." Thesis, https://doors.doshisha.ac.jp/opac/opac_link/bibid/BB10268908/?lang=0, 2008. https://doors.doshisha.ac.jp/opac/opac_link/bibid/BB10268908/?lang=0.
Full textMcLurg, Craig J. (Craig James) Carleton University Dissertation Engineering Electrical. "Hardware and software for a speech and signal processing subsystem for a multiprocessor." Ottawa, 1987.
Find full textTryfou, Georgina. "Time-frequency reassignment for acoustic signal processing. From speech to singing voice applications." Doctoral thesis, University of Trento, 2017. http://eprints-phd.biblio.unitn.it/2562/2/PhD-Thesis.pdf.
Full textBakir, Tariq Saad. "Blind adaptive dereverberation of speech signals using a microphone array." Diss., Available online, Georgia Institute of Technology, 2004:, 2004. http://etd.gatech.edu/theses/available/etd-06072004-131047/unrestricted/bakir%5Ftariq%5Fs%5F200405%5Fphd.pdf.
Full textLeis, John W. "Spectral coding methods for speech compression and speaker identification." Thesis, Queensland University of Technology, 1998. https://eprints.qut.edu.au/36062/7/36062_Digitised_Thesis.pdf.
Full text