Dissertations / Theses on the topic 'Gaussian Mixture Model'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Gaussian Mixture Model.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Lan, Jing. "Gaussian mixture model based system identification and control." [Gainesville, Fla.] : University of Florida, 2006. http://purl.fcla.edu/fcla/etd/UFE0014640.
Full textLu, Liang. "Subspace Gaussian mixture models for automatic speech recognition." Thesis, University of Edinburgh, 2013. http://hdl.handle.net/1842/8065.
Full textVakil, Sam. "Gaussian mixture model based coding of speech and audio." Thesis, McGill University, 2004. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=81575.
Full textThis work introduces a coding scheme which works in a perceptual auditory domain. The input high dimensional frames of audio and speech are transformed to power spectral domain, using either DFT or MDCT. The log spectral vectors are then transformed to the excitation domain. In the quantizer section the vectors are DCT transformed and decorrelated. This operation gives the possibility of using diagonal covariances in modelling the data. Finally, a GMM based VQ is performed on the vectors.
In the decoder part the inverse operations are done. However, in order to prevent negative power spectrum elements due to inverse perceptual transformation in the decoder, instead of direct inversion, a Nonnegative Least Squares Algorithm has been used to switch back to frequency domain. For the sake of comparison, a reference subband based "Excitation Distortion coder" is implemented and comparing the resulting coded files showed a better performance for the proposed GMM based coder.
Sadarangani, Nikhil 1979. "An improved Gaussian mixture model algorithm for background subtraction." Thesis, Massachusetts Institute of Technology, 2002. http://hdl.handle.net/1721.1/87293.
Full textIncludes bibliographical references (leaves 71-72).
by Nikhil Sadarangani.
M.Eng.
Stuttle, Matthew Nicholas. "A gaussian mixture model spectral representation for speech recognition." Thesis, University of Cambridge, 2003. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.620077.
Full textWang, Juan. "Estimation of individual treatment effect via Gaussian mixture model." HKBU Institutional Repository, 2020. https://repository.hkbu.edu.hk/etd_oa/839.
Full textDelport, Marion. "A spatial variant of the Gaussian mixture of regressions model." Diss., University of Pretoria, 2017. http://hdl.handle.net/2263/65883.
Full textDissertation (MSc)--University of Pretoria, 2017.
Statistics
MSc
Unrestricted
Malsiner-Walli, Gertraud, Sylvia Frühwirth-Schnatter, and Bettina Grün. "Model-based clustering based on sparse finite Gaussian mixtures." Springer, 2016. http://dx.doi.org/10.1007/s11222-014-9500-2.
Full textTran, Denis. "A study of bit allocation for Gaussian mixture model quantizers and image coders /." Thesis, McGill University, 2005. http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=83937.
Full textFirst, a Greedy level allocation algorithm is developed based on the philosophy of the Greedy algorithin but, it does so level by level, considering the best benefit and bit cost yielded by an allocation. The Greedy level allocation algorithm is computationally intensive in general, thus we discuss combining it with other algorithms to obtain lower costs.
Second, another algorithm solving problems of negative bit allocations and integer level is proposed. The level allocations are to keep a certain ratio with respect to each other throughout the algorithm in order to remain closest to the condition for lowest distortion. Moreover, the original formula assumes a 6dB gain for each added bit, which is not generally true. The algorithm presents a new parameter k, which controls the benefit of adding one bit, usually set at 0.5 in the high-rate optimal bit allocation formula for MSE calling the new algorithm, the Two-Stage Iterative Bit Allocation (TSIBA) algorithm. Simulations show that modifying the bit allocation formula effectively brings about some gains over the previous methods.
The formula containing the new parameter is generalized into a, formula introducing a new parameter which weights not only the variances but also the dimensions, training the new parameter on their distribution function. The TSIBA was an a-posteriori decision algorithm, where the decision on which value of k to select for lowest distortion was decided after computing all distortions. The Generalized TSIBA (GTSIBA), on the other hand, uses a training procedure to estimate which weighting factor to set for each dimension at a certain bit rate. Simulation results show yet another improvement when using the Generalized TSIBA over all previous methods.
Shashidhar, Sanda, and Amirisetti Sravya. "Online Handwritten Signature Verification System : using Gaussian Mixture Model and Longest Common Sub-Sequences." Thesis, Blekinge Tekniska Högskola, Institutionen för tillämpad signalbehandling, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:bth-15807.
Full textMoradiannejad, Ghazaleh. "People Tracking Under Occlusion Using Gaussian Mixture Model and Fast Level Set Energy Minimization." Thèse, Université d'Ottawa / University of Ottawa, 2013. http://hdl.handle.net/10393/24304.
Full textWebb, Grayson. "A Gaussian Mixture Model based Level Set Method for Volume Segmentation in Medical Images." Thesis, Linköpings universitet, Beräkningsmatematik, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-148548.
Full textLindström, Kevin. "Fault Clustering With Unsupervised Learning Using a Modified Gaussian Mixture Model and Expectation Maximization." Thesis, Linköpings universitet, Fordonssystem, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-176535.
Full textDahlqwist, Elisabeth. "Birthweight-specific neonatal health : With application on data from a tertiaryhospital in Tanzania." Thesis, Uppsala universitet, Statistiska institutionen, 2014. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-227531.
Full textKullmann, Emelie. "Speech to Text for Swedish using KALDI." Thesis, KTH, Optimeringslära och systemteori, 2016. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-189890.
Full textDe senaste åren har olika tillämpningar inom människa-dator interaktion och främst taligenkänning hittat sig ut på den allmänna marknaden. Många system och tekniska produkter stöder idag tjänsterna att transkribera tal och diktera text. Detta gäller dock främst de större språken och sällan finns samma stöd för mindre språk som exempelvis svenskan. I detta examensprojekt har en modell för taligenkänning på svenska ut- vecklas. Det är genomfört på uppdrag av Sveriges Radio som skulle ha stor nytta av en fungerande taligenkänningsmodell på svenska. Modellen är utvecklad i ramverket Kaldi. Två tillvägagångssätt för den akustiska träningen av modellen är implementerade och prestandan för dessa två är evaluerade och jämförda. Först tränas en modell med användningen av Hidden Markov Models och Gaussian Mixture Models och slutligen en modell där Hidden Markov Models och Deep Neural Networks an- vänds, det visar sig att den senare uppnår ett bättre resultat i form av måttet Word Error Rate.
Tomashenko, Natalia. "Speaker adaptation of deep neural network acoustic models using Gaussian mixture model framework in automatic speech recognition systems." Thesis, Le Mans, 2017. http://www.theses.fr/2017LEMA1040/document.
Full textDifferences between training and testing conditions may significantly degrade recognition accuracy in automatic speech recognition (ASR) systems. Adaptation is an efficient way to reduce the mismatch between models and data from a particular speaker or channel. There are two dominant types of acoustic models (AMs) used in ASR: Gaussian mixture models (GMMs) and deep neural networks (DNNs). The GMM hidden Markov model (GMM-HMM) approach has been one of the most common technique in ASR systems for many decades. Speaker adaptation is very effective for these AMs and various adaptation techniques have been developed for them. On the other hand, DNN-HMM AMs have recently achieved big advances and outperformed GMM-HMM models for various ASR tasks. However, speaker adaptation is still very challenging for these AMs. Many adaptation algorithms that work well for GMMs systems cannot be easily applied to DNNs because of the different nature of these models. The main purpose of this thesis is to develop a method for efficient transfer of adaptation algorithms from the GMM framework to DNN models. A novel approach for speaker adaptation of DNN AMs is proposed and investigated. The idea of this approach is based on using so-called GMM-derived features as input to a DNN. The proposed technique provides a general framework for transferring adaptation algorithms, developed for GMMs, to DNN adaptation. It is explored for various state-of-the-art ASR systems and is shown to be effective in comparison with other speaker adaptation techniques and complementary to them
Kawaguchi, Nobuo, Katsuhiko Kaji, Susumu Fujita, 信夫 河口, 克彦 梶, and 迪. 藤田. "Gaussian Mixture Model を用いた無線LAN位置推定手法." 一般社団法人情報処理学会, 2010. http://hdl.handle.net/2237/15430.
Full textKawaguchi, Nobuo, Katsuhiko Kaji, Susumu Fujita, 信夫 河口, 克彦 梶, and 迪. 藤田. "Gaussian Mixture Model を用いた無線LAN位置推定手法." 一般社団法人情報処理学会, 2011. http://hdl.handle.net/2237/15440.
Full textMinh, Tuan Pham, Tomohiro Yoshikawa, Takeshi Furuhashi, and Kaita Tachibana. "Robust feature extractions from geometric data using geometric algebra." IEEE, 2009. http://hdl.handle.net/2237/13896.
Full textYang, Chenguang. "Security in Voice Authentication." Digital WPI, 2014. https://digitalcommons.wpi.edu/etd-dissertations/79.
Full textZhao, David Yuheng. "Model Based Speech Enhancement and Coding." Doctoral thesis, Stockholm : Kungliga Tekniska högskolan, 2007. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-4412.
Full textBere, Alphonce. "Some non-standard statistical dependence problems." University of the Western Cape, 2016. http://hdl.handle.net/11394/4868.
Full textThe major result of this thesis is the development of a framework for the application of pair-mixtures of copulas to model asymmetric dependencies in bivariate data. The main motivation is the inadequacy of mixtures of bivariate Gaussian models which are commonly fitted to data. Mixtures of rotated single parameter Archimedean and Gaussian copulas are fitted to real data sets. The method of maximum likelihood is used for parameter estimation. Goodness-of-fit tests performed on the models giving the highest log-likelihood values show that the models fit the data well. We use mixtures of univariate Gaussian models and mixtures of regression models to investigate the existence of bimodality in the distribution of the widths of autocorrelation functions in a sample of 119 gamma-ray bursts. Contrary to previous findings, our results do not reveal any evidence of bimodality. We extend a study by Genest et al. (2012) of the power and significance levels of tests of copula symmetry, to two copula models which have not been considered previously. Our results confirm that for small sample sizes, these tests fail to maintain their 5% significance level and that the Cramer-von Mises-type statistics are the most powerful.
Bekli, Zeid, and William Ouda. "A performance measurement of a Speaker Verification system based on a variance in data collection for Gaussian Mixture Model and Universal Background Model." Thesis, Malmö universitet, Fakulteten för teknik och samhälle (TS), 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:mau:diva-20122.
Full textZhang, Di. "INFORMATION THEORETIC CRITERIA FOR IMAGE QUALITY ASSESSMENT BASED ON NATURAL SCENE STATISTICS." Thesis, University of Waterloo, 2006. http://hdl.handle.net/10012/2842.
Full textThe goal of objective image quality assessment is to introduce a computational quality metric that can predict image or video quality. Many methods have been proposed in the past decades. Traditionally, measurements convert the spatial data into some other feature domains, such as the Fourier domain, and detect the similarity, such as mean square distance or Minkowsky distance, between the test data and the reference or perfect data, however only limited success has been achieved. None of the complicated metrics show any great advantage over other existing metrics.
The common idea shared among many proposed objective quality metrics is that human visual error sensitivities vary in different spatial and temporal frequency and directional channels. In this thesis, image quality assessment is approached by proposing a novel framework to compute the lost information in each channel not the similarities as used in previous methods. Based on natural scene statistics and several image models, an information theoretic framework is designed to compute the perceptual information contained in images and evaluate image quality in the form of entropy.
The thesis is organized as follows. Chapter I give a general introduction about previous work in this research area and a brief description of the human visual system. In Chapter II statistical models for natural scenes are reviewed. Chapter III proposes the core ideas about the computation of the perceptual information contained in the images. In Chapter IV, information theoretic criteria for image quality assessment are defined. Chapter V presents the simulation results in detail. In the last chapter, future direction and improvements of this research are discussed.
Roy, Tamoghna. "BER Modeling for Interference Canceling Adaptive NLMS Equalizer." Thesis, Virginia Tech, 2014. http://hdl.handle.net/10919/78055.
Full textMaster of Science
Zelenty, Jennifer Evelyn. "Effects of nickel and manganese on the embrittlement of low-copper pressure vessel steels." Thesis, University of Oxford, 2016. https://ora.ox.ac.uk/objects/uuid:28b9151f-1644-470b-abc7-48ff82bcffdd.
Full textZhang, Lin. "Semiparametric Bayesian Kernel Survival Model for Highly Correlated High-Dimensional Data." Diss., Virginia Tech, 2018. http://hdl.handle.net/10919/95040.
Full textPHD
Sikora, Jan. "Statický model scény." Master's thesis, Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií, 2013. http://www.nusl.cz/ntk/nusl-220128.
Full textXu, Li. "Statistical Methods for Variability Management in High-Performance Computing." Diss., Virginia Tech, 2021. http://hdl.handle.net/10919/104184.
Full textDoctor of Philosophy
This dissertation focuses on three projects that are all related to statistical methods in performance variability management in high-performance computing (HPC). HPC systems are computer systems that create high performance by aggregating a large number of computing units. The performance of HPC is measured by the throughput of a benchmark called the IOZone Filesystem Benchmark. The performance variability is the variation among throughputs when the system configuration is fixed. Variability management involves studying the relationship between performance variability and the system configuration. In Chapter 2, we use several existing prediction models to predict the standard deviation of throughputs given different system configurations and compare the accuracy of predictions. We also conduct HPC system optimization using the chosen prediction model as the objective function. In Chapter 3, we use the mixture model to determine the number of modes in the distribution of throughput under different system configurations. In addition, we develop a model to determine the number of additional runs for future benchmark experiments. In Chapter 4, we develop a statistical model that can predict the throughout distributions given the system configurations. We also compare the prediction of summary statistics of the throughput distributions with existing prediction models.
Oliveira, Luan Soares. "Classificação de fluxos de dados não estacionários com algoritmos incrementais baseados no modelo de misturas gaussianas." Universidade de São Paulo, 2015. http://www.teses.usp.br/teses/disponiveis/55/55134/tde-06042016-143503/.
Full textLearning concepts from data streams differs significantly from traditional batch learning. In batch learning there is an implicit assumption that the concept to be learned is static and does not evolve significantly over time. On the other hand, in data stream learning the concepts to be learned may evolve over time. This evolution is called concept drift, and makes the creation of a fixed training set be no longer applicable. Incremental learning paradigm is a promising approach for learning in a data stream setting. However, in the presence of concept drifts, out dated concepts can cause misclassifications. Several incremental Gaussian mixture models methods have been proposed in the literature, but these algorithms lack an explicit policy to discard outdated concepts. In this work, a new incremental algorithm for data stream with concept drifts based on Gaussian Mixture Models is proposed. The proposed methodis compared to various algorithms widely used in the literature, and the results show that it is competitive with them invarious scenarios, overcoming them in some cases.
Wu, Jingwen. "Model-based clustering and model selection for binned data." Thesis, Supélec, 2014. http://www.theses.fr/2014SUPL0005/document.
Full textThis thesis studies the Gaussian mixture model-based clustering approaches and the criteria of model selection for binned data clustering. Fourteen binned-EM algorithms and fourteen bin-EM-CEM algorithms are developed for fourteen parsimonious Gaussian mixture models. These new algorithms combine the advantages in computation time reduction of binning data and the advantages in parameters estimation simplification of parsimonious Gaussian mixture models. The complexities of the binned-EM and the bin-EM-CEM algorithms are calculated and compared to the complexities of the EM and the CEM algorithms respectively. In order to select the right model which fits well the data and satisfies the clustering precision requirements with a reasonable computation time, AIC, BIC, ICL, NEC, and AWE criteria, are extended to binned data clustering when the proposed binned-EM and bin-EM-CEM algorithms are used. The advantages of the different proposed methods are illustrated through experimental studies
Mangayyagari, Srikanth. "Voice recognition system based on intra-modal fusion and accent classification." [Tampa, Fla.] : University of South Florida, 2007. http://purl.fcla.edu/usf/dc/et/SFE0002229.
Full textAndrésen, Anton, and Adam Håkansson. "Comparing unsupervised clustering algorithms to locate uncommon user behavior in public travel data : A comparison between the K-Means and Gaussian Mixture Model algorithms." Thesis, Tekniska Högskolan, Jönköping University, JTH, Datateknik och informatik, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:hj:diva-49243.
Full textPalmer, Jason Allan. "Variational and scale mixture representations of non-Gaussian densities for estimation in the Bayesian Linear Model sparse coding, independent component analysis, and minimum entropy segmentation /." Diss., Connect to a 24 p. preview or request complete full text in PDF format. Access restricted to UC campuses, 2006. http://wwwlib.umi.com/cr/ucsd/fullcit?p3237562.
Full textTitle from first page of PDF file (viewed December 13, 2006). Available via ProQuest Digital Dissertations. Vita. Includes bibliographical references (p. 143-150).
Xiao, Ying. "New tools for unsupervised learning." Diss., Georgia Institute of Technology, 2014. http://hdl.handle.net/1853/52995.
Full textRaman, Pujita. "Speaker Identification and Verification Using Line Spectral Frequencies." Thesis, Virginia Tech, 2015. http://hdl.handle.net/10919/52964.
Full textMaster of Science
Miyajima, Chiyomi, Yoshihiro Nishiwaki, Koji Ozawa, Toshihiro Wakita, Katsunobu Itou, Kazuya Takeda, and Fumitada Itakura. "Driver Modeling Based on Driving Behavior and Its Evaluation in Driver Identification." IEEE, 2007. http://hdl.handle.net/2237/9623.
Full textJia, Jia. "Interactive Imaging via Hand Gesture Recognition." Thesis, University of Bradford, 2009. http://hdl.handle.net/10454/4259.
Full textBusch, Andrew W. "Wavelet transform for texture analysis with application to document analysis." Thesis, Queensland University of Technology, 2004. https://eprints.qut.edu.au/15908/1/Andrew_Busch_Thesis.pdf.
Full textBusch, Andrew W. "Wavelet Transform For Texture Analysis With Application To Document Analysis." Queensland University of Technology, 2004. http://eprints.qut.edu.au/15908/.
Full textLidija, Krstanović. "Mera sličnosti između modela Gausovih smeša zasnovana na transformaciji prostora parametara." Phd thesis, Univerzitet u Novom Sadu, Fakultet tehničkih nauka u Novom Sadu, 2017. https://www.cris.uns.ac.rs/record.jsf?recordId=104904&source=NDLTD&language=en.
Full textThis thesis studies the possibility that the parameters of Gaussian components of aparticular Gaussian Mixture Model (GMM) lie approximately on a lower-dimensionalsurface embedded in the cone of positive definite matrices. For that case, we delivernovel, more efficient similarity measure between GMMs, by LPP-like projecting thecomponents of a particular GMM, from the high dimensional original parameter space,to a much lower dimensional space. Thus, finding the distance between two GMMs inthe original space is reduced to finding the distance between sets of lowerdimensional euclidian vectors, pondered by corresponding weights. The proposedmeasure is suitable for applications that utilize high dimensional feature spaces and/orlarge overall number of Gaussian components. We confirm our results on artificial, aswell as real experimental data.
Jose, Neenu. "SPEAKER AND GENDER IDENTIFICATION USING BIOACOUSTIC DATA SETS." UKnowledge, 2018. https://uknowledge.uky.edu/ece_etds/120.
Full textBjarnason, Brynjar Smári. "Clustering metagenome contigs using coverage with CONCOCT." Thesis, KTH, Skolan för datavetenskap och kommunikation (CSC), 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-208944.
Full textMetagenomik möjliggör analys av arvsmassor i mikrobiella floror utan att först behöva odla mikroorgansimerna. Metoden innebär att man läser korta DNA-snuttar som sedan pusslas ihop till längre genomfragment (kontiger). Genom att gruppera kontiger som härstammar från samma organism kan man sedan återskapa mer eller mindre fullständiga genom, men detta är en svår bioinformatisk utmaning. Målsättningen med det här projektet var att utvärdera precisionen med vilken mjukvaran CONCOCT, som vi nyligen utvecklat, grupperar kontiger som härstammar från samma organism baserat på information om kontigernas sekvenskomposition och abundansprofil över olika prover. Vi testade hur olika parametrar påverkade klustringen av kontiger i artificiella metagenomdataset av olika komplexitet som vi skapade in silico genom att blanda data från tidigare sekvenserade genom. Parametrarna som testades rörde indata såväl som den statistiska modell som CONCOCT använder för att utföra klustringen. Parametrarna varierades en i taget medan de andra parametrarna hölls konstanta. Antalet kluster hölls också konstant och motsvarade antalet olika organismer i flororna. Bäst resultat erhölls då vi använde en låst kovariansmodell och använde principalkomponenter som förklarade 90% av variansen, samt filtrerade bort kontiger som var kortare än 3000 baspar. Vi fick också bäst resultat då vi använde alla tillgängliga prover. Därefter använde vi dessa parameterinställningar och lät CONCOCT själv bestämma lämpligt antal kluster i dataseten med “Bayesian Information Criterion” - metoden som då var implementerad i CONCOCT. Detta gav otillfredsställande resultat med i regel för få och för stora kluster. Därför testade vi en alternativ metod, “Dirichlet Process Gaussian Mixture Model”, för att uppskatta antal kluster. Denna metod gav avsevärt bättre resultat och i senare versioner av CONCOCT har en liknande metod implementerats.
Lai, Chu-Shiuan, and 賴竹煖. "Gaussian Mixture of Background and Shadow Model." Thesis, 2010. http://ndltd.ncl.edu.tw/handle/38760050190895130218.
Full text國立臺灣師範大學
資訊工程研究所
98
In this paper, we integrate shadow information into the background model of a scene in an attempt to detect both shadows and foreground objects at a time. Since shadows accompanying foreground objects are viewed as parts of the foreground objects, shadows will be extracted as well during foreground object detection. Shadows can distort object shapes and may connect multiple objects into one object. On the other hand, shadows tell the directions of light sources. In other words, shadows can be advantageous as well as disadvantageous. To begin, we use an adaptive Gaussian mixture model to describe the background of a scene. Based on this preliminary background model, we extract foreground objects and their accompanying shadows. Shadows are next separated from foreground objects through a series of intensity and color analyses. The characteristics of shadows are finally determined with the principal component analysis method and are embedded as an additional Gaussian in the background model. Experimental results demonstrated the feasibility of the proposed background model. Keywords: Dynamic scene, Adaptive Gaussian Mixture Model, Foreground detection, Shadow detection
Sue, Yung-Chun, and 蘇詠鈞. "Specified Gestures Identification using Gaussian Mixture Model." Thesis, 2012. http://ndltd.ncl.edu.tw/handle/15219540833691164661.
Full text清雲科技大學
電子工程所
100
Sign language recognition technique is composed by the hand images detection and the hand gestures recognition. Hand images detection is locating the sign language select, sign language capture, the palm and fingers part from the sensed image, and rotating them to the appropriate hand posture, both are the important pre-processing for sign language identification and recognition. This paper first introduced sequentially throughout the study practices, as well as the process of image pre-processing instructions. The major work in the hand gestures recognition is to identify the variance of the fingers. In this paper the creation of sign language image of slash encoding, Department of the advantages of slash encoding the difference between your fingers the number of changes, and the Gaussian mixture model (GMM) to establish the model of sign language and identification. Such as poor recognition rate is adjusted probability distribution of weight values to improve the recognition rate. The entire the paper Shushing is the Gaussian mixture model (GMM), slash code, adjust the probability distribution of the weight value. Finally, after adjusting the probability distribution of weight values, we learned from the conclusion that the overall recognition results rose to 98.33%from 92.66% of the original, so changing the probability distribution of the weight value can effectively improve the recognition rate.
莊清乾. "Automatic Bird Songs Recognition using Gaussian Mixture Model." Thesis, 2008. http://ndltd.ncl.edu.tw/handle/09774268339453426682.
Full text中華大學
資訊工程學系(所)
96
In this paper, Gaussian mixture models (GMM) were applied to identify bird species from their sounds. First, each syllable corresponding to a piece of vocalization is manually segmented. Two-dimension MFCC (TDMFCC), dynamic two-dimension MFCC (DTDMFCC), and normalized audio spectrum envelope (NASE) modulation coefficients are calculated for each syllable and regarded as the vocalization features of each syllable. Principal component analysis (PCA) is used to reduce the feature space dimension of the original input features vector space. GMM is used to cluster the feature vectors from the same bird species into several groups with each group represented by a Gaussian distribution. The self-splitting Gaussian mixture learning (SGML) algorithm is then employed to find an appropriate number of Gaussian components for each GMM. In addition, a model selection algorithm based on the Bayesian information criterion (BIC) is applied to select the optimal model between GMM and extended VQ (EVQ) according to the amount of training data available. Linear discriminant analysis (LDA) is finally exploited to increase the classification accuracy at a lower dimensional feature vector space. In our experiments, the combination of TDMFCC, DTDMFCC, and NASE modulation coefficients achieve the average classification accuracy of 83.9% for the classification of 28 bird species.
LIN, YU-JUNG, and 林昱融. "Modified Gaussian Mixture Model Applied to Speaker Verification." Thesis, 2016. http://ndltd.ncl.edu.tw/handle/33cbau.
Full text中央警察大學
刑事警察研究所
104
Gaussian mixture model (GMM) is a combination of a plurality of Gaussian probability density function, it can be smoothly approximate the probability density distribution of any arbitrary shape. In various areas of pattern recognition, it has a good recognition results. However, during building the speaker model process, we must determine the parameters of each Gaussian probability density function through constantly iterative calculation, the calculation process is quite complex. This paper presents modified Gaussian mixture model, each characteristic for recognition has its own independent Gaussian probability density function. Since the process without iteration, it can significantly reduce the amount of calculation. And the speaker verification results show that it can still maintain a good recognition results. In this paper, we use Mel frequency cepstral coefficients(MFCCs) as the voice characteristic for speaker verification. The average error rate for speaker verification on Gaussian mixture model is 0.5901%, while it on modified Gaussian mixture model is 1.6700%, the gap between them was 1.0799%. The error rate of two methods during the speaker verification has less difference. But in the speaker model build process, modified Gaussian mixture model does not need to go through an iterative calculation. The calculation ways and time are more simple and faster than Gaussian mixture model. It can also be another consideration of algorithm for more speed and convenience.
Teng, Yu-Hsiang, and 鄧玉祥. "Establishment of Background Image using Gaussian Mixture Model." Thesis, 2010. http://ndltd.ncl.edu.tw/handle/28291711624177215723.
Full text義守大學
資訊工程學系碩士班
98
In the field of object tracking, a background image is often established before further processing. One typical approach for object tracking is to filter out irrelevant background images as well as reserving only the pixel information of the foreground. For such method, it would be more convenient to construct the background image in advance for further processing. Although background subtraction and frame differences are easy to implement, they are easily affected by external environment. For instance, the changes of light source or the sway of leaves could mistake the background as the foreground. However, the Gaussian mixture model uses statistical method to generate the background images, so it could use the statistical method to distinguish the changes of the light source as well as slight movements. Therefore, the background objects will not be regarded as the foreground objects. Exponential smoothing is based on time series with stability or regularity, therefore the time series can be reasonably extended. Exponential smoothing needs more complete historical information to accurately and objectively predict the future trend, followed by the forecast conclusion. The study used a static camera, so the image information will not change violently. The stationary background image displayed in each frames will be relevant with previous frames; as a result, exponential smoothing method can be used in establishing background images. The exponential smoothing for Gaussian mixture model proposed in this study is to use the characteristics of exponential smoothing to predict the future trends. This method can effectively construct the background image when there are many moving objects. Also, the computation is efficient in comparison to the traditional Gaussian mixture model method.
Verma, Nishchal Kumar. "Gaussian mixture model based non-additive fuzzy systems." Thesis, 2006. http://localhost:8080/xmlui/handle/12345678/5595.
Full textChang, Zhi-Jie, and 張智傑. "Language Identification based on Gaussian Mixture Model Tokenizer and Language Model." Thesis, 2005. http://ndltd.ncl.edu.tw/handle/81926368859761798108.
Full text