Academic literature on the topic 'LPC coefficients'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'LPC coefficients.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "LPC coefficients"

1

Choi, Jae-Seung. "Speaker Recognition using LPC cepstrum Coefficients and Neural Network." Journal of the Korean Institute of Information and Communication Engineering 15, no. 12 (December 31, 2011): 2521–26. http://dx.doi.org/10.6109/jkiice.2011.15.12.2521.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Olive, Joseph P. "Mixed spectral representation—Formants and LPC coefficients." Journal of the Acoustical Society of America 85, S1 (May 1989): S59. http://dx.doi.org/10.1121/1.2027054.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Jung, Won-Jin, and Moo-Young Kim. "Quantization of LPC Coefficients Using a Multi-frame AR-model." Journal of the Acoustical Society of Korea 31, no. 2 (February 29, 2012): 93–99. http://dx.doi.org/10.7776/ask.2012.31.2.093.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Pérez, María Salomé, and Enrique Carrera. "LPC-based Feature Coefficients for Voice Authentication Tasks." MASKAY 2, no. 1 (November 1, 2012): 73. http://dx.doi.org/10.24133/maskay.v2i1.151.

Full text
Abstract:
Voice authentication is a promising biometric technique based on extracting important information from the speech signal by means of computing a vector of feature coefficients. Based on that, this paper evaluates the effectiveness of linear predictive coefficients when combined with other simple metrics in voice authentication tasks. Linear predictive coefficients were chosen due to their relatively good performance and their not-so-complicated structures when compared to other similar alternatives. All the feature coefficients have been evaluated through an extensive parameter space study in order to apprehend the main limitations and potentials of voice authentication under different scenarios. For such an evaluation, a classifier based on artificial neural networks has been implemented.
APA, Harvard, Vancouver, ISO, and other styles
5

Hong Kook Kim, Seung Ho Choi, and Hwang Soo Lee. "On approximating line spectral frequencies to LPC cepstral coefficients." IEEE Transactions on Speech and Audio Processing 8, no. 2 (March 2000): 195–99. http://dx.doi.org/10.1109/89.824705.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Sanches, I. "From LPC to normalised autocorrelation coefficients through a matrix." Electronics Letters 34, no. 4 (1998): 333. http://dx.doi.org/10.1049/el:19980310.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Mohd Ali, Yusnita, Alhan Farhanah Abd Rahim, Emilia Noorsal, Zuhaila Mat Yassin, Nor Fadzilah Mokhtar, and Mohamad Helmy Ramlan. "Fuzzy-based voiced-unvoiced segmentation for emotion recognition using spectral feature fusions." Indonesian Journal of Electrical Engineering and Computer Science 19, no. 1 (July 1, 2020): 196. http://dx.doi.org/10.11591/ijeecs.v19.i1.pp196-206.

Full text
Abstract:
Despite abundant growth in automatic emotion recognition system (ERS) studies using various techniques in feature extractions and classifiers, scarce sources found to improve the system via pre-processing techniques. This paper proposed a smart pre-processing stage using fuzzy logic inference system (FIS) based on Mamdani engine and simple time-based features i.e. zero-crossing rate (ZCR) and short-time energy (STE) to initially identify a frame as voiced (V) or unvoiced (UV). Mel-frequency cepstral coefficients (MFCC) and linear prediction coefficients (LPC) were tested with K-nearest neighbours (KNN) classifiers to evaluate the proposed FIS V-UV segmentation. We also introduced two feature fusions of MFCC and LPC with formants to obtain better performance. Experimental results of the proposed system surpassed the conventional ERS which yielded a rise in accuracy rate from 3.7% to 9.0%. The fusion of LPC and formants named as SFF LPC-fmnt indicated a promising result between 1.3% and 5.1% higher accuracy rate than its baseline features in classifying between neutral, angry, happy and sad emotions. The best accuracy rates yielded for male and female speakers were 79.1% and 79.9% respectively using SFF MFCC-fmnt fusion technique.
APA, Harvard, Vancouver, ISO, and other styles
8

Singh, Mandeep, and Gurpreet Singh. "Word recognition from speech signal using linear predictive coding and spectrum analysis." International Journal of Engineering & Technology 7, no. 3 (July 16, 2018): 1531. http://dx.doi.org/10.14419/ijet.v7i3.13285.

Full text
Abstract:
This paper presents a technique for isolated word recognition from speech signal using Spectrum Analysis and Linear Predictive Coding (LPC). In the present study, only those words have been analyzed which are commonly used during a telephonic conversations by criminals. Since each word is characterized by unique frequency spectrum signature, thus, spectrum analysis of a speech signal has been done using certain statistical parameters. These parameters help in recognizing a particular word from a speech signal, as there is a unique value of a feature for each word, which helps in distinguishing one word from the other. Second method used is based on LPC coefficients. Analysis of features extracted using LPC coefficients help in identification of a specific word from the input speech signal. Finally, a combination of best features from these two methods has been used and a hybrid technique is proposed. An accuracy of 94% has been achieved for sample size of 400 speech words.
APA, Harvard, Vancouver, ISO, and other styles
9

., PPS Subhashini. "TEXT-INDEPENDENT SPEAKER RECOGNITION USING COMBINED LPC AND MFC COEFFICIENTS." International Journal of Research in Engineering and Technology 03, no. 06 (June 25, 2014): 508–14. http://dx.doi.org/10.15623/ijret.2014.0306095.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Moriya, Takehiro. "Method for the modification of LPC coefficients of acoustic signals." Journal of the Acoustical Society of America 104, no. 5 (November 1998): 2554. http://dx.doi.org/10.1121/1.423836.

Full text
APA, Harvard, Vancouver, ISO, and other styles
More sources

Dissertations / Theses on the topic "LPC coefficients"

1

Crosmer, Joel R. "Very low bit rate speech coding using the line spectrum pair transformation of the LPC coefficients." Diss., Georgia Institute of Technology, 1985. http://hdl.handle.net/1853/15739.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Alvarenga, Rodrigo Jorge. "Reconhecimento de comandos de voz por redes neurais." Universidade de Taubaté, 2012. http://www.bdtd.unitau.br/tedesimplificado/tde_busca/arquivo.php?codArquivo=587.

Full text
Abstract:
Sistema de reconhecimento de fala tem amplo emprego no universo industrial, no aperfeiçoamento de operações e procedimentos humanos e no setor do entretenimento e recreação. O objetivo específico do trabalho foi conceber e desenvolver um sistema de reconhecimento de voz, capaz de identificar comandos de voz, independentemente do locutor. A finalidade precípua do sistema é controlar movimentos de robôs, com aplicações na indústria e no auxílio de deficientes físicos. Utilizou-se a abordagem da tomada de decisão por meio de uma rede neural treinada com as características distintivas do sinal de fala de 16 locutores. As amostras dos comandos foram coletadas segundo o critério de conveniência (em idade e sexo), a fim de garantir uma maior discriminação entre as características de voz, e assim alcançar a generalização da rede neural utilizada. O préprocessamento consistiu na determinação dos pontos extremos da locução do comando e na filtragem adaptativa de Wiener. Cada comando de fala foi segmentado em 200 janelas, com superposição de 25% . As features utilizadas foram a taxa de cruzamento de zeros, a energia de curto prazo e os coeficientes ceptrais na escala de frequência mel. Os dois primeiros coeficientes da codificação linear preditiva e o seu erro também foram testados. A rede neural empregada como classificador foi um perceptron multicamadas, treinado pelo algoritmo backpropagation. Várias experimentações foram realizadas para a escolha de limiares, valores práticos, features e configurações da rede neural. Os resultados foram considerados muito bons, alcançando uma taxa de acertos de 89,16%, sob as condições de pior caso da amostragem dos comandos.
Systems for speech recognition have widespread use in the industrial universe, in the improvement of human operations and procedures and in the area of entertainment and recreation. The specific objective of this study was to design and develop a voice recognition system, capable of identifying voice commands, regardless of the speaker. The main purpose of the system is to control movement of robots, with applications in industry and in aid of disabled people. We used the approach of decision making, by means of a neural network trained with the distinctive features of the speech of 16 speakers. The samples of the voice commands were collected under the criterion of convenience (age and sex), to ensure a greater discrimination between the voice characteristics and to reach the generalization of the neural network. Preprocessing consisted in the determination of the endpoints of each command signal and in the adaptive Wiener filtering. Each speech command was segmented into 200 windows with overlapping of 25%. The features used were the zero crossing rate, the short-term energy and the mel-frequency ceptral coefficients. The first two coefficients of the linear predictive coding and its error were also tested. The neural network classifier was a multilayer perceptron, trained by the backpropagation algorithm. Several experiments were performed for the choice of thresholds, practical values, features and neural network configurations. Results were considered very good, reaching an acceptance rate of 89,16%, under the `worst case conditions for the sampling of the commands.
APA, Harvard, Vancouver, ISO, and other styles
3

Kubánková, Anna. "Automatická klasifikace digitálních modulací." Doctoral thesis, Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií, 2008. http://www.nusl.cz/ntk/nusl-233424.

Full text
Abstract:
This dissertation thesis deals with a new method for digital modulation recognition. The history and present state of the topic is summarized in the introduction. Present methods together with their characteristic properties are described. The recognition by means of artificial neural is presented in more detail. After setting the objective of the dissertation thesis, the digital modulations that were chosen for recognition are described theoretically. The modulations FSK, MSK, BPSK, QPSK, and QAM-16 are concerned. These modulations are mostly used in modern communication systems. The method designed is based on the analysis of module and phase spectrograms of the modulated signals. Their histograms are used for the examination of the spectrogram properties. They provide information on the count of carrier frequencies in the signal, which is used for the FSK and MSK recognition, and on the count of phase states on which the BPSK, QPSK, and QAM-16 are classified. The spectrograms in that the characteristic attributes of the modulations are visible are obtained with the segment length equal to the symbol length. It was found that it is possible to correctly recognize the modulation with the known symbol length at the signal-to-noise ratio at least 0 dB. That is why it is necessary to detect the symbol length prior to the spectrogram calculation. Four methods were designed for this purpose: autocorrelation function, cepstrum analysis, wavelet transform, and LPC coefficients. These methods were algorithmized and analyzed with signals disturbed by the white Gaussian noise, phase noise and with signals passed through a multipass fading channel. The method of detection by means of cepstrum analysis proved the most suitable and reliable. Finally the new method for digital modulation recognition was verified with signals passed through a channel with properties close to the real one.
APA, Harvard, Vancouver, ISO, and other styles
4

Fayad, Layal. "Caractérisation de la nouvelle chambre de simulation atmosphérique CHARME et étude de la réaction d’ozonolyse d’un COV biogénique, le γ-terpinène." Thesis, Littoral, 2019. https://documents.univ-littoral.fr/access/content/group/50b76a52-4e4b-4ade-a198-f84bc4e1bc3c/BULCO/Th%C3%A8ses/LPCA/These_Fayad_Layal.pdf.

Full text
Abstract:
L’étude des mécanismes et interactions atmosphériques est un des sujets majeurs actuels de recherches environnementales. La façon la plus directe et pertinente pour étudier la transformation des polluants et la formation des aérosols dans l'atmosphère est de simuler les processus dans des conditions contrôlées et simplifiées. Une nouvelle chambre de simulation CHARME (CHamber for the Atmospheric Reactivity and the Metrology of the Environment) a été développée au Laboratoire de Physico-Chimie Atmosphérique (LPCA) de l’Université du Littoral Côte d’Opale (ULCO). CHARME est également dédiée à la validation de dispositifs optiques utilisés pour la métrologie d’espèces atmosphériques stables et instables (radicaux). La première partie de ces travaux de recherche concerne la caractérisation de tous les paramètres techniques, physiques et chimiques de cette nouvelle chambre et l’optimisation des méthodes pour étudier la réactivité des composés organiques volatiles (COV) et simuler la formation d’aérosols organiques secondaires (AOS). Les résultats obtenus démontrent que CHARME est un outil adapté pour reproduire les réactions se produisant dans la troposphère. La deuxième partie est dédiée à l’étude de la réaction d’ozonolyse d’un COV biogénique, le γ-terpinène. La constante de vitesse à (294 ± 2) K et la pression atmosphérique a été mesurée et les produits d’oxydation identifiés dans la phase gazeuse. L’hygroscopicité des aérosols organiques secondaires a également été étudiée. A notre connaissance, ce travail représente la première étude sur la formation des AOS à partir de l’ozonolyse du γ-terpinène
The study of atmospheric processes is among the central topics of current environmental research. The most direct and significant way to investigate the transformation of pollutants and the formation of aerosols in the atmosphere, is to simulate these processes under controlled and simplified conditions. In this regard, a new simulation chamber, CHARME (CHamber for the Atmospheric Reactivity and the Metrology of the Environment) has been designed in the Laboratory of Physico-Chemistry of the Atmosphere (LPCA) in the University of Littoral Côte d’Opale (ULCO). CHAE is also dedicated to the development and validation of new spectroscopic approaches for the metrology of atmospheric species including gases, particles and radicals.The first aim of this research was to characterize all the technical, physical and chemical parameters of this new chamber and to optimize the methods for studying the atmospheric reactivity of volatile organic compounds (VOCs) and simulating the formation of secondary organic aerosols (SOA). The results of numerous experiments and tests show that CHARME is a convenient tool to reproduce chemical reactions occurring in the troposphere. The second research objective was to investigate the reaction of the biogenic VOC, γ-terpinene, with ozone. The rate coefficient at (294 ± 2) K and atmospheric pressure was determined and the gas-phase oxidation products were identified. The physical state and hygroscopicity of the secondary organic aerosols was also studied. To our knowledge, this work represents the first study on SOA formation from the ozonolysis of γ-terpinene
APA, Harvard, Vancouver, ISO, and other styles
5

Guérin, Frédéric. "ÉMISSION DE GAZ A EFFET DE SERRE (CO2, CH4) PAR UNE RETENUE DE BARRAGE HYDROÉLECTRIQUE EN ZONE TROPICALE (PETIT-SAUT, GUYANE FRANÇAISE) :EXPÉRIMENTATION ET MODÉLISATION." Phd thesis, Université Paul Sabatier - Toulouse III, 2006. http://tel.archives-ouvertes.fr/tel-00079947.

Full text
Abstract:
Les émissions de dioxyde de carbone (CO2) et de méthane (CH4) et le cycle du carbone dans la retenue de barrage de Petit-Saut et la rivière Sinnamary (Guyane Française) ont été étudiés dans le but de développer un modèle couplé hydrodynamique-biogéochimie. Le développement de ce modèle a nécessité l'étude de trois processus contrôlant ces émissions : (i) la production de CO2 et de CH4 lors de la dégradation de la matière organique (MO) des sols et de végétaux, (ii) l'oxydation aérobie du CH4 dans la colonne d'eau du barrage et (iii) les processus d'échange gazeux à l'interface air-eau.
Sur 10 ans, les émissions atmosphériques se sont avérées très significatives, notamment les trois premières années ayant suivies la mise en eau, puis décroissent au cours du temps. Tandis que 50% des émissions de CO2 ont lieu à la surface du lac, les émissions de CH4 sont principalement localisées en aval des turbines.
Les émissions atmosphériques résultent de la dégradation de la MO (sol et biomasse issus de la forêt tropicale) immergée lors de la mise en eau et leur diminution au cours du temps découle de l'épuisement du stock de MO. Au terme de 10 ans, 20% du stock de carbone a été minéralisé et émis vers l'atmosphère sous forme de CO2 et de CH4. L'oxydation aérobie du CH4 transforme plus de 95% du CH4 diffusant depuis l'hypolimnion en CO2 dans la colonne d'eau du lac et 40% du CH4 entrant dans la rivière à l'aval. A l'échelle du barrage ce processus est responsable de l'oxydation de 90% du CH4 produit et de 30% des émissions totales de CO2. Le CH4 et le CO2 qui atteignent les eaux de surface du barrage sont émis vers l'atmosphère par flux diffusifs. L'étude de ce processus de transfert gazeux à l'interface air-eau montre que, en milieu tropical, les flux diffusifs sont accélérés par les fortes températures et les phénomènes pluvieux.
Le modèle est basé sur le modèle hydrodynamique SYMPHONIE 2D et les modules biogéochimiques développés dans le cadre de cette étude à partir des données cinétiques des processus étudiés. Les profils verticaux simulés de température, d'oxygène, de CO2 et de CH4 sont bien reproduits. Ce modèle pose les bases d'un outil opérationnel de modélisation pour la retenue de Petit Saut ainsi que pour d'autres réservoirs en milieu tropical.
APA, Harvard, Vancouver, ISO, and other styles
6

Guérin, Frédéric. "Emission de gaz à effet de serre (CO2,CH4) par une retenue de barrage hydroélectrique en zone tropicale (Petit-saut, Guyane française) : expérimentation et modélisation." Toulouse 3, 2006. https://tel.archives-ouvertes.fr/tel-00079947.

Full text
Abstract:
Les émissions de dioxyde de carbone (CO2) et de méthane (CH4) et le cycle du carbone dans la retenue de barrage de Petit-Saut et la rivière Sinnamary (Guyane Française) ont été étudiés dans le but de développer un modèle couplé hydrodynamique-biogéochimie. Le développement de ce modèle a nécessité l'étude de trois processus contrôlant ces émissions : (i) la production de CO2 et de CH4 lors de la dégradation de la matière organique (MO) des sols et de végétaux, (ii) l'oxydation aérobie du CH4 dans la colonne d'eau du barrage et (iii) les processus d'échange gazeux à l'interface air-eau. Sur 10 ans, les émissions atmosphériques se sont avérées très significatives, notamment les trois premières années ayant suivies la mise en eau, puis décroissent au cours du temps. Tandis que 50% des émissions de CO2 ont lieu à la surface du lac, les émissions de CH4 sont principalement localisées en aval des turbines. Les émissions atmosphériques résultent de la dégradation de la MO (sol et biomasse issus de la forêt tropicale) immergée lors de la mise en eau et leur diminution au cours du temps découle de l'épuisement du stock de MO. Au terme de 10 ans, 20% du stock de carbone a été minéralisé et émis vers l'atmosphère sous forme de CO2 et de CH4. L'oxydation aérobie du CH4 transforme plus de 95% du CH4 diffusant depuis l'hypolimnion en CO2 dans la colonne d'eau du lac et 40% du CH4 entrant dans la rivière à l'aval. A l'échelle du barrage ce processus est responsable de l'oxydation de 90% du CH4 produit et de 30% des émissions totales de CO2. Le CH4 et le CO2 qui atteignent les eaux de surface du barrage sont émis vers l'atmosphère par flux diffusifs. L'étude de ce processus de transfert gazeux à l'interface air-eau montre que, en milieu tropical, les flux diffusifs sont accélérés par les fortes températures et les phénomènes pluvieux. Le modèle est basé sur le modèle hydrodynamique SYMPHONIE 2D et les modules biogéochimiques développés dans le cadre de cette étude à partir des données cinétiques des processus étudiés. Les profils verticaux simulés de température, d'oxygène, de CO2 et de CH4 sont bien reproduits. Ce modèle pose les bases d'un outil opérationnel de modélisation pour la retenue de Petit Saut ainsi que pour d'autres réservoirs en milieu tropical
The emissions of carbon dioxide (CO2) and methane (CH4) and the carbon cycle in the Petit-Saut reservoir and in the Sinnamary River (French Guiana) were studied with an aim of developing a coupled physical/biogeochemical model. The development of this model required the study of three processes controlling these emissions: (i) CO2 and CH4 production during the mineralization in anoxic condition of organic matter (OM) from soils and plants, (ii) aerobic CH4 oxidation in the water column of the lake and (iii) the processes involved in gas exchange at the air-water interface. Over 10 years, atmospheric emissions were shown to be very significant, in particular the first three years having followed the reservoir impoundment and then decreased with time. While 50% of the CO2 emissions take place at the surface of the lake, the emissions of CH4 are mainly localized downstream from the turbines. The atmospheric emissions result from the degradation of OM (soil and biomass originating from the tropical forest) flooded during impoundment and their reduction with time rises from the exhaustion of the OM stock. 10 years after impoundement, 20% of the carbon stock were mineralized and emitted to the atmosphere in the form of CO2 and of CH4. Aerobic CH4 oxidation transforms more than 95% of the CH4 diffusing upward from the hypolimnion into CO2 in the water column of the lake and 40% of the CH4 entering the river downstream of the dam. In the whole Petit Saut system, this process is responsible for the oxidation of 90% of the produced CH4 and 30% of the total CO2 emissions. The CH4 and CO2 which reach the water surface of the reservoir and of the river downstream of the dam are emitted to the atmosphere by diffusive flux. The study of this process of gas transfer to the interface air-water shows that, in tropical environment, diffusive fluxes are enhanced by the elevated temperatures and the rainy phenomena. The model is based on the hydrodynamic model SYMPHONY 2D and the biogeochemical model developed during this study starting from the kinetic data of the studied processes. The simulated vertical profiles of temperature, oxygen, CO2 and CH4 are well reproduced. This model poses the bases of an operational tool of modeling for the Petit-Saut reservoir like for other reservoirs in tropical environments
APA, Harvard, Vancouver, ISO, and other styles
7

Hong, Wei-ping, and 洪偉玶. "Usefulness of the LPC-Residue and LPC Coefficient in Text-Independent Speaker Verification." Thesis, 2007. http://ndltd.ncl.edu.tw/handle/95586261639335208085.

Full text
Abstract:
碩士
國立高雄第一科技大學
電腦與通訊工程所
95
This thesis focuses on usefulness of the LPC-Residue and LPC Coefficient in the speaker verification system. First step in the front-end feature extraction get the magnitude spectrum of the speech signal from a 32ms short-time segment of speech that is pre-emphasized and processed by a mel-scale filterbank. And the output of the filterbank is then cosine transformed to produce the cepstral coefficients. The zeroth cepstral coefficient isn’t used in the feature vector. When we gotten the coefficients, passed the coefficients to the Gaussian mixture models (GMM). The GMM are interpreted to represent some broad acoustic classes. Finally, the maximum-likelihood parameter estimates the system. The inputs of system have two elements, one is original speech, and other is residual signal. In the experiment, we can find the data of output that the MFCC’s EER is better than LPCC’s EER. And we also find the calculation of new feature vector. The new feature vector is combined the original signal extracted by MFCC with the residual signal extracted by LPCC. The new feature vector is complementary the MFCC and LPCC for the identify file. Finally, we get the most eer than the prior feature vector of MFCC.
APA, Harvard, Vancouver, ISO, and other styles

Book chapters on the topic "LPC coefficients"

1

Ratanpara, Tushar, and Narendra Patel. "Singer Identification Using MFCC and LPC Coefficients from Indian Video Songs." In Advances in Intelligent Systems and Computing, 275–82. Cham: Springer International Publishing, 2015. http://dx.doi.org/10.1007/978-3-319-13728-5_31.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Soto-Murillo, Manuel A., Karen E. Villagrana Bañuelos, Julieta G. Rodriguez-Ruiz, Jared D. Salinas-González, Carlos E. Galván-Tejada, Hamurabi Gamboa-Rosales, and Jorge I. Galván-Tejada. "Classification of Heart Health by LPC and MFCC Coefficients and Statistical Features." In IFMBE Proceedings, 104–12. Cham: Springer International Publishing, 2019. http://dx.doi.org/10.1007/978-3-030-30648-9_15.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Trabelsi, Imen, and Med Salim Bouhlel. "Comparison of Several Acoustic Modeling Techniques for Speech Emotion Recognition." In Cognitive Analytics, 283–93. IGI Global, 2020. http://dx.doi.org/10.4018/978-1-7998-2460-2.ch015.

Full text
Abstract:
Automatic Speech Emotion Recognition (SER) is a current research topic in the field of Human Computer Interaction (HCI) with a wide range of applications. The purpose of speech emotion recognition system is to automatically classify speaker's utterances into different emotional states such as disgust, boredom, sadness, neutral, and happiness. The speech samples in this paper are from the Berlin emotional database. Mel Frequency cepstrum coefficients (MFCC), Linear prediction coefficients (LPC), linear prediction cepstrum coefficients (LPCC), Perceptual Linear Prediction (PLP) and Relative Spectral Perceptual Linear Prediction (Rasta-PLP) features are used to characterize the emotional utterances using a combination between Gaussian mixture models (GMM) and Support Vector Machines (SVM) based on the Kullback-Leibler Divergence Kernel. In this study, the effect of feature type and its dimension are comparatively investigated. The best results are obtained with 12-coefficient MFCC. Utilizing the proposed features a recognition rate of 84% has been achieved which is close to the performance of humans on this database.
APA, Harvard, Vancouver, ISO, and other styles
4

"Appendix A: Alternative Representations of the LPC Coefficients." In Speech Recognition Over Digital Channels, 225–26. Chichester, UK: John Wiley & Sons, Ltd, 2006. http://dx.doi.org/10.1002/0470024720.app1.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Aggarwal, Gaurav, and Latika Singh. "Comparisons of Speech Parameterisation Techniques for Classification of Intellectual Disability Using Machine Learning." In Research Anthology on Physical and Intellectual Disabilities in an Inclusive Society, 828–47. IGI Global, 2022. http://dx.doi.org/10.4018/978-1-6684-3542-7.ch046.

Full text
Abstract:
Classification of intellectually disabled children through manual assessment of speech at an early age is inconsistent, subjective, time-consuming and prone to error. This study attempts to classify the children with intellectual disabilities using two speech feature extraction techniques: Linear Predictive Coding (LPC) based cepstral parameters, and Mel-frequency cepstral coefficients (MFCC). Four different classification models: k-nearest neighbour (k-NN), support vector machine (SVM), linear discriminant analysis (LDA) and radial basis function neural network (RBFNN) are employed for classification purposes. 48 speech samples of each group are taken for analysis, from subjects with a similar age and socio-economic background. The effect of the different frame length with the number of filterbanks in the MFCC and different frame length with the order in the LPC is also examined for better accuracy. The experimental outcomes show that the projected technique can be used to help speech pathologists in estimating intellectual disability at early ages.
APA, Harvard, Vancouver, ISO, and other styles
6

Koppula, Neeraja, K. Sarada, Ibrahim Patel, R. Aamani, and K. Saikumar. "Identification and Recognition of Speaker Voice Using a Neural Network-Based Algorithm." In Handbook of Research on Innovations and Applications of AI, IoT, and Cognitive Technologies, 278–89. IGI Global, 2021. http://dx.doi.org/10.4018/978-1-7998-6870-5.ch019.

Full text
Abstract:
This chapter explains the speech signal in moving objects depending on the recognition field by retrieving the name of individual voice speech and speaker personality. The adequacy of precisely distinguishing a speaker is centred exclusively on vocal features, as voice contact with machines is getting more pervasive in errands like phone, banking exchanges, and the change of information from discourse data sets. This audit shows the location of text-subordinate speakers, which distinguishes a solitary speaker from a known populace. The highlights are eliminated; the discourse signal is enrolled for six speakers. Extraction of the capacity is accomplished utilizing LPC coefficients, AMDF computation, and DFT. By adding certain highlights as information, the neural organization is prepared. For additional correlation, the attributes are put away in models. The qualities that should be characterized for the speakers were acquired and dissected utilizing back propagation algorithm to a format picture.
APA, Harvard, Vancouver, ISO, and other styles
7

Kong, Weiping, Yinli Bi, Wenjiang Huang, Lingli Tang, Chuanrong Li, and Lingling Ma. "Nondestructive Evaluation of Inoculation Effects of AMF and Bradyrhizobium japonicum on Soybean under Drought Stress From Reflectance Spectroscopy." In Soybean for Human Consumption and Animal Feed. IntechOpen, 2020. http://dx.doi.org/10.5772/intechopen.88673.

Full text
Abstract:
Precise estimation of leaf chlorophyll content (LCC) and leaf water content (LWC) of soybean, using remote sensing technology, provides a new avenue for the nondestructive evaluation of inoculation effects of arbuscular mycorrhizal fungi (AMF) and Bradyrhizobium japonicum (BJ) on soybean growth condition. In this study, a series of pot experiments were conducted in the greenhouse, soybean inoculated with Glomus intraradices (G.i, one of AMF species), G.i and BJ, and non-inoculation were planted under drought stress (DS) and normal irrigation (NI) conditions. Leaf spectra and LCC and LWC were measured on the 28th and 56th days after inoculation. Two new simple ratio (SR) indices, derived from the first derivative spectral reflectance at λ1 nm (Dλ1) and the raw spectral reflectance at λ2 nm (Rλ2), were developed to estimate LCC and LWC. The results indicate that under DS, plants inoculated with G.i had higher LCC and LWC than the non-inoculated plants, followed by the counterparts co-inoculated with G.i and BJ. Linear estimation models, established by the D650/Rred edge and D1680/R680, achieved great improved accuracy for quantifying LCC and LWC of soybean under inoculation and drought stress treatments, with determination of coefficient of 0.63 and 0.76, respectively.
APA, Harvard, Vancouver, ISO, and other styles
8

Kaur, Taranjit, and Balwinder Singh Dhaliwal. "Design of Linear Phase FIR Low Pass Filter Using Mutation-Based Particle Swarm Optimization Technique." In Applications of Artificial Intelligence in Electrical Engineering, 344–58. IGI Global, 2020. http://dx.doi.org/10.4018/978-1-7998-2718-4.ch017.

Full text
Abstract:
This chapter presents a mutation-based particle swarm optimization (PSO) approach for designing a linear phase digital low pass FIR filter (LPF). Since conventional gradient-based methods are susceptible to being trapped in local optima, the stochastic search methods have proven to be effective in a multi-dimensional non-linear environment. In this chapter, LPF with 20 coefficients has been designed. Since filter design is a multidimensional optimization problem, the concept of mutation helps in maintaining diversity in the swarm population and thereby efficiently controlling the local search and convergence to the global optimum solution. Given the filter specifications to be realized, the Mutation PSO (MPSO) tries to meet the ideal frequency response characteristics by generating an optimal set of filter coefficients. The simulation results have been compared with basic PSO and state of artworks on filter design. The results justify that the proposed technique outperforms not only in convergence speed but also in the quality of the solution obtained.
APA, Harvard, Vancouver, ISO, and other styles
9

Mahapatra, Nirmal Kumar, and Tuhin Bera. "Generalised Single-Valued Neutrosophic Number and Its Application to Neutrosophic Linear Programming." In Neutrosophic Sets in Decision Analysis and Operations Research, 180–214. IGI Global, 2020. http://dx.doi.org/10.4018/978-1-7998-2555-5.ch009.

Full text
Abstract:
In this chapter, the concept of single valued neutrosophic number (SVN-Number) is presented in a generalized way. Using this notion, a crisp linear programming problem (LP-problem) is extended to a neutrosophic linear programming problem (NLP-problem). The coefficients of the objective function of a crisp LP-problem are considered as generalized single valued neutrosophic number (GSVN-Number). This modified form of LP-problem is here called an NLP-problem. An algorithm is developed to solve NLP-problem by simplex method. Finally, this simplex algorithm is applied to a real-life problem. The problem is illustrated and solved numerically.
APA, Harvard, Vancouver, ISO, and other styles
10

Srivastava, Prashant K., Swati Suman, and Smita Pandey. "Monitoring Changes in Urban Cover Using Landsat Satellite Images and Demographical Information." In Environmental Information Systems, 981–95. IGI Global, 2019. http://dx.doi.org/10.4018/978-1-5225-7033-2.ch043.

Full text
Abstract:
The monitoring of urban cover is very important for the planner, management, governmental and non-governmental organizations for optimizing the use of urban resources and minimizing the environmental losses. The study here aims at analyzing the changes that occurred in urban green cover over a time span of 1991-2001 using multi-date Landsat satellite images data over the Varanasi district, India and its relation to demographical changes. The Support Vector Machines (SVMs) classifier has been used for image classification. The urbanization indicators such as Land Consumption Ratio (LCR) and Land Absorption Coefficient (LAC) were also used in order to understand the changes in urban cover and population dynamics. All the analysis indicates significant changes in the urban cover values with increasing population at both spatial and temporal scale.
APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "LPC coefficients"

1

Grass, J., and P. Kabal. "Methods of improving vector-scalar quantization of LPC coefficients." In [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing. IEEE, 1991. http://dx.doi.org/10.1109/icassp.1991.150425.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Hyungseob Han, Sangjin Cho, and Uipil Chong. "Fault diagnosis system using LPC coefficients and neural network." In 2010 International Forum on Strategic Technology (IFOST). IEEE, 2010. http://dx.doi.org/10.1109/ifost.2010.5667999.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Feifei, Wang, and Xu Weizhang. "A comparison of algorithms for the calculation of LPC coefficients." In 2014 International Conference on Information Science, Electronics and Electrical Engineering (ISEEE). IEEE, 2014. http://dx.doi.org/10.1109/infoseee.2014.6948119.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Evans, Ward R. "A Comprative Study of the Karhunen-Loeve Transform Applied to Selected LPC Coefficients." In 1987 IEEE Military Communications Conference - Crisis Communications: The Promise and Reality. IEEE, 1987. http://dx.doi.org/10.1109/milcom.1987.4795263.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Struwe, Kevin. "Voiced-Unvoiced Classification of Speech Using a Neural Network Trained with LPC Coefficients." In 2017 International Conference on Control, Artificial Intelligence, Robotics & Optimization (ICCAIRO). IEEE, 2017. http://dx.doi.org/10.1109/iccairo.2017.20.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Ramachandran, Ravi P., M. M. Sondhi, N. Seshadri, and B. S. Atal. "Combined vector and scalar codebook for robust quantization of linear predictive coefficients (LPC) parameters." In SPIE's 1994 International Symposium on Optics, Imaging, and Instrumentation, edited by Richard J. Mammone and J. David Murley, Jr. SPIE, 1994. http://dx.doi.org/10.1117/12.191880.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Alshaer, Hisham, Martha Garcia, M. Hossein Radfar, Geoffrey R. Fernie, and T. Douglas Bradley. "Detection of upper airway narrowing via classification of LPC coefficients: Implications for obstructive sleep apnea diagnosis." In ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2011. http://dx.doi.org/10.1109/icassp.2011.5946495.

Full text
APA, Harvard, Vancouver, ISO, and other styles
8

Guan, C., Y. Chen, and B. Wu. "Direct modulation on LPC coefficients with application to speech enhancement and improving the performance of speech recognition in noise." In Proceedings of ICASSP '93. IEEE, 1993. http://dx.doi.org/10.1109/icassp.1993.319242.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Leinhos, Dirk C., Norbert R. Schmid, and Leonhard Fottner. "The Influence of Transient Inlet Distortions on the Instability Inception of a Low Pressure Compressor in a Turbofan Engine." In ASME Turbo Expo 2000: Power for Land, Sea, and Air. American Society of Mechanical Engineers, 2000. http://dx.doi.org/10.1115/2000-gt-0505.

Full text
Abstract:
While studies on compressor flow instabilities under the presence of inlet distortions have been carried out with steady distortions in the past, the investigation presented here focuses on the influence of transient inlet distortions as generated by variable geometry engine intakes of super- and hypersonic aircrafts on the characteristic and the nature of the instability inception of a LPC. The flow patterns (total pressure distortion with a superimposed co- or counter-rotating swirl) of the distortions are adopted from a hypersonic concept aircraft. A LARZAC 04 twin-spool turbofan was operated with transient inlet distortions, generated by a moving delta-wing, and steady total pressure distortions starting close to the LPC’s stability limit until it stalled. High frequency pressure signals are recorded at different engine power settings. Instabilities are investigated with regard to the inception process and the early detection of stall precursors for providing data for a future stability control device. It turned out that the transient distortion does not have an influence on the surge margin of the LPC compared to the steady distortion, but that it changes the nature of stall inception. The pressure traces are analysed in the time and frequency domain and also with tools like Spatial FFT, Power Spectral Density and Travelling Wave Energy. A Wavelet Transformation algorithm is applied as well. While in the case of clean inlet flow the compressor exhibits different types of stall inception depending on the engine speed, stall is always initiated by spike-type disturbances under the presence of steady or transient distortions though modal disturbances are present in the mid speed range that do not grow into stall, but rather interact with the inlet flow and produce short lengthscale disturbances. The obtained early warning times prior to stall are adversely affected by transient distortions in some cases. The problem of appropriate threshholding becomes evident. The best warning times have been acquired using a statistical evaluation of the Wavelet coefficients which might be promising to apply in a staged active control system which could include different phases of detection and actuation depending on the current precursor.
APA, Harvard, Vancouver, ISO, and other styles
10

Kubota, Shohei, Ryoichiro Yoshida, and Yoshimitsu Kuroki. "Coefficient Constraint LIC with ADMM." In 2018 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS). IEEE, 2018. http://dx.doi.org/10.1109/iciibms.2018.8549950.

Full text
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography