To see the other types of publications on this topic, follow the link: Voice parameters.

Journal articles on the topic 'Voice parameters'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 journal articles for your research on the topic 'Voice parameters.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.

1

Mittal, Vikas, and R. K. Sharma. "Classification of Pathological Voices Using Glottal Signal Parameters." Journal of Computational and Theoretical Nanoscience 16, no. 9 (2019): 3999–4002. http://dx.doi.org/10.1166/jctn.2019.8284.

Full text
Abstract:
The discrimination of voice signals has numerous applications in diagnosing of pathologies related to voice. This paper discussed about the glottal signal that is bound to recognize two sorts of voice issue: Laryngitis and Laryngeal dystonia (LD). The parameters of the glottal signal fill in as contribution to classifiers that characterizes into three unique gatherings of speakers: speakers with Laryngitis; with laryngeal dystonia (LD); lastly speakers with healthy voices. The database is made out of voice accounts containing tests of three gatherings. The classifiers SVM provided 60%, KNN provided 70% and Ensemble provided 80% classification accuracy in the case of Laryngitis. Voice signals of patients affected with Laryngeal dystonia were also collected and tested with same classifiers and the Accuracy of 90%, 80% and 50% were obtained with SVM, KNN and Ensemble respectively.
APA, Harvard, Vancouver, ISO, and other styles
2

Krishna, Murari Bansal, and Raj Singh Prithvi. "Development of a Tool to Objectively Identify the Normal Human Voice." International Journal of Toxicological and Pharmacological Research 12, no. 5 (2022): 189–96. https://doi.org/10.5281/zenodo.11902797.

Full text
Abstract:
<strong>Background: &nbsp;</strong>Acoustic analysis is used to assist in differential diagnosis, documentation, and evaluation of treatment for voice disorders. Clinical data has shown that Jitter, Shimmer, Mean Pitch, and Harmonic Noise Ratio are the indices of voice pathology. A voice with some periodicity can now be analyzed with a computerized acoustic analyzer, a relatively newer technique that can be widely used in clinical practice. Voice is an acoustic output of the vibrations of the vocal folds and is the basic source of speech. In contrast, speech is a meaningful acoustic output created by the modulation of voice by organs of articulation into basic building blocks, the &lsquo;phonemes&rsquo;. Phonemes help in distinguishing one word from another in a particular language. Some sounds like clicks, whistling, and whispering can be produced by organs of articulations without voice. Because of anatomical, physiological, racial, cultural, and social factors, every human voice is unique and fingerprints is the signature of each individual.&nbsp;<strong>Aim:</strong>&nbsp;To create a database of normal voices, analyze and identify different parameters of these voices and hence identify benchmarks of normal voices.&nbsp;<strong>Material and Method:</strong>&nbsp;The study were conducted in the department of Otolaryngology. Voice samples of 250 normal males and 250 normal females aged between 19 to 30 years were collected using a sustained vowel /a/ which was recorded and analyzed using a freely downloadable software &ldquo;PRAAT&rdquo;. The parameters like Jitter, Shimmer, and Pitch were derived, and mean, SD, and range of voice parameters were calculated. The Microphone was held at a distance of 5cm in front of the lips and 3 cm above the breath stream. Each person was first trained to produce sustained vowel /a/ by the examiner herself through utterance of the voice at comfortable loudness and pitch.&nbsp;<strong>Results:&nbsp;</strong>In males, the value of parameters was mean pitch (124.05), jitter (0.011), and shimmer (0.04). In females the parameters were mean pitch (212.27), jitter (0.01), shimmer (0.06). The Pitch in females is consistently higher than in males. Jitter Range is almost the same in both males and females. There is not much difference in Shimmer between Males and Females.&nbsp;<strong>Conclusion:&nbsp;</strong>Voice can be objectively analyzed using Acoustic Parameters like mean Pitch, Jitter, and Shimmer. Most of the studies show that normal voice parameters depend on gender, region, methodology of the voice collection, software and hardware used, different algorithms used for calculations and the setup, etc. Hence every institution should standardize the method of acoustic analysis for its own consumption. For the purpose, we have created a huge database of the voices of normal young adults. Voices can be objectively analyzed using acoustic parameters like mean pitch, jitter, shimmer, and harmonic-to-noise ratio. A large database yields more reliable normative parameters. Institutions should develop their own standard protocol for the selection of subjects, recording of voices, and their analysis.
APA, Harvard, Vancouver, ISO, and other styles
3

Choi, Hee-Jin, and Ji-Yeoun Lee. "Comparative Study between Healthy Young and Elderly Subjects: Higher-Order Statistical Parameters as Indices of Vocal Aging and Sex." Applied Sciences 11, no. 15 (2021): 6966. http://dx.doi.org/10.3390/app11156966.

Full text
Abstract:
The objective of this study was to test higher-order statistical (HOS) parameters for the classification of young and elderly voice signals and identify gender- and age-related differences through HOS analysis. This study was based on data from 116 subjects (58 females and 58 males) extracted from the Saarbruecken voice database. In the gender analysis, the same number of voice samples were analyzed for each sex. Further, we conducted experiments on the voices of elderly people using gender analysis. Finally, we reviewed the standards and reference models to reduce sex and gender bias. The acoustic parameters were extracted from young and elderly voice signals using Praat and a time–frequency analysis program (TF32). Additionally, we investigated the gender- and age-related differences in HOS parameters. Young and elderly voice signals significantly differed in normalized skewness (p = 0.005) in women and normalized kurtosis (p = 0.011) in men. Therefore, normalized skewness is a useful parameter for distinguishing between young and elderly female voices, and normalized kurtosis is essential for distinguishing between young and elderly male voices. We will continue to investigate parameters that represent important information in elderly voice signals.
APA, Harvard, Vancouver, ISO, and other styles
4

Mittal, Vikas, and R. K. Sharma. "Vocal Folds Analysis for Detection and Classification of Voice Disorder." International Journal of E-Health and Medical Communications 12, no. 4 (2021): 97–119. http://dx.doi.org/10.4018/ijehmc.20210701.oa6.

Full text
Abstract:
The detection and description of pathological voice are the most important applications of voice profiling. Currently, techniques like laryngostroboscopy or surgical microlarynoscopy are popularly used for the diagnosis of voice pathologies but are invasive in nature. Disorders of vocal folds impact the quality of voice, and therefore, the accuracy of voice profiling is reduced. This paper presents a better solution to differentiate normal and pathological voices based on the glottal, physical, and acoustic and equivalent electrical parameters. These parameters have been correlated using mathematical equations and models. Results reveal that the glottal flow is strongly influenced by physical parameters like stiffness and viscosity of vocal folds in case of pathological voice. However, their direct measurement requires complex invasive medical procedures or costly and complex electronic hardware arrangements in case of non-invasive methods. Glottal parameters, on the other hand, facilitate much simpler estimation of vocal folds disorders. In this work, the authors have presented two non-invasive approaches for better accuracy and least complexity for differentiating normal and pathological voices: 1) by using correlation of glottal and physical parameters, 2)by using acoustic and equivalent electrical parameters.
APA, Harvard, Vancouver, ISO, and other styles
5

Cistola, Giorgia, Alex Peiró-Lilja, Guillermo Cámbara, Ineke van der Meulen, and Mireia Farrús. "Influence of TTS Systems Performance on Reaction Times in People with Aphasia." Applied Sciences 11, no. 23 (2021): 11320. http://dx.doi.org/10.3390/app112311320.

Full text
Abstract:
Text-to-speech (TTS) systems provide fundamental reading support for people with aphasia and reading difficulties. However, artificial voices are more difficult to process than natural voices. The current study is an extended analysis of the results of a clinical experiment investigating which, among three artificial voices and a digitised human voice, is more suitable for people with aphasia and reading impairments. Such results show that the voice synthesised with Ogmios TTS, a concatenative speech synthesis system, caused significantly slower reaction times than the other three voices used in the experiment. The present study explores whether and what voice quality metrics are linked to delayed reaction times. For this purpose, the voices were analysed using an automatic assessment of intelligibility, naturalness, and jitter and shimmer voice quality parameters. This analysis revealed that Ogmios TTS, in general, performed worse than the other voices in all parameters. These observations could explain the significantly delayed reaction times in people with aphasia and reading impairments when listening to Ogmios TTS and could open up consideration about which TTS to choose for compensative devices for these patients based on the voice analysis of these parameters.
APA, Harvard, Vancouver, ISO, and other styles
6

Ramani, Sai Aishwarya, Eric J. Hunter, and Lady Catherine Cantor Cutiva. "Acoustic speech parameter relationships with voice disorders and phrase differences." Journal of the Acoustical Society of America 153, no. 3_supplement (2023): A295. http://dx.doi.org/10.1121/10.0018908.

Full text
Abstract:
While acoustic speech analysis is non-invasive, the utility has been mixed due to the range of voice types. For vocal health practitioners to efficiently and quickly assess and document voice changes, knowing which voice parameter would be sensitive to vocal change is crucial. Using a database of 296 individual voices including 8 voice pathology types and typical voice samples, the sensitivity of a range of acoustic speech parameters to differentiate common voice pathology types was investigated. Both traditional and contemporary acoustic speech metrics were estimated for the samples using a custom MATLAB script and PRAAT (e.g., jitter, shimmer HNR, CPPS, Alpha ratio, PPE). Analysis then evaluate the predictability value of the metrics to discriminate pathology type. From the pool of parameters, 11 were able to identify pathological voices from normal controls and several of the parameters were more sensitive to some pathology. For example, CPPs and jitter values could discriminate neuropathological voices whereas HNR and Shimmer cold discriminate muscle-based pathologies. These results indicate how the sensitivity of acoustic speech metrics to the voice pathology types can allow for the identification of individual metrics (or combinations of metrics) which could be used to track changes in vocal health.
APA, Harvard, Vancouver, ISO, and other styles
7

Mittal, Vikas, and R. K. Sharma. "Electrical Modeling of Two Tube Vocal Tract for Voice Pathology Detection." Sensor Letters 17, no. 12 (2019): 943–46. http://dx.doi.org/10.1166/sl.2019.4168.

Full text
Abstract:
Voice pathology is interrelated to a vocal folds complication, and the vocal tract area which is attached to vocal folds shows a variable shapes in case of voice pathology. This variable shapes are tested to make the division of healthy and pathological voices. So, Length and area of vocal tract are used for finding electrical parameters using transmission line theory by considering the vocal tract as a sequence of tubes. The proposed work is used voice database from the Saarbrucken Voice Database (SVD) and real-time data from the Maharishi Markandeshwar (M.M) hospital, Mullana. The proposed method achieves a higher accuracy of 91.7 percent with electrical parameters compared to 88.2 percent with the acoustic parameters and 85.3 percent with physical parameters in the detection of voice pathology.
APA, Harvard, Vancouver, ISO, and other styles
8

Peterson, K. Linnea, Katherine Verdolini-Marston, Julie M. Barkmeier, and Henry T. Hoffman. "Comparison of Aerodynamic and Electroglottographic Parameters in Evaluating Clinically Relevant Voicing Patterns." Annals of Otology, Rhinology & Laryngology 103, no. 5 (1994): 335–46. http://dx.doi.org/10.1177/000348949410300501.

Full text
Abstract:
The purpose of the present study was to identify one or more aerodynamic or electroglottographic measures that distinguish among voicing patterns that are clinically relevant for nodule pathogenesis and regression: a presumably pathogenic pattern (pressed voice), a neutral pattern (normal voice), and two presumably therapeutic patterns (resonant voice and breathy voice). Trained subjects with normal voices produced several tokens of each voice type on sustained vowels /a/, /i/, and /u/. For each token, maximum flow declination rate, alternating current flow, and minimum flow were obtained from inverse-filtered airflow signals, and closed quotient and closing time were obtained from electroglottographic signals. The results indicate that for /a/ and /i/ (but not for /u/), the closed quotient provides a sensitive tool for distinguishing the voice types in physiologically interpretable directions. Further, post-hoc analyses confirmed a direct relationship between the closed quotient and videoscopic ratings of laryngeal adduction, which previous work links to nodule pathogenesis and regression.
APA, Harvard, Vancouver, ISO, and other styles
9

Šiupšinskienė, Nora, Kęstutis Adamonis, and Robert Toohill. "Usefulness of assessment of voice capabilities in female patients with reflux-related dysphonia." Medicina 45, no. 12 (2009): 978. http://dx.doi.org/10.3390/medicina45120125.

Full text
Abstract:
Objectives. To analyze vocal capabilities in patients diagnosed with reflux related dysphonia versus controls with healthy voice with selection of the most informative discriminating quantitative parameters and to assess voice changes following treatment. Material and methods. Six parameters of voice range profile (VRP) and five parameters of speech range profile were taken and analyzed from 60 dysphonic outpatient females with laryngopharyngeal reflux (LPR) diagnosed by reflux-related atypical and typical symptoms, videolaryngoscopic findings, upper gastrointestinal endoscopy, and positive response to empiric 3-month omeprazole treatment. Seventy-six females with healthy voice served as controls. Results. All six parameters of voice range profile and three of 5 parameters of speech range profile showed significant differences comparing LPR patients with controls before omeprazole treatment (P&lt;0.05). Logistic regression analysis revealed VRP maximum-minimum intensity range to be the most informative parameter for discrimination between reflux-related dysphonic and healthy voices (overall prediction accuracy, 86.8%). A threshold value of significant parameter was stated using the receiver operating characteristic curve. Treatment with omeprazole significantly improved voice quality showing the greatest changes in the mean scores of majority of voice range profile parameters. Conclusions. Vocal capabilities, especially evaluated by voice range profile, are restricted in LPR female patients in comparison to subjects with healthy voice. Quantitative voice assessment with voice range profile may add more objective aspect for screening dysphonia and could be used as a criterion of evaluation of treatment efficacy in such patients.
APA, Harvard, Vancouver, ISO, and other styles
10

Schlegel, Patrick, Kirsten Wong, Mamdouh Aker, Yazeed Alhiyari, and Jennifer Long. "Objective Assessment of Porcine Voice Acoustics for Laryngeal Surgical Modeling." Applied Sciences 11, no. 10 (2021): 4489. http://dx.doi.org/10.3390/app11104489.

Full text
Abstract:
Pigs have become important animal models in voice research. Several objective parameters exist to characterize the pig voice, but it is not clear which of them are sensitive to the impaired voice quality after laryngeal injury or surgery. In order to conduct meaningful voice research in pigs, it is critical to have standard functional voice outcome measures that can distinguish between normal and impaired voices. For this reason, we investigated 17 acoustic parameters before and early after surgery in three Yucatan mini pigs. Four parameters showed consistent changes between pre- and post-surgery recordings, mostly related to decreased spectral energy in higher frequencies after surgery. We recommend two of these, 50% spectral energy quartile (Q50) and Flux, for objective functional voice assessment of pigs undergoing laryngeal surgery. The long-term goal of this process is to enable quantitative voice outcome tracking of laryngeal surgical interventions in porcine models.
APA, Harvard, Vancouver, ISO, and other styles
11

Kreiman, Jody, Bruce R. Gerratt, Kristin Precoda, and Gerald S. Berke. "Individual Differences in Voice Quality Perception." Journal of Speech, Language, and Hearing Research 35, no. 3 (1992): 512–20. http://dx.doi.org/10.1044/jshr.3503.512.

Full text
Abstract:
Sixteen listeners (10 expert, 6 naive) judged the dissimilarity of pairs of voices drawn from pathological and normal populations. Separate nonmetric multidimensional scaling solutions were calculated for each listener and voice set. The correlations between individual listeners’ dissimilarity ratings were low However, scaling solutions indicated that each subject judged the voices in a reliable, meaningful way. Listeners differed more from one another in their judgments of the pathological voices (which varied widely on a number of acoustic parameters) than they did for the normal voices (which formed a much more homogeneous set acoustically). The acoustic features listeners used to judge dissimilarity were predictable from the characteristics of the stimulus sets’ only parameters that showed substantial variability were perceptually salient across listeners. These results are consistent with prototype models of voice perception They suggest that traditional means of assessing listener reliability n voice perception tasks may not be appropriate, and highlight the importance of using explicit comparisons between stimuli when studying voice quality perception
APA, Harvard, Vancouver, ISO, and other styles
12

Lee, Ji-Yeoun, Ji-Hye Park, Ji-Na Lee, and Ah-Ra Jung. "Investigation of the Clinical Effectiveness and Prognostic Factors of Voice Therapy in Voice Disorders: A Pilot Study." Applied Sciences 13, no. 20 (2023): 11523. http://dx.doi.org/10.3390/app132011523.

Full text
Abstract:
Examining the relationship between the prognostic factors and the effectiveness of voice therapy is a crucial step in developing personalized treatment strategies for individuals with voice disorders. This study recommends using the multilayer perceptron model (MLP) to comprehensively analyze the prognostic factors, with various parameters, including personal habits and acoustic parameters, that can influence the effectiveness of before-and-after voice therapy in individuals with speech disorders. Various methods, including the assessment of personal characteristics, acoustic analysis, statistical analysis, binomial logistic regression analysis, and MLP, are implemented in this experiment. Accuracies of 87.5% and 85.71% are shown for the combination of optimal input parameters for female and male voices, respectively, through the MLP model. This fact validates the selection of input parameters when building our model. Good prognostic indicators for the clinical effectiveness of voice therapy in voice disorders are jitter (post-treatment) for females and MPT (pre-treatment) for males. The results are expected to provide a foundation for modeling research utilizing artificial intelligence in voice therapy for voice disorders. In terms of follow-up studies, it will be necessary to conduct research that utilizes big data to analyze the optimal parameters for predicting the clinical effectiveness of voice disorders.
APA, Harvard, Vancouver, ISO, and other styles
13

Wang, Xiaochen, and Tao Wang. "Voice Recognition and Evaluation of Vocal Music Based on Neural Network." Computational Intelligence and Neuroscience 2022 (May 20, 2022): 1–9. http://dx.doi.org/10.1155/2022/3466987.

Full text
Abstract:
Artistic voice is the artistic life of professional voice users. In the process of selecting and cultivating artistic performing talents, the evaluation of voice even occupies a very important position. Therefore, an appropriate evaluation of the artistic voice is crucial. With the development of art education, how to scientifically evaluate artistic voice training methods and fairly select artistic voice talents is an urgent need for objective evaluation of artistic voice. The current evaluation methods for artistic voices are time-consuming, laborious, and highly subjective. In the objective evaluation of artistic voice, the selection of evaluation acoustic parameters is very important. Attempt to extract the average energy, average frequency error, and average range error of singing voice by using speech analysis technology as the objective evaluation acoustic parameters, use neural network method to objectively evaluate the singing quality of artistic voice, and compare with the subjective evaluation of senior professional teachers. In this paper, voice analysis technology is used to extract the first formant, third formant, fundamental frequency, sound range, fundamental frequency perturbation, first formant perturbation, third formant perturbation, and average energy of singing acoustic parameters. By using BP neural network methods, the quality of singing was evaluated objectively and compared with the subjective evaluation of senior vocal professional teachers. The results show that the BP neural network method can accurately and objectively evaluate the quality of singing voice by using the evaluation parameters, which is helpful in scientifically guiding the selection and training of artistic voice talents.
APA, Harvard, Vancouver, ISO, and other styles
14

DESCHLER, DANIEL G., E. THOMAS DOHERTY, CHARLES G. REED, and MARK I. SINGER. "Quantitative and Qualitative Analysis of Tracheoesophageal Voice after Pectoralis Major Flap Reconstruction of the Neopharynx." Otolaryngology–Head and Neck Surgery 118, no. 6 (1998): 771–76. http://dx.doi.org/10.1016/s0194-5998(98)70267-0.

Full text
Abstract:
Although tracheoesophageal voice restoration is accepted after reconstruction of the neopharynx with the pectoralis major myocutaneous flap, the character of such voice is not well described. Six patients reconstructed with the pectoralis major flap after laryngopharyngectomy underwent successful voice restoration with the Blom-Singer prosthesis. Voice was evaluated by a standardized protocol and compared with voices of control subjects treated with total laryngectomy and similar voice restoration. The patients with pectoralis major flaps produced similar intensity levels for soft voice (53.7 dB vs. 55.6 dB) and loud voice (61.3 dB vs. 65.3 dB) when compared with controls ( p &gt; 0.05). No significant differences ( p &gt; 0.05) were noted for fundamental frequency (F0) between patients with pectoralis major flaps and controls for soft (62.3 Hz vs. 85.4 Hz) and loud (109.8 Hz vs. 133.8 Hz) voice. Jitter was also comparable. Trained and naive listeners completed qualitative analyses for 10 parameters and judged that control patients had significantly better voice for most parameters. This finding demonstrates that dependable voice is attainable after pectoralis major flap reconstruction of the neopharynx. Although this voice does not differ significantly from voice after standard laryngectomy for acoustic parameters, perceptual analysis does reveal significant differences. (Otolaryngol Head Neck Surg 1998;118:771-6.)
APA, Harvard, Vancouver, ISO, and other styles
15

Lee, Megan M., and Brett R. Myers. "Associations of acoustic parameters and gender dysphoria in transmasculine individuals." Journal of the Acoustical Society of America 153, no. 3_supplement (2023): A158. http://dx.doi.org/10.1121/10.0018497.

Full text
Abstract:
Gender dysphoria refers to a feeling of discomfort due to a mismatch between a person’s biological sex and gender identity. Voice is a secondary sex marker, and it may contribute to feelings of unease or dissatisfaction if the voice does not match a person’s gender identity. The Utrecht Gender Dysphoria Scale—Gender Spectrum (UGDS-GS) was designed to measure a person's degree of gender dysphoria. Transmasculine individuals were assigned female at birth and identify as male or masculine non-binary, and acoustic studies focusing on transmasculine voices are notably sparse. In a study including 20 transmasculine individuals, we compared UGDS-GS results to the following acoustic parameters: vocal tract length (VTL) estimates, fundamental frequency (f0), standard deviation of f0, and cepstral peak prominence smoothed (CPPS). We found a significant positive correlation between each acoustic measure and gender dysphoria. We also found robust associations between each of these measures and a voice-related quality of life measure. These findings indicate that the sound of one’s voice may be a strong predictor of gender dysphoria.
APA, Harvard, Vancouver, ISO, and other styles
16

Chou, Adriana, Colin Schrof, Evan Polce, Maia Braden, James McMurray, and Jack Jiang. "Comparing the Nonlinear Dynamic Acoustic Parameters of Healthy Adult and Pediatric Voices." Annals of Otology, Rhinology & Laryngology 127, no. 12 (2018): 937–45. http://dx.doi.org/10.1177/0003489418803394.

Full text
Abstract:
Objectives: The aims of this study were to compare nondysphonic adult and pediatric voices using linear and nonlinear acoustic parameters and to evaluate the ability of adult spectrum convergence ratio (SCR) and rate of divergence (ROD) reference values to correctly identify a pediatric voice type as periodic or aperiodic. Methods: Twenty adult and 36 pediatric nondysphonic voice samples were collected and analyzed using linear and nonlinear acoustic parameters. Absence of voice disorder was confirmed using perceptual acoustic and spectral analysis. Mean values for jitter, shimmer, SCR, and ROD were compared between adults and children, across specific age groups, and within genders. Using adult reference values for SCR and ROD, samples were classified as primarily periodic or aperiodic and typed using spectral analysis. Rates of accurate typing were also compared between subject groups. Results: Overall, jitter and shimmer were similar among the adult and pediatric age groups. ROD was significantly different among the 3 pediatric and 1 adult group; the pediatric age groups were similar to one another. Adult SCR was also significantly different from all of the pediatric age groups. In adult men, ROD and SCR were significantly different from all of the pediatric age groups; the pediatric age groups were similar to one another. In female subjects, ROD was significantly different among all age groups. The ROD and SCR reference values were significantly better at categorizing adult voice types compared with pediatric voice types. Conclusions: In healthy subjects, SCR and ROD have discriminatory power for identifying adult versus pediatric voices, while jitter and shimmer cannot differentiate between the 2 groups. However, age- and gender-specific pediatric reference values must be determined to accurately classify voice types using SCR and ROD.
APA, Harvard, Vancouver, ISO, and other styles
17

Silva, Wellington da, and Ana Carolina Constantini. "Speech task affects the objective evaluation of dysphonic voices." Journal of Speech Sciences 7, no. 1 (2018): 1–15. http://dx.doi.org/10.20396/joss.v7i1.14988.

Full text
Abstract:
The acoustic analysis of speech has proved useful in the clinical evaluation of dysphonia, for it allows an objective assessment of the voice. However, the literature has suggested that the type of speech task used to obtain voice samples from patients (sustained vowel or connected speech) may affect both the perceptual and the acoustic evaluation of dysphonic voices. This study aimed at investigating whether the type of speech task significantly influences the acoustic analysis of dysphonic voices. Five acoustic parameters related to voice quality (cepstral peak prominence, difference between the magnitudes of the first and second harmonics, harmonics-to-noise ratio, jitter and shimmer) were automatically computed from voice samples of 5 female and 5 male subjects with and without dysphonia. These recordings consisted of three types of speech task: connected speech, count and sustained vowel. Analyses of variance with repeated measures showed that all five acoustic parameters were significantly affected by speech task. Further analyses through the Duncan’s multiple-range test indicated that the type of speech task may also influence the discrimination of dysphonic voices. It is concluded that speech task affects the acoustic assessment of dysphonic voices by significantly raising or reducing the values of the acoustic parameters.
APA, Harvard, Vancouver, ISO, and other styles
18

Mr., G.B. Gour, and V. Udayashankara Dr. "Voice Activity Detection and Pitch analysis in Pathological Voices." International Journal of Trend in Scientific Research and Development 1, no. 5 (2017): 423–28. https://doi.org/10.31142/ijtsrd2324.

Full text
Abstract:
This paper presents the voice activity detection VAD using basic parameters. The voice is found to have hoarseness in case of thyroid and laryngeal cancer patients. It is useful for the better analysis of pathological voices in presence of background noise. Automatic voice pathology detection and classification is important in the voice disorder assessment. Here, two pitch detection algorithms are used for the analysis. This enhanced voice enables the early detection and diagnosis of type of pathology related to the patient. Mr. G.B. Gour | Dr. V. Udayashankara &quot;Voice Activity Detection and Pitch analysis in Pathological Voices&quot; Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-1 | Issue-5 , August 2017, URL: https://www.ijtsrd.com/papers/ijtsrd2324.pdf
APA, Harvard, Vancouver, ISO, and other styles
19

Hina Zameer, Sidra Abid Syed, Marium Raziq, Muhammad Muzammil Khan, Sania Tanvir, and Shahzad Nasim. "Automated Acoustic Evaluation of Voice Disorders: A Comprehensive Study on Parameter Analysis Using ANN." Asian Bulletin of Big Data Management 3, no. 2 (2024): 94–102. http://dx.doi.org/10.62019/abbdm.v3i2.93.

Full text
Abstract:
Voice analysis serves as a crucial tool in diagnosing voice abnormalities, offering a non-invasive alternative to intrusive procedures. This research digs into a comprehensive study of voice disorder assessment methodologies, focusing on acoustic analysis and classification. The study employs various parameters such as Jitter, Shimmer, and Harmonic-to-Noise Ratio (HNR) alongside an Artificial Neural Network (ANN) classifier. Utilizing the Saarbruecken Voice Database the research aims to distinguish between healthy and dysphonic voices across genders. Principal Component Analysis (PCA) aids in feature selection, enhancing model accuracy. The results exhibit distinct precision levels in male and female groups, showcasing the effectiveness of specific parameters in classification.
APA, Harvard, Vancouver, ISO, and other styles
20

Rinta, Tiija. "Children's speaking and singing voices are one voice: evidence from perceptual analyses of independent voice parameters." Journal of Music, Technology and Education 2, no. 2 (2009): 127–40. http://dx.doi.org/10.1386/jmte.2.2-3.127_1.

Full text
APA, Harvard, Vancouver, ISO, and other styles
21

Pevtsova, V. V., V. V. Shilenkova, and O. V. Fedoseeva. "OBJECTIVE PARAMETERS OF VOICE STRENGTH." Russian Otorhinolaryngology 81, no. 2 (2016): 86–91. http://dx.doi.org/10.18692/1810-4800-2016-2-86-91.

Full text
APA, Harvard, Vancouver, ISO, and other styles
22

Morgon, A. "Anatomical Parameters of the Voice." Acta Oto-Laryngologica 105, no. 5-6 (1988): 420–24. http://dx.doi.org/10.3109/00016488809119495.

Full text
APA, Harvard, Vancouver, ISO, and other styles
23

Dzulfikar, Helmy, Sisdarmanto Adinandra, and Erika Ramadhani. "The Comparison of Audio Analysis Using Audio Forensic Technique and Mel Frequency Cepstral Coefficient Method (MFCC) as the Requirement of Digital Evidence." Jurnal Online Informatika 6, no. 2 (2021): 145. http://dx.doi.org/10.15575/join.v6i2.702.

Full text
Abstract:
Audio forensics is the application of science and scientific methods in handling digital evidence in the form of audio. In this regard, the audio supports the disclosure of various criminal cases and reveals the necessary information needed in the trial process. So far, research related to audio forensics is more on human voices that are recorded directly, either by using a voice recorder or voice recordings on smartphones, which are available on Google Play services or iOS Store. This study compares the analysis of live voices (human voices) with artificial voices on Google Voice and other artificial voices. This study implements the audio forensic analysis, which involves pitch, formant, and spectrogram as the parameters. Besides, it also analyses the data by using feature extraction using the Mel Frequency Cepstral Coefficient (MFCC) method, the Dynamic Time Warping (DTW) method, and applying the K-Nearest Neighbor (KNN) algorithm. The previously made live voice recording and artificial voice are then cut into words. Then, it tests the chunk from the voice recording. The testing of audio forensic techniques with the Praat application obtained similar words between live and artificial voices and provided 40,74% accuracy of information. While the testing by using the MFCC, DTW, KNN methods with the built systems by using Matlab, obtained similar word information between live voice and artificial voice with an accuracy of 33.33%.
APA, Harvard, Vancouver, ISO, and other styles
24

McFarlane, Stephen C., Teri L. Holt-Romeo, Alfred S. Lavorato, and Lyle Warner. "Unilateral Vocal Fold Paralysis." American Journal of Speech-Language Pathology 1, no. 1 (1991): 45–48. http://dx.doi.org/10.1044/1058-0360.0101.45.

Full text
Abstract:
Voice samples of 16 adult patients with vocal fold paralysis and 6 normal adults were recorded and evaluated by 27 listeners (9 speech-language pathologists, 9 otolaryngologists, and 9 lay listeners). The listeners rated the voices of the speakers on a 10-point scale on six vocal parameters (pitch, loudness, hoarseness, vocal roughness, breathiness, and overall quality). The 16 patients represented three different treatment approaches (teflon injection, 4; voice therapy, 6; and muscle-nerve reinnervation surgery, 6). Voice therapy and muscle nerve reinnervation surgery were both rated more successful than teflon injection in terms of improvement from pretreatment to posttreatment voices for all six vocal parameters. Normal speakers’ voices were rated higher than the voices of any treatment group. Speech-language pathologists were more reliable listeners than the other two listener groups. A case is made for using a period of trial voice therapy while waiting for possible spontaneous nerve healing in unilateral vocal fold paralysis patients who do not have aspiration problems and who have a competent cough. The majority of patients studied presented with unilateral recurrent laryngeal nerve paralysis, and none recovered vocal fold function during the course of the study.
APA, Harvard, Vancouver, ISO, and other styles
25

Wogalter, Michael S., and Rana S. Barzegar. "Intended Carefulness Ratings for Voiced Warning Statements." Proceedings of the Human Factors and Ergonomics Society Annual Meeting 44, no. 22 (2000): 686–89. http://dx.doi.org/10.1177/154193120004402250.

Full text
Abstract:
The parameters that affect the efficacy of voiced warnings have only recently begun to be investigated. The current study examines the effects on intended carefulness ratings for voiced warning statements considering the following factors: sex of speaker, sex of participant, voice style (monotone, emotional, whisper), and sound level (low, high). Warning statements voiced in the emotional voice style and in a higher sound level produced the higher intended carefulness ratings. Implications for the design of voiced warnings are discussed.
APA, Harvard, Vancouver, ISO, and other styles
26

Kosztyła-Hojna, Bożena, Anna Łobaczuk-Sitnik, Maciej Zdrojkowski, et al. "Analysis of voice quality parameters in patients with vestibular voice." Otolaryngologia Polska 73, no. 3 (2019): 11–15. http://dx.doi.org/10.5604/01.3001.0012.8373.

Full text
Abstract:
Abstract Vestibular voice includes participation of larynx structures which are absent in physiological process. Vestibular phonation may be desired when vocal folds are damaged as in paralytic dysphonia, or undesired in marginal hyperfunction. Vestibular voice may result from psychogenic dysphonia – phononeurosis. The aim of the study is perceptive evaluation of vestibular voice, objective larynx visualization, acoustic and aerodynamic examination. The study included 40 patients: 20 with vestibular voice, 20 with euphonic voice. Voice quality has been evaluated using perceptual GRBAS scale. Endoscopic and stroboscopic larynx examination used Endo-STROB-EL-Xion GmbH with visual tract. High-Speed Digital Imaging (HSDI) and High Speed (HS) camera registered true vocal folds vibrations. Acoustic evaluation of voice with DiagnoScope Specjalista, DiagNova Technologies included analysis of F0, Jitter, Shimmer, NHR, nonharmonic components. MPT has been analyzed. In examined group, hoarseness (95%), roughness (75%) and voice strain (55%) have been recorded. Endoscopy revealed edema of vestibular folds with dilation of vessels covering glottis. Stroboscopy and HSDI confirmed coexistence of hyperfunctional (95%) or paralytic (5%) dysphonia. Acoustic assessment revealed increase in Jitter, Shimmer, NHR and decrease in F0 and MPT. The vestibular voice is observed most frequently in women with hyperfunctional dysphonia (phononeuroses) or in paralytic dysphonia. Visualization techniques confirm the coexistence of vestibular folds hypertrophy and edema with vibration disorders. In the perceptual assessment, vestibular voice was hoarse, rough and strained. Acoustic examination showed increase of Jitter, Shimmer, NHR, presence of nonharmonic components and decrease of F0 and MPT.
APA, Harvard, Vancouver, ISO, and other styles
27

Ilić-Savić, Ivana, and Mirjana Petrović-Lazić. "Comparison of voice acoustic parameters in persons with organic voice disorders." Acta Facultatis Medicae Naissensis 41, no. 3 (2024): 376–83. http://dx.doi.org/10.5937/afmnai41-48292.

Full text
Abstract:
Introduction/Aim. Voice is an acoustic phenomenon that must be viewed multidimensionally. In the quantitative assessment of voice characteristics, the use of acoustic parameters becomes a necessary method of analysis. The aim of the paper is to examine whether there are differences in the acoustic parameters of the voice (the average value of the fundamental frequency - Fo, variations in the height of the fundamental frequency - Jita, and variations in the intensity of the fundamental tone of the larynx - ShdB in people with organic voice disorders with reference to gender and age). Methods. The research included 30 subjects with organic voice disorders (inflammatory processes in the larynx and various allergic diseases), of both sexes (female n = 13, male n = 17), aged from 13 to 85 years. The first age group consisted of respondents aged 13 - 40 years, and the second age group consisted of respondents aged 41 - 85 years. The acoustic parameters of the voice were processed with a computer program for multidimensional vocal analysis model 4300 of the "Kay Elemetrics" voice laboratory. Results. The results of the analyzed parameters (Fo, Jita, ShdB) show that the mentioned parameters deviate from the reference values in women. Analyzing the acoustic parameters of the voice in older and younger subjects with organic voice disorders, it can be seen that there are significant differences between the two groups of subjects in the acoustic parameters of the voice in older subjects. Conclusion. Considering the importance of voice and speech for social development, these research findings imply the need to describe the quantitative and microstructural characteristics of the voice in individuals with organic voice disorders, thereby providing vocal pathologists with a better understanding of the relationship between the investigated independent factors of sex, age, smoking status, and voice, improving their ability in the process of diagnosis, therapy and rehabilitation of organic voice disorders.
APA, Harvard, Vancouver, ISO, and other styles
28

Owczarek, Kalina, Piotr Niewiadomski, and Jurek Olszewski. "Acoustic and function analysis of the voice in patients with functional and organic dysphonia by using the DiagnoScope “Specjalista” computer program." Otolaryngologia Polska 73, no. 2 (2019): 1–5. http://dx.doi.org/10.5604/01.3001.0013.1535.

Full text
Abstract:
Introduction: The aim of the dissertation was to evaluate the parameters of acoustic and function analysis of the voice in patients with functional and organic dysphonia by using the DiagnoScope “Specjalista” computer program. Material and methods: The study was performed in 131 people aged 21-82 years (the mean age was 48,34 years), including 75 women, at the age of 21-75 years and 56 men at the age of 22-82 years, all treated in the Clinic of Otolaryngology, Laryngological Oncology, Audiology and Phoniatrics of the Military Medical Academy Memorial University Hospital – Central Veterans' Hospital. The study participants were divided into 3 groups: I – 45 patients aged 22-82 years with functional dysphonia (hyperfunctional type dysphonia), II – 45 patients aged 28-80 years with chronic hypertrophic laryngitis (polyp of the vocal fold, hypertrophy of the vocal fold, Reinke-type edema-hypertrophic changes in the vocal folds), III – 41 individuals, including students of the Faculty of Military Medicine of the Medical University of Łódź, at the age of 21-70 years, without disease symptoms within the vocal organ. The following examinations were performed: a laryngological subject and object examination, a videolaryngoscopic and stroboscopic examination of the larynx (including the GRBAS scale), and a videolaryngostroboscopic-phoniatric examination, encompassing the following parameters: regularity of vibrations of the vocal folds, amplitude of vibrations, marginal shift, glottis closure, voice range, the way of creating the voice, voice reposition, time of phonation) and an acoustic and voice performance diagnostic analysis by the means of the DiagnoScope “Specjalista” program. Results: The results obtained suggest that the differentiation of pathological voices from normal voices is statistically significant for parameters such as: Jitter, RAP, PPQ, AFO, SimpleQ, Shimmer, APQ, Q, APQ, HPQh, RHPQ, RHPQh, R2H, U2H, U2Hl, U2Hh, NHR, Yg. In the function analysis, statistically significant differences were found in all parameters. Conclusions: The videolaryngostroboscopic examination, voice rating scales, and the objective analysis of the voice provide an appropriate set of tests that determine the nature of voice disorders. The acoustic and function analysis of the voice enables an early diagnosis of dysphonia, both functional and organic. The acoustic and function analysis is an objective method of assessment of the voice disorders which renders it possible to use it in judicature.
APA, Harvard, Vancouver, ISO, and other styles
29

Liu, Boquan, Evan Polce, Hayley Raj, and Jack Jiang. "Quantification of Voice Type Components Present in Human Phonation Using a Modified Diffusive Chaos Technique." Annals of Otology, Rhinology & Laryngology 128, no. 10 (2019): 921–31. http://dx.doi.org/10.1177/0003489419848451.

Full text
Abstract:
Purpose: Signal typing has been used to categorize healthy and disordered voices; however, human voices are likely comprised of differing proportions of periodic type 1 elements, type 2 elements that are periodic with modulations, aperiodic type 3 elements, and stochastic type 4 elements. A novel diffusive chaos method is presented to detect the distribution of voice types within a signal with the goal of providing an objective and clinically useful tool for evaluating the voice. It was predicted that continuous calculation of the diffusive chaos parameter throughout the voice sample would allow for construction of comprehensive voice type component profiles (VTCP). Methods: One hundred thirty-five voice samples of sustained /a/ vowels were randomly selected from the Disordered Voice Database Model 4337. All samples were classified according to the voice type paradigm using spectrogram analysis, yielding 34 type 1, 35 type 2, 42 type 3, and 24 type 4 voice samples. All samples were then analyzed using the diffusive chaos method, and VTCPs were generated to show the distribution of the 4 voice type components (VTC). Results: The proportions of VTC1 varied significantly between the majority of the traditional voice types ( P &lt; .001). Three of the 4 VTCs of type 3 voices were significantly different from the VTCs of type 4 voices ( P &lt; .001). These results were compared to calculations of spectrum convergence ratio, which did not vary significantly between voice types 1 and 2 or 2 and 3. Conclusion: The diffusive chaos method demonstrates proficiency in generating comprehensive VTCPs for disordered voices with varying severity. In contrast to acoustic parameters that provide a single measure of disorder, VTCPs can be used to detect subtler changes by observing variations in each VTC over time. This method also provides the advantage of quantifying stochastic noise components that are due to breathiness in the voice.
APA, Harvard, Vancouver, ISO, and other styles
30

Myszel, Karol, and Agata Szkiełkowska. "Acoustic voice parameters in hearing-impaired, school-aged children. Research study outcomes." Journal of Clinical Otorhinolaryngology 3, no. 3 (2021): 01–06. http://dx.doi.org/10.31579/2692-9562/034.

Full text
Abstract:
Aim/Background: Individuals deprived of full hearing ability suffer from many problems in their social, professional, and personal lives. Prolonged hearing deprivation, particularly in children, results in many consequences including loss of control of one’s voice. Impaired hearing control of the voice leads to dysphonia of different degrees. The aim of this study was to analyse acoustic voice parameters in hearing-impaired, school-aged children (7–12 years) and the influence of treatment type on voice quality. Material and method: The subjects in this study were a group of 83 Polish school children, aged 7–12 years. ENT and phoniatric examination were performed on each subject prior to inclusion. Acoustic voice analysis was performed for each individual. The linguistic material was a prolonged [a] vowel. The acoustic analysis was performed with a digital spectrograph KAY 4300B for multidimensional voice analysis MDVP (Multi-Dimensional Voice Profile). The criterion to include patients in the study group was a profound prelingual, sensorineural hypoacusis. Within the study group, four subgroups were categorised according to the use of hearing devices: 20 children without any type of hearing device (HL), 20 children with hearing aids (HA), and 20 children with cochlear implants (CI). The children using hearing aids and cochlear implants were included in the study 6 months after the hearing rehabilitation was implemented. The control group (NH) consisted of 23 normal-hearing children, in the same age group, without any co-existing voice disorders. The statistical level of significance p &lt; 0.05 was used. Results: The selected, clinically significant voice parameters were analysed for the groups of children in the study.
APA, Harvard, Vancouver, ISO, and other styles
31

Mittal, Vikas, and R. K. Sharma. "Classification of Parkinson Disease Based on Analysis and Synthesis of Voice Signal." International Journal of Healthcare Information Systems and Informatics 16, no. 4 (2021): 1–22. http://dx.doi.org/10.4018/ijhisi.20211001.oa30.

Full text
Abstract:
The most important application of voice profiling is pathological voice detection. Parkinson's disease is a chronic neurological degenerative disease affecting the central nervous system responsible for essentially progressive evolution movement disorders. 70% to 90% of Parkinson’s disease (PD) patients show an affected voice. This paper proposes a methodology for PD based on acoustic, glottal, physical, and electrical parameters. The results show that the acoustic parameter is more important in the case of Parkinson’s disease as compared to glottal and physical parameters. The authors achieved 97.2% accuracy to differentiate Parkinson and healthy voice using jitter to pitch ratio proposed algorithm. The Authors also proposed an algorithm of poles calculation of the vocal tract to find formants of the vocal tract. Further, formants are used for finding the transfer function of vocal tract filter. In the end, the authors suggested parameters of the electrical vocal tract model are also changed in the case of PD voices.
APA, Harvard, Vancouver, ISO, and other styles
32

Dahmani, Mohamed, and Mhania Guerti. "Recurrence Quantification Analysis of Glottal Signal as non Linear Tool for Pathological Voice Assessment and Classification." International Arab Journal of Information Technology 17, no. 6 (2020): 857–66. http://dx.doi.org/10.34028/iajit/17/6/4.

Full text
Abstract:
Automatic detection and assessment of Vocal Folds pathologies using signal processing techniques knows an extensively challenge use in the voice or speech research community. This paper contributes the application of the Recurrence Quantification Analysis (RQA) to a glottal signal waveform in order to evaluate the dynamic process of Vocal Folds (VFs) for diagnosis and classify the voice disorders. The proposed solution starts by extracting the glottal signal waveform from the voice signal through an inverse filtering algorithm. In the next step, the parameters of RQA are determined via the Recurrent Plot (RP) structure of the glottal signal where the normal voice is considered as a reference. Finally, these parameters are used as input features set of a hybrid Particle Swarm Optimization-Support Vector Machines (PSO-SVM) algorithms to segregate between normal and pathological voices. For the test validation, we have adopted the collection of Saarbrucken Voice Database (SVD) where we have selected the long vowel /a:/ of 133 normal samples and 260 pathological samples uttered by four groups of subjects : persons having suffered from vocal folds paralysis, persons having vocal folds polyps, persons having spasmodic dysphonia and normal voices. The obtained results show the effectiveness of RQA applied to the glottal signal as a features extraction technique. Indeed, the PSO-SVM as a classification method presented an effective tool for assessment and diagnosis of pathological voices with an accuracy of 97.41%
APA, Harvard, Vancouver, ISO, and other styles
33

Knyazev, Sergey V. "On the interaction of phonetic parameters implementing the voiced / voiceless phonological opposition in Standard Modern Russian." Sibirskiy filologicheskiy zhurnal, no. 4 (2021): 137–53. http://dx.doi.org/10.17223/18137083/77/11.

Full text
Abstract:
The paper reports new data obtained in the experimental study of voice coarticulation of voiced and voiceless obstruents adjacent to sonorant depending on the place and manner of articulation of these consonants in Standard Modern Russian. The experimental results revealed the voice coarticulation of the obstruent in word-internal clusters of [sonorant + obstruent + sonorant] coronal consonants, possibly due to the preceding homorganic nasal consonant. In the case of sonorants [nasal + voiceless stop + vibrant] that are not identical in place and manner of articulation, the closure part of the dental stop becomes voiced throughout, with this phonation type accommodation not leading, nevertheless, to the voiced / voiceless phoneme neutralization since the contrast in question is still maintained by phonetic parameters other than voice (phonation itself). These are closure duration, burst duration, and relative overall intensity. On the contrary, in the case of dental sonorants [nasal + voiceless stop + nasal] being identical in place and manner of articulation, the contrast in burst duration is eliminated since no burst of dental stop is found in the position before homorganic nasal, with the closure part of the stop not acquiring voicing to prevent the voiced / voiceless phoneme neutralization. In conclusion, it is argued that in Standard Modern Russian, the phonetic parameter [relative overall intensity] is less significant in the hierarchical structure of distinctive phonological feature than [closure voicing] and [burst duration] ones since it cannot serve as the only parameter distinguishing the voiced and voiceless obstruents in the intersonorant position.
APA, Harvard, Vancouver, ISO, and other styles
34

Mitrovic, Slobodan, Rajko Jovic, Vesna Aleksic, and Biserka Cvejic. "Parameters of phoniatric examinations for solo singers." Medical review 55, no. 7-8 (2002): 309–13. http://dx.doi.org/10.2298/mpns0208309m.

Full text
Abstract:
Introduction A phoniatrist analyzes the professional's voice at the beginning of his vocal studies or career but also later, in cases of voice disorder. Phoniatric examination of professional singers must be done according to "all inclusive" protocols of examination. Such protocols must establish the status of basic elements of phonatory system: activator generator and resonator of voice and articulatory space. Anamnesis All patients requiring phoniatric examination no matter if they are candidates for professional singers, need to provide anamnestic data about their previous problems regarding voice or singing. Clinical examination This examination is necessary and it must include: examination of nose, cavum oris, pharynx, ears and larynx. Subjective acoustic analysis This analysis is based on evaluation of physiological and pathophysiological manifestations of voice. Musical voice range Determination of musical voice range during phoniatric examination does not intend to make any classification of voice, nor to suggest to vocal teacher what he should count upon from future singers. Musical range can be determined only by a phoniatrist skilled in music or with musical training, but first of all vocal teacher. Objective acoustic analysis These methods are used for examination of phonatory function, or laryngeal pathology. They are not invasive and give objective and quantitative information. They include laryngostroboscopy, spectral analysis of voice (sonography) and fundamental parameters of voice signal (computer program). Speech examination Articulation is very important for solo singers, because good articulation contributes to qualitative emission of sound and expression of emotions. Hearing tests Tonal-threshold audiometry is performed as a hearing test. Additional tests They include rhinomanometry, vital capacity measurements maximal phonation time and phonation quotient. Conclusion Phoniatric examination is a necessary proceeding which must be performed before admission to the academy of solo singing, and then during singers' education and career. The phoniatric protocol must include a minimal number of parameters, which can be increased if required. All parameters of phoniatric examination must be adequatly evaluated by experts.
APA, Harvard, Vancouver, ISO, and other styles
35

Lee, Ji-Yeoun. "Experimental Evaluation of Deep Learning Methods for an Intelligent Pathological Voice Detection System Using the Saarbruecken Voice Database." Applied Sciences 11, no. 15 (2021): 7149. http://dx.doi.org/10.3390/app11157149.

Full text
Abstract:
This work is focused on deep learning methods, such as feedforward neural network (FNN) and convolutional neural network (CNN), for pathological voice detection using mel-frequency cepstral coefficients (MFCCs), linear prediction cepstrum coefficients (LPCCs), and higher-order statistics (HOSs) parameters. In total, 518 voice data samples were obtained from the publicly available Saarbruecken voice database (SVD), comprising recordings of 259 healthy and 259 pathological women and men, respectively, and using /a/, /i/, and /u/ vowels at normal pitch. Significant differences were observed between the normal and the pathological voice signals for normalized skewness (p = 0.000) and kurtosis (p = 0.000), except for normalized kurtosis (p = 0.051) that was estimated in the /u/ samples in women. These parameters are useful and meaningful for classifying pathological voice signals. The highest accuracy, 82.69%, was achieved by the CNN classifier with the LPCCs parameter in the /u/ vowel in men. The second-best performance, 80.77%, was obtained with a combination of the FNN classifier, MFCCs, and HOSs for the /i/ vowel samples in women. There was merit in combining the acoustic measures with HOS parameters for better characterization in terms of accuracy. The combination of various parameters and deep learning methods was also useful for distinguishing normal from pathological voices.
APA, Harvard, Vancouver, ISO, and other styles
36

Arias, Mariano Rosique, JosÉ Luis RamÓN, Matilde Campos, and Juan JimÉNez Cervantes. "Acoustic analysis of the voice in phonatory fistuloplasty after total laryngectomy." Otolaryngology–Head and Neck Surgery 122, no. 5 (2000): 743–47. http://dx.doi.org/10.1067/mhn.2000.98359.

Full text
Abstract:
A comparative study of the voice with sustained phonation of the vowel /a/ was made in 3 groups of male patients: (1) 20 patients receiving total laryngectomy for epidermoid carcinoma of the larynx who had acquired good voice quality after a phonatory fistuloplasty with a Herrmann voice prosthesis; (2) 20 patients undergoing total laryngectomy for epidermoid carcinoma of the larynx who had learned esophageal speech; and (3) 20 subjects with normal voices. Statistical analysis yielded significant differences in fundamental voice frequency between the 3 groups, with the patients with phonatory prostheses revealing the closest to a normal voice. For other parameters used, such as jitter, shimmer, and harmonics/noise ratio, voice quality with a phonatory prosthesis was similar to that obtained with esophageal speech.
APA, Harvard, Vancouver, ISO, and other styles
37

Arias, Mariano Rosique, José Luis Ramón, Matilde Campos, and Juan Jiménez Cervantes. "Acoustic Analysis of the Voice in Phonatory Fistuloplasty after Total Laryngectomy." Otolaryngology–Head and Neck Surgery 122, no. 5 (2000): 743–47. http://dx.doi.org/10.1016/s0194-5998(00)70208-7.

Full text
Abstract:
A comparative study of the voice with sustained phonation of the vowel /a/ was made in 3 groups of male patients: (1) 20 patients receiving total laryngectomy for epidermoid carcinoma of the larynx who had acquired good voice quality after a phonatory fistuloplasty with a Herrmann voice prosthesis; (2) 20 patients undergoing total laryngectomy for epidermoid carcinoma of the larynx who had learned esophageal speech; and (3) 20 subjects with normal voices. Statistical analysis yielded significant differences in fundamental voice frequency between the 3 groups, with the patients with phonatory prostheses revealing the closest to a normal voice. For other parameters used, such as jitter, shimmer, and harmonics/noise ratio, voice quality with a phonatory prosthesis was similar to that obtained with esophageal speech.
APA, Harvard, Vancouver, ISO, and other styles
38

Kreiman, Jody, Bruce R. Gerratt, and Kristin Precoda. "Listener Experience and Perception of Voice Quality." Journal of Speech, Language, and Hearing Research 33, no. 1 (1990): 103–15. http://dx.doi.org/10.1044/jshr.3301.103.

Full text
Abstract:
Five speech-language clinicians and 5 naive listeners rated the similarity of pairs of normal and dysphonic voices. Multidimensional scaling was used to determine the voice characteristics that were perceptually important for each voice set and listener group. Solution spaces were compared to determine if clinical experience affects perceptual strategies. Naive and expert listeners attended to different aspects of voice quality when judging the similarity of voices, for both normal and pathological voices. All naive listeners used similar perceptual strategies; however, individual clinicians differed substantially in the parameters they considered important when judging similarity. These differences were large enough to suggest that care must be taken when using data averaged across clinicians, because averaging obscures important aspects of an individual’s perceptual behavior.
APA, Harvard, Vancouver, ISO, and other styles
39

Pimenta, Regina Aparecida, María Eugenia Dájer, Adriana Hachiya, Domingos Hiroshi Tsuji, and Arlindo Neto Montagnoli. "Parameters Acoustic and High-speed kymography identified effects of voiced vibration and vocal fry exercises." CoDAS 25, no. 6 (2013): 577–83. http://dx.doi.org/10.1590/s2317-17822014000100010.

Full text
Abstract:
Purpose: To identify the immediate effects of voiced vibration and vocal fry exercises on healthy subjects by means of acoustic parameters and high-speed kymography. Methods: Thirty healthy subjects (18 women and 12 men, aged from 19 to 45 years old) participated in this study. The voices were recorded at the range of 44.100 samples/second, before and after of the vocal exercises. We developed a computational routine to extract jitter and shimmer. High-speed kymography was generated from laryngeal images for the analysis of the phase times: closed (CPh), open (OPh), of closing (cPh) and of opening (oPh) and used the paired t-Student test and the Mann-Whitney test, with a significance level of 0.05. Results: After voiced vibrations, acoustic parameters showed reduced jitter for both genders (p=0.018 for men and p&lt;0.01 for women) and reduced shimmer for female voices (p&lt;0.01). There was a decrease in CPh (p=0.046) and cPh (p=0.026) and an increase in OPh (p=0.05) in female vocal folds. After vocal fry, we identified decreased jitter (p&lt;0.01) in female voices and cPh (p=0.026) in male vocal folds. Conclusion: We observed more positive immediate effects of the voiced vibrations, mainly in voice quality and vocal folds among females. Nevertheless, studies with larger male sample and investigation of the appropriate time of vocal fry are necessary to confirm the results of this search.
APA, Harvard, Vancouver, ISO, and other styles
40

Novák, Alexej, and Jan Vokřál. "Acoustic Parameters for the Evaluation of Voice of Future Voice Professionals." Folia Phoniatrica et Logopaedica 47, no. 5 (1995): 279–85. http://dx.doi.org/10.1159/000266361.

Full text
APA, Harvard, Vancouver, ISO, and other styles
41

Ikuma, Takeshi, Andrew J. McWhorter, Lacey Adkins, and Melda Kunduk. "Development of Parameters towards Voice Bifurcations." Applied Sciences 11, no. 12 (2021): 5469. http://dx.doi.org/10.3390/app11125469.

Full text
Abstract:
Pathological vocal folds are known to exhibit multiple oscillation patterns, depending on tissue imbalance, subglottal pressure level, and other factors. This includes mid-phonation changes due to bifurcations in the underlying voice source system. Knowledge of when changes in oscillation patterns occur is helpful in the assessments of voice disorders, and the knowledge could be transformed into useful objective measures. Mid-phonation bifurcations can occur in rapid succession; hence, a fast classification of oscillation pattern is critical to minimize the averaging of data across bifurcations. This paper proposes frequency-ratio based short-term measures, named harmonic disturbance factor (HDF) and biphonic index (BI), towards the detection of the bifurcations. For the evaluation of HDF and BI, a frequency selection algorithm for glottal source signals is devised, and its efficacy is demonstrated with the glottal area waveforms of four cases, representing the wide range of oscillatory behaviors. The HDF and BI exhibit clear transitions when the voice bifurcations are apparent in the spectrograms. The presented proof-of-concept experiment’s outcomes warrant a larger scale study to formalize the parameters of the frequency selection algorithm.
APA, Harvard, Vancouver, ISO, and other styles
42

Leonov, A. S., and V. N. Sorokin. "Accuracy in determining voice source parameters." Acoustical Physics 60, no. 6 (2014): 687–93. http://dx.doi.org/10.1134/s1063771014050078.

Full text
APA, Harvard, Vancouver, ISO, and other styles
43

Tafiadis, Dionysios, Georgios Tatsis, Nausica Ziavra, and Eugenia I. Toki. "Voice Data on Female Smokers: Coherence between the Voice Handicap Index and Acoustic Voice Parameters." AIMS Medical Science 4, no. 2 (2017): 151–63. http://dx.doi.org/10.3934/medsci.2017.2.151.

Full text
APA, Harvard, Vancouver, ISO, and other styles
44

Adak, Smritikana, Rashmimala Pradhan, Sujyotsna Jena, Subhalaxmi Pradhan, Lulup Kumar Sahoo, and Mamata Swain. "Impact of nurse and beloved family member’s voice stimulus on the level of consciousness and physiological parameters in comatose patients." Journal of Integrative Nursing 7, no. 1 (2025): 33–41. https://doi.org/10.4103/jin.jin_133_24.

Full text
Abstract:
ABSTRACT Objective: The objective of this study was to compare the effect of nurse and beloved family member’s recording voice on consciousness and physical parameters in patients with coma state. Materials and Methods: A randomized control trial parallel group design was conducted among 45 comatose patients divided into two intervention groups, i.e. nurse voice stimulus group, receiving nurses voice with standard care, family members voice stimulus group receiving their beloved family member voice with standard care and one control group receiving only standard care in medicine intensive care unit. The intervention was provided three times a day, each lasting 5 min for 7 days in addition to standard care. Repeated measure analysis of variance and independent t-test were used to compare within and between groups, respectively. Results: The study found significant differences in Glasgow coma scale (GCS) scores within both the nurse (F = 2.78, P = 0.042) and family member (F = 10.27, P = 0.0001) voice stimulus groups over 7 days. Comparing GCS scores between intervention groups showed significant variations before (P = 0.028), during (P = 0.047), and after (P = 0.036) the intervention on day 7. Comparing GCS scores between the family members’ voice stimulus group and the control group, significant changes were observed on days 5 and 7 (P = 0.043, 0.030, 0.030, and 0.014, 0.012, 0.012) before, during, and after the intervention. Conclusions: The use of beloved family members’ voices proved more effective in elevating the patients’ level of consciousness compared to both the nurse voice stimulus group and the control group.
APA, Harvard, Vancouver, ISO, and other styles
45

Li, Xiao Wu, and Hong Bo Ouyang. "Evaluating Artistic Voice of Singing Objectivly Using BPNN." Advanced Materials Research 546-547 (July 2012): 1240–44. http://dx.doi.org/10.4028/www.scientific.net/amr.546-547.1240.

Full text
Abstract:
In this paper, an objective way to evaluate artistic voice of singing is discussed. The model transforms artistic voice evaluation indicators into qualified data as BP network input and takes fuzzy synthetic evaluation results as output. The authors take F1(the first Formant), F3(the third Formant), vocal range, perturbation of F1, perturbation of F3 and average energy as the evaluating parameters and assess the quality of singing voices with BPNN(back propagation neural network). The results are then compared with the subjective evaluation of experienced professionals. Experiments show that BP neural network is effective to evaluate the singing voices, thus to be helpful to scientific guidance of selecting and training the talent of artistic voice.
APA, Harvard, Vancouver, ISO, and other styles
46

Dong, Li, and Jiangping Kong. "Electroglottographic Analysis of the Voice in Young Male Role of Kunqu Opera." Applied Sciences 11, no. 9 (2021): 3930. http://dx.doi.org/10.3390/app11093930.

Full text
Abstract:
The phonation types used in the young male role in Kunqu Opera were investigated. Two national young male role singers volunteered as the subjects. Each singer performed three voice conditions: singing, stage speech, and reading lyrics. Three electroglottogram parameters, the fundamental frequency, contact quotient, and speed quotient, were analyzed. Electroglottogram parameters were different between voice conditions. Five phonation types were found by clustering analysis in singing and stage speech: (1) breathy voice, (2) high adduction modal voice, (3) modal voice, (4) untrained falsetto, and (5) high adduction falsetto. The proportion of each phonation type was not identical in singing and stage speech. The relationship between phonation type and pitch was multiple to one in the low pitch range, and one to one in the high pitch range. The sound pressure levels were related to the phonation types. Five phonation types, instead of only the two phonation types (modal voice and falsetto) that are identified in traditional Kunqu Opera singing theory, were concomitantly used in the young male role’s artistic voices. These phonation types were more similar to those of the young female roles than to those of the other male roles in the Kunqu Opera.
APA, Harvard, Vancouver, ISO, and other styles
47

Zhang, Xiaojun, Lingling Gu, Wei Wei, Di Wu, Zhi Tao, and Heming Zhao. "Pathological Voice Source Analysis System Using a Flow Waveform-Matched Biomechanical Model." Applied Bionics and Biomechanics 2018 (July 2, 2018): 1–13. http://dx.doi.org/10.1155/2018/3158439.

Full text
Abstract:
Voice production occurs through vocal cord and vibration coupled to glottal airflow. Vocal cord lesions affect the vocal system and lead to voice disorders. In this paper, a pathological voice source analysis system is designed. This study integrates nonlinear dynamics with an optimized asymmetric two-mass model to explore nonlinear characteristics of vocal cord vibration, and changes in acoustic parameters, such as fundamental frequency, caused by distinct subglottal pressure and varying degrees of vocal cord paralysis are analyzed. Various samples of sustained vowel /a/ of normal and pathological voices were extracted from MEEI (Massachusetts Eye and Ear Infirmary) database. A fitting procedure combining genetic particle swarm optimization and a quasi-Newton method was developed to optimize the biomechanical model parameters and match the targeted voice source. Experimental results validate the applicability of the proposed model to reproduce vocal cord vibration with high accuracy, and show that paralyzed vocal cord increases the model coupling stiffness.
APA, Harvard, Vancouver, ISO, and other styles
48

Vipperla, Ravichander, Steve Renals, and Joe Frankel. "Ageing Voices: The Effect of Changes in Voice Parameters on ASR Performance." EURASIP Journal on Audio, Speech, and Music Processing 2010, no. 1 (2010): 525783. http://dx.doi.org/10.1186/1687-4722-2010-525783.

Full text
APA, Harvard, Vancouver, ISO, and other styles
49

Rantala, Leena, and Erkki Vilkman. "Relationship between subjective voice complaints and acoustic parameters in female teachers' voices." Journal of Voice 13, no. 4 (1999): 484–95. http://dx.doi.org/10.1016/s0892-1997(99)80004-6.

Full text
APA, Harvard, Vancouver, ISO, and other styles
50

Van Lancker, Diana, Jody Kreiman, and Karen Emmorey. "Familiar voice recognition: patterns and parameters Part I: Recognition of backward voices." Journal of Phonetics 13, no. 1 (1985): 19–38. http://dx.doi.org/10.1016/s0095-4470(19)30723-5.

Full text
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography