To see the other types of publications on this topic, follow the link: Acoustic phonetics.

Dissertations / Theses on the topic 'Acoustic phonetics'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 dissertations / theses for your research on the topic 'Acoustic phonetics.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.

1

Teixeira, de Jesus Luis Miguel. "Acoustic phonetics of European Portuguese fricative consonants." Thesis, University of Southampton, 2001. https://eprints.soton.ac.uk/426721/.

Full text
Abstract:
The production of fricatives is not yet fully understood because the mechanism is particularly complex. Studies of Portuguese fricatives have been very limited, so in this thesis a novel methodology of corpus design, and temporal and spectral analysis techniques were developed to enhance our description of the acoustic properties, and to increase our understanding of the production of fricatives. The data presented in this thesis could be used to improve the naturalness of synthetic speech. Corpora were devised that included the fricatives /f, v, s, z, J, 3/ in the following contexts: sustained, repeated nonsense words of the form /PV1CV2/, Portuguese words containing fricatives in frame sentences, and the same set of words in sentences. Four subjects (two male, two female) were recorded saying the corpora, using a microphone in the acoustic far - field and a laryngograph. Temporal analysis of the fricatives revealed a large number of devoiced examples. Analysis of variance showed that devoicing was significantly more likely for word-final fricatives and posterior place of articulation. In addition to the fricatives listed above, we also noticed other fricatives occurring as allophones of / r , r / in 100 words out of 365. Durations of the fricative segments were comparable to /R, r / and thus shorter on average than fricatives / f , v, s, z, J, 3/. Some of the speech segments were continuous "noisy signals" very similar to those of fricatives. The spectral peak frequencies of the fricatives occurring in place of / a / were compared to the other fricatives, which indicated a place of articulation further back than /J, 3/, and compared to velar and uvular fricative results previously reported for other languages. These comparisons indicated that the uvular fricatives [x, k] and the voiceless tapped alveolar [r] were given the phonological role of /R/ and / r / respectively, though these fricatives have not previously been reported as phones of standard European Portuguese. The fricative spectra were parameterised in terms of our knowledge of the underlying aeroacoustics. The parameters spectral slope, frequency of maximum amplitude, and dynamic amplitude were developed to characterise fricative spectra. The parameters behaved as predicted for changes in eSbrt level, voicing, and location within the fricative. Some combinations were also useful for separating the fricatives by place or by sibilance. A preliminary cross - language study of Portuguese and English fricatives produced by two bilingual siblings is also presented. Although results for Portuguese and English fricatives seem to be very similar this maybe due to the use by bilinguals of different production strategies from monolinguals which attenuate cross - language acoustical contrasts. The English corpus developed for the bilingual subjects could be used to study monolingual English speakers.
APA, Harvard, Vancouver, ISO, and other styles
2

Flory, Yvonne. "The impact of head and body postures on the acoustic speech signal." Thesis, University of Cambridge, 2015. https://www.repository.cam.ac.uk/handle/1810/247436.

Full text
Abstract:
This dissertation is aimed at investigating the impact of postural changes within speakers on the acoustic speech signal to complement research on articulatory changes under the same conditions. The research is therefore relevant for forensic phonetics, where quantifying within-speaker variation is vital for the accuracy of speaker comparison. To this end, two acoustic studies were carried out to quantify the influence of five head positions and three body orientations on the acoustic speech signal. Results show that there is a consistent change in the third formant, a change which was most evident in the body orientation measurements, and to a lesser extent in the head position data. Analysis of the results with respect to compensation strategies indicates that speakers employ different strategies to compensate for these perturbations to their vocal tract. Some speakers did not exhibit large differences in their speech signal, while others appeared to compensate much less. Across all speakers, the effect was much stronger in what were deemed ‘less natural’, postures. That is, speakers were apparently less able to predict and compensate for the impact of prone body orientation on their speech than for that of the more natural supine orientation. In addition to the acoustic studies, a perception experiment assessed whether listeners could make use of acoustic cues to determine the posture of the speaker. Stimuli were chosen with, by design, stronger or weaker acoustic cues to posture, in order to elicit a possible difference in identification performance. Listeners were nevertheless not able to identify above chance whether a speaker was sitting or lying in prone body orientation even when hearing the set with stronger cues. Further combined articulatory and acoustic research will have to be carried out to disentangle which articulatory behaviours correlate with the acoustic changes presented in order to draw a more comprehensive picture of the effects of postural variation on speech.
APA, Harvard, Vancouver, ISO, and other styles
3

Yuan, Anthony. "Acoustic study of the Cantonese diphthongs." Click to view the E-thesis via HKUTO, 1996. http://sunzi.lib.hku.hk/hkuto/record/B36209375.

Full text
Abstract:
Thesis (B.Sc)--University of Hong Kong, 1996.
"A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, April 29, 1996." Also available in print.
APA, Harvard, Vancouver, ISO, and other styles
4

So, Ka-pak, and 蘇家柏. "Articulatory-acoustic relation in Cantonese vowels." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2003. http://hub.hku.hk/bib/B26772449.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Yeung, Ho-yan. "Vowels of Hong Kong English from an acoustic perspective /." Click to view the E-thesis via HKU Scholars Hub, 2007. http://lookup.lib.hku.hk/lookup/bib/B42006235.

Full text
Abstract:
Thesis (B.Sc)--University of Hong Kong, 2007.
"A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, June 30, 2007." Includes bibliographical references (p. 29-30). Also available in print.
APA, Harvard, Vancouver, ISO, and other styles
6

Krull, Diana. "Acoustic Properties as Predictors of Perceptual Responses : a Study of Swedish Voiced Stops." Doctoral thesis, Stockholms universitet, Institutionen för lingvistik, 1988. http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-40213.

Full text
Abstract:
In speech recognition algorithms and certain theories of speech perception the interpretation of the signal is based on " distance scores " for comparisons of the signal with stored references; in these theories, perception is seen as a product of stimulus and experience. The aim of the present thesis is to evaluate such distance measures by investigating the perceptual confusions of the Swedish voiced stops [b,d,q,g] in systematically varied fragments of vowel-consonantvowel stimuli providing 25 vowel contexts for each consonant. To what extent can perceptual identifications be accounted for in terms of the acoustic properties of  the stimuli? Short stimulus segments following stop release, chosen to elicit perceptual confusions, constituted the main material for this investigation. The resulting confusions were shown to form a regular pattern depending mainly on the acute/grave dimension of the following vowel. The acoustic distances calculated were based partly on formant frequencies at the consonant-vowel boundary, partly on filter-band spectra. B oth models provided distance measures which revealed regular patterns related in their essentials to the confusions. However, the predictive capacity of both models was improved by including the dynamic properties of the stimuli in the distance measures. The highest correlation between predicted and observed percent confusions, r=.85, was obtained with the fOlmant-based model. The asymmetries in the listeners' confusions were also shown to be predictable given acoustic data on the following vowel and were included in the calculations.
För att köpa boken skicka en beställning till exp@ling.su.se/ To order the book send an e-mail to exp@ling.su.se
APA, Harvard, Vancouver, ISO, and other styles
7

Deshmukh, Om Dadaji. "Synergy of acoustic-phonetics and auditory modeling towards robust speech recognition." College Park, Md. : University of Maryland, 2006. http://hdl.handle.net/1903/3952.

Full text
Abstract:
Thesis (Ph. D.) -- University of Maryland, College Park, 2006.
Thesis research directed by: Electrical Engineering. Title from t.p. of PDF. Includes bibliographical references. Published by UMI Dissertation Services, Ann Arbor, Mich. Also available in paper.
APA, Harvard, Vancouver, ISO, and other styles
8

Hartinger, Mariam. "Untersuchungen der Sprechmotorik von Polterern mit Hilfe der Elektromagnetischen Mediosagittalen Artikulographie (EMMA) /." Frankfurt am Main [u.a.] : Lang, 2008. http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&doc_number=016243303&line_number=0004&func_code=DB_RECORDS&service_type=MEDIA.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Leung, Ka Yee. "Combining acoustic features and articulatory features for speech recognition /." View Abstract or Full-Text, 2002. http://library.ust.hk/cgi/db/thesis.pl?ELEC%202002%20LEUNGK.

Full text
Abstract:
Thesis (M. Phil.)--Hong Kong University of Science and Technology, 2002.
Includes bibliographical references (leaves 92-96). Also available in electronic version. Access restricted to campus users.
APA, Harvard, Vancouver, ISO, and other styles
10

Sellman, R. Thomas. "Word hypothesis from undifferentiated, errorful phonetic strings /." Online version of thesis, 1993. http://hdl.handle.net/1850/11727.

Full text
APA, Harvard, Vancouver, ISO, and other styles
11

Tsay, Suhchuan Jane, and Suhchuan Jane Tsay. "Phonological pitch." Diss., The University of Arizona, 1994. http://hdl.handle.net/10150/186900.

Full text
Abstract:
The theory proposed in this thesis, Phonological Pitch, concerns the representation and behavior of the tone feature. It is a formally simple phonological theory constrained by a set of explicit extragrammatical principles. Phonological Pitch contains two major grammatical mechanisms. First, tone is represented with a single multivalued feature (Pitch) whose value can range from 1 to n, where n is a language-specific number with no universal upper limit. Second, the Contiguity Hypothesis states that tone groups in rules must always form contiguous sets, though these groups can vary from rule to rule. Phonological Pitch can be so simple because the power of the grammatical theory is constrained with independently necessary extragrammatical factors. Specifically, limits on the number of tone levels arise from learnability and perceptual constraints, which can be precisely formalized, that also play a role in nonlinguistic domains. Similarly, the Contiguity Hypothesis is derived from psychoacoustic constraints on discriminating between acoustically similar pitches. Other perceptual and physiological constraints explain patterns in the typology of contour tones and in the interactions of tone with other features. The empirical support for Phonological Pitch includes the following. First, languages are attested with as many as five distinct tone levels, and the number of languages with n tone levels gradually decreases as n increases, rather than dropping off abruptly at some point. An analysis using learnability and perceptual constraints can explain this gradual drop better than a universal grammatical upper limit. Second, tone rules can transpose sets of tones up or down by a fixed interval, a fact which is easier to formalize with a single multivalued feature than with a set of binary features. Third, tone groups do not form universal natural classes nor groups with noncontiguous tones, as other tone theories predict. Fourth, tone interacts not only with laryngeal features like voicing, but also with nonlaryngeal features like vowel height, and both the existence and relative rarity of tone-vowel height interactions imply that understanding tone interactions requires reference to extragrammatical physiological factors.
APA, Harvard, Vancouver, ISO, and other styles
12

Ericsdotter, Christine. "Articulatory-Acoustic Relationships in Swedish Vowel Sounds." Doctoral thesis, Stockholm : Department of Linguistics, Stockholm University, 2005. http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-699.

Full text
APA, Harvard, Vancouver, ISO, and other styles
13

Poon, Man-wai May. "Acoustic cues for the perception of aspiration in Cantonese initial stops." Click to view the E-thesis via HKUTO, 2000. http://sunzi.lib.hku.hk/hkuto/record/B36207688.

Full text
Abstract:
Thesis (B.Sc)--University of Hong Kong, 2000.
"A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, May 10, 2000." Also available in print.
APA, Harvard, Vancouver, ISO, and other styles
14

Yan, Kam-sum Tom. "Dyspraxia of speech in a British family an acoustic study of diphthong production /." Click to view the E-thesis via HKU Scholars Hub, 2003. http://lookup.lib.hku.hk/lookup/bib/B38890999.

Full text
Abstract:
Thesis (B.Sc.)--University of Hong Kong, 2003.
"A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, April 30, 2003." Includes bibliographical references (p. 29-31) Also available in print.
APA, Harvard, Vancouver, ISO, and other styles
15

Miller-Ockhuizen, Amanda. "Grounding Jui'hoansi Root Phonotactics : the phonetics of the guttural OCP and other acoustic modulations /." The Ohio State University, 2001. http://rave.ohiolink.edu/etdc/view?acc_num=osu148639945196266.

Full text
APA, Harvard, Vancouver, ISO, and other styles
16

Chang, Wing-yin Maureen. "Perceptual and acoustic differences between aging voice and dysphonic voice." Click to view the E-thesis via HKUTO, 2001. http://sunzi.lib.hku.hk/hkuto/record/B36207810.

Full text
Abstract:
Thesis (B.Sc)--University of Hong Kong, 2001.
"A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, May 4, 2001. Also available in print.
APA, Harvard, Vancouver, ISO, and other styles
17

Ng, Yuk-sim Cherry. "Perceptual and acoustic analysis of dysarthric dysphonia direct magnitude estimation versus interval scaling /." Click to view the E-thesis via HKUTO, 2002. http://sunzi.lib.hku.hk/hkuto/record/B36208425.

Full text
Abstract:
Thesis (B.Sc)--University of Hong Kong, 2002.
"A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, May 10, 2002." Also available in print.
APA, Harvard, Vancouver, ISO, and other styles
18

Cobb, Katherine N. "La pronunciacion de vocales atonas en espanol: La aplicacion de reglas fonologicas por parte de hablantes no-nativos del espanol." Thesis, The University of Arizona, 2009. http://hdl.handle.net/10150/193438.

Full text
Abstract:
Se examino la produccion de vocales tonicas y atonas por parte de tres grupos de hablantes de espanol (dos grupos con diferencias en anos de experiencia y un grupo nativo). Los hablantes participaron en un experimento de produccion cuyos datos fueron sometidos a varios analisis. Se encontraron efectos robustos de los anos de practica con una segunda lengua, por el hecho de que los hablantes avanzados fueron capaces de reproducir vocales que no produjeron los intermedios. Para las vocales tonicas, la /e/ es la vocal mas difi­cil de aprender para los hablantes no-nativos, seguida por la /a/ y la /u/, mientras que la /o/ y la /i/ son mas faciles. Para las vocales atonas, la /e/ y la /o/ son las mas difi­ciles de aprender para los hablantes no-nativos, seguidas por la /a/ y la /u/, y finalmente por la /i/, la cual es la mas facil.
APA, Harvard, Vancouver, ISO, and other styles
19

Tam, So-may May. "Acoustic analysis of word-initial stop consonants in profoundly hearing impaired speakers." Click to view the E-thesis via HKUTO, 1996. http://sunzi.lib.hku.hk/hkuto/record/B36209338.

Full text
Abstract:
Thesis (B.Sc)--University of Hong Kong, 1996.
"A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, April 29, 1996." Also available in print.
APA, Harvard, Vancouver, ISO, and other styles
20

Weaver, Andrea Lynn. "The Effect of a Lingual Magnet on Fricative Production: An Acoustic Evaluation of Placement and Adaptation." Diss., CLICK HERE for online access, 2005. http://contentdm.lib.byu.edu/ETD/image/etd1009.pdf.

Full text
APA, Harvard, Vancouver, ISO, and other styles
21

Machado, Aline 1989. "Uso de técnicas acústicas para verificação de locutor em simulação experimental." [s.n.], 2014. http://repositorio.unicamp.br/jspui/handle/REPOSIP/271134.

Full text
Abstract:
Orientador: Plínio Almeida Barbosa
Dissertação (mestrado) - Universidade Estadual de Campinas, Instituto de Estudos da Linguagem
Made available in DSpace on 2018-08-26T13:16:38Z (GMT). No. of bitstreams: 1 Machado_Aline_M.pdf: 1068208 bytes, checksum: 5fde2bb97b66beab86daeec0c2e28087 (MD5) Previous issue date: 2014
Resumo: Esta pesquisa investiga a eficácia de um conjunto de medidas acústicas para o reconhecimento da fala de um indivíduo em um grupo de dez falantes do português brasileiro. Um sujeito desse grupo foi sorteado e nomeado o "criminoso". Entre as medidas usadas na pesquisa estão, as frequências dos dois primeiros formantes das vogais, a frequência fundamental média, a duração de unidades do tamanho da sílaba e da vogal, a dinamicidade dos formantes e o desvio padrão de durações de intervalos consonânticos ('delta'C). Todos os trechos escolhidos são de entrevistados divididos em dois grupos, (i) entrevistas ao ar livre e (ii) gravações telefônicas (de celular para celular). Os indivíduos consistem em falantes do português brasileiro das regiões do estado de São Paulo, Rio Grande do Sul, Pará e Bahia. Nesta pesquisa fazemos um apanhado histórico da Fonética Forense, os métodos de análise utilizados ao longo história e também quais parâmetros acústicos mais usados para diferentes contextos de gravação, direta e por celular e quais deles foram mais significantes na pesquisa. Em nossos resultados, os parâmetros que menos sofreram variação com a mudança de canal de transmissão foram parâmetros de ritmo e tempo, como duração, taxa de elocução, 'delta'C; e um parâmetro que mede a dinamicidade do formante, que foi a taxa de movimento do segundo formante. As medidas temporais da pesquisa, por serem as mais variáveis inter-sujeito, tiveram grande poder discriminador. Os testes estatísticos apontaram que três dos indivíduos estudados, apresentavam semelhanças com o "criminoso"
Abstract: The aim of this research is to use some acoustic techniques for recognizing a subject in a group of ten speakers of the Brazilian Portuguese variety and pointing out what are the most relevant acoustic parameters for speaker recognition in that group. The analysis of the first two formants for the oral vowels, fundamental frequency, speech length, formant movement rate, syllable-sized duration, intensity and 'delta'C (standard deviation of consonantal interval durations of the collected samples) will help identifying an individual from within the group. All the samples are from interviews made in a poorly treated acoustic environment and into a mobile phone. Moreover, the samples of one the speakers (the "criminal"), which were collected in an acoustically-treated room, will simulate the questioned pattern of the forensic situation
Mestrado
Linguistica
Mestra em Linguística
APA, Harvard, Vancouver, ISO, and other styles
22

Pontes, Lucas de Almeida [UNESP]. "O espraiamento da nasalização do português do Brasil." Universidade Estadual Paulista (UNESP), 2014. http://hdl.handle.net/11449/115625.

Full text
Abstract:
Made available in DSpace on 2015-03-03T11:52:24Z (GMT). No. of bitstreams: 0 Previous issue date: 2014-04-25Bitstream added on 2015-03-03T12:07:19Z : No. of bitstreams: 1 000809431.pdf: 7440467 bytes, checksum: 7396af26c146adb564c07919d0666bc1 (MD5)
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
O espraiamento da nasalização é observado desde a primeira gramática da língua portuguesa (FERNÃO DE OLIVEIRA, 1536) e por ortógrafos (MADUREIRA FEIJÓ, 1734; GONÇALVES VIANA, 1892). O espraiamento nasal é um processo de assimilação que ocorre quando a nasalização da consoante se estende sobre um segmento vocálico adjacente. O espraiamento da nasalização também pode ocorrer em diferentes contextos com um processo de assimilação. A presente dissertação faz uma revisão geral dos trabalhos de fonética e fonologia que fazem referência a qualquer tipo de fenômeno de espraiamento nasal. O corpus analisado acusticamente é um conjunto de palavras produzidas por um falante nativo do dialeto paulista. Através da análise acústica, estudou-se a estrutura dos formantes dos segmentos com nasalização e dos segmentos orais que podem ser nasalizadas em um processo de espraiamento nasal. Dadas as dificuldades de definição dos formantes nasais com programa Praat, foram usadas diferentes técnicas para conseguir um resultado melhor. Essa questão é discutida na presente dissertação. A dissertação contribui para uma melhor compreensão do espraiamento nasal e de como descrever formantes nasais. Ela também apresenta uma análise dos dados da língua portuguesa do Brasil, com especial referência ao dialeto paulista
Nasal spreading has been observed since the first grammar of Portuguese (FERNÃO DE OLIVEIRA, 1536) and by orthographers (MADUREIRA FEIJÓ, 1734; GONÇALVES VIANA, 1892). The nasal spreading is an assimilatory process that occurs when a consonantal nasalization spreads over an adjacent vocalic segment. Nasal spreading can also occur with a process of assimilation in different contexts. The present study and research makes a general review of phonetic and phonological works that makes reference to any kind of nasal spreading phenomenon. The corpus investigated acoustically is a set of sentences spoken by a native speaker of Paulista Dialect. The acoustic investigation studied the formant structure of the segments with nasalization and oral segments that could be nasalized in a process of nasal spreading. Given the difficulties of setting the nasal formants with Praat program, different techniques were used to achieve a better result. This question is discussed in the present dissertation. The dissertation contributes to a better understanding of nasal spreading, how to describe nasal formants, and presents an analysis of data from Brazilian Portuguese with special reference to Paulista Dialect
APA, Harvard, Vancouver, ISO, and other styles
23

Pontes, Lucas de Almeida. "O espraiamento da nasalização do português do Brasil /." Araraquara, 2014. http://hdl.handle.net/11449/115625.

Full text
Abstract:
Orientador: Luiz Carlos Cagliari
Banca: Larissa Cristina Berti
Banca: Vera Pacheco
Resumo: O espraiamento da nasalização é observado desde a primeira gramática da língua portuguesa (FERNÃO DE OLIVEIRA, 1536) e por ortógrafos (MADUREIRA FEIJÓ, 1734; GONÇALVES VIANA, 1892). O espraiamento nasal é um processo de assimilação que ocorre quando a nasalização da consoante se estende sobre um segmento vocálico adjacente. O espraiamento da nasalização também pode ocorrer em diferentes contextos com um processo de assimilação. A presente dissertação faz uma revisão geral dos trabalhos de fonética e fonologia que fazem referência a qualquer tipo de fenômeno de espraiamento nasal. O corpus analisado acusticamente é um conjunto de palavras produzidas por um falante nativo do dialeto paulista. Através da análise acústica, estudou-se a estrutura dos formantes dos segmentos com nasalização e dos segmentos orais que podem ser nasalizadas em um processo de espraiamento nasal. Dadas as dificuldades de definição dos formantes nasais com programa Praat, foram usadas diferentes técnicas para conseguir um resultado melhor. Essa questão é discutida na presente dissertação. A dissertação contribui para uma melhor compreensão do espraiamento nasal e de como descrever formantes nasais. Ela também apresenta uma análise dos dados da língua portuguesa do Brasil, com especial referência ao dialeto paulista
Abstract: Nasal spreading has been observed since the first grammar of Portuguese (FERNÃO DE OLIVEIRA, 1536) and by orthographers (MADUREIRA FEIJÓ, 1734; GONÇALVES VIANA, 1892). The nasal spreading is an assimilatory process that occurs when a consonantal nasalization spreads over an adjacent vocalic segment. Nasal spreading can also occur with a process of assimilation in different contexts. The present study and research makes a general review of phonetic and phonological works that makes reference to any kind of nasal spreading phenomenon. The corpus investigated acoustically is a set of sentences spoken by a native speaker of Paulista Dialect. The acoustic investigation studied the formant structure of the segments with nasalization and oral segments that could be nasalized in a process of nasal spreading. Given the difficulties of setting the nasal formants with Praat program, different techniques were used to achieve a better result. This question is discussed in the present dissertation. The dissertation contributes to a better understanding of nasal spreading, how to describe nasal formants, and presents an analysis of data from Brazilian Portuguese with special reference to Paulista Dialect
Mestre
APA, Harvard, Vancouver, ISO, and other styles
24

Falahati, Ardestani Reza. "Gradient and Categorical Consonant Cluster Simplification in Persian: An Ultrasound and Acoustic Study." Thèse, Université d'Ottawa / University of Ottawa, 2013. http://hdl.handle.net/10393/26117.

Full text
Abstract:
The main goal of this thesis is to investigate the nature of an optional consonant deletion process, through an articulatory and acoustic study of word-final consonant clusters in Persian. Persian word-final coronal stops are optionally deleted when they are preceded by obstruents or the homorganic nasal /n/. For example, the final clusters in the words /næft/ “oil”, /suχt/ “burnt” and /qæsd/ “intention” are optionally simplified in fast/casual speech, resulting in: [næf], [suχ], and [qæs]. What is not clear from this traditional description is whether the coronal stop is truly deleted, or if a coronal gesture is produced, but not heard, because it is obscured by the adjacent consonants. According to Articulatory Phonology (Browman & Goldstein 1986, 1988, 1989, 1990a, 1990b, 1992, 2001), the articulatory gestures of the deleted segments can still exist even if the segments are not heard. In this dissertation, ultrasound imaging was used to determine whether coronal consonant deletion in Persian is categorical or gradient, and the acoustic consequences of cluster simplification were investigated through duration and spectral measures. This phonetic study enables an account for the optional nature of the cluster simplification process. A general phonological account is provided for the simplification of coda clusters with rising sonority, and the acoustic and articulatory investigation focuses on the simplification of clusters with coronal stops. Ten Persian-speaking graduate students from the University of Ottawa and Carleton University, five male and five female, aged 25-38 participated in the articulatory and acoustic study. Audio and real time ultrasound video recordings were made while subjects had a guided conversation with a native speaker of Persian. 662 tokens of word-final coronal clusters were auditorily classified into unsimplified and simplified according to whether they contained an audible [t]. Singleton coda consonants and singleton /t/s were also captured as controls. The end of the constriction plateau of C1 and beginning of constriction plateau of C3 were used to define a time interval in which to measure the coronal gesture as the vertical distance between the tongue blade and the palate. Smoothing Splines ANOVA was used in a novel way to compare tongue blade height over time across the three conditions. The articulatory results of this study showed that the gestures of the deleted segments are often still present. More specifically, the findings showed that of the clusters that sounded simplified, some truly had no [t] gesture, some had gestural overlap, and some had reduced gestures. In order to explain the optional nature of the simplification process, it is argued that the simplified tokens are the result of two independent mechanisms. Inevitable mechanical and physiological effects generate gesturally reduced and overlapped tokens whereas planned language-specific behaviors driven by phonological rules or abstract cognitive representations result in no [t]-gesture output. The findings of this study support the main arguments presented in Articulatory Phonology regarding the underlying reasons for sound patterns and sound change. The results of this study are further used to examine different sound change models. It is argued that the simplified tokens with totally deleted [t] gesture could be the result of speakers changing their representations based on other people’s gestural overlap. This would be instances of the Choice and Chance categories in Blevins’ (2004) CCC sound change model. The acoustic results did not find any major cues which could distinguish simplified tokens from controls. It is argued that articulatory data should form an integral part of phonetic studies.
APA, Harvard, Vancouver, ISO, and other styles
25

Ahn, Hyunkee. "Post-release phonatory processes in English and Korean : acoustic correlates and implications for Korean phonology /." Digital version accessible at:, 1999. http://wwwlib.umi.com/cr/utexas/main.

Full text
APA, Harvard, Vancouver, ISO, and other styles
26

Drager, Katie. "A Sociophonetic Ethnography of Selwyn Girls' High." Thesis, University of Canterbury. Languages, Cultures and Linguistics, 2009. http://hdl.handle.net/10092/4185.

Full text
Abstract:
This thesis reports on findings from a year-long sociolinguistic ethnography at an all girls’ high school in New Zealand which is referred to as Selwyn Girls’ High (SGH). The study combines the qualitative methods of ethnography with the quantitative methods of acoustic phonetic analysis and experimental design. At the school, there were a number of different groups (e.g. The PCs, The Pasifika Group, The BBs), each forming a community of practice where the different members actively constructed their unique social personae within the context of the group. There was a dichotomy between the groups based on whether they ate lunch in the common room (CR) or not (NCR) and this division reflected the individual speakers’ stance on whether they viewed themselves as “normal” or different from other girls at the school. In-depth acoustic analysis was conducted on tokens of the word like from the girls’ speech. This is a word with a number of different pragmatic functions, such as quotative like (I was LIKE “yeah okay”), discourse particle like (It was LIKE so boring), and lexical verb like (I LIKE your socks). The results provide evidence of acoustically gradient variation in the girls’ realisations of the word like that is both grammatically and socially conditioned. For example, quotative like was more likely to have a shorter /l/ to vowel duration ratio and be less diphthongal than either discourse particle like or grammatical like and there was a significant difference in /k/ realisation depending on a combination of the token’s pragmatic function and whether the speaker ate lunch in the CR or not. Additionally, three speech perception experiments were conducted in order to examine the girls’ sensitivity to the relationship between phonetic variants, lemma-based information, and social factors. The results indicate that perceivers were able to distinguish between auditory tokens of the different functions of like in a manner that was consistent with trends observed in production. Perceivers were also able to extract social information about the speaker depending on phonetic cues in the stimuli. Taken together, the results provide evidence that lemmas with a shared wordform can have different phonetic realisations, that individuals can manipulate these realisations in the construction of their social personae, and that individuals can use lemma-based phonetic trends from production to identify a word. These results have implications for how phonetic, lemma, and social information are stored in the mind and, together, they are used to inform a unified model of speech production, perception and identity construction.
APA, Harvard, Vancouver, ISO, and other styles
27

Kulikov, Vladimir. "Voicing and voice assimilation in Russian stops." Diss., University of Iowa, 2012. https://ir.uiowa.edu/etd/3327.

Full text
Abstract:
The main objective of this thesis is to investigate acoustic cues for the voicing contrast in stops in Russian for effects of speaking rate and phonetic environment. Although the laryngeal contrast in Russian is assumed to be a [voice] contrast, very few experimental studies have looked at the acoustic properties of Russian voiced and voiceless stops. Most claims about acoustic properties of stops and phonological processes that affect them (voice assimilation and final devoicing) have been made based on impressionistic transcriptions. The present study provides evidence that (1) voicing in voiced stops is affected by speaking rate manipulation, (2) stops in Russian retain underlying voicing contrast in presonorant position and voice assimilation occurs only in obstruent clusters, and (3) phonological processes of voice assimilation and final devoicing do not result in complete neutralization. The target of the investigation is voiced and voiceless intervocalic stops, stops in clusters, and final stops in different prosodic positions within a word and at the phrase level. The acoustic cues to voicing (duration of voicing, stop closure duration, vowel duration, f0, and F1) were measured from the production data of 14 monolingual speakers of Russian recorded in Russia. Speakers produced words and phrases with target stops in three speaking rate conditions: list reading, slow rate and fast rate. The data were analyzed in 5 blocks focusing on (1) word-internal stops, (2) voice assimilation in stops in prepositions, (3) cases of so-called "sonorant transparency", (4) voice assimilation in stops before /v/, and (5) voicing processes across a word boundary. The results of the study present a challenge to the widely-held assumption that phonological processes precede phonetic processes at the phonology-phonetics interface. It is shown that the underlying contrast leaves traces on assimilated and devoiced stops. To account for the findings, a phonology-phonetics interface that allows interaction between the modules is required. In addition, the results show that temporal cues are affected by speaking rate manipulation, but the effect of rate on voicing is found only in voiced stops. Duration of voicing and VOT in voiceless stops are not affected by speaking rate. The results also show that no effect of C2 is obtained on voicing in C1 stops in in obstruent-sonorant-obstruent clusters, thus no "phonological sonorant transparency to voice assimilation" is found in Russian. Rather, the study provides evidence that there is variation in production of voicing in stops in prepositions, and that voice assimilation in stops before /v/ followed by a voiced obstruent is optional for some speakers.
APA, Harvard, Vancouver, ISO, and other styles
28

Engbrecht, Jeffery W. "Word hypothesis of phonetic strings using hidden Markov models /." Online version of thesis, 1990. http://hdl.handle.net/1850/10604.

Full text
APA, Harvard, Vancouver, ISO, and other styles
29

Campos, Luana Caroline Pereira 1986. "Radialista : análise acústica da variação entoacional na fala profissional e na fala coloquial." [s.n.], 2012. http://repositorio.unicamp.br/jspui/handle/REPOSIP/271140.

Full text
Abstract:
Orientador: Plínio Almeida Barbosa
Dissertação (mestrado) - Universidade Estadual de Campinas, Instituto de Estudos da Linguagem
Made available in DSpace on 2018-08-21T11:44:42Z (GMT). No. of bitstreams: 1 Campos_LuanaCarolinePereira_M.pdf: 561404 bytes, checksum: 9141e55bb8da299032441577a649fcf3 (MD5) Previous issue date: 2012
Resumo: O objetivo deste trabalho é estudar a diferença entre a fala coloquial e a fala profissional do locutor de rádio, analisando, em um primeiro momento, sua entoação através dos parâmetros de gama tonal, valor absoluto de frequência fundamental (f0), forma melódica do foco e alinhamento tonal. O sujeito da pesquisa foi um locutor de rádio, do sexo masculino, de 39 anos de idade, de uma emissora AM de Campinas, com um programa diário local de variedades. As gravações foram feitas em sala silenciosa, com microfone profissional, diretamente no computador através de placa de som. Os dados foram analisados acusticamente, utilizando o programa de análise acústica PRAAT (www.praat.org), focalizando a entoação por meio do correlato acústico da entoação: o parâmetro f0. A coleta de dados foi realizada por meio de uma entrevista semiestruturada informal (entrevista) com o locutor, solicitando-se que produzisse uma fala coloquial de diversos assuntos usualmente abordados na mídia na época da gravação. Durante a entrevista o locutor, espontaneamente, narrou um trecho de um gol do seu time, sendo esse trecho uma simulação de locução de jogo de futebol (simulação). A transcrição ortográfica por completo da entrevista foi realizada, e foram selecionados 10 trechos, que foram transcritos para formato de texto formal. De cada um dos 10 trechos selecionados foram elaboradas 5 frases no estilo de manchetes jornalísticas, sendo utilizadas as mais similares à locução radiofônica, somando um total de 29 manchetes. Após um tempo, foi solicitada ao locutor a leitura dos trechos transcritos e reescritos de forma profissional, como se estivesse atuando no rádio. Essa leitura foi realizada em taxa de elocução habitual (leitura normal), em taxa de elocução rápida (leitura rápida), e imitando o estilo profissional de narração futebolística (leitura futebol). Para validação dos estilos foi realizado um teste de percepção auditiva, a fim de verificar se o locutor foi eficiente nas tarefas solicitadas: leitura normal em comparação com a entrevista, e leitura futebol em comparação com leitura normal. A finalidade foi observar as diferenças e estratégias usadas pelos locutores para atrair a atenção dos ouvintes durante sua emissão. Após análise dos dados, concluímos que, como estratégia para diferenciar os estilos de locução, o locutor utiliza, quando a frequência fundamental (f0) é semelhante, primeiramente da variação da taxa de elocução, e em seguida a variação da duração das pausas silenciosas e da taxa de produção de proeminências. Quando a taxa de elocução é semelhante - como na leitura rápida e na leitura futebol- a estratégia utilizada pelo locutor para diferenciação do estilo de fala é, primeiramente, a variação da média de f0, assim como da duração das pausas silenciosas, e dos intervalos entre as proeminências
Abstract: The main purpose of this work is to study the prosodic differences between the colloquial and professional speech of a radio announcer by analyzing some intonational and rhythmic parameters such as tonal range, median fundamental frequency (f0), pitch accent shape and tonal alignment, as well as speech rate and pauses. The research subject was an AM radio announcer, male, 39 years old, working in an AM station in Campinas, where he has a daily program of local varieties. The recordings were made in a quiet room, with a professional microphone, directly through the computer sound card. Data were analyzed acoustically, using acoustic analysis software PRAAT (www.praat.org). Data collection was conducted through an informal semi-structured interview (interview) with the announcer, where the experimenter asked him to talk about different subjects usually covered in the media at the time of recording. During the interview, the announcer spontaneously narrated a snippet of a goal during a game of his soccer team. This part was separately analyzed as a simulation of a football game (simulation). The entire interview was orthographically transcribed and 10 excerpts were selected for analysis. These excerpts were slightly modified in order to obtain a formal text for a reading task. In each of the 10 excerpts 5 sentences were drawn in the style of journalistic headlines, from which 29 headlines most similar to real radio headlines were selected for reading as headlines. The speaker was asked to read the 10 excerpts of the transcription in a professional manner, as if he was in the radio station. This reading was performed in normal speech rate (normal reading), in fast speech rate (speed reading), and a part of it imitating the style of the narration of a soccer game (soccer game reading). For validating the different speaking styles, we conducted an auditory perception test in order to check if the speaker was efficient in producing the tasks. Listeners compared normal reading with the interview, and soccer-game style reading compared to normal reading. The purpose of the study was to observe the differences and strategies used by speakers to attract the attention of listeners during broadcasting. After analyzing the data, we conclude that, as a strategy to differentiate the styles of speech when the fundamental frequency (f0) is similar, the announcer varies speech rate, and then varies the duration of silent pauses as well as the rate of pitch accent production. When speech rate is similar in the case of rapid reading and soccer game reading, the strategy used by the announcer for distinguishing these speech styles is the variation of the median f0, as well as the variation of silent pauses duration and the pitch accent rate
Mestrado
Linguistica
Mestra em Linguística
APA, Harvard, Vancouver, ISO, and other styles
30

Vassoler, Aline Mara de Oliveira. "Aspectos acústicos da fala na representação teatral das emoções." Universidade de São Paulo, 2011. http://www.teses.usp.br/teses/disponiveis/8/8139/tde-03102011-105152/.

Full text
Abstract:
Este estudo investigou a diferenciação das emoções por meio dos aspectos acústicos da fala na representação teatral das emoções no Português Brasileiro (PB), sobretudo o comportamento da frequência fundamental (F0) e os aspectos entonacionais das quatro emoções: alegria, raiva, medo e tristeza, e da fala neutra. Embora o conceito de emoções seja de difícil definição, buscaram-se subsídios teóricos em consagrados autores como SCHERER (1985), EKMAN (1984), DESCARTES (1998), e alguns representantes da Semiótica da Paixão como GREIMAS; Fontanille (1993). Os primeiros relatos sobre as emoções trazidas por Descartes (1998), conhecidas como paixões, mostravam inferências fisiológicas ainda imaturas, no entanto de grande valor descritivo. A F0, a intensidade de voz, a velocidade de fala (LAUKKA, 1994) e a energia espectral (BANSE; SCHERER (1996) são os parâmetros mais relacionados com as emoções, já que são valores que expressam as mudanças respiratórias, fonatórias e articulatórias do indivíduo. A entonação da fala, representada acusticamente pela variação da F0, é também utilizada frequentemente na diferenciação das emoções, pois está relacionada com a expressão vocal do falante. A amostra de fala foi produzida por três atrizes profissionais com idade entre 40 e 50 anos, com média de 20 anos de profissão. As vozes foram gravadas no laboratório Estúdio Multimeios do Centro de Computação Eletrônica da Universidade de São Paulo (CCE-USP), captadas pelo software de edição de som Sound Forge 9. As atrizes leram o texto (corpus) com fala neutra e interpretando as emoções alegria, raiva, medo e tristeza, com repetição de cinco vezes para cada interpretação e para a fala neutra, totalizando 25 gravações para cada atriz. Buscou-se apoio na teoria dos simulacros, sustentada pela Semiótica das Paixões, para fundamentar a escolha da emoção representada por atores, em vez da fala espontânea e da emoção evocada. O corpus - texto escolhido para ser lido pelas atrizes nessa pesquisa - já fora utilizado por FIGUEIREDO (1993), e uma das motivações da escolha se deve ao fato de se tratar de um texto árido, que não evoca emoções apenas com a sua leitura. Foi necessário segmentar o texto para unidades linguísticas menores (três sentenças) com a finalidade de analisar os dados de forma mais eficaz. O embasamento teórico para essa etapa da pesquisa, a segmentação do texto, é a Fonologia Prosódica proposta por Nespor e Vogel (1986). A fim de obtermos os valores de F0, as sentenças foram segmentadas como unidades ainda menores (sílabas). Para obtenção dos valores da F0 e da curva entonacional, utilizou-se o software Praat. 5.1.23 e seus scripts correlacionados. Os dados foram submetidos à estatística descritiva. As atrizes SL e KK interpretaram a alegria na faixa de frequência mais alta, em seguida, a raiva. Apenas a atriz AA interpretou o medo com valores de média de F0 mais altos e depois a alegria. Em geral, a tristeza manteve-se com valores de F0 mais baixos que todas as emoções, inclusive da fala neutra. O comportamento da F0 corrobora a literatura pesquisada. Por meio do estudo da entonação das sentenças nas quatro emoções e na fala neutra, foi possível identificar a interferência da emoção nos aspectos linguísticos da sentença, principalmente no medo, já que o esperado para as sentenças declarativas do português brasileiro seriam curvas descendentes e não ascendentes como ocorreu nesta emoção. Os resultados destes estudos contribuíram para a diferenciação das emoções, no entanto é necessário incluir mais sujeitos e analisá-los, detalhadamente, do ponto de vista estatístico.
This study investigated emotional differences by means of acoustical aspects of speech in theatrical representations of emotions in Brazilian Portuguese (PB), above all the behavior of the fundamental frequency (F0 ) and the aspects of intonation of the four emotions: happiness, anger, fear and sadness and of neutral speech. Although the concept of emotion is difficult to define, supporting theories have been sought in consecrated authors such as SCHERER (1985) EKMAN (1984), DESCARTES (1988) and other representatives of Semiotic Passion such as GREIMAS; FONTANILLE (1998) who are still known as passions showing immature physiological inference although of a great descriptive value. The F0, the intensity of the voice, the rate of speech (LAUKKA, 1994) and spectral energy (BANSE; SCHERER, 1996) are the parameters most closely related to emotions as they are values that express the changes in breathing, phonetics and articulations of the individual. Intonation, represented acoustically by the variation of F0 is also frequently used to differentiate emotions as these are related to the vocal expression of the speaker. A speech sample was produced by three professional actresses aged between 40 and 50 with an average of 20 years in the profession. The voices were recorded in the laboratory of Studio Multimeios in the Electronic Computation Center of the University of São Paulo (CCE-USP) using Sound Forge 9 software for sound editing. The actresses read the text (corpus) in a neutral voice and then interpreted the emotions of happiness, anger, fear and sadness, repeating them five times for each interpretation including the neutral speech, making a total of 25 recordings for each actress. Support was sought for similarities sustained by Semiotic Passion, to justify the choice of emotion presented by the actors instead of spontaneous speech and the emotion evoked. The corpus used in this research was a text already used in FIGUEIREDO (1993), one of the reasons for this choice being the fact that it is an arid text which does not evoke emotions when reading. It was necessary to divide the text into smaller linguistic units (three sentences) in order to analyze the data more efficiently. The theoretical basis for this stage of research of dividing up the text is prosodic phonology as proposed by Nespor and Vogel (1986). In order to obtain the values of F0, the sentences were divided into still smaller units (syllables). In order to obtain the values of F0 and an intonation curve, the software Praat.5.1.23 was used and the scripts correlated. The data was submitted to a descriptive statistic. The actresses SL and KK interpreted happiness at a higher frequency range, followed by anger. Only the actress AA interpreted fear with average F0 values, lower than all the other emotions including neutral speech. The behavior of F0 corroborates the researched literature. By means of the study of intonation of the four emotional sentences and the neutral speech, it was possible to identify the interference of emotion in the linguistic aspects of the sentence, mainly fear as the declarative sentences of Brazilian Portuguese were expected to be descending curves and not ascending as was the case for this emotion. The results of these studies contribute to the differentiation of emotions; however it is necessary to include more subjects and to analyze them with more detail from a statistical point of view.
APA, Harvard, Vancouver, ISO, and other styles
31

Pessotti, Antonio Carlos Silvano 1969. "Efeitos do treinamento e da prática vocal profissional sobre o canto e a fala." [s.n.], 2012. http://repositorio.unicamp.br/jspui/handle/REPOSIP/269058.

Full text
Abstract:
Orientador: Eleonora Cavalcante Albano
Tese (doutorado) - Universidade Estadual de Campinas, Instituto de Estudos da Linguagem
Made available in DSpace on 2018-08-19T18:11:17Z (GMT). No. of bitstreams: 1 Pessotti_AntonioCarlosSilvano_D.pdf: 3351765 bytes, checksum: dfc6656e57053e749ba0a5598827d29e (MD5) Previous issue date: 2012
Resumo: Neste estudo observam-se os efeitos decorrentes do treinamento e da prática vocal profissional, tendo por base três hipóteses que evidenciem semelhanças e diferenças entre fala e canto: 1) a fala é diferente entre grupos, 2) o canto é semelhante entre grupos, 3) fala e canto possuem semelhanças e diferenças influenciadas por formação musical e treinamento. Foi escolhida a canção Conselhos (Carlos Gomes), cujo texto foi lido e cantado cinco vezes por três grupos, cada qual com cinco informantes: cantoras solistas (SOL), cantoras coralistas (COR) e locutoras de rádio (LOC). Os dados obtidos da partitura da canção foram analisados com procedimentos não paramétricos. Os dados acústicos das gravações de fala e canto (com e sem acompanhamento) foram analisados com procedimentos paramétricos. As análises não-paramétricas mostraram que a partitura musical mantém as restrições linguísticas, sem perda da função fonológica nem da pertinência linguística. As diferenças observadas na análise da duração de fala corroboram a primeira hipótese. Tais diferenças sugerem influência de treinamento, distinção dos grupos com prática profissional, e manutenção da hierarquia prosódica. A gradiência entre os grupos na análise da entoação da fala separa cantoras das locutoras, e indicam influência do treinamento vocal profissional. A análise das estimativas de espaço vocálico evidenciou as tônicas, usadas pelos grupos como marcadores de expressividade. A intensidade na fala ressaltou locutoras e solistas como grupos com prática vocal profissional. As solistas se destacam na fala com valores elevados dos formantes e proximidade tonal com a partitura, fato que poderia explicar a tendência das cantoras com maior prática e treinamento em ler um texto próximo ao canto. A análise da duração global no canto sem acompanhamento mostra semelhança entre as cantoras que reflete a segunda hipótese, referente à formação musical. As diferenças de duração das variáveis linguísticas no canto refletem a influência da prática e do treinamento. A análise da entoação indicou que as coralistas atrelam a afinação à pulsação rítmica, e, o acompanhamento facilita essa tarefa. A análise das estimativas de espaço vocálico no canto mostrou áreas semelhantes entre os grupos. A distinção entre eles aparece nas tônicas, menos centralizadas pelas solistas, e, ainda, com intensidade e formantes mais altos. A comparação entre dados acústicos e estimativas da partitura no canto mostrou as solistas com entoação próxima à partitura. As tônicas foram melhor investigadas na fala, no canto, e entre as modalidades. Na fala, as diferenças entre os grupos são explicadas pela formação, pois solistas e locutoras são treinadas para produzir maior abertura oral. As consequências articulatórias seriam o abaixamento de laringe e redução do espaço faríngeo. A investigação das tônicas no canto mostrou formantes elevados para as solistas, implicando em estiramento labial (ou elevação de laringe, prática não recomendada pela pedagogia do canto). A comparação entre modalidades indicou influência da proficiência musical ou do treinamento vocal, em busca de postura vocal confortável. Esses resultados na fala e no canto indicam possível transferência gestual do canto para a fala, e vice-versa, tal como ocorre na aquisição de segunda língua. Tal fenômeno resultaria de adaptação biomecânica, coerentemente com a Fonologia Gestual
Abstract: The aim of this study is to observe the effects of training and professional voice practice, the basis of three scenarios that show similarities and differences in spoken and sung productions: 1) speech is different among groups, 2) singing is similar in the two groups of singers, 3) speech and singing have similarities and differences influenced by musical education and training. Conselhos, a song by Carlos Gomes, was chosen to constitute the corpus, whose text was read and sung five times by three groups, each with five subjects: solo singers (SOL), choir singers (COR) and news broadcasters (LOC). The data obtained from the musical score were analyzed with nonparametric procedures, and the data from acoustic recordings of speech and singing (with and without accompaniment) were analyzed using parametric procedures. Non-parametric analysis showed that the musical score maintains language restrictions, without loss of phonological function or linguistic relevance. Differences observed in the analysis of spoken duration support the first hypothesis. Such differences suggest the influence of training distinctions based on professional practice and maintenance of the prosodic hierarchy. Analysis of speech intonation shows gradient performance among groups, and separates singers from broadcasters, as well as indicate the influence of professional vocal training. The analysis of vowel space estimations shows the stressed vowels as expressiveness markers used by both groups. The intensity in speech distinguishes broadcasters and soloists such as groups with professional vocal practioneers. The soloists stand out with high values in intensity and formants. The way of keeping close to the score may explain the tendency of singers with more practice and training to read a text with an intonation that reminds the melody. Analysis of overall duration in singing without accompaniment shows that the similarity between groups reflects the second hypothesis, referring to musical training. Differences in duration of the linguistic variables in singing reflect the influence of practice and training. The analysis indicated that pitch singers' intonation ties the musical score, and accompaniment makes it easy. The analysis of vowel space area estimations in singing showed similar between groups. The distinction between them appears in the tonic vowels, more centralized by the soloists, and with higher intensity and formants. The comparison between acoustic data and estimations of the musical score in singing showed the soloists with similar pitch to the score. The stressed vowels were investigated in speech, singing, and between modalities. In speech, the differences between groups are explained by their background, as soloists and broadcasters are trained to open their mouth widely. The articulatory consequences would be the larynx lowering and reduction of pharyngeal space. Research results indicate the stressed vowels in singing have higher formants for soloists, resulting in stretching (or lifting of the larynx). The results of these comparisons indicate the influence of music proficiency or vocal training seeking a comfortable vocal space. These results, observed in the speech and singing, suggest gestural transference between singing to speech, and vice-versa, as occurs in second language acquisition. This phenomenon could be the result of bio-mechanical adaptation, consistent with the Gestural Phonology
Doutorado
Linguistica
Doutor em Linguística
APA, Harvard, Vancouver, ISO, and other styles
32

Cattelain, Thibault. "Production des consonnes plosives du français : du contrôle des bruits de plosion." Thesis, Université Grenoble Alpes (ComUE), 2019. http://www.theses.fr/2019GREAS011/document.

Full text
Abstract:
L’étude de la production des consonnes occlusives (/p/, /b/, etc) a un intérêtparticulier pour la compréhension du contrôle moteur de la production de la parole. Eneffet, la production de ces consonnes requiert une coordination fine des 3 niveaux deproduction : respiration, vibration des cordes vocales, articulation.L’objectif de mes travaux de thèse est d’étudier la coordination des gestesrespiratoires, laryngés et articulatoires permettant de contrôler la variation de certains traits acoustiques des consonnes occlusives, plus spécifiquement les caractéristiques acoustiques de leurs bruits de plosion (intensité, durée, spectre, cruciaux pour l’intelligibilité des consonnes occlusives). Une partie importante de mes travaux de thèse s’intéresse également au contrôle musculaire de cette coordination gestuelle.Ces objectifs demandent un travail méthodologique préliminaire pour comparer,développer et implémenter différentes techniques de mesure et d’estimation des effortsarticulatoires de production de parole, à différents points de vue, physiologiques etmécaniques (cinématique du mouvement labial, capteurs de force, électromyographieorofaciale). L’exploration de ces questions a donné lieu à l’acquisition d’une large base de données (acoustiques et physiologiques) de production de consonnes occlusives du français chez une vingtaine de locuteurs adultes sains, incluant 2 modes de phonation (modal et chuchoté), 2 débits de parole (normal et rapide) et plusieurs niveaux d’effort articulatoire.L’analyse de cette base de données a permis de confirmer certaines relations déjà établies en parole conversationnelle entre l’intensité acoustique du bruit de plosion et le maximum de Pression Intra-Orale (ou la vitesse d’ouverture des lèvres pour les consonnes labiales), et entre les paramètres spectraux du bruit de plosion (skewness et kurtosis) et les paramètres articulatoires de déplacement de la langue pour les consonnes alvéolaires et vélaires.D’autres relations (non décrites dans la littérature) ont été observées en paroleconversationnelle : 1- l’intensité acoustique du bruit de plosion augmente lorsque le degré de compression labial et la vitesse de fermeture des lèvres augmentent pour les consonnes labiales ; 2- l’intensité acoustique du bruit de plosion augmente lorsque la vitesse tangentielle du mouvement d’élévation de la langue augmente pour les consonnespalatales ; 3- le degré de compression labial, les vitesses de fermeture et d’ouverture des lèvres augmentent significativement lorsque les activités des muscles Orbicularis Oris Supérieur (OOS) et Dépresseur de la Lèvre Inférieure (DLI) augmentent (dans les phases du mouvement où ils sont agonistes). Ces relations évoluent en fonction du mode de phonation (l’accent est mis en qualité chuchotée sur l’utilisation de paramètres cinématiques au détriment des paramètres aérodynamiques, articulatoires et temporels) et du débit de parole (la plupart des paramètres physiologiques et articulatoires perdent avec le débit leur efficacité de contrôle des caractéristiques acoustiques)
Stop consonants (/p/, /b/, etc) are of particular interest for the understanding of speech motor control. Indeed, the production of these stop consonant requires the coordination of the 3 production levels: breathing, vocal folds vibration and articulation.The main goal of my thesis is to study how respiratory, laryngeal and articulatory gestures coordinate to control the variation of acoustic features of stop consonants, especially of their burts (intensity, duration, spectrum), which are crucial for stop consonant intelligibility. An important part of my thesis work also focuses on the muscular control of lip gestures in the production of bilabial stops. These goals needed a preliminary methodological work to compare, develop and implement different techniques, in order to measure and estimate articulatory efforts of speech production, physiologically and mechanically (lip movement kinematics, force sensors, orofacial electromyography). This methodological exploration has given rise to theacquisition of a large database (acoustic and physiological data) of French stop consonant productions, for twenty healthy speakers, including 2 phonation modes (modal and whispered), 2 speech rates (normal and fast) and several levels of articulatory effort.The analysis of this database has confirmed relationships already established inconversational speech between burst intensity and the maximum of intra-oral pressure (or opening velocity of lips for labial stops), and between spectral features of the burst and articulatory parameters of tongue movements for alveolar and velar stops. New other relationships have been observed in conversational : 1- the burst acoustic intensity increase when the lips compression and opening velocity increase (for labial stop consonants) ; 2-the burst acoustic intensity increase when the elevation tangential velocity of the tongue increase (for palatal stop consonants) ; 3- the lips compression, lips opening and closing velocities significantly increase when the activities of the OOS (Superior Orbicularis Oris) and DLI (Depressor of the Inferior Lip) muscles increase (during the movement phasis where muscles are agonists). These relationships depend on phonation quality (in whispered speech the accent is made on using kinematic parameters at the cost of aerodynamic, articulatory and temporal ones) and speech rate (most of physiological and articulatory parameters lost efficacies for acoustic control when speech rate increase)
APA, Harvard, Vancouver, ISO, and other styles
33

Pritchard, Sonia. "A Cross-language Study of the Production and Perception of Palatalized Consonants." Thèse, Université d'Ottawa / University of Ottawa, 2012. http://hdl.handle.net/10393/22882.

Full text
Abstract:
The goal of this dissertation was to investigate experimentally the phonetic qualities of the palatalized consonants of Standard Bulgarian. The term ‘palatalized’ refers to consonants (e.g., [tʲ, dʲ]) which are articulated with a secondary palatal gesture superimposed on the primary gesture associated with their plain counterparts (e.g., [t, d]). An acoustic study investigated the claim (Horálek, 1950; Choi, 1998; Ignateva-Tsoneva, 2008) that the palatalized consonants of Standard Bulgarian have undergone depalatalization, which was defined as the decomposition of a secondary palatal [ ʲ ] gesture into a palatal glide [j]. A cross-language comparison was performed. Russian (e.g., [tʲulʲ], ‘silk net’) and British English (e.g., [tju:lip], ‘tulip’) data served as a baseline against which the Bulgarian data (e.g., [tʲul], ‘silk net’) was evaluated. Subjects’ productions of words were recorded for acoustic analyses. The F1, F2 and F3 frequencies of the critical segments were analyzed with a Smoothing Spline ANOVA (Gu, 2002). The analyses indicated that Bulgarian palatalized consonants were identical to those of the Russian palatalized consonants, but different from the consonant-palatal glide sequences of British English. It was concluded that Bulgarian palatalized consonants have not undergone depalatalization. A perception study employed two variations of the gating task (Grosjean, 1980): audio-only and audio-visual. The results of the audio-only experiment indicated that Bulgarian and Russian listeners needed only the information associated with the palatalization portion of the consonant to identify it as palatalized. Bulgarian subjects did not need the transitions with the following vowel (Tilkov, 1983) to identify a consonant as palatalized. The Russian subjects of Richey’s (2000) experiment did not need the formant transitions either to identify the secondary palatal gesture. These findings provide further evidence that the palatalized consonants of the Standard Bulgarian have not undergone depalatalization. The purpose of the audio-visual experiment was to investigate if Bulgarian and Russian listeners use visual information to identify palatalized consonants. The results from this experiment were not as clear cut as those from the audio-only experiment. Factors such as insufficient visual information at earlier gates, as well as attentional load are being considered as possible confounds. In addition, an improved methodology for an audio-visual perception study is outlined. Experimental evidence from the acoustic and perception studies points to similarities in the phonetic shape of the palatalized consonants of Bulgarian and Russian. However, the phonological distribution of these segments is very different in the respective languages. I argue against a one-to-one mapping between the phonetic and phonological representations of the Bulgarian palatalized consonants. Based on distributional evidence, I propose that at the level of phonology they consist of a sequence of /CjV/.
APA, Harvard, Vancouver, ISO, and other styles
34

Torres, Juan Félix. "Estimation of glottal source features from the spectral envelope of the acoustic speech signal." Diss., Georgia Institute of Technology, 2010. http://hdl.handle.net/1853/34736.

Full text
Abstract:
Speech communication encompasses diverse types of information, including phonetics, affective state, voice quality, and speaker identity. From a speech production standpoint, the acoustic speech signal can be mainly divided into glottal source and vocal tract components, which play distinct roles in rendering the various types of information it contains. Most deployed speech analysis systems, however, do not explicitly represent these two components as distinct entities, as their joint estimation from the acoustic speech signal becomes an ill-defined blind deconvolution problem. Nevertheless, because of the desire to understand glottal behavior and how it relates to perceived voice quality, there has been continued interest in explicitly estimating the glottal component of the speech signal. To this end, several inverse filtering (IF) algorithms have been proposed, but they are unreliable in practice because of the blind formulation of the separation problem. In an effort to develop a method that can bypass the challenging IF process, this thesis proposes a new glottal source information extraction method that relies on supervised machine learning to transform smoothed spectral representations of speech, which are already used in some of the most widely deployed and successful speech analysis applications, into a set of glottal source features. A transformation method based on Gaussian mixture regression (GMR) is presented and compared to current IF methods in terms of feature similarity, reliability, and speaker discrimination capability on a large speech corpus, and potential representations of the spectral envelope of speech are investigated for their ability represent glottal source variation in a predictable manner. The proposed system was found to produce glottal source features that reasonably matched their IF counterparts in many cases, while being less susceptible to spurious errors. The development of the proposed method entailed a study into the aspects of glottal source information that are already contained within the spectral features commonly used in speech analysis, yielding an objective assessment regarding the expected advantages of explicitly using glottal information extracted from the speech signal via currently available IF methods, versus the alternative of relying on the glottal source information that is implicitly contained in spectral envelope representations.
APA, Harvard, Vancouver, ISO, and other styles
35

Muenz, Harald. "A portfolio of compositions with commentary." Thesis, Brunel University, 2015. http://bura.brunel.ac.uk/handle/2438/11808.

Full text
APA, Harvard, Vancouver, ISO, and other styles
36

Rossi, Aglael Juliana Aparecida Gama. "Relações entre desenvolvimento linguistico e neuromotor : a aquisição da duração no portugues brasileiro." [s.n.], 1999. http://repositorio.unicamp.br/jspui/handle/REPOSIP/270656.

Full text
Abstract:
Orientador: Eleonora Cavalcante Albano
Tese (doutorado) - Universidade Estadual de Campinas, Instituto de Estudos da Linguagem
Made available in DSpace on 2018-07-25T07:55:24Z (GMT). No. of bitstreams: 1 Rossi_AglaelJulianaAparecidaGama_D.pdf: 5116274 bytes, checksum: 18488b3860c0b7e29755becb163a4b3a (MD5) Previous issue date: 1999
Resumo: O trabalho apresenta uma discussão sobre as relações entre desenvolvimento lingüístico e neuromotor a partir do estudo da aquisição da duração no português brasileiro (PB). Ele parte de uma literatura que compara grupos de crianças de diferentes faixas etárias a um grupo controle de adultos para medidas de duração. As hipóteses encontradas para explicar os maiores valores de duração e a maior variabilidade na fala infantil referem-se a fatores neuromotores e estatísticos. Faz-se uma crítica aos experimentos dos estudos resenhados e persegue-se a hipótese de que, no PB, um fator lingüístico, o grau de acentuação do segmento acústico em função da posição por ele ocupada na palavra e na sentença, influencia a diferença encontrada entre adultos e crianças, para medidas de duração, isso porque, as crianças não reduzem os segmentos em posições não-acentuadas. Realizou-se um experimento no qual sentenças pronunciadas pela pesquisadora eram repetidas três vezes pelos sujeitos, tendo-se três pares formados por: criança de 4 anos e 1 mês (M) e pesquisadora; criança de 4 anos e 9 meses (E) e pesquisadora; e a professora delas (S) e a pesquisadora. Segmentos acústicos vocálicos, consonantais e do tamanho de sílabas foram demarcados para suas posições de acento nas palavras (pré-tônica, tônica e pós-tônica) e na sentença (início e final absolutos). Por meio de uma análise via teste t, os sujeitos dentro de cada par foram comparados para a média e o desvio-padrão das diferenças de duração de cada tipo de segmento acústico, em cada posição de acento. A hipótese a ser testada era se as médias das diferenças de duração entre os sujeitos de cada par seria estatisticamente igual a zero. Encontrou-se que a criança mais nova (M) já adquiriu a implementação do parâmetro de duração para a realização do acento, embora ainda não o tenha adquirido para os elementos não-acentuados que participam da construção das alternâncias rítmicas do PB. Já a criança E está muito próxima do padrão adulto para a duração de segmentos vocálicos, mas não para a duração de consoantes, sílabas e palavras. Quanto à variabilidade,embora M e E apresentem valores maiores de desvios-padrão, seus contornos de duração para os vários tipos de segmentos acústicos, nas várias posições de acento, não são díspares em relação àquele obtido para o par formado pelos adultos. Nos segmentos acústicos do tamanho de sílabas e palavras, há uma maior coordenação com sobreposição de gestos na fala adulta, levando, aparentemente, a uma maior coarticulação que na fala infantil. A partir da adoção de modelos dinâmicos, a maior variabilidade da fala infantil, na produção de segmentos acústicos vocálicos e consonantais, é vista como uma menor coordenação entre gestos articulatórios e uma menor freqüência de oscilação dos articuladores, que dificulta a produção de segmentos reduzidos e estáveis nas posições não-acentuadas. Propõe-se que as crianças estudadas possuem um oscilador para variações macrorrítmicas, provavelmente ao nível da sentença, do contorno duracional do PB, mas não para variações ao nível da sílaba, como é demonstrado pela menor coordenação e coarticulação entre os gestos articulatórios envolvidos em sua produção
Abstract: This work offers a discussion on the relationship between linguistic and neuromotor development, based on the study of the acquisition of duration in Brazilian Portuguese (BP). Many studies in the literature have shown that acoustic segment durations are more variable in young children's speech than in adults' speech. The hypotheses that have been advanced to explain this fact refer to neuromotor and statistical factors. Our work criticizes the experiments in the literature and proposes that, in BP, a linguistic factor, which combines stress status and position of the acoustic segment in the word and sentence, can influence the difference between adults' and children's duration measures, since children do not reduce acoustic segments in stressless positions as adults do. We ran an experiment in which sentences produced by the author were repeated three times by the subjects. The analyses made comparisons within and among three pairs of subjects, made up by the author and: one child of 4 years and 1 month (M), one child of 4 years and 9 months (E), and their teacher. The acoustic segments of each sentence, corresponding to vowels, consonants, syllables and words, were demarcated for its stress position in the word (pre-stressed, stressed and poststressed) and in the sentence (beginningand final). A statistical analysis via t test compared the subjects in each pair as to the mean and standard deviation of the differences in duration for each acoustic segment type in each stress position. The hypothesis to be tested was whether the mean of the differences in duration would be statistically equal to zero. We found that the younger child (M) had already acquired the duration parameter for the production of stress, but not for the production of acoustic segments in the stressless position. Child E's production is very similar to the adult's in relation to vowels, but not to consonants, syllables and words. Concerning variability, although the children present higher standard deviation values, their variability contours for the different kinds of acoustic segments in different stress positions are similar to the ones obtained for the adults. In syllables and words, a coordination with superimposed gestures was found to be. higher in the adults' speech than in the children's speech. We adopt dynamic models to explain the higher variability of the children's speech, mainly in the production of isolated acoustic segments, corresponding to vowels and consonants, in view of the lesser coordination among articulatory gestures and the lower oscillation frequency of the articulators, which prevents the production of reduced and stable acoustic segments in stressless positions. Also in line with dynamic models, we propose that the child production mechanisms studied here incorporate an oscillator which accounts for the greater variation in the acoustic duration contour of BP, probably at the level of the sentence, but not at lower leveI of the syllable, as shown by the lesser coordination and coarticulation among the articulatory gestures involved in their production
Doutorado
Doutor em Linguística
APA, Harvard, Vancouver, ISO, and other styles
37

Berro, Jean Paulo Indrigo. "Análise fonético-acústica do plural das palavras terminadas em -ão /." Araraquara, 2018. http://hdl.handle.net/11449/154473.

Full text
Abstract:
Orientador: Luiz Carlos Cagliari
Banca: Daniel Soares da Costa
Banca: Luciana Mercês Ribeiro Santos
Resumo: Várias vezes ouvimos falar que o português do Brasil é uma língua homogênea. Mas, um território de 8,5 milhões de quilômetros quadrados, com aproximadamente 200 milhões de pessoas, com grandes índices de analfabetismo e com uma pluralidade cultural - não poderia de modo algum apresentar uma língua homogênea. O presente estudo verifica como determinados falantes pronunciam os ditongos nasais -ões, -ães e -ãos, analisando os segmentos [ɐ̃], [õ], [ɪ̃] e [ʊ̃], que compõem estes ditongos nasais. O foco da pesquisa é através de uma investigação acústica. Verificadas as qualidades acústicas investigadas, a pesquisa faz uma análise comparativa entre as qualidades acústicas e articulatórias. A pesquisa utiliza o programa de análise acústica Praat. Com o auxílio do Praat e seguindo a Teoria Acústica de Produção de Fala de Fant (1960), foram feitas as segmentações apropriadas dos enunciados registrados e, para os segmentos relevantes, foi investigada a estrutura acústica dos formantes com as facilidades fornecidas pelo Praat. O objetivo da pesquisa está voltado para a forma como os falantes pronunciam os ditongos nasais. No estudo, foi levada em consideração a percepção do fator, já que surgiram variações inesperadas. Fonologicamente, a forma de plural dos ditongos nasais em estudo pode ser interpretada como /-awNS/.
Abstract: Brazilians usually hear that his language is quite homogeneous. However, a large territory of 8,5 millions square kilometers, and with 200 million inhabitants, with a large contingent of his population not alphabetized, plus a cultural plurality, could not by all means be linguistically homogeneous. The present study verifies how some speakers pronounce the nasal diphthongs -ões, -ães e -ãos, with a particular articulatory and acoustic analysis of the diphthongal segments [ɐ̃], [õ], [ɪ̃] e [ʊ̃]. The focus of the research is the acoustic investigation. Once detected the acoustic patterns the work makes a comparative analysis between the acoustic and the articulatory patterns. The Praat Program is use for the acoustic analysis. With the help of Praat and following Fant speech acoustic theory (1960), there has been made the appropriate segmentations of the recorded utterances and for the relevant segments it has been investigated the formant structure with the facilities provided by Praat. The aim of the research is orientated towards how the speakers pronounce the nasal diphthongs. In the study it has been taken into consideration the factor perception, since there emerged unexpected variations. Phonologically the plural form of Portuguese nasal diphthongs may be interpreted as /-awNS/
Mestre
APA, Harvard, Vancouver, ISO, and other styles
38

Santos, Jeylla Salomé Barbosa dos. "As realizações de /R/ rm coda silábica na comunidade de Porto da Rua, litoral norte de Alagoas : análise lingüística e sociolinguística." Universidade Federal de Alagoas, 2010. http://repositorio.ufal.br/handle/riufal/487.

Full text
Abstract:
In the light of the Theory of Variation and Change and Generative Phonology, in this study, aims to investigate the performance of a segment /R/ in the community of Porto da Rua (in the northern coast of Alagoas). The phonetic environment in which this realization occurs was determined, as well as the influence of extralinguistic factors analysed. The corpus for this research consisted of 48 informants among men and women born in the community. The categorization of data and statistical analysis were done using the package VARBRUL. Data were coded according to linguistic and social groups of factors (GF). The results indicated that the variant under study may be undergoing a process of linguistic change, since those responsible for spreading the informants are not in school and age over 50 years. Data collection was done through recordings with spontaneous narratives. The objective is thus to study the correlation between linguistic phenomena and stratified external variables (gender, age and education).
Fundação de Amparo a Pesquisa do Estado de Alagoas
À luz da Teoria da Variação e Mudança e da Fonologia gerativa, pretendeu-se, neste estudo, investigar a realização de um segmento /R/ na comunidade de Porto da Rua (litoral norte de Alagoas). Determinamos o ambiente fonético em que essa realização ocorre e verificamos a influência de fatores extralinguísticos. O corpus para a pesquisa constituiu-se de dados de fala, gravados em áudio, de textos espontâneos produzidos por 48 informantes, homens e mulheres nascidos (e que viveram sempre) na comunidade. A categorização dos dados e a análise estatística foram feitas com a utilização do pacote VARBRUL. Os dados foram codificados de acordo com grupos de fatores (GF) linguísticos e sociais. Objetivou-se, dessa forma, estudar a correlação entre fenômenos linguísticos e variáveis externas estratificadas (sexo, faixa etária e escolarização). Os resultados indicaram que a variante em estudo pode estar passando por um processo de mudança linguística, uma vez que os responsáveis pela sua realização são os informantes não escolarizados e a faixa etária com mais de 50 anos.
APA, Harvard, Vancouver, ISO, and other styles
39

Prakup, Barbara L. "Acoustic measures of the voices of older singers and non-singers." [Kent, Ohio] : Kent State University, 2009. http://rave.ohiolink.edu/etdc/view?acc%5Fnum=kent1239673429.

Full text
Abstract:
Thesis (Ph.D.)--Kent State University, 2009.
Title from PDF t.p. (viewed Jan. 21, 2010). Advisor: Peter Mueller. Keywords: acoustic measures, voice, older singers, non-singers. Includes bibliographical references (p. 78-91).
APA, Harvard, Vancouver, ISO, and other styles
40

Bekker, Ian. "The vowels of South African English / Ian Bekker." Thesis, North-West University, 2008. http://hdl.handle.net/10394/2003.

Full text
Abstract:
This thesis provides a comparative analysis of vowel quality in South African English (SAE) using the following data: firstly, the existing impressionistic literature on SAE and other relevant accents of English, the former of which is subject to a critical review; secondly, acoustic data from a similar range of accents, including new SAE data, collected and instrumentally analyzed specifically for the purposes of this research. These various data are used to position, on both a descriptive and theoretical level, the SAE vowel system. In addition, and in the service of providing a careful reconstruction of the linguistic history of this variety, it offers a three-stage koin´eization model which helps, in many respects, to illuminate the respective roles played by endogenous and exogenous factors in SAE’s development. More generally, the analysis is focussed on rendering explicit the extent to which the synchronic status and diachronic development of SAE more generally, and SAE vowel quality more particularly, provides support for a number of descriptive and theoretical frameworks, including those provided in Labov (1994), Torgersen and Kerswill (2004), Trudgill (2004) and Schneider (2003; 2007). With respect to these frameworks, and based on the results of the analysis, it proposes an extension to Schneider’s (2007) Dynamic Model, shows Trudgill’s (2004) model of new-dialect formation to be inadequate in accounting for some of the SAE data, provides evidence that SAE is a possibly imminent but ‘conservative’ member of Torgersen and Kerswill’s (2004) SECS-Shift and uses SAE data to question the applicability of the SECS-Shift to FOOT-Fronting. Furthermore, this thesis provides evidence that SAE has undergone an indexicallydriven arrestment of the Diphthong and Southern Shifts and a subsequent and related diffusion of GenSAE values at the expense of BrSAE ones. Similarly, it shows that SAE’s possible participation in the SECS-Shift constitutes an effective chain-shift reversal ‘from above’. It stresses that, in order to understand such phenomena, recourse needs to be made to a theory of indexicality that takes into account the unique sociohistorical development of SAE and its speakers. Lastly, the adoption of the three-stage koin´eization model mentioned above highlights the merits of considering both endogenous and exogenous factors in the historical reconstruction of new-dialect formation and, for research into SAE in particular, strengthens the case for further investigation into the possible effects of 19th-century Afrikaans/Dutch, Yiddish and north-of-English dialects on the formation of modern SAE.
Thesis (Ph.D. (English))--North-West University, Potchefstroom Campus, 2009.
APA, Harvard, Vancouver, ISO, and other styles
41

Turnbull, Rory. "Assessing the listener-oriented account of predictability-based phonetic reduction." The Ohio State University, 2015. http://rave.ohiolink.edu/etdc/view?acc_num=osu1429796768.

Full text
APA, Harvard, Vancouver, ISO, and other styles
42

Boyle, Molly. "Bit O’ the Auld Craic: An Acoustic Analysis of the Vowel System of the Engish of South Roscommon." Scholarship @ Claremont, 2017. http://scholarship.claremont.edu/scripps_theses/1022.

Full text
Abstract:
The present study aims to address the question of how vowel quality varies between rural and town-dwelling male speakers of Irish-English in South Roscommon, Ireland. Previous studies have identified four distinct varieties of Irish-English in Ireland: the Eastern, South &Western, Midland, and Northern varieties, loosely based on the political provinces of Munster, Connaught, Leinster, and Ulster. County Roscommon straddles the provinces of Connaught and Leinster, complicating the presence of phonological features associated with one of two different ‘accent regions’. The last phonological study carried out in Roscommon was by Patrick Leo Henry in 1957. While this was a promising start in assessing regional distinctions, rural ones in particular, the lack of recent studies leaves a sizeable gap that does not address modern changes in the linguistic landscape of Ireland, nor the availability of modern methods of acoustic analysis. In particular, the present study investigates the pre-nasal merging of front unrounded vowels /ɛ/ and /ɪ/, vowel centralization, and a lower /æ/, associated with the Western variety of Irish English. Factors such as supraregionalization lead to my hypothesis that rural speakers will demonstrate higher frequency of the vowel features associated with the Western variety. To assess the frequency of certain vowel sounds, twenty participants were recorded and formant data was extracted for F1 and F2 values of the tokens. It was found that the rural speakers in Roscommon demonstrated a more prominent merger between /ɛ/ and /ɪ/, a lower [æ], and the rural speakers demonstrated an overall trend toward centralization.
APA, Harvard, Vancouver, ISO, and other styles
43

Carvalho, Kelly Cristiane Henschel Pobbe de. "Descrição fonético-acústica das vibrantes no português e no espanhol /." Assis : [s.n.], 2004. http://hdl.handle.net/11449/102467.

Full text
Abstract:
Orientador: Rafael Eugenio Hoyos-Andrade
Banca: Adelaide Hercília P. Silva
Banca: Gisele Domingos do Mar
Banca: Mirian Therezinha da Matta Machado
Banca: Zilda Maria Zapparoli Castro Melo
Resumo: Neste trabalho observamos e contrastamos as realizações das consoantes chamadas vibrantes, no português e no espanhol, em diferentes contextos fônicos, do ponto de vista acústico. Para tanto, utilizamos o Multi-Speech, programa de análise de fala para Windows, produzido pela Kay Elemetrics, que possibilita o desenvolvimento das análises espectrográficas necessárias neste tipo de investigação. As gravações foram feitas em sala acusticamente isolada, com gravador profissional, no Laboratório de Línguas da Faculdade de Ciências e Letras de Assis (UNESP), por informantes da região de Assis (interior de São Paulo) e da cidade de Bogotá (Colômbia). Embora este estudo tenha um caráter primordialmente descritivo, pode, eventualmente, servir de apoio àqueles que se dedicam ao ensino/aprendizagem do português e do espanhol como línguas estrangeiras, pois atesta informações relevantes de natureza contrastiva sobre o componente fônico das duas línguas, no que se refere às consoantes vibrantes.
Abstract: This dissertation deals with the acoustic analysis of trills and taps, in Portuguese as well as in Spanish. These consonants were spectrographically studied in the different contexts in which they appear in both languages. The physical analysis was made by means of the Kay Elemetrics Multi-Speech for Windows software. With its help we obtained the sound waves and sound spectrograms, necessary to our purposes, namely the acoustical description of the selected sounds in order to elaborate a contrastive description of the "r type" consonants. Our study was limited to the Portuguese spoken in our city area (Assis SP, Brazil) and to the Spanish spoken in Bogotá (Colombia). The data to be analyzed were recorded in the Language Laboratory of our University Campus (Faculdade de Ciências e Letras de Assis - UNESP). We used a professional cassette recorder within an acoustically isolated room. Although this study has a primarily descriptive character, it may eventually help those people who are devoted to the teaching/learning process of Portuguese and Spanish as foreign languages. In fact it shows contrastive relevant information about the phonetic component properties of both languages, in the very specific area of the so called trills or vibrant consonants.
Doutor
APA, Harvard, Vancouver, ISO, and other styles
44

Almalki, Hussain. "Acoustic Investigation of Production of Clusters by Saudi Second Language Learners of English." FIU Digital Commons, 2014. http://digitalcommons.fiu.edu/etd/1235.

Full text
Abstract:
Production errors made by second language (L2) learners of English have been attributed to markedness, L1 transfer or input frequency (cf. Major, 2001; Edwards & Zampini, 2008; Baptista, Rauber, & Watkins, 2009). This thesis examines the production of 17 English initial consonant clusters (e.g., /pr/ in “pray”) in a markedness relationship, whereby clusters with greater sonority distance between the first and second consonants are unmarked and clusters with smaller sonority distance between the first and second consonants are marked, by two groups of Saudi Arabian L2 English learners. It also explores the effect of input frequency and L1 transfer. Participants were asked to read 60 sentences and their reading was recorded for acoustic analysis. Analysis showed that “prothesis” was always used to simplify the clusters, and that, the duration of the prothetic vowel tended to get longer when clusters become more marked. Intermediate participants had greater degree of difficulty in producing the clusters and tended to insert a longer prothetic vowel in general. Markedness explained the performance on #sC clusters; however, performance on non #sC clusters was best explained by L1 transfer. Results further indicated that input frequency was irrelevant to this study.
APA, Harvard, Vancouver, ISO, and other styles
45

Meneses, Francisco 1986. "As vogais desvozeadas no Português Brasileiro = investigação acústico-articulatória = Devoiced vowels in Brazilian portuguese : an acoustic-articulatory investigation." [s.n.], 2012. http://repositorio.unicamp.br/jspui/handle/REPOSIP/270636.

Full text
Abstract:
Orientador: Eleonora Cavalcante Albano
Dissertação (mestrado) - Universidade Estadual de Campinas, Instituto de Estudos da Linguagem
Made available in DSpace on 2018-08-21T00:01:21Z (GMT). No. of bitstreams: 1 Meneses_Francisco_M.pdf: 3600241 bytes, checksum: 3773bdafbef97346d85bb068f8bd4a65 (MD5) Previous issue date: 2012
Resumo: Este trabalho examina o processo de produção de vogais desvozeadas no português brasileiro (doravante PB). O objetivo é, a partir de uma visão de cunho dinâmico, mostrar que há, no sinal das chamadas sílabas desvozeadas, rastros da vogal que impossibilitam uma hipótese de síncope vocálica. Além disso, busca-se uma síntese teórica das pistas acústicas encontradas a fim de relacioná-las à articulação das vogais desvozeadas. Para tanto, uma análise acústico-articulatória foi realizada a partir da gravação da leitura de frases-veículos. As leituras das frases foram realizadas por seis sujeitos do sexo feminino, naturais de Vitória da Conquista - BA, sem queixas de fala ou audição. As gravações foram realizadas em uma cabine acústica, por meio de gravador digital. A análise instrumental foi realizada por meio do software PRAAT. Foram obtidas as medidas de duração da sílaba e do ruído fricativo, as medidas do centroide do ruído e os valores da Razão de centralização formântica (doravante FCR) e a Área de espaço vocálico (doravante VSA) das vogais em contexto de desvozeamento. Os resultados encontrados mostram que há um gradiente de desvozeamento manifestado de três maneiras: as medidas de duração, assim como as medidas do primeiro momento espectral, mostraram que pistas remanescentes do gesto vocálico permanecem no ruído das fricativas; os dados de FCR e VSA mostram que as vogais sofrem uma grande redução da magnitude em contexto de desvozeamento. Os resultados apontam para um efeito de sobreposição de gestos, em detrimento de uma síncope vocálica. Essa hipótese não caberia em descrições fonológicas tradicionais, as quais, em geral, lidam com operações simbólicas. O fenômeno em estudo pode, então, ser iluminado pela Fonologia Gestual (BROWMAN E GOLDSTEIN, 1992; BALL E KENT, 1997; ALBANO, 2001), a qual, em relação à representação tradicional, é capaz de expressar realizações gradientes, pois incorpora com sucesso os fatores tempo e magnitude, diretamente relacionados à ideia de movimento dos articuladores
Abstract: This paper examines the production of devoiced vowels in Brazilian Portuguese. The goal is to depart from a dynamic view to show that there are vowel traces in the signal of devoiced syllables, a fact which weakens the hypothesis of vowel syncope. In addition, we seek a theoretical synthesis of the acoustic cues found in order to relate them to devoiced vowel articulation. To this end, an acoustic-articulatory analysis was performed from the recording of read carrier sentences. The sentences were read by six female subjects, from Vitória da Conquista - BA, without speech or hearing problems. The recordings were performed in an acoustic booth, using a digital recorder. The instrumental analysis was performed using the PRAAT software. The following measurements were made: the duration of the syllable and its fricative noise, the centroid of the noise and the values of formant centralization ratio (FCR) and vowel space area (VSA) of vowels in devoicing context. The results show that there is gradient devoicing manifested in three ways: both the duration measurements and the first spectral moment measures showed that the vowel gesture cues remain in fricative noise signal; FCR and VSA data show that vowels undergo great magnitude reduction in the devoicing context. The results indicate an effect of overlapping gestures, rather than vowel syncope. This hypothesis would not fit into traditional phonological descriptions, which usually deal with symbolic operations. The phenomenon under study can thus then be illuminated by Gestural Phonology (BROWMAN AND GOLDSTEIN, 1992; BALL AND KENT, 1997; ALBANO, 2001), which, as opposed to traditional representation, is able to express gradients, as it incorporates successfully the factors of timing and magnitude, directly related to the idea of articulator movement
Mestrado
Linguistica
Mestre em Linguística
APA, Harvard, Vancouver, ISO, and other styles
46

Ramírez, Cruz María Manjavacas. "Estudio acústico de las vocales del español a partir de grabaciones de habla infantil." Doctoral thesis, Universitat de Barcelona, 2016. http://hdl.handle.net/10803/401094.

Full text
Abstract:
Dentro la reducida bibliografía que se ha acercado al estudio de los sonidos vocálicos en edades tempranas, son muy pocos los trabajos que han emprendido la tarea de registrar y transcribir fonéticamente estos segmentos a lo largo de los tres primeros años de vida, y más escasos aún son aquellos que han analizado acústicamente sus dos primeros formantes debido a la dificultad de la obtención de los datos, la dificultad que conlleva el transcribir muestras de habla de sujetos tan jóvenes, y la complejidad del examen acústico de estas emisiones por las peculiaridades que presenta el habla de sujetos de tan corta edad. La exigüidad de este tipo de estudios en la literatura internacional y su ausencia en el campo del español (al menos durante un periodo de estudio tan extenso) nos llevaron a emprender esta tesis doctoral cuyos objetivos principales son el análisis acústico de los dos primeros formantes de las emisiones vocálicas aisladas de dos sujetos hablantes de castellano entre los 0;4 y los 3;0 años, y la transcripción fonética de dichas emisiones. En cuanto a la metodología, se grabaron semanalmente las emisiones espontáneas de un niño y una niña hablantes de español entre los 0;4 y los 3;0 años de vida, en diferentes contextos familiares. Después se extrajeron los sonidos vocálicos que se ajustaron a criterios previamente establecidos, se midieron los dos primeros formantes y se crearon cartas de formantes que nos ayudaron a situar las señales en el espacio vocálico. Posteriormente, cinco adultos hablantes nativos de español realizaron un test de percepción para clasificar los sonidos escuchados según las categorías vocálicas de la lengua española. Finalmente, realizamos una serie de análisis estadísticos. Los resultados obtenidos fueron los siguientes: 1. Los valores de los dos primeros formantes de las vocales analizadas en los bebés y niños pequeños superan con creces a los de los adultos. 2. Las vocales más frecuentes en los primeros meses de vida son las vocales centrales medias y bajas, y las vocales anteriores. 3. En los primeros meses de vida se produce una superposición de las vocales en el espacio vocálico. 4. Algunos sonidos vocálicos pueden articularse al poco de nacer, pero otros como [u] necesitan de una maduración anatómica y de una mejora del control motor. 5. A pesar de la inversa relación establecida por Fant (1960) entre la longitud del tracto vocal y las frecuencias de los formantes, los valores formánticos no descienden de manera significativa durante el primer año. 6. Aunque los niños que adquieren la misma lengua meta tienen inventarios vocálicos parecidos, pueden encontrarse diferencias individuales. Finalmente, las conclusiones alcanzadas tras la realización de esta tesis doctoral fueron las siguientes: 1. Los valores formánticos de los bebés superan con creces a los de los adultos. 2. La mayoría de los sonidos vocálicos producidos por un bebé en los primeros meses de vida son vocales centrales medias y bajas, y vocales anteriores. 3. Dentro del ángulo posterior, la vocal [u] es la última vocal en aparecer y no lo hace al menos hasta que el bebé ha cumplido los seis meses de vida. 4. Las producciones vocálicas de los bebés están determinadas, al menos en parte, por la lengua ambiente en la que estos están inmersos. 5. Se crearon campos de dispersión alrededor de todas las vocales analizadas debido a la articulación de los niños de una misma vocal con pequeñas diferencias en los valores formánticos. 6. La variabilidad de los valores formánticos dentro de una vocal puede deberse a que el tracto vocal del bebé no está aún configurado como el del adulto. 7. Los valores de los dos primeros formantes no sufren cambios significativos durante el primer año de vida. 8. Existe una variedad inter-individual tanto en el número de vocales como en su cualidad vocálica.
The present study has a two-fold purpose: the acoustic analysis of the first two formants of isolated vocal sounds produced by two Spanish infants between 0;4 and 3;0 years, and phonetic transcription of these emissions. The methodological aspects of the study are: recordings of infant’s spontaneous emissions were made at weekly sessions between 0;4 and 3;0 years. Spectrograms were made for isolated vocal sounds (which met the previous criteria) in order to measure the first two formants of each vowel sound. Then, formant frequencies were plotted in vowel charts. Every vowel selected was listened and classified according Spanish vowel system by five Spanish adults. Statistical analyses were made to determine formant frequencies variations. Results obtained from these analyses were: 1. Infant’s formant frequencies are higher than adult’s. 2. In the first few months of life, predominant vowels are mid and low central vowels, and front vowels. 3. There is a vowel’s overlapping during the first year of life. 4. Some vocalic sounds can be produced within first month of life, but others like [u] require of an anatomical maturation and a better motor control system. 5. There are no significant changes in average F1 and F2 across the age period. 6. There are individual differences in sound repertoires. The main conclusions reached were: 1. Infant’s formant frequencies are higher than adult’s. 2. In the first few months of life, predominant vowels are mid and low central vowels, and front vowels. 3. The high back vowel [u] is the last vowel to be produced on the back axis. 4. Infant’s vocalic sounds are partly influenced by target-language. 5. It was observed a clustering of vowels in the acoustic space. 6. Variations between formant ranges of vowels can be explained by infant’s particular anatomical configuration of vocal tract. 7. There are no significant changes in average F1 and F2 across the age period. 8. Individual differences were noted in the quality of vocalic sounds and in the number of these.
APA, Harvard, Vancouver, ISO, and other styles
47

Carvalho, Kelly Cristiane Henschel Pobbe de [UNESP]. "Descrição fonético-acústica das vibrantes no português e no espanhol." Universidade Estadual Paulista (UNESP), 2004. http://hdl.handle.net/11449/102467.

Full text
Abstract:
Made available in DSpace on 2014-06-11T19:32:09Z (GMT). No. of bitstreams: 0 Previous issue date: 2004-03-02Bitstream added on 2014-06-13T20:42:48Z : No. of bitstreams: 1 carvalho_kchp_dr_assis.pdf: 17423514 bytes, checksum: 5a2ddc3e71ca7722c0a452d6131d37ce (MD5)
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Neste trabalho observamos e contrastamos as realizações das consoantes chamadas vibrantes, no português e no espanhol, em diferentes contextos fônicos, do ponto de vista acústico. Para tanto, utilizamos o Multi-Speech, programa de análise de fala para Windows, produzido pela Kay Elemetrics, que possibilita o desenvolvimento das análises espectrográficas necessárias neste tipo de investigação. As gravações foram feitas em sala acusticamente isolada, com gravador profissional, no Laboratório de Línguas da Faculdade de Ciências e Letras de Assis (UNESP), por informantes da região de Assis (interior de São Paulo) e da cidade de Bogotá (Colômbia). Embora este estudo tenha um caráter primordialmente descritivo, pode, eventualmente, servir de apoio àqueles que se dedicam ao ensino/aprendizagem do português e do espanhol como línguas estrangeiras, pois atesta informações relevantes de natureza contrastiva sobre o componente fônico das duas línguas, no que se refere às consoantes vibrantes.
This dissertation deals with the acoustic analysis of trills and taps, in Portuguese as well as in Spanish. These consonants were spectrographically studied in the different contexts in which they appear in both languages. The physical analysis was made by means of the Kay Elemetrics Multi-Speech for Windows software. With its help we obtained the sound waves and sound spectrograms, necessary to our purposes, namely the acoustical description of the selected sounds in order to elaborate a contrastive description of the r type consonants. Our study was limited to the Portuguese spoken in our city area (Assis SP, Brazil) and to the Spanish spoken in Bogotá (Colombia). The data to be analyzed were recorded in the Language Laboratory of our University Campus (Faculdade de Ciências e Letras de Assis - UNESP). We used a professional cassette recorder within an acoustically isolated room. Although this study has a primarily descriptive character, it may eventually help those people who are devoted to the teaching/learning process of Portuguese and Spanish as foreign languages. In fact it shows contrastive relevant information about the phonetic component properties of both languages, in the very specific area of the so called trills or vibrant consonants.
APA, Harvard, Vancouver, ISO, and other styles
48

Demasi, Rita de Cássia Benevides. "A ditongação nasal no português brasileiro: uma análise acústico-aerodinâmica da fala." Universidade de São Paulo, 2010. http://www.teses.usp.br/teses/disponiveis/8/8139/tde-15032010-123909/.

Full text
Abstract:
Os estudos de caracterização acústica das vogais nasais são vastoa. Porém, há poucos estudos sobre a ditongação nasal. Este é um fenômeno que emerge da costelacao articulatória dos gestos. Isso pode ser notado a partir dos parâmetros acústicoaerodinâmicos. O objetivo desta é analisar o resultado da configuração gestual entre o movimento da língua e o gesto de abertura e fechamento do véu palatino, durante a produção dos ditongos nasais do Português Brasileiro. Mostraremos os efeitos da coarticulação no output sonoro e como ela se configura, a partir da gravação de dados acústicos e aerodinâmicos. O material foi gravado com o aparelho EVA Portátil 2. Esse permitiu que o output acústico e os dados aerodinâmicos fossem gravados concomitantes. O corpus do experimento é composto por vinte ditongos divididos em orais e nasais (dez posteriores e dez anteriores) todos dicionarizados: [p@w, s@w, m@w, k@w, t@w,p@)w), s@)w), m@)w), k@)w), t@)w), dej, sej, frej, hej, lej, te)j ), se)j ,) be)j ), a.mej), a.le)j\\) . As palavras foram inseridas na frase-veículo: Digo _____ cada dia. Essa foi repetida três vezes por seis informantes (três homens e três mulheres) falantes do dialeto Paulistano ( ). Para o controle de população foi utilizada outra frase-veículo: Digo ____ todo dia, essa foi repetida por 1.3 dos sujeitos, um de cada grupo ( ). Na inspeção visual utilizou-se o software Signal Explorer e Phonédit. Os parâmetros aerodinâmicos analisados foram: a configuração do fluxo de ar oral e nasal; a taxa máxima de nasalização e a duração do fluxo de ar nasal. Os parâmetros acústicos foram: a movimentação dos formantes; a extração de F0, F1, F2 e F3 de todos os segmentos e a duração do ditongo nasal: a vogal, o glide e o apêndice nasal. A Média, o Desvio Padrão e o teste ANOVA foram feitos no Excel. Os gráficos de dispersão dos formantes foram feito no Formant Explorer. Assim, notou-se uma variação nos valores da taxa de nasalização, p > 0,5, entre a variante sexo. Nas mulheres as frequências dos formantes são mais elevadas e a dispersão dos valores do glide nasal é mais evidenciada do que nos homens. As alterações remetem as diferenças fisiológicas entre os grupos. A taxa máxima de ar nasal variou significativamente, p > 0,5, se comparado os ditongos nasais: anteriores > posteriores. Acusticamente, a transição dos formantes é dependente do contexto silábico. O mesmo não acontece com o traçado do fluxo de ar nasal, que mantém o padrão de contorno, independente da articulação silábica. Concluí-se que há um padrão aerodinâmico relativo à sincronia do movimento do véu e da língua, gerando três fases acústicas distintas: vogal nasal, glide nasal e apêndice nasal. O contorno da trajetória padrão do fluxo de ar nasal, em 87% dos casos, apresentou três fases distintas: a primeira plana; a segunda, um pico acentuado; e a terceira, uma queda abrupta. Assim, concluímos que os ditongos nasais têm uma dinâmica articulatória, acústica e aerodinâmica diferente dos não-nasalizados e que a adequação do controle das variáveis do sistema fonético-fonológico e do o conjunto de articulações, que geram uma única percepção.
There are several studies that characterize the nasal vowels. However, there are few studies about the nasal diphthongation. This phenomenon emerges from the articulatory gestures constellation. This can be noted by analyzing of the acousticaerodynamics parameters. The aim of this work is study the gesture configuration between the thong movement and the velum aperture during the nasal diphthongs production of the Brazilian Portuguese. We will show the effects of the coarticulation in the output and how it sets up in the acoustic and aerodynamic data. The data was recorded by the device EVA Portable 2. Thus, the airflow and the acoustic output were collected concomitantly. The corpus of this experiment was covered by ten oral and ten nasal diphthongs, between ten back and ten front:[p@w, s@w, m@w, k@w, t@w,p@)w), s@)w), m@)w), k@)w), t@)w), dej, sej, frej, hej, lej, te)j ), se)j ,) be)j ), a.mej), a. le)j)\\. These words are dictionaries. They were inserted in the carry-sentence [dZi.gU__ ka.d5 dZi5] and were repeated three times, by six subjects (three men and three women); all of them are Paulistano Dialects speakers. This resulted in 360 tokens (3 × 6 × 20). The carry-sentence of the populational control was [dZi.gU__ to.dT dZi5]. This was repeated by 1/3 of the subjects. This resulted in 120 tokens (3 × 2 × 20 ). The diphthong was analyzed by Signal Explorer and Phonédit. The aerodynamic parameters studied were: the nasal and oral airflow shape; the peak of nasalization and the duration of nasal airflow. The acoustic parameters analyzed were: the movement and the configuration of the formants; the values of F0, F1, F2 and F3 were extracted of all segments; the nasal diphthongs duration in the vowel, the glide and the nasal appendix. The Average, Pattern Deviation and ANOVA were done by Excel. The dispersion graphics were made by Formant Explorer. As a result we noticed that the formants movements dependent on syllabic context. The womens formants had different values of males. The degree of the dispersion of hers was higher than him. This was showed more evident in the nasal glides. This reflects the physiological differences between the groups. The nasal airflow peak variation was p> 0,5 among the sex variant. The rate of nasal airflow of the back has more volume than front, dp > 0,5. The same does not happen with the nasal airflow shape. The shape pattern is independent of syllabic articulation, but the rate of nasalization depends of the articulation. We concluded that there is an aerodynamic pattern that is resulted of the thong movement and velum aperture. This product three distinct acoustic phases: vowel nasalization, glide nasal and the nasal appendix. By the aerodynamic view, in 87% of cases, the pattern shape of the nasal airflow represents three distinct phases: the first is sharp; the second is a peak; and last part is a drop line. Thus, we concluded that the nasal diphthongs have articulatory, acoustic and aerodynamic patters different from the non-nasalized segment. These reflect the adequacy of the control of variables of phonetic-phonological system and the set of these characteristics creates a single perception.
APA, Harvard, Vancouver, ISO, and other styles
49

Martins, Marcus Vinicíus Moreira. "Alvos tonais: unidades fonético-fonológicas da entoação." Universidade de São Paulo, 2017. http://www.teses.usp.br/teses/disponiveis/8/8142/tde-15052018-104642/.

Full text
Abstract:
O objetivo deste trabalho é desenvolver nossa hipótese em torno dos alvos tonais, que seriam unidades fonético-fonológicas responsáveis pela mediação entre o nível representacional da entoação e o nível físico da implementação de F0. Os alvos tonais foram divididos em duas grandes categorias, topológicos e pontuais. Os primeiros ocorrem em um espaço limitado, denominado por Ferreira-Netto (2008) de tom médio. Os limites desse espaço são definidos pelos limiares de diferenciação tonal (LDT) e estão a +3 e -4 semitons do tom médio. Além e aquém destes limiares encontram-se as faixas de frequência do Foco/Ênfase. Nestas faixas os tons passam a ser eventos relevantes para os ouvintes que podem atribuir significados a eles. Aos tons que ocorrem nessa região demos o nome de pontual, uma vez que são eventos específicos. Para testar essa hipótese aplicamos o teste 1, no qual era solicitado aos participantes que repetissem uma sentença pré-gravada dotada de uma divisão entoacional marcante: na primeira parte tratava-se de uma voz masculina entoando uma frase declarativa, na segunda parte uma voz feminina entoando uma frase interrogativa. De um total de 15 participantes obtivemos 24 amostras, contando repetições intra-sujeitos. A análise foi conduzida em duas etapas, na primeira avaliamos a capacidade do falante detectar o alvo tonal topológico subjacente à primeira parte do estímulo e reproduzi-lo. Na segunda etapa, avaliamos a capacidade do falante em detectar o alvo pontual caracterizado pela interrogativa e implementá-lo em sua fala. A análise da primeira condição foi feita por meio do que denominamos índice de relação (ir), que media o grau de correlação entre o estímulo e a repetição do falante. A análise revelou que os participantes demonstraram uma grande acuidade na execução da tarefa, o que sugere que os falantes são capazes de monitorar a implementação da frequência fundamental, a partir da detecção dos alvos topológicos. Já a segunda análise demonstra que a implementação dos alvos pontuais pode ser aleatória em certa medida, uma vez que ela não precisa respeitar um limite específico, apenas um limiar. Na segunda parte do trabalho aplicamos um método semelhante, voltado à análise da fala emotiva atuada em três condições: raiva, tristeza e neutra. A frase consistia de um trecho de um livro de ciências lido nessas três emoções por atrizes profissionais, a análise por meio de testes de hipótese (n=196, p<0,005) revelou que os alvos topológicos entre as condições eram distintos, o que sugere que o espaço entoacional e a variação de frequência em seu interior podem ser uma pista significativa para a distinção da fala emotiva.
The main purpose of this work is develop our hypothesis about the tonal targets, which would be phonological-phonetic units responsible for the mediation between the representational level of the intonation and the physical level of the implementation of F0. The tonal targets were divided into two major categories, topological and punctual. The first occur in a limited space, and is called by Ferreira-Netto (2008) as midtone. The boundaries of this space are defined by the tonal differentiation thresholds (TDT) and are specified at +3 and -4 semitones from the midtone. Beyond these thresholds are the Focus/Emphasis frequency bands. In these bands the tones become relevant events to the listeners who are able to attribute meanings to them. the tones that occur in this region we gave the name of punctual, since they are specific events. To test this hypothesis, we applied test 1, in which participants were asked to repeat a pre-recorded sentence with a striking intonational division: in the first part it was a male voice spealing a declarative phrase, in the second part a female voice speaking an interrogative phrase. From 15 participants we obtained 24 samples, counting intra-subject repetitions. The analysis was conducted in two stages, in the first one we evaluated the ability of the speaker to detect the topological tonal target underlying the first part of the stimulus and to reproduce it. In the second step, we evaluated the ability of the speaker to detect the punctual target characterized by the interrogative and implement it in his speech. The analysis of the first condition was done by means of what we call the relationship index (ir), which measures the degree of correlation between the stimulus and the repetition of the speaker. The analysis revealed that the participants demonstrated a great acuity in the execution of the task, which suggests that the speakers are able to monitor the fundamental frequency implementation, from the detection of the topological targets. The second analysis, on the other hand, shows that the implementation of specific targets can be random to some extent, since it does not need to respect a specific limit, only a threshold. In the second part of the work we apply a similar method, focused on the analysis of emotional speech, in three conditions: anger, sadness and neutrality. The phrase consisted of an excerpt from a science book, read in these three emotions by professional actresses. The analysis, using hypothesis tests (n = 196, p <0.005), revealed that the topological targets between the conditions were distinct, suggesting the intonational space and the variation of frequency in its interior can be a significant clue to the distinction of the emotive speech.
APA, Harvard, Vancouver, ISO, and other styles
50

Reidy, Patrick F. "The spectral dynamics of voiceless sibilant fricatives in English and Japanese." The Ohio State University, 2015. http://rave.ohiolink.edu/etdc/view?acc_num=osu1430766545.

Full text
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography