Дисертації з теми "Interfaces vocales"
Оформте джерело за APA, MLA, Chicago, Harvard та іншими стилями
Ознайомтеся з топ-15 дисертацій для дослідження на тему "Interfaces vocales".
Біля кожної праці в переліку літератури доступна кнопка «Додати до бібліографії». Скористайтеся нею – і ми автоматично оформимо бібліографічне посилання на обрану працю в потрібному вам стилі цитування: APA, MLA, «Гарвард», «Чикаго», «Ванкувер» тощо.
Також ви можете завантажити повний текст наукової публікації у форматі «.pdf» та прочитати онлайн анотацію до роботи, якщо відповідні параметри наявні в метаданих.
Переглядайте дисертації для різних дисциплін та оформлюйте правильно вашу бібліографію.
Janer, Mestres Jordi. "Singing-driven interfaces for sound synthesizers." Doctoral thesis, Universitat Pompeu Fabra, 2008. http://hdl.handle.net/10803/7550.
Amb la present recerca, intentem relacionar la veu amb el so dels instruments musicals, tenint en compte tan la descripció del senyal de veu, com les corresponents estratègies de mapeig per un control adequat del sintetitzador.
Proposem dos enfocaments diferents, d'una banda el control d'un sintetitzador de veu cantada, i d'altra banda el control de la síntesi de sons instrumentals. Per aquest últim, suggerim una representació del senyal de veu com a gests vocals, que inclou una sèrie d'algoritmes d'anàlisis de veu. A la vegada, per demostrar els resultats obtinguts, hem desenvolupat dos prototips a temps real.
Los instrumentos musicales digitales se pueden separar en dos componentes: el interfaz de usuario y el motor de sintesis. El interfaz de usuario se ha denominado tradicionalmente controlador musical. El objectivo de esta tesis es el diseño de un interfaz que permita el control de la sintesis de sonidos instrumentales a partir de la voz cantada.
La presente investigación pretende relacionar las caracteristicas de la voz con el sonido de los instrumentos musicales, teniendo en cuenta la descripción de la señal de voz, como las correspondientes estrategias de mapeo para un control apropiado del sintetizador. Se proponen dos enfoques distintos, el control de un sintetizador de voz cantada, y el control de la sintesis de sonidos insturmentales. Para este último, se sugiere una representación de la señal de voz como gestos vocales, incluyendo varios algoritmos de analisis de voz. Los resultados obtenidos se demuestran con dos prototipos a tiempo real.
Digital musical instruments are usually decomposed in two main constituent parts: a user interface and a sound synthesis engine. The user interface is popularly referred as a musical controller, and its design is the primary objective of this dissertation. Under the title of singing-driven interfaces, we aim to design systems that allow controlling the synthesis of musical instruments sounds with the singing voice.
This dissertation searches for the relationships between the voice and the sound of musical instruments by addressing both, the voice signal description, as well as the mapping strategies for a meaningful control of the synthesized sound.
We propose two different approaches, one for controlling a singing voice synthesizer, and another for controlling the synthesis of instrumental sounds. For the latter, we suggest to represent voice signal as vocal gestures, contributing with several voice analysis methods.
To demonstrate the obtained results, we developed two real-time prototypes.
Srivastava, Brij Mohan Lal. "Anonymisation du locuteur : représentation, évaluation et garanties formelles." Thesis, Université de Lille (2018-2021), 2021. https://pepite-depot.univ-lille.fr/LIBRE/EDMADIS/2021/2021LILUB029.pdf.
Large-scale centralized storage of speech data poses severe privacy threats to the speakers. Indeed, the emergence and widespread usage of voice interfaces starting from telephone to mobile applications, and now digital assistants have enabled easier communication between the customers and the service providers. Massive speech data collection allows its users, for instance researchers, to develop tools for human convenience, like voice passwords for banking, personalized smart speakers, etc. However, centralized storage is vulnerable to cybersecurity threats which, when combined with advanced speech technologies like voice cloning, speaker recognition, and spoofing, may endow a malicious entity with the capability to re-identify speakers and breach their privacy by gaining access to their sensitive biometric characteristics, emotional states, personality attributes, pathological conditions, etc.Individuals and the members of civil society worldwide, and especially in Europe, are getting aware of this threat. With firm backing by the GDPR, several initiatives are being launched, including the publication of white papers and guidelines, to spread mass awareness and to regulate voice data so that the citizens' privacy is protected.This thesis is a timely effort to bolster such initiatives and propose solutions to remove the biometric identity of speakers from speech signals, thereby rendering them useless for re-identifying the speakers who spoke them.Besides the goal of protecting the speaker's identity from malicious access, this thesis aims to explore the solutions which do so without degrading the usefulness of speech.We present several anonymization schemes based on voice conversion methods to achieve this two-fold objective. The output of such schemes is a high-quality speech signal that is usable for publication and a variety of downstream tasks.All the schemes are subjected to a rigorous evaluation protocol which is one of the major contributions of this thesis.This protocol led to the finding that the previous approaches do not effectively protect the privacy and thereby directly inspired the VoicePrivacy initiative which is an effort to gather individuals, industry, and the scientific community to participate in building a robust anonymization scheme.We introduce a range of anonymization schemes under the purview of the VoicePrivacy initiative and empirically prove their superiority in terms of privacy protection and utility.Finally, we endeavor to remove the residual speaker identity from the anonymized speech signal using the techniques inspired by differential privacy. Such techniques provide provable analytical guarantees to the proposed anonymization schemes and open up promising perspectives for future research.In practice, the tools developed in this thesis are an essential component to build trust in any software ecosystem where voice data is stored, transmitted, processed, or published. They aim to help the organizations to comply with the rules mandated by civil governments and give a choice to individuals who wish to exercise their right to privacy
Murdoch, Michael J. "Nonverbal vocal interface /." Link to online version, 2006. https://ritdml.rit.edu/dspace/handle/1850/10346.
Hatt, Grégory. "Interface homme-machine intégrant la reconnaissance vocale et l'analyse d'image /." Sion, 2008. http://doc.rero.ch/record/12810?ln=fr.
Martin, Pierre. "C3i systeme de reconnaissance vocale du chinois moderne (chinese ideograms input interface)." Nice, 1994. http://www.theses.fr/1994NICE4809.
CARNEIRO, Maria Isabel Farias. "Abordagem multidimensional para avaliação da acessibilidade de interfaces vocais considerando a modelagem da incerteza." Universidade Federal de Campina Grande, 2014. http://dspace.sti.ufcg.edu.br:8080/jspui/handle/riufcg/1307.
Made available in DSpace on 2018-07-31T19:39:43Z (GMT). No. of bitstreams: 1 MARIA ISABEL FARIA CARNEIRO - DISSERTAÇÃO PPGCC 2014..pdf: 45568096 bytes, checksum: 7fe570750f4904224de8b7e2f76035e2 (MD5) Previous issue date: 2014-03
0 desenvolvimento de interfaces vocais [VUI - Voice User Interface) per se não é uma garantia para um processo interativo de qualidade entre usuários com deficiência visual e sistemas computacionais. Com o intuito de avaliar os problemas de acessibilidade em VUI, a presente pesquisa focalizou a proposição de uma abordagem de avaliação baseada em um conjunto de técnicas já conhecidas pela comunidade de IHC (Interação Homem-Máquina). No tocante a cada técnica utilizada, o problema foi focado a partir de diferentes perspectivas: (i) do usuário, expresso a partir das visões dos usuários sobre o produto, reunidas a partir de uma abordagem de avaliação; (ii) do especialista, expresso sob a forma de análise dos resultados dos desempenhos dos usuários em sessões de teste de acessibilidade; e (iii) da comunidade de acessibilidade, expresso com base em revisões de projeto, a fim de determinar se o projeto da interface está em conformidade com um padrão. Além disso, visando a evidenciar a incerteza associada aos julgamentos do avaliador na inspeção de conformidade do produto, incorporou-se a modelagem de incerteza, a partir da utilização de Redes Bayesianas, possibilitando ao avaliador explicitar os níveis de incerteza associados às inspeções de conformidade do produto a um padrão, por ele realizadas. A abordagem metodológica foi validada a partir de um estudo de caso envolvendo a avaliação da acessibilidade do sistema computacional DOSVOX, desenvolvido na Universidade Federal do Rio de Janeiro (UFRJ), com o objetivo de auxiliar usuários com deficiência visual no uso de sistemas computacionais. No enfoque da inspeção de conformidade, consideraram-se as partes 14 (Diálogos via menus), 17 (Diálogos via preenchimento via formulários) e 171 (Guia de acessibilidade de software) do padrão internacional ISO 9241. Por outro lado, nos enfoques da mensuração de desempenho e da sondagem da satisfação subjetiva do usuário, foram realizados testes de acessibilidade, envolvendo um universo amostrai de 100 usuários. Inicialmente, os participantes foram agrupados como cegos (40 usuários), baixa visão (20 usuários) e sem deficiência visual (40 usuários), de acordo com tipo de deficiência visual. Em seguida, eles foram classificados como principiantes (46 usuários) ou intermediários (54 usuários), de acordo com o nível de conhecimento em Informática e de experiência o produto avaliado. Os dados resultantes dos testes de acessibilidade foram processados estatisticamente, a fim de verificar a correlação entre os desempenhos dos grupos de usuários e entre o desempenho das categorias de usuários de cada grupo. O processamento estatístico dos dados evidenciou a inexistência de diferenças significativas entre os desempenhos dos grupos, bem como entre as categorias de usuários. Por outro lado, a confrontação dos resultados dos três enfoques (mensuração de desempenho do usuário, mensuração da satisfação subjetiva do usuário e inspeção de conformidade do produto a padrões) demonstrou que a abordagem de avaliação proposta produziu resultados complementares e reforçou a relevância da utilização de uma abordagem multimétodos para a avaliação de acessibilidade de interfaces vocais.
Voice interaction design per se does not provide quality assurance of the interactive process for visually impaired users. In this dissertation, a method for evaluating voice user interface (VUI) accessibility based upon a set of techniques already well-known to the HCI (Human-Computer Interaction) community is proposed. For each technique, the problem is focused from a different perspective: (i) the user's perspective, which is expressed as views on the product gathered from an inquiry-based approach; (ii) the specialist's perspective, which is expressed by the analysis of the performance results in accessibility testing sessions; and (iii) the accessibility community's perspective, which is expressed by design reviews to determine whether a user interface design conforms to standards. Additionally, Bayesian networks were used in order to make explicit the uncertainty inherent in conformity inspection processes. A case study with DOSVOX system was performed to validate the proposed approach. DOSVOX system was developed at Federal University of Rio de Janeiro (UFRJ) with the aim of helping visually impaired users use the computer. A conformity inspection was performed in accordance with parts 14 (Menu dialogues), 17 (Form-filling dialogues) 171 (Guidance on software accessibility) of ISO 9241. On the other hand, the user performance measurement and the user subjective satisfaction measurement were conducted via accessibility testing. One hundred subjects were enrolled in this study. First, they were categorized as blind (40 users), low vision (20 users) and non-visually impaired (40 users), according to their visual impairment. Second, they were grouped as novices (46 users) and intermediates (54 users), according to their knowledge level in Informatics and experience with the evaluated product. Accessibility test results were statistically analyzed in order to verify the correlation between category performances and between group performances. No statistically significant differences between the user categories or the user groups were found. On the other hand, data comparison showed that the three strategies adopted (user performance measurement, user satisfaction measurement and standard conformity inspection) add to the evaluation process, producing complimentary data that are significant to the process, and reinforcing the relevance of a multi-layered approach for the accessibility evaluation of voice user interfaces.
Chapman, Jana Lynn. "BYU Vocal Performance Database." BYU ScholarsArchive, 2010. https://scholarsarchive.byu.edu/etd/2146.
Perrotin, Olivier. "Chanter avec les mains : interfaces chironomiques pour les instruments de musique numériques." Thesis, Paris 11, 2015. http://www.theses.fr/2015PA112207/document.
This thesis deals with the real-time control of singing voice synthesis by a graphic tablet, based on the digital musical instrument Cantor Digitalis.The relevance of the graphic tablet for the intonation control is first considered, showing that the tablet provides a more precise pitch control than real voice in experimental conditions.To extend the accuracy of control to any situation, a dynamic pitch warping method for intonation correction is developed. It enables to play under the pitch perception limens preserving at the same time the musician's expressivity. Objective and perceptive evaluations validate the method efficiency.The use of new interfaces for musical expression raises the question of the modalities implied in the playing of the instrument. A third study reveals a preponderance of the visual modality over the auditive perception for the intonation control, due to the introduction of visual clues on the tablet surface. Nevertheless, this is compensated by the expressivity allowed by the interface.The writing or drawing ability acquired since early childhood enables a quick acquisition of an expert control of the instrument. An ensemble of gestures dedicated to the control of different vocal effects is suggested.Finally, an intensive practice of the instrument is made through the Chorus Digitalis ensemble, to test and promote our work. An artistic research has been conducted for the choice of the Cantor Digitalis' musical repertoire. Moreover, a visual feedback dedicated to the audience has been developed, extending the perception of the players' pitch and articulation
Dours, Daniel. "Conception d'un système multiprocesseur traitant un flot continu de données en temps réel pour la réalisation d'une interface vocale intelligente." Grenoble 2 : ANRT, 1986. http://catalogue.bnf.fr/ark:/12148/cb375972845.
Dours, Daniel. "Conception d'un systeme multiprocesseur traitant un flot continu de donnees en temps reel pour la realisation d'une interface vocale intelligente." Toulouse 3, 1986. http://www.theses.fr/1986TOU30107.
Xu, Kele. "Visualisation tridimensionnelle de la langue basée sur des séquences d'image échographique en mode-B." Thesis, Paris 6, 2016. http://www.theses.fr/2016PA066498/document.
A silent speech interface (SSI) is a system to enable speech communication with non-audible signal, that employs sensors to capture non-acoustic features for speech recognition and synthesis. Extracting robust articulatory features from such signals, however, remains a challenge. As the tongue is a major component of the vocal tract, and the most important articulator during speech production, a realistic simulation of tongue motion in 3D can provide a direct, effective visual representation of speech production. This representation could in turn be used to improve the performance of speech recognition of an SSI, or serve as a tool for speech production research and the study of articulation disorders. In this thesis, we explore a novel 3D tongue visualization framework, which combines the 2D ultrasound imaging and 3D physics-based modeling technique. Firstly, different approaches are employed to follow the motion of the tongue in the ultrasound image sequences, which can be divided into two main types of methods: speckle tracking and contour tracking. The methods to track speckles include deformation registration, optical-flow, and local invariant features-based method. Moreover, an image-based tracking re-initialization method is proposed to improve the robustness of speckle tracking. Compared to speckle tracking, the extraction of the contour of the tongue surface from ultrasound images exhibits superior performance and robustness. In this thesis, a novel contour-tracking algorithm is presented for ultrasound tongue image sequences, which can follow the motion of tongue contours over long durations with good robustness. To cope with missing segments caused by noise, or by the tongue midsagittal surface being parallel to the direction of ultrasound wave propagation, active contours with a contour-similarity constraint are introduced, which can be used to provide “prior” shape information. Experiments on synthetic data and on real 60 frame per second data from different subjects demonstrate that the proposed method gives good contour tracking for ultrasound image sequences even over durations of minutes, which can be useful in applications such as speech recognition where very long sequences must be analyzed in their entirety…
BEZERRA, Joelma de Almeida e. Silva. "O coro cênico da Universidade da Amazônia: experienciando uma identidade a partir de um repertório musical." Universidade Federal do Pará, 2015. http://repositorio.ufpa.br/jspui/handle/2011/9985.
Approved for entry into archive by Larissa Silva (larissasilva@ufpa.br) on 2018-06-11T19:52:06Z (GMT) No. of bitstreams: 2 license_rdf: 0 bytes, checksum: d41d8cd98f00b204e9800998ecf8427e (MD5) Dissertacao_CoroCenicoUniversidade.pdf: 5031387 bytes, checksum: b3b69d578b6518d94b96f0724c6f6781 (MD5)
Made available in DSpace on 2018-06-11T19:52:06Z (GMT). No. of bitstreams: 2 license_rdf: 0 bytes, checksum: d41d8cd98f00b204e9800998ecf8427e (MD5) Dissertacao_CoroCenicoUniversidade.pdf: 5031387 bytes, checksum: b3b69d578b6518d94b96f0724c6f6781 (MD5) Previous issue date: 2015-06-26
CAPES - Coordenação de Aperfeiçoamento de Pessoal de Nível Superior
O objetivo principal desta pesquisa foi o de investigar a experiência de construção de uma “identidade”, a partir de um repertório musical do Coro Cênico da UNAMA, que originou a produção da série de CDs Trilhas D’Água, constituído de repertório musical de carimbós, lundus, cantos pastoris, cordões-de-bicho, bois, acalantos e canções, uma releitura do universo musical amazônico para a linguagem do canto-coral. Questões como de que forma a mudança no repertório musical contribuiu para a construção da “identidade” do grupo, as transformações observadas nos coristas quanto ao pensamento, o comportamento e o próprio som gerado, na comunidade acadêmica que o mantém, assim como no contexto cultural em que este grupo se insere são abordadas. A dissertação é estruturada em três seções. A primeira apresenta uma biografia contextualizada do Coro Cênico da UNAMA; a segunda descreve o processo de ensino e aprendizagem do grupo e, a prática musical é analisada na terceira, considerando a esfera social na qual o grupo se insere. São utilizadas fontes bibliográficas sobre mudança musical e identidade musical, à luz da etnomusicologia. Levantamento de dados disponíveis em blogs e sites sobre a prática coral foram realizados, assim como entrevistas “semiestruturadas” e “episódicas” com a comunidade acadêmica, alunos participantes do grupo, antigos alunos, reitor e pró-reitor da instituição, compositores e a regente, considerando a história oral recontada pelos agentes que constroem e executam a prática musical de alguma forma na universidade. Parte-se do princípio de que esses agentes são as pessoas que detém as bases e as concepções do que seja a prática musical na Universidade da Amazônia.
The main goal of this research was to investigate the experience of building an “identity” from a musical repertoire of the Universidade da Amazônia’s Coro Cênico, which led to the production of the CD series “Trilhas D'Água”, consisting of a musical repertoire retelling the Amazon musical universe in the language of choral singing. The following issues are addressed: how the change in musical repertoire contributed to the construction of the group’s "identity"; the changes observed in the chorists about the thinking, behavior and the generated sound itself; the academic community that mantains it, as well as the cultural context in which this group is inserted. The dissertation is structured in three sections. The first presents a contextualized biography of Coro Cênico da UNAMA; the second describes the group’s teaching and learning process, and the musical practice is analyzed in the third, considering the social sphere in which the group operates. Bibliographical sources about musical change and identity in the light of ethnomusicology are used . Data about choir practice available in blogs and sites were collected. Moreover, "semi-structured" and "episodic" interviews with the academic community, students from the group, rector and dean of the institution, composers and conductor were done, considering the oral history retold by the agents that build and perform musical practice at the university somehow . It starts from the principle that these agents are the people who hold the basis and conceptions of what the musical practice at the University of Amazonia is about.
Vacher, Michel. "Analyse sonore et multimodale dans le domaine de l'assistance à domicile." Habilitation à diriger des recherches, Université de Grenoble, 2011. http://tel.archives-ouvertes.fr/tel-00956330.
Rouillard, José. "Hyperdialogue sur Internet." Phd thesis, Université Joseph Fourier (Grenoble), 2000. http://tel.archives-ouvertes.fr/tel-00006753.
Charbonneau, Sylvain. "L'informatisation de l'accueil téléphonique." Thèse, 2004. http://hdl.handle.net/1866/17395.