Dissertations / Theses on the topic 'Automated language translation'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 47 dissertations / theses for your research on the topic 'Automated language translation.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Marshall, Susan LaVonne. "Concept of Operations (CONOPS) for foreign language and speech translation technologies in a coalition military environment." Thesis, Monterey, Calif. : Springfield, Va. : Naval Postgraduate School ; Available from National Technical Information Service, 2005. http://library.nps.navy.mil/uhtbin/hyperion/05Mar%5FMarshall.pdf.
Full textDoni, Pracner. "Translation and Transformation of Low Level Programs." Phd thesis, Univerzitet u Novom Sadu, Prirodno-matematički fakultet u Novom Sadu, 2019. https://www.cris.uns.ac.rs/record.jsf?recordId=110184&source=NDLTD&language=en.
Full textU okviru ove teze se predstavlja pristup radu sa programima niskog nivoa koji omogućava automatsko restrukturiranje i podizanje na više nivoe. Samim tim postaje mnogo lakše razumeti logiku programa što smanjuje vreme razvoja.Proces je dizajniran tako da bude fleksibilan i sastoji se od više nezavisnih alata. Samim tim je lako menjati proces po potrebi, ali i upotrebiti razvijene alate u drugim procesima. Tipično se mogu razlikovati dva glavna koraka. Prvi je prevođenje u jezik WSL,za koji postoji veliki broj transformacija programa koje očuvavaju semantiku. Drugi su transformacije u samom WSL-u. Za potrebe prevođenja su razvijena dva alata, jedan koji radi sa podskupom x86 asemblera i drugi koji radi sa MikroJava bajtkôdom. Rezultat prevođenja je program niskog nivoa u WSL jeziku.Primarni cilj ovog istraživanja je bila potpuna automatizacija odabira transformacija, tako da i korisnici bez iskustva u radu sa sistemom mogu efikasno da primene ovaj proces za svoje potrebe. Sa druge strane zbog fleksibilnosti procesa, iskusni korisnici mogu lakoda ga prošire ili da ga integrišu u neki drugi već postojeći proces.Automatizacija je postignuta pretraživanjem usponom (eng. hill climbing).Eksperimenti vršeni na nekoliko tipova ulaznih programa niskog nivoa su pokazali da rezultati mogu biti izuzetni. Za funkciju pogodnosti je korišćena ugrađena metrika koja daje “težinu” struktura u programu. Kod ulaza za koje je originalni izvorni kôd bio dostupan, krajnje metrike najboljih varijanti prevedenih i transformisanih programa su bile na sličnom nivou. Neki primeri su bolji od originala, dok su drugi bili nešto kompleksniji. Rezultati su uvek pokazivali značajna unapređenja u odnosu na originalni kôd niskog nivoa.
Zogheib, Ali. "Automatic language translation /." Göteborg : IT-universitetet, Chalmers tekniska högskola och Göteborgs universitet, 2007. http://www.ituniv.se/w/index.php?option=com_itu_thesis&Itemid=319.
Full textDavis, Paul C. "Stone Soup Translation: The Linked Automata Model." Connect to this title online, 2002. http://rave.ohiolink.edu/etdc/view?acc%5Fnum=osu1023806593.
Full textTitle from first page of PDF file. Document formatted into pages; contains xvi, 306 p.; includes graphics. Includes abstract and vita. Advisor: Chris Brew, Dept. of Linguistics. Includes indexes. Includes bibliographical references (p. 284-293).
Clark, D. P. "Automatic translation of scene description languages." Thesis, Swansea University, 2006. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.636259.
Full textDürlich, Luise. "Automatic Recognition and Classification of Translation Errors in Human Translation." Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-420289.
Full textChatterjee, Rajen. "Automatic Post-Editing for Machine Translation." Doctoral thesis, Università degli studi di Trento, 2019. http://hdl.handle.net/11572/242495.
Full textHuang, X. "XTRA : The design and implementation of a fully automatic machine translation system." Thesis, University of Essex, 1987. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.379393.
Full textAverboch, Guillermo Andres. "A system for document analysis, translation, and automatic hypertext linking." Thesis, Virginia Tech, 1995. http://hdl.handle.net/10919/43809.
Full textMaster of Science
Saers, Markus. "Translation as Linear Transduction : Models and Algorithms for Efficient Learning in Statistical Machine Translation." Doctoral thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2011. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-135704.
Full textLindgren, Anna. "Semi-Automatic Translation of Medical Terms from English to Swedish : SNOMED CT in Translation." Thesis, Linköpings universitet, Medicinsk informatik, 2011. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-69736.
Full textDen internationella kliniska terminologin SNOMED CT har översatts från engelska till svenska under ansvar av Socialstyrelsen. Den här studien utfördes för att påvisa om semiautomatiska översättningsmetoder skulle kunna utföra tillräckligt bra översättning med färre resurser än manuell översättning. Den engelsk-svenska medicinska ordlistan TermColl användes som bas för översättning av delmängder av SNOMED CT via översättningsminne och genom statistisk översättning. Med Socialstyrelsens översättningar som referens poängsattes the semiautomatiska översättningarna via BLEU. Resultaten visade att statistisk översättning gav ett betydligt bättre resultat än översättning med översättningsminne, men över lag var resultaten alltför dåliga för att semiautomatisk översättning skulle kunna rekommenderas i detta fall.
Büchse, Matthias. "Algebraic decoder specification: coupling formal-language theory and statistical machine translation." Doctoral thesis, Saechsische Landesbibliothek- Staats- und Universitaetsbibliothek Dresden, 2015. http://nbn-resolving.de/urn:nbn:de:bsz:14-qucosa-159266.
Full textShi, Chunqi. "User-Centered Design of Translation Systems." 京都大学 (Kyoto University), 2013. http://hdl.handle.net/2433/180468.
Full textÖktem, Alp. "Incorporating prosody into neural speech processing pipelines: applications on automatic speech transcription and spoken language machine translation." Doctoral thesis, Universitat Pompeu Fabra, 2019. http://hdl.handle.net/10803/666222.
Full textEn aquesta tesi estudio la inclusió de la prosòdia en dues aplicacions que involucren la comprensió de la parla:~la transcripció automàtica de la parla i la traducció de la llengua oral. En el primer cas, proposo un mètode que utilitza un mecanisme d’atenció sobre seqüències paral·leles de característiques prosòdiques i morfosintàctiques. Els resultats indiquen una precisió de $F_1$=70.3\% en la generació de la puntuació. En el segon cas m'ocupo de la millora de la traducció de la llengua oral utilitzant la prosòdia. Un sistema neural de traducció automàtica format amb un corpus de text en el domini del cinema s’adapta amb característiques de pauses afegides utilitzant un conjunt de dades bilingües prosòdicament anotada. Els resultats mostren que la generació de puntuació prosòdica com a pas previ a la traducció augmenta la precisió de la traducció en un 1\% en termes de BLEU. La codificació de les pauses com a característica addicional encara incrementa la precisió en un altre 1\%. A més a més, amplio el sistema de traducció per a predir conjuntament les característiques de pausa i poder-les utilitzar com a entrada en un sistema de síntesi de veu.
Pereira, José Casimiro. "Natural language generation in the context of multimodal interaction in Portuguese : Data-to-text based in automatic translation." Doctoral thesis, Universidade de Aveiro, 2017. http://hdl.handle.net/10773/21767.
Full textResumo em português não disponivel
To enable the interaction by text and/or speech it is essential that we devise systems capable of translating internal data into sentences or texts that can be shown on screen or heard by users. In this context, it is essential that these natural language generation (NLG) systems provide sentences in the native languages of the users (in our case European Portuguese) and enable an easy development and integration process while providing an output that is perceived as natural. The creation of high quality NLG systems is not an easy task, even for a small domain. The main di culties arise from: classic approaches being very demanding in know-how and development time; a lack of variability in generated sentences of most generation methods; a di culty in easily accessing complete tools; shortage of resources, such as large corpora; and support being available in only a limited number of languages. The main goal of this work was to propose, develop and test a method to convert Data-to-Portuguese, which can be developed with the smallest amount possible of time and resources, but being capable of generating utterances with variability and quality. The thesis defended argues that this goal can be achieved adopting data-driven language generation { more precisely generation based in language translation { and following an Engineering Research Methodology. In this thesis, two Data2Text NLG systems are presented. They were designed to provide a way to quickly develop an NLG system which can generate sentences with good quality. The proposed systems use tools that are freely available and can be developed by people with low linguistic skills. One important characteristic is the use of statistical machine translation techniques and this approach requires only a small natural language corpora resulting in easier and cheaper development when compared to more common approaches. The main result of this thesis is the demonstration that, by following the proposed approach, it is possible to create systems capable of translating information/data into good quality sentences in Portuguese. This is done without major e ort regarding resources creation and with the common knowledge of an experienced application developer. The systems created, particularly the hybrid system, are capable of providing a good solution for problems in data to text conversion.
Mostofian, Nasrin. "A Study on Manual and Automatic Evaluation Procedures and Production of Automatic Post-editing Rules for Persian Machine Translation." Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-325818.
Full textQuernheim, Daniel. "Bimorphism Machine Translation." Doctoral thesis, Universitätsbibliothek Leipzig, 2017. http://nbn-resolving.de/urn:nbn:de:bsz:15-qucosa-223667.
Full textKučera, Jiří. "A Combination of Automata and Grammars." Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2013. http://www.nusl.cz/ntk/nusl-236222.
Full textPapadopoulou, Anthi. "Automatic Error Detection and Correction in Neural Machine Translation : A comparative study of Swedish to English and Greek to English." Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-385085.
Full textZhou, Mingjie. "Deep networks for sign language video caption." HKBU Institutional Repository, 2020. https://repository.hkbu.edu.hk/etd_oa/848.
Full textSilvestre, Cerdà Joan Albert. "Different Contributions to Cost-Effective Transcription and Translation of Video Lectures." Doctoral thesis, Universitat Politècnica de València, 2016. http://hdl.handle.net/10251/62194.
Full text[ES] Durante estos últimos años, los repositorios multimedia on-line han experimentado un gran crecimiento que les ha hecho establecerse como fuentes fundamentales de conocimiento, especialmente en el área de la educación, donde se han creado grandes repositorios de vídeo charlas educativas para complementar e incluso reemplazar los métodos de enseñanza tradicionales. No obstante, la mayoría de estas charlas no están transcritas ni traducidas debido a la ausencia de soluciones de bajo coste que sean capaces de hacerlo garantizando una calidad mínima aceptable. Soluciones de este tipo son claramente necesarias para hacer que las vídeo charlas sean más accesibles para hablantes de otras lenguas o para personas con discapacidades auditivas. Además, dichas soluciones podrían facilitar la aplicación de funciones de búsqueda y de análisis tales como clasificación, recomendación o detección de plagios, así como el desarrollo de funcionalidades educativas avanzadas, como por ejemplo la generación de resúmenes automáticos de contenidos para ayudar al estudiante a tomar apuntes. Por este motivo, el principal objetivo de esta tesis es desarrollar una solución de bajo coste capaz de transcribir y traducir vídeo charlas con un nivel de calidad razonable. Más específicamente, abordamos la integración de técnicas estado del arte de Reconocimiento del Habla Automático y Traducción Automática en grandes repositorios de vídeo charlas educativas para la generación de subtítulos multilingües de alta calidad sin requerir intervención humana y con un reducido coste computacional. Además, también exploramos los beneficios potenciales que conllevaría la explotación de la información de la que disponemos a priori sobre estos repositorios, es decir, conocimientos específicos sobre las charlas tales como el locutor, la temática o las transparencias, para crear sistemas de transcripción y traducción especializados mediante técnicas de adaptación masiva. Las soluciones propuestas en esta tesis han sido testeadas en escenarios reales llevando a cabo nombrosas evaluaciones objetivas y subjetivas, obteniendo muy buenos resultados. El principal legado de esta tesis, The transLectures-UPV Platform, ha sido liberado públicamente como software de código abierto, y, en el momento de escribir estas líneas, está sirviendo transcripciones y traducciones automáticas para diversos miles de vídeo charlas educativas en nombrosas universidades e instituciones Españolas y Europeas.
[CAT] Durant aquests darrers anys, els repositoris multimèdia on-line han experimentat un gran creixement que els ha fet consolidar-se com a fonts fonamentals de coneixement, especialment a l'àrea de l'educació, on s'han creat grans repositoris de vídeo xarrades educatives per tal de complementar o inclús reemplaçar els mètodes d'ensenyament tradicionals. No obstant això, la majoria d'aquestes xarrades no estan transcrites ni traduïdes degut a l'absència de solucions de baix cost capaces de fer-ho garantint una qualitat mínima acceptable. Solucions d'aquest tipus són clarament necessàries per a fer que les vídeo xarres siguen més accessibles per a parlants d'altres llengües o per a persones amb discapacitats auditives. A més, aquestes solucions podrien facilitar l'aplicació de funcions de cerca i d'anàlisi tals com classificació, recomanació o detecció de plagis, així com el desenvolupament de funcionalitats educatives avançades, com per exemple la generació de resums automàtics de continguts per ajudar a l'estudiant a prendre anotacions. Per aquest motiu, el principal objectiu d'aquesta tesi és desenvolupar una solució de baix cost capaç de transcriure i traduir vídeo xarrades amb un nivell de qualitat raonable. Més específicament, abordem la integració de tècniques estat de l'art de Reconeixement de la Parla Automàtic i Traducció Automàtica en grans repositoris de vídeo xarrades educatives per a la generació de subtítols multilingües d'alta qualitat sense requerir intervenció humana i amb un reduït cost computacional. A més, també explorem els beneficis potencials que comportaria l'explotació de la informació de la que disposem a priori sobre aquests repositoris, és a dir, coneixements específics sobre les xarrades tals com el locutor, la temàtica o les transparències, per a crear sistemes de transcripció i traducció especialitzats mitjançant tècniques d'adaptació massiva. Les solucions proposades en aquesta tesi han estat testejades en escenaris reals duent a terme nombroses avaluacions objectives i subjectives, obtenint molt bons resultats. El principal llegat d'aquesta tesi, The transLectures-UPV Platform, ha sigut alliberat públicament com a programari de codi obert, i, en el moment d'escriure aquestes línies, està servint transcripcions i traduccions automàtiques per a diversos milers de vídeo xarrades educatives en nombroses universitats i institucions Espanyoles i Europees.
Silvestre Cerdà, JA. (2016). Different Contributions to Cost-Effective Transcription and Translation of Video Lectures [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/62194
TESIS
Salimi, Jonni. "Machine Translation Of Fictional And Non-fictional Texts : An examination of Google Translate's accuracy on translation of fictional versus non-fictional texts." Thesis, Stockholms universitet, Engelska institutionen, 2014. http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-106670.
Full textLe, Hai Son. "Continuous space models with neural networks in natural language processing." Phd thesis, Université Paris Sud - Paris XI, 2012. http://tel.archives-ouvertes.fr/tel-00776704.
Full textKervajan, LoÏc. "Contribution à la traduction automatique français/langue des signes française (LSF) au moyen de personnages virtuels : Contribution à la génération automatique de la LSF." Thesis, Aix-Marseille 1, 2011. http://www.theses.fr/2011AIX10172.
Full textSince the law was voted the 11-02-2005 for equal rights and opportunities: places open to anyone (public places, shops, internet, etc.) should welcome the Deaf in French Sign Language (FSL). We have worked on the development of technological tools to promote LSF, especially in machine translation from written French to FSL.Our thesis begins with a presentation of knowledge on FSL (theoretical resources and ways to edit FSL) and follows by further concepts of descriptive grammar. Our working hypothesis is: FSL is a language and, therefore, machine translation is relevant.We describe the language specifications for automatic processing, based on scientific knowledge and proposals of our native FSL speaker informants. We also expose our methodology, and do present the advancement of our work in the formalization of linguistic data based on the specificities of FSL which certain (verbs scheme, adjective and adverb modification, organization of nouns, agreement patterns) require further analysis.We do present the application framework in which we worked on: the machine translation system and virtual characters animation system of France Telecom R&D.After a short avatar technology presentation, we explain our control modalities of the gesture synthesis engine through the exchange format that we developed.Finally, we conclude with an evaluation, researches and developments perspectives that could follow this thesis.Our approach has produced its first results since we have achieved our goal of running the full translation chain: from the input of a sentence in French to the realization of the corresponding sentence in FSL with a synthetic character
Ngo, Ho Anh Khoa. "Generative Probabilistic Alignment Models for Words and Subwords : a Systematic Exploration of the Limits and Potentials of Neural Parametrizations." Electronic Thesis or Diss., université Paris-Saclay, 2021. http://www.theses.fr/2021UPASG014.
Full textAlignment consists of establishing a mapping between units in a bitext, combining a text in a source language and its translation in a target language. Alignments can be computed at several levels: between documents, between sentences, between phrases, between words, or even between smaller units end when one of the languages is morphologically complex, which implies to align fragments of words (morphemes). Alignments can also be considered between more complex linguistic structures such as trees or graphs. This is a complex, under-specified task that humans accomplish with difficulty. Its automation is a notoriously difficult problem in natural language processing, historically associated with the first probabilistic word-based translation models. The design of new models for natural language processing, based on distributed representations computed by neural networks, allows us to question and revisit the computation of these alignments. This research project, therefore, aims to comprehensively understand the limitations of existing statistical alignment models and to design neural models that can be learned without supervision to overcome these drawbacks and to improve the state of art in terms of alignment accuracy
Zamora, Martínez Francisco Julián. "Aportaciones al modelado conexionista de lenguaje y su aplicación al reconocimiento de secuencias y traducción automática." Doctoral thesis, Universitat Politècnica de València, 2012. http://hdl.handle.net/10251/18066.
Full textZamora Martínez, FJ. (2012). Aportaciones al modelado conexionista de lenguaje y su aplicación al reconocimiento de secuencias y traducción automática [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/18066
Palancia
Fränne, Ellen. "Google Traduction et le texte idéologique : dans quelle mesure une traduction automatique transmet-elle le contenu idéologique d'un texte?" Thesis, Linnéuniversitetet, Institutionen för språk (SPR), 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:lnu:diva-64460.
Full textAhlert, Hubert. "Um modelo não procedural de especificação e implementação voltado a sistemas transacionais em banco de dados." reponame:Biblioteca Digital de Teses e Dissertações da UFRGS, 1994. http://hdl.handle.net/10183/9006.
Full textThis Ph.D thesis presents a graphic and textual specification model for database transactions systems (ER/T+) and, also, an implementation model for this specification. Suggest a proceduralization technique for declarative specifications using a data flow dependency graph to establish a precedence relation between the diagram flows of the graphics specification language. Furthermore it presents the execution mechanism of the proposal specification language and the behavioral and structural rules for mapping the specification language into corresponding implementation language (C and SQL) constructions. Additionaly, are discussed query optimization aspects for transaction specification language and aspects of nested queries to combine various ER/T+ diagram flows into complex SQL query expressions
Kouniali, Samy Habib. "Désambigüisation de groupes nominaux complexes en conformité avec les connaissances du domaine : application a la traduction automatique." Vandoeuvre-les-Nancy, INPL, 1993. http://www.theses.fr/1993INPL096N.
Full textShibata, Danilo Picagli. "Tradução grafema-fonema para a língua portuguesa baseada em autômatos adaptativos." Universidade de São Paulo, 2008. http://www.teses.usp.br/teses/disponiveis/3/3141/tde-30052008-101100/.
Full textThis work presents a study on the use of adaptive devices for text-to-speech translation. The work focuses on the development of a grapheme-phoneme translation method for Portuguese based on Adaptive Automata and the use of this method in a text-to-speech translation software. The presented method resembles human behavior when handling syllable separation rules, syllable stress definition and influences syllables have on each other. This feature makes the method easy to use with different variations of Portuguese, since these characteristics are invariants of the language. Portuguese spoken nowadays in São Paulo, Brazil has been chosen as the target for analysis and tests in this work. The method has good results for such variation of Portuguese, reaching 95% accuracy rate for grapheme-phoneme translation, clearing the 90% mark after resolution of ambiguous cases in which different representations are accepted for a grapheme and generating phonetic output intelligible for native speakers based on concatenation synthesis using syllables as concatenation units. As final results of this work, a model is presented for grapheme-phoneme translation for Portuguese words based on Adaptive Automata, a methodology to choose the correct phonetic representation for the grapheme in ambiguous cases, a software for Adaptive Automata simulation and a software for grapheme-phoneme translation of texts using both the model of translation and methodology for disambiguation. The latter software was unified with the speech synthesizer developed by Koike et al. (2007) to create a text-to-speech translator for Portuguese. This work evidences the feasibility of text-to-speech translation for Portuguese using Adaptive Automata as the main instrument for such task.
Fehri, Héla. "Reconnaissance automatique des entités nommées arabes et leur traduction vers le français." Thesis, Besançon, 2012. http://www.theses.fr/2012BESA1031/document.
Full textThe translation of named entities (NEs) is a current research topic with regard to the proliferation of electronic documents exchanged through the Internet. So, the need to process these documents with NLP tools becomes necessary and interesting. Formal or semi-formal modeling of these NEs may intervene in both processes of recognition and translation. Indeed, it makes the accumulation of linguistic resources more reliable, limits the impact of linguistic specificities and facilitates the transformation from one representation to another. In this context, we propose a tool for the recognition and translation of Arabic NEs into French, based primarily on formal .representation and a set of transducers. This tool takes into account the integration of a module of transliteration. Its implementation was performed using the NooJ platform and the results obtained proved to be satisfactory
Menacer, Mohamed Amine. "Reconnaissance et traduction automatique de la parole de vidéos arabes et dialectales." Electronic Thesis or Diss., Université de Lorraine, 2020. http://www.theses.fr/2020LORR0157.
Full textThis research has been developed in the framework of the project AMIS (Access to Multilingual Information and opinionS). AMIS is an European project which aims to help people to understand the main idea of a video in a foreign language by generating an automatic summary of it. In this thesis, we focus on the automatic recognition and translation of the speech of Arabic and dialectal videos. The statistical approaches proposed in the literature for automatic speech recognition are language independent and they are applicable to modern standard Arabic. However, this language presents some characteristics that we need to take into consideration in order to boost the performance of the speech recognition system. Among these characteristics we can mention the absence of short vowels in the text, which makes their training by the acoustic model difficult. We proposed several approaches to acoustic and/or language modeling in order to better recognize the Arabic speech. In the Arab world, modern standard Arabic is not the mother tongue, that is why daily conversations are carried out with dialect, an Arabic inspired from modern standard Arabic, but not only. We worked on the adaptation of the speech recognition system developed for the modern standard Arabic to the Algerian dialect, which is one of the most difficult variants of the Arabic language to recognize by automatic speech recognition systems. This is mainly due to the borrowed words from other languages, the code-switching and the lack of resources. Our approach to overcome all these problems is to take advantage from oral and textual data of other languages that have an impact on the dialect in order to train the required models for dialect speech recognition. The resulting text from Arabic speech recognition system was then used for machine translation. As a starting point, we conducted a comparative study between the phrase based approach and the neural approach used in machine translation. Then, we adapted these two approaches to translate the code-switched text. Our study focused on the mix of Arabic and English in a parallel corpus extracted from official documents of the United Nations. In order to prevent the error propagation in the pipeline system, we worked on the adaptation of the vocabulary of the automatic speech recognition system and on the proposition of a new model that directly transforms a speech signal in language A into a sequence of words in another language B
Potet, Marion. "Vers l'intégration de post-éditions d'utilisateurs pour améliorer les systèmes de traduction automatiques probabilistes." Phd thesis, Université de Grenoble, 2013. http://tel.archives-ouvertes.fr/tel-00995104.
Full textNgô, Van Chan. "Formal verification of a synchronous data-flow compiler : from Signal to C." Phd thesis, Université Rennes 1, 2014. http://tel.archives-ouvertes.fr/tel-01067477.
Full textFaab, Gertrud. "A morphosyntacic description of Northern Sotho as a basis for an automated translation from Northern Sotho into English." Thesis, 2010. http://hdl.handle.net/2263/28569.
Full text- language units of Northern Sotho are identified, i.e. the tokens and words that form the language. These are sorted into word class categories (parts of speech), using the descriptions of Taljard et al. (2008) as a basis;
- the formal relationships between these units, wherever possible on the level of parts of speech, are described in the form of productive morpho-syntactic phrase grammar rules. These rules are defined within the framework of generative grammar.
Thesis (PhD)--University of Pretoria, 2010.
African Languages
unrestricted
Nemutamvuni, Mulalo Edward. "Investigating the effectiveness of available tools for translating into tshiVenda." Diss., 2018. http://hdl.handle.net/10500/25563.
Full textAbstracts in English and Venda
This study has investigated the effectiveness of available tools used for translating from English into Tshivenḓa and vice versa with the aim to investigate and determine the effectiveness of these tools. This study dealt with the problem of lack of effective translation tools used to translate between English and Tshivenḓa. Tshivenḓa is one of South Africa’s minority languages. Its (Tshivenḓa) lack of effective translation tools negatively affects language practitioners’ work. This situation is perilous for translation quality assurance. Translation tools, both computer technology and non-computer technology tools abound for developed languages such as English, French and others. Based on the results of this research project, the researcher did make recommendations that could remedy the situation. South Africa is a democratic country that has a number of language-related policies. This then creates a conducive context for stakeholders with language passion to fully develop Tshivenḓa language in all dimensions. The fact is that all languages have evolved and they were all underdeveloped. This vividly shows that Tshivenḓa language development is also possible just like Afrikaans, which never existed on earth before 1652. It (Afrikaans) has evolved and overtaken all indigenous South African languages. This study did review the literature regarding translation and translation tools. The literature was obtained from both published and unpublished sources. The study has used mixed methods research, i.e. quantitative and qualitative research methods. These methods successfully complemented each other throughout the entire research. Data were gathered through questionnaires and interviews wherein both open and closed-ended questions were employed. Both purposive/judgemental and snowball (chain) sampling have been applied in this study. Data analysis was addressed through a combination of methods owing to the nature of mixed methods research. Guided by analytic comparison approach when grouping together related data during data analysis and presentation, both statistical and textual analyses have been vital in this study. Themes were constructed to lucidly present the gathered data. At the last chapters, the researcher discussed the findings and evaluated the entire research before making recommendations and conclusion.
Iyi ṱhoḓisiso yo ita tsedzuluso nga ha kushumele kwa zwishumiswa zwi re hone zwine zwa shumiswa u pindulela u bva kha luambo lwa English u ya kha Tshivenḓa na u bva kha Tshivenḓa u ya kha English ndivho I ya u sedzulusa na u lavhelesa kushumele kwa izwi zwishumiswa uri zwi a thusa naa. Ino ṱhoḓisiso yo shumana na thaidzo ya ṱhahelelo ya zwishumiswa zwa u pindulela zwine zwa shumiswa musi hu tshi pindulelwa vhukati ha English na Tshivenḓa. Tshivenḓa ndi luṅwe lwa nyambo dza Afrika Tshipembe dzine dza ambiwa nga vhathu vha si vhanzhi. U shaea ha zwishumiswa zwa u pindulela zwine zwa shuma nga nḓila I thusaho zwi kwama mushumo wa vhashumi vha zwa nyambo nga nḓila I si yavhuḓi. Iyi nyimele I na mulingo u kwamaho khwaḽithi ya zwo pindulelwaho. Zwishumiswa zwa u pindulela, zwa thekhnoḽodzhi ya khomphiyutha na zwi sa shumisi thekhnoḽodzhi ya khomphiyutha zwo ḓalesa kha nyambo dzo bvelelaho u tou fana na kha English, French na dziṅwe. Zwo sendeka kha mvelelo dza ino thandela ya ṱhoḓisiso, muṱoḓisisi o ita themendelo dzine dza nga fhelisa thaidzo ya nyimele. Afrika Tshipembe ndi shango ḽa demokirasi ḽine ḽa vha na mbekanyamaitele dzo vhalaho nga ha dzinyambo. Izwi zwi ita uri hu vhe na nyimele ine vhafaramikovhe vhane vha funesa nyambo vha kone u bveledza Tshivenḓa kha masia oṱhe. Zwavhukuma ndi zwa uri nyambo dzoṱhe dzi na mathomo nahone dzoṱhe dzo vha dzi songo bvelela. Izwi zwi ita uri zwi vhe khagala uri luambo lwa Tshivenḓa na lwone lu nga bveledzwa u tou fana na luambo lwa Afrikaans lwe lwa vha lu si ho ḽifhasini phanḓa ha ṅwaha wa 1652. Ulu luambo (Afrikaans) lwo vha hone shangoni lwa mbo bveledzwa lwa fhira nyambo dzoṱhe dza fhano hayani Afrika Tshipembe. Kha ino ṱhoḓisiso ho vhaliwa maṅwalwa ane a amba nga ha u pindulela na nga ha zwishumiswa zwa u pindulela. Maṅwalwa e a vhalwa o wanala kha zwiko zwo kanḓiswaho na zwiko zwi songo kanḓiswaho. Ino ṱhoḓisiso yo shumisa ngona dza ṱhoḓisiso dzo ṱanganyiswaho, idzo ngona ndi khwanthithethivi na khwaḽithethivi. Idzi ngona dzo shumisana zwavhuḓisa kha ṱhoḓisiso yoṱhe. Data yo kuvhanganywa hu tshi khou shumiswa dzimbudziso na u tou vhudzisa hune afho ho shumiswa mbudziso dzo vuleaho na dzo valeaho. Ngona dza u nanga sambula muṱoḓisisi o shumisa khaṱulo yawe uri ndi nnyi ane a nga vha a na data yo teaho na u humbela vhavhudziswa uri vha bule vhaṅwe vhathu vha re na data yo teaho ino ṱhoḓisiso. viii Tsenguluso ya data ho ṱanganyiswa ngona dza u sengulusa zwo itiswa ngauri ṱhoḓisiso ino yo ṱanganyisa ngona dza u ita ṱhoḓisiso. Sumbanḓila ho shumiswa tsenguluso ya mbambedzo kha u sengulusa data. Data ine ya fana yo vhewa fhethu huthihi musi hu tshi khou senguluswa na u vhiga. Tsenguluso I shumisaho mbalo/tshivhalo (khwanthithethivi) na I shumisaho maipfi kha ino ngudo dzo shumiswa. Ho vhumbiwa dziṱhoho u itela u ṱana data ye ya kuvhanganywa. Ngei kha ndima dza u fhedza, muṱodisisi o rera nga ha mawanwa, o ṱhaṱhuvha ṱhoḓisiso yoṱhe phanḓa ha u ita themendelo na u vhina.
African Languages
M.A. (African Languages)
Wang, Lih-der, and 王立德. "The Implementation of Automated Translator from Programming Language to Timed Automata." Thesis, 2004. http://ndltd.ncl.edu.tw/handle/y8d82v.
Full text國立臺灣大學
電機工程學研究所
92
General purpose programming languages are rich in functionalities and therefore have complicated structures. Hence verifying software programs written in such programming languages can be very difficult. Even if the technical specifications had been verified to be error-free, there may still be errors introduced by the actual implementation. It is important that the verification is performed on the final program. Based on the work flow, a program can be considered as a real-time system with many processes, where each process is represented by a fragment of the original program. According to the grammars of the specific programming language, an automated translator can be used to translate the programs into corresponding formal verification models. The whole program can then be verified by applying the standard verification techniques to each model individually. The automation of translation can simplify the task of creating a formal model and identify potential errors in the implementation. We will present our implementation of a program that translates basic C programs into timed automata described by Red.
"Semi-automatic grammar induction for bidirectional machine translation." 2002. http://library.cuhk.edu.hk/record=b5895983.
Full textThesis (M.Phil.)--Chinese University of Hong Kong, 2002.
Includes bibliographical references (leaves 137-143).
Abstracts in English and Chinese.
Chapter 1 --- Introduction --- p.1
Chapter 1.1 --- Objectives --- p.3
Chapter 1.2 --- Thesis Outline --- p.5
Chapter 2 --- Background in Natural Language Understanding --- p.6
Chapter 2.1 --- Rule-based Approaches --- p.7
Chapter 2.2 --- Corpus-based Approaches --- p.8
Chapter 2.2.1 --- Stochastic Approaches --- p.8
Chapter 2.2.2 --- Phrase-spotting Approaches --- p.9
Chapter 2.3 --- The ATIS Domain --- p.10
Chapter 2.3.1 --- Chinese Corpus Preparation --- p.11
Chapter 3 --- Semi-automatic Grammar Induction - Baseline Approach --- p.13
Chapter 3.1 --- Background in Grammar Induction --- p.13
Chapter 3.1.1 --- Simulated Annealing --- p.14
Chapter 3.1.2 --- Bayesian Grammar Induction --- p.14
Chapter 3.1.3 --- Probabilistic Grammar Acquisition --- p.15
Chapter 3.2 --- Semi-automatic Grammar Induction 一 Baseline Approach --- p.16
Chapter 3.2.1 --- Spatial Clustering --- p.16
Chapter 3.2.2 --- Temporal Clustering --- p.18
Chapter 3.2.3 --- Post-processing --- p.19
Chapter 3.2.4 --- Four Aspects for Enhancements --- p.20
Chapter 3.3 --- Chapter Summary --- p.22
Chapter 4 --- Semi-automatic Grammar Induction - Enhanced Approach --- p.23
Chapter 4.1 --- Evaluating Induced Grammars --- p.24
Chapter 4.2 --- Stopping Criterion --- p.26
Chapter 4.2.1 --- Cross-checking with Recall Values --- p.29
Chapter 4.3 --- Improvements on Temporal Clustering --- p.32
Chapter 4.3.1 --- Evaluation --- p.39
Chapter 4.4 --- Improvements on Spatial Clustering --- p.46
Chapter 4.4.1 --- Distance Measures --- p.48
Chapter 4.4.2 --- Evaluation --- p.57
Chapter 4.5 --- Enhancements based on Intelligent Selection --- p.62
Chapter 4.5.1 --- Informed Selection between Spatial Clustering and Tem- poral Clustering --- p.62
Chapter 4.5.2 --- Selecting the Number of Clusters Per Iteration --- p.64
Chapter 4.5.3 --- An Example for Intelligent Selection --- p.64
Chapter 4.5.4 --- Evaluation --- p.68
Chapter 4.6 --- Chapter Summary --- p.71
Chapter 5 --- Bidirectional Machine Translation using Induced Grammars ´ؤBaseline Approach --- p.73
Chapter 5.1 --- Background in Machine Translation --- p.75
Chapter 5.1.1 --- Rule-based Machine Translation --- p.75
Chapter 5.1.2 --- Statistical Machine Translation --- p.76
Chapter 5.1.3 --- Knowledge-based Machine Translation --- p.77
Chapter 5.1.4 --- Example-based Machine Translation --- p.78
Chapter 5.1.5 --- Evaluation --- p.79
Chapter 5.2 --- Baseline Configuration on Bidirectional Machine Translation System --- p.84
Chapter 5.2.1 --- Bilingual Dictionary --- p.84
Chapter 5.2.2 --- Concept Alignments --- p.85
Chapter 5.2.3 --- Translation Process --- p.89
Chapter 5.2.4 --- Two Aspects for Enhancements --- p.90
Chapter 5.3 --- Chapter Summary --- p.91
Chapter 6 --- Bidirectional Machine Translation ´ؤ Enhanced Approach --- p.92
Chapter 6.1 --- Concept Alignments --- p.93
Chapter 6.1.1 --- Enhanced Alignment Scheme --- p.95
Chapter 6.1.2 --- Experiment --- p.97
Chapter 6.2 --- Grammar Checker --- p.100
Chapter 6.2.1 --- Components for Grammar Checking --- p.101
Chapter 6.3 --- Evaluation --- p.117
Chapter 6.3.1 --- Bleu Score Performance --- p.118
Chapter 6.3.2 --- Modified Bleu Score --- p.122
Chapter 6.4 --- Chapter Summary --- p.130
Chapter 7 --- Conclusions --- p.131
Chapter 7.1 --- Summary --- p.131
Chapter 7.2 --- Contributions --- p.134
Chapter 7.3 --- Future work --- p.136
Bibliography --- p.137
Chapter A --- Original SQL Queries --- p.144
Chapter B --- Seeded Categories --- p.146
Chapter C --- 3 Alignment Categories --- p.147
Chapter D --- Labels of Syntactic Structures in Grammar Checker --- p.148
"Automatic construction of English/Chinese parallel corpus." 2001. http://library.cuhk.edu.hk/record=b5890676.
Full textThesis (M.Phil.)--Chinese University of Hong Kong, 2001.
Includes bibliographical references (leaves 88-96).
Abstracts in English and Chinese.
ABSTRACT --- p.i
ACKNOWLEDGEMENTS --- p.v
LIST OF TABLES --- p.viii
LIST OF FIGURES --- p.ix
CHAPTERS
Chapter 1. --- INTRODUCTION --- p.1
Chapter 1.1 --- Application of corpus-based techniques --- p.2
Chapter 1.1.1 --- Machine Translation (MT) --- p.2
Chapter 1.1.1.1 --- Linguistic --- p.3
Chapter 1.1.1.2 --- Statistical --- p.4
Chapter 1.1.1.3 --- Lexicon construction --- p.4
Chapter 1.1.2 --- Cross-lingual Information Retrieval (CLIR) --- p.6
Chapter 1.1.2.1 --- Controlled vocabulary --- p.6
Chapter 1.1.2.2 --- Free text --- p.7
Chapter 1.1.2.3 --- Application corpus-based approach in CLIR --- p.9
Chapter 1.2 --- Overview of linguistic resources --- p.10
Chapter 1.3 --- Written language corpora --- p.12
Chapter 1.3.1 --- Types of corpora --- p.13
Chapter 1.3.2 --- Limitation of comparable corpora --- p.16
Chapter 1.4 --- Outline of the dissertation --- p.17
Chapter 2. --- LITERATURE REVIEW --- p.19
Chapter 2.1 --- Research in automatic corpus construction --- p.20
Chapter 2.2 --- Research in translation alignment --- p.25
Chapter 2.2.1 --- Sentence alignment --- p.27
Chapter 2.2.2 --- Word alignment --- p.28
Chapter 2.3 --- Research in alignment of sequences --- p.33
Chapter 3. --- ALIGNMENT AT WORD LEVEL AND CHARACTER LEVEL --- p.35
Chapter 3.1 --- Title alignment --- p.35
Chapter 3.1.1 --- Lexical features --- p.37
Chapter 3.1.2 --- Grammatical features --- p.40
Chapter 3.1.3 --- The English/Chinese alignment model --- p.41
Chapter 3.2 --- Alignment at word level and character level --- p.42
Chapter 3.2.1 --- Alignment at word level --- p.42
Chapter 3.2.2 --- Alignment at character level: Longest matching --- p.44
Chapter 3.2.3 --- Longest common subsequence(LCS) --- p.46
Chapter 3.2.4 --- Applying LCS in the English/Chinese alignment model --- p.48
Chapter 3.3 --- Reduce overlapping ambiguity --- p.52
Chapter 3.3.1 --- Edit distance --- p.52
Chapter 3.3.2 --- Overlapping in the algorithm model --- p.54
Chapter 4. --- ALIGNMENT AT TITLE LEVEL --- p.59
Chapter 4.1 --- Review of score functions --- p.59
Chapter 4.2 --- The Score function --- p.60
Chapter 4.2.1 --- (C matches E) and (E matches C) --- p.60
Chapter 4.2.2 --- Length similarity --- p.63
Chapter 5. --- EXPERIMENTAL RESULTS --- p.69
Chapter 5.1 --- Hong Kong government press release articles --- p.69
Chapter 5.2 --- Hang Seng Bank economic monthly reports --- p.76
Chapter 5.3 --- Hang Seng Bank press release articles --- p.78
Chapter 5.4 --- Hang Seng Bank speech articles --- p.81
Chapter 5.5 --- Quality of the collections and future work --- p.84
Chapter 6. --- CONCLUSION --- p.87
Bibliography
Büchse, Matthias. "Algebraic decoder specification: coupling formal-language theory and statistical machine translation: Algebraic decoder specification: coupling formal-language theory and statistical machine translation." Doctoral thesis, 2014. https://tud.qucosa.de/id/qucosa%3A28493.
Full textPolák, Peter. "Strojový překlad mluvené řeči přes fonetickou reprezentaci zdrojové řeči." Master's thesis, 2020. http://www.nusl.cz/ntk/nusl-416019.
Full text"Semi-automatic acquisition of domain-specific semantic structures." 2000. http://library.cuhk.edu.hk/record=b5890445.
Full textThesis (M.Phil.)--Chinese University of Hong Kong, 2000.
Includes bibliographical references (leaves 99-106).
Abstracts in English and Chinese.
Chapter 1 --- Introduction --- p.1
Chapter 1.1 --- Thesis Outline --- p.5
Chapter 2 --- Background --- p.6
Chapter 2.1 --- Natural Language Understanding --- p.6
Chapter 2.1.1 --- Rule-based Approaches --- p.7
Chapter 2.1.2 --- Stochastic Approaches --- p.8
Chapter 2.1.3 --- Phrase-Spotting Approaches --- p.9
Chapter 2.2 --- Grammar Induction --- p.10
Chapter 2.2.1 --- Semantic Classification Trees --- p.11
Chapter 2.2.2 --- Simulated Annealing --- p.12
Chapter 2.2.3 --- Bayesian Grammar Induction --- p.12
Chapter 2.2.4 --- Statistical Grammar Induction --- p.13
Chapter 2.3 --- Machine Translation --- p.14
Chapter 2.3.1 --- Rule-based Approach --- p.15
Chapter 2.3.2 --- Statistical Approach --- p.15
Chapter 2.3.3 --- Example-based Approach --- p.16
Chapter 2.3.4 --- Knowledge-based Approach --- p.16
Chapter 2.3.5 --- Evaluation Method --- p.19
Chapter 3 --- Semi-Automatic Grammar Induction --- p.20
Chapter 3.1 --- Agglomerative Clustering --- p.20
Chapter 3.1.1 --- Spatial Clustering --- p.21
Chapter 3.1.2 --- Temporal Clustering --- p.24
Chapter 3.1.3 --- Free Parameters --- p.26
Chapter 3.2 --- Post-processing --- p.27
Chapter 3.3 --- Chapter Summary --- p.29
Chapter 4 --- Application to the ATIS Domain --- p.30
Chapter 4.1 --- The ATIS Domain --- p.30
Chapter 4.2 --- Parameters Selection --- p.32
Chapter 4.3 --- Unsupervised Grammar Induction --- p.35
Chapter 4.4 --- Prior Knowledge Injection --- p.40
Chapter 4.5 --- Evaluation --- p.43
Chapter 4.5.1 --- Parse Coverage in Understanding --- p.45
Chapter 4.5.2 --- Parse Errors --- p.46
Chapter 4.5.3 --- Analysis --- p.47
Chapter 4.6 --- Chapter Summary --- p.49
Chapter 5 --- Portability to Chinese --- p.50
Chapter 5.1 --- Corpus Preparation --- p.50
Chapter 5.1.1 --- Tokenization --- p.51
Chapter 5.2 --- Experiments --- p.52
Chapter 5.2.1 --- Unsupervised Grammar Induction --- p.52
Chapter 5.2.2 --- Prior Knowledge Injection --- p.56
Chapter 5.3 --- Evaluation --- p.58
Chapter 5.3.1 --- Parse Coverage in Understanding --- p.59
Chapter 5.3.2 --- Parse Errors --- p.60
Chapter 5.4 --- Grammar Comparison Across Languages --- p.60
Chapter 5.5 --- Chapter Summary --- p.64
Chapter 6 --- Bi-directional Machine Translation --- p.65
Chapter 6.1 --- Bilingual Dictionary --- p.67
Chapter 6.2 --- Concept Alignments --- p.68
Chapter 6.3 --- Translation Procedures --- p.73
Chapter 6.3.1 --- The Matching Process --- p.74
Chapter 6.3.2 --- The Searching Process --- p.76
Chapter 6.3.3 --- Heuristics to Aid Translation --- p.81
Chapter 6.4 --- Evaluation --- p.82
Chapter 6.4.1 --- Coverage --- p.83
Chapter 6.4.2 --- Performance --- p.86
Chapter 6.5 --- Chapter Summary --- p.89
Chapter 7 --- Conclusions --- p.90
Chapter 7.1 --- Summary --- p.90
Chapter 7.2 --- Future Work --- p.92
Chapter 7.2.1 --- Suggested Improvements on Grammar Induction Process --- p.92
Chapter 7.2.2 --- Suggested Improvements on Bi-directional Machine Trans- lation --- p.96
Chapter 7.2.3 --- Domain Portability --- p.97
Chapter 7.3 --- Contributions --- p.97
Bibliography --- p.99
Chapter A --- Original SQL Queries --- p.107
Chapter B --- Induced Grammar --- p.109
Chapter C --- Seeded Categories --- p.111
Sefara, Tshephisho Joseph. "The development of an automatic pronunciation assistant." Thesis, 2019. http://hdl.handle.net/10386/2906.
Full textThe pronunciation of words and phrases in any language involves careful manipulation of linguistic features. Factors such as age, motivation, accent, phonetics, stress and intonation sometimes cause a problem of inappropriate or incorrect pronunciation of words from non-native languages. Pronunciation of words using different phonological rules has a tendency of changing the meaning of those words. This study presents the development of an automatic pronunciation assistant system for under-resourced languages of Limpopo Province, namely, Sepedi, Xitsonga, Tshivenda and isiNdebele. The aim of the proposed system is to help non-native speakers to learn appropriate and correct pronunciation of words/phrases in these under-resourced languages. The system is composed of a language identification module on the front-end side and a speech synthesis module on the back-end side. A support vector machine was compared to the baseline multinomial naive Bayes to build the language identification module. The language identification phase performs supervised multiclass text classification to predict a person’s first language based on input text before the speech synthesis phase continues with pronunciation issues using the identified language. The speech synthesis on the back-end phase is composed of four baseline text-to-speech synthesis systems in selected target languages. These text-to-speech synthesis systems were based on the hidden Markov model method of development. Subjective listening tests were conducted to evaluate the performance of the quality of the synthesised speech using a mean opinion score test. The mean opinion score test obtained good performance results on all targeted languages for naturalness, pronunciation, pleasantness, understandability, intelligibility, overall quality of the system and user acceptance. The developed system has been implemented on a “real-live” production web-server for performance evaluation and stability testing using live data.
Ouyang, Jessica Jin. "Adapting Automatic Summarization to New Sources of Information." Thesis, 2019. https://doi.org/10.7916/d8-5nar-6b61.
Full textQuernheim, Daniel. "Bimorphism Machine Translation." Doctoral thesis, 2016. https://ul.qucosa.de/id/qucosa%3A15589.
Full textvan, Merriënboer Bart. "Sequence-to-sequence learning for machine translation and automatic differentiation for machine learning software tools." Thèse, 2018. http://hdl.handle.net/1866/21743.
Full textVariš, Dušan. "Automatická korektura chyb ve výstupu strojového překladu." Master's thesis, 2016. http://www.nusl.cz/ntk/nusl-352609.
Full text