Dissertations / Theses: 'Malgache (langue) – Traduction automatique'

1

Rakotoalimanana, Herizo David. "Structure morphosyntaxique et modélisation informatique de la langue malgache." Nancy 2, 2000. http://www.theses.fr/2000NAN21035.

Full text

Abstract:

La présente étude s'inscrit dans le cadre d'un essai de traitement automatique de la langue malgache visant à construire un dictionnaire électronique. Notre travail est articulé autour de deux axes principaux : une étude descriptive de la langue d'une part et une modélisation informatique d'autre part. Après un chapitre préliminaire consacré à une présentation générale, nous présentons dans la première partie une analyse des termes malgaches portant tant sur les formes de la structure morphologique ou morphématique que sur leur comportement syntaxique. Les relations entre les constituants d'un énoncé sont détaillées. Nous décrivons les différentes voix des énoncés malgaches et les modalités : le temps, l'aspect et le mode. Dans la deuxième partie, nous formalisons les données linguistiques dégagées dans la partie descriptive pour en faire une implantation informatique d'un système d'Analyseur-Générateur des Termes prédicatifs Malgaches (AGTM). L'analyse automatique d'un terme consiste à repérer ses constituants, identifier leurs relations et assigner leur traduction en français. Notre système fournit toutes les formes dérivées possibles d'un terme. Nous terminons cette partie pratique par le bilan de notre étude et les perspectives d'avenir. La thèse est complétée par des annexes qui rassemblent le module de programmation et les résultats obtenus This study is intented to provide a first framework for the definition of an electronic dictionary of the Malagasy language to be used in the context of Natural Language Processing components. Our work is based, on the one hand, on a descriptive study of the Malagasy language, and, on the other hand, on the proposition of a computer representation of the corresponding phenomena. Following a preliminary chapter dedicated to a general presentation of the geographical and sociological context, the first part of the thesis presents a thorough analysis of Malagasian terms from the point of view of their morphology and syntactic behaviour. In particular, we describe the various voices and modalities of Malagasian utterances : tense, aspect and mood. The second part of the thesis comprises the formel representation that we associate to the linguistic data that have been previously described, so that a possible implementation can be derived in the form of a parser / generator of Malagasian predicative terms (AGTM). The automatic analysis of a term is based upon the identification of its various constituants, the relations that link these together and the identification of a possible translation in French for the term. Our system can also provide all the possible forms that can be derived from a root, given a set of restrictions provided as input. The thesis comprises several appendixes which present the whole software implemented in Prolog, as well as sample results