Log in

Relevant bibliographies by topics / Linguistic Extraction / Dissertations / Theses

To see the other types of publications on this topic, follow the link: Linguistic Extraction.

Dissertations / Theses on the topic 'Linguistic Extraction'

Author: Grafiati

Published: 4 June 2021

Last updated: 14 February 2022

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 dissertations / theses for your research on the topic 'Linguistic Extraction.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.

1

Mason, Oliver Jan. "The automatic extraction of linguistic information from text corpora." Thesis, University of Birmingham, 2006. http://etheses.bham.ac.uk//id/eprint/116/.

Full text

Abstract:

This is a study exploring the feasibility of a fully automated analysis of linguistic data. It identifies a requirement for large-scale investigations, which cannot be done manually by a human researcher. Instead, methods from natural language processing are suggested as a way to analyse large amounts of corpus data without any human intervention. Human involvement hinders scalability and introduces a bias which prevents studies from being completely replicable. The fundamental assumption underlying this work is that linguistic analysis must be empirical, and that reliance on existing theories

APA, Harvard, Vancouver, ISO, and other styles

2

Nepal, Srijan. "Linguistic Approach to Information Extraction and Sentiment Analysis on Twitter." University of Cincinnati / OhioLINK, 2012. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1342544962.

Full text

APA, Harvard, Vancouver, ISO, and other styles

3

Shahid, Ahmad. "Extraction of linguistic resources from multilingual corpora and their exploitation." Thesis, University of York, 2012. http://etheses.whiterose.ac.uk/2111/.

Full text

Abstract:

Increasing availability of on-line and off-line multilingual resources along with the developments in the related automatic tools that can process this information, such as GIZA++ (Och & Ney 2003), has made it possible to build new multilingual resources that can be used for NLP/IR tasks. Lexicon generation is one such task, which if done by hand is quite expensive with human and capital costs involved. Generation of multilingual lexicons can now be automated, as is done in this research work. Wikipedia, an on-line multilingual resource was gainfully employed to automatically build multilingua

APA, Harvard, Vancouver, ISO, and other styles

4

Pettersson, Eva. "Spelling Normalisation and Linguistic Analysis of Historical Text for Information Extraction." Doctoral thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2016. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-269753.

Full text

Abstract:

Historical text constitutes a rich source of information for historians and other researchers in humanities. Many texts are however not available in an electronic format, and even if they are, there is a lack of NLP tools designed to handle historical text. In my thesis, I aim to provide a generic workflow for automatic linguistic analysis and information extraction from historical text, with spelling normalisation as a core component in the pipeline. In the spelling normalisation step, the historical input text is automatically normalised to a more modern spelling, enabling the use of existin

APA, Harvard, Vancouver, ISO, and other styles

5

Lindes, Peter. "OntoSoar: Using Language to Find Genealogy Facts." BYU ScholarsArchive, 2014. https://scholarsarchive.byu.edu/etd/4133.

Full text

Abstract:

There is a need to have an automated system that can read family history books or other historical texts and extract as many genealogy facts as possible from them. Embley and others have applied traditional information extraction techniques to this problem in a system called OntoES with a reasonable amount of success. In parallel much linguistic theory has been developed in the past decades, and Lonsdale and others have built computational embodiments of some of these theories using Soar. In this thesis we introduce a system called OntoSoar which combines the Link Grammar Parser using a gramma

APA, Harvard, Vancouver, ISO, and other styles

6

Martínez, de la Mora Daniela 1983. "The Universality of perceptual and linguistic constraints in the extraction of rule-like patterns : a cross-species comparison." Doctoral thesis, Universitat Pompeu Fabra, 2013. http://hdl.handle.net/10803/113604.

Full text

Abstract:

Studies have shown that linguistic and perceptual constraints are important for speech processing. First, rule-like structures are more easily learned over vowels than over consonants. Second, sequences varying in pitch and duration are grouped following the Iambic – Trochaic Law (ITL). In this research, I investigated the origins of these linguistic and perceptual constraints. My aim was to test if vowels’ acoustic saliency was the reason why they are the preferred target for abstract computations, and to explore the extent to which the principles of the ITL come from evolutionary her

APA, Harvard, Vancouver, ISO, and other styles

7

Danilova, Vera. "Linguistic support for protest event data collection." Doctoral thesis, Universitat Autònoma de Barcelona, 2015. http://hdl.handle.net/10803/374232.

Full text

Abstract:

sta tesis aborda el problema de la cualidad de recopilación automática de datos sobre protestas y propone herramientas de extracción multilíngüe de atributos del evento de protesta para mejorar la calidad de la unidad de análisis. El trabajo incluye la exploración del estado de arte en los dominios de la recopilación automática de datos sobre protestas y la extracción multilíngüe de eventos. En la ausencia de una colección de datos multilíngües sobre protestas anotados por expertos para el aprendizaje supervisado nos enfocamos en el tratamiento de noticias multilíngües basado en patrones lin

APA, Harvard, Vancouver, ISO, and other styles

8

Marcińczuk, Michał. "Pattern Acquisition Methods for Information Extraction Systems." Thesis, Blekinge Tekniska Högskola, Avdelningen för programvarusystem, 2007. http://urn.kb.se/resolve?urn=urn:nbn:se:bth-4291.

Full text

Abstract:

This master thesis treats about Event Recognition in the reports of Polish stockholders. Event Recognition is one of the Information Extraction tasks. This thesis provides a comparison of two approaches to Event Recognition: manual and automatic. In the manual approach regular expressions are used. Regular expressions are used as a baseline for the automatic approach. In the automatic approach three Machine Learning methods were applied. In the initial experiment the Decision Trees, naive Bayes and Memory Based Learning methods are compared. A modification of the standard Memory Based Learning

APA, Harvard, Vancouver, ISO, and other styles

9

Aslam, Irfan. "Semantic frame based automatic extraction of typological information from descriptive grammars." Thesis, Högskolan i Skövde, Institutionen för informationsteknologi, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-17893.

Full text

Abstract:

This thesis project addresses the machine learning (ML) modelling aspects of the problem of automatically extracting typological linguistic information of natural languages spoken in South Asia from annotated descriptive grammars. Without getting stuck into the theory and methods of Natural Language Processing (NLP), the focus has been to develop and test a machine learning (ML) model dedicated to the information extraction part. Starting with the existing state-of-the-art frameworks to get labelled training data through the structured representation of the descriptive grammars, the problem ha

APA, Harvard, Vancouver, ISO, and other styles

10

Oudni, Amal. "Fouille de données par extraction de motifs graduels : contextualisation et enrichissement." Thesis, Paris 6, 2014. http://www.theses.fr/2014PA066437/document.

Full text

Abstract:

Les travaux de cette thèse s'inscrivent dans le cadre de l'extraction de connaissances et de la fouille de données appliquée à des bases de données numériques ou floues afin d'extraire des résumés linguistiques sous la forme de motifs graduels exprimant des corrélations de co-variations des valeurs des attributs, de la forme « plus la température augmente, plus la pression augmente ». Notre objectif est de les contextualiser et de les enrichir en proposant différents types de compléments d'information afin d'augmenter leur qualité et leur apporter une meilleure interprétation. Nous proposons q

APA, Harvard, Vancouver, ISO, and other styles

11

Morsi, Youcef Ihab. "Analyse linguistique et extraction automatique de relations sémantiques des textes en arabe." Thesis, Bourgogne Franche-Comté, 2020. http://www.theses.fr/2020UBFCC019.

Full text

Abstract:

Cette recherche porte sur le développement d’un outil de traitement automatique de la langue arabe standard moderne, au niveau morphologique et sémantique, avec comme objectif final l’extraction d’information dans le domaine de l’innovation technologique en entreprise. En ce qui concerne l’analyse morphologique, notre outil comprend plusieurs traitements successifs qui permettent d’étiqueter et de désambiguïser les occurrences dans les textes : une couche morphologique (Gibran 1.0), qui s’appuie sur les schèmes arabes comme traits distinctifs ; une couche contextuelle (Gibran 2.0), qui fait ap

APA, Harvard, Vancouver, ISO, and other styles

12

Ferraro, Gabriela. "Towards deep content extraction from specialized discourse : the case of verbal relations in patent claims." Doctoral thesis, Universitat Pompeu Fabra, 2012. http://hdl.handle.net/10803/84174.

Full text

Abstract:

This thesis addresses the problem of the development of Natural Language Processing techniques for the extraction and generalization of compositional and functional relations from specialized written texts and, in particular, from patent claims. One of the most demanding tasks tackled in the thesis is, according to the state of the art, the semantic generalization of linguistic denominations of relations between object components and processes described in the texts. These denominations are usually verbal expressions or nominalizations that are too concrete to be used as standard labels

APA, Harvard, Vancouver, ISO, and other styles

13

Cunha, Fanego Iria da. "Hacia un modelo lingüístico de resumen automático de artículos médicos en español." Doctoral thesis, Universitat Pompeu Fabra, 2008. http://hdl.handle.net/10803/7508.

Full text

Abstract:

En esta tesis se presenta un modelo lingüístico de resumen automático de artículos médicos en español que aúna criterios basados en la estructura textual, en las unidades léxicas y la estructura discursiva y sintáctico-comunicativa de los textos. El modelo se crea partiendo de la hipótesis de que los especialistas de cada ámbito emplean estrategias específicas a la hora de resumir. La validación de esta hipótesis mediante experimentos estadísticos permite tomar los artículos médicos acompañados de sus respectivos resúmenes como material de referencia para analizar, de cara a detectar las estra

APA, Harvard, Vancouver, ISO, and other styles

14

da, Cunha Fanego Iria. "Hacia un modelo lingüístico de resumen automático de artículos médicos en español." Doctoral thesis, Universitat Pompeu Fabra, 2008. http://hdl.handle.net/10803/7508.

Full text

Abstract:

En esta tesis se presenta un modelo lingüístico de resumen automático de artículos médicos en español que aúna criterios basados en la estructura textual, en las unidades léxicas y la estructura discursiva y sintáctico-comunicativa de los textos. El modelo se crea partiendo de la hipótesis de que los especialistas de cada ámbito emplean estrategias específicas a la hora de resumir. La validación de esta hipótesis mediante experimentos estadísticos permite tomar los artículos médicos acompañados de sus respectivos resúmenes como material de referencia para analizar, de cara a detectar las estra

APA, Harvard, Vancouver, ISO, and other styles

15

Laguna, Merley da Silva Conrado. "Extração automática de termos simples baseada em aprendizado de máquina." Universidade de São Paulo, 2014. http://www.teses.usp.br/teses/disponiveis/55/55134/tde-11082014-103430/.

Full text

Abstract:

A Mineração de Textos (MT) visa descobrir conhecimento inovador nos textos não estruturados. A extração dos termos que representam os textos de um domínio é um dos passos mais importantes da MT, uma vez que os resultados de todo o processo da MT dependerão, em grande parte, da qualidade dos termos obtidos. Nesta tese, considera-se como termos as unidades lexicais realizadas para designar conceitos em um cenário tematicamente restrito. Para a extração dos termos, pode-se fazer uso de abordagens como: estatística, linguística ou híbrida. Normalmente, para a Mineração de Textos, são utilizados mé

APA, Harvard, Vancouver, ISO, and other styles

16

Santos, Carlos Alberto dos. "Uma an?lise comparativa entre as abordagens lingu?stica e estat?stica para extra??o autom?tica de termos relevantes de corpora." Pontif?cia Universidade Cat?lica do Rio Grande do Sul, 2018. http://tede2.pucrs.br/tede2/handle/tede/8233.

Full text

Abstract:

Submitted by PPG Ci?ncia da Computa??o (ppgcc@pucrs.br) on 2018-07-26T19:48:07Z No. of bitstreams: 1 CARLOS ALBERTO DOS SANTOS_DIS.pdf: 1271475 bytes, checksum: 856ae87ad633d3c772b413816caa43d1 (MD5)<br>Approved for entry into archive by Sheila Dias (sheila.dias@pucrs.br) on 2018-08-01T13:39:36Z (GMT) No. of bitstreams: 1 CARLOS ALBERTO DOS SANTOS_DIS.pdf: 1271475 bytes, checksum: 856ae87ad633d3c772b413816caa43d1 (MD5)<br>Made available in DSpace on 2018-08-01T14:31:21Z (GMT). No. of bitstreams: 1 CARLOS ALBERTO DOS SANTOS_DIS.pdf: 1271475 bytes, checksum: 856ae87ad633d3c772b413816caa43d1 (MD5

APA, Harvard, Vancouver, ISO, and other styles

17

Kreps, Christian John Manfred. "Extraction, movement and dependency theory." Thesis, University College London (University of London), 1998. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.300589.

Full text

APA, Harvard, Vancouver, ISO, and other styles

18

Hegarty, Michael Vincent. "Adjunct extraction and chain configurations." Thesis, Massachusetts Institute of Technology, 1992. http://hdl.handle.net/1721.1/17298.

Full text

APA, Harvard, Vancouver, ISO, and other styles

19

Ben, Salamah Janan. "Extraction de connaissances dans des textes arabes et français par une méthode linguistico-computationnelle." Thesis, Paris 4, 2017. http://www.theses.fr/2017PA040137.

Full text

Abstract:

Dans le cadre de notre thèse, nous avons proposé une approche générique multilingue d'extraction automatique de connaissances. Nous avons validé l‟approche sur l'extraction des événements de variations des cours pétroliers et l‟extraction des expressions temporelles liées à des référentiels. Notre approche est basée sur la constitution de plusieurs cartes sémantiques par analyse des données non structurées afin de formaliser les traces linguistiques textuelles exprimées par des catégories d'un point de vue de fouille. Nous avons mis en place un système expert permettant d‟annoter la présence d

APA, Harvard, Vancouver, ISO, and other styles

20

Piskorski, Jakub. "ExPRESS : extraction pattern recognition engine and specification suite." Universität Potsdam, 2008. http://opus.kobv.de/ubp/volltexte/2008/2722/.

Full text

Abstract:

The emergence of information extraction (IE) oriented pattern engines has been observed during the last decade. Most of them exploit heavily finite-state devices. This paper introduces ExPRESS – a new extraction pattern engine, whose rules are regular expressions over flat feature structures. The underlying pattern language is a blend of two previously introduced IE oriented pattern formalisms, namely, JAPE, used in the widely known GATE system, and the unificationbased XTDL formalism used in SProUT. A brief and technical overview of ExPRESS, its pattern language and the pool of its native lin

APA, Harvard, Vancouver, ISO, and other styles

21

Morgan, Tricia. "A comparative study of hypernymic patterns for knowledge extraction." Thesis, National Library of Canada = Bibliothèque nationale du Canada, 2001. http://www.collectionscanada.ca/obj/s4/f2/dsk3/ftp04/MQ58487.pdf.

Full text

APA, Harvard, Vancouver, ISO, and other styles

22

Tolle, Kristin M. "Domain-independent semantic concept extraction using corpus linguistics, statistics and artificial intelligence techniques." Diss., The University of Arizona, 2003. http://hdl.handle.net/10150/280502.

Full text

Abstract:

For this dissertation two software applications were developed and three experiments were conducted to evaluate the viability of a unique approach to medical information extraction. The first system, the AZ Noun Phraser, was designed as a concept extraction tool. The second application, ANNEE, is a neural net-based entity extraction (EE) system. These two systems were combined to perform concept extraction and semantic classification specifically for use in medical document retrieval systems. The goal of this research was to create a system that automatically (without human interaction) enable

APA, Harvard, Vancouver, ISO, and other styles

23

Marshman, Elizabeth. "The cause relation in biopharmaceutical corpora: English and French patterns for knowledge extraction." Thesis, University of Ottawa (Canada), 2002. http://hdl.handle.net/10393/6385.

Full text

Abstract:

One of the most important aspects of a terminologist's work is extracting conceptual information about terms from texts. Because this task is so time-consuming, researchers are trying to develop tools which will extract conceptual information semi-automatically. Many of these tools are based on the use of linguistic indicators called knowledge patterns. This thesis aims to identify some knowledge patterns in English and French which indicate the conceptual relation of cause and effect. This relation, though not as widely studied as those of generic to specific or part to whole, is critical

APA, Harvard, Vancouver, ISO, and other styles

24

Seemann, Nina [Verfasser], and Andreas [Akademischer Betreuer] Maletti. "Rule extraction for multi bottom-up tree transducers / Nina Seemann ; Betreuer: Andreas Maletti." Stuttgart : Universitätsbibliothek der Universität Stuttgart, 2016. http://d-nb.info/112864648X/34.

Full text

APA, Harvard, Vancouver, ISO, and other styles

25

Middleton, Anthony M. "High-Performance Knowledge-Based Entity Extraction." NSUWorks, 2009. http://nsuworks.nova.edu/gscis_etd/246.

Full text

Abstract:

Human language records most of the information and knowledge produced by organizations and individuals. The machine-based process of analyzing information in natural language form is called natural language processing (NLP). Information extraction (IE) is the process of analyzing machine-readable text and identifying and collecting information about specified types of entities, events, and relationships. Named entity extraction is an area of IE concerned specifically with recognizing and classifying proper names for persons, organizations, and locations from natural language. Extant approaches

APA, Harvard, Vancouver, ISO, and other styles

26

Muthiah, Kalaivahni. "What Happens to the Where, When and How in Malay?" Thesis, University of North Texas, 2000. https://digital.library.unt.edu/ark:/67531/metadc2512/.

Full text

Abstract:

In this thesis, I analyze three positions of the wh-word in Malay and attempt to explain what accounts for the differences between them. Specifically, I consider if the movement of the wh-interrogative is really wh-movement or if something else is going on. In regard to the the in-situ wh-words and the partially moved wh-words, I consider whether these move covertly and if they do, if this is feature movement or covert phrasal movement.

APA, Harvard, Vancouver, ISO, and other styles

27

Hubbertz, Andrew Paul. "Subject clitics and subject extraction in Somali." Thesis, McGill University, 1991. http://catalog.hathitrust.org/api/volumes/oclc/32079883.html.

Full text

APA, Harvard, Vancouver, ISO, and other styles

28

Godby, Carol Jean. "A Computational Study of Lexicalized Noun Phrases in English." The Ohio State University, 2002. http://rave.ohiolink.edu/etdc/view?acc_num=osu1017343683.

Full text

APA, Harvard, Vancouver, ISO, and other styles

29

Fox, Daniel. "Scrambling and extraction constraints in Dari : GB and RRG analyses /." Amherst, Mass. : [s.n.], 2010. http://hdl.handle.net/10009/301.

Full text

APA, Harvard, Vancouver, ISO, and other styles

30

Hätty, Anna [Verfasser], and im Walde Sabine [Akademischer Betreuer] Schulte. "Automatic term extraction for conventional and extended term definitions across domains / Anna Hätty ; Betreuer: Sabine Schulte im Walde." Stuttgart : Universitätsbibliothek der Universität Stuttgart, 2020. http://d-nb.info/1221601245/34.

Full text

APA, Harvard, Vancouver, ISO, and other styles

31

Tabassum, Binte Jafar Jeniya. "Information Extraction From User Generated Noisy Texts." The Ohio State University, 2020. http://rave.ohiolink.edu/etdc/view?acc_num=osu1606315356821532.

Full text

APA, Harvard, Vancouver, ISO, and other styles

32

Foo, Jody. "Computational Terminology : Exploring Bilingual and Monolingual Term Extraction." Licentiate thesis, Linköpings universitet, NLPLAB - Laboratoriet för databehandling av naturligt språk, 2012. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-75243.

Full text

Abstract:

Terminologies are becoming more important to modern day society as technology and science continue to grow at an accelerating rate in a globalized environment. Agreeing upon which terms should be used to represent which concepts and how those terms should be translated into different languages is important if we wish to be able to communicate with as little confusion and misunderstandings as possible. Since the 1990s, an increasing amount of terminology research has been devoted to facilitating and augmenting terminology-related tasks by using computers and computational methods. One focus for

APA, Harvard, Vancouver, ISO, and other styles

33

Tiedemann, Jörg. "Recycling Translations : Extraction of Lexical Data from Parallel Corpora and their Application in Natural Language Processing." Doctoral thesis, Uppsala University, Department of Linguistics, 2003. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-3791.

Full text

Abstract:

<p>The focus of this thesis is on re-using translations in natural language processing. It involves the collection of documents and their translations in an appropriate format, the automatic extraction of translation data, and the application of the extracted data to different tasks in natural language processing.</p><p>Five parallel corpora containing more than 35 million words in 60 languages have been collected within co-operative projects. All corpora are sentence aligned and parts of them have been analyzed automatically and annotated with linguistic markup.</p><p>Lexical data are extract

APA, Harvard, Vancouver, ISO, and other styles

34

Lindqvist, Ellinor. "Text Simpliﬁcation and Keyphrase Extraction for Swedish". Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-385150.

Full text

Abstract:

Attempts have been made in Sweden to increase readability for texts addressed to the public, and ongoing projects are still being conducted by disability associations, private companies and Swedish authorities. In this thesis project, we explore automatic approaches to increase readability trough text simpliﬁcation and keyphrase extraction, with the goal of facilitating text comprehension and readability for people with reading difﬁculties. A combination of handwritten rules and monolingual machine translation was used to simplify the syntactic and lexical content of Swedish texts, and noun ph

APA, Harvard, Vancouver, ISO, and other styles

35

Sil, Avirup. "Entity Information Extraction using Structured and Semi-structured resources." Diss., Temple University Libraries, 2014. http://cdm16002.contentdm.oclc.org/cdm/ref/collection/p245801coll10/id/272966.

Full text

Abstract:

Computer and Information Science<br>Ph.D.<br>Among all the tasks that exist in Information Extraction, Entity Linking, also referred to as entity disambiguation or entity resolution, is a new and important problem which has recently caught the attention of a lot of researchers in the Natural Language Processing (NLP) community. The task involves linking/matching a textual mention of a named-entity (like a person or a movie-name) to an appropriate entry in a database (e.g. Wikipedia or IMDB). If the database does not contain the entity it should return NIL (out-of-database) value. Existing tech

APA, Harvard, Vancouver, ISO, and other styles

36

Conrath, Juliette. "Unsupervised extraction of semantic relations using discourse information." Thesis, Toulouse 3, 2015. http://www.theses.fr/2015TOU30202/document.

Full text

Abstract:

La compréhension du langage naturel repose souvent sur des raisonnements de sens commun, pour lesquels la connaissance de relations sémantiques, en particulier entre prédicats verbaux, peut être nécessaire. Cette thèse porte sur la problématique de l'utilisation d'une méthode distributionnelle pour extraire automatiquement les informations sémantiques nécessaires à ces inférences de sens commun. Des associations typiques entre des paires de prédicats et un ensemble de relations sémantiques (causales, temporelles, de similarité, d'opposition, partie/tout) sont extraites de grands corpus, par l'

APA, Harvard, Vancouver, ISO, and other styles

37

Lilliehöök, Hampus. "Extraction of word senses from bilingual resources using graph-based semantic mirroring." Thesis, Linköpings universitet, Interaktiva och kognitiva system, 2013. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-91880.

Full text

Abstract:

In this thesis we retrieve semantic information that exists implicitly in bilingual data. We gather input data by repeatedly applying the semantic mirroring procedure. The data is then represented by vectors in a large vector space. A resource of synonym clusters is then constructed by performing K-means centroid-based clustering on the vectors. We evaluate the result manually, using dictionaries, and against WordNet, and discuss prospects and applications of this method.<br>I det här arbetet utvinner vi semantisk information som existerar implicit i tvåspråkig data. Vi samlar indata genom att

APA, Harvard, Vancouver, ISO, and other styles

38

Kohail, Sarah [Verfasser], and Chris [Akademischer Betreuer] Biemann. "Unsupervised Induction of Domain Dependency Graphs - Extracting, Understanding and Visualizing Domain Knowledge / Sarah Kohail ; Betreuer: Chris Biemann." Hamburg : Staats- und Universitätsbibliothek Hamburg, 2019. http://d-nb.info/1201821398/34.

Full text

APA, Harvard, Vancouver, ISO, and other styles

39

Kohail, Sarah Verfasser], and Chris [Akademischer Betreuer] [Biemann. "Unsupervised Induction of Domain Dependency Graphs - Extracting, Understanding and Visualizing Domain Knowledge / Sarah Kohail ; Betreuer: Chris Biemann." Hamburg : Staats- und Universitätsbibliothek Hamburg, 2019. http://nbn-resolving.de/urn:nbn:de:gbv:18-101911.

Full text

APA, Harvard, Vancouver, ISO, and other styles

40

Dhyani, Dushyanta Dhyani. "Boosting Supervised Neural Relation Extraction with Distant Supervision." The Ohio State University, 2018. http://rave.ohiolink.edu/etdc/view?acc_num=osu1524095334803486.

Full text

APA, Harvard, Vancouver, ISO, and other styles

41

Munson, Matthew [Verfasser]. "Biblical Semantics : Applying Digital Methods for Semantic Information Extraction to Current Problems in New Testament Studies / Matthew Munson." Aachen : Shaker, 2017. http://d-nb.info/1149279451/34.

Full text

APA, Harvard, Vancouver, ISO, and other styles

42

Stalpouskaya, Katsiaryna [Verfasser], and Romy [Akademischer Betreuer] Fröhlich. "Automatic extraction of agendas for action from news coverage of violent conflict / Katsiaryna Stalpouskaya ; Betreuer: Romy Fröhlich." München : Universitätsbibliothek der Ludwig-Maximilians-Universität, 2019. http://d-nb.info/1216417784/34.

Full text

APA, Harvard, Vancouver, ISO, and other styles

43

Cederblad, Gustav. "Finding Synonyms in Medical Texts : Creating a system for automatic synonym extraction from medical texts." Thesis, Linköpings universitet, Institutionen för datavetenskap, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-149643.

Full text

Abstract:

This thesis describes the work of creating an automatic system for identifying synonyms and semantically related words in medical texts. Before this work, as a part of the project E-care@home, medical texts have been classified as either lay or specialized by both a lay annotator and an expert annotator. The lay annotator, in this case, is a person without any medical knowledge, whereas the expert annotator has professional knowledge in medicine. Using these texts made it possible to create co-occurrences matrices from which the related words could be identified. Fifteen medical terms were cho

APA, Harvard, Vancouver, ISO, and other styles

44

Lindén, Johannes. "Extracting Text into Meta-Data : Improving machine text-understanding of news-media articles." Licentiate thesis, Mittuniversitetet, Institutionen för informationssystem och –teknologi, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-41775.

Full text

Abstract:

Society is constantly in need of information. It is important to consume event-based information of what is happening around us as well as facts and knowledge. As society grows, the amount of information to consume grows with it. This thesis demonstrates one way to extract and represent knowledge from text in a machine-readable way for news media articles. Three objectives are considered when developing a machine learning system to retrieve categories, entities, relations and other meta-data from text paragraphs. The first is to sort the terminology by topic; this makes it easier for machine l

APA, Harvard, Vancouver, ISO, and other styles

45

Mavrikas, Efthimios. "Entre les mots : méthodes d’analyse informatique du discours idéologique." Thesis, Lyon 2, 2010. http://www.theses.fr/2010LYO22013/document.

Full text

Abstract:

La présente thèse recherche une approche sémantique pour l'extraction et l'analyse du discours idéologique au sein de documents textuels sous forme électronique. Cette approche intègre une méthode qualitative d'analyse de textes issue des sciences sociales (analyse critique du discours) avec une méthode quantitative de raisonnement et extraction d'information à base d'ontologies de domaine et de traitement semi-automatique du langage naturel. L'application centrale du projet de thèse vise à étudier le discours marxiste comme représenté par la collection thématique Archive Internet des Marxiste

APA, Harvard, Vancouver, ISO, and other styles

46

Celebi, Hatice. "Extracting And Analyzing Impoliteness In Corpora A Study Based On Thebritish National Corpus And The Spoken Turkish Corpus." Phd thesis, METU, 2012. http://etd.lib.metu.edu.tr/upload/12615309/index.pdf.

Full text

Abstract:

This study aims to focus on extracting and analyzing impoliteness in corpora in British English and Turkish retrieved from two different corpora British National Corpus (BNC) and Spoken Turkish Corpus (STC), which is under construction. It focuses on conversation as genre in spoken interaction and discusses issues related to impoliteness in a corpus driven linguistics (CDL) approach. It proposes two levels<br>extraction and analysis. Within the CDL framework, the theory or model of impoliteness behind the analysis will be forced by the findings gathered from the extraction of impoliteness. At

APA, Harvard, Vancouver, ISO, and other styles

47

Sjöberg, Agaton. "Extracting Transaction Information from Financial Press Releases." Thesis, Linköpings universitet, Artificiell intelligens och integrerade datorsystem, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-177688.

Full text

Abstract:

The use cases of Information Extraction (IE) are more or less endless, often consisting of a combination of Named Entity Recognition (NER) and Relation Extraction (RE). One use case of IE is the extraction of transaction information from Norwegian insider transaction Press Releases (PRs), where a transaction consists of at most four entities: the name of the owner performing the transaction, the number of shares transferred, the transaction date, and the price of the shares bought or sold. The relationships between the entities define which entity belongs to which transaction, and whether shar

APA, Harvard, Vancouver, ISO, and other styles

48

Erin, Macmurray. "Discours de presse et veille stratégique d'événements Approche textométrique et extraction d'informations pour la fouille de textes." Phd thesis, Université de la Sorbonne nouvelle - Paris III, 2012. http://tel.archives-ouvertes.fr/tel-00740601.

Full text

Abstract:

Ce travail a pour objet l'étude de deux méthodes de fouille automatique de textes, l'extraction d'informations et la textométrie, toutes deux mises au service de la veille stratégique des événements économiques. Pour l'extraction d'informations, il s'agit d'identifier et d'étiqueter des unités de connaissances, entités nommées -- sociétés, lieux, personnes, qui servent de points d'entrée pour les analyses d'activités ou d'événements économiques -- fusions, faillites, partenariats, impliquant ces différents acteurs. La méthode textométrique, en revanche, met en oeuvre un ensemble de modèles sta

APA, Harvard, Vancouver, ISO, and other styles

49

Kantzola, Evangelia. "Extractive Text Summarization of Greek News Articles Based on Sentence-Clusters." Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-420291.

Full text

Abstract:

This thesis introduces an extractive summarization system for Greek news articles based on sentence clustering. The main purpose of the paper is to evaluate the impact of three different types of text representation, Word2Vec embeddings, TF-IDF and LASER embeddings, on the summarization task. By taking these techniques into account, we build three different versions of the initial summarizer. Moreover, we create a new corpus of gold standard summaries to evaluate them against the system summaries. The new collection of reference summaries is merged with a part of the MultiLing Pilot 2011 in or

APA, Harvard, Vancouver, ISO, and other styles

50

Grant, Harald. "Extractive Multi-document Summarization of News Articles." Thesis, Linköpings universitet, Institutionen för datavetenskap, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-158275.

Full text

Abstract:

Publicly available data grows exponentially through web services and technological advancements. To comprehend large data-streams multi-document summarization (MDS) can be used. In this research, the area of multi-document summarization is investigated. Multiple systems for extractive multi-document summarization are implemented using modern techniques, in the form of the pre-trained BERT language model for word embeddings and sentence classification. This is combined with well proven techniques, in the form of the TextRank ranking algorithm, the Waterfall architecture and anti-redundancy filt

APA, Harvard, Vancouver, ISO, and other styles

We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!