Dissertations / Theses on the topic 'Coreference'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Coreference.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Nicol, Janet Lee. "Coreference processing during sentence comprehension." Thesis, Massachusetts Institute of Technology, 1988. http://hdl.handle.net/1721.1/14421.
Full textYoon, Chulmin. "Essays on De Jure Coreference." The Ohio State University, 2020. http://rave.ohiolink.edu/etdc/view?acc_num=osu1595589314146257.
Full textHERNANDEZ, ADRIEL GARCIA. "COREFERENCE RESOLUTION FOR THE ENGLISH LANGUAGE." PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO, 2017. http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=30730@1.
Full textCOORDENAÇÃO DE APERFEIÇOAMENTO DO PESSOAL DE ENSINO SUPERIOR
FUNDAÇÃO DE APOIO À PESQUISA DO ESTADO DO RIO DE JANEIRO
PROGRAMA DE EXCELENCIA ACADEMICA
BOLSA NOTA 10
Um dos problemas encontrados nos sistemas de processamento de linguagem natural é a dificuldade em identificar elementos textuais que se referem à mesma entidade. Este fenômeno é chamado de correferência. Resolver esse problema é parte integrante da compreensão do discurso, permitindo que os usuários da linguagem conectem as partes da informação de fala relativas à mesma entidade. Por conseguinte, a resolução de correferência é um importante foco de atenção no processamento da linguagem natural.Apesar da riqueza das pesquisas existentes, o desempenho atual dos sistemas de resolução de correferência ainda não atingiu um nível satisfatório. Neste trabalho, descrevemos um sistema de aprendizado estruturado para resolução de correferências em restrições que explora duas técnicas: árvores de correferência latente e indução automática de atributos guiadas por entropia. A modelagem de árvore latente torna o problema de aprendizagem computacionalmente viável porque incorpora uma estrutura escondida relevante. Além disso, utilizando um método automático de indução de recursos, podemos construir eficientemente modelos não-lineares, usando algoritmos de aprendizado de modelo linear como, por exemplo, o algoritmo de perceptron estruturado e esparso.Nós avaliamos o sistema para textos em inglês, utilizando o conjunto de dados da CoNLL-2012 Shared Task. Para a língua inglesa, nosso sistema obteve um valor de 62.24 por cento no score oficial dessa competição. Este resultado está abaixo do desempenho no estado da arte para esta tarefa que é de 65.73 por cento. No entanto, nossa solução reduz significativamente o tempo de obtenção dos clusters dos documentos, pois, nosso sistema leva 0.35 segundos por documento no conjunto de testes, enquanto no estado da arte, leva 5 segundos para cada um.
One of the problems found in natural language processing systems, is the difficulty to identify textual elements referring to the same entity, this task is called coreference. Solving this problem is an integral part of discourse comprehension since it allows language users to connect the pieces of speech information concerning to the same entity. Consequently, coreference resolution is a key task in natural language processing.Despite the large efforts of existing research, the current performance of coreference resolution systems has not reached a satisfactory level yet. In this work, we describe a structure learning system for unrestricted coreferencere solution that explores two techniques: latent coreference trees and automatic entropy-guided feature induction. The latent tree modeling makes the learning problem computationally feasible,since it incorporates are levant hidden structure. Additionally,using an automatic feature induction method, we can efciently build enhanced non-linear models using linear model learning algorithms, namely, the structure dandsparse perceptron algorithm. We evaluate the system on the CoNLL-2012 Shared Task closed track data set, for the English portion. The proposed system obtains a 62.24 per cent value on the competition s official score. This result is be low the 65.73 per cent, the state-of-the-art performance for this task. Nevertheless, our solution significantly reduces the time to obtain the clusters of adocument, since, our system takes 0.35 seconds per document in the testing set, while in the state-of-the-art, it takes 5 seconds for each one.
WERNER, ENEIDA FIGUEIRA DE ALMEIDA. "REVISION IN WRITING AND COREFERENCE ISSUES." PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO, 2018. http://www.maxwell.vrac.puc-rio.br/Busca_etds.php?strSecao=resultado&nrSeq=36163@1.
Full textCOORDENAÇÃO DE APERFEIÇOAMENTO DO PESSOAL DE ENSINO SUPERIOR
PROGRAMA DE SUPORTE À PÓS-GRADUAÇÃO DE INSTS. DE ENSINO
O objetivo desta tese é investigar o processo de revisão da escrita e o processo de estabelecimento da correferência quanto à forma como são monitorados por grupos com diferentes graus de experiência em escrita. A pesquisa insere-se no quadro dos estudos sobre processamento da escrita, focalizando o processo da produção, e ancora-se, teoricamente, no tocante à pesquisa em escrita, no modelo de processamento cognitivo da escrita de Flower e Hayes (1980) e no modelo de revisão de Hayes (1987). Nos estudos da correferência, consideram-se as principais teorias voltadas para a investigação da influência de fatores que favorecem a acessibilidade à memória para seu estabelecimento, a Teoria da Acessibilidade (Ariel, 1990), a Teoria da Centralização (Grosz, Joshi e Weinstein, 1995) e a Hipótese da Carga Informacional (Almor, 1999). Relacionamos as questões teóricas aos dados de natureza cognitiva obtidos por meio de metodologia experimental. O laboratório utilizado foi o LAPAL, na PUC-Rio. Os experimentos conduzidos basearam-se em tarefas de produção e revisão de textos. Foi utilizada a ferramenta de keystroke logging Inputlog (http://www.inputlog.net/) para gravação e análise dos dados. Os participantes eram alunos de graduação e de pós-graduação de uma instituição pública e de uma instituição privada no Rio de Janeiro. No primeiro experimento foram analisados dados de natureza global do processamento da escrita e do processamento da correferência a partir de imagens-estímulos de duas histórias em quadrinhos, sem material verbal. No que tange ao comportamento global do processamento de escrita, foram verificadas medidas relativas ao processo e ao produto do texto produzido (em termos de número de caracteres e de palavras) e também relativas a pausas e tipos de revisões realizadas. No âmbito das medidas voltadas especificamente para o processamento da correferência, foramanalisados dados relacionados aos tipos de expressões referenciais selecionadas para introduzir e retomar entidades discursivas, bem como quanto ao momento em que elementos de retomada foram revistos (revisão do tipo imediata ou posterior) e à natureza do tipo de alteração implementada no que tange ao grau de especificidade do termo usado na substituição (mais/menos específico). O segundo experimento objetivou investigar os fatores que influenciam a escolha de uma expressão referencial anafórica a partir da informação contida no antecedente. Foi conduzida tarefa de revisão com quatro textos de mesmo tipo narrativo. Em cada tipo de texto avaliou-se os tipos de retomadas anafóricas das expressões referenciais em função do grau de ativação de informação na memória favorecido pela acessibilidade ao antecedente. Foram tomadas como variáveis independentes a função sintática do antecedente (mais suj.; menos suj.), o papel temático (mais agente; menos agente), e a distância entre o antecedente e o elemento de retomada (igual período; diferente período). No primeiro experimento os resultados apontaram divergências entre os tipos de revisões efetuadas (imediatas/posteriores) e quanto à proporção de revisões efetuadas (apagamentos/inserções) indicando que o grupo de alunos de pós-graduação empregou mais qualitativamente estratégias e recursos de revisão no monitoramento de seus textos do que os alunos de graduação. No segundo experimento, na análise estatística conduzida para cada grupo separadamente, foi verificado efeito principal de posição sintática (nos 2 grupos), distância (nos 2 grupos), e papel temático (no grupo de pós-graduação). Além disso, verificaram-se efeitos de interação entre posição e distância, e entre posição, papel temático e distância (grupo de graduação) e de posição e distância (grupo de pós-graduação). A qualidade das revisões efetuadas foi diferente, tendo o grupo de alunos de pós-graduação efetuado mais revisões do tipo posterior e percentualmente mais revisões que implicaram modificações na qualidade textual. Em conjunt
The purpose of this doctoral thesis is to investigate the writing process and the process of establishing coreference as to how they are monitored by groups of different degrees of writing experience. The research is part of the study of writing processing, focusing on the production process, and is theoretically anchored in writing research related to the Cognitive Writing Model of Hayes and Flower (1980) and in Hayes s Writing Revision Model (1987). In the studies of coreference, we consider the main theories that investigate the influence of factors that favour accessibility to memory, Accessibility Theory (Ariel, 1990), the Centering Theory (Grosz, Joshi and Weinstein, 1995) and the Information Load Hypothesis (Almor, 1999). We related the theoretical questions to the data captured by means of experimental methodology. The laboratory used was LAPAL, at PUC-Rio. The experiments conducted were based on writing production and revision tasks and we used the technological tool of keystroke logging Inputlog (http://www.inputlog.net/) to record and analyse data. Participants were graduate and post graduate students of public and private institutions in Rio de Janeiro.In the first experiment the data analysed related to production of writing and coreference processing from image-stimuli of two comic strips without verbal material. Concerning the measures related to writing production, we analysed the relation between the process and product in terms of the number of characters and words as well as pauses and the types of revisions made. Regarding the measures of coreference processing, we examined the types of of referential expressions selected to introduce and to establish coreference within discourse entities, as well as data related to the moment when correferential elements were revised (immediate or delayed revisions) and the degree of specificity implied in the alterations worked out. The second experiment aimed to investigate the factors that influence the choice of anaphoric referential expressions from the type of information contained in the antecedent. We conducted an experiment of writing revision consisting of four different texts of the same discursive genre. In each of them we took into account the degree of activation in memory provided by information that favours accessibility to memory stored items. The independent variables were the syntactic function of the antecedent(more subject/less subject), the thematic role of the antecedent (more agent/less agent) and the the distance between the antecedent and the anaphoric referential expression (equal period/different period). Results from the first experiment pointed out differences between the types of revisions (immediate/delayed) and the proportion of revisions made (deletions/insertions) indicating that post-graduate group used more revision strategy resources while monitoring their production as compared to the group of graduates. In the second experiment, statistical analysis conducted for each group separately revealed effects of the factors considered as for syntactic position (in the 2 groups), thematic role (in the post-graduates group) and distance (in both groups). In addition, interaction effects between distance and syntactic position and between position, thematic role and distance (graduates group) and position and distance (post-graduates group) were significant. The quality of the revisions was proven diverse, having post-graduates proceeded to more delayed revisions that imply alteration in overall text quality than the group of graduates. As a whole, the experiments conducted allowed us to identify differences between the experimental groups and suggest evidence that schooling level plays an important role in writing and in the choices made in for coreference processing.
Bodnari, Andreea. "Joint multilingual learning for coreference resolution." Thesis, Massachusetts Institute of Technology, 2014. http://hdl.handle.net/1721.1/91126.
Full text98
Cataloged from PDF version of thesis.
Includes bibliographical references (pages 112-120).
Natural language is a pervasive human skill not yet fully achievable by automated computing systems. The main challenge is understanding how to computationally model both the depth and the breadth of natural languages. In this thesis, I present two probabilistic models that systematically model both the depth and the breadth of natural languages for two different linguistic tasks: syntactic parsing and joint learning of named entity recognition and coreference resolution. The syntactic parsing model outperforms current state-of-the-art models by discovering linguistic information shared across languages at the granular level of a sentence. The coreference resolution system is one of the first attempts at joint multilingual modeling of named entity recognition and coreference resolution with limited linguistic resources. It performs second best on three out of four languages when compared to state-of-the-art systems built with rich linguistic resources. I show that we can simultaneously model both the depth and the breadth of natural languages using the underlying linguistic structure shared across languages.
by Andreea Bodnari.
Ph. D.
Webster, Kellie. "Improved Coreference Resolution Using Cognitive Insights." Thesis, The University of Sydney, 2016. http://hdl.handle.net/2123/15468.
Full textCorazza, Michele. "Coreference Resoultion basata su reti neurali deep." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2017. http://amslaurea.unibo.it/14554/.
Full textNilsson, Kristina. "Hybrid Methods for Coreference Resolution in Swedish." Doctoral thesis, Stockholm : Department of Linguistics, Stockholm University, 2010. http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-38395.
Full textChristiansen, Thomas Wulstan. "Coreference and noun phrase selection in Italian." Thesis, University of Salford, 2000. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.365982.
Full textKunz, Jenny. "Neural Language Models with Explicit Coreference Decision." Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-371827.
Full textRolih, Gabi. "Applying Coreference Resolution for Usage in Dialog Systems." Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-353730.
Full textPatel, Chandankumar Johakhim. "A Performance Analysis Framework for Coreference Resolution Algorithms." Wright State University / OhioLINK, 2016. http://rave.ohiolink.edu/etdc/view?acc_num=wright1471954403.
Full textShrimpton, Luke William. "Efficient techniques for streaming cross document coreference resolution." Thesis, University of Edinburgh, 2017. http://hdl.handle.net/1842/28895.
Full textSapena, Masip Emili. "A constraint-based hypergraph partitioning approach to coreference resolution." Doctoral thesis, Universitat Politècnica de Catalunya, 2012. http://hdl.handle.net/10803/83904.
Full textLa resolució de correferències és una tasca de processament del llenguatge natural que consisteix en determinar les expressions d'un discurs que es refereixen a la mateixa entitat del mon real. La tasca té un efecte directe en la minería de textos així com en moltes tasques de llenguatge natural que requereixin interpretació del discurs com resumidors, responedors de preguntes o traducció automàtica. Resoldre les correferències és essencial si es vol poder “entendre” un text o un discurs. Els objectius d'aquesta tesi es centren en la recerca en resolució de correferències amb aprenentatge automàtic. Concretament, els objectius de la recerca es centren en els següents camps: + Models de classificació: Els models de classificació més comuns a l'estat de l'art estan basats en la classificació independent de parelles de mencions. Més recentment han aparegut models que classifiquen grups de mencions. Un dels objectius de la tesi és incorporar el model entity-mention a l'aproximació desenvolupada. + Representació del problema: Encara no hi ha una representació definitiva del problema. En aquesta tesi es presenta una representació en hypergraf. + Algorismes de resolució. Depenent de la representació del problema i del model de classificació, els algorismes de ressolució poden ser molt diversos. Un dels objectius d'aquesta tesi és trobar un algorisme de resolució capaç d'utilitzar els models de classificació en la representació d'hypergraf. + Representació del coneixement: Per poder administrar coneixement de diverses fonts, cal una representació simbòlica i expressiva d'aquest coneixement. En aquesta tesi es proposa l'ús de restriccions. + Incorporació de coneixement del mon: Algunes correferències no es poden resoldre només amb informació lingüística. Sovint cal sentit comú i coneixement del mon per poder resoldre coreferències. En aquesta tesi es proposa un mètode per extreure coneixement del mon de Wikipedia i incorporar-lo al sistem de resolució. Les contribucions principals d'aquesta tesi son (i) una nova aproximació al problema de resolució de correferències basada en satisfacció de restriccions, fent servir un hypergraf per representar el problema, i resolent-ho amb l'algorisme relaxation labeling; i (ii) una recerca per millorar els resultats afegint informació del mon extreta de la Wikipedia. L'aproximació presentada pot fer servir els models mention-pair i entity-mention de forma combinada evitant així els problemes que es troben moltes altres aproximacions de l'estat de l'art com per exemple: contradiccions de classificacions independents, falta de context i falta d'informació. A més a més, l'aproximació presentada permet incorporar informació afegint restriccions i s'ha fet recerca per aconseguir afegir informació del mon que millori els resultats. RelaxCor, el sistema que ha estat implementat durant la tesi per experimentar amb l'aproximació proposada, ha aconseguit uns resultats comparables als millors que hi ha a l'estat de l'art. S'ha participat a les competicions internacionals SemEval-2010 i CoNLL-2011. RelaxCor va obtenir la segona posició al CoNLL-2010.
Shyu, Eric. "Latent tree structure learning for cross-document coreference resolution." Thesis, Massachusetts Institute of Technology, 2014. http://hdl.handle.net/1721.1/91867.
Full textCataloged from PDF version of thesis.
Includes bibliographical references (pages 77-79).
Cross Document Coreference Resolution (CDCR) is the problem of learning which mentions, coming from several different documents, correspond to the same entity. This thesis approaches the CDCR problem by first turning it into a structure learning problem. A latent tree structure, in which leaves correspond to observed mentions and internal nodes correspond to latent sub-entities, is learned. A greedy clustering heuristic can then be used to select subtrees from the learned tree structure as entities. As with other structure learning problems, it is prudent to envoke Occam's razor and perform regularization to obtain the simplest hypothesis. When the state space consists of tree structures, we can impose a bias on the possible structure. Different aspects of tree structure (i.e. number of edges, depth of the leaves, etc.) can be penalized in these models to improve the generalization of thes models. This thesis draws upon these ideas to provide a new model for CDCR. To learn parameters, we implement a parameter estimation algorithm based on existing stochastic gradient-descent based algorithms and show how to further tune regularization parameters. The latent tree structure is then learned using MCMC inference. We show how structural regularization plays a critical role in the inference procedure. Finally, we empirically show that our model out-performs previous work, without using a sophisticated set of features.
by Eric Shyu.
M. Eng.
Martschat, Sebastian [Verfasser], and Michael [Akademischer Betreuer] Strube. "Structured Representations for Coreference Resolution / Sebastian Martschat ; Betreuer: Michael Strube." Heidelberg : Universitätsbibliothek Heidelberg, 2017. http://d-nb.info/1178009653/34.
Full textCai, Jie [Verfasser], and Michael [Akademischer Betreuer] Strube. "Coreference Resolution via Hypergraph Partitioning / Jie Cai ; Betreuer: Michael Strube." Heidelberg : Universitätsbibliothek Heidelberg, 2013. http://d-nb.info/1179924339/34.
Full textHe, Tian Ye. "Coreference resolution on entities and events for hospital discharge summaries." Thesis, Massachusetts Institute of Technology, 2007. http://hdl.handle.net/1721.1/45977.
Full textThesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2007.
The wealth of medical information contained in electronic medical records (EMRs) and Natural Language Processing (NLP) technologies that can automatically extract information from them have opened the doors to automatic patient-care quality monitoring and medical- assist question answering systems. This thesis studies coreference resolution, an information extraction (IE) subtask that links together specific mentions to each entity. Coreference resolution enables us to find changes in the state of entities and makes it possible to answer questions regarding the information thus obtained. We perform coreference resolution on a specific type of EMR, the hospital discharge summary. We treat coreference resolution as a binary classification problem. Our approach yields insights into the critical features for coreference resolution for entities that fall into five medical semantic categories that commonly appear in discharge summaries.
by Tian Ye He.
M.Eng.
Moosavi, Nafise Sadat [Verfasser], and Michael [Akademischer Betreuer] Strube. "Robustness in Coreference Resolution / Nafise Sadat Moosavi ; Betreuer: Michael Strube." Heidelberg : Universitätsbibliothek Heidelberg, 2020. http://d-nb.info/1205210539/34.
Full textKobdani, Hamidreza [Verfasser], and Hinrich [Akademischer Betreuer] Schütze. "A modular framework for coreference resolution / Hamidreza Kobdani. Betreuer: Hinrich Schütze." Stuttgart : Universitätsbibliothek der Universität Stuttgart, 2012. http://d-nb.info/1021923303/34.
Full textTeixeira, ElisÃngela Nogueira. "Syntactic and semantic preferences on coreference processing: Evidence from eye movements." Universidade Federal do CearÃ, 2013. http://www.teses.ufc.br/tde_busca/arquivo.php?codArquivo=9870.
Full textConselho Nacional de Desenvolvimento CientÃfico e TecnolÃgico
Esta tese tem como objetivo principal contribuir com o desenvolvimento dos estudos psicolinguÃsticos que procuram demonstrar experimentalmente conjecturas teÃricas a respeito do processamento anafÃrico. Tomando por base a Teoria da Acessibilidade (ARIEL, 1991, 2001), a Teoria da CentralizaÃÃo (GROSZ; JOSHI; WEINSTEIN, 1995), os trabalhos em torno da tipicidade do termo antecedente (GARROD; SANFORD, 1977; VAN GOMPEL; LIVERSEDGE; PEARSON, 2004), a HipÃtese da Carga Informacional (ALMOR, 1999) e a HipÃtese da PosiÃÃo do Antecedente (CARMINATI, 2002), trabalhamos com a hipÃtese de que, em perÃodos complexos por coordenaÃÃo e subordinaÃÃo, formados por no mÃximo duas oraÃÃes, a saliÃncia da posiÃÃo sintÃtica de sujeito à o principal fator para a resoluÃÃo anafÃrica em lÃngua portuguesa. Fazendo uso de metodologia experimental on-line e off-line, procuramos evidÃncias para nossa hipÃtese em um conjunto formado por quatro estudos, composto por (i) um experimento de compreensÃo de perÃodos complexos por coordenaÃÃo, em que foram manipulados a posiÃÃo do antecedente e o tipo de relaÃÃo semÃntica entre antecedente e anÃfora; (ii) um experimento de compreensÃo de perÃodos complexos por subordinaÃÃo, em que foram manipulados o tipo da correferÃncia anafÃrica, sob a forma de pronome pleno ou nulo, e a posiÃÃo da correferÃncia, anafÃrica ou catafÃrica; (iii) uma sondagem de produÃÃo de perÃodos complexos com uso de pronomes plenos ou nulos como correferentes; e (iv) uma anÃlise dos movimentos oculares durante a leitura de textos autÃnticos em lÃngua portuguesa com o objetivo de encontrar padrÃes de fixaÃÃo oculares. Os estudos foram realizados em um rastreador ocular de 120 Hz que registrou a cada 8 ms a movimentaÃÃo ocular dos participantes durante a leitura dos estÃmulos. As variÃveis dependentes de movimentaÃÃo ocular analisadas foram: (i) o nÃmero de fixaÃÃes; (ii) o tempo da primeira fixaÃÃo; (iii) a duraÃÃo mÃdia da fixaÃÃo ocular; e (iv) o tempo total de fixaÃÃo. A anÃlise conjunta dos resultados dos experimentos sugere que a resoluÃÃo da anÃfora correferencial nos perÃodos complexos estudados à uma funÃÃo da proeminÃncia sintÃtica da posiÃÃo de sujeito e que a carga de informaÃÃo das expressÃes anafÃricas com conteÃdo semÃntico parece levar a um aumento de custo durante o processamento anafÃrico de um antecedente altamente acessÃvel.
In this dissertation, our main objective is to contribute for the development and understanding of psycholinguistics studies that attempt to experimentally demonstrate relevant theoretical conjectures about anaphoric processing. Under the conceptual frameworks of the Theory of Accessibility (ARIEL, 1991, 2001), the Theory of Centering (GROSZ; JOSHI; WEINSTEIN, 1995), the studies on the typicality of the antecedent term (GARROD; SANFORD, 1977; VAN GOMPEL; LIVERSEDGE; PEARSON, 2004), the Informational Load Hypothesis (ALMOR, 1999), and the Position of Antecedent Hypothesis (CARMINATI, 2002), we propose that the prominence of the syntactic position in complex sentences plays a major role on the anaphoric resolution in the Portuguese language. Adopting a psycholinguistic methodology based on on-line (tracking of eye movements) as well as off-line observations, we searched for evidence to support our hypothesis from the results of the following set of studies: (i) an experiment to evaluate the comprehension of complex sentences due to coordination, in which both the position of the antecedent and the type of semantic relationship between antecedent and anaphora are manipulated; (ii) an experiment to evaluate the comprehension of complex sentences due to subordination, in which both the type of anaphoric coreference, in the form of a plain or null pronoun, and the position of the coreference, anaphoric or cataphoric, are manipulated; (iii) an experiment for generation of complex sentences, using plain or null pronouns as coreferentials; and (iv) a reading experiment of non-manipulated texts to establish a comparative standard for reading flux in Brazilian Portuguese. Our on-line experiments were performed with an eye-tracker of 120 Hz, which allowed eye movements to be recorded at each 8 milliseconds. The following dependent variables related with the eye movement have been analyzed: (i) the number of fixations; (ii) the duration time of the first fixation; (iii) the average duration of the fixations; and (iv) the total time of fixation. The overall analysis of our results, based on the investigation of complex sentences, suggests that the resolution of the coreferential anaphora is a function of the prominence of the subject position. Moreover, the information load of anaphoric expressions with semantic content seems to increase the cost of the anaphoric processing of a highly accessible antecedent.
Tomadaki, Eleftheria. "Cross-document coreference between different types of collateral texts for films." Thesis, University of Surrey, 2006. http://epubs.surrey.ac.uk/844096/.
Full textLassalle, Emmanuel. "Structured learning with latent trees : a joint approach to coreference resolution." Sorbonne Paris Cité, 2015. http://www.theses.fr/2015USPCC273.
Full textThis thesis explores ways to define automated coreference resolution systems by using structured machine leaming techniques. We design supervised models that leam to build coreference clusters from raw text: our main objective is to get model able to process documents globally, in a structurel fashion, to ensure coherent outputs. Our models are trained and evaluated on the English part of die CoNLL-2012 Shared Task annotated corpus with standard metrics. We carry out detailed comparisons of different settings so as to refine our models and design a complete end-to-end coreference resolver. Specifically, we first carry out a preliminary work on improving the way features are employed by linear models for classification: we extend existing work on separating different types of mention pairs to define more accurate classifiers of coreference links. We then define varions structured models based on latent trees to learn to build clusters globally, and not only from die predictions of a mention pair classifier. We study different latent representations (varions shapes and sparsity) and show empirically that die best suited structure is some restricted class of trees related to the best-first rule for selecting coreference links. We further improve this latent representation by integrating anaphoricity modelling jointly with coreference, designing a global (structured at the document level) and joint model outperforming existing models on gold mentions evaluation. We finally design a complete end-to-end resolver and evaluate the improvement obtained by our new modela on detected mentions, a more realistic setting for coreference resolution
Rösiger, Ina [Verfasser], and Jonas [Akademischer Betreuer] Kuhn. "Computational modelling of coreference and bridging resolution / Ina Rösiger ; Betreuer: Jonas Kuhn." Stuttgart : Universitätsbibliothek der Universität Stuttgart, 2019. http://d-nb.info/1184277826/34.
Full textSimeonov, Dimitar N. "The use of coreference resolution for understanding manipulation commands for the PR2 Robot." Thesis, Massachusetts Institute of Technology, 2012. http://hdl.handle.net/1721.1/77077.
Full textCataloged from PDF version of thesis.
Includes bibliographical references (p. 81-84).
Natural language interaction can enable us to interface with robots such as the Personal Robot 2 (PR2), without the need for a special training or equipment. Programming such a robot to follow commands is challenging because natural language has a complex structure and semantics, a model for which needs to be based on linguistic knowledge or learned from examples. In this thesis we first enable the PR2 robot to follow manipulation commands expressed in natural language by applying the Generalized Grounding Graph (G3 ). We model the PR2's actions and their trajectories in the physical environment, define the state-action space and learn a grounding model from an annotated corpus of robot actions aligned with commands. We achieved lower overall performance than previous implementations of G3 had reported. After that, we present an approach for using the linguistic technique of coreference resolution to improve the robot's ability to understand commands consisting of multiple clauses. We constrain the groundings for coreferent phrases to be identical by merging their nodes in the grounding graph. We show that using coreference information increases the robot ability to infer the right action sequence. This brings the robotic capabilities of modeling and understanding natural language closer to our theoretical understanding of discourse.
by Dimitar N. Simeonov.
M.Eng.
Jaffe, Evan. "The Role of Coreference Resolution in Memory- and Expectation-based Models of Human Sentence Processing." The Ohio State University, 2021. http://rave.ohiolink.edu/etdc/view?acc_num=osu1619104248552177.
Full textZhekova, Desislava Verfasser], Sandra [Akademischer Betreuer] [Kübler, and John A. [Akademischer Betreuer] Bateman. "Towards Multilingual Coreference Resolution / Desislava Zhekova. Gutachter: John Bateman ; Sandra Kübler. Betreuer: Sandra Kübler." Bremen : Staats- und Universitätsbibliothek Bremen, 2013. http://d-nb.info/1072078791/34.
Full textGriest, Kenneth Campbell. "An analysis of features used to train entity mention detection and coreference resolution classifiers." Connect to online resource, 2007. http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqdiss&rft_dat=xri:pqdiss:1447653.
Full textTourille, Julien. "Extracting Clinical Event Timelines : Temporal Information Extraction and Coreference Resolution in Electronic Health Records." Thesis, Université Paris-Saclay (ComUE), 2018. http://www.theses.fr/2018SACLS603/document.
Full textImportant information for public health is contained within Electronic Health Records (EHRs). The vast majority of clinical data available in these records takes the form of narratives written in natural language. Although free text is convenient to describe complex medical concepts, it is difficult to use for medical decision support, clinical research or statistical analysis.Among all the clinical aspects that are of interest in these records, the patient timeline is one of the most important. Being able to retrieve clinical timelines would allow for a better understanding of some clinical phenomena such as disease progression and longitudinal effects of medications. It would also allow to improve medical question answering and clinical outcome prediction systems. Accessing the clinical timeline is needed to evaluate the quality of the healthcare pathway by comparing it to clinical guidelines, and to highlight the steps of the pathway where specific care should be provided.In this thesis, we focus on building such timelines by addressing two related natural language processing topics which are temporal information extraction and clinical event coreference resolution.Our main contributions include a generic feature-based approach for temporal relation extraction that can be applied to documents written in English and in French. We devise a neural based approach for temporal information extraction which includes categorical features.We present a neural entity-based approach for coreference resolution in clinical narratives. We perform an empirical study to evaluate how categorical features and neural network components such as attention mechanisms and token character-level representations influence the performance of our coreference resolution approach
Grishina, Yulia [Verfasser], Manfred [Akademischer Betreuer] Stede, Manfred Gutachter] Stede, and Heike [Gutachter] [Zinsmeister. "Assessing the applicability of annotation projection methods for coreference relations / Yulia Grishina ; Gutachter: Manfred Stede, Heike Zinsmeister ; Betreuer: Manfred Stede." Potsdam : Universität Potsdam, 2019. http://nbn-resolving.de/urn:nbn:de:kobv:517-opus4-425378.
Full textGrishina, Yulia [Verfasser], Manfred [Akademischer Betreuer] Stede, Manfred [Gutachter] Stede, and Heike [Gutachter] Zinsmeister. "Assessing the applicability of annotation projection methods for coreference relations / Yulia Grishina ; Gutachter: Manfred Stede, Heike Zinsmeister ; Betreuer: Manfred Stede." Potsdam : Universität Potsdam, 2019. http://d-nb.info/1218404442/34.
Full textLenas, Erik. "Prerequisites for Extracting Entity Relations from Swedish Texts." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-281275.
Full textNatural Language Processing (NLP) är ett stort och aktuellt forskningsområde idag med många praktiska tillämpningar som sentimentanalys, textkategoriser- ing, maskinöversättning och automatisk textsummering. Forskningen är för när- varande mest inriktad på det engelska språket, men många andra språkområ- den försöker komma ikapp. Det här arbetet fokuserar på ett område inom NLP som kallas informationsextraktion, och mer specifikt relationsextrahering, det vill säga att extrahera relationer mellan namngivna entiteter i en text. Vad det här ar- betet försöker göra är att använda olika maskininlärningstekniker för att skapa en svensk Language Processing Pipeline bestående av part-of-speech tagging, de- pendency parsing, named entity recognition och coreference resolution. Denna pipeline är sedan tänkt att användas som en bas for senare relationsextrahering från svenskt arkivmaterial. Den uppenbara svårigheten med detta ligger i att det är ont om stora, annoterade svenska dataset. Till exempel så finns det inget till- räckligt stort svenskt dataset för coreference resolution. En stor del av detta arbete går därför ut på att skapa en svensk coreference solver genom att implementera distantly supervised machine learning, med vilket menas att använda en engelsk coreference solver på ett oannoterat engelskt-svenskt corpus, och sen använda en word-aligner för att översätta detta maskinannoterade engelska dataset till ett svenskt, och sen träna en svensk coreference solver på detta dataset. Det här arbetet använder Allen NLP:s end-to-end coreference solver, både för att skapa det svenska datasetet, och för att träna den svenska modellen, och uppnår en F1-score på 0.5. Vad gäller named entity recognition så använder det här arbetet Kungliga Bibliotekets BERT-modeller som bas, och uppnår genom detta en F1- score på 0.95. Spacy används som ett enande ramverk för att samla alla dessa NLP-komponenter inom en enda pipeline.
Ritz, Julia. "Discourse-givenness of noun phrases : theoretical and computational models." Phd thesis, Universität Potsdam, 2013. http://opus.kobv.de/ubp/volltexte/2014/7081/.
Full textDie vorliegende Arbeit gibt formale Definitionen der Konzepte Diskursgegebenheit, Koreferenz und Referenz. Zudem wird über Experimente berichtet, Nominalphrasen im Deutschen und Englischen hinsichtlich ihrer Diskursgegebenheit zu klassifizieren. Die Definitionen basieren auf Arbeiten von Bach (1987) zu Referenz, Kibble und van Deemter (2000) zu Koreferenz und der Diskursrepräsentationstheorie (Kamp und Reyle, 1993). In den Experimenten wurden die koreferenzannotierten Korpora MUC-7, OntoNotes und ARRAU (Englisch) und TüBa-D/Z (Deutsch) verwendet. Sie umfassen die Klassifikationsalgorithmen J48 (Entscheidungsbäume), Ripper (regelbasiertes Lernen) und lineare Support Vector Machines. Mehrere neue Klassifikationsmerkmale werden vorgeschlagen, die die Spezifizität der Nominalphrase messen, sowie ihren Kontext abbilden. Mit Hilfe dieser Merkmale kann eine signifikante Verbesserung der Klassifikation erreicht werden.
Goodsell, Thea. "Mental files." Thesis, University of Oxford, 2013. http://ora.ox.ac.uk/objects/uuid:7d7a1146-f770-4951-81a2-2b5dc42d2ecc.
Full textBatista-Navarro, Riza Theresa Bautista. "Information extraction from pharmaceutical literature." Thesis, University of Manchester, 2014. https://www.research.manchester.ac.uk/portal/en/theses/information-extraction-from-pharmaceutical-literature(3f8322b6-8b8d-44eb-a8cd-899026b267b9).html.
Full textRaghavan, Preethi. "MEDICAL EVENT TIMELINE GENERATION FROM CLINICAL NARRATIVES." The Ohio State University, 2014. http://rave.ohiolink.edu/etdc/view?acc_num=osu1397651496.
Full textKonstantinova, Natalia. "Knowledge acquisition from user reviews for interactive question answering." Thesis, University of Wolverhampton, 2013. http://hdl.handle.net/2436/297401.
Full textCastaño, André Casado. "Populando ontologias através de informações em HTML - o caso do currículo lattes." Universidade de São Paulo, 2008. http://www.teses.usp.br/teses/disponiveis/45/45134/tde-12082008-130204/.
Full textLattes Platform is the main database of Brazilian researchers resumés in use nowadays. It stores in a standardized form professional, academic, bibliographical productions and other data from these researchers. From these Lattes resumés database, several types of reports can be generated. The tools available for Lattes platform are unable to detect some of the problems that emerge when generating consolidated reports, such as citation duplicity or bibliographical productions misclassified by their authors, generating an incorrect number of publications. This problem demands a revision performed by the researcher on the reports generated, and the flaws of this process are the main inspiration for this project. In this work we use the Lattes platform resumés database as the source for populating an ontology that is intended to be used to generate reports. We analyze the whole process of information gathering from HTML files and its post-processing to insert them correctly in the ontology, according to its semantics. With this ontology correctly populated, we show some new reports that can be generated and we perform also an analysis of the methods and approaches used in the whole process, highlighting their strengths and weaknesses, detailing the dificulties faced in the automated populating process (instantiation) of an ontology.
Lima, Juciane Nóbrega. "Paralelismo e foco estrutural no processamento da correferência de pronomes e de nomes repetidos." Universidade Federal da Paraíba, 2014. http://tede.biblioteca.ufpb.br:8080/handle/tede/8418.
Full textMade available in DSpace on 2016-07-21T13:48:00Z (GMT). No. of bitstreams: 1 arquivo total.pdf: 1030830 bytes, checksum: affa607bacbace5e6fc95c51c5a74efe (MD5) Previous issue date: 2014-03-25
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - CAPES
The present study has aims to investigate the intrasentencial coreference processing, observing how the processing of coreference of pronouns and repeated names occurs in relation to the focus of their antecedents. We take as initial hypothesis that repeated names take an extra cost to be processed than pronouns, despite the antecedent is more salient or not. This is nominated by the Repeated Name Penalty, postulated by the Informational Load Hypothesis. Almor (1999, 2000). We performed two online self-paced reading, through Psyscope program. The dependent variable of the two experiments was the reading time of the critical segment (repeated name or pronoun) and the independent variables were the type of resumption (pronoun or repeated name) and the position of the antecedent (focused or unfocused) . The difference between the two experiments was that at the first we controlled all experimental sentences to contain pronouns and repeated names in the same position and syntactic function of its background, so in parallel . The second was controlled in order not to contain the conditions in parallel. 28 students from UFPB participated in each experiment. The results of the first experiment show a lower reading time for the pronouns in relation to repeated names regardless if its antecedent was focused or not .The structural focus showed no significant effect on any of the experimental conditions. A possible explanation would be that the effect of structural parallelism overlapped the effect of focus. That's what the results of the second experiment showed. The resumption and antecedent not in parallel time resulted in an significant effect of structural focus. The reading time was faster when the antecedent was focused than when it was not. It was also confirmed Repeated Name Penalty also in this second experiment.
O presente estudo tem como objeto de investigação o processamento correferencial intersentencial, procurando observar como se dá o processamento da correferência de pronomes e de nomes repetidos em relação ao foco dos seus respectivos antecedentes. Tomamos como hipótese inicial que nomes repetidos teriam o processamento mais custoso do que os pronomes, independente da saliência do antecedente. Ou seja, haveria Penalidade do Nome Repetido, postulada pela Hipótese da Carga Informacional de Almor (1999; 2000). Para isso, realizamos dois experimentos com uma tarefa on-line de leitura automonitorada (self-paced reading), por meio do programa Psyscope. A variável dependente dos dois experimentos foi o tempo de leitura do segmento crítico (nome repetido ou pronome). E as variáveis independentes foram: o tipo de retomada (pronome ou nome repetido) e a posição do antecedente (focalizado ou não focalizado). A diferença entre os dois experimentos foi que no primeiro controlamos para que em todas as frases experimentais contivessem pronomes e nomes repetidos na mesma posição e função sintática de seus antecedentes, ou seja, em paralelo. Já no segundo controlamos para que em nenhuma das condições tivessem antecedente e retomada em paralelo. O total de participantes voluntários foi de 28 estudantes da UFPB em cada experimento. Os resultados do primeiro experimento mostram menor tempo de leitura para os pronomes em relação aos nomes repetidos independentemente se o seu antecedente estivesse focalizado ou não. Já o foco estrutural não mostrou efeito significativo em nenhuma das condições experimentais. Uma possível explicação seria a de que o efeito do paralelismo estrutural se sobrepôs ao efeito do foco. Foi o que os resultados do segundo experimento demonstraram. Dessa vez, com retomada e antecedente não paralelo, o efeito de foco estrutural se mostrou significativo, ou seja, a leitura foi mais rápida quando o antecedente estava focalizado do que quando não estava. E foi confirmada Penalidade do Nome Repetido também nesse segundo experimento.
Kaumanns, Franz David [Verfasser], and Hinrich [Akademischer Betreuer] Schütze. "Assessment and analysis of the applicability of recurrent neural networks to natural language understanding with a focus on the problem of coreference resolution / Franz David Kaumanns ; Betreuer: Hinrich Schütze." München : Universitätsbibliothek der Ludwig-Maximilians-Universität, 2016. http://d-nb.info/1121507999/34.
Full textSilva, Jefferson Fontinele da. "Resolução de correferência em múltiplos documentos utilizando aprendizado não supervisionado." Universidade de São Paulo, 2011. http://www.teses.usp.br/teses/disponiveis/55/55134/tde-19072011-144521/.
Full textOne of the problems found in Natural Language Processing (NLP) systems is the difficulty of identifying textual elements that refer to the same entity. This phenomenon, in which the set of textual elements refers to a single entity, is called coreference. Coreference resolution systems can improve the performance of various NLP applications, such as automatic summarization, information extraction systems, question answering systems. Recently, research in NLP has explored the possibility of identifying the coreferent elements in multiple documents. In this context, this work focuses on the development of an unsupervised method for coreference resolution in multiple documents, using Portuguese as the target language. Until now, it is not known any system for this purpose for the Portuguese. The results of the experiments with the system suggest that the developed method is superior to methods based on string matching
Huang, Yin Jou. "Event Centric Approaches in Natural Language Processing." Doctoral thesis, Kyoto University, 2021. http://hdl.handle.net/2433/265210.
Full textFonseca, Evandro Brasil. "Resolu??o de correfer?ncia nominal usando sem?ntica em l?ngua portuguesa." Pontif?cia Universidade Cat?lica do Rio Grande do Sul, 2018. http://tede2.pucrs.br/tede2/handle/tede/8169.
Full textApproved for entry into archive by Sheila Dias (sheila.dias@pucrs.br) on 2018-06-26T14:40:39Z (GMT) No. of bitstreams: 1 EVANDRO BRASIL FONSECA_TES.pdf: 1972824 bytes, checksum: 9fca0c499753cd9d2822c59040e826bf (MD5)
Made available in DSpace on 2018-06-26T14:48:46Z (GMT). No. of bitstreams: 1 EVANDRO BRASIL FONSECA_TES.pdf: 1972824 bytes, checksum: 9fca0c499753cd9d2822c59040e826bf (MD5) Previous issue date: 2018-03-19
Coreference Resolution task is challenging for Natural Language Processing, considering the required linguistic knowledge and the sophistication of language processing techniques involved. Even though it is a demanding task, a motivating factor in the study of this phenomenon is its usefulness. Basically, several Natural Language Processing tasks may benefit from their results, such as named entities recognition, relation extraction between named entities, summarization, sentiment analysis, among others. Coreference Resolution is a process that consists on identifying certain terms and expressions that refer to the same entity. For example, in the sentence ? France is refusing. The country is one of the first in the ranking... ? we can say that [the country] is a coreference of [France]. By grouping these referential terms, we form coreference groups, more commonly known as coreference chains. This thesis proposes a process for coreference resolution between noun phrases for Portuguese, focusing on the use of semantic knowledge. Our proposed approach is based on syntactic-semantic linguistic rules. That is, we combine different levels of linguistic processing, using semantic relations as support, in order to infer referential relations between mentions. Models based on linguistic rules have been efficiently applied in other languages, such as: English, Spanish and Galician. In few words, these models are more efficient than machine learning approaches when we deal with less resourceful languages, since the lack of sample-rich corpora may produce a poor training. The proposed approach is the first model for Portuguese coreference resolution which uses semantic knowledge. Thus, we consider it as the main contribution of this thesis.
A tarefa de Resolu??o de Correfer?ncia ? um grande desafio para a ?rea de Processamento da Linguagem Natural, tendo em vista o conhecimento lingu?stico exigido e a sofistica??o das t?cnicas de processamento da l?ngua empregados. Mesmo sendo uma tarefa desafiadora, um fator motivador do estudo deste fen?meno se d? pela sua utilidade. Basicamente, v?rias tarefas de Processamento da Linguagem Natural podem se beneficiar de seus resultados, como, por exemplo, o reconhecimento de entidades nomeadas, extra??o de rela??o entre entidades nomeadas, sumariza??o, an?lise de sentimentos, entre outras. A Resolu??o de Correfer?ncia ? um processo que consiste em identificar determinados termos e express?es que remetem a uma mesma entidade. Por exemplo, na senten?a ?A Fran?a est? resistindo. O pa?s ? um dos primeiros no ranking...? podemos dizer que [o pa?s] ? uma correfer?ncia de [A Fran?a]. Realizando o agrupamento desses termos referenciais, formamos grupos de men??es correferentes, mais conhecidos como cadeias de correfer?ncia. Esta tese prop?e um processo para a resolu??o de correfer?ncia entre sintagmas nominais para a l?ngua portuguesa, tendo como foco a utiliza??o do conhecimento sem?ntico. Nossa abordagem proposta ? baseada em regras lingu?sticas sint?tico-sem?nticas. Ou seja, combinamos diferentes n?veis de processamento lingu?stico utilizando rela??es sem?nticas como apoio, de forma a inferir rela??es referenciais entre men??es. Modelos baseados em regras lingu?sticas t?m sido aplicados eficientemente em outros idiomas como o ingl?s, o espanhol e o galego. Esses modelos mostram-se mais eficientes que os baseados em aprendizado de m?quina quando lidamos com idiomas menos providos de recursos, dado que a aus?ncia de corpora ricos em amostras pode prejudicar o treino desses modelos. O modelo proposto nesta tese ? o primeiro voltado para a resolu??o de correfer?ncia em portugu?s que faz uso de conhecimento sem?ntico. Dessa forma, tomamos este fator como a principal contribui??o deste trabalho.
Adamček, Adam. "Metody extrakce informací." Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2015. http://www.nusl.cz/ntk/nusl-234967.
Full textCorreia, Débora Vasconcelos. "Relações entre memória procedimental e linguagem em pessoas que gaguejam: um estudo com base no processamento da correferência anafórica em português brasileiro." Universidade Federal da Paraíba, 2014. http://tede.biblioteca.ufpb.br:8080/handle/tede/6426.
Full textCoordenação de Aperfeiçoamento de Pessoal de Nível Superior
This dissertation aims to explain how is the processing of coreference in people who stutter (PWS), reflecting on the possibility of an association between stuttering and the presence of difficulties in procedural memory, from the relationship between Alm's Dual Premotor Model (2005) and Ullman´s Declarative/Procedural Model (2001). It is proposed, then, a hypothesis about the connection between the presence of dysfunctions in procedural memory and the linguistic processing of PQG, which was investigated through the ASRT test (Alternating Serial Reaction Time) of procedural memory and two experiments of self-paced reading to the investigation of the phenomenon of inter and intrasentential coreference. In the ASRT test (experiment 1) performed to measure the degree of implicit learning of the participants, the findings suggested a tendency of the groups (PQC and FF) to behave distinctively. PQG showed a pattern of ascending curve, with a positive Spearman's coefficient for the variable cycle, expressing an increase in time of reaction as it increased the number of cycles (stimuli). Which we interpreted as a possible difficulty in the PQG in implicit learning of motor sequences. And the FF showed a descending curve, confirmed by a negative Spearman's coefficient for the variable cycle. Demonstrating that the procedural learning for this group occurred quickly, i.e., the reaction time of the FF reduced as there was an increase in the number of cycles. With these indications that PQG present difficulties in procedural memory, which could interfere in the processing of grammatical aspects according to our hypothesis, we set out to the investigation of the linguistic processing. In experiment 2, the intersentential coreference, performed with the aim at investigating the processing of lexical pronoun (PR) and the repeated name (NR) in the object position between FF and PQC, the results showed that there is no difference in this type of processing between FF and PQC, since both groups showed similar patterns in the average reading time of the critical segment. However, there were a significant effect for the variable tipo de retomada, showing that PR are processed faster than the NR, as previously found by Leitão (2005). Thus, in order to investigate how was grammar functioning in PQG and to attest the hypothesis defended in this dissertation, we set out to the analysis of the phenomenon of coreference in the intrassentential level, in order to isolate the grammatical aspect and eliminate possible interference from the pragmatic and contextual factors. The results pointed to the absence of main effect for the variable group, however, we found a marginally significant interaction effect between the variables group and type of sentence. This interaction can be explained by the fact that the groups react differently to the conditions, departing from the observation that there is an inverse behavior between them, i.e., to the extent that FF are faster in the grammatical condition and slower in agramatical condition, PQG show the opposite pattern. Which corroborates our hypothesis that PQG would have difficulties in perception of breach of grammatical principle. This possibility, confirmed by the statistical evidence foreseen for our findings with the increase of sample, that it directs our search for rejecting the null hypothesis.
Esta dissertação tem por objetivo explanar como se dá o processamento da correferência em pessoas que gaguejam (PQG), refletindo sobre a possibilidade de associação entre a gagueira e a presença de dificuldades na memória procedimental, a partir da relação entre o Modelo Pré-Motor Duplo de Alm (2005) e o Modelo Declarativo/Procedimental de Ullman (2001). Lança-se, então, uma hipótese acerca da conexão entre a presença de disfunções na memória procedimental e o processamento linguístico das PQG, investigada por meio do teste ASRT (Alternating Serial Reaction Time) de memória procedimental e dois experimentos de leitura automonitorada para a investigação do fenômeno da correferência inter e intrassentencial. No teste ASRT (experimento 1) realizado para medir o grau de aprendizagem implícita dos participantes, os resultados encontrados apontaram para uma tendência dos grupos (PQG e FF) a comportarem-se de maneira distinta. As PQG evidenciaram um padrão de curva ascendente, com coeficiente de Spearman positivo para a variável ciclo, expressando um aumento do tempo de reação à medida que se aumentava o número de ciclos (estímulos). O que interpretamos como uma possível dificuldade das PQG na aprendizagem implícita das sequências motoras. E os FF evidenciaram uma curva descendente, confirmada pelo coeficiente de Spearman negativo para a variável ciclo. Demonstrando que a aprendizagem procedimental para este grupo ocorreu de maneira mais rápida, ou seja, o tempo de reação dos FF reduzia à medida que se aumentava o número de ciclos. De posse desses indícios de que as PQG apresentam dificuldades na memória procedimental, o que poderia interferir no processamento dos aspectos gramaticais de acordo com a nossa hipótese, partimos para a investigação do processamento linguístico. No experimento 2, de correferência intersentencial, realizado com o intuito de investigar o processamento do pronome lexical (PR) e do nome repetido (NR) em posição de objeto entre FF e PQG, os resultados obtidos evidenciaram que não há diferença nesse tipo de processamento entre FF e PQG, uma vez que ambos os grupos apresentaram padrões semelhantes no tempo médio de leitura do segmento crítico. No entanto, houve efeito significativo para a variável tipo de retomada, constatando que os PR são mais rapidamente processados do que o NR, conforme já encontrado em Leitão (2005). Dessa forma, a fim de investigar como se dava o funcionamento da gramática nas PQG e atestar de modo mais categórico a hipótese defendida nesta dissertação, partimos para a análise do fenômeno da correferência em nível intrassentencial, objetivando isolar o aspecto gramatical e eliminar as possíveis interferências dos fatores pragmáticos e contextuais. Os resultados obtidos apontaram a ausência de efeito principal para a variável grupo, no entanto, constatou-se um efeito de interação marginalmente significativo entre as variáveis grupo e tipo de sentença. Essa interação pode ser explicada pelo fato de os grupos reagirem diferentemente às condições, partindo da observação que há um comportamento invertido entre eles, ou seja, na medida em que os FF s são mais rápidos na condição gramatical e mais lentos na condição agramatical, as PQG apresentam o padrão oposto. O que corrobora com a nossa hipótese de que as PQG teriam dificuldades na percepção da violação do princípio gramatical. Possibilidade essa, confirmada por meio das evidências estatísticas previstas para os nossos resultados com o aumento da amostra, que direciona a nossa pesquisa para a rejeição da hipótese nula.
Wetzel, Dominikus Emanuel. "Entity-based coherence in statistical machine translation : a modelling and evaluation perspective." Thesis, University of Edinburgh, 2018. http://hdl.handle.net/1842/31522.
Full textShankaranarayanan, S. "Detection of Coreferences in Automatic Specifications Analysis." Thesis, Virginia Tech, 1994. http://hdl.handle.net/10919/42360.
Full textMaster of Science
Gonçalves, Patrícia Nunes. "CorrefSum: revisão da coesão referencial em sumários extrativos." Universidade do Vale do Rio do Sinos, 2008. http://www.repositorio.jesuita.org.br/handle/UNISINOS/2264.
Full textCoordenação de Aperfeiçoamento de Pessoal de Nível Superior
Com o avanço da Internet, cada vez mais convivemos com a sobrecarga de informação. É nesse contexto que a área de sumarização automática de textos tem se tornado uma área proeminente de pesquisa. A sumarização é o processo de discernir as informações mais importantes dos textos para produzir uma versão resumida. Sumarizadores extrativos escolhem as sentenças mais relevantes do texto e as reagrupam para formar o sumário. Muitas vezes, as frases selecionadas do texto não preservam a coesão referencial necessária para o entendimento do texto. O foco deste trabalho é, portanto, na análise e recuperação da coesão referencial desses sumários. O objetivo é desenvolver um sistema que realiza a manutenção da coesão referencial dos sumários extrativos usando como fonte de informação as cadeias de correferência presentes no texto-fonte. Para experimentos e avaliação dos resultados foram utilizados dois sumarizadores: Gist-Summ e SuPor-2. Foram utilizadas duas formas de avaliação: automática e subjetiva. Os resultados
With the advance of Internet technology we see the problem of information overload. In this context, automatic summarization is an important research area. Summarization is the process of identifying the most relevant information brought about in a text and on that basis to rewrite a short version of it. Extractive summarizers choose the most relevant sentences in a text and regroup them to form the summary. Usually the juxtaposition of the selected sentences violate the referential cohesion that is needed for the interpretation of the text. This work focuses on the analysis and recovery of referential cohesion of extractive summaries on the basis of knowledge about correference chains as presented in the source text. Some experiments were undertaken considering the summarizers GistSumm and SuPor-2. Evaluation was done in two ways, automatically and subjectively. The results indicate that this is a promising area of work and ways of advancing in this research are discussed
BOURGEOIS, ROBERT. "Iceo. Intension, coreferences et objets dans la federation de formalismes de specification." Paris 6, 1990. http://www.theses.fr/1990PA066425.
Full textVersley, Yannick [Verfasser], and Erhard [Akademischer Betreuer] Hinrichs. "Resolving Coreferent Bridging in German Newspaper Text / Yannick Versley ; Betreuer: Erhard Hinrichs." Tübingen : Universitätsbibliothek Tübingen, 2010. http://d-nb.info/1161803114/34.
Full text