Academic literature on the topic 'Corpus-based data'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Corpus-based data.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "Corpus-based data"
Gu, Chonglong. "Corpus triangulation: combining data and methods in corpus-based translation studies." Translator 24, no. 1 (December 6, 2017): 107–10. http://dx.doi.org/10.1080/13556509.2018.1411639.
Full textGamper, Johann, and Oliviero Stock. "Corpus-based terminology." Terminology 5, no. 2 (December 31, 1998): 147–59. http://dx.doi.org/10.1075/term.5.2.05gam.
Full textWolk, Christoph, and Benedikt Szmrecsanyi. "Probabilistic corpus-based dialectometry." Journal of Linguistic Geography 6, no. 1 (April 2018): 56–75. http://dx.doi.org/10.1017/jlg.2018.6.
Full textKhamis, Noorli. "Corpus-based Data for Determining Specialised Language Features." International Journal of Advanced Trends in Computer Science and Engineering 9, no. 1 (February 15, 2020): 36–41. http://dx.doi.org/10.30534/ijatcse/2020/07912020.
Full textMikulová, Marie, Eduard Bejček, Veronika Kolářová, and Jarmila Panevová. "Subcategorization of Adverbial Meanings Based on Corpus Data." Journal of Linguistics/Jazykovedný casopis 68, no. 2 (December 1, 2017): 268–77. http://dx.doi.org/10.1515/jazcas-2017-0036.
Full textBloothooft, Gerrit. "Corpus-based Name Standardization." History and Computing 6, no. 3 (October 1994): 153–67. http://dx.doi.org/10.3366/hac.1994.6.3.153.
Full textSzmrecsanyi, Benedikt, and Christoph Wolk. "Holistic corpus-based dialectology." Revista Brasileira de Linguística Aplicada 11, no. 2 (2011): 561–92. http://dx.doi.org/10.1590/s1984-63982011000200011.
Full textEscudero-Mancebo, David, and Valentín Cardeñoso-Payo. "Applying data mining techniques to corpus based prosodic modeling." Speech Communication 49, no. 3 (March 2007): 213–29. http://dx.doi.org/10.1016/j.specom.2007.01.008.
Full textLyddon, Paul. "Discovering Language Properties through Corpus-Based Dictionary Data Analysis." Vocabulary Learning and Instruction 6, no. 2 (2017): 61–70. http://dx.doi.org/10.7820/vli.v06.2.lyddon.
Full textde Monnink, Inge. "Combining Corpus and Experimental Data." International Journal of Corpus Linguistics 4, no. 1 (August 13, 1999): 77–111. http://dx.doi.org/10.1075/ijcl.4.1.05mon.
Full textDissertations / Theses on the topic "Corpus-based data"
Nolli, Carla Fernanda. "Data-driven learning and corpus-based approaches in language education." Florianópolis, SC, 2006. http://repositorio.ufsc.br/xmlui/handle/123456789/88465.
Full textMade available in DSpace on 2012-10-22T09:21:53Z (GMT). No. of bitstreams: 0
This study focuses on the analysis of conditional sentences examples found in teaching materials (textbooks and grammar books) and compares them with a large corpus in order to verify their frequency and authenticity. In order to do so, the comparison was carried out with the help of a corpus analysis software, which generated a concordance list of the word if. These tokens were analyzed and classified in order to distinguish the three types of conditional sentences studied in this thesis. One of the purposes of this research is also to shed light on an approach that still remains largely unexplored in Brazil, namely Data-Driven Learning (DDL), which explores teaching and learning through corpus linguistics. Este estudo se concentra na análise de exemplos de sentenças condicionais em materiais de ensino (livros textos e gramáticas) e compara-os com um corpus lingüístico a fim de verificar sua freqüência e autenticidade. Para isso, a comparação foi realizada com a ajuda de um software de análise de corpus, que gerou uma lista de concordâncias com a palavra if. Todos os exemplos foram analisados e classificados a fim de detectar os três tipos de sentenças condicionais estudadas nesta dissertação. Um dos objetivos desta pesquisa é também dar ênfase a uma metodologia que ainda permanece muito inexplorada no Brasil, chamada de Aprendizagem a Partir de Dados, que explora o ensino e a aprendizagem através de lingüística de corpus.
Adolphs, Svenja. "Linking lexico-grammar and speech acts : a corpus-based approach." Thesis, University of Nottingham, 2001. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.391412.
Full textMarchewka, Katarzyna M. "Gender agreement in Polish : a study based on elicitation and corpus data." Thesis, University of Surrey, 2016. http://epubs.surrey.ac.uk/809946/.
Full textWang, Lixum. "The use of parallel texts in language learning : computer software and teaching materials for English and Chinese." Thesis, University of Birmingham, 2000. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.368990.
Full textZhang, Min, and 張珉. "Using corpus data in a MOODLE-based self-learning course : teaching education students to 'cite like an academic'." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2015. http://hdl.handle.net/10722/211141.
Full textpublished_or_final_version
Education
Doctoral
Doctor of Philosophy
Tsiros, Augoustinos. "A multidimensional sketching interface for visual interaction with corpus-based concatenative sound synthesis." Thesis, Edinburgh Napier University, 2016. http://researchrepository.napier.ac.uk/Output/463438.
Full textVieira, Nataliya Godinho Soares. "Training and discovering corpus-based data driven exercices in english teaching (L2/FL) to native speakers of portuguese (L1)." Master's thesis, Faculdade de Ciências Sociais e Humanas, Universidade Nova de Lisboa, 2012. http://hdl.handle.net/10362/7422.
Full textConsiderando o rápido desenvolvimento das novas tecnologias e o seu uso no ensino de línguas estrangeiras, Linguística de Corpus oferece novas ferramentas e materiais que enriquecem a aprendizagem de uma segunda língua. Este projecto apresenta um quadro de princípios teóricos relacionados com os corpora online e propõe os exemplos de training e discovering corpus-based data-driven exercícios, que são uma contribuição original para o ensino/aprendizagem de Inglês (L2) aos falantes nativos da língua Portuguesa (L1). Os data-driven exercícios, com base em concordâncias extraídas de corpora, proporcionam um ensino-descoberta e envolvem os alunos numa "aprendizagemdescoberta", enriquecendo, deste modo, o desenvolvimento pessoal dos professores e dos alunos. Múltiplas são as finalidades pedagógicas deste projecto relacionadas com a utilização da data-driven learning (DDL) abordagem assim como a aplicação dos recursos baseados em TIC no ensino/aprendizagem das línguas estrangeiras.
Garcia, William Danilo. "Fanfictions, linguística de corpus e aprendizagem direcionada por dados : tarefas de produção escrita com foco no uso autêntico de língua e atividades que visam à autonomia dos alunos de letras em analisar preposições /." São José do Rio Preto, 2020. http://hdl.handle.net/11449/192699.
Full textResumo: A relação da Linguística de Corpus com o Ensino de Línguas, apesar de receber foco mesmo antes do advento dos computadores, se intensificou por volta da década de 90, momento em que pesquisas em corpora de aprendizes e em Aprendizagem Direcionada por Dados foram enfatizadas. Considerado esse estreitamento, esta pesquisa objetiva compilar quatro corpora de aprendizes a partir do uso autêntico da língua com o intuito de desenvolver atividades didáticas direcionadas por dados dos próprios alunos que promovam nos discentes um perfil autônomo de investigação linguística (mais precisamente das preposições with, in, on, at, for e to). No tocante à fundamentação teórica, destacam-se Prabhu (1987), Skehan (1996), Willis (1996), Nunan (2004) e Ellis (2006) a respeito do Ensino de Línguas por Tarefas, Jenkins (2012) e Neves (2014) que discorrem sobre as ficções de fã. Já sobre a Linguística de Corpus, tem-se Sinclair (1991), Berber Sardinha (2000) e Viana (2011). Granger (1998, 2002, 2013) mais relacionado a Corpus de Aprendizes, e Johns (1991, 1994), Berber Sardinha (2011) e Boulton (2010) no que diz respeito à Aprendizagem Direcionada por Dados. Como metodologia, levantaram-se textos escritos pelos alunos a partir de uma tarefa de produção escrita em que eles redigiram uma ficção de fã. Em seguida, esses textos formaram dois corpora de aprendizes iniciais, que foram analisados com o auxílio da ferramenta AntConc (ANTHONY, 2018) no intuito de observar a presença ou não de inadequações ... (Resumo completo, clicar acesso eletrônico abaixo)
Abstract: Although the relation between Corpus Linguistics and Language Teaching has been emphasized even before the advent of computers, it has been highlighted around the 90s. This was the moment when research on learner corpora and Data-Driven Learning was focused. Having said that, this study aimed to compile four learner corpora based on the authentic use of the language. This was done in order to develop data-driven teaching activities that could promote, among the students, an autonomous profile of linguistic investigation (more precisely about the prepositions with, in, on, at, for and to). Concerning the existing literature, we highlight the works of Prabhu (1987), Skehan (1996), Willis (1996), Nunan (2004) and Ellis (2006) about Task-Based Language Teaching, and Jenkins (2012) and Neves (2014) about fanfictions. In relation to Corpus Linguistics, this study is based on Sinclair (1991), Berber Sardinha (2000) and Viana (2011). Granger (1998, 2012, 2013) is referenced to define learner corpora, and Johns (1991, 1994), Berber Sardinha (2011) and Boulton (2010) to discuss Data-Driven Learning. The methodological approach involved the collection of the compositions from Language Teaching undergraduate students who developed a writing task in which they had to write a fanfiction. These texts composed two learner corpora, which were analyzed with the AntConc tool (ANTHONY, 2018) with the purpose of observing the occurrence of prepositions in English and whether they were accurately ... (Complete abstract click electronic access below)
Mestre
Gentilini, Livia. "La terminologia della sicurezza informatica nella banca dati FranceTerme: un'analisi corpus-based." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2019. http://amslaurea.unibo.it/17696/.
Full textGhisi, Daniele. "Music across music : towards a corpus-based, interactive computer-aided composition." Thesis, Paris 6, 2017. http://www.theses.fr/2017PA066561/document.
Full textThe reworking of existing music in order to build new one is a quintessential characteristic of the Western musical tradition. This thesis proposes and discusses my personal approach to the subject: the borrowing of music fragments from large-scale corpora (containing audio samples as well as symbolic scores) in order to build a low-level, descriptor-based palette of grains. Parameters are handled via digital hybrid scores, in order to equip corpus-based composition with the control of notational practices. This thesis also introduces the dada library, providing Max with the ability to organize, select and generate musical content via a set of graphical interfaces manifesting an exploratory approach towards music composition. Its modules address a range of scenarios, including, but not limited to, database visualization, score segmentation and analysis, concatenative synthesis, music generation via physical or geometrical modelling, wave terrain synthesis, graph exploration, cellular automata, swarm intelligence, and videogames. The library is open-source and it fosters a performative approach to computer-aided composition. Finally, this thesis addresses the issue of whether classical representation of music, disentangled in the standard set of traditional parameters, is optimal. Two possible alternatives to orthogonal decompositions are presented: grain-based score representations, inheriting techniques from corpus-based composition, and unsupervised machine learning models, providing entangled, `agnostic' representations of music. The thesis also details my first experience of collaborative writing within the /nu/thing collective
Books on the topic "Corpus-based data"
Müller-Landmann, Sonja. Corpus-based parse pruning: Applying empirical data to symbolic knowledge. Saarbrücken: DFKI, 2000.
Find full textCorpus-based studies of lesser-described languages: The CorpAfroAs corpus of spoken AfroAsiatic languages. Amsterdam: John Benjamins Publishing Company, 2015.
Find full textStudies in authorship recognition: A corpus-based approach. Frankfurt am Main: P. Lang, 1999.
Find full textCorpus-based analyses of the problem-solution pattern: A phraseological approach. Amsterdam: John Benjamins Pub., 2008.
Find full textPostmodifying clauses in the English noun phrase: A corpus-based study. Amsterdam: Rodopi, 1989.
Find full textBasciano, Bianca, Franco Gatti, and Anna Morbiato. Corpus-Based Research on Chinese Language and Linguistics. Venice: Fondazione Università Ca’ Foscari, 2020. http://dx.doi.org/10.30687/978-88-6969-406-6.
Full textWohlgenannt, Gerhard. Learning ontology relations by combining corpus-based techniques and reasoning on data from semantic web sources. Frankfurt am Main: P. Lang, 2011.
Find full textWohlgenannt, Gerhard. Learning Ontology Relations by Combining Corpus-Based Techniques and Reasoning on Data from Semantic Web Sources. Bern: Peter Lang International Academic Publishers, 2018.
Find full textHundt, Marianne. English mediopassive constructions: A cognitive, corpus-based study of their origin, spread, and current status. Amsterdam: Rodopi, 2004.
Find full textBook chapters on the topic "Corpus-based data"
Gries, Stefan Th. "Corpus data in usage-based linguistics." In Human Cognitive Processing, 237–56. Amsterdam: John Benjamins Publishing Company, 2011. http://dx.doi.org/10.1075/hcp.32.15gri.
Full textViereck, Wolfgang. "The Atlas Linguarum Europae: A diachronic analysis of its data." In Corpus-based Analysis and Diachronic Linguistics, 21–36. Amsterdam: John Benjamins Publishing Company, 2011. http://dx.doi.org/10.1075/tufs.3.04vie.
Full textYaneva, Victoria, Shiva Taslimipoor, Omid Rohanian, and Le An Ha. "Cognitive Processing of Multiword Expressions in Native and Non-native Speakers of English: Evidence from Gaze Data." In Computational and Corpus-Based Phraseology, 363–79. Cham: Springer International Publishing, 2017. http://dx.doi.org/10.1007/978-3-319-69805-2_26.
Full textPooley, Tim. "The uneasy interface: Methodological issues in using data from traditional and urban dialectology in (re-)constructing sociolinguistic history." In Corpus-Based Perspectives in Linguistics, 169–89. Amsterdam: John Benjamins Publishing Company, 2007. http://dx.doi.org/10.1075/ubli.6.13poo.
Full textYoshitomi, Asako. "Testing the primacy of aspect and reverse order hypothesis in Japanese returnees: Towards constructing a corpus of second language attrition data." In Corpus-Based Perspectives in Linguistics, 371–89. Amsterdam: John Benjamins Publishing Company, 2007. http://dx.doi.org/10.1075/ubli.6.25yos.
Full textWang, Xingfu, Zhongfu Wu, Yan Li, Qian Huang, and Jinglu Hui. "Corpus-Based Analysis of the Co-occurrence of Chinese Antonym Pairs." In Advanced Data Mining and Applications, 500–507. Berlin, Heidelberg: Springer Berlin Heidelberg, 2010. http://dx.doi.org/10.1007/978-3-642-17313-4_50.
Full textMaes, Francis, Ludovic Denoyer, and Patrick Gallinari. "Corpus-Based Structure Mapping of XML Document Corpora: A Reinforcement Learning Based Model." In Modeling, Learning, and Processing of Text Technological Data Structures, 249–66. Berlin, Heidelberg: Springer Berlin Heidelberg, 2011. http://dx.doi.org/10.1007/978-3-642-22613-7_13.
Full textOliveira, Francisco, Fai Wong, Anna Ho, Yiping Li, and Mingchui Dong. "Overcoming Data Sparseness Problem in Statistical Corpus Based Sense Disambiguation." In Computational Methods in Engineering & Science, 314. Berlin, Heidelberg: Springer Berlin Heidelberg, 2006. http://dx.doi.org/10.1007/978-3-540-48260-4_160.
Full textPimentel, Braulio Andres Soncco, and Roxana L. Q. Portugal. "Fake News in Spanish: Towards the Building of a Corpus Based on Twitter." In Information Management and Big Data, 333–39. Cham: Springer International Publishing, 2020. http://dx.doi.org/10.1007/978-3-030-46140-9_32.
Full textSun, Jiawen. "A Corpus-Based Multi-dimensional Study of Tourism English Register Features." In Lecture Notes on Data Engineering and Communications Technologies, 262–68. Singapore: Springer Singapore, 2021. http://dx.doi.org/10.1007/978-981-16-5854-9_33.
Full textConference papers on the topic "Corpus-based data"
Tian, Xueqin. "Foreign Language Writing Based on Corpus-based Data-driven." In 4th International Conference on Management Science, Education Technology, Arts, Social Science and Economics 2016. Paris, France: Atlantis Press, 2016. http://dx.doi.org/10.2991/msetasse-16.2016.110.
Full textWawer, Aleksander, and Dominika Rogozinska. "How Much Supervision? Corpus-Based Lexeme Sentiment Estimation." In 2012 IEEE 12th International Conference on Data Mining Workshops. IEEE, 2012. http://dx.doi.org/10.1109/icdmw.2012.119.
Full textYANG, Yanyu. "A Corpus-Based Study on Oral Language Education of Police English." In DSDE '21: 2021 4th International Conference on Data Storage and Data Engineering. New York, NY, USA: ACM, 2021. http://dx.doi.org/10.1145/3456146.3456164.
Full textWu, Yaguang, Haichun Sun, and Chungang Yan. "An event timeline extraction method based on news corpus." In 2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA). IEEE, 2017. http://dx.doi.org/10.1109/icbda.2017.8078725.
Full textLarsen-Walker, Melissa. "How Does Data Driven Learning Affect the Production of Multi-Word Sequences in EAP Students’ Academic Writing?" In EUROPHRAS 2017 - Computational and Corpus-based Phraseology: Recent Advances and Interdisciplinary Approaches. Editions Tradulex, Geneva, Switzerland, 2017. http://dx.doi.org/10.26615/978-2-9701095-2-5_010.
Full textGuo, Siqiao, Xianbo Li, and Zhixin Ma. "Association Rule Mining of Anaphora Based on ParCorFull Corpus." In ICCDE 2020: 2020 The 6th International Conference on Computing and Data Engineering. New York, NY, USA: ACM, 2020. http://dx.doi.org/10.1145/3379247.3379277.
Full textLiu, Xuanjun, Zheyu Zhu, Tengyan Fu, Jiaxuan Chen, and Ying Jiang. "Corpus Annotation System Based on HanLP Chinese Word Segmentation." In CONF-CDS 2021: The 2nd International Conference on Computing and Data Science. New York, NY, USA: ACM, 2021. http://dx.doi.org/10.1145/3448734.3450845.
Full textQingzhi, Sun, Du Qingfeng, Zhang Chenxi, and Li Jun. "Chinese News Event Corpus Construction Method Based on Syntax Tree." In ICBDT 2020: 2020 3rd International Conference on Big Data Technologies. New York, NY, USA: ACM, 2020. http://dx.doi.org/10.1145/3422713.3422741.
Full textZhu, Ying, and Eric Friginal. "Interactive Visual Text Analysis for Corpus-Based Language Learning." In 2015 IEEE First International Conference on Big Data Computing Service and Applications (BigDataService). IEEE, 2015. http://dx.doi.org/10.1109/bigdataservice.2015.55.
Full textRybka, Roman, Alexander Sboev, Ivan Moloshnikov, and Dmitry Gudovskikh. "Morpho-syntactic parsing based on neural networks and corpus data." In 2015 Artificial Intelligence and Natural Language and Information Extraction, Social Media and Web Search FRUCT Conference (AINL-ISMW FRUCT). IEEE, 2015. http://dx.doi.org/10.1109/ainl-ismw-fruct.2015.7382975.
Full text