Academic literature on the topic 'Corpus comparable'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Corpus comparable.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "Corpus comparable"
Laviosa, Sara. "How Comparable Can 'Comparable Corpora' Be?" Target. International Journal of Translation Studies 9, no. 2 (January 1, 1997): 287–317. http://dx.doi.org/10.1075/target.9.2.05lav.
Full textLópez Arroyo, Belén. "Can comparable corpora be compared?" Ibérica, no. 39 (January 2, 2020): 43–68. http://dx.doi.org/10.17398/2340-2784.39.43.
Full textČermáková, Ann, Jarmo Jantunen, Tommi Jauhiainen, John Kirk, Michal Křen, Marc Kupietz, and Elaine Uí Dhonnchadha. "The International Comparable Corpus: Challenges in building multilingual spoken and written comparable corpora." Research in Corpus Linguistics 10, no. 1 (2021): 89–103. http://dx.doi.org/10.32714/ricl.09.01.06.
Full textAwal, Norsimah Mat, Intan Safinaz Zainuddin, and Imran Ho-Abdullah. "Use of Comparable Corpus in Teaching Translation." Procedia - Social and Behavioral Sciences 18 (2011): 638–42. http://dx.doi.org/10.1016/j.sbspro.2011.05.094.
Full textLópez Arroyo, Belén, and Roda P. Roberts. "Genre and Register in Comparable Corpora: An English/Spanish Contrastive Analysis." Meta 62, no. 1 (July 6, 2017): 114–36. http://dx.doi.org/10.7202/1040469ar.
Full textWeng, Yu, Shumin Dong, and Chaomurilige Chaomurilige. "A Privacy-Preserving Multilingual Comparable Corpus Construction Method in Internet of Things." Mathematics 12, no. 4 (February 17, 2024): 598. http://dx.doi.org/10.3390/math12040598.
Full textSun, Yuan, and Qian Zhao. "Tibetan-Chinese Named Entity Extraction Based on Comparable Corpus." Applied Mechanics and Materials 571-572 (June 2014): 1202–5. http://dx.doi.org/10.4028/www.scientific.net/amm.571-572.1202.
Full textLI, BO, ERIC GAUSSIER, and DAN YANG. "Measuring bilingual corpus comparability." Natural Language Engineering 24, no. 4 (January 15, 2018): 523–49. http://dx.doi.org/10.1017/s1351324917000481.
Full textForchini, Pierfranca, and Amanda Murphy. "N-grams in comparable specialized corpora." Patterns, meaningful units and specialized discourses 13, no. 3 (September 17, 2008): 351–67. http://dx.doi.org/10.1075/ijcl.13.3.06for.
Full textVisky, Mihaela. "L’UTILISATION DU CORPUS COMPARABLE DANS L’ENSEIGNEMENT DE LA TRADUCTION." Professional Communication and Translation Studies 6 (2013): 165–76. http://dx.doi.org/10.59168/mojx8412.
Full textDissertations / Theses on the topic "Corpus comparable"
Shrestha, Prajol. "Alignement inter-modalités de corpus comparable monolingue." Phd thesis, Université de Nantes, 2013. http://tel.archives-ouvertes.fr/tel-00909179.
Full textProchasson, Emmanuel. "Alignement multilingue en corpus comparables spécialisés." Phd thesis, Université de Nantes, 2009. http://tel.archives-ouvertes.fr/tel-00462248.
Full textLi, Bo. "Mesurer et améliorer la qualité des corpus comparables." Thesis, Grenoble, 2012. http://www.theses.fr/2012GRENM069.
Full textBilingual corpora are an essential resource used to cross the language barrier in multilingual Natural Language Processing (NLP) tasks. Most of the current work makes use of parallel corpora that are mainly available for major languages and constrained areas. Comparable corpora, text collections comprised of documents covering overlapping information, are however less expensive to obtain in high volume. Previous work has shown that using comparable corpora is beneficent for several NLP tasks. Apart from those studies, we will try in this thesis to improve the quality of comparable corpora so as to improve the performance of applications exploiting them. The idea is advantageous since it can work with any existing method making use of comparable corpora. We first discuss in the thesis the notion of comparability inspired from the usage experience of bilingual corpora. The notion motivates several implementations of the comparability measure under the probabilistic framework, as well as a methodology to evaluate the ability of comparability measures to capture gold-standard comparability levels. The comparability measures are also examined in terms of robustness to dictionary changes. The experiments show that a symmetric measure relying on vocabulary overlapping can correlate very well with gold-standard comparability levels and is robust to dictionary changes. Based on the comparability measure, two methods, namely the greedy approach and the clustering approach, are then developed to improve the quality of any given comparable corpus. The general idea of these two methods is to choose the highquality subpart from the original corpus and to enrich the low-quality subpart with external resources. The experiments show that one can improve the quality, in terms of comparability scores, of the given comparable corpus by these two methods, with the clustering approach being more efficient than the greedy approach. The enhanced comparable corpus further results in better bilingual lexicons extracted with the standard extraction algorithm. Lastly, we investigate the task of Cross-Language Information Retrieval (CLIR) and the application of comparable corpora in CLIR. We develop novel CLIR models extending the recently proposed information-based models in monolingual IR. The information-based CLIR model is shown to give the best performance overall. Bilingual lexicons extracted from comparable corpora are then combined with the existing bilingual dictionary and used in CLIR experiments, which results in significant improvement of the CLIR system
Abdul, Rauf Sadaf. "Sélection de corpus en traduction automatique statistique." Phd thesis, Université du Maine, 2012. http://tel.archives-ouvertes.fr/tel-00732984.
Full textBouamor, Dhouha. "Constitution de ressources linguistiques multilingues à partir de corpus de textes parallèles et comparables." Phd thesis, Université Paris Sud - Paris XI, 2014. http://tel.archives-ouvertes.fr/tel-00994222.
Full textAl-Qaisi, Fu'ad. "Apport de la linguistique de corpus à la lexicographie bilingue (français-arabe) : macrostructure et microstructure d'un dictionnaire de collocations." Thesis, Lyon 2, 2015. http://www.theses.fr/2015LYO20115.
Full textThe aim of this study is to examine the contribution of corpus linguistics to bilingual French-Arabic lexicography. We particularly focus on collocations, as our research begins with the compilation of a bilingual corpus leading up to the integration of collocations in the lexicon. Fundamentals such as corpus linguistics, corpora and collocation are examined. Our research then takes an empirical turn that is based on the use of our corpus. To overcome the unavailability of corpus processing tools in Arabic, an approach was developed in this study that we called the footbridge strategy. The idea is to start from a French-Arabic (translated) parallel corpus. This corpus consists of the French version of Le Monde Diplomatique, and its translation. Using a parallel corpus aims to facilitate the identification of contrastive phenomena. The results obtained in the translated corpus (in its Arabic component) will be subsequently checked in an Arabic monolingual corpus. The latter is a corpus consisting of three newspapers: Alrai, Alayyam, Algouhouria. Throughout the exploitation of the corpus, results are compared first between corpora and dictionaries, secondly between corpus types (parallel and comparable), and thirdly between newspapers (Alrai, Alayyam, Algouhouria). Then a number of collocations are subjected to semantic and structural review and consideration. This review process not only brings some clarifications on the environment of collocations between language and speech but also about a possible approach for their integration in the dictionary. Legitimate questions gradually arise regarding the resemblance of collocations in French and Arabic. The results highlight phenomena such as collocational chains (clusters), collocational synonyms, etc. The study culminates in the design of a computer dictionary of collocations, i.e. an active bilingual dictionary aimed at Arabic language specialists and translators
Hoddinott, Simon Matthew. "Web mining for translators: automatic construction of comparable, genre-driven corpora." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2016. http://amslaurea.unibo.it/10775/.
Full textLaviosa-Braithwaite, S. "The English Comparable Corpus (ECC) : a resource and a methodology for the empirical study of translation." Thesis, University of Manchester, 1996. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.488308.
Full textChiao, Yun-Chuang. "Extraction lexicale bilingue à partir de textes médicaux comparable : application à la recherche d'information translangue." Paris 6, 2004. https://tel.archives-ouvertes.fr/tel-00007704.
Full textZennaki, Othman. "Construction automatique d'outils et de ressources linguistiques à partir de corpus parallèles." Thesis, Université Grenoble Alpes (ComUE), 2019. http://www.theses.fr/2019GREAM006/document.
Full textThis thesis focuses on the automatic construction of linguistic tools and resources for analyzing texts of low-resource languages. We propose an approach using Recurrent Neural Networks (RNN) and requiring only a parallel or multi-parallel corpus between a well-resourced language and one or more low-resource languages. This parallel or multi-parallel corpus is used to construct a multilingual representation of words of the source and target languages. We used this multilingual representation to train our neural models and we investigated both uni and bidirectional RNN models. We also proposed a method to include external information (for instance, low-level information from Part-Of-Speech tags) in the RNN to train higher level taggers (for instance, SuperSenses taggers and Syntactic dependency parsers). We demonstrated the validity and genericity of our approach on several languages and we conducted experiments on various NLP tasks: Part-Of-Speech tagging, SuperSenses tagging and Dependency parsing. The obtained results are very satisfactory. Our approach has the following characteristics and advantages: (a) it does not use word alignment information, (b) it does not assume any knowledge about target languages (one requirement is that the two languages (source and target) are not too syntactically divergent), which makes it applicable to a wide range of low-resource languages, (c) it provides authentic multilingual taggers (one tagger for N languages)
Books on the topic "Corpus comparable"
Laviosa-Braithwaite, S. The English comparable corpus(ECC): A resource and a methodology for the empirical studyof translation. Manchester: UMIST, 1996.
Find full textMcEnery, Tony. Corpus Linguistics. Edited by Ruslan Mitkov. Oxford University Press, 2012. http://dx.doi.org/10.1093/oxfordhb/9780199276349.013.0024.
Full textCermáková, Anna, Hilde Hasselgård, Markéta Malá, and Denisa Šebestová, eds. Contrastive Corpus Linguistics. Bloomsbury Publishing Plc, 2024. http://dx.doi.org/10.5040/9781350385962.
Full textBook chapters on the topic "Corpus comparable"
Shultz, Thomas R., Scott E. Fahlman, Susan Craw, Periklis Andritsos, Panayiotis Tsaparas, Ricardo Silva, Chris Drummond, et al. "Comparable Corpus." In Encyclopedia of Machine Learning, 194. Boston, MA: Springer US, 2011. http://dx.doi.org/10.1007/978-0-387-30164-8_144.
Full textBaradaran Hashemi, Homa, Azadeh Shakery, and Heshaam Faili. "Creating a Persian-English Comparable Corpus." In Multilingual and Multimodal Information Access Evaluation, 27–39. Berlin, Heidelberg: Springer Berlin Heidelberg, 2010. http://dx.doi.org/10.1007/978-3-642-15998-5_5.
Full textEckart, Thomas, and Uwe Quasthoff. "Statistical Corpus and Language Comparison on Comparable Corpora." In Building and Using Comparable Corpora, 151–65. Berlin, Heidelberg: Springer Berlin Heidelberg, 2013. http://dx.doi.org/10.1007/978-3-642-20128-8_8.
Full textMcEnery, Tony, and Richard Xiao. "Parallel and comparable corpora: The state of play." In Corpus-Based Perspectives in Linguistics, 131–45. Amsterdam: John Benjamins Publishing Company, 2007. http://dx.doi.org/10.1075/ubli.6.11mce.
Full textLi, Bo, and Eric Gaussier. "Exploiting Comparable Corpora for Lexicon Extraction: Measuring and Improving Corpus Quality." In Building and Using Comparable Corpora, 131–49. Berlin, Heidelberg: Springer Berlin Heidelberg, 2013. http://dx.doi.org/10.1007/978-3-642-20128-8_7.
Full textPhilip, Gill. "Arriving at equivalence: Making a case for comparable general reference corpora in translation studies." In Corpus Use and Translating, 59–73. Amsterdam: John Benjamins Publishing Company, 2009. http://dx.doi.org/10.1075/btl.82.06phi.
Full textRahimi, Zahra, and Azadeh Shakery. "Topic Based Creation of a Persian-English Comparable Corpus." In Information Retrieval Technology, 458–69. Berlin, Heidelberg: Springer Berlin Heidelberg, 2011. http://dx.doi.org/10.1007/978-3-642-25631-8_41.
Full textRogati, Monica, and Yiming Yang. "Cross-Lingual Pseudo-Relevance Feedback Using a Comparable Corpus." In Lecture Notes in Computer Science, 151–57. Berlin, Heidelberg: Springer Berlin Heidelberg, 2002. http://dx.doi.org/10.1007/3-540-45691-0_12.
Full textKruijt, Anne, Patrizia Cordin, and Stefan Rabanus. "On the validity of crowdsourced data." In Studies in Corpus Linguistics, 10–33. Amsterdam: John Benjamins Publishing Company, 2023. http://dx.doi.org/10.1075/scl.110.01kru.
Full textPicchi, Eugenio, and Carol Peters. "Cross-Language Information Retrieval: A System for Comparable Corpus Querying." In Cross-Language Information Retrieval, 81–92. Boston, MA: Springer US, 1998. http://dx.doi.org/10.1007/978-1-4615-5661-9_7.
Full textConference papers on the topic "Corpus comparable"
Huidrom, Rudali, Yves Lepage, and Khogendra Khomdram. "EM Corpus: a comparable corpus for a less-resourced language pair Manipuri-English." In The 14th Workshop on Building and Using Comparable Corpora. INCOMA Ltd. Shoumen, BULGARIA, 2021. http://dx.doi.org/10.26615/978-954-452-076-2_008.
Full textYouliang, Zhou, Gong Zhengxian, and Zhou Guodong. "Complement the comparable corpus obtained from websites." In 2010 2nd International Conference on Future Computer and Communication. IEEE, 2010. http://dx.doi.org/10.1109/icfcc.2010.5497762.
Full textLupu, Mihai. "Bootstrapping a Comparable Corpus from Patent Family Members." In 2012 23rd International Workshop on Database and Expert Systems Applications (DEXA). IEEE, 2012. http://dx.doi.org/10.1109/dexa.2012.60.
Full textSun, Y., and W. B. Guo. "Tibetan–Chinese cross-language-network comparable corpus extraction." In International Conference on Computer Science and Technology. Southampton, UK: WIT Press, 2014. http://dx.doi.org/10.2495/iccst140311.
Full textLv, Fei, and Zede Zhu. "A Summary of Studies on Bilingual Comparable Corpus." In 2019 International Conference on Smart Grid and Electrical Automation (ICSGEA). IEEE, 2019. http://dx.doi.org/10.1109/icsgea.2019.00138.
Full textJP, Sanjanasri, Vijay Krishna Menon, Soman KP, and Krzysztof Wolk. "Mining BilingualWord Pairs from Comparable Corpus using Apache Spark Framework." In The 14th Workshop on Building and Using Comparable Corpora. INCOMA Ltd. Shoumen, BULGARIA, 2021. http://dx.doi.org/10.26615/978-954-452-076-2_002.
Full textAbidi, K., M. A. Menacer, and Kamel Smaïli. "CALYOU: A Comparable Spoken Algerian Corpus Harvested from YouTube." In Interspeech 2017. ISCA: ISCA, 2017. http://dx.doi.org/10.21437/interspeech.2017-1305.
Full textKatsumata, Satoru, and Mamoru Komachi. "(Almost) Unsupervised Grammatical Error Correction using Synthetic Comparable Corpus." In Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications. Stroudsburg, PA, USA: Association for Computational Linguistics, 2019. http://dx.doi.org/10.18653/v1/w19-4413.
Full textMilajevs, Dmitrijs. "Toward a Comparable Corpus of Latvian, Russian and English Tweets." In Proceedings of the 10th Workshop on Building and Using Comparable Corpora. Stroudsburg, PA, USA: Association for Computational Linguistics, 2017. http://dx.doi.org/10.18653/v1/w17-2505.
Full textYamamoto, Yuki, Tomoyoshi Akiba, and Hajime Tsukada. "Training Neural Machine Translation Models by Using an Entire Comparable Corpus." In 2023 10th International Conference on Advanced Informatics: Concept, Theory and Application (ICAICTA). IEEE, 2023. http://dx.doi.org/10.1109/icaicta59291.2023.10390174.
Full textReports on the topic "Corpus comparable"
Wolfenson, David, William W. Thatcher, and James E. Kinder. Regulation of LH Secretion in the Periovulatory Period as a Strategy to Enhance Ovarian Function and Fertility in Dairy and Beef Cows. United States Department of Agriculture, December 2003. http://dx.doi.org/10.32747/2003.7586458.bard.
Full textSeiple, Jacqueline, Luis Santiago, Christopher Spaur, Safra Altman, Matthew Balazik, Thomas Laczo, Daniel Mensah, Warunika Amarasingha, Andrew Payson, and Danielle Szimanski. Two years of post-project monitoring of a navigation solution in a dynamic coastal environment, Smith Island, Maryland. Engineer Research and Development Center (U.S.), June 2022. http://dx.doi.org/10.21079/11681/44620.
Full textChidsey, Thomas C., David E. Eby, Michael D. Vanden Berg, and Douglas A. Sprinkel. Microbial Carbonate Reservoirs and Analogs from Utah. Utah Geological Survey, July 2021. http://dx.doi.org/10.34191/ss-168.
Full text