Добірка наукової літератури з теми "Low resource language"
Оформте джерело за APA, MLA, Chicago, Harvard та іншими стилями
Ознайомтеся зі списками актуальних статей, книг, дисертацій, тез та інших наукових джерел на тему "Low resource language".
Біля кожної праці в переліку літератури доступна кнопка «Додати до бібліографії». Скористайтеся нею – і ми автоматично оформимо бібліографічне посилання на обрану працю в потрібному вам стилі цитування: APA, MLA, «Гарвард», «Чикаго», «Ванкувер» тощо.
Також ви можете завантажити повний текст наукової публікації у форматі «.pdf» та прочитати онлайн анотацію до роботи, якщо відповідні параметри наявні в метаданих.
Статті в журналах з теми "Low resource language":
Lin, Donghui, Yohei Murakami, and Toru Ishida. "Towards Language Service Creation and Customization for Low-Resource Languages." Information 11, no. 2 (January 27, 2020): 67. http://dx.doi.org/10.3390/info11020067.
Zhou, Shuyan, Shruti Rijhwani, John Wieting, Jaime Carbonell, and Graham Neubig. "Improving Candidate Generation for Low-resource Cross-lingual Entity Linking." Transactions of the Association for Computational Linguistics 8 (July 2020): 109–24. http://dx.doi.org/10.1162/tacl_a_00303.
Mati, Diellza Nagavci, Mentor Hamiti, Arsim Susuri, Besnik Selimi, and Jaumin Ajdari. "Building Dictionaries for Low Resource Languages: Challenges of Unsupervised Learning." Annals of Emerging Technologies in Computing 5, no. 3 (July 1, 2021): 52–58. http://dx.doi.org/10.33166/aetic.2021.03.005.
Shikali, Casper S., and Refuoe Mokhosi. "Enhancing African low-resource languages: Swahili data for language modelling." Data in Brief 31 (August 2020): 105951. http://dx.doi.org/10.1016/j.dib.2020.105951.
Chen, Siqi, Yijie Pei, Zunwang Ke, and Wushour Silamu. "Low-Resource Named Entity Recognition via the Pre-Training Model." Symmetry 13, no. 5 (May 2, 2021): 786. http://dx.doi.org/10.3390/sym13050786.
Rijhwani, Shruti, Jiateng Xie, Graham Neubig, and Jaime Carbonell. "Zero-Shot Neural Transfer for Cross-Lingual Entity Linking." Proceedings of the AAAI Conference on Artificial Intelligence 33 (July 17, 2019): 6924–31. http://dx.doi.org/10.1609/aaai.v33i01.33016924.
Mi, Chenggang, Shaolin Zhu, and Rui Nie. "Improving Loanword Identification in Low-Resource Language with Data Augmentation and Multiple Feature Fusion." Computational Intelligence and Neuroscience 2021 (April 8, 2021): 1–9. http://dx.doi.org/10.1155/2021/9975078.
Lee, Chanhee, Kisu Yang, Taesun Whang, Chanjun Park, Andrew Matteson, and Heuiseok Lim. "Exploring the Data Efficiency of Cross-Lingual Post-Training in Pretrained Language Models." Applied Sciences 11, no. 5 (February 24, 2021): 1974. http://dx.doi.org/10.3390/app11051974.
Et. al., Syed Abdul Basit Andrabi,. "A Review of Machine Translation for South Asian Low Resource Languages." Turkish Journal of Computer and Mathematics Education (TURCOMAT) 12, no. 5 (April 10, 2021): 1134–47. http://dx.doi.org/10.17762/turcomat.v12i5.1777.
Zhang, Mozhi, Yoshinari Fujinuma, and Jordan Boyd-Graber. "Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 05 (April 3, 2020): 9547–54. http://dx.doi.org/10.1609/aaai.v34i05.6500.
Дисертації з теми "Low resource language":
Jansson, Herman. "Low-resource Language Question Answering Systemwith BERT." Thesis, Mittuniversitetet, Institutionen för informationssystem och –teknologi, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-42317.
Zhang, Yuan Ph D. Massachusetts Institute of Technology. "Transfer learning for low-resource natural language analysis." Thesis, Massachusetts Institute of Technology, 2017. http://hdl.handle.net/1721.1/108847.
This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.
Cataloged from student-submitted PDF version of thesis.
Includes bibliographical references (pages 131-142).
Expressive machine learning models such as deep neural networks are highly effective when they can be trained with large amounts of in-domain labeled training data. While such annotations may not be readily available for the target task, it is often possible to find labeled data for another related task. The goal of this thesis is to develop novel transfer learning techniques that can effectively leverage annotations in source tasks to improve performance of the target low-resource task. In particular, we focus on two transfer learning scenarios: (1) transfer across languages and (2) transfer across tasks or domains in the same language. In multilingual transfer, we tackle challenges from two perspectives. First, we show that linguistic prior knowledge can be utilized to guide syntactic parsing with little human intervention, by using a hierarchical low-rank tensor method. In both unsupervised and semi-supervised transfer scenarios, this method consistently outperforms state-of-the-art multilingual transfer parsers and the traditional tensor model across more than ten languages. Second, we study lexical-level multilingual transfer in low-resource settings. We demonstrate that only a few (e.g., ten) word translation pairs suffice for an accurate transfer for part-of-speech (POS) tagging. Averaged across six languages, our approach achieves a 37.5% improvement over the monolingual top-performing method when using a comparable amount of supervision. In the second monolingual transfer scenario, we propose an aspect-augmented adversarial network that allows aspect transfer over the same domain. We use this method to transfer across different aspects in the same pathology reports, where traditional domain adaptation approaches commonly fail. Experimental results demonstrate that our approach outperforms different baselines and model variants, yielding a 24% gain on this pathology dataset.
by Yuan Zhang.
Ph. D.
Zouhair, Taha. "Automatic Speech Recognition for low-resource languages using Wav2Vec2 : Modern Standard Arabic (MSA) as an example of a low-resource language." Thesis, Högskolan Dalarna, Institutionen för information och teknik, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:du-37702.
Packham, Sean. "Crowdsourcing a text corpus for a low resource language." Master's thesis, University of Cape Town, 2016. http://hdl.handle.net/11427/20436.
Lakew, Surafel Melaku. "Multilingual Neural Machine Translation for Low Resource Languages." Doctoral thesis, Università degli studi di Trento, 2020. http://hdl.handle.net/11572/257906.
Mairidan, Wushouer. "Pivot-Based Bilingual Dictionary Creation for Low-Resource Languages." 京都大学 (Kyoto University), 2015. http://hdl.handle.net/2433/199441.
Samson, Juan Sarah Flora. "Exploiting resources from closely-related languages for automatic speech recognition in low-resource languages from Malaysia." Thesis, Université Grenoble Alpes (ComUE), 2015. http://www.theses.fr/2015GREAM061/document.
Languages in Malaysia are dying in an alarming rate. As of today, 15 languages are in danger while two languages are extinct. One of the methods to save languages is by documenting languages, but it is a tedious task when performed manually.Automatic Speech Recognition (ASR) system could be a tool to help speed up the process of documenting speeches from the native speakers. However, building ASR systems for a target language requires a large amount of training data as current state-of-the-art techniques are based on empirical approach. Hence, there are many challenges in building ASR for languages that have limited data available.The main aim of this thesis is to investigate the effects of using data from closely-related languages to build ASR for low-resource languages in Malaysia. Past studies have shown that cross-lingual and multilingual methods could improve performance of low-resource ASR. In this thesis, we try to answer several questions concerning these approaches: How do we know which language is beneficial for our low-resource language? How does the relationship between source and target languages influence speech recognition performance? Is pooling language data an optimal approach for multilingual strategy?Our case study is Iban, an under-resourced language spoken in Borneo island. We study the effects of using data from Malay, a local dominant language which is close to Iban, for developing Iban ASR under different resource constraints. We have proposed several approaches to adapt Malay data to obtain pronunciation and acoustic models for Iban speech.Building a pronunciation dictionary from scratch is time consuming, as one needs to properly define the sound units of each word in a vocabulary. We developed a semi-supervised approach to quickly build a pronunciation dictionary for Iban. It was based on bootstrapping techniques for improving Malay data to match Iban pronunciations.To increase the performance of low-resource acoustic models we explored two acoustic modelling techniques, the Subspace Gaussian Mixture Models (SGMM) and Deep Neural Networks (DNN). We performed cross-lingual strategies using both frameworks for adapting out-of-language data to Iban speech. Results show that using Malay data is beneficial for increasing the performance of Iban ASR. We also tested SGMM and DNN to improve low-resource non-native ASR. We proposed a fine merging strategy for obtaining an optimal multi-accent SGMM. In addition, we developed an accent-specific DNN using native speech data. After applying both methods, we obtained significant improvements in ASR accuracy. From our study, we observe that using SGMM and DNN for cross-lingual strategy is effective when training data is very limited
Tafreshi, Shabnam. "Cross-Genre, Cross-Lingual, and Low-Resource Emotion Classification." Thesis, The George Washington University, 2021. http://pqdtopen.proquest.com/#viewpdf?dispub=28088437.
Singh, Mittul [Verfasser], and Dietrich [Akademischer Betreuer] Klakow. "Handling long-term dependencies and rare words in low-resource language modelling / Mittul Singh ; Betreuer: Dietrich Klakow." Saarbrücken : Saarländische Universitäts- und Landesbibliothek, 2017. http://d-nb.info/1141677962/34.
Dyer, Andrew. "Low Supervision, Low Corpus size, Low Similarity! Challenges in cross-lingual alignment of word embeddings : An exploration of the limitations of cross-lingual word embedding alignment in truly low resource scenarios." Thesis, Uppsala universitet, Institutionen för lingvistik och filologi, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-395946.
Книги з теми "Low resource language":
Carver, Tina Kasloff. A writing book: English in everyday life : a teacher's resource book. 2nd ed. Upper Saddle River, NJ: Prentice Hall Regents, 1998.
Shefner, Jon. The illusion of civil society: Democratization and community mobilization in low-income Mexico. University Park, Pa: Pennsylvania State University Press, 2008.
Canadian Legal Information Centre. Plain Language Centre. Plain Language Resource Centre catalogue. [Ottawa: Multiculturalism and Citizenship Canada], 1992.
Hahn, Walther von, and Cristina Vertan. Multilingual processing in eastern and southern EU languages: Low-resourced technologies and translation. Newcastle upon Tyne, UK: Cambridge Scholars Publishing, 2012.
Corson, David. Language policy in schools: A resource for teachers and administrators. Mahwah, NJ: Lawrence Erlbaum Associates, 1999.
Library, Canada Multiculturalism and Citizenship Canada Departmental. Plain Language Resource Centre catalogue =: Catalogue du centre de ressources sur le langage clair et simple. [Ottawa: Multiculturalism and Citizenship Canada], 1992.
Library, Canada Multiculturalism and Citizenship Canada Departmental. Plain Language Resource Centre catalogue =: Catalogue du centre de ressources sur le langage clair et simple. [Ottawa: Multiculturalism and Citizenship Canada], 1992.
Megías, José Manuel Lucía. Literatura románica en Internet: Los textos. Madrid: Editorial Castalia, 2002.
Franklin, Kristine L. El aullido de los monos. New York: Atheneum, 1994.
United States. Congress. Senate. Committee on Labor and Human Resources. Subcommittee on Education, Arts, and Humanities. Foreign Language Competence for the Future Act of 1989: Hearing before the Subcommittee on Education, Arts, and Humanities of the Committee on Labor and Human Resources, United States Senate, One Hundred First Congress, first session, on S. 1690, to establish programs to improve foreign language instruction, and for other purposes, S. 1540, to establish a critical languages and area studies program, October 31, 1989. Washington: U.S. G.P.O., 1990.
Частини книг з теми "Low resource language":
Grießhaber, Daniel, Ngoc Thang Vu, and Johannes Maucher. "Low-Resource Text Classification Using Domain-Adversarial Learning." In Statistical Language and Speech Processing, 129–39. Cham: Springer International Publishing, 2018. http://dx.doi.org/10.1007/978-3-030-00810-9_12.
Juan, Sarah Samson, Muhamad Fikri Che Ismail, Hamimah Ujir, and Irwandi Hipiny. "Language Modelling for a Low-Resource Language in Sarawak, Malaysia." In Lecture Notes in Electrical Engineering, 147–58. Singapore: Springer Singapore, 2019. http://dx.doi.org/10.1007/978-981-15-1289-6_14.
Zhu, ShaoLin, Xiao Li, YaTing Yang, Lei Wang, and ChengGang Mi. "Learning Bilingual Lexicon for Low-Resource Language Pairs." In Natural Language Processing and Chinese Computing, 760–70. Cham: Springer International Publishing, 2018. http://dx.doi.org/10.1007/978-3-319-73618-1_66.
Xu, Nuo, Yinqiao Li, Chen Xu, Yanyang Li, Bei Li, Tong Xiao, and Jingbo Zhu. "Analysis of Back-Translation Methods for Low-Resource Neural Machine Translation." In Natural Language Processing and Chinese Computing, 466–75. Cham: Springer International Publishing, 2019. http://dx.doi.org/10.1007/978-3-030-32236-6_42.
Hu, Yong, Heyan Huang, Tian Lan, Xiaochi Wei, Yuxiang Nie, Jiarui Qi, Liner Yang, and Xian-Ling Mao. "Multi-task Learning for Low-Resource Second Language Acquisition Modeling." In Web and Big Data, 603–11. Cham: Springer International Publishing, 2020. http://dx.doi.org/10.1007/978-3-030-60259-8_44.
Srivastava, Brij Mohan Lal, and Manish Shrivastava. "Articulatory Gesture Rich Representation Learning of Phonological Units in Low Resource Settings." In Statistical Language and Speech Processing, 80–95. Cham: Springer International Publishing, 2016. http://dx.doi.org/10.1007/978-3-319-45925-7_7.
Juan, Sarah Samson, Laurent Besacier, Benjamin Lecouteux, and Tien-Ping Tan. "Merging of Native and Non-native Speech for Low-resource Accented ASR." In Statistical Language and Speech Processing, 255–66. Cham: Springer International Publishing, 2015. http://dx.doi.org/10.1007/978-3-319-25789-1_24.
James, Cynthia C., and Kean Wah Lee. "Narrative Inquiry into Teacher Identity, Context, and Technology Integration in Low-Resource ESL Classrooms." In Language Learning with Technology, 65–76. Singapore: Springer Singapore, 2021. http://dx.doi.org/10.1007/978-981-16-2697-5_5.
Wu, Jing, Hongxu Hou, Zhipeng Shen, Jian Du, and Jinting Li. "Adapting Attention-Based Neural Network to Low-Resource Mongolian-Chinese Machine Translation." In Natural Language Understanding and Intelligent Applications, 470–80. Cham: Springer International Publishing, 2016. http://dx.doi.org/10.1007/978-3-319-50496-4_39.
Kunchukuttan, Anoop, and Pushpak Bhattacharyya. "A Case Study on Indic Language Translation." In Machine Translation and Transliteration Involving Related and Low-resource Languages, 93–112. Boca Raton: Chapman and Hall/CRC, 2021. http://dx.doi.org/10.1201/9781003096771-7.
Тези доповідей конференцій з теми "Low resource language":
Gandhe, Ankur, Florian Metze, and Ian Lane. "Neural network language models for low resource languages." In Interspeech 2014. ISCA: ISCA, 2014. http://dx.doi.org/10.21437/interspeech.2014-560.
Feng, Xiaocheng, Xiachong Feng, Bing Qin, Zhangyin Feng, and Ting Liu. "Improving Low Resource Named Entity Recognition using Cross-lingual Knowledge Transfer." In Twenty-Seventh International Joint Conference on Artificial Intelligence {IJCAI-18}. California: International Joint Conferences on Artificial Intelligence Organization, 2018. http://dx.doi.org/10.24963/ijcai.2018/566.
Khemchandani, Yash, Sarvesh Mehtani, Vaidehi Patil, Abhijeet Awasthi, Partha Talukdar, and Sunita Sarawagi. "Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages Study." In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Stroudsburg, PA, USA: Association for Computational Linguistics, 2021. http://dx.doi.org/10.18653/v1/2021.acl-long.105.
Joshi, Ishani, Purvi Koringa, and Suman Mitra. "Word Embeddings in Low Resource Gujarati Language." In 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW). IEEE, 2019. http://dx.doi.org/10.1109/icdarw.2019.40090.
Wong, Tak-sum, and John Lee. "Character Profiling in Low-Resource Language Documents." In ADCS '19: Australasian Document Computing Symposium. New York, NY, USA: ACM, 2019. http://dx.doi.org/10.1145/3372124.3372129.
Qi, Zhaodi, Yong Ma, and Mingliang Gu. "A Study on Low-resource Language Identification." In 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). IEEE, 2019. http://dx.doi.org/10.1109/apsipaasc47483.2019.9023075.
Kumar, Sachin, Antonios Anastasopoulos, Shuly Wintner, and Yulia Tsvetkov. "Machine Translation into Low-resource Language Varieties." In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). Stroudsburg, PA, USA: Association for Computational Linguistics, 2021. http://dx.doi.org/10.18653/v1/2021.acl-short.16.
Sailor, Hardik, Ankur Patil, and Hemant Patil. "Advances in Low Resource ASR: A Deep Learning Perspective." In The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages. ISCA: ISCA, 2018. http://dx.doi.org/10.21437/sltu.2018-4.
Liu, Chunxi, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, and Sanjeev Khudanpur. "Low-Resource Contextual Topic Identification on Speech." In 2018 IEEE Spoken Language Technology Workshop (SLT). IEEE, 2018. http://dx.doi.org/10.1109/slt.2018.8639544.
Beloucif, Meriem, Ana Valeria Gonzalez, Marcel Bollmann, and Anders Søgaard. "Naive Regularizers for Low-Resource Neural Machine Translation." In Recent Advances in Natural Language Processing. Incoma Ltd., Shoumen, Bulgaria, 2019. http://dx.doi.org/10.26615/978-954-452-056-4_013.