Academic literature on the topic 'Text tokenization'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Text tokenization.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "Text tokenization"
Sekhar, Sowmik. "Tokenization for Text Analysis." International Journal of Scientific Research and Engineering Trends 10, no. 1 (2024): 149–52. http://dx.doi.org/10.61137/ijsret.vol.10.issue1.127.
Full textNnaemeka, M. Oparauwah, N. Odii Juliet, I. Ayogu Ikechukwu, and C. Iwuchukwu Vitalis. "A boundary-based tokenization technique for extractive text summarization." World Journal of Advanced Research and Reviews 11, no. 2 (2021): 303–12. https://doi.org/10.5281/zenodo.5336977.
Full textNnaemeka M Oparauwah, Juliet N Odii, Ikechukwu I Ayogu, and Vitalis C Iwuchukwu. "A boundary-based tokenization technique for extractive text summarization." World Journal of Advanced Research and Reviews 11, no. 2 (2021): 303–12. http://dx.doi.org/10.30574/wjarr.2021.11.2.0351.
Full textNazir, Shahzad, Muhammad Asif, Mariam Rehman, and Shahbaz Ahmad. "Machine learning based framework for fine-grained word segmentation and enhanced text normalization for low resourced language." PeerJ Computer Science 10 (January 31, 2024): e1704. http://dx.doi.org/10.7717/peerj-cs.1704.
Full textBAR-HAIM, ROY, KHALIL SIMA'AN, and YOAD WINTER. "Part-of-speech tagging of Modern Hebrew text." Natural Language Engineering 14, no. 2 (2008): 223–51. http://dx.doi.org/10.1017/s135132490700455x.
Full textVadlapati, Praneeth. "TokEncryption: Enhanced Hashing of Text using Tokenization." INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 08, no. 12 (2024): 1–8. https://doi.org/10.55041/ijsrem20280.
Full textA. Mullen, Lincoln, Kenneth Benoit, Os Keyes, Dmitry Selivanov, and Jeffrey Arnold. "Fast, Consistent Tokenization of Natural Language Text." Journal of Open Source Software 3, no. 23 (2018): 655. http://dx.doi.org/10.21105/joss.00655.
Full textBartenyev, Oleg. "Evaluating the Effectiveness of Text Tokenization Methods." Vestnik MEI, no. 6 (December 25, 2023): 144–56. http://dx.doi.org/10.24160/1993-6982-2023-6-144-156.
Full textS, Vijayarani, and Janani R. "Text Mining: open Source Tokenization Tools – An Analysis." Advanced Computational Intelligence: An International Journal (ACII) 3, no. 1 (2016): 37–47. http://dx.doi.org/10.5121/acii.2016.3104.
Full textA. Hosni Mahmoud, Hanan, Alaaeldin M. Hafez, and Eatedal Alabdulkreem. "Language-Independent Text Tokenization Using Unsupervised Deep Learning." Intelligent Automation & Soft Computing 35, no. 1 (2023): 321–34. http://dx.doi.org/10.32604/iasc.2023.026235.
Full textDissertations / Theses on the topic "Text tokenization"
Aliwy, Ahmed Hussein. "Arabic Morphosyntactic Raw Text Part of Speech Tagging System." Doctoral thesis, 2013. http://depotuw.ceon.pl/handle/item/241.
Full textBooks on the topic "Text tokenization"
Mikheev, Andrei. Text Segmentation. Edited by Ruslan Mitkov. Oxford University Press, 2012. http://dx.doi.org/10.1093/oxfordhb/9780199276349.013.0010.
Full textBook chapters on the topic "Text tokenization"
Grefenstette, Gregory. "Tokenization." In Text, Speech and Language Technology. Springer Netherlands, 1999. http://dx.doi.org/10.1007/978-94-015-9273-4_9.
Full textHvitfeldt, Emil, and Julia Silge. "Tokenization." In Supervised Machine Learning for Text Analysis in R. Chapman and Hall/CRC, 2021. http://dx.doi.org/10.1201/9781003093459-3.
Full textFares, Murhaf, Stephan Oepen, and Yi Zhang. "Machine Learning for High-Quality Tokenization Replicating Variable Tokenization Schemes." In Computational Linguistics and Intelligent Text Processing. Springer Berlin Heidelberg, 2013. http://dx.doi.org/10.1007/978-3-642-37247-6_19.
Full textDomingo, Miguel, Mercedes García-Martínez, Alexandre Helle, Francisco Casacuberta, and Manuel Herranz. "How Much Does Tokenization Affect Neural Machine Translation?" In Computational Linguistics and Intelligent Text Processing. Springer Nature Switzerland, 2023. http://dx.doi.org/10.1007/978-3-031-24337-0_38.
Full textGraña, Jorge, Miguel A. Alonso, and Manuel Vilares. "A Common Solution for Tokenization and Part-of-Speech Tagging." In Text, Speech and Dialogue. Springer Berlin Heidelberg, 2002. http://dx.doi.org/10.1007/3-540-46154-x_1.
Full textGraña, Jorge, Fco Mario Barcala, and Jesús Vilares. "Formal Methods of Tokenization for Part-of-Speech Tagging." In Computational Linguistics and Intelligent Text Processing. Springer Berlin Heidelberg, 2002. http://dx.doi.org/10.1007/3-540-45715-1_22.
Full textPiergiovanni, AJ, Kairo Morton, Weicheng Kuo, Michael S. Ryoo, and Anelia Angelova. "Video Question Answering with Iterative Video-Text Co-tokenization." In Lecture Notes in Computer Science. Springer Nature Switzerland, 2022. http://dx.doi.org/10.1007/978-3-031-20059-5_5.
Full textKamps, Jaap, Sisay Fissaha Adafre, and Maarten de Rijke. "Effective Translation, Tokenization and Combination for Cross-Lingual Retrieval." In Multilingual Information Access for Text, Speech and Images. Springer Berlin Heidelberg, 2005. http://dx.doi.org/10.1007/11519645_12.
Full textTailor, Chetana, and Bankim Patel. "Sentence Tokenization Using Statistical Unsupervised Machine Learning and Rule-Based Approach for Running Text in Gujarati Language." In Advances in Intelligent Systems and Computing. Springer Singapore, 2018. http://dx.doi.org/10.1007/978-981-13-2285-3_38.
Full textNuankaew, Wongpanya S., Ronnachai Thipmontha, Phaisarn Jeefoo, Patchara Nasa-ngium, and Pratya Nuankaew. "Using Text Mining and Tokenization Analysis to Identify Job Performance for Human Resource Management at the University of Phayao." In Recent Challenges in Intelligent Information and Database Systems. Springer Nature Switzerland, 2023. http://dx.doi.org/10.1007/978-3-031-42430-4_47.
Full textConference papers on the topic "Text tokenization"
Kayalı, Nihal Zuhal, and Sevinç İlhan Omurca. "Hybrid Tokenization Strategy for Turkish Abstractive Text Summarization." In 2024 8th International Artificial Intelligence and Data Processing Symposium (IDAP). IEEE, 2024. http://dx.doi.org/10.1109/idap64064.2024.10711036.
Full textGoldman, Omer, Avi Caciularu, Matan Eyal, Kris Cao, Idan Szpektor, and Reut Tsarfaty. "Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance." In Findings of the Association for Computational Linguistics ACL 2024. Association for Computational Linguistics, 2024. http://dx.doi.org/10.18653/v1/2024.findings-acl.134.
Full textHassler, M., and G. Fliedl. "Text preparation through extended tokenization." In DATA MINING AND MIS 2006. WIT Press, 2006. http://dx.doi.org/10.2495/data060021.
Full textPrakrankamanant, Patawee, and Ekapol Chuangsuwanich. "Tokenization-based data augmentation for text classification." In 2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE). IEEE, 2022. http://dx.doi.org/10.1109/jcsse54890.2022.9836268.
Full textCruz Diaz, Noa P., and Manuel Maña López. "An Analysis of Biomedical Tokenization: Problems and Strategies." In Proceedings of the Sixth International Workshop on Health Text Mining and Information Analysis. Association for Computational Linguistics, 2015. http://dx.doi.org/10.18653/v1/w15-2605.
Full textHiraoka, Tatsuya, Hiroyuki Shindo, and Yuji Matsumoto. "Stochastic Tokenization with a Language Model for Neural Text Classification." In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 2019. http://dx.doi.org/10.18653/v1/p19-1158.
Full textIslam, Tanzirul, Mofazzal Hossain, and MD Fahim Arefin. "Comparative Analysis of Different Text Summarization Techniques Using Enhanced Tokenization." In 2021 3rd International Conference on Sustainable Technologies for Industry 4.0 (STI). IEEE, 2021. http://dx.doi.org/10.1109/sti53101.2021.9732589.
Full textPosokhov, P. A., S. S. Skrylnikov, and O. V. Makhnytkina. "Artificial text detection in Russian language: a BERT-based Approach." In Dialogue. RSUH, 2022. http://dx.doi.org/10.28995/2075-7182-2022-21-470-476.
Full textHorsmann, Tobias, and Torsten Zesch. "LTL-UDE $@$ EmpiriST 2015: Tokenization and PoS Tagging of Social Media Text." In Proceedings of the 10th Web as Corpus Workshop. Association for Computational Linguistics, 2016. http://dx.doi.org/10.18653/v1/w16-2615.
Full textHuang, Zien. "An Ensemble LLM Framework of Text Recognition Based on BERT and BPE Tokenization." In 2024 5th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT). IEEE, 2024. http://dx.doi.org/10.1109/ainit61980.2024.10581466.
Full text