Academic literature on the topic 'Multimodal NLP'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Multimodal NLP.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Journal articles on the topic "Multimodal NLP"
Tiwari, Manisha, Pragati Khare, Ishani Saha, and Mahesh Mali. "Multimodal NLP for image captioning : Fusing text and image modalities for accurate and informative descriptions." Journal of Information and Optimization Sciences 45, no. 4 (2024): 1041–49. http://dx.doi.org/10.47974/jios-1626.
Full textZhang, Yingjie. "The current status and prospects of transformer in multimodality." Applied and Computational Engineering 11, no. 1 (2023): 224–30. http://dx.doi.org/10.54254/2755-2721/11/20230240.
Full textManish Kumar Keshri. "The Integration of NLP and Computer Vision: Advanced Frameworks for Multi-Modal Content Understanding." International Journal of Scientific Research in Computer Science, Engineering and Information Technology 11, no. 2 (2025): 2788–98. https://doi.org/10.32628/cseit25112708.
Full textHu, Qinrui. "Sentiment Analysis and Facial Expression Recognition in Customer Service Interactions." Frontiers in Business, Economics and Management 16, no. 3 (2024): 72–75. http://dx.doi.org/10.54097/tx980862.
Full textResearcher. "UNDERSTANDING NATURAL LANGUAGE PROCESSING (NLP) TECHNIQUES." International Journal of Computer Engineering and Technology (IJCET) 15, no. 4 (2024): 527–36. https://doi.org/10.5281/zenodo.13311223.
Full textResearcher. "UNDERSTANDING NATURAL LANGUAGE PROCESSING (NLP) TECHNIQUES." International Journal of Research In Computer Applications and Information Technology (IJRCAIT) 15, no. 6 (2024): 1221–31. https://doi.org/10.5281/zenodo.14359554.
Full textFan, Yuhan. "Research progress and challenges of deep learning in Natural Language Processing." Advances in Engineering Innovation 16, no. 6 (2025): None. https://doi.org/10.54254/2977-3903/2025.24550.
Full textWang, Bin, Chunyu Xie, Dawei Leng, and Yuhui Yin. "IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 20 (2025): 21035–43. https://doi.org/10.1609/aaai.v39i20.35400.
Full textMrs., Nagarathnamma S. M. "The Future of Natural Language Processing: A Survey of Recent Advances and Emerging Trends." Journal of Scholastic Engineering Science and Management 2, no. 6 (2023): 26–35. https://doi.org/10.5281/zenodo.8243058.
Full textSingh, Ankit Kumar. "Desktop Assistant Based on NLP." INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 08, no. 05 (2024): 1–5. http://dx.doi.org/10.55041/ijsrem34539.
Full textDissertations / Theses on the topic "Multimodal NLP"
Nouri, Golmaei Sara. "Improving the Performance of Clinical Prediction Tasks by using Structured and Unstructured Data combined with a Patient Network." Thesis, 2021. http://dx.doi.org/10.7912/C2/41.
Full textBaria, Enrico. "Multimodal imaging for tissue diagnostics by combined two-photon and Raman microscopy." Doctoral thesis, 2018. http://hdl.handle.net/2158/1129455.
Full textBook chapters on the topic "Multimodal NLP"
Kubsch, Marcus, Daniela Caballero, and Pablo Uribe. "Once More with Feeling: Emotions in Multimodal Learning Analytics." In The Multimodal Learning Analytics Handbook. Springer International Publishing, 2022. http://dx.doi.org/10.1007/978-3-031-08076-0_11.
Full textJohnson, David, Nick Dragojlovic, Nicola Kopac, et al. "EXPECT-NLP: An Integrated Pipeline and User Interface for Exploring Patient Preferences Directly from Patient-Generated Text." In Multimodal AI in Healthcare. Springer International Publishing, 2022. http://dx.doi.org/10.1007/978-3-031-14771-5_6.
Full textLiu, Yicheng. "Multimodal NLP and Artificial Intelligence: Cross-Media Information Understanding and Generation." In Advances in Social Science, Education and Humanities Research. Atlantis Press SARL, 2024. https://doi.org/10.2991/978-2-38476-327-6_24.
Full textModi, Sangita S., and Sudhir B. Jagtap. "Multimodal Web Content Mining to Filter Non-learning Sites Using NLP." In Lecture Notes on Data Engineering and Communications Technologies. Springer International Publishing, 2019. http://dx.doi.org/10.1007/978-3-030-24643-3_3.
Full textSaba, N. S., Kumari Anjali, Akansha Tanu, Aryan Porwal, Ahan Tejaswi, and R. Sindhu Rajendran. "NLP in Social Media Data Processing." In Advances in Computational Intelligence and Robotics. IGI Global, 2025. https://doi.org/10.4018/979-8-3693-2935-1.ch007.
Full textPaolozzi, Stefano, Fernando Ferri, and Patrizia Grifoni. "Improving Multimedia Digital Libraries Usability Applying NLP Sentence Similarity to Multimodal Sentences." In Handbook of Research on Digital Libraries. IGI Global, 2009. http://dx.doi.org/10.4018/978-1-59904-879-6.ch022.
Full textde Hond, Anne, Marieke van Buchem, Claudio Fanconi, et al. "Predicting Depression Risk in Patients with Cancer Using Multimodal Data." In Caring is Sharing – Exploiting the Value in Data for Health and Innovation. IOS Press, 2023. http://dx.doi.org/10.3233/shti230274.
Full textSamuthira Pandi, V., and Shobana D. "Incorporation of NLP techniques to facilitate intuitive user interactions with prosthetic devices." In The Role of Artificial Intelligence in Advanced Prosthetics and Implantable Devices. RADemics Research Institute, 2025. https://doi.org/10.71443/9789349552975-06.
Full textJain, Raghav, Tulika Saha, and Sriparna Saha. "T-VAKS: A Tutoring-Based Multimodal Dialog System via Knowledge Selection." In Frontiers in Artificial Intelligence and Applications. IOS Press, 2023. http://dx.doi.org/10.3233/faia230388.
Full textHidayatullah, Ahmad Fathan, Kassim Kalinaki, Haji Gul, Rufai Zakari Yusuf, and Wasswa Shafik. "Leveraging Natural Language Processing for Enhanced Text Analysis in Business Intelligence." In Advances in Computational Intelligence and Robotics. IGI Global, 2024. http://dx.doi.org/10.4018/979-8-3693-5288-5.ch006.
Full textConference papers on the topic "Multimodal NLP"
Bonnier, Thomas. "Error Detection for Multimodal Classification." In Proceedings of the 5th Workshop on Trustworthy NLP (TrustNLP 2025). Association for Computational Linguistics, 2025. https://doi.org/10.18653/v1/2025.trustnlp-main.6.
Full textChiguru, Aparna, and Rajchandar K. "Revolutionizing NLP: Multimodal Integration for Enhanced Image-to-Text Extraction." In 2024 3rd International Conference on Computational Modelling, Simulation and Optimization (ICCMSO). IEEE, 2024. http://dx.doi.org/10.1109/iccmso61761.2024.00093.
Full textBaghalizadeh-Moghadam, Neda, Frédéric Cuppens, and Nora Boulahia-Cuppens. "An NLP-Based Framework Leveraging Email and Multimodal User Data." In 22nd International Conference on Security and Cryptography. SCITEPRESS - Science and Technology Publications, 2025. https://doi.org/10.5220/0013524000003979.
Full textWang, Jiawen, Longfei Zuo, Siyao Peng, and Barbara Plank. "MultiClimate: Multimodal Stance Detection on Climate Change Videos." In Proceedings of the Third Workshop on NLP for Positive Impact. Association for Computational Linguistics, 2024. http://dx.doi.org/10.18653/v1/2024.nlp4pi-1.27.
Full textHorawalavithana, Sameera, Sai Munikoti, Ian Stewart, Henry Kvinge, and Karl Pazdernik. "SCITUNE: Aligning Large Language Models with Human-Curated Scientific Multimodal Instructions." In Proceedings of the 1st Workshop on NLP for Science (NLP4Science). Association for Computational Linguistics, 2024. http://dx.doi.org/10.18653/v1/2024.nlp4science-1.7.
Full textGupta, Tanay, Tushar Goel, and Ishan Verma. "Exploring Multimodal Language Models for Sustainability Disclosure Extraction: A Comparative Study." In The Sixth Workshop on Insights from Negative Results in NLP. Association for Computational Linguistics, 2025. https://doi.org/10.18653/v1/2025.insights-1.13.
Full textRawte, Vipula, Sarthak Jain, Aarush Sinha, et al. "ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models." In Proceedings of the 5th Workshop on Trustworthy NLP (TrustNLP 2025). Association for Computational Linguistics, 2025. https://doi.org/10.18653/v1/2025.trustnlp-main.15.
Full textRazzhigaev, Anton, Maxim Kurkin, Elizaveta Goncharova, et al. "OmniDialog: A Multimodal Benchmark for Generalization Across Text, Visual, and Audio Modalities." In Proceedings of the 2nd GenBench Workshop on Generalisation (Benchmarking) in NLP. Association for Computational Linguistics, 2024. http://dx.doi.org/10.18653/v1/2024.genbench-1.12.
Full textBabu, Alby, Dharshini T, Gayathry Krishnan V.S, Ummu Haiman V.P, Annie Julie Joseph, and Rajesh K.R. "Multimodal Emotion Analysis Using Integrating NLP, AI, and Facial Expression Recognition for Enhanced Emotion Detection." In 2024 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES). IEEE, 2024. https://doi.org/10.1109/spices62143.2024.10779754.
Full textP, Priyanka, and Balachander T. "Multimodal Fusion for Coherent Description Generation: A System Integrating NLP, Computer Vision, and Speech Recognition." In 2025 International Conference on Computational Robotics, Testing and Engineering Evaluation (ICCRTEE). IEEE, 2025. https://doi.org/10.1109/iccrtee64519.2025.11053045.
Full textReports on the topic "Multimodal NLP"
Pande, Vikram. Looking to Collaborate on ML Research (NLP / Multimodal AI). ResearchHub Technologies, Inc., 2025. https://doi.org/10.55277/researchhub.tr6juazr.
Full text