Journal articles on the topic 'Visual question generation'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Visual question generation.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Patil, Charulata, and Manasi Patwardhan. "Visual Question Generation." ACM Computing Surveys 53, no. 3 (2020): 1–22. http://dx.doi.org/10.1145/3383465.
Full textLiu, Hongfei, Jiali Chen, Wenhao Fang, Jiayuan Xie, and Yi Cai. "Category-Guided Visual Question Generation (Student Abstract)." Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 13 (2023): 16262–63. http://dx.doi.org/10.1609/aaai.v37i13.26991.
Full textXie, Jiayuan, Mengqiu Cheng, Xinting Zhang, et al. "Explicitly Guided Difficulty-Controllable Visual Question Generation." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 24 (2025): 25552–60. https://doi.org/10.1609/aaai.v39i24.34745.
Full textMi, Li, Syrielle Montariol, Javiera Castillo Navarro, Xianjie Dai, Antoine Bosselut, and Devis Tuia. "ConVQG: Contrastive Visual Question Generation with Multimodal Guidance." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 5 (2024): 4207–15. http://dx.doi.org/10.1609/aaai.v38i5.28216.
Full textSarrouti, Mourad, Asma Ben Abacha, and Dina Demner-Fushman. "Goal-Driven Visual Question Generation from Radiology Images." Information 12, no. 8 (2021): 334. http://dx.doi.org/10.3390/info12080334.
Full textPang, Wei, and Xiaojie Wang. "Visual Dialogue State Tracking for Question Generation." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (2020): 11831–38. http://dx.doi.org/10.1609/aaai.v34i07.6856.
Full textSrinivas, Dr Rhea. "VISUAL QUESTION ANSWERING." International Scientific Journal of Engineering and Management 04, no. 04 (2025): 1–7. https://doi.org/10.55041/isjem03029.
Full textKamala, M. "Visual Question Generation from Remote Sensing Images Using Gemini API." International Journal for Research in Applied Science and Engineering Technology 12, no. 3 (2024): 2924–29. http://dx.doi.org/10.22214/ijraset.2024.59537.
Full textKachare, Atul, Mukesh Kalla, and Ashutosh Gupta. "Visual Question Generation Answering (VQG-VQA) using Machine Learning Models." WSEAS TRANSACTIONS ON SYSTEMS 22 (June 28, 2023): 663–70. http://dx.doi.org/10.37394/23202.2023.22.67.
Full textSandhya, Vidyashankar, Vahi Rakshit, Karkhanis Yash, and Srinivasa Gowri. "Vis Quelle: Visual Question-based Elementary Learning Companion a system to Facilitate Learning Word-Object Associations." International Journal of Innovative Technology and Exploring Engineering (IJITEE) 11, no. 1 (2021): 41–49. https://doi.org/10.35940/ijitee.A9599.1111121.
Full textZhu, He, Ren Togo, Takahiro Ogawa, and Miki Haseyama. "Diversity Learning Based on Multi-Latent Space for Medical Image Visual Question Generation." Sensors 23, no. 3 (2023): 1057. http://dx.doi.org/10.3390/s23031057.
Full textBoukhers, Zeyd, Timo Hartmann, and Jan Jürjens. "COIN: Counterfactual Image Generation for Visual Question Answering Interpretation." Sensors 22, no. 6 (2022): 2245. http://dx.doi.org/10.3390/s22062245.
Full textYu, Ting, Zixuan Tong, Jun Yu, and Ke Zhang. "Fine-grained Adaptive Visual Prompt for Generative Medical Visual Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 9 (2025): 9662–70. https://doi.org/10.1609/aaai.v39i9.33047.
Full textCai, Shuo, Xinzhe Han, and Shuhui Wang. "Divide-and-Conquer: Tree-structured Strategy with Answer Distribution Estimator for Goal-Oriented Visual Dialogue." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 2 (2025): 1917–25. https://doi.org/10.1609/aaai.v39i2.32187.
Full textShridhar, Mohit, Dixant Mittal, and David Hsu. "INGRESS: Interactive visual grounding of referring expressions." International Journal of Robotics Research 39, no. 2-3 (2020): 217–32. http://dx.doi.org/10.1177/0278364919897133.
Full textKim, Incheol. "Visual Experience-Based Question Answering with Complex Multimodal Environments." Mathematical Problems in Engineering 2020 (November 19, 2020): 1–18. http://dx.doi.org/10.1155/2020/8567271.
Full textGuo, Zihan, Dezhi Han, and Kuan-Ching Li. "Double-layer affective visual question answering network." Computer Science and Information Systems, no. 00 (2020): 38. http://dx.doi.org/10.2298/csis200515038g.
Full textSingh, Anjali, Ruhi Sharma Mittal, Shubham Atreja, et al. "Automatic Generation of Leveled Visual Assessments for Young Learners." Proceedings of the AAAI Conference on Artificial Intelligence 33 (July 17, 2019): 9713–20. http://dx.doi.org/10.1609/aaai.v33i01.33019713.
Full textLong, Xinwei, Zhiyuan Ma, Ermo Hua, Kaiyan Zhang, Biqing Qi, and Bowen Zhou. "Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 23 (2025): 24723–31. https://doi.org/10.1609/aaai.v39i23.34653.
Full textKim, Jung-Jun, Dong-Gyu Lee, Jialin Wu, Hong-Gyu Jung, and Seong-Whan Lee. "Visual question answering based on local-scene-aware referring expression generation." Neural Networks 139 (July 2021): 158–67. http://dx.doi.org/10.1016/j.neunet.2021.02.001.
Full textLiu, Yuhang, Daowan Peng, Wei Wei, Yuanyuan Fu, Wenfeng Xie, and Dangyang Chen. "Detection-Based Intermediate Supervision for Visual Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 12 (2024): 14061–68. http://dx.doi.org/10.1609/aaai.v38i12.29315.
Full textFeng, Chun-Mei, Yang Bai, Tao Luo, et al. "VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 3 (2025): 2942–50. https://doi.org/10.1609/aaai.v39i3.32301.
Full textGhosh, Akash, Arkadeep Acharya, Raghav Jain, Sriparna Saha, Aman Chadha, and Setu Sinha. "CLIPSyntel: CLIP and LLM Synergy for Multimodal Question Summarization in Healthcare." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 20 (2024): 22031–39. http://dx.doi.org/10.1609/aaai.v38i20.30206.
Full text闫, 婧昕. "Sub-Med VQA: A Medical Visual Question Answering Model Integrating Sub-Question Generation and Multimodal Reasoning." Statistics and Application 14, no. 02 (2025): 115–25. https://doi.org/10.12677/sa.2025.142041.
Full textZhang, Lizong, Haojun Yin, Bei Hui, Sijuan Liu, and Wei Zhang. "Knowledge-Based Scene Graph Generation with Visual Contextual Dependency." Mathematics 10, no. 14 (2022): 2525. http://dx.doi.org/10.3390/math10142525.
Full textZhang, Weifeng, Jing Yu, Wenhong Zhao, and Chuan Ran. "DMRFNet: Deep Multimodal Reasoning and Fusion for Visual Question Answering and explanation generation." Information Fusion 72 (August 2021): 70–79. http://dx.doi.org/10.1016/j.inffus.2021.02.006.
Full textLim, Youngsun, Hojun Choi, and Hyunjung Shim. "Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 25 (2025): 26290–98. https://doi.org/10.1609/aaai.v39i25.34827.
Full textZhu, He, Ren Togo, Takahiro Ogawa, and Miki Haseyama. "Multimodal Natural Language Explanation Generation for Visual Question Answering Based on Multiple Reference Data." Electronics 12, no. 10 (2023): 2183. http://dx.doi.org/10.3390/electronics12102183.
Full textYi, Ziruo, Ting Xiao, and Mark V. Albert. "A Survey on Multimodal Large Language Models in Radiology for Report Generation and Visual Question Answering." Information 16, no. 2 (2025): 136. https://doi.org/10.3390/info16020136.
Full textKruchinin, Vladimir, and Vladimir Kuzovkin. "Overview of Existing Methods for Automatic Generation of Tasks with Conditions in Natural Language." Computer tools in education, no. 1 (March 28, 2022): 85–96. http://dx.doi.org/10.32603/2071-2340-2022-1-85-96.
Full textELSHAMY, Ghada, Marco ALFONSE, Islam HEGAZY, and Mostafa AREF. "A multi-modal transformer-based model for generative visual dialog system." Applied Computer Science 21, no. 1 (2025): 1–17. https://doi.org/10.35784/acs_6856.
Full textLi, Xiaochuan, Baoyu Fan, Runze Zhang, et al. "Image Content Generation with Causal Reasoning." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 12 (2024): 13646–54. http://dx.doi.org/10.1609/aaai.v38i12.29269.
Full textTanaka, Ryota, Kyosuke Nishida, and Sen Yoshida. "VisualMRC: Machine Reading Comprehension on Document Images." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 15 (2021): 13878–88. http://dx.doi.org/10.1609/aaai.v35i15.17635.
Full textKamala Mekala. "Enhancing VQA with SELM: A Multi-Model Approach Using SBERT." Journal of Information Systems Engineering and Management 10, no. 41s (2025): 795–808. https://doi.org/10.52783/jisem.v10i41s.8003.
Full textWörgötter, Florentin, Ernst Niebur, and Christof Koch. "Generation of Direction Selectivity by Isotropic Intracortical Connections." Neural Computation 4, no. 3 (1992): 332–40. http://dx.doi.org/10.1162/neco.1992.4.3.332.
Full textZhu, Qiaoyi. "A Study of the Aesthetic Art of New Patriotism in Red Film and Television Drama." International Journal of Education, Humanities and Social Sciences 1, no. 1 (2024): 16–22. http://dx.doi.org/10.70088/zb1sr964.
Full textWang, Junjue, Zhuo Zheng, Zihang Chen, Ailong Ma, and Yanfei Zhong. "EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 6 (2024): 5481–89. http://dx.doi.org/10.1609/aaai.v38i6.28357.
Full textBELZ, A., T. L. BERG, and L. YU. "From image to language and back again." Natural Language Engineering 24, no. 3 (2018): 325–62. http://dx.doi.org/10.1017/s1351324918000086.
Full textAbrecht, Stephanie, Lydia Gauerhof, Christoph Gladisch, Konrad Groh, Christian Heinzemann, and Matthias Woehrle. "Testing Deep Learning-based Visual Perception for Automated Driving." ACM Transactions on Cyber-Physical Systems 5, no. 4 (2021): 1–28. http://dx.doi.org/10.1145/3450356.
Full textCheng, Zesen, Kehan Li, Peng Jin, et al. "Parallel Vertex Diffusion for Unified Visual Grounding." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 2 (2024): 1326–34. http://dx.doi.org/10.1609/aaai.v38i2.27896.
Full textPettitt, Joanne. "Visual-Textual Encounters with a German Grandfather: The Work of Angela Findlay." Jewish Film & New Media: An International Journal 11, no. 1 (2023): 90–115. http://dx.doi.org/10.1353/jfn.2023.a937530.
Full textKhademi, Mahmoud, and Oliver Schulte. "Deep Generative Probabilistic Graph Neural Networks for Scene Graph Generation." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (2020): 11237–45. http://dx.doi.org/10.1609/aaai.v34i07.6783.
Full textLiu, Xiulong, Sudipta Paul, Moitreya Chatterjee, and Anoop Cherian. "CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 4 (2024): 3765–73. http://dx.doi.org/10.1609/aaai.v38i4.28167.
Full textZhao, Chengfang, Mingwei Tang, Yanxi Zheng, and Chaocong Ran. "An Adaptive Multimodal Fusion Network Based on Multilinear Gradients for Visual Question Answering." Electronics 14, no. 1 (2024): 9. https://doi.org/10.3390/electronics14010009.
Full textZhou, Luowei, Hamid Palangi, Lei Zhang, Houdong Hu, Jason Corso, and Jianfeng Gao. "Unified Vision-Language Pre-Training for Image Captioning and VQA." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 07 (2020): 13041–49. http://dx.doi.org/10.1609/aaai.v34i07.7005.
Full textKatz, Chaim N., Kramay Patel, Omid Talakoub, David Groppe, Kari Hoffman, and Taufik A. Valiante. "Differential Generation of Saccade, Fixation, and Image-Onset Event-Related Potentials in the Human Mesial Temporal Lobe." Cerebral Cortex 30, no. 10 (2020): 5502–16. http://dx.doi.org/10.1093/cercor/bhaa132.
Full textKumar Singh, Ashutosh, Anish Khobragade, and Vikas Kanake. "IMAGE STORY: ENHANCED COGNITIVE VISUAL NARRATIVE SYSTEM." International Journal of Advanced Research 13, no. 06 (2025): 1218–31. https://doi.org/10.21474/ijar01/21185.
Full textReddy, Revant Gangi, Xilin Rui, Manling Li, et al. "MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding." Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 10 (2022): 11200–11208. http://dx.doi.org/10.1609/aaai.v36i10.21370.
Full textSejati, Sadewa Purba, and Ifnu Rifki Nurhidayanto. "Peningkatan Literasi Sumber Daya Air Tanah Menggunakan Media Interaktif Berbasis Android." Dinamisia : Jurnal Pengabdian Kepada Masyarakat 6, no. 6 (2022): 1454–60. http://dx.doi.org/10.31849/dinamisia.v6i6.11118.
Full textRestrepo, David, Chenwei Wu, Zhengxu Tang, et al. "Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 27 (2025): 28321–30. https://doi.org/10.1609/aaai.v39i27.35053.
Full text