Academic literature on the topic 'AI video synthesis'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'AI video synthesis.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "AI video synthesis"

1

A P, Aiswarya. "Text to Video Generation Using Generative AI for Interior Design Visualization." INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 09, no. 03 (2025): 1–9. https://doi.org/10.55041/ijsrem42124.

Full text
Abstract:
The emerging discipline of text-to-video synthesis combines computer vision and natural language understanding to create coherent, realistic videos that are based on written descriptions. The research is an endeavour to provide a bridge between the fields of computer vision and natural language processing by using a robust text-to-video production system. The system's main goal is to convert text prompts into visually appealing videos using pre-trained models and style transfer techniques, providing a fresh approach to content development. The method demonstrates flexibility and effectiveness
APA, Harvard, Vancouver, ISO, and other styles
2

Rana, Jignesh. "Ai-Studios." International Journal for Research in Applied Science and Engineering Technology 13, no. 2 (2025): 563–68. https://doi.org/10.22214/ijraset.2025.66893.

Full text
Abstract:
Ai-Studios, a system that combines large language models with Stable Diffusion techniques to craft captivating poems and stories based on user prompts. This innovative system begins with user-provided prompts and offers the choice between poetry and narratives. Advanced language models generate rich textual content, forming the foundation of our creative journey. To translate this text into visually stunning experiences, Stable Diffusion models transform each sentence into vivid images with high accuracy. By using cross-attention layers, these models offer flexibility in responding to differen
APA, Harvard, Vancouver, ISO, and other styles
3

Li, Yaowei, Xintao Wang, Zhaoyang Zhang, et al. "Image Conductor: Precision Control for Interactive Video Synthesis." Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 5 (2025): 5031–38. https://doi.org/10.1609/aaai.v39i5.32533.

Full text
Abstract:
Filmmaking and animation production often require sophisticated techniques for coordinating camera transitions and object movements, typically involving labor-intensive real-world capturing. Despite advancements in generative AI for video creation, achieving precise control over motion for interactive video asset generation remains challenging. To this end, we propose Image Conductor, a method for precise control of camera transitions and object movements to generate video assets from a single image. An well-cultivated training strategy is proposed to separate distinct camera and object motion
APA, Harvard, Vancouver, ISO, and other styles
4

KJ, Karthik. "A SURVEY ON AI-CONTENT GENERATOR." INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 09, no. 01 (2025): 1–9. https://doi.org/10.55041/ijsrem41078.

Full text
Abstract:
This survey provides an in-depth exploration of AI-driven content generation, covering key areas such as text creation, image synthesis, video generation, and automated coding. By examining advancements in technologies like Large Language Models (LLMs), GANs, and diffusion models, the study highlights AI's role in transforming diverse fields. Text generation technologies are enabling structured, creative, and conversational outputs, while image and video synthesis models like Imagen and Phenaki are setting new benchmarks for visual quality and realism. In code generation, tools like ChatGPT an
APA, Harvard, Vancouver, ISO, and other styles
5

Azieiev, Serhii. "ARTIFICIAL INTELLIGENCE TOOLS IN JOURNALISTS’ WORK WITH AUDIOVISUAL CONTENT." Dialog: media studios, no. 30 (December 13, 2024): 7–22. https://doi.org/10.18524/2308-3255.2024.30.318416.

Full text
Abstract:
The rapid development of artificial intelligence (AI) is significantly transforming the creation, processing, and analysis of audiovisual content in journalism. Neural networks, machine learning algorithms, and other intelligent technologies enable the automation of many routine tasks – from information gathering and processing to video editing and voice synthesis – enhancing the efficiency of journalists’ work. This article examines key AI tools used in journalistic activities, including automatic speech transcription systems, AI-driven text and image generation, facial and object recognition
APA, Harvard, Vancouver, ISO, and other styles
6

Dey, Biswajit, and Rajdeep paul. "Integrating Indian Sign Language Recognition with Real-Time Speech Synthesis for video conferences." INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 09, no. 03 (2025): 1–9. https://doi.org/10.55041/ijsrem42865.

Full text
Abstract:
Both hearing and deaf people commonly face major communication hurdles in their daily lives. To solve, this study presents a real-time video calling system that uses ai model to recognize Indian Sign Language (ISL). Peers are connected via WebSockets, and video data is shared with the AI model for identification. Our approach captures 30 frames a second and buffers them as groups of 3 seconds that a backend AI model interprets. The application using grid fragmentation-based splitting and k-NN prediction detects the hand movements very accurately and translates these movements to textual equiva
APA, Harvard, Vancouver, ISO, and other styles
7

Luo, Ziqian, Feiyang Chen, Xiaoyang Chen, and Xueting Pan. "A Novel Framework for Text-Image Pair to Video Generation in Music Anime Douga (MAD) Production." Artificial Intelligence Advances 6, no. 1 (2024): 25–33. http://dx.doi.org/10.30564/aia.v6i1.6848.

Full text
Abstract:
The rapid growth of digital media has driven advancements in multimedia generation, notably in Music Anime Douga (MAD), which blends animation with music. Creating MADs currently requires extensive manual labor, particularly for designing critical frames. Existing methods like GANs and transformers excel at text-to-video synthesis but lack the precision needed for artistic control in MADs. They often neglect the crucial hand-drawn frames that form the visual foundation of these videos. This paper introduces a novel framework for generating high-quality videos from text-image pairs, addressing
APA, Harvard, Vancouver, ISO, and other styles
8

Meshram, Sahil. "Genius AI A Unified Platform for Text, Image, Audio, Video, and Code AI." International Journal for Research in Applied Science and Engineering Technology 13, no. 6 (2025): 825–29. https://doi.org/10.22214/ijraset.2025.71461.

Full text
Abstract:
The rapid evolution of artificial intelligence (AI) has led to the development of specialized models across different modalities such as text, image, video, audio, and program code. This paper presents the design and conceptual framework for a multimodal AI platform that harmoniously brings together multiple AI systems into a single, user-friendly. The proposed platform leverages state-of-the-art AI models, each tailored for a specific modality—Natural Language Processing (NLP) models for text understanding and generation, Computer Vision models for image analysis and synthesis, Generative Vid
APA, Harvard, Vancouver, ISO, and other styles
9

Donika, Valcheva, Kalushkov Teodor, and Shipkovenski Georgi. "Research on Motion Capture Technologies and AI Video Synthesis for Creating Digital Bulgarian Folk Choreographies." BRAIN. Broad Research in Artificial Intelligence and Neuroscience 16, Special Issue 1 (2025): 117–26. https://doi.org/10.70594/brain/16.S1/10.

Full text
Abstract:
The article is focused on the following three scientific and applied activities: research into motion capture technologies, analysis of applications for AI video synthesis, and developing a methodology for creating digital choreographies of Bulgarian folk dances. The possibilities for automation through AI and MoCap were assessed, including extraction, adaptation, and synchronisation of choreographic movements with virtual avatars using retargeting technologies. When analysing the leading AI-based applications for markerless MoCap, it was found that they differ in the detail of motion capture,
APA, Harvard, Vancouver, ISO, and other styles
10

Uppin, Mr Rohit B. "Introduction to Generative AI and its application in Education." International Journal for Research in Applied Science and Engineering Technology 12, no. 1 (2024): 861–66. http://dx.doi.org/10.22214/ijraset.2024.57563.

Full text
Abstract:
Abstract: Generative AI has made significant progress in re- cent years, with a growing range of applications in a variety of fields. Generative AI applications have catalyzed a new erain the synthesis and manipulation of digital content. Genera- tive AIis very recent technology which changed the way tradi-tional search engines work. The search engines work on the principles of information retrieval. However, openGL came up with use of Artificial Intelligence (AI) for synthesis of digital content and launched well known asChatGPT. The GenerativeAI differsfrom traditional AL as it takes text ,a
APA, Harvard, Vancouver, ISO, and other styles
More sources

Dissertations / Theses on the topic "AI video synthesis"

1

Boutet, de Monvel Violaine. "Du feedback vidéo à l'IA générative : sur la récursivité dans les arts et médias." Electronic Thesis or Diss., Paris 3, 2025. http://www.theses.fr/2025PA030009.

Full text
Abstract:
Cette thèse érige, sous le prisme du feedback, un pont entre l’art vidéo pionnier des années 1960 à 1980 et les pratiques en lien avec l’IA générative, que les avancées phénoménales de l’apprentissage profond ont précipitées depuis le milieu des années 2010. La rétroaction renvoie en cybernétique à l’autorégulation par la boucle de systèmes naturels et technologiques. Appliqué à des dispositifs analogiques, numériques ou hybrides en circuit fermé, ce processus automatisé qualifie aussi les effets contingents qui en résultent à l'écran. La première partie revient sur l’influence colossale que l
APA, Harvard, Vancouver, ISO, and other styles

Book chapters on the topic "AI video synthesis"

1

Leiker, Daniel, Ashley Ricker Gyllen, Ismail Eldesouky, and Mutlu Cukurova. "Generative AI for Learning: Investigating the Potential of Learning Videos with Synthetic Virtual Instructors." In Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners, Doctoral Consortium and Blue Sky. Springer Nature Switzerland, 2023. http://dx.doi.org/10.1007/978-3-031-36336-8_81.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Shah, Shrishti, Shubhasri Tadepalli, Lalitha Tanmai Vaddiparthi, Nishat Afshan Ansari, and Ankit A. Bhurane. "Generative AI for Text to Image." In Advances in Media, Entertainment, and the Arts. IGI Global, 2024. http://dx.doi.org/10.4018/979-8-3693-1950-5.ch002.

Full text
Abstract:
Text-to-image (TTI) synthesis models represent a creative approach in the realm of artificial intelligence, specifically designed to transform textual input into visually realistic images. The essence of TTI generation lies in its ability to harness the power of language and convert it seamlessly into visually compelling content, showcasing creative image synthesis. Initially using GANs and transformers, text-to-image generation evolved with diffusion models introducing noise. Integration with large models, TTI models now produce results near-real images. Breakthroughs like ControlNet and 3D o
APA, Harvard, Vancouver, ISO, and other styles
3

Wadood, Asim. "Generative AI From Theory to Model." In Deep Learning, Reinforcement Learning, and the Rise of Intelligent Systems. IGI Global, 2024. http://dx.doi.org/10.4018/979-8-3693-1738-9.ch004.

Full text
Abstract:
This book chapter provides a comprehensive overview of generative AI and its applications in computer vision. The introduction section elucidates the concept of generative AI and underscores its importance within the realm of artificial intelligence. The chapter also provides a deep dive into the various techniques used in generative AI, such as creative style transfer, forecasting subsequent video frames, enhancing image resolution, enabling interactive image generation, facilitating image-to-image translation, text-to-image synthesis, image inpainting, the generation of innovative animated c
APA, Harvard, Vancouver, ISO, and other styles
4

"Centaurs and Cyborgs." In Advances in Computational Intelligence and Robotics. IGI Global, 2025. https://doi.org/10.4018/979-8-3373-2518-7.ch003.

Full text
Abstract:
Generative AI has evolved digital content creation, yet its true potential emerges only through a deliberate synthesis of machine efficiency and human expertise. This chapter examines the imperative for hybridized skill sets through three key dimensions. First, it explains why hybridization is essential—illustrating that combining AI's rapid, scalable production with human oversight in technical literacy, creative intuition, and ethical judgment results in more nuanced, high-quality content than either element can produce alone. Second, the chapter explores practical applications and workflow
APA, Harvard, Vancouver, ISO, and other styles
5

Granville, Vincent. "Image and video generation." In Synthetic Data and Generative AI. Elsevier, 2024. http://dx.doi.org/10.1016/b978-0-44-321857-6.00008-4.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Baskar, Sowmiya. "Generative AI." In Advances in Computational Intelligence and Robotics. IGI Global, 2025. https://doi.org/10.4018/979-8-3693-5623-4.ch006.

Full text
Abstract:
GenerativeAI (GenAI) is a buzz term in the field of Artificial Intelligence (AI). It is a branch of AI, which has the capability to create content in various forms such as text, audio and video, by leveraging patterns in existing data. GenAI utilizes variety of Machine Learning and Deep Learning algorithms. It offers an alternative to the traditional teaching-learning model by offering enhanced teaching and learning experiences. Learning is a challenge for special student categories like neurodiversity, where GenAI plays a vital role to achieve their academic goals by creating inclusive learni
APA, Harvard, Vancouver, ISO, and other styles
7

Vhatkar, Kapil, Yatri Davda, and Aman Prakash Singh. "Generative AI in Deepfakes." In Ecological and Human Dimensions of AI-Based Supply Chain. IGI Global, 2025. https://doi.org/10.4018/979-8-3693-7478-8.ch014.

Full text
Abstract:
Deepfakes, driven by advanced generative AI models like GANs and VAEs, create highly realistic synthetic media, ranging from manipulated faces and voices to entirely fabricated videos. Originally developed for creative expression and entertainment, deepfakes now raise significant concerns, including political manipulation, privacy violations, and the spread of false information. This chapter delves into the technology behind deepfakes, exploring the models, techniques, and tools used to produce them, while also examining emerging detection methods designed to identify such synthetic content. A
APA, Harvard, Vancouver, ISO, and other styles
8

Machado, Andreia Bem, João Rodrigues dos Santos, António Sacavém, Ramesh Sharma, and Rui Nunes Cruz. "Transforming Education." In Transforming Education With Generative AI. IGI Global, 2024. http://dx.doi.org/10.4018/979-8-3693-1351-0.ch018.

Full text
Abstract:
Generative AI systems are increasingly present in our daily lives, helping us make crucial decisions. They use machine learning algorithms and tools, fed with millions of data collected from the web, producing entirely new information and generating variations. And this is not just limited to texts — it can produce images, audio, videos, even code, or new programming languages. There are several fields where generative AI can have a considerable impact in the coming years. In this context, the issues proposed in this chapter are: What is generative AI? What is prompt engineering? How to transf
APA, Harvard, Vancouver, ISO, and other styles
9

Tribhuvan, Dr Padmapani P., and Amrapali P. Tribhuvan. "ARTIFICIAL INTELLIGENCE FOR DEEPFAKE CREATION AND DETECTION." In Futuristic Trends in Artificial Intelligence Volume 3 Book 11. Iterative International Publishers, Selfypage Developers Pvt Ltd, 2024. http://dx.doi.org/10.58532/v3bkai11p1ch2.

Full text
Abstract:
Deepfake technology involving AI has thrivingly emerged in recent years, enabling the creation of extremely realistic synthetic media, including forged audio and video content. While this technology promises several beneficial applications, it also poses significant challenges and concerns, particularly regarding its potential for misuse and manipulation. This research paper investigates the creation of deepfakes using artificial intelligence and explores various approaches for their detection and mitigation.
APA, Harvard, Vancouver, ISO, and other styles
10

Sicilia, María, Mariola Palazón, and María Jesús Acosta-López. "The Use of AI in Advertising Creativity." In Advances in Marketing, Customer Relationship Management, and E-Services. IGI Global, 2025. https://doi.org/10.4018/979-8-3693-3799-8.ch002.

Full text
Abstract:
The advances in technologies and the irruption of artificial intelligence (AI) represent a challenge for advertising agencies and for creative management. Generative AI may be used to enhance creativity and to create content that depicts a highly convincing version of reality. The use of AI in advertising has been related to terms such as synthetic advertising, computational creativity or intelligent advertising. This chapter aims to explain the use of artificial intelligence in the following areas of advertising creativity: text, image, voice, audio, music and video generation. It also addres
APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "AI video synthesis"

1

Wu, Xiaodong. "Influence of AI Technology on Speech Synthesis and Voice Cloning." In MHV '25: Mile-High Video Conference. ACM, 2025. https://doi.org/10.1145/3715675.3715806.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Sugie, Satoko. "Exploring the potential of AI-supported instructional design and multimodal communication to promote a paradigm shift in Chinese language education." In XXnd International CALL Research Conference. Castledown Publishers, 2024. http://dx.doi.org/10.29140/9780648184485-41.

Full text
Abstract:
This study addresses a crucial issue in Chinese language education in Japanese universities, emphasizing the need to innovate from traditional teaching paradigm towards technology-enhanced approaches. By integrating AI-supported instructional designs grounded in Task-Based Language Teaching (TBLT) principles, the study aims to enhance the learning experience for novice Chinese learners. The incorporation of AI and ICT tools such as online whiteboards, ChatGPT for dialogue writing and translation, automatic speech synthesis tools for pronunciation practice, and video editing applications demons
APA, Harvard, Vancouver, ISO, and other styles
3

Weg, Joshua, Taehyung Wang, and Li Liu. "Interpretable AI-Generated Videos Detection using Deep Learning and Integrated Gradients." In 16th International Conference on Applied Human Factors and Ergonomics (AHFE 2025). AHFE International, 2025. https://doi.org/10.54941/ahfe1006041.

Full text
Abstract:
The rapid advancements in generative AI have led to text-to-video models creating highly realistic content, raising serious concerns about misinformation spread through synthetic videos. As these AI videos become more convincing, they threaten information integrity across social media, news, and digital communications. Using AI-generated videos, bad actors can now create false narratives, manipulate public opinion, and influence critical processes like elections. This technology's democratization means that sophisticated disinformation campaigns are no longer limited to well-resourced actors,
APA, Harvard, Vancouver, ISO, and other styles
4

Stöckl, Andreas, Tim Willaert, and Rimbert Rudisch-sommer. "Humans and AI writing lectures together." In 2025 Intelligent Human Systems Integration. AHFE International, 2025. https://doi.org/10.54941/ahfe1005809.

Full text
Abstract:
With the recent advancements in Generative Artificial Intelligence (GenAI) technologies, particularly Large Language Models (LLMs) like GPT4, there has been a significant shift in how information can be easily accessed, generated, and utilized. This study uses these advancements to create a tool where humans and AI generate complete lectures, encompassing the entire process from structure outlining and scriptwriting to slide creation and delivery via a digital avatar.The motivation behind this study comes from the challenges faced in the educational sector, including the time-consuming nature
APA, Harvard, Vancouver, ISO, and other styles
5

Breck, Dominik, Max Schlosser, Rico Thomanek, Christian Roschke, Matthias Vodel, and Marc Ritter. "Automated generation of synthetic person activity data for AI models training." In AHFE 2023 Hawaii Edition. AHFE International, 2023. http://dx.doi.org/10.54941/ahfe1004182.

Full text
Abstract:
Image and video analytic methods, such as the recognition of a person's activities on the basis of given image material, are of great importance both in research and in everyday life. For such complex methods, deep learning approaches are mostly used, which require training based on a high data foundation. The main problem with data sets used for these methods is the acquisition and complex annotation of video data suitable for training a model. Further problems arising from the use of real-world data lie in the non-compliance with basic data protection issues or the representation of one-side
APA, Harvard, Vancouver, ISO, and other styles
6

Rathee, Munish. "AI-Assisted Infrastructure Monitoring: Supplementing Human Inspections on Auckland Harbour Bridge." In 31st International Conference on Neural Information Processing. Tuwhera, 2025. https://doi.org/10.24135/iconip13.

Full text
Abstract:
Ensuring traffic safety on critical infrastructure such as the Auckland Harbour Bridge (AHB) is essential, particularly given its movable concrete barrier (MCB) system, which handles 154,000 vehicles daily on the AHB and helps manage traffic in similar scenarios in over 20 cities worldwide [1, 2]. The MCB system relies on connecting metal pins to secure 750 kg concrete segments together, and these pins can become dislodged due to external factors, posing significant safety risks. On-foot manual inspections are conducted to check for dislodged or at-risk pins, exposing workers to hazardous traf
APA, Harvard, Vancouver, ISO, and other styles
7

Cardoso Verhalen, Lívia Elias, Michele Marta Moraes Castro, and Cristiano Maciel. "Análise de ferramentas para geração de avatares artificiais a partir de pessoas reais." In Escola Regional de Informática de Goiás. Sociedade Brasileira de Computação, 2024. https://doi.org/10.5753/erigo.2024.4798.

Full text
Abstract:
Este artigo explora ferramentas atuais para a criação de avatares digitais feitos por Inteligências Artificiais generativas e suas diferentes funcionalidades e aspectos para a criação de um avatar a partir de uma das quatro ferramentas analisadas, sendo elas Synthesia AI Video Generator, FlexClip, Vidnoz e HeyGen. As ferramentas foram selecionadas a partir de buscas exploratórias na web e avaliadas pelos critérios: realismo, opções de personalização, idiomas e apoios visuais. O avatar criado é utilizado em um vídeo curto para demonstrar o funcionamento total da ferramenta.
APA, Harvard, Vancouver, ISO, and other styles
8

Albuquerque, Isabela, Joao Monteiro, and Tiago Falk. "Generating Videos by Traversing Image Manifolds Learned by GANs." In LatinX in AI at Neural Information Processing Systems Conference 2018. Journal of LatinX in AI Research, 2018. http://dx.doi.org/10.52591/lxai201812036.

Full text
Abstract:
In this work, we introduce a two-step framework for generative modeling of temporal data. Specifically, the generative adversarial networks (GANs) setting is employed to generate synthetic scenes of moving objects. To do so, we propose a two-step training scheme within which: a generator of static frames is trained first. Afterwards, a recurrent model is trained with the goal of providing a sequence of inputs to the previously trained frames generator, thus yielding scenes which look natural. The adversarial setting is employed in both training steps. However, with the aim of avoiding known tr
APA, Harvard, Vancouver, ISO, and other styles
9

Pataranutaporn, Pat, Chayapatr Archiwaranguprok, Samantha W. T. Chan, Elizabeth Loftus, and Pattie Maes. "Synthetic Human Memories: AI-Edited Images and Videos Can Implant False Memories and Distort Recollection." In CHI 2025: CHI Conference on Human Factors in Computing Systems. ACM, 2025. https://doi.org/10.1145/3706598.3713697.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Gallen, R., L. Gambini, P. Gilligan, et al. "33 Novel use of a video frame interpolation algorithm for radiation dose reduction in the catheterisation laboratory using AI-generated synthetic angiographic images." In Irish Cardiac Society Annual Scientific Meeting & AGM, October 17th – 19th 2024, Europa Hotel, Belfast. BMJ Publishing Group Ltd and British Cardiovascular Society, 2024. http://dx.doi.org/10.1136/heartjnl-2024-ics.34.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Reports on the topic "AI video synthesis"

1

Busch, Ella, and Jacob Ware. The Weaponization of Deepfakes: Digital Deception on the Far-Right. ICCT, 2023. http://dx.doi.org/10.19165/2023.2.07.

Full text
Abstract:
In an ever-evolving technological landscape, digital disinformation is on the rise, as are its political consequences. In this paper, we explore the creation and distribution of synthetic media by malign actors, specifically a form of artificial intelligence-machine learning (AI/ML) known as the deepfake. Individuals looking to incite political violence are increasingly turning to deepfakes–specifically deepfake video content–in order to create unrest, undermine trust in democratic institutions and authority figures, and elevate polarised political agendas. We present a new subset of individua
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!