Academic literature on the topic 'Audio-visual scene analysis'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Audio-visual scene analysis.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "Audio-visual scene analysis"

1

Parekh, Sanjeel, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Perez, and Gael Richard. "Weakly Supervised Representation Learning for Audio-Visual Scene Analysis." IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2020): 416–28. http://dx.doi.org/10.1109/taslp.2019.2957889.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

O’Donovan, Adam, Ramani Duraiswami, Dmitry Zotkin, and Nail Gumerov. "Audio visual scene analysis using spherical arrays and cameras." Journal of the Acoustical Society of America 127, no. 3 (2010): 1979. http://dx.doi.org/10.1121/1.3385079.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Ahrens, Axel, and Kasper Duemose Lund. "Auditory spatial analysis in reverberant multi-talker environments with congruent and incongruent audio-visual room information." Journal of the Acoustical Society of America 152, no. 3 (2022): 1586–94. http://dx.doi.org/10.1121/10.0013991.

Full text
Abstract:
In a multi-talker situation, listeners have the challenge of identifying a target speech source out of a mixture of interfering background noises. In the current study, it was investigated how listeners analyze audio-visual scenes with varying complexity in terms of number of talkers and reverberation. The visual information of the room was either congruent with the acoustic room or incongruent. The listeners' task was to locate an ongoing speech source in a mixture of other speech sources. The three-dimensional audio-visual scenarios were presented using a loudspeaker array and virtual realit
APA, Harvard, Vancouver, ISO, and other styles
4

Motlicek, Petr, Stefan Duffner, Danil Korchagin, et al. "Real-Time Audio-Visual Analysis for Multiperson Videoconferencing." Advances in Multimedia 2013 (2013): 1–21. http://dx.doi.org/10.1155/2013/175745.

Full text
Abstract:
We describe the design of a system consisting of several state-of-the-art real-time audio and video processing components enabling multimodal stream manipulation (e.g., automatic online editing for multiparty videoconferencing applications) in open, unconstrained environments. The underlying algorithms are designed to allow multiple people to enter, interact, and leave the observable scene with no constraints. They comprise continuous localisation of audio objects and its application for spatial audio object coding, detection, and tracking of faces, estimation of head poses and visual focus of
APA, Harvard, Vancouver, ISO, and other styles
5

Gebru, Israel Dejene, Xavier Alameda-Pineda, Florence Forbes, and Radu Horaud. "EM Algorithms for Weighted-Data Clustering with Application to Audio-Visual Scene Analysis." IEEE Transactions on Pattern Analysis and Machine Intelligence 38, no. 12 (2016): 2402–15. http://dx.doi.org/10.1109/tpami.2016.2522425.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Mulachela, Husen, Aurelius RL Teluma, and Eka Putri Paramita. "Gender Equality Messages in Film Marlina The Murderer In Four Acts." JCommsci - Journal of Media and Communication Science 2, no. 3 (2019): 136. http://dx.doi.org/10.29303/jcommsci.v2i3.57.

Full text
Abstract:
This research trying to analyze the meaning of symbols in the film Marlina The Murderer in Four Acts based on the indicators of gender equality namely, access, participation, control, and benefits. The unit of analysis in this study includes the audio and visual elements that exist in a selected scene for later analysis using the Roland Barthes semiotic method known as the "two order of signification" to find the meaning of denotation and connotation meanings and myths contained in both order systems. The whole series in this study refers to the framework of thinking with the aim of answering
APA, Harvard, Vancouver, ISO, and other styles
7

Xiao, Mei, May Wong, Michelle Umali, and Marc Pomplun. "Using Eye-Tracking to Study Audio — Visual Perceptual Integration." Perception 36, no. 9 (2007): 1391–95. http://dx.doi.org/10.1068/p5731.

Full text
Abstract:
Perceptual integration of audio—visual stimuli is fundamental to our everyday conscious experience. Eye-movement analysis may be a suitable tool for studying such integration, since eye movements respond to auditory as well as visual input. Previous studies have shown that additional auditory cues in visual-search tasks can guide eye movements more efficiently and reduce their latency. However, these auditory cues were task-relevant since they indicated the target position and onset time. Therefore, the observed effects may have been due to subjects using the cues as additional information to
APA, Harvard, Vancouver, ISO, and other styles
8

Nahorna, Olha, Frédéric Berthommier, and Jean-Luc Schwartz. "Audio-visual speech scene analysis: Characterization of the dynamics of unbinding and rebinding the McGurk effect." Journal of the Acoustical Society of America 137, no. 1 (2015): 362–77. http://dx.doi.org/10.1121/1.4904536.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Habib, Muhammad Alhada Fuadilah, Asik Putri Ayusari Ratnaningsih, and Michael Jeffri Sinabutar. "SEMIOTICS ANALYSIS OF AHOK-DJAROT’S CAMPAIGN VIDEO ON YOUTUBE SOCIAL MEDIA FOR THE SECOND ROUND OF THE 2017 DKI JAKARTA GUBERNATORIAL ELECTION." Journal of Urban Sociology 4, no. 2 (2021): 76. http://dx.doi.org/10.30742/jus.v4i2.1772.

Full text
Abstract:
This study focuses on the messages conveyed in Ahok-Djarot’s campaign video on Youtube social media for the second round of the 2017 DKI Jakarta gubernatorial election by exploring and analyzing the elements of the icons, the indexes, the symbols, the lyrics, and the storyline using Peirce's semiotics based on the visual methodology. Various messages that have been conveyed through the video with the title “Video Kampanye Ahok-Djarot: Pastikan Pancasila Hadir di Jakarta” (Ahok-Djarot’s Campaign Video: Ensure Pancasila is Present in Jakarta) are very interesting to study because the video has b
APA, Harvard, Vancouver, ISO, and other styles
10

Ramenahalli, Sudarshan. "A Biologically Motivated, Proto-Object-Based Audiovisual Saliency Model." AI 1, no. 4 (2020): 487–509. http://dx.doi.org/10.3390/ai1040030.

Full text
Abstract:
The natural environment and our interaction with it are essentially multisensory, where we may deploy visual, tactile and/or auditory senses to perceive, learn and interact with our environment. Our objective in this study is to develop a scene analysis algorithm using multisensory information, specifically vision and audio. We develop a proto-object-based audiovisual saliency map (AVSM) for the analysis of dynamic natural scenes. A specialized audiovisual camera with 360∘ field of view, capable of locating sound direction, is used to collect spatiotemporally aligned audiovisual data. We demon
APA, Harvard, Vancouver, ISO, and other styles

Dissertations / Theses on the topic "Audio-visual scene analysis"

1

Parekh, Sanjeel. "Learning representations for robust audio-visual scene analysis." Thesis, Université Paris-Saclay (ComUE), 2019. http://www.theses.fr/2019SACLT015/document.

Full text
Abstract:
L'objectif de cette thèse est de concevoir des algorithmes qui permettent la détection robuste d’objets et d’événements dans des vidéos en s’appuyant sur une analyse conjointe de données audio et visuelle. Ceci est inspiré par la capacité remarquable des humains à intégrer les caractéristiques auditives et visuelles pour améliorer leur compréhension de scénarios bruités. À cette fin, nous nous appuyons sur deux types d'associations naturelles entre les modalités d'enregistrements audiovisuels (réalisés à l'aide d'un seul microphone et d'une seule caméra), à savoir la corrélation mouvement/audi
APA, Harvard, Vancouver, ISO, and other styles
2

Phillips, Nicola Jane. "Audio-visual scene analysis : attending to music in film." Thesis, University of Cambridge, 2000. https://www.repository.cam.ac.uk/handle/1810/251745.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Alameda-Pineda, Xavier. "Egocentric Audio-Visual Scene Analysis : a machine learning and signal processing approach." Thesis, Grenoble, 2013. http://www.theses.fr/2013GRENM024/document.

Full text
Abstract:
Depuis les vingt dernières années, l'industrie a développé plusieurs produits commerciaux dotés de capacités auditives et visuelles. La grand majorité de ces produits est composée d'un caméscope et d'un microphone embarqué (téléphones portables, tablettes, etc). D'autres, comme la Kinect, sont équipés de capteurs de profondeur et/ou de petits réseaux de microphones. On trouve également des téléphones portables dotés d'un système de vision stéréo. En même temps, plusieurs systèmes orientés recherche sont apparus (par exemple, le robot humanoïde NAO). Du fait que ces systèmes sont compacts, leur
APA, Harvard, Vancouver, ISO, and other styles
4

Khalidov, Vasil. "Modèles de mélanges conjugués pour la modélisation de la perception visuelle et auditive." Grenoble, 2010. http://www.theses.fr/2010GRENM064.

Full text
Abstract:
Dans cette thèse, nous nous intéressons à la modélisation de la perception audio-visuelle avec une tête robotique. Les problèmes associés, notamment la calibration audio-visuelle, la détection, la localisation et le suivi d'objets audio-visuels sont étudiés. Une approche spatio-temporelle de calibration d'une tête robotique est proposée, basée sur une mise en correspondance probabiliste multimodale des trajectoires. Le formalisme de modèles de mélange conjugué est introduit ainsi qu'une famille d'algorithmes d'optimisation efficaces pour effectuer le regroupement multimodal. Un cas particulier
APA, Harvard, Vancouver, ISO, and other styles
5

Stauffer, Chris. "Automated Audio-visual Activity Analysis." 2005. http://hdl.handle.net/1721.1/30568.

Full text
Abstract:
Current computer vision techniques can effectively monitor gross activities in sparse environments. Unfortunately, visual stimulus is often not sufficient for reliably discriminating between many types of activity. In many cases where the visual information required for a particular task is extremely subtle or non-existent, there is often audio stimulus that is extremely salient for a particular classification or anomaly detection task. Unfortunately unlike visual events, independent sounds are often very ambiguous and not sufficient to define useful events themselves. Without an effective
APA, Harvard, Vancouver, ISO, and other styles

Book chapters on the topic "Audio-visual scene analysis"

1

Saraceno, Caterina, and Riccardo Leonardi. "Audio-visual processing for scene change detection." In Image Analysis and Processing. Springer Berlin Heidelberg, 1997. http://dx.doi.org/10.1007/3-540-63508-4_114.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Tsekeridou, Sofia, Stelios Krinidis, and Ioannis Pitas. "Scene Change Detection Based on Audio-Visual Analysis and Interaction." In Multi-Image Analysis. Springer Berlin Heidelberg, 2001. http://dx.doi.org/10.1007/3-540-45134-x_16.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Owens, Andrew, and Alexei A. Efros. "Audio-Visual Scene Analysis with Self-Supervised Multisensory Features." In Computer Vision – ECCV 2018. Springer International Publishing, 2018. http://dx.doi.org/10.1007/978-3-030-01231-1_39.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Ganesh, Attigodu Chandrashekara, Frédéric Berthommier, and Jean-Luc Schwartz. "Audio Visual Integration with Competing Sources in the Framework of Audio Visual Speech Scene Analysis." In Advances in Experimental Medicine and Biology. Springer International Publishing, 2016. http://dx.doi.org/10.1007/978-3-319-25474-6_42.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Gupta, Vaibhavi, Vinay Detani, Vivek Khokar, and Chiranjoy Chattopadhyay. "C2VNet: A Deep Learning Framework Towards Comic Strip to Audio-Visual Scene Synthesis." In Document Analysis and Recognition – ICDAR 2021. Springer International Publishing, 2021. http://dx.doi.org/10.1007/978-3-030-86331-9_11.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Pham, Lam, Alexander Schindler, Mina Schutz, Jasmin Lampert, Sven Schlarb, and Ross King. "Deep Learning Frameworks Applied For Audio-Visual Scene Classification." In Data Science – Analytics and Applications. Springer Fachmedien Wiesbaden, 2022. http://dx.doi.org/10.1007/978-3-658-36295-9_6.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "Audio-visual scene analysis"

1

Wang, Shanshan, Annamaria Mesaros, Toni Heittola, and Tuomas Virtanen. "A Curated Dataset of Urban Scenes for Audio-Visual Scene Analysis." In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2021. http://dx.doi.org/10.1109/icassp39728.2021.9415085.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Schwartz, Jean-Luc, Frédéric Berthommier, and Christophe Savariaux. "Audio-visual scene analysis: evidence for a "very-early" integration process in audio-visual speech perception." In 7th International Conference on Spoken Language Processing (ICSLP 2002). ISCA, 2002. http://dx.doi.org/10.21437/icslp.2002-437.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

"ColEnViSon: Color Enhanced Visual Sonifier - A Polyphonic Audio Texture and Salient Scene Analysis." In International Conference on Computer Vision Theory and Applications. SciTePress - Science and and Technology Publications, 2009. http://dx.doi.org/10.5220/0001805105660572.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Schott, Gareth, and Raphael Marczak. "Understanding game actions: The development of a post-processing method for audio-visual scene analysis." In 2016 Future Technologies Conference (FTC). IEEE, 2016. http://dx.doi.org/10.1109/ftc.2016.7821657.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Fayek, Haytham M., and Anurag Kumar. "Large Scale Audiovisual Learning of Sounds with Weakly Labeled Data." In Twenty-Ninth International Joint Conference on Artificial Intelligence and Seventeenth Pacific Rim International Conference on Artificial Intelligence {IJCAI-PRICAI-20}. International Joint Conferences on Artificial Intelligence Organization, 2020. http://dx.doi.org/10.24963/ijcai.2020/78.

Full text
Abstract:
Recognizing sounds is a key aspect of computational audio scene analysis and machine perception. In this paper, we advocate that sound recognition is inherently a multi-modal audiovisual task in that it is easier to differentiate sounds using both the audio and visual modalities as opposed to one or the other. We present an audiovisual fusion model that learns to recognize sounds from weakly labeled video recordings. The proposed fusion model utilizes an attention mechanism to dynamically combine the outputs of the individual audio and visual models. Experiments on the large scale sound events
APA, Harvard, Vancouver, ISO, and other styles
6

YANG, LING, and SHENG-DONG YUE. "AN ANALYSIS OF THE CHARACTERISTICS OF MUSIC CREATION IN MEFISTOFELE." In 2021 International Conference on Education, Humanity and Language, Art. Destech Publications, Inc., 2021. http://dx.doi.org/10.12783/dtssehs/ehla2021/35726.

Full text
Abstract:
Successful opera art cannot be separated from literary elements, but also from the support of music. Opera scripts make up plots with words. Compared with emotional resonance directly from the senses, music can plasticize the abstract literary image from the perspective of sensibility. An excellent opera work can effectively promote the development of the drama plot through music design, and deepen the conflict of drama with the "ingenious leverage" of music. This article intends to analyze the music design of the famous opera, Mefistofele, and try to explore the fusion effect of music and dra
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!