Academic literature on the topic 'OpenAI Whisper'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'OpenAI Whisper.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "OpenAI Whisper"

1

Ghale, Akarsh, Janaki K, and Devaraj Verma C. "Instant Transcription and Translation Tool using OpenAI?s Whisper ASR Model." International Journal of Science and Research (IJSR) 11, no. 12 (2022): 185–88. http://dx.doi.org/10.21275/sr221203164929.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Bhargavi, A. D. "Video Transcripts Summarization using OpenAI Whisper and GPT Model." International Journal for Research in Applied Science and Engineering Technology 12, no. 3 (2024): 2319–27. http://dx.doi.org/10.22214/ijraset.2024.59365.

Full text
Abstract:
Abstract: In today’s digital age, a vast amount of video content is generated and shared on the internet every minute. However, extracting relevant information from these videos can be time-consuming and challenging. This is where video transcript summarization comes in, providing a concise summary of video content without the need to watch the entire video. The video transcript summarization system aims to streamline the process of extracting key insights and information from video content by generating concise and informative summaries from their transcripts. In the dynamic landscape of vide
APA, Harvard, Vancouver, ISO, and other styles
3

Saraf, Aryan. "Multilingual Translation for Speech and Text using Whisper AI: A Deep Learning Approach." International Journal for Research in Applied Science and Engineering Technology 13, no. 7 (2025): 1895–901. https://doi.org/10.22214/ijraset.2025.73288.

Full text
Abstract:
In an increasingly interconnected world, the ability to accurately translate between multiple languages, both written and spoken, is essential for global communication. Traditional machine translation and speech recognition systems often operate as separate pipelines, leading to increased complexity and reduced efficiency, especially when dealing with low-resource languages or noisy audio environments. This research presents a comprehensive study of Whisper AI, a multilingual, multitask model developed by OpenAI for speech recognition and translation. Leveraging a transformer-based encoder-dec
APA, Harvard, Vancouver, ISO, and other styles
4

William, Ezra, and Amalia Zahra. "Speech Recognition Dengan Whisper Dalam Bahasa Indonesia." Action Research Literate 9, no. 2 (2025): 386–97. https://doi.org/10.46799/arl.v9i2.2573.

Full text
Abstract:
Perkembangan teknologi kecerdasan buatan telah mendorong kemajuan dalam pengenalan suara (speech recognition), terutama dalam mendukung komunikasi digital yang lebih efisien. Salah satu model terbaru yang banyak digunakan adalah Whisper, yang dikembangkan oleh OpenAI dengan kemampuan pengenalan suara multibahasa yang diklaim memiliki akurasi tinggi. Namun, tantangan utama dalam implementasi teknologi ini di Indonesia adalah keterbatasan sumber daya data dalam bahasa lokal serta variasi aksen yang signifikan. Oleh karena itu, penelitian ini dilakukan untuk mengevaluasi kinerja model Whisper dal
APA, Harvard, Vancouver, ISO, and other styles
5

Amudhiniyan, Amudhiniyan. "Enhancing Communication between Speech and Hearing Impaired People." INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 09, no. 02 (2025): 1–9. https://doi.org/10.55041/ijsrem41922.

Full text
Abstract:
Mute Mate is a novel video conferencing system that uses Artificial Intelligence and Real-Time communication technologies to bridge the communication gap between sign language users and verbal communicators. The system uses YOLOv11 for sign language detection, OpenAI's Whisper model for speech-to-text translation, and WebRTC for real-time lag-free video communication. It ensures seamless communication between users of different modes of communication. Large-scale testing demonstrates the system's remarkable accuracy, low latency, and effectiveness, demonstrating its potential to revolutionize
APA, Harvard, Vancouver, ISO, and other styles
6

Małecki, Paweł, та Magdalena Piotrowska. "Нови тенденциї у розвою сучасней линґвистики у Сербї". Rocznik Ruskiej Bursy 20 (10 грудня 2024): 189–204. https://doi.org/10.12797/rrb.20.2024.20.10.

Full text
Abstract:
ANALIZA I KLASYFIKACJA JĘZYKA RUSIŃSKIEGO PRZY UŻYCIU MODELU SZTUCZNEJ SIECI NEURONOWEJ ASR OPENAI WHISPERArtykuł przedstawia analizę lingwistyczną języka rusińskiego, koncentrując się na jego złożonych i zmieniających się aspektach, takich jak wymowa oraz różnice indywidualne, regionalne i historyczne. Do przeprowadzenia badania wykorzystano sztuczną sieć neuronową opartą na modelu OpenAI Whisper. Model ten, choć szkolony na danych z większości państwowych języków urzędowych, nie był bezpośrednio trenowany na bazach próbek języka rusińskiego ze względu na jego lokalny i mniejszościowy/etniczn
APA, Harvard, Vancouver, ISO, and other styles
7

Bhute, Dr Harsha A. "MockMate: AI-Powered Online Mock Interview Assessment and Evaluation System." INTERNATIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 09, no. 04 (2025): 1–9. https://doi.org/10.55041/ijsrem45858.

Full text
Abstract:
Abstract: In the current competitive job market, being well-prepared for interviews is essential to landing a job. Traditional mock interviews, however, are not scalable and can call for a large human resource commitment. By providing a tailored, automated, and interactive online platform driven by artificial intelligence, MockMate tackles this problem. The system mimics actual interview situations, assesses candidate responses in real time, and provides thorough, data-driven feedback by utilizing cutting-edge Natural Language Processing (NLP) and speech-to-text technologies. In addition to cu
APA, Harvard, Vancouver, ISO, and other styles
8

Papala, Gowtham, Aniket Ransing, and Pooja Jain. "Sentiment Analysis and Speaker Diarization in Hindi and Marathi Using using Finetuned Whisper." Scalable Computing: Practice and Experience 24, no. 4 (2023): 835–46. http://dx.doi.org/10.12694/scpe.v24i4.2248.

Full text
Abstract:
Automatic Speech Recognition (ASR) is a crucial technology that enables machines to automatically recognize human voices based on audio signals. In recent years, there has been a rigorous growth in the development of ASR models with the emergence of new techniques and algorithms. One such model is the Whisper ASR model developed by OpenAI, which is based on a Transformer encoder-decoder architecture and can handle multiple tasks such as language identification, transcription, and translation. However, there are still limitations to the Whisper ASR model, such as speaker diarization, summarizat
APA, Harvard, Vancouver, ISO, and other styles
9

Ferdiansyah, Danny, and Christian Sri Kusuma Aditya. "Implementasi Automatic Speech Recognition Bacaan Al-Qur’an Menggunakan Metode Wav2Vec 2.0 dan OpenAI-Whisper." Jurnal Teknik Elektro dan Komputer TRIAC 11, no. 1 (2024): 11–16. http://dx.doi.org/10.21107/triac.v11i1.24332.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Polepaka, Sanjeeva, Varikuppala Prashanth Kumar, S. Umesh Chandra, Hema Nagendra Sri Krishna, and Gaurav Thakur. "Automated Caption Generation for Video Call with Language Translation." E3S Web of Conferences 430 (2023): 01025. http://dx.doi.org/10.1051/e3sconf/202343001025.

Full text
Abstract:
In the modern era, virtual communication between individuals is common. Many people’s lives have been made simpler in a number of circumstances by providing subtitles, generating automated captions for social media videos, and language translation from a source language to a targeted language. Both are included, which offers face-to-face translated captions during video conversations. React is used for application development. To send the data, socket programming is utilized. Context is understood and translated using Google translate API and speech recognition modules. With OpenAI and Whisper
APA, Harvard, Vancouver, ISO, and other styles
More sources

Books on the topic "OpenAI Whisper"

1

Peterson, Tracie. Hidden in a whisper. Five Star, 2002.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
2

Peterson, Tracie. Hidden in a whisper. Bethany House, 1999.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
3

McNaught, Judith. Night whispers. Pocket Books, 1999.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
4

McNaught, Judith. Night whispers. Pocket Books, 1998.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
5

McNaught, Judith. Night whispers. Wheeler Pub., 1998.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
6

McNaught, Judith. Night whispers. Pocket Books, 1998.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
7

Copyright Paperback Collection (Library of Congress), ed. Sweet nothings: When Molly Wells showed up at Jake Coulter's ranch, she had nothing but some extra clothes, a stolen horse, and a fear of her ex-husband that threatened to rule her life. She'd heard Jake was a real-life horse whisperer-and perhaps the only man alive who could help the beautiful stallion that her ex-husband had so bitterly abused. But she had no idea that Jake's talents would work their magic on her, with the same power to gain her trust, give her strength, and whisper away her deepest fears. New American Library, 2002.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
8

Batista, Josué R. Learn OpenAI Whisper: Transform Your Understanding of Gen AI Through Robust and Accurate Speech Processing Solutions. Packt Publishing, Limited, 2024.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
9

(Narrator), Enid Graham, ed. Night Whispers. Simon & Schuster Audio, 2001.

Find full text
APA, Harvard, Vancouver, ISO, and other styles

Book chapters on the topic "OpenAI Whisper"

1

Ifrah, Shimon. "GPT-4o, DALL-E, and Whisper." In Getting Started with Azure OpenAI. Apress, 2024. http://dx.doi.org/10.1007/979-8-8688-0599-8_6.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Yadav, Aman, Anita Shrotriya, and Amit Kumar Bairwa. "Fine-Tuning OpenAI Whisper and DistilWhisper: An In-Depth Analysis." In Smart Innovation, Systems and Technologies. Springer Nature Singapore, 2025. https://doi.org/10.1007/978-981-96-2182-8_44.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Löw, Patrick, Marie Westerdick, Klara Groß-Elixmann, and Dirk Burdinski. "Diversitätsorientiert lehren mit einer Open Source- Lösung für Medientranskripte (astAV)." In Hochschulbildung: Lehre und Forschung. transcript Verlag, 2024. http://dx.doi.org/10.14361/9783839469385-014.

Full text
Abstract:
Das Programm astAV (automatic speech recognition toolkit for Audio and Video) ist eine Open Source-Lösung zur Transkript- und Untertitelerstellung für Audio- und Videodateien. astAV wurde im Jahr 2020 als studentisches Projekt erarbeitet und wird seit 2021 kooperativ weiterentwickelt. Im Beitrag wird diskutiert, wie eine umfassende Digitialisierungsinitiative für Untertitelung diversitätsgerecht sowie ressourceneffizient umgesetzt werden kann. Die Genese des Programms verdeutlicht, wie die Gestaltung digitaler Kulturen an Hochschulen maßgeblich auch durch globale Schlüsselereignisse beeinfluss
APA, Harvard, Vancouver, ISO, and other styles
4

An, Jing, Yanbing Bai, Jiyi Li, Lifei Wang, Yuyi Jiang, and Yikui Zhang. "Cantonese Dialect Transcription in Diverse Sophisticated Scenarios via the OpenAI Whisper Speech Recognition Model." In Communications in Computer and Information Science. Springer Nature Singapore, 2025. https://doi.org/10.1007/978-981-96-7008-6_23.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Petrič, Teodor. "Jezikovni modeli za pripravo govornega korpusa: programi za prepoznavanje govora." In Stanje in perspektive uporabe govornih virov v raziskavah govora. Univerza v Mariboru, Univerzitetna založba, 2024. http://dx.doi.org/10.18690/um.ff.4.2024.9.

Full text
Abstract:
V preteklem desetletju, še posebej v zadnjih petih letih po uveljavljanju velikih jezikovnih modelov, ki temeljijo na arhitekturi transformerjev (pretvorbenih modelov), smo dobili vrsto programskih orodij, ki pospešujejo ustvarjanje večplastnih jezikovnih gradiv. Preizkušali smo programska orodja za prepoznavanje in pretvorbo govora v pisno obliko (tj. orodja Razpoznavalnik, Microsoft Word Prepiši, Vosk/Kaldi in OpenAI Whisper), ki so ključni za pospešeno ustvarjanje govornih korpusov. Uporabljali smo vrsto meril, ki zadevajo preprostost uporabe, časovni prihranek, morebitne stroške, zagotavlj
APA, Harvard, Vancouver, ISO, and other styles
6

a, Deepshikh, Ameena Naaz, and Gazy Abbas. "An Overview of Chatgpt: Current Trends and Future Possibilities." In Data Science and Intelligent Computing Techniques. Soft Computing Research Society, 2023. http://dx.doi.org/10.56155/978-81-955020-2-8-33.

Full text
Abstract:
This study focuses on the perspective of artificial intelligence with the automation of data processing, the production of fresh insights, and virtual assistance to obtain new information, as we are aware of the potential of artificial intelligence (AI), which has changed the overall way of conducting research. The paper is an overview of various OpenAI-related research articles along with other products and applications in which open AI is currently working, like Gym, Robosumo, Debate Game, Generative Models (GPT, GPT2, GPT3), ChatGPT, Music, Whisper, Codex, and more. This paper aims to summa
APA, Harvard, Vancouver, ISO, and other styles
7

Trollope, Anthony. "Chapter LIII Lady Ushant at Bragton." In The American Senator. Oxford University Press, 2008. http://dx.doi.org/10.1093/owc/9780199537631.003.0054.

Full text
Abstract:
ON the Sunday Larry came into Dillsborough and had ‘his gossip with the girls’ according to order;—but it was not very successful. Mrs. Masters, who opened the door for him, instructed him in a special whisper ‘to talk away just as though he did...
APA, Harvard, Vancouver, ISO, and other styles
8

Subburaj, Brindha, Uma Maheswari Jayachandran, Siya Bansal, and Vedansh Kumar. "Foul Language Censored Social Media Video Generation Using Audio Censoring Model." In Advances in Social Networking and Online Communities. IGI Global, 2025. https://doi.org/10.4018/979-8-3693-9904-0.ch016.

Full text
Abstract:
Advancements in communication medium and the proliferation of social media led to increasing amount of content shared often without moderation. The offensive are raising significantly, posing threat to mental health, and degrades the user experience. Current study focuses on identifying and censoring audio clips with offensive and foul words. We propose a novel audio censoring method using OpenAI's whisper model. The audio files are extracted from video and forwarded to whisper model. The text transcripts of the audio file is obtained through the whisper model. Keywords from the text are compa
APA, Harvard, Vancouver, ISO, and other styles
9

Hardy, Thomas. "III.–iv." In Jude the Obscure. Oxford University Press, 2008. http://dx.doi.org/10.1093/owc/9780199537020.003.0026.

Full text
Abstract:
Jude’s reverie was interrupted by the creak of footsteps ascending the stairs. He whisked Sue’s clothing from the chair where it was drying, thrust it under the bed, and sat down to his book. Somebody knocked and opened the door immediately. It was the landlady....
APA, Harvard, Vancouver, ISO, and other styles
10

Blackmore, R. D. "Chapter VIII: A Boy and a Girl." In Lorna Doone. Oxford University Press, 2008. http://dx.doi.org/10.1093/owc/9780199537594.003.0010.

Full text
Abstract:
WHEN I came to myself again, my hands were full of young grass and mould; and a little girl kneeling at my side was rubbing my forehead tenderly, with a dock-leaf and a handkerchief. ‘Oh, I am so glad,’ she whispered softly, as I opened...
APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "OpenAI Whisper"

1

Hlongwane, Khumbulani, Sthembiso Mthethwa, and Lindelweyizizwe Manqele. "Enhancing Diversity in Inclusive Learning Classroom Using OpenAI Whisper Model." In 2025 IST-Africa Conference (IST-Africa). IEEE, 2025. https://doi.org/10.23919/ist-africa67297.2025.11060520.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

R, Vinotha, Hepsiba D, and L. D. Vijay Anand. "Leveraging OpenAI Whisper Model to Improve Speech Recognition for Dysarthric Individuals." In 2024 Asia Pacific Conference on Innovation in Technology (APCIT). IEEE, 2024. http://dx.doi.org/10.1109/apcit62007.2024.10673628.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Shah, Henil, Maahi Patel, and Rajeev Kumar Gupta. "Enhancing Multimedia Accessibility: Automated Video Captioning and Translation System Using OpenAI Whisper." In 2024 Asian Conference on Intelligent Technologies (ACOIT). IEEE, 2024. https://doi.org/10.1109/acoit62457.2024.10939144.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Yang, Yuhang, Yizhou Peng, Hao Huang, Eng Siong Chng, and Xionghu Zhong. "Adapting OpenAI’s Whisper for Speech Recognition on Code-Switch Mandarin-English SEAME and ASRU2019 Datasets." In 2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). IEEE, 2024. https://doi.org/10.1109/apsipaasc63619.2025.10849308.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Siriket, Lattapon, Kulsawasd Jitkajornwanich, Saichon Jaiyen, and Sarun Intakosum. "Improving OpenAI’s Whisper Model for Transcribing Homophones in Legal News." In 2024 10th International Conference on Engineering, Applied Sciences, and Technology (ICEAST). IEEE, 2024. http://dx.doi.org/10.1109/iceast61342.2024.10554018.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Dere, A., and A. Ajibade. "WhisperMed: Fine-Tuned ASR for Enhanced Medication Communication in Clinical Settings." In International Conference on Artificial Intelligence and Robotics. Machine Intelligence Research Group (MIRG), 2024. https://doi.org/10.52968/15065271.

Full text
Abstract:
Medication management is a critical aspect of patient safety that often faces significant communication challenges, particularly in resource-constrained environments. Errors in transcribing medication information can lead to adverse drug events, which are among the most preventable causes of patient harm. Automatic Speech Recognition (ASR) systems have shown promise in mitigating these communication issues, yet they frequently struggle with domain-specific vocabularies, especially complex medical and pharmaceutical terminology. To address these challenges, we present WhisperMed, a fine-tuned v
APA, Harvard, Vancouver, ISO, and other styles
7

Trong Nguyen, Khac, and Jan-torsten Milde. "An AI Powered Glasses Attachment for the Visually Impaired." In 16th International Conference on Applied Human Factors and Ergonomics (AHFE 2025). AHFE International, 2025. https://doi.org/10.54941/ahfe1006155.

Full text
Abstract:
This research presents the development and evaluation of a prototype AI-powered glasses attachment designed to enhance the daily mobility and independence of individuals with visual impairments. The work addresses the significant challenges faced by millions who experience limitations in navigating, recognising objects, and accessing information. It aims to contribute to the field of assistive technologies by creating a cost-effective and versatile solution. The project emphasises the importance of designing inclusive technology that is not only functional but also user-friendly and accessible
APA, Harvard, Vancouver, ISO, and other styles

Reports on the topic "OpenAI Whisper"

1

MacFarlane, Andrew. 2021 medical student essay prize winner - A case of grief. Society for Academic Primary Care, 2021. http://dx.doi.org/10.37361/medstudessay.2021.1.1.

Full text
Abstract:
As a student undertaking a Longitudinal Integrated Clerkship (LIC)1 based in a GP practice in a rural community in the North of Scotland, I have been lucky to be given responsibility and my own clinic lists. Every day I conduct consultations that change my practice: the challenge of clinically applying the theory I have studied, controlling a consultation and efficiently exploring a patient's problems, empathising with and empowering them to play a part in their own care2 – and most difficult I feel – dealing with the vast amount of uncertainty that medicine, and particularly primary care, pre
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!