Journal articles on the topic 'Speaker diarization'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Speaker diarization.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Karamyan, Davit S., and Grigor A. Kirakosyan. "Building a Speaker Diarization System: Lessons from VoxSRC 2023." Mathematical Problems of Computer Science 60 (November 30, 2023): 52–62. http://dx.doi.org/10.51408/1963-0109.
Full textIyer, Apoorva, Deepika Kini, and Shanthi Therese. "Speaker Diarization." International Journal of Computer Trends and Technology 67, no. 9 (2019): 50–54. http://dx.doi.org/10.14445/22312803/ijctt-v67i9p110.
Full textV., Subba Ramaiah, Srinivasa Rao S., and Devaraju V.S.N.Kumar. "Speaker Diarization based on Black-Hole Entropy Fuzzy Clustering using Cepstral Features." International Journal of Engineering and Advanced Technology (IJEAT) 9, no. 4 (2020): 1055–61. https://doi.org/10.35940/ijeat.D7832.049420.
Full textMr. Chaitanya Pampana, Dr. M. Vijay Reddy, and Dr. K. Jhansi Rani. "A Review on Speaker Diarization for Whispered Speech Audio." International Research Journal on Advanced Engineering and Management (IRJAEM) 3, no. 05 (2025): 1765–73. https://doi.org/10.47392/irjaem.2025.0279.
Full textPrabhala, Jagat Chaitanya, Venkatnareshbabu K, and Ragoju Ravi. "OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIARIZATION SYSTEMS: A MATHEMATICAL FORMULATION." Applied Mathematics and Sciences An International Journal (MathSJ) 10, no. 1/2 (2023): 1–10. http://dx.doi.org/10.5121/mathsj.2023.10201.
Full textKshirod, Kshirod Sarmah. "Speaker Diarization with Deep Learning Techniques." Turkish Journal of Computer and Mathematics Education (TURCOMAT) 11, no. 3 (2020): 2570–82. http://dx.doi.org/10.61841/turcomat.v11i3.14309.
Full textPARK, KYUNG-MI, JEONG-SIK PARK, JAE-HYUN BAE, and YUNG-HWAN OH. "ONLINE SPEAKER DIARIZATION FOR MULTIMEDIA DATA RETRIEVAL ON MOBILE DEVICES." International Journal of Pattern Recognition and Artificial Intelligence 26, no. 08 (2012): 1260011. http://dx.doi.org/10.1142/s0218001412600117.
Full textV, Sethuram, Ande Prasad, and R. Rajeswara Rao. "Metaheuristic adapted convolutional neural network for Telugu speaker diarization." Intelligent Decision Technologies 15, no. 4 (2022): 561–77. http://dx.doi.org/10.3233/idt-211005.
Full textZaiets, I., V. Brydinskyi, D. Sabodashko, Yu Khoma, Kh Ruda, and M. Shved. "UTILIZATION OF VOICE EMBEDDINGS IN INTEGRATED SYSTEMS FOR SPEAKER DIARIZATION AND MALICIOUS ACTOR DETECTION." Computer systems and network 6, no. 1 (2024): 54–66. http://dx.doi.org/10.23939/csn2024.01.054.
Full textNoulas, A., G. Englebienne, and B. J. A. Krose. "Multimodal Speaker Diarization." IEEE Transactions on Pattern Analysis and Machine Intelligence 34, no. 1 (2012): 79–93. http://dx.doi.org/10.1109/tpami.2011.47.
Full textLyu, Ke-Ming, Ren-yuan Lyu, and Hsien-Tsung Chang. "Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation." PeerJ Computer Science 10 (March 29, 2024): e1973. http://dx.doi.org/10.7717/peerj-cs.1973.
Full textHsu, Yicheng, Ssuhan Chen, Yuhsin Lai, Chingyen Wang, and Mingsian R. Bai. "Spatial-temporal activity-informed diarization and separation." Journal of the Acoustical Society of America 157, no. 2 (2025): 1162–75. https://doi.org/10.1121/10.0035830.
Full textAstapov, Sergei, Aleksei Gusev, Marina Volkova, et al. "Application of Fusion of Various Spontaneous Speech Analytics Methods for Improving Far-Field Neural-Based Diarization." Mathematics 9, no. 23 (2021): 2998. http://dx.doi.org/10.3390/math9232998.
Full textTaha, Thaer Mufeed, Zaineb Ben Messaoud, and Mondher Frikha. "Convolutional Neural Network Architectures for Gender, Emotional Detection from Speech and Speaker Diarization." International Journal of Interactive Mobile Technologies (iJIM) 18, no. 03 (2024): 88–103. http://dx.doi.org/10.3991/ijim.v18i03.43013.
Full textKhoma, Volodymyr, Yuriy Khoma, Vitalii Brydinskyi, and Alexander Konovalov. "Development of Supervised Speaker Diarization System Based on the PyAnnote Audio Processing Library." Sensors 23, no. 4 (2023): 2082. http://dx.doi.org/10.3390/s23042082.
Full textViñals, Ignacio, Alfonso Ortega, Antonio Miguel, and Eduardo Lleida. "The Domain Mismatch Problem in the Broadcast Speaker Attribution Task." Applied Sciences 11, no. 18 (2021): 8521. http://dx.doi.org/10.3390/app11188521.
Full textIndu D. "A Methodology for Speaker Diazaration System Based on LSTM and MFCC Coefficients." Journal of Electrical Systems 20, no. 6s (2024): 2938–45. http://dx.doi.org/10.52783/jes.3299.
Full textMurali, Abhejay, Satwik Dutta, Meena Chandra Shekar, Dwight Irvin, Jay Buzhardt, and John H. Hansen. "Towards developing speaker diarization for parent-child interactions." Journal of the Acoustical Society of America 152, no. 4 (2022): A61. http://dx.doi.org/10.1121/10.0015551.
Full textAhmad, Zubair, Alquhayz, and Ditta. "Multimodal Speaker Diarization Using a Pre-Trained Audio-Visual Synchronization Model." Sensors 19, no. 23 (2019): 5163. http://dx.doi.org/10.3390/s19235163.
Full textJiao, Xiaolin, Yaqi Chen, Dan Qu, and Xukui Yang. "Blueprint Separable Subsampling and Aggregate Feature Conformer-Based End-to-End Neural Diarization." Electronics 12, no. 19 (2023): 4118. http://dx.doi.org/10.3390/electronics12194118.
Full textAronowitz, Hagai. "COMPENSATION OF INTRA-SPEAKER VARIABILITY IN SPEAKER DIARIZATION." Journal of the Acoustical Society of America 134, no. 5 (2013): 3967. http://dx.doi.org/10.1121/1.4828924.
Full textAlvarez-Trejos, Juan Ignacio, Alicia Lozano-Diez, and Daniel Ramos. "Feature Integration Strategies for Neural Speaker Diarization in Conversational Telephone Speech." Applied Sciences 15, no. 9 (2025): 4842. https://doi.org/10.3390/app15094842.
Full textDARGAHI, Fatemeh, Costin-Alexandru DEONISE, Constantin ANGHEL, Cătălin Negru, and Florin Pop. "Microphone Speaker Analysis: Audio Segmentation and Frequency Insights." Annals of the Academy of Romanian Scientists Series on Science and Technology of Information 17, no. 1 (2024): 5–14. https://doi.org/10.56082/annalsarsciinfo.2024.1.5.
Full textVryzas, Nikolaos, Nikolaos Tsipas, and Charalampos Dimoulas. "Web Radio Automation for Audio Stream Management in the Era of Big Data." Information 11, no. 4 (2020): 205. http://dx.doi.org/10.3390/info11040205.
Full textK. Pande, Vinod, Vijay K. Kale, and Sangramsing N. Kayte. "FEATURE EXTRACTION USING I-VECTOR AND X-VECTOR METHODS FOR SPEAKER DIARIZATION." ICTACT Journal on Soft Computing 15, no. 4 (2025): 3717–21. https://doi.org/10.21917/ijsc.2025.0515.
Full textWang, Jiani, Shiran Dudy, Xinlu Hu, Zhiyong Wang, Rosy Southwell, and Jacob Whitehill. "Optimizing Speaker Diarization for the Classroom: Applications in Timing Student Speech and Distinguishing Teachers from Children." Journal of Educational Data Mining 17, no. 1 (2025): 98–125. https://doi.org/10.5281/zenodo.14871875.
Full textBarras, C., Xuan Zhu, S. Meignier, and J. L. Gauvain. "Multistage speaker diarization of broadcast news." IEEE Transactions on Audio, Speech and Language Processing 14, no. 5 (2006): 1505–12. http://dx.doi.org/10.1109/tasl.2006.878261.
Full textJothilakshmi, S., V. Ramalingam, and S. Palanivel. "Speaker diarization using autoassociative neural networks." Engineering Applications of Artificial Intelligence 22, no. 4-5 (2009): 667–75. http://dx.doi.org/10.1016/j.engappai.2009.01.012.
Full textXylogiannis, Paris, Nikolaos Vryzas, Lazaros Vrysis, and Charalampos Dimoulas. "Multisensory Fusion for Unsupervised Spatiotemporal Speaker Diarization." Sensors 24, no. 13 (2024): 4229. http://dx.doi.org/10.3390/s24134229.
Full textMertens, Robert, Po-Sen Huang, Luke Gottlieb, Gerald Friedland, Ajay Divakaran, and Mark Hasegawa-Johnson. "On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks." International Journal of Multimedia Data Engineering and Management 3, no. 3 (2012): 1–19. http://dx.doi.org/10.4018/jmdem.2012070101.
Full textPan, Weijun, Yidi Wang, Yumei Zhang, and Boyuan Han. "ATC-SD Net: Radiotelephone Communications Speaker Diarization Network." Aerospace 11, no. 7 (2024): 599. http://dx.doi.org/10.3390/aerospace11070599.
Full textKone, Tenon Charly, Sebastian Ghinet, Sayed Ahmed Dana, and Anant Grewal. "Speech detection models for effective communicable disease risk assessment in air travel environments." Journal of the Acoustical Society of America 155, no. 3_Supplement (2024): A277. http://dx.doi.org/10.1121/10.0027492.
Full textZhou, Yu. "Harmonic Structure Features for Robust Speaker Diarization." ETRI Journal 34, no. 4 (2012): 583–90. http://dx.doi.org/10.4218/etrij.12.0111.0455.
Full textAhmad, Rehan, Syed Zubair, and Hani Alquhayz. "Speech Enhancement for Multimodal Speaker Diarization System." IEEE Access 8 (2020): 126671–80. http://dx.doi.org/10.1109/access.2020.3007312.
Full textFerras, Marc, Srikanth Madikeri, and Herve Bourlard. "Speaker Diarization and Linking of Meeting Data." IEEE/ACM Transactions on Audio, Speech, and Language Processing 24, no. 11 (2016): 1935–45. http://dx.doi.org/10.1109/taslp.2016.2590139.
Full textXu, Yan, Ian McLoughlin, Yan Song, and Kui Wu. "Improved i-Vector Representation for Speaker Diarization." Circuits, Systems, and Signal Processing 35, no. 9 (2015): 3393–404. http://dx.doi.org/10.1007/s00034-015-0206-2.
Full textAHMAD, Rehan, and Syed ZUBAIR. "Unsupervised deep feature embeddings for speaker diarization." TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES 27, no. 4 (2019): 3138–49. http://dx.doi.org/10.3906/elk-1901-125.
Full textTranter, S. E., and D. A. Reynolds. "An overview of automatic speaker diarization systems." IEEE Transactions on Audio, Speech and Language Processing 14, no. 5 (2006): 1557–65. http://dx.doi.org/10.1109/tasl.2006.878256.
Full textAnguera, Xavier, Chuck Wooters, and Javier Hernando. "Acoustic Beamforming for Speaker Diarization of Meetings." IEEE Transactions on Audio, Speech and Language Processing 15, no. 7 (2007): 2011–22. http://dx.doi.org/10.1109/tasl.2007.902460.
Full textImseng, David, and Gerald Friedland. "Tuning-Robust Initialization Methods for Speaker Diarization." IEEE Transactions on Audio, Speech, and Language Processing 18, no. 8 (2010): 2028–37. http://dx.doi.org/10.1109/tasl.2010.2040796.
Full textBarra-Chicote, R., J. M. Pardo, J. Ferreiros, and J. M. Montero. "Speaker Diarization Based on Intensity Channel Contribution." IEEE Transactions on Audio, Speech, and Language Processing 19, no. 4 (2011): 754–61. http://dx.doi.org/10.1109/tasl.2010.2062507.
Full textAnguera Miro, Xavier, S. Bozonnet, N. Evans, C. Fredouille, G. Friedland, and O. Vinyals. "Speaker Diarization: A Review of Recent Research." IEEE Transactions on Audio, Speech, and Language Processing 20, no. 2 (2012): 356–70. http://dx.doi.org/10.1109/tasl.2011.2125954.
Full textFriedland, G., A. Janin, D. Imseng, et al. "The ICSI RT-09 Speaker Diarization System." IEEE Transactions on Audio, Speech, and Language Processing 20, no. 2 (2012): 371–81. http://dx.doi.org/10.1109/tasl.2011.2158419.
Full textHuijbregts, Marijn, David A. van Leeuwen, and Chuck Wooters. "Speaker Diarization Error Analysis Using Oracle Components." IEEE Transactions on Audio, Speech, and Language Processing 20, no. 2 (2012): 393–403. http://dx.doi.org/10.1109/tasl.2011.2162318.
Full textO’Shaughnessy, Douglas. "Speaker Diarization: A Review of Objectives and Methods." Applied Sciences 15, no. 4 (2025): 2002. https://doi.org/10.3390/app15042002.
Full textVaquero, C., A. Ortega, A. Miguel, and Eduardo Lleida. "Quality Assessment for Speaker Diarization and Its Application in Speaker Characterization." IEEE Transactions on Audio, Speech, and Language Processing 21, no. 4 (2013): 816–27. http://dx.doi.org/10.1109/tasl.2012.2236317.
Full textKothalkar, Prasanna V., Dwight Irvin, Jay Buzhardt, and John H. Hansen. "End-to-end child-adult speech diarization in naturalistic conditions of preschool classrooms." Journal of the Acoustical Society of America 153, no. 3_supplement (2023): A174. http://dx.doi.org/10.1121/10.0018568.
Full textAhmed, Ahmed Isam, John P. Chiverton, David L. Ndzi, and Mahmoud M. Al-Faris. "Channel and channel subband selection for speaker diarization." Computer Speech & Language 75 (September 2022): 101367. http://dx.doi.org/10.1016/j.csl.2022.101367.
Full textRho, Jinsang, Suwon Shon, Sung Soo Kim, Jae-Won Lee, and Hanseok Ko. "Local Distribution Based Density Clustering for Speaker Diarization." Journal of the Acoustical Society of Korea 34, no. 4 (2015): 303–9. http://dx.doi.org/10.7776/ask.2015.34.4.303.
Full textSultan, Wael Ali, Mourad Samir Semary, and Sherif Mahdy Abdou. "An Efficient Speaker Diarization Pipeline for Conversational Speech." Benha Journal of Applied Sciences 9, no. 5 (2024): 141–46. http://dx.doi.org/10.21608/bjas.2024.284482.1414.
Full text