Journal articles on the topic 'Speaker embedding'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Speaker embedding.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Mridha, Muhammad Firoz, Abu Quwsar Ohi, Muhammad Mostafa Monowar, Md Abdul Hamid, Md Rashedul Islam, and Yutaka Watanobe. "U-Vectors: Generating Clusterable Speaker Embedding from Unlabeled Data." Applied Sciences 11, no. 21 (2021): 10079. http://dx.doi.org/10.3390/app112110079.
Full textKim, Minsoo, and Gil-Jin Jang. "Speaker-Attributed Training for Multi-Speaker Speech Recognition Using Multi-Stage Encoders and Attention-Weighted Speaker Embedding." Applied Sciences 14, no. 18 (2024): 8138. http://dx.doi.org/10.3390/app14188138.
Full textLiu, Elaine M., Jih-Wei Yeh, Jen-Hao Lu, and Yi-Wen Liu. "Speaker embedding space cosine similarity comparisons of singing voice conversion models and voice morphing." Journal of the Acoustical Society of America 154, no. 4_supplement (2023): A244. http://dx.doi.org/10.1121/10.0023424.
Full textPick, Ron Korenblum, Vladyslav Kozhukhov, Dan Vilenchik, and Oren Tsur. "STEM: Unsupervised STructural EMbedding for Stance Detection." Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 10 (2022): 11174–82. http://dx.doi.org/10.1609/aaai.v36i10.21367.
Full textKaramyan, Davit S., and Grigor A. Kirakosyan. "Building a Speaker Diarization System: Lessons from VoxSRC 2023." Mathematical Problems of Computer Science 60 (November 30, 2023): 52–62. http://dx.doi.org/10.51408/1963-0109.
Full textMilewski, Krzysztof, Szymon Zaporowski, and Andrzej Czyżewski. "Comparison of the Ability of Neural Network Model and Humans to Detect a Cloned Voice." Electronics 12, no. 21 (2023): 4458. http://dx.doi.org/10.3390/electronics12214458.
Full textKang, Woo Hyun, Sung Hwan Mun, Min Hyun Han, and Nam Soo Kim. "Disentangled Speaker and Nuisance Attribute Embedding for Robust Speaker Verification." IEEE Access 8 (2020): 141838–49. http://dx.doi.org/10.1109/access.2020.3012893.
Full textPoojary, Nigam R., and K. H. Ashish. "Text To Speech with Custom Voice." International Journal for Research in Applied Science and Engineering Technology 11, no. 4 (2023): 4523–30. http://dx.doi.org/10.22214/ijraset.2023.51217.
Full textLee, Kong Aik, Qiongqiong Wang, and Takafumi Koshinaka. "Xi-Vector Embedding for Speaker Recognition." IEEE Signal Processing Letters 28 (2021): 1385–89. http://dx.doi.org/10.1109/lsp.2021.3091932.
Full textSečujski, Milan, Darko Pekar, Siniša Suzić, Anton Smirnov, and Tijana Nosek. "Speaker/Style-Dependent Neural Network Speech Synthesis Based on Speaker/Style Embedding." JUCS - Journal of Universal Computer Science 26, no. 4 (2020): 434–53. http://dx.doi.org/10.3897/jucs.2020.023.
Full textSečujski, Milan, Darko Pekar, Siniša Suzić, Anton Smirnov, and Tijana Nosek. "Speaker/Style-Dependent Neural Network Speech Synthesis Based on Speaker/Style Embedding." JUCS - Journal of Universal Computer Science 26, no. (4) (2020): 434–53. https://doi.org/10.3897/jucs.2020.023.
Full textChadchankar, Mrs Asharani. "Advancements in Speaker-Independent Speech Separation Using Deep Attractor Networks." International Journal for Research in Applied Science and Engineering Technology 13, no. 5 (2025): 4056–61. https://doi.org/10.22214/ijraset.2025.71160.
Full textBae, Ara, and Wooil Kim. "Speaker Verification Employing Combinations of Self-Attention Mechanisms." Electronics 9, no. 12 (2020): 2201. http://dx.doi.org/10.3390/electronics9122201.
Full textWirdiani, Ayu, Steven Ndung'u Machetho, I. Ketut Gede Darma Putra, Made Sudarma, Rukmi Sari Hartati, and Henrico Aldy Ferdian. "Improvement Model for Speaker Recognition using MFCC-CNN and Online Triplet Mining." International Journal on Advanced Science, Engineering and Information Technology 14, no. 2 (2024): 420–27. http://dx.doi.org/10.18517/ijaseit.14.2.19396.
Full textLi, Xiao, Xiao Chen, Rui Fu, Xiao Hu, Mintong Chen, and Kun Niu. "Learning Deep Embedding with Acoustic and Phoneme Features for Speaker Recognition in FM Broadcasting." IET Biometrics 2024 (March 22, 2024): 1–10. http://dx.doi.org/10.1049/2024/6694481.
Full textPan, Weijun, Shenhao Chen, Yidi Wang, Sheng Chen, and Xuan Wang. "The Speaker Identification Model for Air-Ground Communication Based on a Parallel Branch Architecture." Applied Sciences 15, no. 6 (2025): 2994. https://doi.org/10.3390/app15062994.
Full textBrydinskyi, Vitalii, Yuriy Khoma, Dmytro Sabodashko, et al. "Comparison of Modern Deep Learning Models for Speaker Verification." Applied Sciences 14, no. 4 (2024): 1329. http://dx.doi.org/10.3390/app14041329.
Full textLin, Weiwei, and Man-Wai Mak. "Mixture Representation Learning for Deep Speaker Embedding." IEEE/ACM Transactions on Audio, Speech, and Language Processing 30 (2022): 968–78. http://dx.doi.org/10.1109/taslp.2022.3153270.
Full textGhorbani, Shahram, and John H. L. Hansen. "Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition." Journal of the Acoustical Society of America 155, no. 6 (2024): 3848–60. http://dx.doi.org/10.1121/10.0026235.
Full textKhoma, Volodymyr, Yuriy Khoma, Vitalii Brydinskyi, and Alexander Konovalov. "Development of Supervised Speaker Diarization System Based on the PyAnnote Audio Processing Library." Sensors 23, no. 4 (2023): 2082. http://dx.doi.org/10.3390/s23042082.
Full textBahmaninezhad, Fahimeh, Chunlei Zhang, and John H. L. Hansen. "An investigation of domain adaptation in speaker embedding space for speaker recognition." Speech Communication 129 (May 2021): 7–16. http://dx.doi.org/10.1016/j.specom.2021.01.001.
Full textZeng, Bang, and Ming Li. "Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection." Computer Speech & Language 94 (November 2025): 101807. https://doi.org/10.1016/j.csl.2025.101807.
Full textXylogiannis, Paris, Nikolaos Vryzas, Lazaros Vrysis, and Charalampos Dimoulas. "Multisensory Fusion for Unsupervised Spatiotemporal Speaker Diarization." Sensors 24, no. 13 (2024): 4229. http://dx.doi.org/10.3390/s24134229.
Full textShahin Shamsabadi, Ali, Brij Mohan Lal Srivastava, Aurélien Bellet, et al. "Differentially Private Speaker Anonymization." Proceedings on Privacy Enhancing Technologies 2023, no. 1 (2023): 98–114. http://dx.doi.org/10.56553/popets-2023-0007.
Full textLi, Wenjie, Pengyuan Zhang, and Yonghong Yan. "TEnet: target speaker extraction network with accumulated speaker embedding for automatic speech recognition." Electronics Letters 55, no. 14 (2019): 816–19. http://dx.doi.org/10.1049/el.2019.1228.
Full textXie, Fei, Dalong Zhang, and Chengming Liu. "Global–Local Self-Attention Based Transformer for Speaker Verification." Applied Sciences 12, no. 19 (2022): 10154. http://dx.doi.org/10.3390/app121910154.
Full textShim, Hye-jin, Jee-weon Jung, and Ha-Jin Yu. "Which to select?: Analysis of speaker representation with graph attention networks." Journal of the Acoustical Society of America 156, no. 4 (2024): 2701–8. http://dx.doi.org/10.1121/10.0032393.
Full textGuo, Xin, Chengfang Luo, Aiwen Deng, and Feiqi Deng. "DeltaVLAD: An efficient optimization algorithm to discriminate speaker embedding for text-independent speaker verification." AIMS Mathematics 7, no. 4 (2022): 6381–95. http://dx.doi.org/10.3934/math.2022355.
Full textPrabhala, Jagat Chaitanya, Venkatnareshbabu K, and Ragoju Ravi. "OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIARIZATION SYSTEMS: A MATHEMATICAL FORMULATION." Applied Mathematics and Sciences An International Journal (MathSJ) 10, no. 1/2 (2023): 1–10. http://dx.doi.org/10.5121/mathsj.2023.10201.
Full textSmith, Sierra Rose, Patricia Crist, Rebekah Givens, Taylor Stringer, and Adriana Macdonald. "Interviews Regarding Practice Scholar Engagement: Practitioners’ Descriptions of Their Research Motivations, Characteristics, Resources, & Outcomes." American Journal of Occupational Therapy 78, Supplement_2 (2024): 7811500214p1. http://dx.doi.org/10.5014/ajot.2024.78s2-po214.
Full textMingote, Victoria, Antonio Miguel, Alfonso Ortega, and Eduardo Lleida. "Supervector Extraction for Encoding Speaker and Phrase Information with Neural Networks for Text-Dependent Speaker Verification." Applied Sciences 9, no. 16 (2019): 3295. http://dx.doi.org/10.3390/app9163295.
Full textLyu, Ke-Ming, Ren-yuan Lyu, and Hsien-Tsung Chang. "Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation." PeerJ Computer Science 10 (March 29, 2024): e1973. http://dx.doi.org/10.7717/peerj-cs.1973.
Full textLIANG, Chunyan, Lin YANG, Qingwei ZHAO, and Yonghong YAN. "Factor Analysis of Neighborhood-Preserving Embedding for Speaker Verification." IEICE Transactions on Information and Systems E95.D, no. 10 (2012): 2572–76. http://dx.doi.org/10.1587/transinf.e95.d.2572.
Full textByun, Jaeuk, and Jong Won Shin. "Monaural Speech Separation Using Speaker Embedding From Preliminary Separation." IEEE/ACM Transactions on Audio, Speech, and Language Processing 29 (2021): 2753–63. http://dx.doi.org/10.1109/taslp.2021.3101617.
Full textLin, Weiwei, Man-Wai Mak, Na Li, Dan Su, and Dong Yu. "A Framework for Adapting DNN Speaker Embedding Across Languages." IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2020): 2810–22. http://dx.doi.org/10.1109/taslp.2020.3030499.
Full textMisbullah, Alim, Muhammad Saifullah Sani, Husaini, Laina Farsiah, Zahnur, and Kikye Martiwi Sukiakhy. "Sistem Identifikasi Pembicara Berbahasa Indonesia Menggunakan X-Vector Embedding." Jurnal Teknologi Informasi dan Ilmu Komputer 11, no. 2 (2024): 369–76. http://dx.doi.org/10.25126/jtiik.20241127866.
Full textLi, Yanxiong, Qisheng Huang, Xiaofen Xing, and Xiangmin Xu. "Low-complexity speaker embedding module with feature segmentation, transformation and reconstruction for few-shot speaker identification." Expert Systems with Applications 280 (June 2025): 127542. https://doi.org/10.1016/j.eswa.2025.127542.
Full textZhou, Yi, Xiaohai Tian, and Haizhou Li. "Language Agnostic Speaker Embedding for Cross-Lingual Personalized Speech Generation." IEEE/ACM Transactions on Audio, Speech, and Language Processing 29 (2021): 3427–39. http://dx.doi.org/10.1109/taslp.2021.3125142.
Full text杨, 益灵. "Multi-Speaker Indonesian Speech Synthesis Based on Global Style Embedding." Computer Science and Application 13, no. 01 (2023): 126–35. http://dx.doi.org/10.12677/csa.2023.131013.
Full textKim, Ju-Ho, Hye-Jin Shim, Jee-Weon Jung, and Ha-Jin Yu. "A Supervised Learning Method for Improving the Generalization of Speaker Verification Systems by Learning Metrics from a Mean Teacher." Applied Sciences 12, no. 1 (2021): 76. http://dx.doi.org/10.3390/app12010076.
Full textSeo, Soonshin, and Ji-Hwan Kim. "Self-Attentive Multi-Layer Aggregation with Feature Recalibration and Deep Length Normalization for Text-Independent Speaker Verification System." Electronics 9, no. 10 (2020): 1706. http://dx.doi.org/10.3390/electronics9101706.
Full textByun, Sung-Woo, and Seok-Pil Lee. "Design of a Multi-Condition Emotional Speech Synthesizer." Applied Sciences 11, no. 3 (2021): 1144. http://dx.doi.org/10.3390/app11031144.
Full textWang, Jiani, Shiran Dudy, Xinlu Hu, Zhiyong Wang, Rosy Southwell, and Jacob Whitehill. "Optimizing Speaker Diarization for the Classroom: Applications in Timing Student Speech and Distinguishing Teachers from Children." Journal of Educational Data Mining 17, no. 1 (2025): 98–125. https://doi.org/10.5281/zenodo.14871875.
Full textWang, Shuai, Zili Huang, Yanmin Qian, and Kai Yu. "Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification." IEEE/ACM Transactions on Audio, Speech, and Language Processing 27, no. 11 (2019): 1686–96. http://dx.doi.org/10.1109/taslp.2019.2928128.
Full textWang, Shuai, Yexin Yang, Zhanghao Wu, Yanmin Qian, and Kai Yu. "Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition." IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2020): 2598–609. http://dx.doi.org/10.1109/taslp.2020.3016498.
Full textYOU, MINGYU, GUO-ZHENG LI, JACK Y. YANG, and MARY QU YANG. "AN ENHANCED LIPSCHITZ EMBEDDING CLASSIFIER FOR MULTI-EMOTION SPEECH ANALYSIS." International Journal of Pattern Recognition and Artificial Intelligence 23, no. 08 (2009): 1685–700. http://dx.doi.org/10.1142/s0218001409007764.
Full textCLARIDGE, CLAUDIA, EWA JONSSON, and MERJA KYTÖ. "Entirely innocent: a historical sociopragmatic analysis of maximizers in the Old Bailey Corpus." English Language and Linguistics 24, no. 4 (2019): 855–74. http://dx.doi.org/10.1017/s1360674319000388.
Full textViñals, Ignacio, Alfonso Ortega, Antonio Miguel, and Eduardo Lleida. "An Analysis of the Short Utterance Problem for Speaker Characterization." Applied Sciences 9, no. 18 (2019): 3697. http://dx.doi.org/10.3390/app9183697.
Full textKang, Woo Hyun, and Nam Soo Kim. "Unsupervised Learning of Total Variability Embedding for Speaker Verification with Random Digit Strings." Applied Sciences 9, no. 8 (2019): 1597. http://dx.doi.org/10.3390/app9081597.
Full textQiu, Zeyu, Jun Tang, Yaxin Zhang, Jiaxin Li, and Xishan Bai. "A Voice Cloning Method Based on the Improved HiFi-GAN Model." Computational Intelligence and Neuroscience 2022 (October 11, 2022): 1–12. http://dx.doi.org/10.1155/2022/6707304.
Full text