Zeitschriftenartikel zum Thema „Speaker embedding“
Geben Sie eine Quelle nach APA, MLA, Chicago, Harvard und anderen Zitierweisen an
Machen Sie sich mit Top-50 Zeitschriftenartikel für die Forschung zum Thema "Speaker embedding" bekannt.
Neben jedem Werk im Literaturverzeichnis ist die Option "Zur Bibliographie hinzufügen" verfügbar. Nutzen Sie sie, wird Ihre bibliographische Angabe des gewählten Werkes nach der nötigen Zitierweise (APA, MLA, Harvard, Chicago, Vancouver usw.) automatisch gestaltet.
Sie können auch den vollen Text der wissenschaftlichen Publikation im PDF-Format herunterladen und eine Online-Annotation der Arbeit lesen, wenn die relevanten Parameter in den Metadaten verfügbar sind.
Sehen Sie die Zeitschriftenartikel für verschiedene Spezialgebieten durch und erstellen Sie Ihre Bibliographie auf korrekte Weise.
Mridha, Muhammad Firoz, Abu Quwsar Ohi, Muhammad Mostafa Monowar, Md Abdul Hamid, Md Rashedul Islam, and Yutaka Watanobe. "U-Vectors: Generating Clusterable Speaker Embedding from Unlabeled Data." Applied Sciences 11, no. 21 (2021): 10079. http://dx.doi.org/10.3390/app112110079.
Der volle Inhalt der QuelleKim, Minsoo, and Gil-Jin Jang. "Speaker-Attributed Training for Multi-Speaker Speech Recognition Using Multi-Stage Encoders and Attention-Weighted Speaker Embedding." Applied Sciences 14, no. 18 (2024): 8138. http://dx.doi.org/10.3390/app14188138.
Der volle Inhalt der QuelleLiu, Elaine M., Jih-Wei Yeh, Jen-Hao Lu, and Yi-Wen Liu. "Speaker embedding space cosine similarity comparisons of singing voice conversion models and voice morphing." Journal of the Acoustical Society of America 154, no. 4_supplement (2023): A244. http://dx.doi.org/10.1121/10.0023424.
Der volle Inhalt der QuellePick, Ron Korenblum, Vladyslav Kozhukhov, Dan Vilenchik, and Oren Tsur. "STEM: Unsupervised STructural EMbedding for Stance Detection." Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 10 (2022): 11174–82. http://dx.doi.org/10.1609/aaai.v36i10.21367.
Der volle Inhalt der QuelleKaramyan, Davit S., and Grigor A. Kirakosyan. "Building a Speaker Diarization System: Lessons from VoxSRC 2023." Mathematical Problems of Computer Science 60 (November 30, 2023): 52–62. http://dx.doi.org/10.51408/1963-0109.
Der volle Inhalt der QuelleMilewski, Krzysztof, Szymon Zaporowski, and Andrzej Czyżewski. "Comparison of the Ability of Neural Network Model and Humans to Detect a Cloned Voice." Electronics 12, no. 21 (2023): 4458. http://dx.doi.org/10.3390/electronics12214458.
Der volle Inhalt der QuelleKang, Woo Hyun, Sung Hwan Mun, Min Hyun Han, and Nam Soo Kim. "Disentangled Speaker and Nuisance Attribute Embedding for Robust Speaker Verification." IEEE Access 8 (2020): 141838–49. http://dx.doi.org/10.1109/access.2020.3012893.
Der volle Inhalt der QuellePoojary, Nigam R., and K. H. Ashish. "Text To Speech with Custom Voice." International Journal for Research in Applied Science and Engineering Technology 11, no. 4 (2023): 4523–30. http://dx.doi.org/10.22214/ijraset.2023.51217.
Der volle Inhalt der QuelleLee, Kong Aik, Qiongqiong Wang, and Takafumi Koshinaka. "Xi-Vector Embedding for Speaker Recognition." IEEE Signal Processing Letters 28 (2021): 1385–89. http://dx.doi.org/10.1109/lsp.2021.3091932.
Der volle Inhalt der QuelleSečujski, Milan, Darko Pekar, Siniša Suzić, Anton Smirnov, and Tijana Nosek. "Speaker/Style-Dependent Neural Network Speech Synthesis Based on Speaker/Style Embedding." JUCS - Journal of Universal Computer Science 26, no. 4 (2020): 434–53. http://dx.doi.org/10.3897/jucs.2020.023.
Der volle Inhalt der QuelleSečujski, Milan, Darko Pekar, Siniša Suzić, Anton Smirnov, and Tijana Nosek. "Speaker/Style-Dependent Neural Network Speech Synthesis Based on Speaker/Style Embedding." JUCS - Journal of Universal Computer Science 26, no. (4) (2020): 434–53. https://doi.org/10.3897/jucs.2020.023.
Der volle Inhalt der QuelleChadchankar, Mrs Asharani. "Advancements in Speaker-Independent Speech Separation Using Deep Attractor Networks." International Journal for Research in Applied Science and Engineering Technology 13, no. 5 (2025): 4056–61. https://doi.org/10.22214/ijraset.2025.71160.
Der volle Inhalt der QuelleBae, Ara, and Wooil Kim. "Speaker Verification Employing Combinations of Self-Attention Mechanisms." Electronics 9, no. 12 (2020): 2201. http://dx.doi.org/10.3390/electronics9122201.
Der volle Inhalt der QuelleWirdiani, Ayu, Steven Ndung'u Machetho, I. Ketut Gede Darma Putra, Made Sudarma, Rukmi Sari Hartati, and Henrico Aldy Ferdian. "Improvement Model for Speaker Recognition using MFCC-CNN and Online Triplet Mining." International Journal on Advanced Science, Engineering and Information Technology 14, no. 2 (2024): 420–27. http://dx.doi.org/10.18517/ijaseit.14.2.19396.
Der volle Inhalt der QuelleLi, Xiao, Xiao Chen, Rui Fu, Xiao Hu, Mintong Chen, and Kun Niu. "Learning Deep Embedding with Acoustic and Phoneme Features for Speaker Recognition in FM Broadcasting." IET Biometrics 2024 (March 22, 2024): 1–10. http://dx.doi.org/10.1049/2024/6694481.
Der volle Inhalt der QuellePan, Weijun, Shenhao Chen, Yidi Wang, Sheng Chen, and Xuan Wang. "The Speaker Identification Model for Air-Ground Communication Based on a Parallel Branch Architecture." Applied Sciences 15, no. 6 (2025): 2994. https://doi.org/10.3390/app15062994.
Der volle Inhalt der QuelleBrydinskyi, Vitalii, Yuriy Khoma, Dmytro Sabodashko, et al. "Comparison of Modern Deep Learning Models for Speaker Verification." Applied Sciences 14, no. 4 (2024): 1329. http://dx.doi.org/10.3390/app14041329.
Der volle Inhalt der QuelleLin, Weiwei, and Man-Wai Mak. "Mixture Representation Learning for Deep Speaker Embedding." IEEE/ACM Transactions on Audio, Speech, and Language Processing 30 (2022): 968–78. http://dx.doi.org/10.1109/taslp.2022.3153270.
Der volle Inhalt der QuelleGhorbani, Shahram, and John H. L. Hansen. "Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition." Journal of the Acoustical Society of America 155, no. 6 (2024): 3848–60. http://dx.doi.org/10.1121/10.0026235.
Der volle Inhalt der QuelleKhoma, Volodymyr, Yuriy Khoma, Vitalii Brydinskyi, and Alexander Konovalov. "Development of Supervised Speaker Diarization System Based on the PyAnnote Audio Processing Library." Sensors 23, no. 4 (2023): 2082. http://dx.doi.org/10.3390/s23042082.
Der volle Inhalt der QuelleBahmaninezhad, Fahimeh, Chunlei Zhang, and John H. L. Hansen. "An investigation of domain adaptation in speaker embedding space for speaker recognition." Speech Communication 129 (May 2021): 7–16. http://dx.doi.org/10.1016/j.specom.2021.01.001.
Der volle Inhalt der QuelleZeng, Bang, and Ming Li. "Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection." Computer Speech & Language 94 (November 2025): 101807. https://doi.org/10.1016/j.csl.2025.101807.
Der volle Inhalt der QuelleXylogiannis, Paris, Nikolaos Vryzas, Lazaros Vrysis, and Charalampos Dimoulas. "Multisensory Fusion for Unsupervised Spatiotemporal Speaker Diarization." Sensors 24, no. 13 (2024): 4229. http://dx.doi.org/10.3390/s24134229.
Der volle Inhalt der QuelleShahin Shamsabadi, Ali, Brij Mohan Lal Srivastava, Aurélien Bellet, et al. "Differentially Private Speaker Anonymization." Proceedings on Privacy Enhancing Technologies 2023, no. 1 (2023): 98–114. http://dx.doi.org/10.56553/popets-2023-0007.
Der volle Inhalt der QuelleLi, Wenjie, Pengyuan Zhang, and Yonghong Yan. "TEnet: target speaker extraction network with accumulated speaker embedding for automatic speech recognition." Electronics Letters 55, no. 14 (2019): 816–19. http://dx.doi.org/10.1049/el.2019.1228.
Der volle Inhalt der QuelleXie, Fei, Dalong Zhang, and Chengming Liu. "Global–Local Self-Attention Based Transformer for Speaker Verification." Applied Sciences 12, no. 19 (2022): 10154. http://dx.doi.org/10.3390/app121910154.
Der volle Inhalt der QuelleShim, Hye-jin, Jee-weon Jung, and Ha-Jin Yu. "Which to select?: Analysis of speaker representation with graph attention networks." Journal of the Acoustical Society of America 156, no. 4 (2024): 2701–8. http://dx.doi.org/10.1121/10.0032393.
Der volle Inhalt der QuelleGuo, Xin, Chengfang Luo, Aiwen Deng, and Feiqi Deng. "DeltaVLAD: An efficient optimization algorithm to discriminate speaker embedding for text-independent speaker verification." AIMS Mathematics 7, no. 4 (2022): 6381–95. http://dx.doi.org/10.3934/math.2022355.
Der volle Inhalt der QuellePrabhala, Jagat Chaitanya, Venkatnareshbabu K, and Ragoju Ravi. "OPTIMIZING SIMILARITY THRESHOLD FOR ABSTRACT SIMILARITY METRIC IN SPEECH DIARIZATION SYSTEMS: A MATHEMATICAL FORMULATION." Applied Mathematics and Sciences An International Journal (MathSJ) 10, no. 1/2 (2023): 1–10. http://dx.doi.org/10.5121/mathsj.2023.10201.
Der volle Inhalt der QuelleSmith, Sierra Rose, Patricia Crist, Rebekah Givens, Taylor Stringer, and Adriana Macdonald. "Interviews Regarding Practice Scholar Engagement: Practitioners’ Descriptions of Their Research Motivations, Characteristics, Resources, & Outcomes." American Journal of Occupational Therapy 78, Supplement_2 (2024): 7811500214p1. http://dx.doi.org/10.5014/ajot.2024.78s2-po214.
Der volle Inhalt der QuelleMingote, Victoria, Antonio Miguel, Alfonso Ortega, and Eduardo Lleida. "Supervector Extraction for Encoding Speaker and Phrase Information with Neural Networks for Text-Dependent Speaker Verification." Applied Sciences 9, no. 16 (2019): 3295. http://dx.doi.org/10.3390/app9163295.
Der volle Inhalt der QuelleLyu, Ke-Ming, Ren-yuan Lyu, and Hsien-Tsung Chang. "Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation." PeerJ Computer Science 10 (March 29, 2024): e1973. http://dx.doi.org/10.7717/peerj-cs.1973.
Der volle Inhalt der QuelleLIANG, Chunyan, Lin YANG, Qingwei ZHAO, and Yonghong YAN. "Factor Analysis of Neighborhood-Preserving Embedding for Speaker Verification." IEICE Transactions on Information and Systems E95.D, no. 10 (2012): 2572–76. http://dx.doi.org/10.1587/transinf.e95.d.2572.
Der volle Inhalt der QuelleByun, Jaeuk, and Jong Won Shin. "Monaural Speech Separation Using Speaker Embedding From Preliminary Separation." IEEE/ACM Transactions on Audio, Speech, and Language Processing 29 (2021): 2753–63. http://dx.doi.org/10.1109/taslp.2021.3101617.
Der volle Inhalt der QuelleLin, Weiwei, Man-Wai Mak, Na Li, Dan Su, and Dong Yu. "A Framework for Adapting DNN Speaker Embedding Across Languages." IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2020): 2810–22. http://dx.doi.org/10.1109/taslp.2020.3030499.
Der volle Inhalt der QuelleMisbullah, Alim, Muhammad Saifullah Sani, Husaini, Laina Farsiah, Zahnur, and Kikye Martiwi Sukiakhy. "Sistem Identifikasi Pembicara Berbahasa Indonesia Menggunakan X-Vector Embedding." Jurnal Teknologi Informasi dan Ilmu Komputer 11, no. 2 (2024): 369–76. http://dx.doi.org/10.25126/jtiik.20241127866.
Der volle Inhalt der QuelleLi, Yanxiong, Qisheng Huang, Xiaofen Xing, and Xiangmin Xu. "Low-complexity speaker embedding module with feature segmentation, transformation and reconstruction for few-shot speaker identification." Expert Systems with Applications 280 (June 2025): 127542. https://doi.org/10.1016/j.eswa.2025.127542.
Der volle Inhalt der QuelleZhou, Yi, Xiaohai Tian, and Haizhou Li. "Language Agnostic Speaker Embedding for Cross-Lingual Personalized Speech Generation." IEEE/ACM Transactions on Audio, Speech, and Language Processing 29 (2021): 3427–39. http://dx.doi.org/10.1109/taslp.2021.3125142.
Der volle Inhalt der Quelle杨, 益灵. "Multi-Speaker Indonesian Speech Synthesis Based on Global Style Embedding." Computer Science and Application 13, no. 01 (2023): 126–35. http://dx.doi.org/10.12677/csa.2023.131013.
Der volle Inhalt der QuelleKim, Ju-Ho, Hye-Jin Shim, Jee-Weon Jung, and Ha-Jin Yu. "A Supervised Learning Method for Improving the Generalization of Speaker Verification Systems by Learning Metrics from a Mean Teacher." Applied Sciences 12, no. 1 (2021): 76. http://dx.doi.org/10.3390/app12010076.
Der volle Inhalt der QuelleSeo, Soonshin, and Ji-Hwan Kim. "Self-Attentive Multi-Layer Aggregation with Feature Recalibration and Deep Length Normalization for Text-Independent Speaker Verification System." Electronics 9, no. 10 (2020): 1706. http://dx.doi.org/10.3390/electronics9101706.
Der volle Inhalt der QuelleByun, Sung-Woo, and Seok-Pil Lee. "Design of a Multi-Condition Emotional Speech Synthesizer." Applied Sciences 11, no. 3 (2021): 1144. http://dx.doi.org/10.3390/app11031144.
Der volle Inhalt der QuelleWang, Jiani, Shiran Dudy, Xinlu Hu, Zhiyong Wang, Rosy Southwell, and Jacob Whitehill. "Optimizing Speaker Diarization for the Classroom: Applications in Timing Student Speech and Distinguishing Teachers from Children." Journal of Educational Data Mining 17, no. 1 (2025): 98–125. https://doi.org/10.5281/zenodo.14871875.
Der volle Inhalt der QuelleWang, Shuai, Zili Huang, Yanmin Qian, and Kai Yu. "Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification." IEEE/ACM Transactions on Audio, Speech, and Language Processing 27, no. 11 (2019): 1686–96. http://dx.doi.org/10.1109/taslp.2019.2928128.
Der volle Inhalt der QuelleWang, Shuai, Yexin Yang, Zhanghao Wu, Yanmin Qian, and Kai Yu. "Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition." IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2020): 2598–609. http://dx.doi.org/10.1109/taslp.2020.3016498.
Der volle Inhalt der QuelleYOU, MINGYU, GUO-ZHENG LI, JACK Y. YANG, and MARY QU YANG. "AN ENHANCED LIPSCHITZ EMBEDDING CLASSIFIER FOR MULTI-EMOTION SPEECH ANALYSIS." International Journal of Pattern Recognition and Artificial Intelligence 23, no. 08 (2009): 1685–700. http://dx.doi.org/10.1142/s0218001409007764.
Der volle Inhalt der QuelleCLARIDGE, CLAUDIA, EWA JONSSON, and MERJA KYTÖ. "Entirely innocent: a historical sociopragmatic analysis of maximizers in the Old Bailey Corpus." English Language and Linguistics 24, no. 4 (2019): 855–74. http://dx.doi.org/10.1017/s1360674319000388.
Der volle Inhalt der QuelleViñals, Ignacio, Alfonso Ortega, Antonio Miguel, and Eduardo Lleida. "An Analysis of the Short Utterance Problem for Speaker Characterization." Applied Sciences 9, no. 18 (2019): 3697. http://dx.doi.org/10.3390/app9183697.
Der volle Inhalt der QuelleKang, Woo Hyun, and Nam Soo Kim. "Unsupervised Learning of Total Variability Embedding for Speaker Verification with Random Digit Strings." Applied Sciences 9, no. 8 (2019): 1597. http://dx.doi.org/10.3390/app9081597.
Der volle Inhalt der QuelleQiu, Zeyu, Jun Tang, Yaxin Zhang, Jiaxin Li, and Xishan Bai. "A Voice Cloning Method Based on the Improved HiFi-GAN Model." Computational Intelligence and Neuroscience 2022 (October 11, 2022): 1–12. http://dx.doi.org/10.1155/2022/6707304.
Der volle Inhalt der Quelle