References
- Y. Esteve, T. Bazillon, J.Y. Antoine, F. Bechet, and J. Farinas, "The EPAC corpus: manual and automatic annotations of conversational speech in french broadcast news," in LREC., 1686-1689 (2010).
- E. El-Khoury, C. Senac, and J. Pinquier. "Improved speaker diarization system for meetings," in IEEE ICASSP., 4097-4100 (2009).
- A. Vinciarelli, Alessandro, F. Fernandez, and S. Favre, "Semantic segmentation of radio programs using social network analysis and duration distribution modeling," in ICME., 779-782 (2007).
- H. Tang, S. M. Chu, M. Hasegawa-Johnson, and T.S. Huang, "Partially supervised speaker clustering," IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 959-971 (2012). https://doi.org/10.1109/TPAMI.2011.174
- T. Pfau, Daniel P.W. Ellis, and A. Stolcke, "Multispeaker speech activity detection for the icsi meeting recorder," in IEEE Workshop on Automatic Speech Recognition and Understanding, 107-110 (2001).
- D. Wing, Y. Yan, J. Dang, and F. K. Soong, "Voice activity detection based on an unsupervised learning framework," IEEE Transactions on Audio, Speech, and Language Processing 19, 2624-2633 (2011). https://doi.org/10.1109/TASL.2011.2125953
- F. G. Germain, D. L. Sun, and G. J. Mysore. "Speaker and noise independent voice activity detection," in Interspeech 2013, 732-736 (2013).
- R. Sinha, S. Tranter, M. Gales, and P. Woodland, "The cambridge university march 2005 speaker diarisation system," in Interspeech, 2437-2440 (2005).
- P. Jain and H. Hermansky, "Improved mean and variance normalization for robust speech recognition," in IEEE ICASSP., 4015-4015 (2001).
- D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," in DSP., 19-41 (2000).
- S. Meignier, D. Moraru, C. Fredouille, J. F. Bonastre, and L. besacier, "Step-by-step and integrated approaches in broadcast news speaker diarization," Computer Speech & Language 20, 303-330 (2006). https://doi.org/10.1016/j.csl.2005.08.002
- X. Zhu, C. Barras, L. Lamel, and J-L. Gauvain, "Multi-stage speaker diarization for conference and lecture meetings," in Multimodal Technologies for Perception of Humans (Springer Berlin Heidelberg, Germany, 2008), pp. 533-542.
- S. Salvador and P. Chan, "Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms," in 16th IEEE International Conference on Tools with Artificial Intelligence, 576-584 (2004).
- H-P. Kriegelet, P. Kroger, J. Sander, and A. Zimek, "Density-based clustering," Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 231-240 (2011).
- M. Ester, H-P. Kriegel, J. Sander, and X. Xu, "A density-based algorithm for discovering clusters in large spatial databases with noise," in Knowledge Discovery and Data Mining 96, 226-231 (1996).
- C. Braune, S. Besecke, and R. Kruse, "Density Based Clustering: Alternatives to DBSCAN," Partitional Clustering Algorithms(Springer International Publishing, Switzerland, 2015), pp. 193-213.
- Z. Aoying, Z. Shuigeng, C. Jing, F. Ye, and H. Yunfa, "Approaches for scaling DBSCAN algorithm to large spatial databases," JCST. 15, 509-526 (2000).
- N. Dehak, P. J. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Transactions on Audio, Speech, and Lang. Process. 19, 788-798 (2011). https://doi.org/10.1109/TASL.2010.2064307