References
- Xinbo Gao, Xiaoou Tang, "Unsupervised Video-shot Segmentation and Model-free Anchorperson Detection for News Video Story Parsing," IEEE Trans. Circuits and Systems for Video Technology. 12(4), 765-776, 2002 https://doi.org/10.1109/TCSVT.2002.800510
- Alberto Albiol, Luis Torres, Edward J. Delp. "The Indexing of Persons in News Sequences using Audio-visual Data," in Proc. International Conference on Acoustics, Speech, and Signal Processing, 3, 137-140, 2003
- Yuya Akita, Masahiro Hasegawa, Tatsuya Kawahara, "Automatic Audio Archiving System for Panel Discussions," in Proc. International Conference on Multimedia & Expo, 3, 1895 -1862, 2004
- Alfred Dielmann, Steve Renals, "Automatic Meeting Segmentation Using Dynamic Bayesian Networks," IEEE Trans. Multimedia, 9(1), 25-36, 2007 https://doi.org/10.1109/TMM.2006.886337
- 한학용, 허강인, 김수훈, "오디오 데이터의 특징 파라메터 구성 에 따른 내용기반 분석," 한국음향학회지 21(2), 182-189, 2002
- 손종목, 배건성, 강경옥, 김재곤, "내용기반 비디오 색인 및 검색 을 위한 음성인식기술 이용에 관한 연구," 한국음향학회지,20(2), 16-20, 2001
- Soonil Kwon, Shrikanth Narayanan, "Unsupervised Speaker Indexing Using Generic Models," IEEE Trans. Speech and Audio Proc. 13(5), 1004-1013, 2005
- Sue E. Tranter, Douglas A. Reynolds, "An Overview of Automatic Speaker Diarization Systems," IEEE Trans. Audio, Speech and Language Proc. 14(5), 1557-1565, 2006
- Ying Li, Shrikanth Narayanan, C.-C. Jay Kuo, "Audiovisual -based Adaptive Speaker Identification," in Proc. International Conference on Acoustics, Speech, and Signal Processing, 5, 812-815, 2003
- Ki Tae Park, Doo Sun Hwang, Young Shik Moon, "Anchor Frame Detection in News Video Using Anchor Object Extraction," IEICE Trans. Fund., E88-A(6), 1525-1528, 2005 https://doi.org/10.1093/ietfec/e88-a.6.1525
- 금지수, 임성길, 이현수, "스펙트럼 분석과 신경망을 이용한 음성 /음악 분류," 한국음향학회지, 26(5), 207-213, 2007
- Scott Shaobing Chen, P.S. Gopalakrishnan, "Speaker, Environment and Channel Change Detection and Clustering via The Bayesian Information Criterion," DARPA Broadcast News Transcription & Understanding Workshop, 1998
- P. Delacourt, C. J. Wellekens, "DISTBIC: A Speaker Based Segmentation for Audio Data Indexing," Speech Communication, 32, 111-126, 2000 https://doi.org/10.1016/S0167-6393(00)00027-3
- Min Yang, Yingchun Yang, Zhaohui Wu, "A Pitch-based Rapid Speech Segmentation for Speaker Indexing," in Proc. IEEE International Symposium on Multimedia, 2005
- Xuejing, "Pitch Determination and Voice Quality Analysis using Subharmonic-to-Harmonic Ratio," in Proc. International Conference on Acoustics, Speech, and Signal Processing, 1, 333-336, 2002
- Maria Zapata Ferrer, Mauro Barbieri, Hans Weda, "Automatic Classification of Field of View in Video," in Proc. International Conference on Multimedia & Expo, 1609-1612, 2006