Performance Comparison of Feature Parameters and Classifiers for Speech/Music Discrimination

Kim Hyung Soon;Kim Su Mi;

대한음성학회지:말소리 (MALSORI)

제46호
/
Pages.37-50
/
2003
/
1226-1173(pISSN)

대한음성학회 (The Korean Society Of Phonetic Sciences And Speech Technology)

음성/음악 판별을 위한 특징 파라미터와 분류기의 성능비교

Performance Comparison of Feature Parameters and Classifiers for Speech/Music Discrimination

김형순 (부산대) ;
김수미 (부산대)

발행 : 2003.06.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

In this paper, we evaluate and compare the performance of speech/music discrimination based on various feature parameters and classifiers. As for feature parameters, we consider High Zero Crossing Rate Ratio (HZCRR), Low Short Time Energy Ratio (LSTER), Spectral Flux (SF), Line Spectral Pair (LSP) distance, entropy and dynamism. We also examine three classifiers: k Nearest Neighbor (k-NN), Gaussian Mixure Model (GMM), and Hidden Markov Model (HMM). According to our experiments, LSP distance and phoneme-recognizer-based feature set (entropy and dunamism) show good performance, while performance differences due to different classifiers are not significant. When all the six feature parameters are employed, average speech/music discrimination accuracy up to 96.6% is achieved.

대한음성학회지:말소리 (MALSORI)

음성/음악 판별을 위한 특징 파라미터와 분류기의 성능비교

Performance Comparison of Feature Parameters and Classifiers for Speech/Music Discrimination

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)