• 제목/요약/키워드: vocal region detection

검색결과 5건 처리시간 0.018초

음성 특성을 고려한 가라오케 시스템 (A Karaoke system based on the vocal characteristics)

  • 김유승;김인철
    • 방송공학회논문지
    • /
    • 제13권3호
    • /
    • pp.380-387
    • /
    • 2008
  • 본 논문에서는 음성 특성에 기반을 둔 보컬 영역 검색 알고리듬을 적용하는 가라오케 시스템을 제시한다. 제안한 시스템에서 입력 음악은 보컬 영역 검색 알고리듬을 통해 보컬 부분과 반주 부분으로 분류된다. 그런 다음, 보컬 영역에 대해서만 보컬 제거기법을 적용한다. 보컬 영역 검색에서는 TICFT (twice iterated composite Fourier transform) 영역에서 보컬의 특성을 고려하여 분류를 수행한다. 보컬 제거를 위해서 대역 통과 필터링 된 보컬 영역으로부터 보컬 성분을 추출하고, 이를 원래의 음악에서 감산함으로써 보컬 성분이 제거된 음악을 얻는다. 본 논문에서 제시한 기법은 4곡의 노래에 적용하고, 그 성능을 평가한다.

음성 하모닉스 스펙트럼의 피크-피팅을 이용한 피치검출에 관한 연구 (A Study on the Pitch Detection of Speech Harmonics by the Peak-Fitting)

  • 김종국;조왕래;배명진
    • 음성과학
    • /
    • 제10권2호
    • /
    • pp.85-95
    • /
    • 2003
  • In speech signal processing, it is very important to detect the pitch exactly in speech recognition, synthesis and analysis. If we exactly pitch detect in speech signal, in the analysis, we can use the pitch to obtain properly the vocal tract parameter. It can be used to easily change or to maintain the naturalness and intelligibility of quality in speech synthesis and to eliminate the personality for speaker-independence in speech recognition. In this paper, we proposed a new pitch detection algorithm. First, positive center clipping is process by using the incline of speech in order to emphasize pitch period with a glottal component of removed vocal tract characteristic in time domain. And rough formant envelope is computed through peak-fitting spectrum of original speech signal infrequence domain. Using the roughed formant envelope, obtain the smoothed formant envelope through calculate the linear interpolation. As well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. Inverse fast fourier transform (IFFT) compute this flattened harmonics. After all, we obtain Residual signal which is removed vocal tract element. The performance was compared with LPC and Cepstrum, ACF. Owing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.

  • PDF

피치 검출을 위한 스펙트럼 평탄화 기법 (Flattening Techniques for Pitch Detection)

  • 김종국;조왕래;배명진
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
    • /
    • pp.381-384
    • /
    • 2002
  • In speech signal processing, it Is very important to detect the pitch exactly in speech recognition, synthesis and analysis. but, it is very difficult to pitch detection from speech signal because of formant and transition amplitude affect. therefore, in this paper, we proposed a pitch detection using the spectrum flattening techniques. Spectrum flattening is to eliminate the formant and transition amplitude affect. In time domain, positive center clipping is process in order to emphasize pitch period with a glottal component of removed vocal tract characteristic. And rough formant envelope is computed through peak-fitting spectrum of original speech signal in frequency domain. As a results, well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. After all, we obtain residual signal which is removed vocal tract element The performance was compared with LPC and Cepstrum, ACF 0wing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.

  • PDF

성대 영상에서 에너지를 이용한 관심 영역 추출 (Region-of-Interest Detection using the Energy from Vocal Fold Image)

  • 김엄준;성미영
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제27권8호
    • /
    • pp.804-814
    • /
    • 2000
  • 본 논문에서는 비데오스트로보키모그래피(Videostrobokymography) 시스템에서 영상중의 관심 영역을 추출하는 효율적인 방법을 소개하고자 한다. 비데오스트로보키모그래피는 성대 운동의 불규칙적인 움직임을 판단하여 자동으로 진단 파라미터를 구하는 의료 영상 시스템이다. 본 논문에서는 세 가지의 단계를 거쳐서 관심 영역을 추출하고 있다. 첫 번째로 최소 에너지를 이용하여 관심 영역의 중심이 되는 부분을 찾는다. 관심 영역 내에 있는 특징 점을 추출한 후 두 번째 단계로 한 라인(line) 영역에 대해 가로축을 따라서 평균값에 의한 에지를 선택한다. 최종 단계에서는 이 특징 값을 합병 알고리즘(merge algorithm)의 임계값으로 사용하여 관심 영역을 추출한다. 제안하는 알고리즘을 19명의 성대 영상에 적용하여 분석한 결과 성대를 촬영한 95%의 영상에서 관심 영역을 추출할 수 있었다. 본 연구에서 제안하는 관심 영역 추출 방법은 계산 량이 적어 200${\times}$280 크기의 영상을 초당 약 40프레임이상 처리하여 관심 영역을 추출할 수 있어 매우 효율적이다.

  • PDF

후두 스트로보스코프 검사의 신호 동기화를 위한 진동 검출기의 유용성 (Usefullness of the Vibration Pick-Up in Detection of Pitch for Synchronization of Laryngeal Stroboscopy)

  • 이진춘;이병주;왕수건;노정훈;권순복;조철우
    • 대한후두음성언어의학회지
    • /
    • 제18권1호
    • /
    • pp.26-32
    • /
    • 2007
  • Objective and Background: Laryngeal stroboscope is an useful equipment in evaluation of vocal cord vibration and in early detection of mucosal lesion including invasive cancer of the vocal cord. Recently Lee et al. (2006) developed portable stroboscope using voice as synchronization signal. It has been frequently impaired ability to synchronize the flashes even in normal female. Authors tried to investigate various methods including vibration pick-up, microphone, laryngeal microphone, and contact microphone for development of simple and accurate method like electroglottograph signal. The purpose of this study was to estimate wheher the vibration pick-up is available and is consistent with the signal of EGG. Subjects and Methods: Authors compared the signals between EGG and noncontact method such as voice, contact methods including vibration pick-up, laryngeal microphone, and contact microphone in normal twenty adults (male 10 and female 10). The number of peak in one cycle was compared with the number of the peak in EGG, and the percent of phase difference in the peak was compared with EGG Also, authors tried to investigate which site of vibration pick-up was most effective for synchronization of stobo flashes. Three site including anterior neck below the cricoid cartilage, thyroid ala, and suprahyoid region were analysed. Results: Among various methods for synchronization of strobo flashes, vibration pick-up was most effective method in peak detection. And anterior neck below cricoid cartilage was the most available site of the vibration pick-up. Conclusion: Authors suggest that vibration pick-up is most available and effective method for synchronization of strobo flashes.

  • PDF