통합 검색 | Korea Science

음악과 음성 판별을 위한 웨이브렛 영역에서의 특징 파라미터 (Feature Parameter Extraction and Analysis in the Wavelet Domain for Discrimination of Music and Speech)

김정민;배건성
- 대한음성학회지:말소리
- /
- 제61호
- /
- pp.63-74
- /
- 2007
Discrimination of music and speech from the multimedia signal is an important task in audio coding and broadcast monitoring systems. This paper deals with the problem of feature parameter extraction for discrimination of music and speech. The wavelet transform is a multi-resolution analysis method that is useful for analysis of temporal and spectral properties of non-stationary signals such as speech and audio signals. We propose new feature parameters extracted from the wavelet transformed signal for discrimination of music and speech. First, wavelet coefficients are obtained on the frame-by-frame basis. The analysis frame size is set to 20 ms. A parameter $E_{sum}$ is then defined by adding the difference of magnitude between adjacent wavelet coefficients in each scale. The maximum and minimum values of $E_{sum}$ for period of 2 seconds, which corresponds to the discrimination duration, are used as feature parameters for discrimination of music and speech. To evaluate the performance of the proposed feature parameters for music and speech discrimination, the accuracy of music and speech discrimination is measured for various types of music and speech signals. In the experiment every 2-second data is discriminated as music or speech, and about 93% of music and speech segments have been successfully detected.
PDF

오디오 인덱싱을 위한 음성/음악 분류 특징 비교 (A Comparison of Speech/Music Discrimination Features for Audio Indexing)

이경록;서봉수;김진영
- 한국음향학회지
- /
- 제20권2호
- /
- pp.10-15
- /
- 2001
본 논문에서 우리는 음향신호에서 음성과 음악을 분류하는 음성/음악 분류실험에 사용되는 특징들간의 상호조합을 비교하였다. 음향신호는 3가지 (음성, 음악, 음성+음악)와 2가지 (음성, 음악)로 분류하였다. 실험은 멜캡스트럼, 에너지, 영교차를 특징으로 사용하였고, 음성/음악 분류성능이 가장 좋은 특징간 상호조합을 모색하였다. 분류 알고리즘으로는 Gaussian Mixture Model (GMM)을 이용하였으며, GMM에 의한 데이터 모델링 전에 각기 다른 특징들을 하나의 특징공간에서 결합하였다. 실험결과 3가지 분류기준 적용시에는 멜캡스트럼, 영교차 조합이 가장 좋은 결과 (음성: 95.1％, 음악: 61.9％, 음성+음악: 55.5％)를 보였고, 2가지 분류기준 적용시에는 멜캡스트럼, 에너지 조합과 멜캡스트럼, 에너지, 영교차 조합이 가장 좋은 결과 (음성: 98.9％, 음악: 100％)를 보였다.
PDF

The Audio Signal Classification System Using Contents Based Analysis

Lee, Kwang-Seok;Kim, Young-Sub;Han, Hag-Yong;Hur, Kang-In
- Journal of information and communication convergence engineering
- /
- 제5권3호
- /
- pp.245-248
- /
- 2007
In this paper, we research the content-based analysis and classification according to the composition of the feature parameter data base for the audio data to implement the audio data index and searching system. Audio data is classified to the primitive various auditory types. We described the analysis and feature extraction method for the feature parameters available to the audio data classification. And we compose the feature parameters data base in the index group unit, then compare and analyze the audio data centering the including level around and index criterion into the audio categories. Based on this result, we compose feature vectors of audio data according to the classification categories, and simulate to classify using discrimination function.
PDF KSCI

오디오 딥러닝을 활용한 Al, Steel 소재의 절삭 깊이에 따른 오디오 판별 (A Study on Acoustic Signal Characterization for Al and Steel Machining by Audio Deep Learning)

김태원;이영민;최해운
- 한국기계가공학회지
- /
- 제20권7호
- /
- pp.72-79
- /
- 2021
This study reports on the experiment of using deep learning algorithms to determine the machining process of aluminium and steel. A face cutting milling tool was used for machining and the cutting speed was set between 3 and 4 mm/s. Both materials were machined with a depth to 0.5mm and 1.0mm. To demonstrate the developed deep learning algorithm, simulation experiments were performed using the VGGish algorithm in MATLAB toobox. Downcutting was used to cut aluminum and steel as a machining process for high quality and precise learning. As a result of learning algorithms using audio data, 61%-99% accuracy was obtained in four categories: Al 0.5mm, Al 1.0mm, Steel 0.5mm and Steel 1.0mm. Audio discrimination using deep learning is derived as a probabilistic result.
https://doi.org/10.14775/ksmpe.2021.20.07.072 인용 PDF KSCI

청각을 이용한 시각 재현 시스템의 개발 (Development of Processing System for Audio-vision System Based on Auditory Input)

김정훈;김덕규;원철호;이종민;이희중;이나희;윤수영
- 대한의용생체공학회:의공학회지
- /
- 제33권1호
- /
- pp.25-31
- /
- 2012
The audio vision system was developed for visually impaired people and usability was verified. In this study ten normal volunteers were included in the subject group and their mean age was 28.8 years old. Male and female ratio was 7:3. The usability of audio vision system was verified by as follows. First, volunteers learned distance of obstacles and up-down discrimination. After learning of audio vision system, indoor and outdoor walking examination was performed. The test was scored by ability of up-down and lateral discrimination, distance recognition and walking without collision. Each parameter was scored by 1 to 5. The results were 93.5 +- SD(ranges, 86 to 100) of 100. In this study, we could convert visual information to auditory information by audio-vision system and verified possibility of applying to daily life for visually impaired people.
https://doi.org/10.9718/JBER.2012.33.1.025 인용 PDF KSCI

방송 오디오 신호로부터 음악 신호 검출에 관한 연구 (A Study of Automatic Detection of Music Signal from Broadcasting Audio Signal)

윤원중;박규식
- 대한전자공학회논문지SP
- /
- 제47권5호
- /
- pp.81-88
- /
- 2010
본 논문에서는 실제 방송 환경에 적용 가능한 방송용 음원 모니터링 시스템을 구축하기 위한 사전연구로 방송 오디오 신호로부터 음악신호 구간을 자동으로 검출할 수 있는 시스템을 제안하였다. 음악구간과 비음악구간의 구분을 위한 특징으로는 사람의 음성 발화 특성을 반영하여 에너지 표준편차와 log 에너지 표준편차 그리고 log 에너지 평균 등 3개의 간단한 시간영역 특징들을 사용하였으며 최종 음악신호 구간 판별은 각 에너지 한계값(threshold)을 이용한 Rule-base 분류를 기반으로 하였다. 실제 FM 라디오 방송 신호를 24시간 녹음하여 진행한 모의실험에서 음악구간 인식률은 96%, 비-음악구간 인식률은 87%를 나타내어 방송용 음원 모니터링 시스템의 전처리기로 손색이 없음을 확인할 수 있었다.
PDF KSCI

실험에 의한 음성·음악 분류 특징의 비교 분석 (Comparison & Analysis of Speech/Music Discrimination Features through Experiments)

이경록;류시우;곽재영
- 한국콘텐츠학회:학술대회논문집
- /
- 한국콘텐츠학회 2004년도 추계 종합학술대회 논문집
- /
- pp.308-313
- /
- 2004
본 논문에서는 각 특징 파라미터 조합의 음성/음악 분류 성능을 비교 분석하였다. 음향신호는 3가지(음성, 음악, 음성+음악)로 분류하였다. 본 실험에서는 분류 특징으로 멜캡스트럼, 에너지, 영교차 3가지 형태가 사용되었다. 음성/음악 분류 성능이 가장 좋은 특징간의 상호 조합을 비교 분석하였다. 실험결과 멜캡스트럼, 영교차 조합이 가장 좋은 결과(음성: 95.1%, 음악: 61.9%, 음성+음악: 55.5%)를 보인다는 것을 확인할 수 있었다.
PDF

식별함수를 이용한 오디오신호의 내용기반 분류 (Content Based Classification of Audio Signal using Discriminant Function)

김영섭;이광석;고시영;허강인
- 한국정보통신학회:학술대회논문집
- /
- 한국해양정보통신학회 2007년도 춘계종합학술대회
- /
- pp.201-204
- /
- 2007
본 논문은 오디오 색인 검색 시스템을 구현하기 위하여 오디오 신호에 대한 특징 파라미터 풀(pool)을 구성하고, 구성되어진 특징 파라미터 풀을 이용한 오디오 데이터의 내용분석 및 분류에 관한 연구이다. 오디오 데이터는 기본적으로 다양한 형태의 오디오 신호로서 분류되어진다. 본 논문에서는 오디오 데이터의 분류에 이용 가능한 특징 파라미터를 분석하고 추출하는 방법에 대하여 논한다. 그리고 특징 파라미터 풀을 색인 그룹 단위로 구성하여 오디오 카테고리에 대한, 설정된 특징들의 포함 정도와 색인기준을 오디오 데이터의 내용을 중심으로 비교, 분석한다. 그리고 마지막으로 위의 결과를 바탕으로 분류카테고리 별로 오디오 데이터의 특징 벡터를 구성한 뒤 이를 이용하여 식별함수 분류기를 통한 분류를 실험한다.
PDF

FM 라디오 환경에서의 실시간 음악 판별 시스템 구현 (Implementation of Music Signals Discrimination System for FM Broadcasting)

강현우
- 정보처리학회논문지B
- /
- 제16B권2호
- /
- pp.151-156
- /
- 2009
본 연구에서는 GMM 기반의 음성/음악 판별 방법을 응용하여 FM 라디오 방송에서 순수한 음악 구간만을 판별하는 시스템을 구현하였다. 본 시스템에서는 음성, 음악, 광고 음악, 기타 여러 가지 사운드가 혼합되어 있는 오디오 방송 프로그램에서 순수한 음악만을 판별하여 자동으로 저장하고자 한다. 음악의 시작 부분과 끝 부분을 보다 정교하게 검출하고자 순수한 음악으로 판별된 구간의 시작 부분과 끝 부분에 대해 후처리 과정을 추가하였다. PC 환경에서 FM 라디오 방송을 이용하여 구현된 시스템을 실시간으로 테스트한 결과 우수한 성능을 보임을 확인하였다. 또한 SoC 구현을 고려하여 고정소수점 연산을 수행한 결과 3MIPS 이하의 적은 연산량으로 부동소수점 연산일 때와 동일한 결과를 얻을수 있었다.
https://doi.org/10.3745/KIPSTB.2009.16-B.2.151 인용 PDF KSCI

사운드 분류기를 이용한 영상검색에 관한 연구 (A Study on Image Retrieval Using Sound Classifier)

김승한;이명순;노승용
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 2006년 학술대회 논문집 정보 및 제어부문
- /
- pp.419-421
- /
- 2006
The importance of automatic discrimination image data has evolved as a research topic over recent years. We have used forward neural network as a classifier using sound data features within image data, our initial tests have shown encouraging results that indicate the viability of our approach.
PDF

검색결과 23건 처리시간 0.029초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)