Search | Korea Science

Dimensionality Reduction Based Frequency Domain Audio Signal Compression Method (차원 축소를 이용한 주파수 영역 오디오 신호 압축)

Kim, Min-Je;Beack, Seung-Kwon;Lee, Tae-Jin;Jang, Dae-Young;Kang, Kyeong-Ok
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2008.02a
- /
- pp.179-182
- /
- 2008
본 논문은 오디오 부호화 및 복호화 과정에서, 주파수 영역에서 표현된 오디오 신호를 차원 축소 방법으로 압축하여 표현함으로서 오디오 부호화 효율을 증대시키고자 하는 방식에 관한 것이다. 차원 축소는 행렬을 특정한 조건을 바탕으로 두 개의 행렬의 곱으로 표현하는 방식으로, 특정 행렬로 표현된 데이터를 좀 더 작은 데이터량으로 표현하는 것뿐만 아니라 이 과정에서 데이터에 내재되어 있는 추상적인 정보까지도 함축적으로 얻어낼 수 있기 때문에, 일반적으로 데이터의 압축에 좋은 성능을 보인다. 주파수 영역으로 변환된 신호는 일반적으로 (주파수 밴드의 개수) $\times$ (전체 프레임의 개수)인 행렬로 볼 수 있으며, 이 전체 행렬을 입력으로 간주하고, 차원 축소를 수행하여 신호의 압축 효과를 얻을 수 있다. 그러나 이 경우, 행렬 전체를 입력 신호로 보아야 하기 때문에 실시간 부호화가 불가능하며, 신호 전체 길이만큼의 부호화 지연이 발생한다. 이를 해소하기 위해, 본 논문에서는 특정 개수만큼의 프레임을 묶어서 여러 번의 차원 축소를 순차적으로 수행함으로써 부호화 지연을 최소화하는 방식을 제안한다.
PDF

Analysis and Synthesis of Audio Signals using a Sinusoidal Model with Psychoacoustic Criteria (정현파 모델을 이용한 오디오 신호의 심리음향적 분석 및 합성)

남승현;강경옥;홍진우
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.2
- /
- pp.77-82
- /
- 1999
A sinusoidal model has been widely used in the analysis and synthesis of speech and audio signals, and becomes one of the efficient candidates for high quality low bit rate audio coders. One of the crucial steps in the analysis and synthesis using a sinusoidal model is the detection of tonal components. This paper proposes an efficient method for the analysis and synthesis of audio signals using a sinusoidal model, which uses psychoacoustic criteria such as masking effect, masking index, and JNDf(Just Noticeable Difference in Frequency). Simulation results show that the proposed method reduces the number of sinusoids significantly without degrading the quality of the synthesized audio signals.
PDF

증강현실 오디오를 위한 마이크로폰 어레이 설계 및 오디오 객체 획득 기술

Gang, Jin-A;Jeon, Chan-Jun;Jeong, Seok-Hui;Kim, Hong-Guk
- Broadcasting and Media Magazine
- /
- v.19 no.1
- /
- pp.56-64
- /
- 2014
본 고에서는 증강현실 오디오 기술에 대한 연구동향을 소개한다. 특히 증강현실 오디오 실현을 위한 핵심기술이라고 할 수 있는 오디오 객체를 획득하기 위한 단일지향성 및 쌍지향성 마이크로폰을 이용한 마이크로폰 어레이 설계 기술과 오디오 채널간의 신호 강도차 분석을 통한 오디오 객체 획득 기술에 대해 설명한다.
PDF KSCI

An Implementation of Sound Enhanced MPEG-1 Audio Decoder on Embedded OS Platform (음질향상 알고리즘을 내장한 MPEG-1 오디오 디코더의 Embedded OS 플랫폼에의 구현)

Hong, Sung-Min;Park, Kyu-Sik
- Journal of Korea Multimedia Society
- /
- v.10 no.8
- /
- pp.958-966
- /
- 2007
In this paper, we implement a sound-enhanced MPEG-1 audio decoder on embedded OS Platform. Low bit rate lossy audio codecs such as MP3, OGG, and AAC for mitigating the problems in storage space and network bandwidth suffer a major common problem such as a loss of high frequency fidelity of audio signal. This high frequency loss will reproduce only a band-limited low-frequency part of audio in the standard CD-quality audio. In order to overcome this problem, we embedded a sound enhancement algorithm into the MPEG-1 audio decoder and then the algorithms optimized according to the characteristic of the MPEG-1 audio layer I, II, III were implemented on an embedded OS platform. From the experimental results with spectrum analysis and listening test, we confirm the superiority of the proposed system compared to the standard MPEG-1 audio decoder.
PDF

Unified coding scheme of speech and music (음악 및 음성 신호의 융합 압축 기술)

O, Eun-Mi
- Broadcasting and Media Magazine
- /
- v.16 no.4
- /
- pp.59-71
- /
- 2011
오디오와 음성 압축 기술적 근간은 서로 다르지만, 최근의 모바일 멀티미디어 기기 시장의 컨버전스 현상에 따라 압축하고자 하는 신호가 혼용되고 있으며, 비슷한 목표 전송률과 음질로 수렴하고 있다. 현재는 동일 기기에서 서로 다른 압축 기술을 적용하고 있으나, 음성과 음악이 동시에 서비스 되는 멀티미디어 기기에서는 단일 압축 방식으로 처리하고자 하는 이슈가 부각되고 있다. 특히, 스마트 폰 및 음악 콘텐츠 포탈 서비스의 대중화를 고려할 때, 음성 및 음악 신호 모두를 효율적으로 압축하는 음악 및 음성 신호의 융합 압축 기술이 더욱 필요해 보인다. 본 고에서는 MPEG 오디오 그룹에서 가장 최근 진행한 Unified Speech and Audio Coding(USAC)의 탄생 배경 및 표준화 현황을 소개한다. USAC는 64kbps 이하에서 기술적으로 최고 성능을 지닌 AMR-WB+ 및 HE-AAC v2보다도 우월한 음질을 보이며, 높은 비트율에서도 동등한 음질을 보장한다. 이런 우수한 음질에 기여한 USAC의 스위칭 구조와 더불어 기술적으로 향상된 주요 모듈인 파라미터 기반 스테레오 및 고주파 압축, 그리고 엔트로피 코딩 방식에 대해서 살펴 본다. 향후, 다양한 오디오 신호를 효율적으로 압축하는 USAC는 디지털 라디오, 모바일 TV, 그리고 오디오 북과 같은 사용자 시나리오에서 사용될 확률이 높아 보인다. 또한, USAC는 배경 잡음이나 배경 음악이 있는 경우에도 성능이 우수하기 때문에 YouTube 및 podcast 등과 같이 사용자가 콘텐츠를 생성할 때도 유용하게 사용 될 수 있다.
PDF KSCI

Detecting Prominent Content in Unstructured Audio using Intensity-based Attack/release Patterns (발생/소멸 패턴을 이용한 비정형 혼합 오디오의 주성분 검출)

Kim, Samuel
- Journal of the Institute of Electronics and Information Engineers
- /
- v.50 no.12
- /
- pp.224-231
- /
- 2013
Defining the concept of prominent audio content as the most informative audio content from the users' perspective within a given unstructured audio segment, we propose a simple but robust intensity-based attack/release pattern features to detect the prominent audio content. We also propose a web-based annotation procedure to retrieve users' subjective perception and annotated 18 hours of video clips across various genres, such as cartoon, movie, news, etc. The experiments with a linear classification method whose models are trained for speech, music, and sound effect demonstrate promising - but varying across the genres of programs - results (e.g., 86.7% weighted accuracy for speech-oriented talk shows and 49.3% weighted accuracy for {action movies}).
https://doi.org/10.5573/ieek.2013.50.12.224 인용 PDF KSCI

A Study on Real-time Discrimination of FM Radio Broadcast Speech/Music (실시간 FM 방송중 음악/음성 검출에 관한 연구)

황진만;강동욱;김기두
- Proceedings of the IEEK Conference
- /
- 2003.07e
- /
- pp.2136-2139
- /
- 2003
본 논문은 FM 라디오 방송중의 오디오 신호를 블록단위로 음악 및 음성을 검출하는 알고리즘에 대한 것으로, 이를 기반으로 방송중의 노래(가요, 팝, 클래식‥‥)만을 자동으로 인식하여 녹음하는 알고리즘을 개발한다. 본 논문에서는 기존에 제안되었던 것[1-4]과 같이 단지 음악과 음성을 구분함과 동시에 음악구간의 논리적 조합으로 이루어진 노래를 자동으로 인식하여 녹음하는 것을 알고리즘의 최종 목표로 한다. 알고리즘의 접근 역시 기존의 음소단위의 모델링을 거치는 GMM 기반의 접근이 아니기 때문에 모델링에 대한 훈련과정이 필요 없고, 시간영역에서의 오디오신호가 가지고 있는 직관적인 특징을 분석함으로써 비교적 적은 연산으로 실시간 구현이 가능하다.
PDF

A New Robust Acoustic Crosstalk Cancellation Method with Sum and Difference Filter in 3D Audio System (3차원 오디오 시스템에서 합과 차 여파기를 이용한 새로운 광대억 간섭신호 제거 방법)

김래훈;임준석;성굉모
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.4
- /
- pp.17-21
- /
- 2001
There are some methods to enhance the ‘sweet spot’in loudspeaker-based 3D audio systems. Most of them can be only applied to narrow frequency band inherently. In this paper, we introduce the more robust 3D sound reproduction system which has far wider robust bandwidth. The system applies a sum and difference filter to the conventional three loudspeaker-based one.
PDF

Objective measurement of spatial auditory quality for multi channel audio codecs (멀티채널 오디오 압축 코덱 음질의 객관적인 측정방법)

Choi, In-Yong;Chon, Sang-Bae;Sung, Koeng-Mo
- Proceedings of the IEEK Conference
- /
- 2005.11a
- /
- pp.431-434
- /
- 2005
본 논문은 멀티채널 오디오 압축 코덱의 음질을 객관적으로 평가할 수 있는 시스템 및 파라메터에 관한 것으로, 멀티채널 오디오 신호로부터 양이입력신호(ear input signals)를 만들어내는 전처리 과정과 이 과정을 통해 출력되는 양이입력신호로부터 양이레벨차이왜곡(inter-aural level difference distortion)을 구하는 과정 및 양이레벨차이왜곡이 청취평가 결과와 일관적인 상관관계를 보임을 서술한다. 본 연구에 의하면 멀티채널 오디오 압축 코덱의 음질을 선별된 청취자에 의한 주관적인 평가와 통계처리 없이 객관적인 측정만을 통해 평가하는 것이 가능하며, 이를 사용하면 멀티채널 오디오 압축 코덱 개발자들이 시간, 경제적 부담 없이 자신이 개발한 압축 코덱의 음질을 간단하게 평가해볼 수 있다.
PDF

SW Signal Analysis of DRM Digital Radio Receiver (DRM 디지털라디오 수신기의 SW 신호분석)

Kang, M.G.;Sohn, S.I.;Lee, K.T.;You, Y.H.;Lee, M.S.
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2012.05a
- /
- pp.765-768
- /
- 2012
본 논문에서는 USB형 DRM(Digital Radio Mondiale) 디지털라디오 수신기의 수신신호 성능 검사를 위한 DRM 수신 모니터링의 응용 S/W의 설계를 통한 수신 시스템의 신호를 분석함으로서 DRM30/DRM+신호를 수신하여, RF 상태 분석, 복조와 관련된 파라메터 확인, 오디오 서비스 정보 확인 및 오디오 재생, 텍스트 활용위한 데이터 표현, GPS 위치 정보 수신이 가능한 통합모듈 응용 S/W를 제안하고자 한다.
PDF

Search Result 438, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)