Search | Korea Science

Audio Quality Enhancement using Perceptual Property at a Low-bitrate Compression (지각적 특성을 이용한 저 비트오율 압축 오디오 음질개선)

Cha Hyuk-Geun;Chae Byoung-Koog;Cha Hyung-Tai
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.275-278
- /
- 2004
본 논문에서는 저 비트오율 압축 시 발생되는 신호 왜곡을 인간의 지각적 특성을 이용하여 음질을 개선하는 알고리즘을 제안한다. 저 비트오율 압축 과정에서 손실된 고주파 영역의 신호를 부가 정보를 사용하지 않고 손실되지 않은 영역의 정보를 사용하여 고주파 영역의 신호를 첨가함으로써 음질을 개선하였다. 비 손실 영역의 순음 및 비 순음 성분을 검출하여 손실영역에 해당 하모닉 성분을 청각 자극 에너지로 스케일 하여 새로운 신호를 첨가한다. 원 신호와 저 비트오율 압축으로 인해 왜곡된 신호, 그리고 본 논문의 알고리즘을 이용하여 개선된 신호를 신호 대 잡음 비를 측정하고 청감 테스트를 통해 음질 개선 효과를 확인하였다.
PDF

New Separated Sound Source Synthesis based on ADRess Algorithm (ADRess 알고리즘 기반 새로운 분리음원 합성 기법)

Jeong, Youngho;Jang, Daeyoung;Lee, Taejin
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2015.11a
- /
- pp.56-59
- /
- 2015
본 논문에서는 스테레오 오디오 신호를 이용하여 음원을 분리하는 ADRess 알고리즘을 기반으로, 추정된 음원 방위각에 대한 신호 강도비를 이용하여 분리음원을 생성하는 새로운 분리음원 합성 기법을 제안한다. 입력된 스테레오 채널 신호 간 강도 차(IID) 특성을 이용하여 신호 분석 프레임별로 개선된 신호 강도비 함수에 따른 frequency-azimuth 평면을 구성하고, 이를 통해 추정된 방위각에 상응하는 신호 강도비로 표현되는 확률밀도함수를 좌/우 신호 중 하나의 주 입력 신호에 취함으로써 분리음원을 합성한다. 제안된 기법의 성능을 검증하기 위하여 SASSEC 에서 제공하는 테스트 음원 및 객관적 평가 지표를 이용하여 측정한 결과, 기존 ADRess 알고리즘에서 제시된 방법에 비해 개선된 품질의 분리음원을 합성하는 것으로 평가되었다.
PDF

Content-based Music Information Retrieval using Pitch Histogram (Pitch 히스토그램을 이용한 내용기반 음악 정보 검색)

박만수;박철의;김회린;강경옥
- Journal of Broadcast Engineering
- /
- v.9 no.1
- /
- pp.2-7
- /
- 2004
In this paper, we proposed the content-based music information retrieval technique using some MPEG-7 low-level descriptors. Especially, pitch information and timbral features can be applied in music genre classification, music retrieval, or QBH(Query By Humming) because these can be modeling the stochasticpattern or timbral information of music signal. In this work, we restricted the music domain as O.S.T of movie or soap opera to apply broadcasting system. That is, the user can retrievalthe information of the unknown music using only an audio clip with a few seconds extracted from video content when background music sound greeted user's ear. We proposed the audio feature set organized by MPEG-7 descriptors and distance function by vector distance or ratio computation. Thus, we observed that the feature set organized by pitch information is superior to timbral spectral feature set and IFCR(Intra-Feature Component Ratio) is better than ED(Euclidean Distance) as a vector distance function. To evaluate music recognition, k-NN is used as a classifier
PDF KSCI

A Perceptual Audio Coder Based on Temporal-Spectral Structure (시간-주파수 구조에 근거한 지각적 오디오 부호화기)

김기수;서호선;이준용;윤대희
- Journal of Broadcast Engineering
- /
- v.1 no.1
- /
- pp.67-73
- /
- 1996
In general, the high quality audio coding(HQAC) has the structure of the convertional data compression techniques combined with moodels of human perception. The primary auditory characteristic applied to HQAC is the masking effect in the spectral domain. Therefore spectral techniques such as the subband coding or the transform coding are widely used[1][2]. However no effort has yet been made to apply the temporal masking effect and temporal redundancy removing method in HQAC. The audio data compression method proposed in this paper eliminates statistical and perceptual redundancies in both temporal and spectral domain. Transformed audio signal is divided into packets, which consist of 6 frames. A packet contains 1536 samples($256{\times}6$) :nd redundancies in packet reside in both temporal and spectral domain. Both redundancies are elminated at the same time in each packet. The psychoacoustic model has been improved to give more delicate results by taking into account temporal masking as well as fine spectral masking. For quantization, each packet is divided into subblocks designed to have an analogy with the nonlinear critical bands and to reflect the temporal auditory characteristics. Consequently, high quality of reconstructed audio is conserved at low bit-rates.
PDF

Noise filtering method based on voice frequency correlation to increase STT efficiency (STT 효율 증대를 위한 음성 주파수 correlation 기반 노이즈 필터링 방안)

Lim, Jiwon;Hwang, Yonghae;Kim, Kyuheon
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- fall
- /
- pp.176-179
- /
- 2021
현재 음성인식 기술은 인공지능 비서, 전화자동응답, 네비게이션 등 다양한 분야에서 사용되고 있으며 인간의 음성을 디바이스에 전달하기 위해 음성 신호를 텍스트로 변환하는 Speech-To-Text (STT) 기술을 필요로 한다. 초기의 STT 기술의 대부분은 확률 통계 방식인 Hidden Markov Model (HMM)기반으로 이루졌으며, 딥러닝 기술의 발전으로 HMM과 함께 Recurrent Nural Network (RNN), Deep Nural Network (DNN) 기법을 사용함으로써 과거보다 단어 인식 오류를 개선하며 20%의 성능 향상을 이루어냈다. 그러나 다수의 화자 혹은 생활소음, 노래 등 소음이 있는 주변 환경의 간섭 신호 영향을 받으면 인식 정확도에 차이가 발생한다. 본 논문에서는 이러한 문제를 해결하기 위하여 음성 신호를 추출하여 주파수성분을 분석하고 오디오 신호 사이의 주파수 영역 correlation 연산을 통해 음성 신호와 노이즈 신호를 구분하는 것으로 STT 인식률을 높이고, 목소리 신호를 더욱 효율적으로 STT 기술에 입력하기 위한 방안을 제안한다.
PDF

Tonality Detection based on Spectrum Energy in Perceptual Audio Coder (지각 오디오 부호화기에서의 스펙트럼 에너지 기반 톤 성분 검출 알고리듬)

이근섭;연규철;박영철;윤대희
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.29 no.6C
- /
- pp.770-776
- /
- 2004
The goal of perceptual audio coder is to reduce redundancy and irrelevancy of audio signal based on the concept of masking. Several studies on masking effect reveal that the masking threshold varies as a function of the noise-like or tone-like nature of audio signals. Therefore, tonality of audio signal influences significantly the quality and efficiency of perceptual audio coder In this paper, we propose a new effective algorithm for tonality measure using spectrum energy. Since the proposed algorithm consists of a few transcendental functions and simple operations, it has lower complexity than MPEG psychoacoustic model-II. The proposed algorithm was tested with some audio signals, and DSP implementation showed that the proposed algorithm could be implemented with 3 MIPS. These results illustrate the efficiency of proposed algorithm in both performance and complexity.
PDF KSCI

MPEG Surround for Multi-Channel Audio Coding-Part 2: Various Modes and Tools (다채널 오디오 코딩을 위한 MPEG Surround-2부: 다양한 모드 및 툴들)

Pang, Hee-Suk
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.7
- /
- pp.610-617
- /
- 2009
An overview of various modes and tools of MPEG Surround is provided Because the binaural mode of MPEG Surround supports the virtual 5.1-channel playback based on HRTFs, it can be played via headphones and earphones for portable audio devices. MPEG Surround also supports the enhanced matrix mode which converts stereo signals to 5.1-channel signals without side information, the 3D stereo mode which deals with 3D-coded signals, the low power version which greatly reduces the computational load in the decoding process. Besides, MPEG Surround provides the arbitrary downmix gains (ADGs) tool which is applied to artistic downmix signals, the matrix compatibility tool which is applied to downmix signals by conventional matrix-based methods, the residual coding tool -which can be used at high bit rates, and the GES tool which is applied to specific sound such as applause. The listening test results by various companies and organizations are also presented for important modes and tools.
https://doi.org/10.7776/ASK.2009.28.7.610 인용 PDF KSCI

음성통신 서비스를 위한 코덱 표준화 동향

Lee, Mi-Suk;Kim, Do-Yeong;Lee, Byeong-Seon
- Broadcasting and Media Magazine
- /
- v.16 no.4
- /
- pp.46-58
- /
- 2011
본 고에서는 ITU-T와 3GPP를 중심으로 음성통신 서비스를 위해 표준으로 채택된 코덱의 특징과 현재 표준화가 진행중인 3GPP EVS(Enhanced Voice Service) 코덱 기술의 표준화 동향에 대해 살펴본다. ITU-T에서는 2000년 중반부터 기존의 협대역(전화선 대역) 보다 넓은 주파수 대역의 신호를 코딩할 수 있는 광대역과 슈퍼와이드밴드 코덱에 대한 표준화가 활발히 진행되었다. 3GPP에서는 2010년부터 4세대 이동 통신에서 고품질의 대화형 서비스를 제공하기 위해 음성뿐만 아니라 혼합컨텐츠와 오디오 신호에 대해서도 우수한 품질을 제공할 수 있는 코덱 기술에 대한 표준화를 진행하고 있다.
PDF KSCI

Seok, J.W.;Hong, J.W.
- Electronics and Telecommunications Trends
- /
- v.14 no.6 s.60
- /
- pp.64-73
- /
- 1999
디지털 워터마크는 디지털 데이터에 삽입된 후 검출되거나 추출될 수 있도록 원신호에 추가된 신호를 의미한다. 디지털 서명(signature)이라고 말하기도 하는 워터마크는 디지털 데이터에 삽입된 일종의 패턴으로써, 디지털 멀티미디어 저작물의 저작권 보호를 위해 최근 들어 활발히 연구되고 있는 분야이다. 본 고에서는 멀티미디어 데이터의 소유권을 보호할 수 있는 워터마킹 기술의 역사와 정의 및 응용범위, 워터마크가 갖추어야 할 조건들, 그리고 문자, 영상 및 오디오 데이터의 워터마킹 기술에 대해 살펴보았다.
https://doi.org/10.22648/ETRI.1999.J.140607 인용 PDF

KBS IP 제작 워크플로우 UHD 멀티 부조정실 구축

신종섭
- Broadcasting and Media Magazine
- /
- v.28 no.2
- /
- pp.75-86
- /
- 2023
KBS는 차세대 방식인 IP 전송 기술을 UHD 방송에 적용하기로 결정하고 2017년 9월부터 18년 3월까지 약 7개월에 걸쳐 IP UHD 부조정실 구축을 완료하였다. 이후 약 3개월간의 시뮬레이션 기간을 거치고 23년 2월 현재까지 KBS 1TV '아침마당', '무엇이든 물어보세요', '더 라이브'와 KBS 2TV '해 볼만한 아침'을 생방송으로 제작하고 있다. 본 글에는 UHD 비디오 신호와 오디오 신호를 IP로 전송하기 위해 참조한 표준 기술과 각 파트별 구축 세부내용을 소개한다. 또한 지속적으로 발전할 IP 제작 시스템에 대해 효율적인 계획과 대응을 할 수 있도록 구축 사례에 대한 경험을 논하고자 한다.
PDF

Search Result 435, Processing Time 0.031 seconds

Audio Quality Enhancement using Perceptual Property at a Low-bitrate Compression (지각적 특성을 이용한 저 비트오율 압축 오디오 음질개선)

New Separated Sound Source Synthesis based on ADRess Algorithm (ADRess 알고리즘 기반 새로운 분리음원 합성 기법)

Content-based Music Information Retrieval using Pitch Histogram (Pitch 히스토그램을 이용한 내용기반 음악 정보 검색)

A Perceptual Audio Coder Based on Temporal-Spectral Structure (시간-주파수 구조에 근거한 지각적 오디오 부호화기)

Noise filtering method based on voice frequency correlation to increase STT efficiency (STT 효율 증대를 위한 음성 주파수 correlation 기반 노이즈 필터링 방안)

Tonality Detection based on Spectrum Energy in Perceptual Audio Coder (지각 오디오 부호화기에서의 스펙트럼 에너지 기반 톤 성분 검출 알고리듬)

MPEG Surround for Multi-Channel Audio Coding-Part 2: Various Modes and Tools (다채널 오디오 코딩을 위한 MPEG Surround-2부: 다양한 모드 및 툴들)

음성통신 서비스를 위한 코덱 표준화 동향

Copyright Protection of Multimedia Contents Using Watermark (워터마크를 이용한 멀티미디어 컨텐츠의 저작권 보호)

KBS IP 제작 워크플로우 UHD 멀티 부조정실 구축

Search Result 435, Processing Time 0.031 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)