Search | Korea Science

Enhancement of SBR for Speech Signal Using Adaptive Noise Floor Level (가변 잡음 레벨을 이용한 음성신호에 대한 SBR 성능 항상 기술)

Lee, Se-Won;Oh, Seoung-Jun;Ahn, Chang-Beom;Lee, Tae-Jin;Kang, Kyoung-Ok;Park, Ho-Chong
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.2
- /
- pp.148-154
- /
- 2009
In audio coding, SBR technology synthesizes the high-bands using patched time-frequency information from low-bands and the correction parameters, Since SBR transmits only correction parameters for high-bands, it provides a low-rate coding of high-bands, and is used as a core module of MPEG-4 HE-AAC, SBR was originally designed for audio signal and its performance for speech signal tends to decrease, and the major reason is an excessive noise floor in high-bands which is caused by incorrect tonality computation, In this paper, a new method to determine noise floor level in an adaptive fashion according to the speech characteristics is proposed in order to solve the problem of SBR for speech signal, The proposed method maintains the compatibility with the standard SBR, and the subjective performance evaluation shows that the proposed method improves the SBR performance especially for male speech signal compared with the standard SBR.
https://doi.org/10.7776/ASK.2009.28.2.148 인용 PDF KSCI

Microscopic DVS based Optimization Technique of Multimedia Algorithm (Microscopic DVS 기반의 멀티미디어 알고리즘 최적화 기법)

Lee Eun-Seo;Kim Byung-Il;Chang Tae-Gye
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.42 no.4 s.304
- /
- pp.167-176
- /
- 2005
This paper proposes a new power minimization technique for the frame-based multimedia signal processing. The derivation of the technique is based on the newly proposed microscopic DVS(Dynamic Voltage Scaling) method, where, the operating frequency and the supply voltage levels are dynamically controlled according to the processing requirement for each frame of multimedia data. The multimedia signal processing algorithms are also redesigned and optimized to maximize the power saving efficiency of the microscopic DVS technology. The characterization of the mean/variance distribution of the processing load in the frame-based multimedia signal processing provides the major basis not only for the optimized application of the microscopic DVS technology but also for the optimization of the multimedia algorithms. The power saying efficiency of the proposed DVS approach is experimentally tested with the algorithms of MPEG-2 video decoder and MPEG-2 AAC audio encoder on the ARM9 RISC processor. The experimental results with the diverse MPEG-2 video and audio files show The average power saving efficiencies of 50$\%$ and 30$\%$, respectively. The results also agree very well with those of the analytic derivations.
PDF KSCI

Audio Quality Enhancement at a Low-bit Rate Perceptual Audio Coding (저비트율로 압축된 오디오의 음질 개선 방법)

서정일;서진수;홍진우;강경옥
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.6
- /
- pp.566-575
- /
- 2002
Low-titrate audio coding enables a number of Internet and mobile multimedia streaming service more efficiently. For the help of next-generation mobile telephone technologies and digital audio/video compression algorithm, we can enjoy the real-time multimedia contents on our mobile devices (cellular phone, PDA notebook, etc). But the limited available bandwidth of mobile communication network prohibits transmitting high-qualify AV contents. In addition, most bandwidth is assigned to transmit video contents. In this paper, we design a novel and simple method for reproducing high frequency components. The spectrum of high frequency components, which are lost by down-sampling, are modeled by the energy rate with low frequency band in Bark scale, and these values are multiplexed with conventional coded bitstream. At the decoder side, the high frequency components are reconstructed by duplicating with low frequency band spectrum at a rate of decoded energy rates. As a result of segmental SNR and MOS test, we convinced that our proposed method enhances the subjective sound quality only 10%∼20% additional bits. In addition, this proposed method can apply all kinds of frequency domain audio compression algorithms, such as MPEG-1/2, AAC, AC-3, and etc.
PDF KSCI

Huffman decoding method based on bit-wise comparison (Bit-wise comparison에 기초한 Huffman decoding 기법)

정종훈;김병일;장태규;장흥엽
- Proceedings of the IEEK Conference
- /
- 2001.06d
- /
- pp.131-134
- /
- 2001
본 논문에서는 효율적인 허프만 디코딩을 수행할 수 있도록 하기 위하여 Bit-wise comparison 방법을 제시하였다. 이 방법은 허프만 코딩 원리인 이진트리 구성에 기초하여 허프만 테이블을 재구성 함으로서 디코딩 사간의 단축 및 알고리즘의 간소화를 가져오도록 하였고, 이를 토대로 MPEG-2 AAC 디코더의 허프만 디코딩 부분에 적용함으로써 성능검증을 수행하였다.
PDF

An efficient search of binary tree for huffman decoding based on numeric interpretation of codewords

Kim, Byeong-Il;Chang, Tae-Gyu;Jeong, Jong-Hoon
- Proceedings of the IEEK Conference
- /
- 2002.07a
- /
- pp.133-136
- /
- 2002
This paper presents a new method of Huffman decoding which gives a significant improvement of processing efficiency based on the reconstruction of an efficient one-dimensional array data structure incorporating the numeric interpretation of the accrued codewords in the binary tree. In the Proposed search method, the branching address is directly obtained by the arithematic operation with the incoming digit value eliminating the compare instruction needed in the binary tree search. The proposed search method gives 30% of improved Processing efficiency and the memory space of the reconstructed Huffman table is reduced to one third compared to the ordinary ‘compare and jump’ based binary tree. The experimental result with the six MPEG-2 AAC test files also shows about 198% of performance improvement compared to those of the widely used conventional sequential search method.
PDF

Unified coding scheme of speech and music (음악 및 음성 신호의 융합 압축 기술)

O, Eun-Mi
- Broadcasting and Media Magazine
- /
- v.16 no.4
- /
- pp.59-71
- /
- 2011
오디오와 음성 압축 기술적 근간은 서로 다르지만, 최근의 모바일 멀티미디어 기기 시장의 컨버전스 현상에 따라 압축하고자 하는 신호가 혼용되고 있으며, 비슷한 목표 전송률과 음질로 수렴하고 있다. 현재는 동일 기기에서 서로 다른 압축 기술을 적용하고 있으나, 음성과 음악이 동시에 서비스 되는 멀티미디어 기기에서는 단일 압축 방식으로 처리하고자 하는 이슈가 부각되고 있다. 특히, 스마트 폰 및 음악 콘텐츠 포탈 서비스의 대중화를 고려할 때, 음성 및 음악 신호 모두를 효율적으로 압축하는 음악 및 음성 신호의 융합 압축 기술이 더욱 필요해 보인다. 본 고에서는 MPEG 오디오 그룹에서 가장 최근 진행한 Unified Speech and Audio Coding(USAC)의 탄생 배경 및 표준화 현황을 소개한다. USAC는 64kbps 이하에서 기술적으로 최고 성능을 지닌 AMR-WB+ 및 HE-AAC v2보다도 우월한 음질을 보이며, 높은 비트율에서도 동등한 음질을 보장한다. 이런 우수한 음질에 기여한 USAC의 스위칭 구조와 더불어 기술적으로 향상된 주요 모듈인 파라미터 기반 스테레오 및 고주파 압축, 그리고 엔트로피 코딩 방식에 대해서 살펴 본다. 향후, 다양한 오디오 신호를 효율적으로 압축하는 USAC는 디지털 라디오, 모바일 TV, 그리고 오디오 북과 같은 사용자 시나리오에서 사용될 확률이 높아 보인다. 또한, USAC는 배경 잡음이나 배경 음악이 있는 경우에도 성능이 우수하기 때문에 YouTube 및 podcast 등과 같이 사용자가 콘텐츠를 생성할 때도 유용하게 사용 될 수 있다.
PDF KSCI

An Extension of Interactive Media System for Mobile Device (모바일 단말을 위한 인터렉티브 미디어 시스템의 확장)

Han, Seung-Jin;Ryu, Eun-Seok;Yoo, Hyuck
- Proceedings of the Korea Information Processing Society Conference
- /
- 2005.05a
- /
- pp.201-204
- /
- 2005
현재의 세계적인 트렌드인 HCI(Human Computer Interaction)에서 사용자의 기호나 의견 등을 반영하는 인터렉티브 미디어(Interactive Media)는 빠질 수 없는 주제다. 본 연구팀은 모바일 단말환경에서 사용자의 인터렉션을 통한 서비스를 제공할 수 있는 IMS(Interactive Media System)를 설계하고, 이를 PDA 상에 구현하였다. 기존의 연구들이 보여주는 링크의 형태로만 미디어를 지원하는 방식은 CPU 등의 자원이 부족한 모바일 환경에서는 부담이 될 수 있다. IMS 는 이를 벗어나 내부적으로 미디어 오브젝트를 지원하는 방식을 사용하여 모바일 환경에 적합하게 연산속도를 개선하고 있다. 또한 이러한 방식으로 인하여 생길 수 있는 문제인 미디어 포맷의 지원에 대한 제약을 극복하기 위해 확장성 있는 구조로 설계되어 이미지와 텍스트, 백터그래픽 만을 제공하던 단순한 시스템에서 H.264 와 MPEG4 AAC 와 같은 여러 모듈들이 더해졌다. 또한 OpenGL 모듈이 추가되고 3D 오브젝트들이 새롭게 정의됨으로써 IMS 는 IML 을 통해 마크업 언어차원에서 3D 그래픽을 지원할 수 있게 되었고 2D 와 3D 를 함깨 사용한 다양한 비쥬얼 구성이 가능하게 되었다. 본 논문에서는 IMS 의 확장성 있는 구조와 OpenGL 을 추가하고 새로운 미디어 오브젝트를 정의하는 과정 등을 다루며 언급한 내용을 자세히 소개한다.
PDF

Search Result 57, Processing Time 0.027 seconds

Enhancement of SBR for Speech Signal Using Adaptive Noise Floor Level (가변 잡음 레벨을 이용한 음성신호에 대한 SBR 성능 항상 기술)

Microscopic DVS based Optimization Technique of Multimedia Algorithm (Microscopic DVS 기반의 멀티미디어 알고리즘 최적화 기법)

Audio Quality Enhancement at a Low-bit Rate Perceptual Audio Coding (저비트율로 압축된 오디오의 음질 개선 방법)

Huffman decoding method based on bit-wise comparison (Bit-wise comparison에 기초한 Huffman decoding 기법)

An efficient search of binary tree for huffman decoding based on numeric interpretation of codewords

Unified coding scheme of speech and music (음악 및 음성 신호의 융합 압축 기술)

An Extension of Interactive Media System for Mobile Device (모바일 단말을 위한 인터렉티브 미디어 시스템의 확장)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)