Search | Korea Science

Unified Speech and Audio Coding Technology (통합 음성 오디오 부호화 기술)

Lee, Taejin;Beack, Seungkwon;Kang, Kyeongok;Kim, Whan-Woo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2011.07a
- /
- pp.264-267
- /
- 2011
다양한 기능을 가지는 모바일 기기들이 하나로 융합되어 가는 방향으로 기술이 발전함에 따라, 음성 및 오디오 모두에 대해 우수한 음질을 제공하는 부호화 기술에 대한 요구사항이 증대되고 있다. MPEG 에서는 2008 년 10 월부터 MPEG-D USAC 기술에 대해 CfP 를 시작으로 본격적으로 표준화를 진행하고 있으며, 2011 년 3 월 96 차 미팅에서 Study on DIS 까지 승인하였다. 본 논문에서는 LPD 모드의 TCX 윈도우의 변경을 통한 USAC 성능향상 방법은 제안한다. TCX 프레임의 연결에 고정된 크기의 중첩만을 이용하는 현재의 방식과는 달리, 이전 TCX 모드와 다음 TCX 모드, transient 의 존재 유무에 따라 적절하게 TCX 윈도우 중첩크기를 조절하여 음악 특성 신호에 대해 LPD 모드의 음질을 개선할 수 있다.
PDF

Spectral recovery method based on TCX mode using CNN (CNN을 이용한 TCX 모드 기반의 주파수 정보 복원 기술)

Kim, Jaewon;Shin, Seong-Hyeon;Han, Seokhyeon;Choi, Hyunkook;Kim, Sangmin;Park, Hochong
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.07a
- /
- pp.340-342
- /
- 2020
본 논문에서는 CNN을 이용한 TCX 모드 기반의 주파수 정보 복원 기술을 제안한다. TCX 모드는 USAC에서 지원하는 음성을 위한 양자화 기술로 부호화 과정에서 포락선을 평탄화한 후 양자화한다. 이러한 평탄화 동작은 주파수 정보 간의 상관도를 높여 네트워크의 학습을 쉽게 만들고 예측 성능을 높인다. 제안하는 방법은 청각 심리 모델 기반으로 구현된 주파수 정보 복원 방법에 TCX 모드 기반의 양자화 방법을 적용하여 일부 주파수 정보만을 사용해 손실된 주파수 정보를 복원한다. 제안하는 방법을 사용해 기존 방법보다 낮은 학습 오차를 얻었고 최적화 되지 않은 조건에서 동등한 음질을 얻었다.
PDF

Improvement of the TCX Module in AMR-WB+ Codec Using Pyramid VQ (Pyramid VQ를 이용한 AMR-WB+ 코덱 내 TCX 모듈의 성능 개선)

Park, Sang-Kuk;Park, Jung-Eun;Baik, Seung-Kweon;Seo, Jung-Il;Kang, Sang-Won
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.3
- /
- pp.109-114
- /
- 2007
In this paper, we Propose a pyramid VQ to quantize the transform coefficients of TCX module for the audio improvement of AMR-WB+ codec. The Proposed pyramid VQ is compared to the $RE_8$ Lattice VQ used in the AMR-WB+ standard codec. demonstrating improvement 4% and 5.7%. respectively, in Mean Squared Error (MSE) and 3.3% and 4.7%. respectively, in Perceptual Evaluation of Audio Quality (PEAQ) by 8-dimensional and 16-dimensional Pyramid VQ.
https://doi.org/10.7776/ASK.2007.26.3.109 인용 PDF KSCI

Design of the TCX module transform coefficients quantizer in AMR-WB+ codec using PVQ (PVQ 방식을 이용한 AMR-WB+ 코덱의 TCX 모듈 변환계수 양자화기 설계)

Park, Sang-Kuk;Park, Jung-Eun;Kang, Sang-Won
- Proceedings of the IEEK Conference
- /
- 2007.07a
- /
- pp.345-346
- /
- 2007
In this paper, we propose a Pyramid VQ(PVQ) to quantize the transform coefficients of TCX module for the music improvement of AMR-WB+ codec. The proposed PVQ is compared to the $RE_8$ Lattice VQ used in the AHR-WB+ standard codec, demonstrating improvement 4% and 5.7%, respectively, in Mean Squared Error(MSE) and 3.3% and 4.7%, respectively, in Perceptual Evaluation of Audio Quality(PEAQ) by 8-dimensional and 16-dimensional Pyramid VQ.
PDF

Design of Low Bits Rate Transform Excitation Wide Band Speech and Audio Coder of Analysis-by-Synthesis Structure (분석/합성 구조의 저 전송률 변환여기 광대역 음성/오디오 부호화기 설계)

Jang, Sunghoon;Hong, Kibong;Lee, Insung
- The Journal of the Acoustical Society of Korea
- /
- v.31 no.7
- /
- pp.472-479
- /
- 2012
This paper is aimed to design 9.2 kbps low bits late transform excitation coder that target to voice and audio signal. To set up low bit rate, we used Band-selection in frequency domain and gain-shape quantization and AbS structure. To decrease lots of calculation from ABS structure, we used each band IDFT and synthesis. And we designed non-transfer band for performance by inserting comfort noise. We propose coder that has low bit rate and similar performance comparing with original 10.4 kbps AMR-WB+ TCX mode.
https://doi.org/10.7776/ASK.2012.31.7.472 인용 PDF KSCI

Adaptive TCX Windowing Technology for Unified Structure MPEG-D USAC

Lee, Tae-Jin;Beack, Seung-Kwon;Kang, Kyeong-Ok;Kim, Whan-Woo
- ETRI Journal
- /
- v.34 no.3
- /
- pp.474-477
- /
- 2012
The MPEG-D unified speech and audio coding (USAC) standardization process was initiated by MPEG to develop an audio codec that is able to provide consistent quality for mixed speech and music contents. The current USAC reference model structure consists of frequency domain (FD) and linear prediction domain (LPD) core modules and is controlled using a signal classifier tool. In this letter, we propose an LPD single-mode USAC structure using an adaptive widowing-based transform-coded excitation module. We tested our system using official test items for all mono-evaluation modes. The results of the experiment show that the objective and subjective performances of the proposed single-mode USAC system are better than those of the FD/LPD dual-mode USAC system.
https://doi.org/10.4218/etrij.12.0211.0404 인용 PDF KSCI

Effects of Single and Repeated Electroconvulsive Shock on the Acetylcholine and Polyamine Contents in Temporal Cortex and Decorticated Cerebrum of Mice (경련성 전기충격에 의하여 나타나는 측뇌-피질과 피질을 제외한 대뇌의 Acetylcholine및 Polyamine 함량-변동에 관한 연구)

Choi, Sang-Hyun;Lee, Hak-Hee;Park, Chung-San;Chun, Boe-Gwun;Chun, Yeon-Sook
- The Korean Journal of Pharmacology
- /
- v.27 no.1
- /
- pp.13-20
- /
- 1991
There are some rather conflicting reports correlating ECS-induced changes of brain acetylcholine, and recently, Zawia and Bondy(1990) proposed the biological role of polyamine system in the long-term adaptive responses of brain to electrical stimulation. This study was undertaken to evaluate the effects of a single or repeated ECS(10mA, 100cps, 1sec; 5 ECS spread out over 9 days) on the brain acetylcholine(ACh) and polyamine contents of male mice. The ACh contents of temporal cortex(TCx) and decorticated cerebrum(dc-CB) were markedly increased by 79.9% and 49.4%, respectively, 10 and 30 min after ECS, and the increases were significantly attenuated with repeated 5 ECS, particularly in dc-CB. The putrescine concentrations of both TCx and dc-CB were little different and not affected by 1 ECS or 5 ECS. But the spermidine(Sd) concentration was higher in dc-CB and spermine(Sm) higher in TCx. While they were moderately decreased after 1 ECS, and their decreases were accentuated after 5 ECS, particularly in dc-CB.Sm(30mg/kg, i.p. inject. 30min before ECS) did not affect the ECS-induced increase of ACh content. Thease results suggest that both of brain ACh and polyamine may be implicated with the long-term adaptive responses to electrical stimulation
PDF

MPEG Audio New Standard: USAC Technology (MPEG 오디오 최신 표준: USAC 기술)

Lee, Tae-Jin;Kang, Kyeong-Ok;Kim, Whan-Woo
- Journal of Broadcast Engineering
- /
- v.16 no.5
- /
- pp.693-704
- /
- 2011
As mobile devices become multi-functional, and converge into a single platform, there is a strong need for a codec that is able to provide consistent quality for speech and music contents. MPEG-D USAC standardization activities started at the 82nd MPEG meeting with a CfP and approved Study on DIS at the 96th MPEG meeting. MPEG-D USAC is converged technology of AMR-WB+ and HE-AAC V2. Specifically, USAC utilizes three core codecs (AAC, ACELP, and TCX) for low frequency regions, SBR for high frequency regions, the MPEG Surround for stereo information, and window transition technology for smoothing transition between various core coder. USAC can provide consistent sound quality for both speech and music contents and can be applied to various applications such as multi-media download to mobile devices, digital radio, mobile TV and audio books.
https://doi.org/10.5909/JEB.2011.16.5.693 인용 PDF KSCI

MPEG-D USAC: Unified Speech and Audio Coding Technology (MPEG-D USAC: 통합 음성 오디오 부호화 기술)

Lee, Tae-Jin;Kang, Kyeong-Ok;Kim, Whan-Woo
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.7
- /
- pp.589-598
- /
- 2009
As mobile devices become multi-functional, and converge into a single platform, there is a strong need for a codec that is able to provide consistent quality for speech and music content MPEG-D USAC standardization activities started at the 82nd MPEG meeting with a CfP and approved WD3 at the 88th MPEG meeting. MPEG-D USAC is converged technology of AMR-WB+ and HE-AAC V2. Specifically, USAC utilizes three core codecs (AAC ACELP and TCX) for low frequency regions, SBR for high frequency regions and the MPEG Surround tool for stereo information. USAC can provide consistent sound quality for both speech and music content and can be applied to various applications such as multi-media download to mobile device Digital radio Mobile TV and audio books.
https://doi.org/10.7776/ASK.2009.28.7.589 인용 PDF KSCI

Frequency Band Selection Exited Linear Prediction Wideband Speech/Audio Coding Using SBR (SBR을 이용한 주파수 밴드선택 여기 선형예측 광대역 음성/오디오 부호화)

Jang, Sunghoon;Lee, Insung
- The Journal of the Acoustical Society of Korea
- /
- v.32 no.6
- /
- pp.556-562
- /
- 2013
This paper is aimed to improve performance of Band-Selection speech/audio Coder reconstucted band spectrum that is not sent by the comfort noise. To improve the performance, we use the Spectral Band Replication(SBR) technique instead of substitution of Comfort noise. To synthesize SBR signal, the SBR algorithm is referenced in selected signals and the spectrum synthesized by SBR is injected to non-selected band. Each sub-band spectrum has been energy-weighted by real audio signal. We propose the enhanced the Band-Selection Coder that utilizes synthesized SBR signal from selected signal instead of comfort noise.
https://doi.org/10.7776/ASK.2013.32.6.556 인용 PDF KSCI

Search Result 11, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)