Search | Korea Science

An Efficient Computation of FFT for MPEG/Audio Psycho-Acoustic Model (MPEG 심리음향모델의 고속 구현을 위한 효율적 FFT 연산)

송건호;이근섭;박영철;윤대희
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.41 no.6
- /
- pp.261-269
- /
- 2004
In this paper, an efficient algorithm for computing in the MPEG/audio Layer Ⅲ (MP3) encoder is proposed. The proposed algerian performs a full-band 1024-point FFT by computing 32-point FFT's of 32 subband outputs. To reduce the aliasing caused by the analysis filter bank, an aliasing cancellation butterfly is developed. A major benefit of the proposed algorithm is the computational saving. By using the proposed algorithm, it is possible to save 40~50% of computations for FFT, which results in about 20% reduction of the PAM-2 complexity.
PDF KSCI

Audio Watermark Using Psychoacoustic Model (심리음향 모델을 이용한 오디오 워터마킹)

이희숙;이우선
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.04a
- /
- pp.859-861
- /
- 2001
본 논문은 오디오의 masking특성을 적용한 심리음향 모델을 이용하여 오디오의 고음질을 보장하면서 잡음과 압축 등의 공격에 강한 오디오 워터마킹 방법을 제안한다. 제안하는 워터마킹 방법은 심리음향 모델에 의해 생산되는 masking thresholds와 원신호의 power spectral density의 각 주파수별 차이 에너지를 이용하여 시간도메인에서 워터마크를 삽입하는 방법으로 오디오의 품질을 유지할 수 있다. 워터마크로는 자기상관성이 강한 PN-시퀀스를 이용하여 강인한 워터마킹을 구현한다. 그리고 PN-시퀀스와 같은 이진 시퀀스 워터마크의 검출을 위한 유사도 측정식을 제안한다.
PDF

MPEG Audio Layer-III Encoder Using Approximated Psy-choacoustic Model (간략화된 심리음향모델을 이용한 MPEG Audio Layer-III 부호화기)

송창준;오현오;박영철;윤대희
- Proceedings of the IEEK Conference
- /
- 2001.09a
- /
- pp.469-472
- /
- 2001
MPEC Audio Layer-III(MP3)알고리듬은 복호화기에 비해 부호화기가 월등히 많은 연산량을 가지고 있는 비대칭 구조를 가지고 있다. MP3 부호화기의 대부분의 연산량은 복잡한 초월함수 연산이 포함되는 심리음향모델과 반복 루프 과정을 수행하는 비선형 양자화와 비트 할당과정 이 차지한다. 본 논문에서는 MP3 부호화기의 실시간 구현을 위한 알고리듬 레벨의 최적화를 수행하였다. MP3 부호화기의 연산량을 줄이기 위해 심리음향모델을 간략화하고 반복 루프의 회수를 최소화할 수 있는 방법을 제안하였다. 프레임당 한 그래뉼의 심리음향모델 정보를 계산하여 한 프레임 내에서의 심리음향모델 정보를 추정함으로써 연산량을 45% 이상 감소시켰다. 또한 외부 반복 루프의 반복 회수를 줄이기 위하여 외부 반복 루프의 반복에 따른 스케일 팩터(Scale Factor) 및 양자화 스탭의 증가 패턴을 관찰하고 최적화된 스캐일 팩터 증가 방법을 제안하였다. 제안된 고속화 방법은 주관적 음질 평가를 통해 성능을 검증하였다.
PDF

Objectively Quantified Consonance of Complex Sounds (객관적으로 정량화된 복합 신호음의 조화도)

Chon, Sang-Bae;Choi, In-Yong;Lee, Min-Gu;Sung, Koeng-Mo
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.7
- /
- pp.323-327
- /
- 2007
In this paper, objectively quantified consonance of complex sound is proposed as a new psychoacoustical parameter. Proposing algorithm quantifies consonance of complex sound after applying psycho acoustical models which are parts of human perception such as masking effect, equal loudness contour, and critical band. To verify proposing algorithm, experiments with 10 car horn signals which have different complex sound were performed. The experiments show cross correlation of 0.95 between objectively quantified consonance by proposing algorithm and subjectively assessed consonance by listening tests. Considering the fact that there are few psychoacoustical parameter except Zwicker parameter, proposing algorithm will help to quantify psychoacoustical effect of complex sounds objectively.
https://doi.org/10.7776/ASK.2007.26.7.323 인용 PDF KSCI

Effect of Fabric Sound of Vapor Permeable Water Repellent Fabrics for Sportswear on Psychoacoustic Properties (스포츠웨어용 투습발수직물 소리가 심리음향학적 특성에 미치는 영향)

Lee, Jee-Hyun;Lee, Kyu-Lin;Jin, Eun-Jung;Yang, Yoon-Jung;Cho, Gil-Soo
- Science of Emotion and Sensibility
- /
- v.15 no.2
- /
- pp.201-208
- /
- 2012
The objectives of this study were to investigate the psychoacoustic properties of PTFE(Poly tetra Fluoroethylene) laminated vapor permeable water repellent fabrics which are frequently used for sportswear, to examine the relationship among fabrics' basic characteristics, mechanical properties and the psychoacoustic properties, and finally to propose the predicting model to minimize the psychoacoustic fabric sound. A total of 8 specimens' frictional sound were recorded and Zwicker's psychoacoustic parameters such as loudness(Z), sharpness(Z), roughness(Z), and fluctuation strength(Z) were calculated using the Sound Quality Program. Mechanical properties of specimens were measured by KES-FB system. Loudness(Z) of specimen D-1 was the highest, which means the rustling sound of the specimen D-1 was the most noisy. Statistically significant difference among film type was observed only in loudness(Z) for fabric sound. Based on ANOVA and post-hoc test, specimens were classified into less loud PTFE film group (groupI) and loud PTFE film group (groupII). Loudness(Z) was higher when staple yarn was used compared when filament yarn was used. According to the correlation between the mechanical properties of fabrics and loudness(Z) in groupI, the shear properties, compression properties and weight showed positive correlation with loudness(Z). According to the regression equation predicting loudness(Z) of groupI, the layer variable was chosen. In groupII, variables explaining the loudness(Z) were yarn types and shear hysteresis(2HG5).
PDF

Adaptive Watermarking for MP3 Copyright Protections Using Psychological Acoustics (심리음향 분석을 이용한 MP3 저작권 보안을 위한 적응적 워터마킹)

Lee, Kyeong-Hwan
- The Journal of the Acoustical Society of Korea
- /
- v.32 no.1
- /
- pp.64-70
- /
- 2013
In this paper, we suggest a new audio watermarking method for audio contents copyrights that can efficiently provide protection from MP3 compression attacks. Watermarks were inserted at the coefficients repeatedly from low frequencies to high frequencies after DCT transform in commonly used Cox's spread spectrum method. Because the methods using arbitrary coefficients are not effective, we use the new weight functions that make small losses for the watermark coefficients during attacks, using psychological acoustics. In the results of various sound clips, the suggested method had overall better outcomes than the Cox's method by preserving watermarks and reducing distortions of the original sounds.
https://doi.org/10.7776/ASK.2013.32.1.064 인용 PDF KSCI

History of Evaluation of sound and noise in passenger cars (승용차의 내부소음평가에 대한 연구사고찰)

Schick, August
- Science of Emotion and Sensibility
- /
- v.1 no.2
- /
- pp.133-147
- /
- 1998
이 글은 영구의 자동차내부 소음연구와 보버르트(Bobbert)의 첫 번재 독일어로 쓰인 연구를 다루고 있다. 승용차내부소음은 높은 적외파음향(infra sound)으로 특징지어 진다. 그런 까닭으로 그러한 종류의 소음을 A-평가계측(A-rated measurement)으로 그 영향을 파알하는 것은 한계가 있다. 자동차음향공학은 특히 인공청각(artificial head)기술의 발전, 소리의 합성적 제조 및 다양한 자동차 내부 음색(timbre)에 대한 일본연구가들 (하쉬모토, 쿠와노, 남바)에 의해 상당산 성취를 경험하였다. 위의 연구들은 무엇보다도 의미분별기법(Semantic Differentials)과 다차원측정방법(Multidimen-sional Scaling)을 사용하고 있다. 그러한 기법의 사용은 심리측정방법을 심리음향학(Psycho-acoustics), 특히 츠뷔커학파(Zwicker school)의 심리음향학 방법론과 결합한 것으로 볼 수 있다.
PDF

MDCT/IMDCT (MPEG 오디오 신호처리를 위한 MDCT/IMDCT의 FPGA 구현)

노진수;이강현
- Proceedings of the Korea Multimedia Society Conference
- /
- 2003.05b
- /
- pp.69-73
- /
- 2003
음향압축에 있어서 인간의 청각신경의 특성을 이용하는 방식이 사용되고 있다. 이러한 방법은 심리음향모델(psychoacustical model)에서 도입되었다. 음향압축에서는 이러한 심리음향모델을 사용하여 인간이 지각할 수 없는 한도 내에서 부호화하지 않는 지각음향부호화(perceptual audio coding)사용한다. 지각음향부호화는 분석필터와 합성필터로 각각 부호화 복호화하는데 이것은 필터뱅크(filter bank)로 구현된 서브밴드코더(subband coder) 이다. 본 논문에서는 분석필터와 합성필터에 사용되는 MDCT(Modified Discrete Cosine Transform)와 IMDCT(Inverse Modified Discrete Cosine Transform)를 FPGA에 구현하였다.
PDF

Improved MPEG-Audio Coding Method (MPEG 오디오 부호화 바업의 성능 향상)

신종인
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06c
- /
- pp.183-187
- /
- 1994
ISO/MPEG 에서는 스테레오 신호만을 부호화할 수 있는 MPEG-1 오디오 부호화 방법을 5.1 채널의 다채널 신호로 확장한 MPEG-2 오디오 방법을 제안하였다. 압축해야 될 신호가 증가하면서 MPEG에서는 채널 내의 부호화 방법으로는 MPEG-1에서 제안된 방법을 사용하고, 부가적으로 채널 간의 부호화 방법을 이용하여 MPEG-1과 호환이 가능하도록 하는 부호화 방법을 다방면에 걸쳐서 연구하여 표준화 작업을 진행하고 있다. 본 논문에서는 MPEG 오디오 부호화 방법을 두가지 측면에서 효율적으로 향상시키는 방법을 제안하고자 한다. 첫 번째는 MPEG에서 제안한 오디오 부호화 알고리듬을 개선하여 음질과 비트율에 있어 향상시키는 것으로 각 서브밴드의 비트 할당 방법과 시간 영역에서의 마스킹 효과 등을 사용한 심리음향 모델 등의 개선 방법이 제안되었다. 두 번째 방법은 부호화기의 계산량을 감소시키는 방법으로 심리음향 모델이나 비트 할당시의 계산과정에 있어 반복적인 과정은 시간 여역에서의 중복성을 이용하여 계산량에 대한 향상을 얻을 수 있었다.
PDF

Towards a better understanding of psychoacoustics in the future broadcasting (차세대 실감 방송의 구현을 위한 심리 음향의 이해)

Kim Sungyoung
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2004.11a
- /
- pp.215-219
- /
- 2004
차세대 실감 방송에서의 오디오 신호는 정보의 전달이라는 기존의 역할을 넘어서 현장감의 재현이라는 실감 방송의 목표를 구현하는 역할을 감당하게 될 것이다. 이 논문에서는 이러한 차세대 실감 방송에서 오디오 신호가 가지는 심리음향학적인(psychoacoustic) 특성을 방송현장의 운용자들을 위해 기존의 연구들에 근거하여 선명하였다. 차세대 방송은 첫째, 멀티채널 오디오 방송, 둘째, 고 해상도 데이터의 활용 그리고 셋째, 멀티 모달 전송로 특정지울 수 있는 새로운 오디오 산업의 기술진행 방향을 통해, 방송으로 전달되어지는 객체에 대하여 개선된 정위(localization), Envelopment 명료도(Clarity)등의 개선된 심리음향학적인 특성을 가지게 한다. 이와 같은 심리음향학적인 개선은 운용자의 올바른 개념적인 이해와 결합하여 보다 현장감 넘치는 방송을 청취자들에게 가져다 줄 것이다.
PDF

Search Result 201, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)