Search | Korea Science

A Perceptual Audio Coder Based on Temporal-Spectral Structure (시간-주파수 구조에 근거한 지각적 오디오 부호화기)

김기수;서호선;이준용;윤대희
- Journal of Broadcast Engineering
- /
- v.1 no.1
- /
- pp.67-73
- /
- 1996
In general, the high quality audio coding(HQAC) has the structure of the convertional data compression techniques combined with moodels of human perception. The primary auditory characteristic applied to HQAC is the masking effect in the spectral domain. Therefore spectral techniques such as the subband coding or the transform coding are widely used[1][2]. However no effort has yet been made to apply the temporal masking effect and temporal redundancy removing method in HQAC. The audio data compression method proposed in this paper eliminates statistical and perceptual redundancies in both temporal and spectral domain. Transformed audio signal is divided into packets, which consist of 6 frames. A packet contains 1536 samples($256{\times}6$) :nd redundancies in packet reside in both temporal and spectral domain. Both redundancies are elminated at the same time in each packet. The psychoacoustic model has been improved to give more delicate results by taking into account temporal masking as well as fine spectral masking. For quantization, each packet is divided into subblocks designed to have an analogy with the nonlinear critical bands and to reflect the temporal auditory characteristics. Consequently, high quality of reconstructed audio is conserved at low bit-rates.
PDF

Audio Enhancement Algorithm Using Adaptive Perceptual Filter (적응 지각 필터를 이용한 오디오 음질 개선 알고리즘)

엄혜영;한헌수;홍민철;차형태
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.8
- /
- pp.687-693
- /
- 2003
In this paper, a new adaptive audio signal enhancement algorithm is proposed. In order to remove a broadband noise from a noisy signal, a filter is designed and applied adaptively to noisy audio signal. The noisy signal is first transformed to frequency domain and divided into bark domain to calculate excitation energy. A filter will be calculated to eliminate the noise by using the excitation energy and noisy energy which is obtained from a silent area. The filter is adaptively adjusted and continuously applied until the threshold point is met. The algorithm also works well even though the noise's energy change all of a sudden. SNR, NMR comparison and MOS Test are performed to show the effectiveness of the proposed algorithm.
PDF KSCI

Digital Audio Watermarking in The Cepstrum Domain (켑스트럼 영역에서의 오디오 워터마킹 방법)

이상광;호요성
- Journal of Broadcast Engineering
- /
- v.6 no.1
- /
- pp.13-20
- /
- 2001
In this paper, we propose a new digital audio watermarking scheme In the cepstrum domain. We insert a digital watermark signal Into the cepstral components of the audio signal using a technique analogous to spread spectrum Communications, hiding a narrow band signal in a wade band channel. In our proposed method, we use pseudo-random sequences to watermark the audio signal. The watermark Is then weighted in the cepstrum domain according to the distribution of cepstral coefficients and the frequency masking characteristics of the human auditory system. The proposed watermark embedding scheme minimizes audibility of the watermark signal. and the embedded watermark is robust to mu1tip1e watermarks, MPEG audio ceding and additive noose.
PDF

Functional MR Imaging of Language System : Comparative Study between Visual and Auditory Instructions in Word Generation Task (언어 중추 영역에 대한 기능적 자기공명영상: 시각적, 청각적 지시 과제에 관한 비교)

구은회;권대철;김동성;송인찬
- Journal of Biomedical Engineering Research
- /
- v.24 no.4
- /
- pp.241-246
- /
- 2003
To evaluate the usefulness if functional MR imaging(MRI) for the determination of language dominance system and to assess differences in the visual and auditory instrument language generation task according to activation task or activated area. Functional maps of the language area were obtained during visual and auditory instructions in word generation tasks in 6 healthy volunteer with right-handness were examined on a 1.5T scanner and the EPI BOLD technique, and three pulse sequence technique get of the true axial planes. Both task consisted of 96 phases including 6 activations and rests contents. Postprocessing were done on MRDx program by using cross correlation method. Two task compare the blain activation area surveyed of 1anguage lateralization index. To evaluated of the detection rates of Broca. Wernicke, pre-frontal lobe, Supplementary Motor Area (SMA) and pre-motor cortex areas and the differences of language lateraliaztion among two word generation task To lateralization index survey in 1anguage area on right and left in brain get to activation area pixel in brain. Compared to visual and auditory instrument task in the language areas get to the lateralization index. Two language generation task high detection rates of Broca and Wernicke areas. The visual instruction no detected in the auditory area, and auditory instruction no detected in the visual area. There was statistics significant different of them among language generation task. 1'his indicated that language area obtained image of the brain functional MR imaging usefulness in the visual and auditory task instrument.
PDF KSCI

A Sync Detect ion and Watermarking Method with the Wavelet Transform (왜이브릿 변환을 이용한 sync 탐지 기법과 워터마킹 기법)

황원영;염학송;강환일;한승수;김갑일
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2002.12a
- /
- pp.309-312
- /
- 2002
본 논문에서는 오디오 워터 마킹 기법을 제안한다. 이 방법은 5차 웨이브릿 변환을 이용한sync 탐지 기법을 제안한다. 이 원리를 Zuicker의 인간청각모델의 한계 밴드이론을 이용한다. 그리고 워터마킹 검출에는 정점 탐지 기법에서 많이 이용하는 에너지와 제로통과 비율을 이용하여 워터마크를 검출한다 실험을 통하여 본 알고리즘이 mp3압축에 강인할 뿐 아니라 디지털에서 아날로그신호로 바꾸고 다시 디지털 신호로 바꾸는 아날로그 공격에 시간영역이나 DCT영역에서 워터마킹을 행하는 것보다 본 알고리즘이 강인함을 보인다 본 오디오 알고리즘은 음악에 연동하는 전기기기를 구성할 때 유용한 알고리즘이 될 수 있다. 즉 음악에 워터마크를 삽입하여 이 워터마크를 전기기기 동작제어 비트열로 이용할 수 있을 것이다.

Design of Audio Watermarks by Noise Shaping (잡음 형상화에 의한 오디오 워터마크 설계)

Lee, Jin-Geol
- Journal of Korea Multimedia Society
- /
- v.8 no.11
- /
- pp.1432-1438
- /
- 2005
A psychoacoustic model based noise shaping method is proposed. The method shapes the noise in the frequency domain such that its presence with a host signal will not be perceptually noticeable. The derivation of imperceptible noise levels from the masking thresholds of the signal involves deconvolution associated with the spreading function in the psychoacoustic model. It has been known as an ill-conditioned Problem. In this paper, a constrained optimization is applied such that the noise excitation level conforms to the masking thresholds of the signal. Thus, the noises embedded in the signal will not be perceived by human ear, and its performance is demonstrated experimentally.
PDF

Efficient Representation method of Spatial cues for audio coding (오디오 채널 신호의 압축을 위한 공간 큐의 효율적 표현 방법)

Beack, Seung-Kwon;Kim, Min-Je;Lee, Tae-Jin;Jang, Dae-Young;Kang, Kyeong-Ok
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2008.02a
- /
- pp.183-186
- /
- 2008
본 논문은 공간영역에서의 오디오 채널 신호의 압축 방법에 있어서, 공간 파라메터의 효율적인 표현 방법을 제안하려 한다. 대상이 되는 공간 파라메터는 인간청각의 ILD(Internaural Level Difference) 인지와 관련한 공간 파라메터에 관한 것으로 ICLD(Inter-Channel Level Difference) 파라메터의 표현방법 관한 것이다. 본 논문의 목적은, ICLD 의 통계적 특성을 분석하고 이에 충실한 표현방법을 제안함으로써, 양자화 시 기존 표현 방법보다 왜곡율을 개선시킴으로써 복원된 오디오 신호의 충실도를 높이는 것을 목적으로 한다. 따라서 본 논문에서는, 새로운 ICLD 표현 방법을 소개하고 이에 대한 이론적 통계적 근거를 제시하며, 실험결과로써 기존 방법과 비교된 왜곡율 측정(distortion measure) 결과를 제시하여 제안된 방법의 우수성을 입증한다.
PDF

Enhanced Adjustment Strategy of Masking Threshold for Speech Signals in Low Bit-Rate Audio Coding (저전송률 오디오 부호화에서 음성 신호의 성능 개선을 위한 마스킹 임계값 적응기법 향상)

Lee, Chang-Heon;Kang, Hong-Goo
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.1
- /
- pp.62-68
- /
- 2010
This paper proposes a new masking threshold adjustment strategy to improve the performance for speech signals in low bit-rate audio coding. After determining formant regions, the masking threshold is adjusted by using the energy ratio of each sub-band to the average energy of each formant. More quantization noises are added to the bands that have relatively large energy, but less distortion is allowed in spectral valley regions by allocating more bits, which reflects the concept of perceptual weighting widely used in speech coding. From the results of objective speech quality measure, we verified that the proposed method improves quality for the speech input signals compared to the conventional one.
https://doi.org/10.7776/ASK.2010.29.1.062 인용 PDF KSCI

The Development and Verification of Balance Insole for Improving the Muscle Imbalance of Left and Right Leg Using based Sound Feedback (청각 피드백이 적용된 좌우 불균형 개선을 위한 밸런스 인솔 개발 및 검증)

Kang, Seung-Rok;Yoon, Young-Hwan;Yu, Chang-Ho;Nah, Jae-Wook;Hong, Chul-Un;Kwon, Tae-Kyu
- Journal of rehabilitation welfare engineering & assistive technology
- /
- v.11 no.2
- /
- pp.115-124
- /
- 2017
This study was to develop the balance insole system for detecting and improving the muscle imbalance of left and right side in lower limbs. We were to verify the validation of balance insole system by analyzing the strategy of muscular activities and foot pressure according to sound feedback. We developed the balance insole based FSR sensor modules for estimating the muscle imbalance using detecting foot pressure. The insole system was FPCB have 8-spot FSR sensor with sensitivity range of 64-level. The participants were twenty peoples who have muscle strength differences in left and right legs over 20%. We measured the muscular activity and foot pressure of left and right side of lower limbs in various gait environment for verifying the improvement effect of muscle imbalance according to sound feedback. They performed gait in slope at 0, 5, 10, 15% and velocity at 3, 4, 5km/h. The result showed that the level of muscle imbalance reduced within 30% for sound feedback of balance insole system contrast to high level of muscle imbalance at 169.9~246.8% during normal gait for increasing slope and velocity. This study found the validation of balance insole system with sound feedback stimulus. Also, we thought that it is necessary to research on the sensitivity of foot area, detection of muscle imbalance and processing algorithm of correction threshold spot.
https://doi.org/10.21288/resko.2017.11.2.115 인용 PDF KSCI

BLIND AUDIO WATERMARKING TECHNIQUE USING SPECIFIC FREQUENCY SIGNAL (특정된 주파수 신호를 이용한 오디오 워터마킹)

Piao, Cheng-Ri;Han, Seung-Soo;Choi, Jong-Uk
- Proceedings of the KIEE Conference
- /
- 2002.07d
- /
- pp.2368-2372
- /
- 2002
멀티미디어의 저작권 보호를 위한 기술로서 워터마킹 기술은 현재 멀티미디어의 여러 분야에서 많이 연구되며 사용되고 있다. 이 기술은 컨텐츠가 질적으로 소비자에게 인식되지 않으며, 그리고 컨텐츠자체에 다양한 정보를 은닉하기 때문에 컨텐츠에 항상 포함되어 있다는 장점이 있다. 현재 MP3등과 같은 압축기술이 발달되어 있기 때문에 네트웍에 의한 데이터 전송성능이 향상되었고, 그러므로 디지털 데이터들이 유통이 활성화되었다. 이것으로 인하여 불법적으로 복제된 다양한 컨텐츠의 유통이 생산자의 이익을 해치고 있다. 디지털 오디오 컨텐츠의 소유권을 위하여, 본 논문에서는 압축에 대한 견고성을 제고하기 위하여 청각시스템의 마스킹 효과를 이용하여 시간영역에서 오디오신호에 특정된 주파수를 가진 워터마크 정보를 삽입하였다. 이 특정된 주파수는 반드시 압축에 살아남는 주파수 대역이어야 하며, 음질을 동시에 고려하여야 한다. 그리고 추출할 때는 FFT변환을 하여 주파수 대역에서 추출한다. 저작권 정보를 쉽게 확인하기 위하여 2진 송상을 워터마크 정보로 삽입하였다.
PDF

Search Result 29, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)