Search | Korea Science

A Speech Enhancement Algorithm based on Human Psychoacoustic Property (심리음향 특성을 이용한 음성 향상 알고리즘)

Jeon, Yu-Yong;Lee, Sang-Min
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.59 no.6
- /
- pp.1120-1125
- /
- 2010
In the speech system, for example hearing aid as well as speech communication, speech quality is degraded by environmental noise. In this study, to enhance the speech quality which is degraded by environmental speech, we proposed an algorithm to reduce the noise and reinforce the speech. The minima controlled recursive averaging (MCRA) algorithm is used to estimate the noise spectrum and spectral weighting factor is used to reduce the noise. And partial masking effect which is one of the human hearing properties is introduced to reinforce the speech. Then we compared the waveform, spectrogram, Perceptual Evaluation of Speech Quality (PESQ) and segmental Signal to Noise Ratio (segSNR) between original speech, noisy speech, noise reduced speech and enhanced speech by proposed method. As a result, enhanced speech by proposed method is reinforced in high frequency which is degraded by noise, and PESQ, segSNR is enhanced. It means that the speech quality is enhanced.
https://doi.org/10.5370/KIEE.2010.59.6.1120 인용 PDF KSCI

Predicting the subjective loudness of floor impact noise in apartment buildings using neural network analysis (Neural Network Analysis를 이용한 공동주택 바닥충격음의 라우드니스 예측)

You, Byoung-Cheol;Jeon, Jin-Yong;Cho, Moon-Jae
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2002.11b
- /
- pp.474-479
- /
- 2002
In this research, the relationship between physical measurements and subjective evaluations of floor impact noise in apartment building was quantified by applying the neural network analysis due to its complex and nonlinear characteristics. The neural network analysis was undertaken by setting up L-value, inverse A index, Zwicker parameters and ACF/IACF factors, as input data, which came from the measurements at real suites of apartment building having various sound insulations. The subjective responses from the psychoacoustic experiments were extracted as output data. Then, the reliability of the quantitative prediction for the subjective loudness was evaluated.
PDF

Design of Audio Watermarks by Noise Shaping (잡음 형상화에 의한 오디오 워터마크 설계)

Lee, Jin-Geol
- Journal of Korea Multimedia Society
- /
- v.8 no.11
- /
- pp.1432-1438
- /
- 2005
A psychoacoustic model based noise shaping method is proposed. The method shapes the noise in the frequency domain such that its presence with a host signal will not be perceptually noticeable. The derivation of imperceptible noise levels from the masking thresholds of the signal involves deconvolution associated with the spreading function in the psychoacoustic model. It has been known as an ill-conditioned Problem. In this paper, a constrained optimization is applied such that the noise excitation level conforms to the masking thresholds of the signal. Thus, the noises embedded in the signal will not be perceived by human ear, and its performance is demonstrated experimentally.
PDF

Optimized Ambisonic Panning Algorithm Using Directional Psychoacoustic Criteria (방향심리인자를 이용한 최적 앰비소닉 패닝기법)

Lee, Sin-Lyul;Lee, Seung-Rae;Sung, Koeng-Mo
- The Journal of the Acoustical Society of Korea
- /
- v.25 no.1E
- /
- pp.8-13
- /
- 2006
In this paper, an Optimized Ambisonic Panning Algorithm (OAPA) which reduces sound localization error, is proposed. In the conventional Ambisonic Panning Algorithm (APA), sound localization is usually different from the panning angle, especially when listeners are not in an ideal listening position, because of low signal separation among other channels. To overcome this problem, an OAPA using window functions is proposed. A proper window function can be verified, comprising of higher harmonic components than 2M+1 and improved DPC and channel separation. Analysis results demonstrate that the proposed method results in higher signal separation among other channels and lower sound localization errors than the conventional APA.
PDF KSCI

Digital Watermarking Using Psychoacoustic Model

Poomdaeng, S.;Toomnark, S.;Amornraksa, T.
- Proceedings of the IEEK Conference
- /
- 2002.07b
- /
- pp.872-875
- /
- 2002
A digital watermarking technique applying psychoacoustic model for audio signal is proposed in this paper. In the watermarking scheme, the pseudo-random bit stream used as a watermark signal is embedded into the audio signal in both speech and music. The strength of the embedded signal is subject to the human auditory system in such a way that the disturbances on host audio signal are beyond the sensing of human ears. The experimental results show that the quality of the watermarked audio signal, in term of signal to noise ratio, can be improved up to 3.2 dB.
PDF

Noise suppressor Using Psychoacoustic Model and Wavelet Packet Transform (심리음향 모델과 웨이블릿 패킷 변환을 이용한 잡음제거기)

Kim, Mi-Seon;Kim, Young-Ju;Lee, In-Sung
- Proceedings of the IEEK Conference
- /
- 2006.06a
- /
- pp.345-346
- /
- 2006
In this paper, we propose the noise suppressor with the psychoacoustic model and wavelet packet transform. The objective of the scheme is to enhance speech corrupted by colored or non-stationary noise. If corrupted noise is colored, subband approach would be more efficient than whole band one. To avoid serious residual noise and speech distortion, we must adjust the Wavelet Coefficient threshold. In this paper, the subband is designed matching with the critical band. And WCT is adapted by noise masking threshold(NMT) and segmental signal to noise ratio(seg_SNR). Consequently this work improve the PESQ-MOS about 0.23 in the case of coded speech.
PDF

Wireless Speech Recognition System using Psychoacoustic Model (심리음향 모델을 이용한 무선 음성인식 시스템)

Noh, Jin-Soo;Rhee, Kang-Hyeon
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.43 no.6 s.312
- /
- pp.110-116
- /
- 2006
In this paper, we implement a speech recognition system to support ubiquitous sensor network application services such as switch control, authentication, etc. using wireless audio sensors. The proposed system is consist of the wireless audio sensor, the speech recognition algorithm using psychoacoustic model and LDPC(low density parity check) for correcting errors. The proposed speech recognition system is inserted in a HOST PC to use the sensor energy effectively mil to improve the accuracy of speech recognition, a FEC(Forward Error Correction) system is used. Also, we optimized the simulation coefficient and test environment to effectively remove the wireless channel noises and correcting wireless channel errors. As a result, when the distance between sensor and the source of voice is less then 1.0m FAR and FRR are 0.126% and 7.5% respectively.
PDF KSCI

Sound Characteristics and Mechanical Properties of Taekwondo Uniform Fabrics (태권도 도복 직물의 소리 특성과 역학적 성질)

Jin, Eun-Jung;Cho, Gil-Soo
- Fashion & Textile Research Journal
- /
- v.14 no.3
- /
- pp.486-491
- /
- 2012
This study examined the sound characteristics of Taekwondo uniform fabrics to investigate the relationship between the sound parameters and the mechanical properties of the fabric as well as to provide the conditions to maximize the frictional sound of the uniform. Frictional sounds of 6 fabrics for Taekwondo uniforms were generated by the Simulator for Frictional Sound of Fabrics. The frictional speeds were controlled at low(0.62 m/s), at mid(1.21 m/s) and at high(2.25 m/s) speed, respectively. The frictional sounds were recorded using a Data Recorder and Sound Quality System subsequently, the physical sound properties such as SPL(Sound Pressure Level) and Zwicker's psychoacoustic parameters were calculated. Mechanical properties of specimens were measured by KES-FB. The SPL, Loudness(Z) values increased while Sharpness(Z) value decreased. In the physical sound parameter, specimen E had the highest SPL value at low speed and specimen B at high speed. In case of Zwicker's psychoacoustic parameters, the commercially available Taekwondo uniform fabrics(E, F) showed higher values of Loudness(Z), Sharpness(Z), and Roughness(Z), that indicates they can produce louder, shaper and rougher sounds than other fabrics for Taekwondo uniforms. The decisive factors that affected frictional sounds for Taekwondo uniforms were W(weight) as well as EM(elongation at maximum load) at low speed and WC(compressional energy) at high speed.
https://doi.org/10.5805/KSCI.2012.14.3.486 인용 PDF KSCI

Variable Bitrate MPEG Audio (가변 전송율 MPEG 오디오)

Nam, Seung-Hyon
- The Journal of Engineering Research
- /
- v.2 no.1
- /
- pp.57-62
- /
- 1997
Two psychoacoustic models used in MPEG-1 employ different masking patterns, different masking indexes, and different computational procedures. As a result, Model 1 is inferior to Model 2 due to its worst case approach in computing the SMR even though it determines tonality and masking levels accurately. In this study, we investigate the performances of psychoacoustic models when we modify the MPEG-1 audio coder for variable bitrates. Simulation results show that Model 2 has a gain of 30 kbps in the dual channel mode and 20 kbps in the joint stereo mode. It is generally known that the joint stereo mode has a gain in bitrate compare to the dual channel mode. For signals with frequent attacks, this gain becomes larger in Model 1 than in Model 2. This is due to the fact that Model 1 uses the worst case approach in computing the SMR to reduce pre-echo
PDF

Comparison of Noise Suppression Methods in Voice CODEC (음성부호화기에서의 잡음제거 방식 비교)

이진걸;기훈재
- Proceedings of the IEEK Conference
- /
- 1998.10a
- /
- pp.1203-1206
- /
- 1998
Considerable research in the last three decades has examined the problem of enhancement of speech degraded by additive background noise. We compare traditional methods such as spectral subtraction and Wiener filter, recently proposed psychoacoustic model based methods such as perceptual filter and noise suppression in EVRC in terms of performance and complexity.
PDF

Search Result 136, Processing Time 0.037 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)