Search | Korea Science

On the Performance of Sample-Adaptive Product Quantizer for Noisy Channels (표본적응 프러덕트 양자기의 전송로 잡음에서의 성능 분석에 관한 연구)

Kim Dong Sik
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.42 no.3 s.303
- /
- pp.81-90
- /
- 2005
When we transmit signals, which are quantized by the vector quantizer (VQ), through noisy channels, the overall performance of the coding system is very dependent on the employed quantization scheme and the channel error effect. In order to design an optimal coding system, the source and channel coding scheme should be jointly optimized as in the channel-optimized VQ. As a suboptimal approach, we may consider the robust VQ (RVQ). In RVQ, we consider developing an index assignment function for mapping the output of quantizers to channel symbols so that the effect of the channel errors is minimized. Recently, a VQ, which can reduce the encoding complexity and is called the sample-adaptive product quantizer (SAPQ), has been proposed. SAPQ has very similar quantizer structure as to the product quantizer (PQ). However, the quantization performance can be better than PQ. Further, the encoding complexity and the memory requirement for the codebooks are lower than the regular full-search VQ case. In this paper, SAPQ is employed in order to design an RVQ to channel errors by reducing the vector dimension. Discussions on the codebook structure of SAPQ and experiments are introduced in an aspect of robustness to noisy channels.
PDF KSCI

Protection Algorithm of the Multimedia Contents in the Mobile Environment (모바일 환경하에서 멀티미디어 컨텐츠 보호 알고리즘)

Kim Hang-Rae;Park Young;Choi Nam-Hyung
- Journal of Digital Contents Society
- /
- v.5 no.1
- /
- pp.87-94
- /
- 2004
In this paper, the digital watermarking algorithm is proposed using CDMA technique for protection of the mobile contents in the mobile environment. The digital watermarking was designed to robust the errors in the mobile environment where pathloss, multipath fading, interference, and noise exist. In case of the multimedia content service in the mobile environment, the construction method of the watermark, the algorithm of insertion and detection are also proposed. The watermark consists of the information of the mobile user. Invisibility and robustness required in watermarking are etimated. It is observed that PSNR of the mobile content inserted the watermark is 90.31 dB, and the signal processing and noise attack are also robust. Especially, because random noise occurs in wireless transmission can overcome, the proposed watermarking algorithm is adequate for protection of the multimedia contents in the mobile environment.
PDF

A Noise-Tolerant Hierarchical Image Classification System based on Autoencoder Models (오토인코더 기반의 잡음에 강인한 계층적 이미지 분류 시스템)

Lee, Jong-kwan
- Journal of Internet Computing and Services
- /
- v.22 no.1
- /
- pp.23-30
- /
- 2021
This paper proposes a noise-tolerant image classification system using multiple autoencoders. The development of deep learning technology has dramatically improved the performance of image classifiers. However, if the images are contaminated by noise, the performance degrades rapidly. Noise added to the image is inevitably generated in the process of obtaining and transmitting the image. Therefore, in order to use the classifier in a real environment, we have to deal with the noise. On the other hand, the autoencoder is an artificial neural network model that is trained to have similar input and output values. If the input data is similar to the training data, the error between the input data and output data of the autoencoder will be small. However, if the input data is not similar to the training data, the error will be large. The proposed system uses the relationship between the input data and the output data of the autoencoder, and it has two phases to classify the images. In the first phase, the classes with the highest likelihood of classification are selected and subject to the procedure again in the second phase. For the performance analysis of the proposed system, classification accuracy was tested on a Gaussian noise-contaminated MNIST dataset. As a result of the experiment, it was confirmed that the proposed system in the noisy environment has higher accuracy than the CNN-based classification technique.
https://doi.org/10.7472/jksii.2021.22.1.23 인용 PDF KSCI HTML

Performance Analysis of Correntropy-Based Blind Algorithms Robust to Impulsive Noise (충격성 잡음에 강인한 코렌트로피 기반 블라인드 알고리듬의 성능분석)

Kim, Namyong
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.40 no.12
- /
- pp.2324-2330
- /
- 2015
In blind signal processing in impulsive noise environment the maximum cross-correntropy (MCC) algorithm shows superior performance compared to MSE-based algorithms. But optimum weight conditions of MCC algorithm and its properties related with robustness to impulsive noise have not been studied sufficiently. In this paper, through the analysis of the behavior of its optimum weight and the relationship with the MSE-based LMS algorithm, it is shown that the optimum weight of MCC and MSE-based LMS have an equal solution. Also the factor that keeps optimum weight of MCC undisturbed and stable under impulsive noise is proven to be the magnitude controlled input through simulation.
https://doi.org/10.7840/kics.2015.40.12.2324 인용 PDF KSCI

User Adjustment Post-Process Using Neural Network In Isolated Word Speech Recognition (고립단어 음성인식에서 신경망을 이용한 사용자 적응형 후처리)

Kim, Young-Jin;Kim, Eun-Ju;Kim, Myoung-Won
- Proceedings of the Korean Information Science Society Conference
- /
- 2005.11b
- /
- pp.736-738
- /
- 2005
최근 PDA나 PMP와 같은 개인용 모바일 기기의 인터페이스 개발로써 잡음환경에 강인한 음성인식 기술들이 연구되고 있으며 이러한 방법으로 오류패턴, 순차패턴, 의미정보, 문맥정보와 같이 인식기에 독립적인 정보를 이용하거나 영상 정보와 같이 언어와 성격이 다른 이질적인 정보를 이용하여 후처리를 하는 연구들이 진행되어 왔다. 그러나 인식기와 독립적인 정보로 후처리를 하는 방법들의 인식률은 인식기의 사전 인식률이 주변 잡음에 의해 떨어질 경우 후처리 인식률도 같이 떨어지는 현상이 벌어진다. 따라서 본 논문에서는 주변 잡음으로 인한 인식기의 사전 인식률에 저하를 줄이는 방법으로 사용자 적응형 후처리를 제안한다. 사용자 적응형 후처리에 사용되는 데이터는 사용자의 발화에 대한 인식기의 출력 값들이며, 출력 값들은 화자독립모델에 의해 계산되는 각 단어들의 유사도 들이다. 따라서 화자독립모델의 결과를 사용자 적응형 후처리에 적용한 결과 인식기의 오류를 $58.7\%$ 줄일 수 있었다.
PDF

Speech Enhancement Based on Improved Minima Controlled Recursive Averaging Incorporating GSAP (전역 음성 부재 확률 기반의 향상된 최소값 제어 재귀평균기법을 이용한 음성 향상 기법)

Song, Ji-Hyun;Bang, Dong-Hyeouck;Lee, Sang-Min
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.49 no.1
- /
- pp.104-111
- /
- 2012
In this paper, we propose a novel method to improve the performance of the improved minima controlled recursive averaging (IMCRA). From an examination for various noise environment, it is shown that the IMCRA has a fundamental drawback for the noise power estimate at the offset region of continuity speech signals. Espectially, it is difficult to obtain the robust estimates of the noise power in non-stationary noisy environments that is rapidly changed the spectral characteristics such as babble noise. To overcome the drawback, we apply the global speech absence probability (GSAP) conditioned on both a priori SNR and a posteriori SNR to the speech detection algorithm of IMCRA. With the performance criteria of the ITU-T P.862 perceptual evaluation of speech quality (PESQ) and a composite measure test, we show that the proposed algorithm yields better results compared to the conventional IMCRA-based scheme under various noise environments. In particular, in the case of babble 5 dB, the proposed method produced a remarkable improvement compared to the IMCRA ( PESQ = 0.026, composite measure = 0.029 ).
PDF KSCI

Speech Recognition with Image Information (영상정보 보완에 의한 음성인식)

이천우;이상원;양근모;박인정
- Proceedings of the IEEK Conference
- /
- 1999.06a
- /
- pp.511-515
- /
- 1999
The main factor decreasing speech recognition rate is the surrounding noise. To lower the noise effect, we generally used the filter bank at preprocessing stage. But, in this paper, we tried to recognize the 10 numeral numbers using 2-D LPC to extract image feature. At first, we obtained the result of speech-only recognition using 13th-order LPC coefficients and then, for distorted speech recognition results of ‘0’, ‘4’, ‘5’, ‘6’ and 9’, we added image parameters such as 12th-order 2-D LPC coefficients. At each frame, we extracted the 2-D LPC coefficients, and simulated recognizer with two parameters such as speech and image. Finally, for the numbers, such as ‘4’and ‘9’, the better results were obtained.
PDF

Indoor Positioning using the Wavelet and Neural Network (WLAN 기반의 웨이블릿과 신경망을 이용한 실내위치인식 연구)

Kim, Jong-Bae
- Proceedings of the IEEK Conference
- /
- 2008.06a
- /
- pp.775-776
- /
- 2008
본 논문은 WLAN 환경에서 웨이블릿과 신경망을 사용한 실내 위치인식 방법을 제안한다. 제안한 방법의 기본적인 아이디어는 경제적이면서 효율적인 방법으로써 실내에 설치된 무선 AP들로부터 수신된 신호들의 수신세기로부터 비교적 정확하게 위치를 추정하는 연구이다. 일반적으로 무선 신호는 주위 환경 및 건물 구조적 요인에 의해 수신세기가 더해지거나 감해지는 현상이 발생함으로써 수신된 각 신호세기로부터 신호 잡음과 오류에 강인하고 시간과 주파수 도메인의 정보 표현이 가능하며 각 신호세기들간의 구별성을 갖는 특징값 획득이 필요하다. 제안 방법에서는 웨이블릿 변환을 이용하여 수신된 신호로부터 중복 데이터와 잡음에 민감하게 반응하지 않는 특징값을 획득하고, 수신신호 전역 및 지역적 특징의 표현이 가능한 신경망을 사용하여 실내위치인식 방법은 제안한다. 실험 결과 실내위치 인식률이 94 % 이상 제시하였다.
PDF

Robust Speech Enhancement Based on Soft Decision Employing Spectral Deviation (스펙트럼 변이를 이용한 Soft Decision 기반의 음성향상 기법)

Choi, Jae-Hun;Chang, Joon-Hyuk;Kim, Nam-Soo
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.47 no.5
- /
- pp.222-228
- /
- 2010
In this paper, we propose a new approach to noise estimation incorporating spectral deviation with soft decision scheme to enhance the intelligibility of the degraded speech signal in non-stationary noisy environments. Since the conventional noise estimation technique based on soft decision scheme estimates and updates the noise power spectrum using a fixed smoothing parameter which was assumed in stationary noisy environments, it is difficult to obtain the robust estimates of noise power spectrum in non-stationary noisy environments that spectral characteristics of noise signal such as restaurant constantly change. In this paper, once we first classify the stationary noise and non-stationary noise environments based on the analysis of spectral deviation of noise signal, we adaptively estimate and update the noise power spectrum according to the classified noise types. The performances of the proposed algorithm are evaluated by ITU-T P. 862 perceptual evaluation of speech quality (PESQ) under various ambient noise environments and show better performances compared with the conventional method.
PDF KSCI

A Study on Power Variations of Magnitude Controlled Input of Algorithms based on Cross-Information Potential and Delta Functions (상호정보 에너지와 델타함수 기반의 알고리즘에서 크기 조절된 입력의 전력변화에 대한 연구)

Kim, Namyong
- Journal of Internet Computing and Services
- /
- v.18 no.6
- /
- pp.1-6
- /
- 2017
For the algorithm of cross-information potential with delta functions (CIPD) which has superior performance in impulsive noise environments, a new method of employing the information of power variations of magnitude controlled input (MCI) in the weight update equation of the CIPD is proposed in this paper where the input of CIPD is modified by the Gaussian kernel of error. To prove its effectiveness compared to the conventionalCIPD algorithm, the distance between the current weight vector and its previous one is analyzed and compared under impulsive noise. In the simulation results the proposed method shows a two-fold improvement in steady state stability, faster convergence speed by 1.8 times, and 2 dB - lower minimum MSE in the impulsive noise situation.
https://doi.org/10.7472/jksii.2017.18.6.1 인용 PDF KSCI

Search Result 230, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)