Search | Korea Science

Line feature extraction in a noisy image

Lee, Joon-Woong;Oh, Hak-Seo;Kweon, In-So
- 제어로봇시스템학회:학술대회논문집
- /
- 1996.10a
- /
- pp.137-140
- /
- 1996
Finding line segments in an intensity image has been one of the most fundamental issues in computer vision. In complex scenes, it is hard to detect the locations of point features. Line features are more robust in providing greater positional accuracy. In this paper we present a robust "line features extraction" algorithm which extracts line feature in a single pass without using any assumptions and constraints. Our algorithm consists of five steps: (1) edge scanning, (2) edge normalization, (3) line-blob extraction, (4) line-feature computation, and (5) line linking. By using edge scanning, the computational complexity due to too many edge pixels is drastically reduced. Edge normalization improves the local quantization error induced from the gradient space partitioning and minimizes perturbations on edge orientation. We also analyze the effects of edge processing, and the least squares-based method and the principal axis-based method on the computation of line orientation. We show its efficiency with some real images.al images.
PDF

Development of Real-Time Distributed Simulator and Controller Based on Virtual Machine (가상머신을 이용한 실시간 분산처리 시뮬레이터 및 제어기)

양광웅;박재현
- Journal of Institute of Control, Robotics and Systems
- /
- v.5 no.1
- /
- pp.115-121
- /
- 1999
Advanced digital computer technology enables the computer-based controllers to replace the traditional analog controllers used in factory automations. This replacement, however, brings up the side effects caused by the quantization error and non-real-time execution of control software. This paper describes the structure of real-time simulator and controller that can be used for design and verification of real-time digital controllers. The virtual machine concept adopted by the proposed real-time simulator makes the proposed simulator be independent from the specific hardware platforms. The proposed system can also be used in the loosely coupled distributed environments connected through local area network using real-time message passing algorithm and virtual data table based on the shared memory mechanism.
PDF

A Neural Network Based on Stochastic Computation using the Ratio of the Number of Ones and Zeros in the Pulse Stream (펄스열에서 1인 펄스수와 0인 펄스수의 비를 이용하여 확률연산을 하는 신경회로망)

민승재;채수익
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.31B no.7
- /
- pp.211-218
- /
- 1994
Stochastic computation employs random pulse streams to represent numbers. In this paper, we study a new method to implement the number system which uses the ratio of the numbers of ones and zeros in the pulse streams. In this number system. if P is the probability that a pulse is one in a pulse stream then the number X represented by the pulse stream is defined as P/(1-P). We propose circuits to implement the basic operations such as addition multiplication and sigmoid function with this number system and examine the error characteristics of such operations in stochastic computation. We also propose a neuron model and derive a learning algorithm based on backpropagation for the 3-layered feedforward neural networks. We apply this learning algorithm to a digit recognition problem. To analyze the results, we discuss the errors due to the variance of the random pulse streams and the quantization noise of finite length register.
PDF

The Effect of the Number of Clusters on Speech Recognition with Clustering by ART2/LBG

Lee, Chang-Young
- Phonetics and Speech Sciences
- /
- v.1 no.2
- /
- pp.3-8
- /
- 2009
In an effort to improve speech recognition, we investigated the effect of the number of clusters. In usual LBG clustering, the number of codebook clusters is doubled on each bifurcation and hence cannot be chosen arbitrarily in a natural way. To have the number of clusters at our control, we combined adaptive resonance theory (ART2) with LBG and perform the clustering in two stages. The codebook thus formed was used in subsequent processing of fuzzy vector quantization (FVQ) and HMM for speech recognition tests. Compared to conventional LBG, our method was shown to reduce the best recognition error rate by 0${\sim$}0.9% depending on the vocabulary size. The result also showed that between 400 and 800 would be the optimal number of clusters in the limit of small and large vocabulary speech recognitions of isolated words, respectively.
PDF

Automatic Clustering of Speech Data Using Modified MAP Adaptation Technique (수정된 MAP 적응 기법을 이용한 음성 데이터 자동 군집화)

Ban, Sung Min;Kang, Byung Ok;Kim, Hyung Soon
- Phonetics and Speech Sciences
- /
- v.6 no.1
- /
- pp.77-83
- /
- 2014
This paper proposes a speaker and environment clustering method in order to overcome the degradation of the speech recognition performance caused by various noise and speaker characteristics. In this paper, instead of using the distance between Gaussian mixture model (GMM) weight vectors as in the Google's approach, the distance between the adapted mean vectors based on the modified maximum a posteriori (MAP) adaptation is used as a distance measure for vector quantization (VQ) clustering. According to our experiments on the simulation data generated by adding noise to clean speech, the proposed clustering method yields error rate reduction of 10.6% compared with baseline speaker-independent (SI) model, which is slightly better performance than the Google's approach.
https://doi.org/10.13064/KSSS.2014.6.1.077 인용 PDF KSCI

Saliency Map Based Color Image Compression for Visual Quality Enhancement of Image (영상의 시각적 품질향상을 위한 Saliency 맵 기반의 컬러 영상압축)

Jung, Sung-Hwan
- Journal of Korea Multimedia Society
- /
- v.20 no.3
- /
- pp.446-455
- /
- 2017
A color image compression based on saliency map was proposed. The proposed method provides higher quality in saliency blocks on which people's attention focuses, compared with non-saliency blocks on which the attention less focuses at a given bitrate. The proposed method uses 3 different quantization tables according to each block's saliency level. In the experiment using 6 typical images, we compared the proposed method with JPEG and other conventional methods. As the result, it showed that the proposed method (Qup=0.5*Qx) is about 3.1 to 1.2 dB better than JPEG and others in saliency blocks in PSNR at the almost similar bitrate. In the comparison of result images, the proposed one also showed less error than others in saliency blocks.
https://doi.org/10.9717/kmms.2017.20.3.446 인용 PDF KSCI

Near Lossless Medical Image Compression using Wavelet Transform (웨이블릿변환을 이용한 무손실에 가까운 의료영상압축)

Yoon, Ki-Byung;Ahn, Chang-Beom
- Proceedings of the KOSOMBE Conference
- /
- v.1995 no.11
- /
- pp.113-116
- /
- 1995
Medical image compression using the wavelet transform has been tried. Due to the flexibility in representing nonstationary image signal in both time and frequency domains and its ability to adapt human visual characteristics, wavelet transform has unique advantage in images compression. In the proposed wavelet compression original image is decomposed into multi-scale bands. Different scale factors are employed in the quantization of wavelet decomposed images in different bands. For the lowest band, a predictor is designed and error signal is entropy coded. For high scale bands, runlength coding for toro run is used with Huffman coding. From simulation with magnetic resonance images($256\times256$ size, 256 graylevels) the proposed algorithm is superior to the JPEG by more than 2.5 dB in near lossless compression (CR = 8 - 10).
PDF

Lplacian Pyramid Coding Technique using a Finite State-Classified Vector Quantizer (유한상태 분류 벡터 양자기를 이용한 라플라시안 피라미드 부호화 기법)

박섭형;이상욱
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.26 no.10
- /
- pp.1561-1570
- /
- 1989
In this paper, we propose an image coding scheme which combines the Laplacian pyramid structure and a hierarchical finite state classified vector quantizer in the DCT domain, namely FSDCT-CTQ. First, an optimal bit allocation problem for fixed rates DCT-CVQ on the Laplacian pyramid structure is described. In an asymptotic case, with an optimal bit allocation, a coding gain over scalar quantization of each Laplacian plane is derived. Second, it is experimentallhy shown that the Laplacian pyramid structure provides a considerable codng gain in the sense of total MMSE (minimum mean squared error). Finally, we propose an FS-DCT-CVQ which exploits the hierarchicla correlation between the Laplacian planes. Simulation results on real images show that the proposed coding scheme can reconstruct an image with 30.33 dB at 0.192 bpp, 32.45 dB at 0.385 bpp, respectively.
PDF

Adaptive Block Truncation Coding Based on Gradient Information (경사도를 이용한 적응 구획 절단 부호화)

신용달;이봉락;이건일
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.18 no.10
- /
- pp.1546-1552
- /
- 1993
We proposed an adaptive block truncation coding(BTC) using gradient and a new initial value. We used gradient of sobel operator as a new category classification coefficient to reduce Jagged appearance at edge part. We defined a new initial value to reduce large quantization error in the 4-level quantizer block including edge part. By computer simulations, we showed that the proposed method less computation load, reduced jagged appearance at edge part, also improved PSNR more than the conventional adaptive BTC.
PDF

A New Controller for Improvement of Response Time by Data Compression Using Color Space Conversion

Koo, Sung-Jo;Kim, Chang-Gon;An, Jong-Ki;Park, Man-Hyo;Yeo, Sang-Deog
- 한국정보디스플레이학회:학술대회논문집
- /
- 2005.07b
- /
- pp.863-864
- /
- 2005
In recent years, we use overdriving scheme to improve the response time of the liquid crystal. Since conventional overdriving scheme uses memory to perform ideal processing, it is desired to reduce system cost by decreasing the data stored in these SDRAMs. As a general compression method, quantization, sub-sampling and Block Truncation Coding ( BTC ) are used, which process data in block base and cause block effect. So we proposed new data compression method by color space conversion. Because this method compresses luminance and chrominance signal by different ratio, it can efficiently reduce error of block effect in decompression image.
PDF

Search Result 296, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)