Proceedings of the IEEK Conference (대한전자공학회:학술대회논문집)
The Institute of Electronics and Information Engineers (IEIE)
- 기타
2000.11d
-
In this paper, a technique to estimate the circular object's center and radius under noisy condition is described. The technique is based on Davies'Hough transform approach to circular object location but more robust to noise and faster to estimate the circle by using an area of cumulated points.
-
This paper proposes an improved image illumination estimation method based on the conventional color constancy algorithm. The most important process of color constancy algorithm is the estimation of the spectral distributions of illuminant of an input image. To estimate of the spectral distributions of illuminant of an input image, we use the brightest pixel values and the values of surface reflectance of an input image using a principal component analysis of the given munsell chips. We estimate a CIE tristimulus values of an input image using the estimated .spectral distribution of illuminant and recover an image by scaling it regularity. From the experimental results, the proposed method was effective in estimating the image illumination
-
This paper presents a robust method to combine a collection of images with small fields of view to obtain an image with a large field of view. In the previous works, there are two main areas which one is a cross correlation-based method and the other is a feature-based method. The former is based on motion estimation from video sequences. so there are a problem on rotating a camera about optical axis. In the latter method, it is difficult to match correspondence feature points correctly.'re find correct correspondences, we proposed the geometrical feature model and correspondence filters and the Gaussian distribution weight function to blend the images smoothly. The experiments show that our method is robust and effective.
-
This paper describes a shape-based object recognition algorithm using multiple distance images. For the images containing dense edge points and noise, previous Hausdorff distance (HD) measures yield a high ms error for object position and many false matchings for recognition. Extended version of HD measure considering edge position and orientation simultaneously is proposed for accurate matching. Multiple distance images are used to calculate proposed matching measure efficiently. Results are presented for visual images and infrared images.
-
기체에 대한 영상처리 기술은 그 응용 분야가 매우 넓고, 그에 따른 산업적 · 경제적 중요성도 증가되게된다 한 예로서 자동으로 공장오염 감시나 산불감시등에 사용되는 영상기기에 곧바로 기체에 대한 영상처리 기술들이 필요하다. 그러나 기체는 고체와는 다른 다음과 같은 특성들을 가지고 있다. 첫째, 고체의 경우 물체의 경계선이 비교적 분명하지만, 기체의 경우 하나의 기체 내에서도 밀도 분포가 다르기 때문에 그 경계선에서도 밀도가 불규칙하여서 기체의 경계선을 정확히 정의하기 힘들다. 둘째, 기체 분석을 위한 영상들은 대체로 잡음이 많고, 기체의 크기에 비하여 해상도가 낮다. 따라서 기체 영상은 픽셀(Pixel) 단위로 분석 처리하기가 어렵다. 위와 같은 기체가 가지는 특성 때문에 고체에 대한 영상 처리 기술을 기체에 직접적으로 적용하기는 불가능하다. 본 연구에서는 화상 데이터에서 기체를 감지하여 추적하는 시스템을 개발하고자 한다.
-
Gaze detection is to find out the position on a monitor screen where a user is looking at, using the computer vision processing. This System can help the handicapped to use a computer, substitute a touch screen which is expensive, and navigate the virtual reality. There are basically two main types of the study of gaze detection. The first is to find out the location by face movement, and the second is by eye movement. In the gaze detection by eye movement, we find out the position with special devices, or the methode of image processing. In this paper, we detect not the iris but the pupil from the image captured by Head-Mounted Camera with infra-red light, and accurately locate the position where a user looking at by A(fine Transform.
-
This paper proposes a motion segmentation algorithm for layer decomposition of image sequences. The proposed algorithm segments an image into initial regions by using its color and texture and computes a motion model of each initial region. Each pixel assigns one of the motion represented by the models or a motion except them, which segments the image into the motion regions. The proposed algorithm is app]ied image sequences and the segmented motion is shown.
-
To create a more realistic soccer game derived from TV images, we are developing an image synthesis system that generates 3D image sequence from TV images. We propose the method for the team and the pose recognition of players in TV images. The representation includes camera calibration method, team recognition method and pose recognition method. To find the location of a player on the field, a field model is constructed and a player's field position is transformed by 4-feature points. To recognize the team information of players, we compute RGB mean values and standard deviations of a player in TV images. Finally, to recognize pose of a player, this system computes the velocity and the ratio of player(height/width). Experimental results are included to evaluate the performance of the team and the pose recognition.
-
영상검색에서 컬러 히스토그램은 폭넓게 사용되고 있으며, 여러 가지 장점이 있음에도 불구하고 공간정보를 포함하지 못하는 문제점이 있다. 본 논문에서는 컬러 히스토그램에 공간정보를 포함시키기 위해 영상을 몇 개의 블록으로 나누어 계층적으로 컬러 히스토그램을 비교하는 방법을 제안하였다. 실험을 통하여 제안한 방법이 컬러 히스토그램을 이용한 방법보다 더 좋은 결과를 나타냄을 확인하였다.
-
In this paper, we present an efficient face detection algorithm for locating vertical views of human faces in complex scenes. The algorithm models the distribution of human skin color in YCbCr color space and find various ace candidate regions. Face candidate regions are found by thresholding with predetermined thresholds. For each of these face candidate regions, The sobel edge operator is used to find edge regions. For each edge region, we used an ellipse detection algorithm which is similar to hough transform to refine the candidate region. Finally if a substantial number of he facial features (eye, mouth) are found successfully in the candidate region, we determine he ace candidate region as a face region. e show empirically that the presented algorithm an find the face region very well in the complex scenes.
-
In this paper, we propose a region-based segmentation algorithm to extract human face area using a window function and neural networks. Furthermore, we apply the erosion and dilation to remove small error areas. By applying the window function, it is possible to reduce error. In particular, false segmentation of the eye and the lip can be considerably reduced. Experiments show promising results and it is expected that the Proposed method can be applied to video conference and still image compression.
-
This paper is to propose an algorithm to reduce the calculation time to perform the 2-dimensional Discrete Wavelet Transform(2DWT). We call this algorithm as Reduced 2-dimensional Discrete Wavelet Transformation(R2DWT). This algorithm uses a modified Mallat-tree such that in each level, the column transform is performed only with the low-pass filtered row transform result. The resulting number of sub-band regions is 2L+1, meanwhile the original(2DWT) has 3L+1 sub-regions, where L is the transform level. To show the proposed algorithm is useful without much loss in SNR(Signal-to-Noise Ratio), we performed experiments with various images. The results showed that above 5:1 in compression ratio, the proposed algorithm has less than 0.SdB difference in SNR from 2DWT with about 25% reduction in calculation time.
-
In progressive image coding, if object region that have main contents in image are transmitted prior to the remained region, this method will be very useful. In this paper, the progressive image coding based on SPIHT using object region transmission method by priority is proposed. First, an original image is transformed by wavelet. Median filtering is used about wavelet transformed coefficient region for extracting object region. This extracted object region encoded by SPIHT. Then encoded object region are transmitted in advance of the remained region. This method is good to a conventional progressive image coding about entire original image. Experimental results show that the proposed method can be very effectively used for image coding applications such as internet retrieval and database searching system.
-
This paper presents a new adaptive fast motion estimation algorithm along with its architecture. The conventional algorithm such as full - search algorithm, three step algorithm have some disadvantages which are related to the amount of computation, the quality of image and the implementation of hardware, the proposed algorithm uses spatial correlation and a slope of motion vector in order to reduce the amount of computation and preserve good image quality, The proposed algorithm is better than the conventional Block Matching Algorithm(BMA) with regard to the amount of computation and image quality. Also, we propose an efficient at chitecture to implement the proposed algorithm. It is suitable for real time processing application.
-
The purpose of this study is prove the effectiveness of an energy subtraction image for the detection of pulmonary nodules and the effectiveness of multi-resolutional filter on an energy subtraction image to detect pulmonary nodules. Also we examine influential factors to the accuracy of detection of pulmonary nodules from viewpoints of types of images and evaluation methods. As one type of images, we select energy subtraction X-ray images, at the same time is done ▽
$^2$ G filter and multi-resolutional filter. Here select two evaluation methods and make clear the effectiveness of multi-resolutional filter on an energy subtraction image. -
This paper describes an approach for extracting invariant features using a view-based representation and recognizing an object with a high speed search method in FLIR. In this paper, we use a reformulated eigenspace technique based on robust estimation for extracting features which are robust for outlier such as noise and clutter. After extracting feature, we recognize an object using a partial distance search method for calculating Euclidean distance. The experimental results show that the proposed method achieves the improvement of recognition rate compared with standard PCA.
-
This study proposes a new calculation method for generating real nighttime lamp-lit images. In order to improve the color appearance in the prediction of a nighttime lamp-lighted scene, the lamp-lit image is synthesized based on spectral distribution using the estimated local spectral distribution of the headlamps and the surface reflectance of every object. The principal component analysis method is introduced to estimate the surface color of an object, and the local spectral distribution of the headlamps is calculated based on the illuminance data and spectral distribution of the illuminating headlamps. HID and halogen lamps are utilized to create beam patterns and captured road scenes are used as background images to simulate actual headlamp-lit images on a monitor. As a result, the reproduced images presented a color appearance that was very close to a real nighttime road image illuminated by single and multiple headlamps.
-
A Scene Based Technique(SBT) that corrects linear array infrared detector's nonuniformity is proposed. Basically, this technique dispenses with using temperature references on a linear array infrared detector. To correct the nonuniformity of infrared images, we use three methods. Firstly, we detect bad channels by using the information which is cumulated all the same line pixels. Secondly, a variable window method is applied to compensate more accurately. Thirdly, an adaptive method which updates gain and offset coefficient is used only on a stationary region. These results are demonstrated on a computer simulation with various images. As a result, the nonuniformity is corrected completely, so that images are enhanced and PSNR(peak signal to noise ratio) is improved much.
-
In this paper, we present a new automatic thresholding algorithm based on maximum entropy of two-dimensional pixel histogram. While most of the previous algorithms select thresholds depending only on the histogram of gray level itself in the image, the presented algorithm considers 2D relational histogram of gray levels of two adjacent pixels in the image. Thus, the new algorithm tends to leave salient edge features on the image after thresholding. The experimental results show the good performance of the presented algorithm.
-
Cellular automata are discrete dynamical systems whose behaviour is completely specified in terms of a local relation. If cellular automata convergence to fixed points, then it can be used to image processing. From the generalized Potts automata point of view, we propose in this paper a cellular automata technique for reducing image noise. To minimize blurring effect, an algorithm based on neighborhood median computation is Preferred. Experimental results are reported.
-
We present a new approach to analyzing the dynamic false contour noise of AC plasma display panels (PDP), which is known to degrade the image quality severely. Compared with the existing methods that consider only the amount of light emission from PDP during 1 field time, the proposed approach uses the impulse response model of the human vision system and estimates how the human beings actually feel as the function of time. Experimental results using various benchmark sub-field scan algorithms are included.
-
In this paper, an effective image restoration using Genetic Algorithm(GA) in wavelet transform region is proposed. First, a wavelet transform is used for decomposition of a blurred image with white Gaussian noise as a preprocessing of the proposed method. The wavelet transform decomposes a degraded image into a wavelet subband coefficient planes. In this wavelet transformed subband coefficient planes, three highest subbands is composed entirely of noise elements on a degraded image. So, these subbands are removed. And remained subbands except for the lowest subband are individually applied to GA. For the performance evaluation, the proposed method is compared with a conventional single GA algorithm and a conventional hybrid method of wavelet transform and GA for a Lenna image and a boat image. As an experimental result, the proposed algorithm is prior to a conventional methods as each PSNR 3.4dB, 1.3dB.
-
In this paper, we present an improved novel interpolator that performs high quality interpolation on both synthetic and real world images. Its structure, which is based on a four directional linear predictor with equiripple windowed samples and phase matching equalizer, provides edge-directional data interpolation so that sharp and artifacts-free images are obtained at a reasonable computational cost.
-
This paper presents a new approach to the vertex based shape coding technique. The conventional approaches encode objects using a spline method with the same distortion coefficients. The proposed approach, however, classifies the objects based on the object's features, and then applies different distortion values depending on the classified object types. Using this pre-classifying technique, this paper reduces the bit rate and the computational complexity necessary for the encoding process. The performance of the proposed method has been proved by experiments on the various sample Images.
-
An efficient bit rate distribution technique that distributes available bits for multiple objects based on motion vector magnitude, size of object shape, and coding distortion is presented. This coding concept using the three parameters was exploited in MPEG-4 multiple object coding. But the scheme is likely to produce poor results such as allocating more bits to less important objects and degrading picture quality, due to the lack of analysis and research in view of human visual aspect. In this paper importance of each object is represented by the three parameters and visually analyzed. Target bits are distributed according to coding distortion using the pre-assigned shape and motion information.
-
본 논문에서 제안한 방법은 DCT 임베이디드 동영상 부호화기를 사용하여 부호화기의 레이트 디스토션 성능과 기존 프레임과 예측 프레임간의 의존성을 이용한 디스토션이 일정한 효율적인 비트율 제어 알고리즘을 제안한다. 다양한 표준 동영상에 대해 컴퓨터 모의 실험을 수행하고 기존 방법과의 비교를 통해 제안방법의 유효성을 검증하고 제안된 알고리즘의 부호화 효율을 확인했다.
-
Bitstrems corrupted by channel errors are not only difficult to be decoded but also propagate error to other part of the bitstreams when highly compressed video is transmitted over channels with noise such as mobile communication channels. In this paper, error concealment algorithm performed in decoder is proposed when errors occur for transmission. Proposed algorithm searches moving area with homogeneous movement in neighbored blocks when motion vectors are damaged, then recovers motion vectors of missing blocks considering where missing blocks are belong to. Experiment result shows that proposed algorithm exhibits better performance in PSNR than existing error concealment method.
-
We present a new variable step size LMS algorithm using the correlation between reference input and error signal of adaptive filter. The proposed algorithm updates each weight of filter by different step size at same sample time. We applied this algorithm to adaptive multip]e-notch filter. Simulation results are presented to compare the performance of the proposed algorithm with the usual LMS algorithm and another variable step algorithm.
-
Recently, many complex DSP (Digital Signal Processing) algorithms have being realized on RISC CPU due to good compilation, low power consumption and large memory space. But, real-time implementation of multiple DSP algorithms on RISC requires the minimum and efficient memory usage and the lower occupancy of CPU. In this thesis, the original floating-point code of MPEG-1 audio decoder is converted to the fixed-point code and then optimized to the efficient assembly code in time-consuming function in accord with RISC feature. Finally, compared with floating-point and fixed-point, about 30 and 3 times speed enhancements are achieved respectively. And 3~4 times memory spaces are spared.
-
In this paper, the portable game machine called W"alking Beat" is designed and implemented not only to propose the new possibilities for the peripheral equipment market of portable acoustic machine but also to overcome the limitation of the acoustic simulation game machine such as the existing Beat Mania. The old game machine can be used only in a particular place, where it is installed. However, in order to get over the constraint on this space problem "Walking Beat Game Machine" is designed to facilitate the portability. In addition, the frequency analysis method using FFT algorithm is employed by regarding the music data for the portable digital acoustic machine as source so that the limitation that the existing game machine depends heavily on the previously registered game contents can be overcome. By making it possible to play games for all the music and putting an emphasis on multimedia trend only to listen to the music, it is speculated that it can contribute to the development of the new culture in the near future.
-
This paper proposes a low-Power DDC(Digital Down Converters) architecture for IF(Intermediate frequency) signal processing. It is shown that concept of conventional interpolated FIR filters can be expanded to IIR filters for DDC applications. Also in the paper, power dissipations for the proposed architecture and conventional ones are estimated.
-
This paper presents the circuit design and implementation of a HomePNA (Home Phoneline Network Alliance) 1M8 PHY transceiver for specification ver1.1. This paper describes a physical medium interface, an Ethernet MAC controller unit interface, and a management interface of the HomePNA transceiver. The designed HomePNA transceiver can support any specifications having more than 32Mbits/sec(maximum in HomePNA ver2.0) transmission rate by changing physical medium interface, because Ethernet MAC controller unit interface has been designed by using MII.
-
The ASIC Design of the Adaptive De-interlacing Algorithm with Improved Horizontal and Vertical EdgesIn this paper, the ADI (Adaptive De-interlacing) algorithm is proposed, which improves visually and subjectively horizontal and vertical edges of the image processed by the ELA(Edge Line-based Average) method. This paper also proposes a VLSI architecture for the proposed algorithm and designed the architecture through the full custom CMOS layout process. The proposed algorithm is verified using C and Matlab and implemented using 0.6
$\mu\textrm{m}$ 2-poly 3-metal CMOS standard libraries. For the circuit and logic simulation, Cadence tool is used. -
In this paper, luminance mapping for uniform color distribution and gamut mapping for maximum chroma reproduction are proposed. In the conventional lightness mapping, the average lightness difference between the two gamut is increased and different color changes in bright and dark regions are also increased. To solve these problems, a lightness mapping is proposed that minimizes the lightness difference of the cusps at each hue angle and produces same color changes in bright and dark regions. Also, chroma mapping that utilize variable anchor point and an anchor point are proposed for maximum chroma reproduction and uniform color change. The proposed algorithm reduce a sudden color change on the gamut boundary of the printer and to maintain a uniform color change during the mapping process. Accordingly, the proposed algorithm can reproduce high quality images with low-cost color devices.
-
본 논문에서는 기존의 중간조 처리 방법들의 단점을 개선하고 원영상의 색을 충실히 재현하기 위해 도트 패턴 데이터베이스를 사용한 모델 기반의 중간조처리 방법을 제안한다. 제안한 방법은 우수한 화질의 풀력 영상을 얻기 위해 BNM을 기반으로 도트 패턴을 생성한 후 원형 도트 중첩 모델과 하드웨어의 점이득을 적용하여 도트 패턴 데이터베이스를 생성한다. 도트 패턴 데이터베이스는 하나의 밝기값에 도트 패턴각각 하나씩 구성되므로 출력 영상에서 원영상 화소의 색을 충실히 재현할 수 있다. 이 과정에서 인간 시각특성을 적용하여 현재 화소의 색에 대해 국부적으로 인간 시각에 적합한 도트 패턴을 선택한다.
-
We propose a statically motivated scene change detection algorithm. As the difference between the neighboring frames will generate peaks at scene boundaries, the problem of detecting fast scene changes is equivalent to detecting peaks in a given sequence. In this paper, the peak detection is performed via several statistics, namely the sample means and variances. For eliminating flash lights as well as detecting fast scene changes within a small number of frames, we have opted to use a two-stage process for computing the necessary statistics. The results indicate superiority of necessary statistics. The results indicate superiority of the proposed algorithm over the previously reported algorithm.
-
In this paper, the quantization noise in block-based video coding is analyzed, and a post-processing method based on the analysis is presented for reducing the quantization noise by using a wavelet transform(WT). In the proposed method, the quantization noise is considered as the sum of a blocking noise expressed as a deterministic profile and the random remainder noise. Each noise is removed in a viewpoint of image restoration using a 1-D WT, which yields a regularized differentiation. The blocking noise first is reduced by weakening the strength of each blocking noise component that appears as an impulse in the first scale wavelet domain. The impulse strength estimation is performed using median filter, quantization parameter(QP), and local activity. The remainder noise, which is considered as a white noise at non-edge pixels, then is reduced by soft-thresholding. The experimental results show that the proposed method yields better performance in terms if subjective quality as well as PSNR performance over VM post-filter in MPEG-4 for all test sequences of various compression ratios. We also present a fast post-processing in spatial domain equivalent to that in wavelet domain for real-time application.
-
In this paper, we propose an efficient method to detect shot changes in compressed MPEG video data by using reference features among video frames. The reference features among video frames imply the similarities among adjacent frames by prediction coded type of each frame. A shot change is detected if the similarity degrees of a frame and its adjacent frames are low. And the shot change detection algorithm is improved by using Fuzzy c-means (FCM) clustering algorithm. The FCM clustering algorithm uses the shot change probabilities evaluated in the mask matching of reference ratios and difference measure values based on frame reference ratios.
-
In this paper, we will present a lossy data compression method for coding multispectral images. The proposed method uses both spatial and spectra] correlation inherent in multispectral images. First, band 2 and band 6 are vector quantized. Secondly, band 4 is estimated with the quantized band 2 using the predictive coding. Errors of band 4 are encoded at a second stage based on the magnitude of the errors. Thirdly, remaining bands are calculated with the quantized band 2 and band 4. Errors of residual bands are wavelet transformed and then we apply the SPIHT coding on the transformed coefficients. We classify classes without extra information transmitting and then use linear predictor. And errors can be encoded by SPIHT coding at any target rate we are want. It is shown that this method has better performance than FPVQ. Average PSNR rises 0.645 dB at the same bit rate.
-
In this paper, we present a fast algorithm for the motion estimation using the efficient selection of an initial search position. In the method, we select the initial search position using the motion vector from the subsmpled images, the predicted motion vector from the neighbor blocks, and the (0,0) motion vector. While searching the candidate blocks, we use the spiral search pattern with the successive elimination algorithm(SEA) and the partial distortion elimination(PDE). The experiment results show that the complexity of the proposed algorithm is about 2∼3 times faster than the three-step search(TSS) with the PSNR loss of just 0.05[dB]∼0.1[dB] than the full search algorithm PSNR. The search complexity can be reduced with quite a few PSNR loss by controling the number of the depth in the spiral search pattern.
-
In this paper, We try to design combined source-channel coder that is compatible with video coding standards. This MAP decoder is proposed by adding semantic structure and semantic constraint of video coding standards to the method using redundnacy of the MAP decoders proposed previously. Then, We get the better performance than usual channel coder's.
-
The object level image compression is a useful technology for reducing the necessary data and manipulating individual objects. In this paper, we propose a new image object compression algorithm that uses the quadratic programming (QP) method to reduce the compressed data. The results indicate the superiority of the proposed QP based algorithm over the low pass extrapolation (LPE) method of MPEG-4.
-
논문은 웨이블릿(wavelet) 변환된 각 프레임의 모든 부대역의 블록들에 대해 계층적 움직임을 추정할때 고해상도 계층에서는 기저대역에서 추정된 전역 움직임 벡터를 기초로 하여 국부 움직임을 추정한다. 이때 복원 영상에 미치는 영향이 가장 큰 기저대역에 대하여 반화소를 사용하면 더욱 최적의 움직임 벡터를 추정할 수 있으나 계산량이 증가하는 단점이 있다. 블록내에 인접한 화소들 간에는 상관관계가 높다는 사실을 이용하여 오차가 최소가 되는 방향을 예측하여 선별적인 보간을 행하여 반화소 움직임을 탐색하여 계산량을 줄였다. 그리고 더욱 향상된 화질을 얻기 위해서 에지 성분이 많은 고해상도 계층에서 저해상도 계층으로의 선택적 국부 움직임을 추정하였다. 모의 실험 결과 기존의 웨이블릿 변환을 이용한 움직임 추정 및 보상 방법보다 향상된 화질을 나타내었다.
-
Motion estimation technique has been used to increase video compression rates in motion video applications. One of the important algorithms to implement the motion estimation technique is search algorithm. Among many search algorithms, the H.263 adopted the Nearest Neighbors algorithm for fast search. In this paper, motion estimation block for the Nearest Neighbors algorithm is designed on FPGA and coded using VHDL and simulated under the Xilinx foundation environments. In the experiment results, we verified that the algorithm was properly designed and performed on the Xilinx FPGA(XCV300Q240)
-
In this paper, implementation of speech Recognizer system, Separated from Personal computer. By using DSP, this intends to extend the voice recognizing, limited into PC because of amount of data and calculations. For this performance The thesis uses the real time End point detector and organizes no additional device between human and the system, characteristic vector are that detects End point and voice from absolute energy and ZCR, that uses 12 difference Cepstrum from LPC, that uses the method to compensate the process of pattern separating and pre-calculated standard pattern limitation.
-
Frequency domain adaptive filter is effective to communication fields of many computational requirements. In this paper we propose a new variable step size algorithms which improves the convergence speed and reduces computational complexity for frequency domain adaptive filter. we compared MSE of the proposed algorithms with one of normalized FLMS using computer simulation of adaptive noise canceler based on synthesis speech.
-
Fractal image compression can reduce the size of image data by contractive mapping of original image. The mapping is affine transformation to find the block(called range block) which is the most similar to the original image. Fractal is very efficient way to reduce the data size. However, it has high distortion rate and requires long encoding time. In this paper, we present the simulation result of fractal and VQ hybrid systems which use different clustering algorithms, normal and improved competitive learning SOFM. The simulation results showed that the VQ hybrid fractal using improved competitive learning SOFM has better distortion rate than the VQ hybrid fractal using normal SOFM.
-
Speaking Rate has variety depends on the situation and habit of speakers. It has been many studied about speaking rate In speaker recognition. The study of speaking rate in speech recognition is one of considerable matter when It is recognized the speakers and it is measured by many speech data base and complicate estimation for accuracy. In this paper, conventional vocoder process the speech signal when encoding and transmitting without regard to speaking rate so in order to apply the speaking rate for vocoder It should be considered the simpler algorithm and less computation amount than the conventional method of speaking rate used In speech recognition. We proposed the speaking rate algorithm which is used the simple parameter with Line Spectrum Pair (LSP). The proposed peaking rate method is measured by the information of LSP in speech. We measured the variety rate of phenomenon about utterances which have different velocity, respectively. As a result, It has distinct variation rate of phenomenon between utterances uttered fast and slow and the rate is 42.8% higher in case of uttered fast than in case of uttered slow.
-
Since the amplitude of voiced fall off at about -20dB/decade, dynamic range is often compressed prior to spectral analysis so that details at weak, high frequencies may be visible. Preemphasizing the speech, either by differentiating the analog speech s
$\sub$ a(t) prior to A/D conversion or by differencing the discrete-time s(n)=s$\sub$ a(nT), compensating for falloff at high frequencies. The most common form of preemphasis is y(n)=s(n)-As(n-1), where A typically lies between 0.9 and 1.0 and reflects the degree of pre-emphasis. In This paper, we proposed that A is adjusted at each time by measuring the slope of envelope in frequency domain. -
We propose a content based watermarking technique in multimedia management system. In the proposed technique, a content description technique of MPEG-7 for the multimedia database is adopted into a watermarking technique. With multimedia features described by MPEG-7 standard, we propose a novel watermarking technique where MPEG-7 descriptors are regarded as perceptually significant portions. The watermark is embedded in cooperating with multimedia features such as MPEG-7 descriptor. To verify the feasibility and performance of proposed watermarking technique, experiments with the MPEG-7 database are performed.
-
In this paper, we propose a regular-texture image retrieval approach relating In curvature. Maximum curvature and minimum curvature are computed from the query and each regular-texture image in the database. Seven features are computed from curvature characterizing statistical properties of the corresponding image. Each regular-texture image in the database is then represented as the seven CM (curvature measurement)-features. Query comparison and matching can be done using the corresponding CM-features. Experimental results on Brodatz texture show that the proposed approach is effective.
-
This paper describes a new algorithm for segmenting continuous handwritten signatures sampled by a digitizer. Signatures are segmented by three procedures. The first step is to calculate the pen tip speed. Then the Gabor wavelet is carried out on the acquired data from the first step. Finally, the local minima of the filtered output are selected as segmentation points of the signature. The proposed method is experimented with numerous signatures with various length and complexity.
-
Character recognition has already been studied in a lot of fields. But, if input-characters have noise in practical application system, the ability decreases markedly. Special consideration should be taken into account in the recognition of blurred data. This paper proposes low-quality printed character recognition methods that extracts blurred parts of the character image, deletes them and carry out accurate character recognition.
-
In this paper, we report detects system algorithm, adapted a perimetric mask and a generalized symmetry system, to detect a transformable material and find out a minute error cannot be noticed by a naked eye. In this thesis, supposed a stable detecting system applied a general image processing theory and perimetric mash algorithm to detect badness. And finally, detected some vague errors with the application of symmetry transform algorithm that accumulate a symmetry of minute error and put stress on it.
-
The sign-language can be used as an auxiliary communication means between avatars of different languages in cyberspace. At that time, an intelligent communication method can also be utilized to achieve real-time communication, where intelligently coded data (joint angles for arm gestures and action units for facial emotions) are transmitted instead of real pictures. In this paper, a method of generating the facial gesture CG animation on different avatar models is provided. At first, to edit emotional expressions efficiently, a comic-style facial model having only eyebrows, eyes, nose, and mouth is employed. Then generation of facial emotion animation with the parameters is also investigated. Experimental results show a possibility that the method could be used for the intelligent avatar communications between Korean and Japanese.
-
This paper has been studied a method that effectively displaying color image to monochromatic display such as PDA and movable-phone. Generally, the Floyd-Steinberg dithering algorithm has been used in this area and its' effectiveness were well known. But it shows some ugly patterns in white area and also shows some directionality in vertical and horizontal directions. To reduce those directionality, I suggest the error diffusion direction to be rotated randomly according to the bit value of the current position. This can also mitigate some ugly pattern in white area
-
This paper describes the implementation of document image restoration system for the geometric distortion using structured light. To get accurate document images, the bounded book must be flattened by pushing down the book with a class plate. However, most of ancient documents are too fragile to be pushed. The proposed system restores the distorted character image due to geometric distortion.
-
The color television signal and color receivers should be balanced for the same value of reference white to achieve colorimetric fidelity and to minimize interference. The NTSC signal is balanced for white at 6774 K and most existing receivers are balanced between 6500 K and 10000 K for many reasons. In this paper, we analyze beam current ratio, lightness, and channel gain ratio according to the color temperature for the three-tube projection HDTV. We also propose the brighter reference white for the three-tube projection HDTV based on the Helmholtz-Kohlrausch effect and the optical resolution of the image. In computer simulation we confirmed the most suitable reference white using the proposed analysis method.
-
Recently, PC monitor users have been replacing cathode ray tubes (CRT) with liquid crystal displays (LCD). But the chromaticity of the primaries are dependent on RGB input signals. And the colorimetry of LCD changes with gray scale and has a poor peformance in color reproduction. In this paper we propose the enhanced algorithm of color reproduction considering color leakage error and black subpixel error in LCD. In order to test peformance of this algorithm we use the colors of Macbeth colorcheck. As a result of experiments, it was confirmed that the color difference of the LCD using the proposed algorithm was considerably reduced.
-
This paper proposes a finger crease pattern identification algorithm utilizing a clustering method. The algorithms has been developed for the use of biometric person identification system. Since the finger crease pattern may be well-imaged utilizing low cost imaging devices such as low-end CCD camera with LED lighting, the feasibility of commercialization of the algorithm and the system utilizing the algorithm may be well justified if the finger crease pattern is a reasonable choice for the biometric feature. In this paper, we exploit this possibility and show the potential of using the finger crease pattern as a feature for biometric person identification.
-
This paper proposes an enhanced algorithm for person identification system utilizing hand vein pattern. The conventional algorithm does not cope with distortion caused by image rotation caused by misplaced hands on the imaging device. A straightforward approach to consider the rotaional compensation required too much computational load, thus, we devised an approach to expect the rotation direction along with image translation, reducing the compuational requirement dramatically In this paper, we present the details of the algorithm with experimental results with the new algorithm.
-
This thesis shows controlling the mobile robot with distance information gotten with ultrasonic sensors, and analysis of captured image. The ultrasonic sensors supplies more accurate distance data in limited area but shows unstable data unlimited area while image data generally shows stable data, but this requires so much time because of amounts of calculation. So this thesis considers the merits of ultrasonic sensors and image to implement robot system .
-
One of the major factors determining the printing quality of a color printer is the color matching that is performed inside the printer driver In this paper, the mini driver for the color printer is built using the Microsoft 98DDK. Also, the ICC profile proposed as the standard for the color management system is generated. The color matching capability of the mini driver with the ICC profile is examined and compared with that of the commercial printer driver.
-
We proposes a new fingerprint minutia matching algorithm which matches the fingerprint minutiae by using adaptive distance. In general, fingerprint is deformed by pressure and orientation when a user press his fingerprint to sensor. These nonlinear deformations change the distance between minutiae and reduce verification rate. We define the adaptive distance using ridge frequency. Adaptive distance normalizes the distance between minutiae and compensates for nonlinear deformation. Our algorithm can distinguish two different fingerprints better and is more robust. Experimental results show that the performance of the proposed algorithm is superior to using Euclidean distance.
-
In this paper, we describe implementation of efficient wavelet image compression and decompression system for DVR(Digital Video Recoder). We used various methods to remove time redundancy, spatial redundancy and statistical redundancy of video camera inputs. Motion detection, wavelet transform, RLC(Run Length Coding) and huffman coding techniques are combined for efficient compression / decompression.