• Title/Summary/Keyword: Mean and Variance Features

Search Result 51, Processing Time 0.025 seconds

Wavelet-Based Face Recognition by Divided Area (웨이브렛을 이용한 공간적 영역분할에 의한 얼굴 인식)

  • 이성록;이상효;조창호;조도현;이상철
    • Proceedings of the IEEK Conference
    • /
    • 2003.07e
    • /
    • pp.2307-2310
    • /
    • 2003
  • In this paper, a method for face recognition based on the wavelet packet decomposition is proposed. In the proposed method, the input image is decomposed by the 2-level wavelet packet transformation and then the face areas are defined by the Integral Projection technique applied to each of the 1-level subband images, HL and LH. After the defined face areas are divided into three areas, called top, bottom, and border, the mean and the variance of the three areas of the approximation image are computed, and the variance of the single predetermined face area for the rest of 15 detail images, from which the feature vectors of statistical measure are extracted. In this paper we use the wavelet packet decomposition, a generalization of the classical wavelet decomposition, to obtain its richer signal analysis features such as discontinuity in higher derivatives, self-similarity, etc. And we have shown that even with very simple statistical features such as mean values and variance we can make an excellent basis for face classification, if an appropriate probability distance is used.

  • PDF

DETECTION AND CLASSIFICATION OF DEFECTS ON APPLE USING MACHINE VISION

  • Suh, Sang-Ryong;Sung, Je-Hoon
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 1996.06c
    • /
    • pp.852-862
    • /
    • 1996
  • This study was carried out to develop tools to detect defects of apple using machine vision. For the purpose, 6 kinds of frame for color images, R, G, B, h, S, and I frame, and a frame for near infra-red images (NIR frame) were tested first to select one which is useful to segment defect areas from apple images. After then, several methods to classify kind of defect for the segmented defect areas were developed and tested. Five kinds of apple defect -bruise , decay ,fleck worm hole and scar were investigated . The results are as follows: NIR frame was selected as the best one among the 7 kinds of image frame, and R, G and I frames showed favourable result to segment areas of apple defect. Various features of the segmented defect areas were measured to classify the defect areas. Eight kids of feature of the areas-size, roundness, axes length ratio, mean and variance of pixel values, variance of real part of spectrum, mean and variance of power spectrum resulted from spacial ourier transform were observed for the segmented defect areas in the selected 4 frames. then procedures to classify defects using the features were developed for the 4 frames and tested with 75-113 defects on apples. The test resulted that NIR and I frames showed high accuracies to classify the kind of defect as 77% and 76% , respectively.

  • PDF

Distorted Image Database Retrieval Using Low Frequency Sub-band of Wavelet Transform (웨이블릿 변환의 저주파수 부대역을 이용한 왜곡 영상 데이터베이스 검색)

  • Park, Ha-Joong;Kim, Kyeong-Jin;Jung, Ho-Youl
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.3 no.1
    • /
    • pp.8-18
    • /
    • 2008
  • In this paper, we propose an efficient algorithm using wavelet transform for still image database retrieval. Especially, it uses only the lowest frequency sub-band in multi-level wavelet transform so that a retrieval system uses a smaller quantity of memory and takes a faster processing time. We extract different textured features, statistical information such as mean, variance and histogram, from low frequency sub-band. Then we measure the distances between the query image and the images in a database in terms of these features. To obtain good retrieval performance, we use the first feature (mean and variance of wavelet coefficients) to filter out most of the unlikely images. The rest of the images are considered to be candidate images. Then we apply the second feature (histogram of wavelet coefficient) to rank all the candidate images. To evaluate the algorithm, we create various distorted image databases using MIT VisTex texture images and PICS natural images. Through simulations, we demonstrate that our method can achieve performance satisfactorily in terms of the retrieval accuracy as well as the both memory requirement and computational complexity. Therefore it is expected to provide good retrieval solution for JPEG-2000 using wavelet transform.

  • PDF

Performance Improvements for Silence Feature Normalization Method by Using Filter Bank Energy Subtraction (필터 뱅크 에너지 차감을 이용한 묵음 특징 정규화 방법의 성능 향상)

  • Shen, Guanghu;Choi, Sook-Nam;Chung, Hyun-Yeol
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.7C
    • /
    • pp.604-610
    • /
    • 2010
  • In this paper we proposed FSFN (Filter bank sub-band energy subtraction based CLSFN) method to improve the recognition performance of the existing CLSFN (Cepstral distance and Log-energy based Silence Feature Normalization). The proposed FSFN reduces the energy of noise components in filter bank sub-band domain when extracting the features from speech data. This leads to extract the enhanced cepstral features and thus improves the accuracy of speech/silence classification using the enhanced cepstral features. Therefore, it can be expected to get improved performance comparing with the existing CLSFN. Experimental results conducted on Aurora 2.0 DB showed that our proposed FSFN method improves the averaged word accuracy of 2% comparing with the conventional CLSFN method, and FSFN combined with CMVN (Cepstral Mean and Variance Normalization) also showed the best recognition performance comparing with others.

The Fast Search Algorithm for Raman Spectrum (라만 스펙트럼 고속 검색 알고리즘)

  • Ko, Dae-Young;Baek, Sung-June;Park, Jun-Kyu;Seo, Yu-Gyeong;Seo, Sung-Il
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.5
    • /
    • pp.3378-3384
    • /
    • 2015
  • The problem of fast search for raman spectrum has attracted much attention recently. By far the most simple and widely used method is to calculate and compare the Euclidean distance between the given spectrum and the spectra in a database. But it is non-trivial problem because of the inherent high dimensionality of the data. One of the most serious problems is the high computational complexity of searching for the closet codeword. To overcome this problem, The fast codeword search algorithm based on the mean pyramids of codewords is currently used in image coding applications. In this paper, we present three new methods for the fast algorithm to search for the closet codeword. the proposed algorithm uses two significant features of a vector, mean values and variance, to reject many unlikely codewords and save a great deal of computation time. The Experiment results show about 42.8-55.2% performance improvement for the 1DMPS+PDS. The results obtained confirm the effectiveness of the proposed algorithm.

A new approach for content-based video retrieval

  • Kim, Nac-Woo;Lee, Byung-Tak;Koh, Jai-Sang;Song, Ho-Young
    • International Journal of Contents
    • /
    • v.4 no.2
    • /
    • pp.24-28
    • /
    • 2008
  • In this paper, we propose a new approach for content-based video retrieval using non-parametric based motion classification in the shot-based video indexing structure. Our system proposed in this paper has supported the real-time video retrieval using spatio-temporal feature comparison by measuring the similarity between visual features and between motion features, respectively, after extracting representative frame and non-parametric motion information from shot-based video clips segmented by scene change detection method. The extraction of non-parametric based motion features, after the normalized motion vectors are created from an MPEG-compressed stream, is effectively fulfilled by discretizing each normalized motion vector into various angle bins, and by considering the mean, variance, and direction of motion vectors in these bins. To obtain visual feature in representative frame, we use the edge-based spatial descriptor. Experimental results show that our approach is superior to conventional methods with regard to the performance for video indexing and retrieval.

Characterization of the Spatial Variability of Paper Formation Using a Continuous Wavelet Transform

  • Keller, D.Steven;Luner, Philip;Pawlak, Joel J.
    • Journal of Korea Technical Association of The Pulp and Paper Industry
    • /
    • v.32 no.5
    • /
    • pp.14-25
    • /
    • 2000
  • In this investigation, a wavelet transform analysis was used to decompose beta-radiographic formation images into spectral and spatial components. Conventional formation analysis may use spectral analysis, based on Fourier transformation or variance vs. zone size, to describe the grammage distribution of features such as flocs, streaks and mean fiber orientation. However, these methods have limited utility for the analysis of statistically stationary data sets where variance is not uniform with position, e.g. paper machine CD profiles (especially those that contain streaks). A continuous wavelet transform was used to analyze formation data arrays obtained from radiographic imaging of handsheets and cross machine paper samples. The response of the analytical method to grammage, floc size distribution, mean fiber orientation an sensitivity to feature localization were assessed. From wavelet analysis, the change in scale of grammage variation as a function of position was used to demonstrate regular and isolated differences in the formed structure.

  • PDF

Estimation of tomato maturity as a continuous index using deep neural networks

  • Taehyeong Kim;Dae-Hyun Lee;Seung-Woo Kang;Soo-Hyun Cho;Kyoung-Chul Kim
    • Korean Journal of Agricultural Science
    • /
    • v.49 no.4
    • /
    • pp.785-793
    • /
    • 2022
  • In this study, tomato maturity was estimated based on deep learning for a harvesting robot. Tomato images were obtained using a RGB camera installed on a monitoring robot, which was developed previously, and the samples were cropped to 128 × 128 size images to generate a dataset for training the classification model. The classification model was constructed based on convolutional neural networks, and the mean-variance loss was used to learn implicitly the distribution of the data features by class. In the test stage, the tomato maturity was estimated as a continuous index, which has a range of 0 to 1, by calculating the expected class value. The results show that the F1-score of the classification was approximately 0.94, and the performance was similar to that of a deep learning-based classification task in the agriculture field. In addition, it was possible to estimate the distribution in each maturity stage. From the results, it was found that our approach can not only classify the discrete maturation stages of the tomatoes but also can estimate the continuous maturity.

Use of Crown Feature Analysis to Separate the Two Pine Species in QuickBird Imagery

  • Kim, Cheon
    • Korean Journal of Remote Sensing
    • /
    • v.24 no.3
    • /
    • pp.267-272
    • /
    • 2008
  • Tree species-specific estimates with spacebome high-resolution imagery improve estimation of forest biomass which is needed to predict the long term planning for the sustainable forest management(SFM). This paper is a contribution to develop crown distinguishing coniferous species, Pinus densiflora and Pinus koraiensis, from QuickBird imagery. The proposed feature analysis derived from shape parameters and first and second-order statistical texture features of the same test area were compared for the two species separation and delineation. As expected, initial studies have shown that both formfactor and compactness shape parameters provided the successful differentiating method between the pine species within the compartment for single crown identification from spaceborne high resolution imagery. Another result revealed that the selected texture parameters - the mean, variance, angular second moment(ASM) - in the infrared band image could produce good subset combination of texture features for representing detailed tree crown outline.

Super-resolution in Music Score Images by Instance Normalization

  • Tran, Minh-Trieu;Lee, Guee-Sang
    • Smart Media Journal
    • /
    • v.8 no.4
    • /
    • pp.64-71
    • /
    • 2019
  • The performance of an OMR (Optical Music Recognition) system is usually determined by the characterizing features of the input music score images. Low resolution is one of the main factors leading to degraded image quality. In this paper, we handle the low-resolution problem using the super-resolution technique. We propose the use of a deep neural network with instance normalization to improve the quality of music score images. We apply instance normalization which has proven to be beneficial in single image enhancement. It works better than batch normalization, which shows the effectiveness of shifting the mean and variance of deep features at the instance level. The proposed method provides an end-to-end mapping technique between the high and low-resolution images respectively. New images are then created, in which the resolution is four times higher than the resolution of the original images. Our model has been evaluated with the dataset "DeepScores" and shows that it outperforms other existing methods.