• Title/Summary/Keyword: Principal components analysis

Search Result 764, Processing Time 0.025 seconds

Color Component Analysis For Image Retrieval (이미지 검색을 위한 색상 성분 분석)

  • Choi, Young-Kwan;Choi, Chul;Park, Jang-Chun
    • The KIPS Transactions:PartB
    • /
    • v.11B no.4
    • /
    • pp.403-410
    • /
    • 2004
  • Recently, studies of image analysis, as the preprocessing stage for medical image analysis or image retrieval, are actively carried out. This paper intends to propose a way of utilizing color components for image retrieval. For image retrieval, it is based on color components, and for analysis of color, CLCM (Color Level Co-occurrence Matrix) and statistical techniques are used. CLCM proposed in this paper is to project color components on 3D space through geometric rotate transform and then, to interpret distribution that is made from the spatial relationship. CLCM is 2D histogram that is made in color model, which is created through geometric rotate transform of a color model. In order to analyze it, a statistical technique is used. Like CLCM, GLCM (Gray Level Co-occurrence Matrix)[1] and Invariant Moment [2,3] use 2D distribution chart, which use basic statistical techniques in order to interpret 2D data. However, even though GLCM and Invariant Moment are optimized in each domain, it is impossible to perfectly interpret irregular data available on the spatial coordinates. That is, GLCM and Invariant Moment use only the basic statistical techniques so reliability of the extracted features is low. In order to interpret the spatial relationship and weight of data, this study has used Principal Component Analysis [4,5] that is used in multivariate statistics. In order to increase accuracy of data, it has proposed a way to project color components on 3D space, to rotate it and then, to extract features of data from all angles.

A study on principal component analysis using penalty method (페널티 방법을 이용한 주성분분석 연구)

  • Park, Cheolyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.4
    • /
    • pp.721-731
    • /
    • 2017
  • In this study, principal component analysis methods using Lasso penalty are introduced. There are two popular methods that apply Lasso penalty to principal component analysis. The first method is to find an optimal vector of linear combination as the regression coefficient vector of regressing for each principal component on the original data matrix with Lasso penalty (elastic net penalty in general). The second method is to find an optimal vector of linear combination by minimizing the residual matrix obtained from approximating the original matrix by the singular value decomposition with Lasso penalty. In this study, we have reviewed two methods of principal components using Lasso penalty in detail, and shown that these methods have an advantage especially in applying to data sets that have more variables than cases. Also, these methods are compared in an application to a real data set using R program. More specifically, these methods are applied to the crime data in Ahamad (1967), which has more variables than cases.

Assessment of water quality variations under non-rainy and rainy conditions by principal component analysis techniques in Lake Doam watershed, Korea

  • Bhattrai, Bal Dev;Kwak, Sungjin;Heo, Woomyung
    • Journal of Ecology and Environment
    • /
    • v.38 no.2
    • /
    • pp.145-156
    • /
    • 2015
  • This study was based on water quality data of the Lake Doam watershed, monitored from 2010 to 2013 at eight different sites with multiple physiochemical parameters. The dataset was divided into two sub-datasets, namely, non-rainy and rainy. Principal component analysis (PCA) and factor analysis (FA) techniques were applied to evaluate seasonal correlations of water quality parameters and extract the most significant parameters influencing stream water quality. The first five principal components identified by PCA techniques explained greater than 80% of the total variance for both datasets. PCA and FA results indicated that total nitrogen, nitrate nitrogen, total phosphorus, and dissolved inorganic phosphorus were the most significant parameters under the non-rainy condition. This indicates that organic and inorganic pollutants loads in the streams can be related to discharges from point sources (domestic discharges) and non-point sources (agriculture, forest) of pollution. During the rainy period, turbidity, suspended solids, nitrate nitrogen, and dissolved inorganic phosphorus were identified as the most significant parameters. Physical parameters, suspended solids, and turbidity, are related to soil erosion and runoff from the basin. Organic and inorganic pollutants during the rainy period can be linked to decayed matters, manure, and inorganic fertilizers used in farming. Thus, the results of this study suggest that principal component analysis techniques are useful for analysis and interpretation of data and identification of pollution factors, which are valuable for understanding seasonal variations in water quality for effective management.

Data anomaly detection and Data fusion based on Incremental Principal Component Analysis in Fog Computing

  • Yu, Xue-Yong;Guo, Xin-Hui
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.10
    • /
    • pp.3989-4006
    • /
    • 2020
  • The intelligent agriculture monitoring is based on the perception and analysis of environmental data, which enables the monitoring of the production environment and the control of environmental regulation equipment. As the scale of the application continues to expand, a large amount of data will be generated from the perception layer and uploaded to the cloud service, which will bring challenges of insufficient bandwidth and processing capacity. A fog-based offline and real-time hybrid data analysis architecture was proposed in this paper, which combines offline and real-time analysis to enable real-time data processing on resource-constrained IoT devices. Furthermore, we propose a data process-ing algorithm based on the incremental principal component analysis, which can achieve data dimensionality reduction and update of principal components. We also introduce the concept of Squared Prediction Error (SPE) value and realize the abnormal detection of data through the combination of SPE value and data fusion algorithm. To ensure the accuracy and effectiveness of the algorithm, we design a regular-SPE hybrid model update strategy, which enables the principal component to be updated on demand when data anomalies are found. In addition, this strategy can significantly reduce resource consumption growth due to the data analysis architectures. Practical datasets-based simulations have confirmed that the proposed algorithm can perform data fusion and exception processing in real-time on resource-constrained devices; Our model update strategy can reduce the overall system resource consumption while ensuring the accuracy of the algorithm.

Wafer state prediction in 64M DRAM s-Poly etching process using real-time data (실시간 데이터를 위한 64M DRAM s-Poly 식각공정에서의 웨이퍼 상태 예측)

  • 이석주;차상엽;우광방
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1997.10a
    • /
    • pp.664-667
    • /
    • 1997
  • For higher component density per chip, it is necessary to identify and control the semiconductor manufacturing process more stringently. Recently, neural networks have been identified as one of the most promising techniques for modeling and control of complicated processes such as plasma etching process. Since wafer states after each run using identical recipe may differ from each other, conventional neural network models utilizing input factors only cannot represent the actual state of process and equipment. In this paper, in addition to the input factors of the recipe, real-time tool data are utilized for modeling of 64M DRAM s-poly plasma etching process to reflect the actual state of process and equipment. For real-time tool data, we collect optical emission spectroscopy (OES) data. Through principal component analysis (PCA), we extract principal components from entire OES data. And then these principal components are included to input parameters of neural network model. Finally neural network model is trained using feed forward error back propagation (FFEBP) algorithm. As a results, simulation results exhibit good wafer state prediction capability after plasma etching process.

  • PDF

Morphological Variation of Berberis amurensis Complex (Berberis amurensis complex의 형태 변이 분석)

  • Hyun, Chang-Woo;Kim, Young-Dong
    • Korean Journal of Plant Taxonomy
    • /
    • v.38 no.2
    • /
    • pp.93-109
    • /
    • 2008
  • The morphological variation was analysed to examine previous hypotheses on the taxonomy of B. amurensis complex which includes B. amurensis Rupr. var. amurensis, B. amurensis var. quelpaertensis (Nakai) Nakai and B. amurensis var. latifolia Nakai. The results from the univariational and principal components analyses employing 22 putatively diagnostic characters indicate that B. amurensis var. quelpaertensis is distinct from var. amurensis in the length and width of leaves, angle of leaf apex, distance between spinose teeth, length of internode, number of flowers per inflorescence, whereas B. amurensis var. latifolia is different from other varieties in the angle of leaf apex and leaf length/width ratio. In principal component analysis, the characters of the leaf including leaf width and length were the main characteristics to distinguish those three taxa. The evidence both from the principal components analyses and current geographical distribution pattern suggest that retaining the varietal status for the two taxa, B. amurensis var. latifolia and B. amurensis var. quelpaertensis is reasonable.

Stability evaluation model for loess deposits based on PCA-PNN

  • Li, Guangkun;Su, Maoxin;Xue, Yiguo;Song, Qian;Qiu, Daohong;Fu, Kang;Wang, Peng
    • Geomechanics and Engineering
    • /
    • v.27 no.6
    • /
    • pp.551-560
    • /
    • 2021
  • Due to the low strength and high compressibility characteristics, the loess deposits tunnels are prone to large deformations and collapse. An accurate stability evaluation for loess deposits is of considerable significance in deformation control and safety work during tunnel construction. 37 groups of representative data based on real loess deposits cases were adopted to establish the stability evaluation model for the tunnel project in Yan'an, China. Physical and mechanical indices, including water content, cohesion, internal friction angle, elastic modulus, and poisson ratio are selected as index system on the stability level of loess. The data set is randomly divided into 80% as the training set and 20% as the test set. Firstly, principal component analysis (PCA) is used to convert the five index system to three linearly independent principal components X1, X2 and X3. Then, the principal components were used as input vectors for probabilistic neural network (PNN) to map the nonlinear relationship between the index system and stability level of loess. Furthermore, Leave-One-Out cross validation was applied for the training set to find the suitable smoothing factor. At last, the established model with the target smoothing factor 0.04 was applied for the test set, and a 100% prediction accuracy rate was obtained. This intelligent classification method for loess deposits can be easily conducted, which has wide potential applications in evaluating loess deposits.

Assessment of Hydrogeochemical Characteristics and Contaminant Dispersion of Aquifer around Keumsan Municipal Landfill (금산 매립장 주변 대수층의 수리지화학적 특성 및 오염 확산 평가)

  • Oh, In-Suk;Ko, Kyung-Seok;Kong, In-Chul;Ku, Min-Ho
    • Economic and Environmental Geology
    • /
    • v.41 no.6
    • /
    • pp.657-672
    • /
    • 2008
  • The purposes of this study are to investigate the hydrogeochemical characteristics of groundwaters around Keumsan municipal landfill, and to evaluate the contaminant dispersion from the landfill and its environmental impact. To achieve these goals, groundwater quality logging, hydrochemical analysis, multivariate statistical analysis, and contaminant transport modeling were performed. The water quality logging indicated a leaking from the landfill at the depth of 4-12m around a leachate sump. Electrical conductivity data indicated that groundwaters within 70-100m from landfill were affected by the landfill leakage. Principal components 1 and 2 obtained from principal components analysis (PCA) reflect the influence of leachate and the characteristics of aquifer media, respectively. The results of principal component analysis also indicated the natural attenuation processes such as cation exchange, sorption, and microbial biodegradation. The modeling results showed that groundwater flow westward along a valley from the landfill and contaminants transport accordingly.

A Study on the Ratio of Human and Dog Facial Components based on Principal Component Analysis (주성분 분석기반 인간과 개의 얼굴 비율 연구)

  • Lee, Young-suk;Ki, Dae Wook
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.10
    • /
    • pp.1339-1347
    • /
    • 2020
  • This study is a preliminary study to design a character automation system that considers the facial characteristics of mammals. The experimental data of this study was conducted on dogs (dog breeds) and humans, which were designed to be used in many contents. First, data was extracted from 100 types of dogs and 100 human data. Second, the criteria for measuring the ratio of important parts of the dog and human face were suggested. In addition, a comparative analysis of the face of a dog and a human face is conducted. Lastly, by analyzing the main component(PCA), the most characteristic elements in the faces of dogs and humans were analyzed. As a result, it was confirmed that the length of the face, the size of the eyes, the length of the glabellar, and the length of the glabellar and other parts are important. Through this study, the features of the dog's face that are different from humans are expected to contribute to the animal character automation.

Modeling of individual head-related impulse responses using a set of general basis functions (보편적인 기저함수를 이용한 개인의 머리전달함수 모델링)

  • Hwang, Sung-Mok;Park, Young-Jin;Park, Youn-Sik
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2007.11a
    • /
    • pp.1430-1436
    • /
    • 2007
  • A principal components analysis (PCA) of the median head-related impulse responses (HRIRs) in the CIPIC HRTF database reveals that the individual HRIRs can be adequately reconstructed by a linear combination of 12 orthonormal basis functions. These basis functions can be used generally to model arbitrary HRIRs, which are not included in the process to obtain the basis functions. To clarify whether these basis functions can be used to model other set of arbitrary HRIRs, an numerical error analysis for modeling and a series of subjective listening tests were carried out using the measured and modeled HRIRs. The results showed that the set of individual HRIRs, which were measured in our lab using different measurement conditions, techniques, and source positions, can be well modeled with reasonable accuracy. Furthermore, all subjects reported not only the accurate vertical perception but also the front-back discrimination with the modeled HRIRs based on 12 basis functions. However, as less basis functions were used for HRIR modeling, the modeling accuracy and localization performance deteriorated.

  • PDF