• Title/Summary/Keyword: Principal components analysis

Search Result 763, Processing Time 0.028 seconds

Analysis of biomarkers with tunable infrared gas sensors (가변 파장형 적외선 가스 센서에 의한 생체표지자 분석)

  • Yi, Seung Hwan
    • Journal of Sensor Science and Technology
    • /
    • v.30 no.5
    • /
    • pp.314-319
    • /
    • 2021
  • In this study, biomarkers were analyzed and segmented using tunable infrared gas sensors after performing the principal component analysis. The free spectral range of the device under test (DUT) was around 30 nm and DUT-5580 yielded the highest output voltage property among the others. The biomarkers (isoprophyl alcohol, ethanol, methanol, and acetone solutions) were sequentially mixed with deionized water and their mists were carried into the gas chamber using high-purity nitrogen gas. A total of 17 different mixed gases were tested with three tunable infrared gas sensors, namely DUT-3144, DUT-5580, and DUT-8010. DUT-8010 resolved the infrared absorption spectra of whole mixed gases. Based on the principal component analysis with each DUT and their combinations, each mixed gas and the trends in increasing gas concentration could be well analyzed when the contributions of the eigenvalues of the first and second were higher than 70% and 10%, respectively, and their sum was greater than 90%.

A Study on the Compression and Major Pattern Extraction Method of Origin-Destination Data with Principal Component Analysis (주성분분석을 이용한 기종점 데이터의 압축 및 주요 패턴 도출에 관한 연구)

  • Kim, Jeongyun;Tak, Sehyun;Yoon, Jinwon;Yeo, Hwasoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.19 no.4
    • /
    • pp.81-99
    • /
    • 2020
  • Origin-destination data have been collected and utilized for demand analysis and service design in various fields such as public transportation and traffic operation. As the utilization of big data becomes important, there are increasing needs to store raw origin-destination data for big data analysis. However, it is not practical to store and analyze the raw data for a long period of time since the size of the data increases by the power of the number of the collection points. To overcome this storage limitation and long-period pattern analysis, this study proposes a methodology for compression and origin-destination data analysis with the compressed data. The proposed methodology is applied to public transit data of Sejong and Seoul. We first measure the reconstruction error and the data size for each truncated matrix. Then, to determine a range of principal components for removing random data, we measure the level of the regularity based on covariance coefficients of the demand data reconstructed with each range of principal components. Based on the distribution of the covariance coefficients, we found the range of principal components that covers the regular demand. The ranges are determined as 1~60 and 1~80 for Sejong and Seoul respectively.

Multivariate Time Series Simulation With Component Analysis (독립성분분석을 이용한 다변량 시계열 모의)

  • Lee, Tae-Sam;Salas, Jose D.;Karvanen, Juha;Noh, Jae-Kyoung
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2008.05a
    • /
    • pp.694-698
    • /
    • 2008
  • In hydrology, it is a difficult task to deal with multivariate time series such as modeling streamflows of an entire complex river system. Normal distribution based model such as MARMA (Multivariate Autorgressive Moving average) has been a major approach for modeling the multivariate time series. There are some limitations for the normal based models. One of them might be the unfavorable data-transformation forcing that the data follow the normal distribution. Furthermore, the high dimension multivariate model requires the very large parameter matrix. As an alternative, one might be decomposing the multivariate data into independent components and modeling it individually. In 1985, Lins used Principal Component Analysis (PCA). The five scores, the decomposed data from the original data, were taken and were formulated individually. The one of the five scores were modeled with AR-2 while the others are modeled with AR-1 model. From the time series analysis using the scores of the five components, he noted "principal component time series might provide a relatively simple and meaningful alternative to conventional large MARMA models". This study is inspired from the researcher's quote to develop a multivariate simulation model. The multivariate simulation model is suggested here using Principal Component Analysis (PCA) and Independent Component Analysis (ICA). Three modeling step is applied for simulation. (1) PCA is used to decompose the correlated multivariate data into the uncorrelated data while ICA decomposes the data into independent components. Here, the autocorrelation structure of the decomposed data is still dominant, which is inherited from the data of the original domain. (2) Each component is resampled by block bootstrapping or K-nearest neighbor. (3) The resampled components bring back to original domain. From using the suggested approach one might expect that a) the simulated data are different with the historical data, b) no data transformation is required (in case of ICA), c) a complex system can be decomposed into independent component and modeled individually. The model with PCA and ICA are compared with the various statistics such as the basic statistics (mean, standard deviation, skewness, autocorrelation), and reservoir-related statistics, kernel density estimate.

  • PDF

Phylogenetic Analysis of Artemisia spp. by Morphological Characteristics of Reproductive Organs in Korea (화기형태에 의한 국내 자생쑥의 유연관계 분석)

  • Sung, Jung-Sook;Lee, Jeong-Hoon;Lee, Jei-Wan;Bang, Kyong-Hwan;Yeo, Jun-Hwan;Park, Chun-Geon;Park, Ho-Ki;Seong, Nak-Sul;Moon, Sung-Gi
    • Korean Journal of Medicinal Crop Science
    • /
    • v.16 no.4
    • /
    • pp.218-224
    • /
    • 2008
  • This study was conducted to obtain the basic data for using the Artemisia genetic resources as a medicinal crop. 24 taxa including Artemisia capillaris Thunb. were analyzed by principal component analysis of 25 characters and cluster analysis for classification. In Principal components analysis of individuals of taxa using 25 morphological characters of reproductive organ, the first, the second, the third and the fourth components contributed 44.73%, 16.86%, 8.88%, and 7.07% of the variations, respectively. The cumulative contribution from the first to the fourth principal components was 77.56%. In cluster analysis, taxa of Artemisia L. was seperated 3 group by 25 morphological characters of reproductive organ, but it didn't completely coincident with Kitamura classification.

A Study on the Characteristics of Traditionality Expression at Modernized Chinese Restaurants - Focused on MT(Modernized Traditional) Syle Restaurants in Hong Kong - (현대화 된 중국식 레스토랑에 나타난 전통성 표현 특성 연구 - 홍콩 소재 MT 유형(Modernized Traditional Style) 레스토랑을 중심으로 -)

  • Oh, Hye-Kyung
    • Korean Institute of Interior Design Journal
    • /
    • v.21 no.4
    • /
    • pp.163-171
    • /
    • 2012
  • The objective of this study was to analyze the characteristics of traditionality expressions at modernized Chinese restaurant in Hong Kong. As a case study, the study examined 12 modernized Chinese restaurants in Hong Kong. The gathered data were categorized and examined according to the ways of traditionality expressions, which included reproduction, transformation, and reinterpretation of traditional components. Each of the components was measured for the amount of traditional or modernity expression on a five-point scale. The five-point scoring system put an emphasis on heritage; 1 point was given to principal modernity(modernity: 90-100% + tradition: 0-10%), 2 points were given to principal modernity + auxiliary tradition(modernity: 70-90% + tradition: 10-30%), 3 points were given to the same ratio between tradition and modernity(modernity: 40-60% + tradition: 40-60%), 4 points were given to principal tradition + auxiliary modernity(modernity: 10-30% + tradition: 70-90%), and 5 points were given to principal tradition(modernity: 0-10% + tradition: 90-100%). The analysis performed according to those criteria and methodologies led to the following findings and conclusions: Traditional components were most reproduced in the ornaments placed all over the restaurant and applied to the chirography of the restaurant logos, walls, and windows/doors in a big number. The methodology of transforming tradition was evenly applied to each of the spatial components. With the most transformations occurring to the lattices, there were many different ways to transform tradition including the partition, chirography, pattern, red lantern, furniture and ornament, and traditional materials that were turned into modern ones. Few examples of reinterpreting tradition were observed in the restaurant titles, inside floors, and ceilings, but plenty of examples were found in the walls, windows/doors, lighting, and furniture in a range of ways. Most of them reinterpreted the traditional forms and added altered patterns to them to remind customers of tradition. In short, all of the three ways of expressing tradition were actively applied to each component in an array of ways.

  • PDF

A Method of Expressing Multivariate Representative Observations Based on the Self-Consistency of Principal Components (주성분의 자기일치성에 기초한 다변량 대표관찰치의 기하적 표현)

  • Kim KeeYoung;Park YongJu
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.1
    • /
    • pp.129-135
    • /
    • 2005
  • Representative observations are useful to express explicitly the distributional variation of the data by a few selected observations corresponding to the quantiles in the univariate situation. Jones and Rice(1992) extended it to the multidimensional case by the principal component based method. This study introduces a modified version of Jones and Rice exploiting the self-consistency of principal components in expressing the chosen observation vectors. Compared to that of Jones and Rice, the suggested method tends to provide with less susceptible representative observations to the sampling variation of the data and the resulted vectors benefits from the self-consistency.

Utilizing Principal Component Analysis in Unsupervised Classification Based on Remote Sensing Data

  • Lee, Byung-Gul;Kang, In-Joan
    • Proceedings of the Korean Environmental Sciences Society Conference
    • /
    • 2003.11a
    • /
    • pp.33-36
    • /
    • 2003
  • Principal component analysis (PCA) was used to improve image classification by the unsupervised classification techniques, the K-means. To do this, I selected a Landsat TM scene of Jeju Island, Korea and proposed two methods for PCA: unstandardized PCA (UPCA) and standardized PCA (SPCA). The estimated accuracy of the image classification of Jeju area was computed by error matrix. The error matrix was derived from three unsupervised classification methods. Error matrices indicated that classifications done on the first three principal components for UPCA and SPCA of the scene were more accurate than those done on the seven bands of TM data and that also the results of UPCA and SPCA were better than those of the raw Landsat TM data. The classification of TM data by the K-means algorithm was particularly poor at distinguishing different land covers on the island. From the classification results, we also found that the principal component based classifications had characteristics independent of the unsupervised techniques (numerical algorithms) while the TM data based classifications were very dependent upon the techniques. This means that PCA data has uniform characteristics for image classification that are less affected by choice of classification scheme. In the results, we also found that UPCA results are better than SPCA since UPCA has wider range of digital number of an image.

  • PDF

Simple Compromise Strategies in Multivariate Stratification

  • Park, Inho
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.2
    • /
    • pp.97-105
    • /
    • 2013
  • Stratification (among other applications) is a popular technique used in survey practice to improve the accuracy of estimators. Its full potential benefit can be gained by the effective use of auxiliary variables in stratification related to survey variables. This paper focuses on the problem of stratum formation when multiple stratification variables are available. We first review a variance reduction strategy in the case of univariate stratification. We then discuss its use for multivariate situations in convenient and efficient ways using three methods: compromised measures of size, principal components analysis and a K-means clustering algorithm. We also consider three types of compromising factors to data when using these three methods. Finally, we compare their efficiency using data from MU281 Swedish municipality population.

Network Anomaly Detection using Hybrid Feature Selection

  • Kim Eun-Hye;Kim Se-Hun
    • Proceedings of the Korea Institutes of Information Security and Cryptology Conference
    • /
    • 2006.06a
    • /
    • pp.649-653
    • /
    • 2006
  • In this paper, we propose a hybrid feature extraction method in which Principal Components Analysis is combined with optimized k-Means clustering technique. Our approach hierarchically reduces the redundancy of features with high explanation in principal components analysis for choosing a good subset of features critical to improve the performance of classifiers. Based on this result, we evaluate the performance of intrusion detection by using Support Vector Machine and a nonparametric approach based on k-Nearest Neighbor over data sets with reduced features. The Experiment results with KDD Cup 1999 dataset show several advantages in terms of computational complexity and our method achieves significant detection rate which shows possibility of detecting successfully attacks.

  • PDF

A Study to Calculate an Efficient Covariance Matrix of Non-local Means with Principal Components Analysis (주성분 분석을 활용한 Non-local means 에서의 효율적인 공분산 행렬 계산 연구)

  • Kim, Jeonghwan;Lee, Minjeong;Jeong, Jechang
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2015.07a
    • /
    • pp.205-207
    • /
    • 2015
  • 본 논문에서는 먼저 주성분 분석 (Principal components analysis, PCA) 을 활용한 Non-local means (NLM) 을 소개하고, 주성분 분석을 하기 위해 필수적인 공분산 행렬 계산을 효율적으로 하는 방법을 제안한다. NLM 에서의 Neighborhood patch 의 크기를 $S{\times}S=S^2$, 이미지 전체의 픽셀 수를 ${\mathcal{Q}}$ 일 때 공분한 행렬을 계산 하기 위해서는 $S^2{\times}{\mathcal{Q}}$ 크기를 가지는 행렬간의 곱 연산이 필요하다. 결론적으로 본 논문에서는 이 행렬의 크기를 줄임으로써 PSNR (Peak signal-to-noise ratio) 의 손실 없이 NLM 의 복잡도를 줄일 수 있음을 보여준다.

  • PDF