• Title/Summary/Keyword: Classification Analysis

Search Result 8,006, Processing Time 0.034 seconds

A Composite Cluster Analysis Approach for Component Classification (컴포넌트 분류를 위한 복합 클러스터 분석 방법)

  • Lee, Sung-Koo
    • The KIPS Transactions:PartD
    • /
    • v.14D no.1 s.111
    • /
    • pp.89-96
    • /
    • 2007
  • Various classification methods have been developed to reuse components. These classification methods enable the user to access the needed components quickly and easily. Conventional classification approaches include the following problems: a labor-intensive domain analysis effort to build a classification structure, the representation of the inter-component relationships, difficult to maintain as the domain evolves, and applied to a limited domain. In order to solve these problems, this paper describes a composite cluster analysis approach for component classification. The cluster analysis approach is a combination of a hierarchical cluster analysis method, which generates a stable clustering structure automatically, and a non-hierarchical cluster analysis concept, which classifies new components automatically. The clustering information generated from the proposed approach can support the domain analysis process.

A Study on Community Classification and Property Analysis for Space Planning of Elementary School -Focusing on the Seoul and Gyeonggi Province- (초등학교 공간계획을 위한 지역유형분류 및 특성분석 -서울·경기 지역을 중심으로-)

  • Lee, Sang Min
    • The Journal of Sustainable Design and Educational Environment Research
    • /
    • v.3 no.2
    • /
    • pp.21-37
    • /
    • 2003
  • This study has the purpose for analysis of each region's property in order to plan a elementary school's space according to community property. For this analysis. we used classification method through classification analysis. classification analysis is one of the useful statistical analysis methode for determining each region's policy through classifying regions which have a similar property. On this study, Seoul and Kyongkido is classified by 4 groups and each group has a different community property. Such a analysis is thought of helping establishing the objective. reasonable space-plan through comparative analysis between subjective claim and objective state indicator of each region.

  • PDF

Binary classification on compositional data

  • Joo, Jae Yun;Lee, Seokho
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.1
    • /
    • pp.89-97
    • /
    • 2021
  • Due to boundedness and sum constraint, compositional data are often transformed by logratio transformation and their transformed data are put into traditional binary classification or discriminant analysis. However, it may be problematic to directly apply traditional multivariate approaches to the transformed data because class distributions are not Gaussian and Bayes decision boundary are not polynomial on the transformed space. In this study, we propose to use flexible classification approaches to transformed data for compositional data classification. Empirical studies using synthetic and real examples demonstrate that flexible approaches outperform traditional multivariate classification or discriminant analysis.

Functional Data Classification of Variable Stars

  • Park, Minjeong;Kim, Donghoh;Cho, Sinsup;Oh, Hee-Seok
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.4
    • /
    • pp.271-281
    • /
    • 2013
  • This paper considers a problem of classification of variable stars based on functional data analysis. For a better understanding of galaxy structure and stellar evolution, various approaches for classification of variable stars have been studied. Several features that explain the characteristics of variable stars (such as color index, amplitude, period, and Fourier coefficients) were usually used to classify variable stars. Excluding other factors but focusing only on the curve shapes of variable stars, Deb and Singh (2009) proposed a classification procedure using multivariate principal component analysis. However, this approach is limited to accommodate some features of the light curve data that are unequally spaced in the phase domain and have some functional properties. In this paper, we propose a light curve estimation method that is suitable for functional data analysis, and provide a classification procedure for variable stars that combined the features of a light curve with existing functional data analysis methods. To evaluate its practical applicability, we apply the proposed classification procedure to the data sets of variable stars from the project STellar Astrophysics and Research on Exoplanets (STARE).

The Difference Analysis between Maturity Stages of Venture Firms by Classification Techniques of Big Data (빅데이터 분류 기법에 따른 벤처 기업의 성장 단계별 차이 분석)

  • Jung, Byoungho
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.15 no.4
    • /
    • pp.197-212
    • /
    • 2019
  • The purpose of this study is to identify the maturity stages of venture firms through classification analysis, which is widely used as a big data technique. Venture companies should develop a competitive advantage in the market. And the maturity stage of a company can be classified into five stages. I will analyze a difference in the growth stage of venture firms between the survey response and the statistical classification methods. The firm growth level distinguished five stages and was divided into the period of start-up and declines. A classification method of big data uses popularly k-mean cluster analysis, hierarchical cluster analysis, artificial neural network, and decision tree analysis. I used variables that asset increase, capital increase, sales increase, operating profit increase, R&D investment increase, operation period and retirement number. The research results, each big data analysis technique showed a large difference of samples sized in the group. In particular, the decision tree and neural networks' methods were classified as three groups rather than five groups. The groups size of all classification analysis was all different by the big data analysis methods. Furthermore, according to the variables' selection and the sample size may be dissimilar results. Also, each classed group showed a number of competitive differences. The research implication is that an analysts need to interpret statistics through management theory in order to interpret classification of big data results correctly. In addition, the choice of classification analysis should be determined by considering not only management theory but also practical experience. Finally, the growth of venture firms needs to be examined by time-series analysis and closely monitored by individual firms. And, future research will need to include significant variables of the company's maturity stages.

A Study of CPC-based Technology Classification Analysis Model of Patents (CPC 기반 특허 기술 분류 분석 모델)

  • Chae, Soo-Hyeon;Gim, Jangwon
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.10
    • /
    • pp.443-452
    • /
    • 2018
  • With the explosively increasing intellectual property rights, securing technological competitiveness of companies is more and more important. In particular, since patents include core technologies and element technologies, patent analysis researches are actively conducted to measure the technological value of companies. Various patent analysis studies have been conducted by the International Patent Classification(IPC), which does not include the latest technical classification, and the technical classification accuracy is low. In order to overcome this problem, the Cooperative Patent Classification(CPC), which includes the latest technology classification and detailed technical classification, has been developed. In this paper, we propose a model to analyze the classification of the technologies included in the patent by using the detailed classification system of CPC. It is possible to analyze the inventor's patents in consideration of the relation, importance, and efficiency between the detailed classification schemes of the CPCs to extract the core technology fields and to analyze the details more accurately than the existing IPC-based methods. Also, we perform the comparative evaluation with the existing IPC based patent analysis method and confirm that the proposed model shows better performance in analyzing the inventor's core technology classification.

Development of a Compound Classification Process for Improving the Correctness of Land Information Analysis in Satellite Imagery - Using Principal Component Analysis, Canonical Correlation Classification Algorithm and Multitemporal Imagery - (위성영상의 토지정보 분석정확도 향상을 위한 응용체계의 개발 - 다중시기 영상과 주성분분석 및 정준상관분류 알고리즘을 이용하여 -)

  • Park, Min-Ho
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.4D
    • /
    • pp.569-577
    • /
    • 2008
  • The purpose of this study is focused on the development of compound classification process by mixing multitemporal data and annexing a specific image enhancement technique with a specific image classification algorithm, to gain more accurate land information from satellite imagery. That is, this study suggests the classification process using canonical correlation classification technique after principal component analysis for the mixed multitemporal data. The result of this proposed classification process is compared with the canonical correlation classification result of one date images, multitemporal imagery and a mixed image after principal component analysis for one date images. The satellite images which are used are the Landsat 5 TM images acquired on July 26, 1994 and September 1, 1996. Ground truth data for accuracy assessment is obtained from topographic map and aerial photograph, and all of the study area is used for accuracy assessment. The proposed compound classification process showed superior efficiency to appling canonical correlation classification technique for only one date image in classification accuracy by 8.2%. Especially, it was valid in classifying mixed urban area correctly. Conclusively, to improve the classification accuracy when extracting land cover information using Landsat TM image, appling canonical correlation classification technique after principal component analysis for multitemporal imagery is very useful.

Microphone Type Classification for Digital Audio Forgery Detection (디지털 오디오 위조검출을 위한 마이크로폰 타입 인식)

  • Seok, Jongwon
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.3
    • /
    • pp.323-329
    • /
    • 2015
  • In this paper we applied pattern recognition approach to detect audio forgery. Classification of the microphone types and models can help determining the authenticity of the recordings. Canonical correlation analysis was applied to extract feature for microphone classification. We utilized the linear dependence between two near-silence regions. To utilize the advantage of multi-feature based canonical correlation analysis, we selected three commonly used features to capture the temporal and spectral characteristics. Using three different microphones, we tested the usefulness of multi-feature based characteristics of canonical correlation analysis and compared the results with single feature based method. The performance of classification rate was carried out using the backpropagation neural network. Experimental results show the promise of canonical correlation features for microphone classification.

The Effect of Motor Ability in Children with Cerebral Palsy on Mastery Motivation (뇌성마비 아동의 신체기능이 완수동기에 미치는 영향)

  • Lee, Na-Jung;Oh, Tae-Young
    • The Journal of Korean Physical Therapy
    • /
    • v.26 no.5
    • /
    • pp.315-323
    • /
    • 2014
  • Purpose: This study was conducted in order to investigate the effect of motor ability on mastery motivation in children with cerebral palsy. Methods: Sixty children with cerebral palsy (5~12 years) and their parents participated in the study. Data on general characteristics and disability condition, Gross Motor Functional Classification System, Manual Ability Classification System, and The Dimensions of Mastery questionnaire were collected for this study. Independent t-test, and ANOVA were used for analysis of the effect of The Dimensions of Mastery questionnaire according to general and disability condition, Gross Motor Functional Classification System, and Manual Ability Classification System. Linear regression analysis was performed to determine the effects of Gross Motor Functional Classification System and Manual Ability Classification System on The Dimensions of Mastery questionnaire. SPSS win. 22.0 was used and Tukey was used for post hoc analysis, level of statistical significance was less than 0.05. Results: The Dimensions of Mastery questionnaire score showed statistically significant difference according to gender, region, type, disability rating, Gross Motor Functional Classification System, and Manual Ability Classification System (p<0.05). Gross Motor Functional Classification System and Manual Ability Classification System were the effect factor on The Dimensions of Mastery questionnaire significantly (p<0.05). Conclusion: These results suggest that motor ability of children with cerebral palsy was an important factor having an effect on The Dimensions of Mastery questionnaire.

Utilizing Principal Component Analysis in Unsupervised Classification Based on Remote Sensing Data

  • Lee, Byung-Gul;Kang, In-Joan
    • Proceedings of the Korean Environmental Sciences Society Conference
    • /
    • 2003.11a
    • /
    • pp.33-36
    • /
    • 2003
  • Principal component analysis (PCA) was used to improve image classification by the unsupervised classification techniques, the K-means. To do this, I selected a Landsat TM scene of Jeju Island, Korea and proposed two methods for PCA: unstandardized PCA (UPCA) and standardized PCA (SPCA). The estimated accuracy of the image classification of Jeju area was computed by error matrix. The error matrix was derived from three unsupervised classification methods. Error matrices indicated that classifications done on the first three principal components for UPCA and SPCA of the scene were more accurate than those done on the seven bands of TM data and that also the results of UPCA and SPCA were better than those of the raw Landsat TM data. The classification of TM data by the K-means algorithm was particularly poor at distinguishing different land covers on the island. From the classification results, we also found that the principal component based classifications had characteristics independent of the unsupervised techniques (numerical algorithms) while the TM data based classifications were very dependent upon the techniques. This means that PCA data has uniform characteristics for image classification that are less affected by choice of classification scheme. In the results, we also found that UPCA results are better than SPCA since UPCA has wider range of digital number of an image.

  • PDF