• 제목/요약/키워드: method: data analysis

검색결과 22,105건 처리시간 0.052초

A Classification Method Using Data Reduction

  • Uhm, Daiho;Jun, Sung-Hae;Lee, Seung-Joo
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제12권1호
    • /
    • pp.1-5
    • /
    • 2012
  • Data reduction has been used widely in data mining for convenient analysis. Principal component analysis (PCA) and factor analysis (FA) methods are popular techniques. The PCA and FA reduce the number of variables to avoid the curse of dimensionality. The curse of dimensionality is to increase the computing time exponentially in proportion to the number of variables. So, many methods have been published for dimension reduction. Also, data augmentation is another approach to analyze data efficiently. Support vector machine (SVM) algorithm is a representative technique for dimension augmentation. The SVM maps original data to a feature space with high dimension to get the optimal decision plane. Both data reduction and augmentation have been used to solve diverse problems in data analysis. In this paper, we compare the strengths and weaknesses of dimension reduction and augmentation for classification and propose a classification method using data reduction for classification. We will carry out experiments for comparative studies to verify the performance of this research.

입자 성분분석을 통한 클린룸 오염제어 (Cleanroom Contamination Control using Particle Composition Analysis)

  • 이현철;김대영;이성훈;노광철;오명도
    • 대한기계학회:학술대회논문집
    • /
    • 대한기계학회 2007년도 춘계학술대회B
    • /
    • pp.2333-2337
    • /
    • 2007
  • The practical studies on the method of particle contamination control for yield enhancement in the cleanroom were carried out. The method of the contamination control was considered, which is composed of data collection, data analysis, improvement action, verification, and implement control. The composition analysis for data collection and data analysis was used in the cellular phone module packaging lines. And this method was evaluated by the variation of yield loss between before and after improvement action. In case that the composition analysis was applied, the critical sources were selected and yield loss reduction through improvement actions was also investigated. From these results, it is concluded that the composition analysis is effective solutions for particle contamination control in the cleanroom.

  • PDF

연속해석 데이터의 상호운용성을 지원하는 CAE 미들웨어와 가시화 시스템의 개발 (Development of a CAE Middleware and a Visualization System for Supporting Interoperability of Continuous CAE Analysis Data)

  • 송인호;양정삼;조현제;최상수
    • 한국CDE학회논문집
    • /
    • 제15권2호
    • /
    • pp.85-93
    • /
    • 2010
  • This paper proposes a CAE data translation and visualization technique that can verify time-varying continuous analysis simulation in a virtual reality (VR) environment. In previous research, the use of CAE analysis data has been problematic because of the lack of any interactive simulation controls for visualizing continuous simulation data. Moreover, the research on post-processing methods for real-time verification of CAE analysis data has not been sufficient. We therefore propose a scene graph based visualization method and a post-processing method for supporting interoperability of continuous CAE analysis data. These methods can continuously visualize static analysis data independently of any timeline; it can also continuously visualize dynamic analysis data that varies in relation to the timeline. The visualization system for continuous simulation data, which includes a CAE middleware that interfaces with various formats of CAE analysis data as well as functions for visualizing continuous simulation data and operational functions, enables users to verify simulation results with more realistic scenes. We also use the system to do a performance evaluation with regard to the visualization of continuous simulation data.

MODIS 손실 자료 복원을 위한 통계적 방법 개발: 평균 편차 방법, 회귀 분석 방법과 지역 변동 방법 (The development of statistical methods for retrieving MODIS missing data: Mean bias, regressions analysis and local variation method)

  • 김민욱;이종혁;박연구;송정현
    • 한국위성정보통신학회논문지
    • /
    • 제11권4호
    • /
    • pp.94-101
    • /
    • 2016
  • 원격 관측 자료인 위성 자료는 한계점이 있으며, 특히 광학 관측기를 활용하면 구름이나 기타 요인에 의해 손실 자료가 발생한다. 본 연구에서는 MODerate resolution Imaging Spectrometer(MODIS)의 관측 자료 중, 지표면 온도 자료를 대상으로 손실 자료를 복원하기 위한 방법인 평균 편차 방법, 회귀 분석 방법, 지역 변동 방법의 세 가지 복원 방법을 개발하였다. 검증을 위해 2014년과 2015년의 위성 자료에서 관측 비율을 근거로 사례를 선택하였다. 검증 자료에서 확인된 지역 변동 방법의 평균 제곱근 편차(RMSE)는 일부 사례에서 약 2 K 이상으로 다른 복원 방법에 비해 낮은 정확도를 보였으며, 회귀 분석 방법의 RMSE는 평균 약 1.13 K으로 대부분의 사례에서 가장 좋은 결과를 보였다. 평균 편차 방법 사용 시, RMSE는 회귀 분석 방법 시와 유사하게 약 1.32 K으로 나타났다.

펄스 분석 기법 및 데이터 마이닝 기법을 이용한 부분방전 패턴인식에 대한 연구 (A Study on the PD Pattern Recognition using the Pulse Analysis Method and Data Mining Methods)

  • 김정태;이욱;김지홍;구자윤
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2006년도 제37회 하계학술대회 논문집 C
    • /
    • pp.1535-1537
    • /
    • 2006
  • Recently, the noise discrimination method using PD pulse waveshape analysis has been suggested to be very effective method which can improve the reliability of the on-site PD measurement. In this method, the data clusters due to PD pulses or noises can be distinguished on the PA map. And for the automatic recognition of the PD clusters, it is necessary to adopt the adaptable pattern recognition method. In this study, as for the algorithm which can recognize data clusters, the data mining method has been adopted and the result of the analysis has been reported.

  • PDF

국부 확률을 이용한 데이터 분류에 관한 연구 (A Study on Data Clustering Method Using Local Probability)

  • 손창호;최원호;이재국
    • 제어로봇시스템학회논문지
    • /
    • 제13권1호
    • /
    • pp.46-51
    • /
    • 2007
  • In this paper, we propose a new data clustering method using local probability and hypothesis theory. To cluster the test data set we analyze the local area of the test data set using local probability distribution and decide the candidate class of the data set using mean standard deviation and variance etc. To decide each class of the test data, statistical hypothesis theory is applied to the decided candidate class of the test data set. For evaluating, the proposed classification method is compared to the conventional fuzzy c-mean method, k-means algorithm and Discriminator analysis algorithm. The simulation results show more accuracy than results of fuzzy c-mean method, k-means algorithm and Discriminator analysis algorithm.

차량부품 문제에 대한 실험계획법과 Field Data 분석을 통한 신뢰성 평가연구 (Research of Reliability Assessment through the Analysis of Field Data and Taguchi Method about Vehicle Components Problem)

  • 강창학;유재복;이치우;김장수
    • 한국산업융합학회 논문집
    • /
    • 제13권4호
    • /
    • pp.211-217
    • /
    • 2010
  • As the vehicle components are various, we confront unexpected problems in the development and application of them. also warranty expenses occur in the result of unconfirmed warranty.in this paper, to solve the problems of disconnection of damper Strut cable, we applied the optimum conditions through taguchi method for improvement of durability. and we made standard of reliability by weibull analysis of the field data. we acquired reliability standard by correlation with lab data and confirmed improved components satisfying the target of reliability. The analysis of reliability by field data is very useful and we need to apply this method to other components, the correlation between field data and Lab Test has influence on satisfying the target of reliability.this method would be utilized for current mass production components and upcoming developed components. the reliability of durability should be continuously used in the basis of primary technique in cope with competitive automotive companies.

  • PDF

속성유사도에 따른 사회연결망 서브그룹의 군집유효성 (Clustering Validity of Social Network Subgroup Using Attribute Similarity)

  • 윤한성
    • 디지털산업정보학회논문지
    • /
    • 제17권1호
    • /
    • pp.75-84
    • /
    • 2021
  • For analyzing big data, the social network is increasingly being utilized through relational data, which means the connection characteristics between entities such as people and objects. When the relational data does not exist directly, a social network can be configured by calculating relational data such as attribute similarity from attribute data of entities and using it as links. In this paper, the composition method of the social network using the attribute similarity between entities as a connection relationship, and the clustering method using subgroups for the configured social network are suggested, and the clustering effectiveness of the clustering results is evaluated. The analysis results can vary depending on the type and characteristics of the data to be analyzed, the type of attribute similarity selected, and the criterion value. In addition, the clustering effectiveness may not be consistent depending on the its evaluation method. Therefore, selections and experiments are necessary for better analysis results. Since the analysis results may be different depending on the type and characteristics of the analysis target, options for clustering, etc., there is a limitation. In addition, for performance evaluation of clustering, a study is needed to compare the method of this paper with the conventional method such as k-means.

Bayesian analysis of longitudinal traits in the Korea Association Resource (KARE) cohort

  • Chung, Wonil;Hwang, Hyunji;Park, Taesung
    • Genomics & Informatics
    • /
    • 제20권2호
    • /
    • pp.16.1-16.12
    • /
    • 2022
  • Various methodologies for the genetic analysis of longitudinal data have been proposed and applied to data from large-scale genome-wide association studies (GWAS) to identify single nucleotide polymorphisms (SNPs) associated with traits of interest and to detect SNP-time interactions. We recently proposed a grid-based Bayesian mixed model for longitudinal genetic data and showed that our Bayesian method increased the statistical power compared to the corresponding univariate method and well detected SNP-time interactions. In this paper, we further analyze longitudinal obesity-related traits such as body mass index, hip circumference, waist circumference, and waist-hip ratio from Korea Association Resource data to evaluate the proposed Bayesian method. We first conducted GWAS analyses of cross-sectional traits and combined the results of GWAS analyses through a meta-analysis based on a trajectory model and a random-effects model. We then applied our Bayesian method to a subset of SNPs selected by meta-analysis to further discover SNPs associated with traits of interest and SNP-time interactions. The proposed Bayesian method identified several novel SNPs associated with longitudinal obesity-related traits, and almost 25% of the identified SNPs had significant p-values for SNP-time interactions.

저장된 정보를 이용한 활동분석방법 연구 -ABC구축을 위한 활동분석 (A Study on the Activity Analysis Method using existing Saved Data -Activity Analysis for ABC Implementation)

  • 박주식;오지영;임총규;강경식
    • 대한안전경영과학회지
    • /
    • 제3권2호
    • /
    • pp.145-154
    • /
    • 2001
  • This paper deals with Activity Analysis Method that is important procedure in Activity Based Costing System Implementation. There are many existing Activity Analysis Method, for example Interview, Questionnaire, Specialist Discussion and Work Measurement. Activity Analysis Data gained through this method has high reliability but this method bring about high cost. In case that certain company needs a strategic costing system, Activity Analysis Method which has high reliability will be need. But, if companies want the costing system as a internal decision making tool only, they need to design the ABC system fast and cheaply. This paper explains that Activity Analysis using existing finn material is good alternatives. So, this paper show the feasibility of Activity Analysis using existing firm material with comparing between job description, job specification information and Activity Analysis information.

  • PDF