• Title/Summary/Keyword: classification criterion

Search Result 284, Processing Time 0.024 seconds

Comparison of Classification Models for Sequential Flight Test Results (단계별 비행훈련 성패 예측 모형의 성능 비교 연구)

  • Sohn, So-Young;Cho, Yong-Kwan;Choi, Sung-Ok;Kim, Young-Joun
    • Journal of the Ergonomics Society of Korea
    • /
    • v.21 no.1
    • /
    • pp.1-14
    • /
    • 2002
  • The main purpose of this paper is to present selection criteria for ROK Airforce pilot training candidates in order to save costs involved in sequential pilot training. We use classification models such Decision Tree, Logistic Regression and Neural Network based on aptitude test results of 288 ROK Air Force applicants in 1994-1996. Different models are compared in terms of classification accuracy, ROC and Lift-value. Neural network is evaluated as the best model for each sequential flight test result while Logistic regression model outperforms the rest of them for discriminating the last flight test result. Therefore we suggest a pilot selection criterion based on this logistic regression. Overall. we find that the factors such as Attention Sharing, Speed Tracking, Machine Comprehension and Instrument Reading Ability having significant effects on the flight results. We expect that the use of our criteria can increase the effectiveness of flight resources.

A Geostatistical Study Using Qualitative Information for Multiple Rock Classification -1. Theory (다분적 암반분류를 위한 정성적 자료의 지구통계학적 연구 1.이론)

  • 유광호
    • Geotechnical Engineering
    • /
    • v.11 no.2
    • /
    • pp.71-78
    • /
    • 1995
  • In this paper, a study was performed on classifying a rock mass into multiple classes as in rock mass classification systems, such as RMR system and Q system etc. In a situation with only limited quantitative data available, it was sought to employ a way of incorporating qualitative data in a systematical and reasonable manner. It is based on the realm of Geostatistics. In particular, indicator kriging technique, which is one of non-parametric approaches, was used. As a selection criterion for an optimal classification, the cost of errors was adopted. As a result, the binary rock classification method developed before was extended and generalized for multiple rock classification with its total number of classes unlimited.

  • PDF

A novel reliability analysis method based on Gaussian process classification for structures with discontinuous response

  • Zhang, Yibo;Sun, Zhili;Yan, Yutao;Yu, Zhenliang;Wang, Jian
    • Structural Engineering and Mechanics
    • /
    • v.75 no.6
    • /
    • pp.771-784
    • /
    • 2020
  • Reliability analysis techniques combining with various surrogate models have attracted increasing attention because of their accuracy and great efficiency. However, they primarily focus on the structures with continuous response, while very rare researches on the reliability analysis for structures with discontinuous response are carried out. Furthermore, existing adaptive reliability analysis methods based on importance sampling (IS) still have some intractable defects when dealing with small failure probability, and there is no related research on reliability analysis for structures involving discontinuous response and small failure probability. Therefore, this paper proposes a novel reliability analysis method called AGPC-IS for such structures, which combines adaptive Gaussian process classification (GPC) and adaptive-kernel-density-estimation-based IS. In AGPC-IS, an efficient adaptive strategy for design of experiments (DoE), taking into consideration the classification uncertainty, the sampling uniformity and the regional classification accuracy improvement, is developed with the purpose of improving the accuracy of Gaussian process classifier. The adaptive kernel density estimation is introduced for constructing the quasi-optimal density function of IS. In addition, a novel and more precise stopping criterion is also developed from the perspective of the stability of failure probability estimation. The efficiency, superiority and practicability of AGPC-IS are verified by three examples.

Database Model of Subway Construction NAS Operating System for Scheduling Management Science (공정관리 과학화를 위한 지하철공사 NAS운영체계 데이터베이스 모델링 구축)

  • Choi, Jaejin;Cho, Byounghoo;Park, Hongtae
    • Journal of the Society of Disaster Information
    • /
    • v.13 no.3
    • /
    • pp.322-331
    • /
    • 2017
  • This study proposed subway construction information classification system based on civil engineering information classification system proposed by Korea Institute of Construction Technology. Also, Based on this criterion, This study established data modeling for NAS operating system Composed of construction information classification system - network - operation and presented an relational database integrated model. The data modeling method proposed in this study can be applied to other civil engineering facilities, so it can be operated as scientific NAS.

The Noise Influence Assessment according to the Change of the Offset Type Print Machine's Power (옵셋 인쇄기계 동력규모 변화에 따른 소음 영향 평가)

  • Gu, Jinhoi;Kwon, Myunghee;Lee, Wooseok;Lee, Jaewon;Park, Hyungkyu;Kim, Samsu;Yun, Heekyung;Lee, Kyumok;Jung, Daekwan;Seo, Chungyoul
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.24 no.9
    • /
    • pp.682-686
    • /
    • 2014
  • Nowadays, the needs to revise the classification criteria for noise emission facilities have been suggested by the related industries. Because there existed many reasonable factors in the criteria regarding the noise emission facilities. And the noise emission facility classification criterion of the print machine changed from 50 HP to 100 HP in 2013. But the increasement of the noise emission facility classification criterion of the print machine can cause adverse effects like the bigger noise. So, in this paper, we measured the print machine's sound power level according to the changes of the print machine's power to assess the adverse effects. The measurement method applied with KS I ISO 9614-2(1996). The corelation between the sound power level and the power of print machines was analyzed by regression analysis. In this paper, we found that the sound power level of the print machines can increase about 1.3 dB in the condition of that the power of print machine increases from 50 HP to 100 HP. And we found that the sound power level of the print machines can increase about 1.0 dB for a increasement of 1,000 SPH(sheet per hour) of printing speed. The noise emission characteristics of print machine stuied in this paper will be useful to design the noise reduction plan in the future.

Odds curve for two classification distributions (두 분류 분포를 위한 오즈 곡선)

  • Hong, Chong Sun;Oh, Se Hyeon;Oh, Tae Gyu
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.2
    • /
    • pp.225-238
    • /
    • 2021
  • The ROC, TOC, and TROC curves, which are visually descriptive methods of exploring the performance of the binary classification model, are implemented with TP, TN, FP, FN which consist of the confusion matrix, as well as their ratios TPR, TNR, FPR, FNR. In this study, we consider two types odds and then propose an odds curve representing these odds. And show the relationship between the odds curve and ROC curve. Based on the odds curve, we propose not only two statistics that measure the discriminant power of the odds curve but also the criteria for validation ratings of the odds curve. According to the shape of the odds curves, two classification distributions can be estimated and a criterion for validation ratings can be determined. The odds curve can be meaningfully used like other visual methods, and two kinds of measures for the discriminant power can be also applied together as an alternative criterion.

An Efficient One Class Classifier Using Gaussian-based Hyper-Rectangle Generation (가우시안 기반 Hyper-Rectangle 생성을 이용한 효율적 단일 분류기)

  • Kim, Do Gyun;Choi, Jin Young;Ko, Jeonghan
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.41 no.2
    • /
    • pp.56-64
    • /
    • 2018
  • In recent years, imbalanced data is one of the most important and frequent issue for quality control in industrial field. As an example, defect rate has been drastically reduced thanks to highly developed technology and quality management, so that only few defective data can be obtained from production process. Therefore, quality classification should be performed under the condition that one class (defective dataset) is even smaller than the other class (good dataset). However, traditional multi-class classification methods are not appropriate to deal with such an imbalanced dataset, since they classify data from the difference between one class and the others that can hardly be found in imbalanced datasets. Thus, one-class classification that thoroughly learns patterns of target class is more suitable for imbalanced dataset since it only focuses on data in a target class. So far, several one-class classification methods such as one-class support vector machine, neural network and decision tree there have been suggested. One-class support vector machine and neural network can guarantee good classification rate, and decision tree can provide a set of rules that can be clearly interpreted. However, the classifiers obtained from the former two methods consist of complex mathematical functions and cannot be easily understood by users. In case of decision tree, the criterion for rule generation is ambiguous. Therefore, as an alternative, a new one-class classifier using hyper-rectangles was proposed, which performs precise classification compared to other methods and generates rules clearly understood by users as well. In this paper, we suggest an approach for improving the limitations of those previous one-class classification algorithms. Specifically, the suggested approach produces more improved one-class classifier using hyper-rectangles generated by using Gaussian function. The performance of the suggested algorithm is verified by a numerical experiment, which uses several datasets in UCI machine learning repository.

Semi-Supervised Learning by Gaussian Mixtures (정규 혼합분포를 이용한 준지도 학습)

  • Choi, Byoung-Jeong;Chae, Youn-Seok;Choi, Woo-Young;Park, Chang-Yi;Koo, Ja-Yong
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.5
    • /
    • pp.825-833
    • /
    • 2008
  • Discriminant analysis based on Gaussian mixture models, an useful tool for multi-class classifications, can be extended to semi-supervised learning. We consider a model selection problem for a Gaussian mixture model in semi-supervised learning. More specifically, we adopt Bayesian information criterion to determine the number of subclasses in the mixture model. Through simulations, we illustrate the usefulness of the criterion.

TIME-VARIANT OUTLIER DETECTION METHOD ON GEOSENSOR NETWORKS

  • Kim, Dong-Phil;I, Gyeong-Min;Lee, Dong-Gyu;Ryu, Keun-Ho
    • Proceedings of the KSRS Conference
    • /
    • 2008.10a
    • /
    • pp.410-413
    • /
    • 2008
  • Existing Outlier detections have been widely studied in geosensor networks. Recently, machine learning and data mining have been applied the outlier detection method to build a model that distinguishes outliers based on anchored criterion. However, it is difficult for the existing methods to detect outliers against incoming time-variant data, because outlier detection needs to monitor incoming data and classify irregular attacks. Therefore, in order to solve the problem, we propose a time-variant outlier detection using 2-dimensional grid method based on unanchored criterion. In the paper, outliers using geosensor data was performed to classify efficiently. The proposed method can be utilized applications such as network intrusion detection, stock market analysis, and error data detection in bank account.

  • PDF

Influence of Data Preprocessing

  • Zhu, Changming;Gao, Daqi
    • Journal of Computing Science and Engineering
    • /
    • v.10 no.2
    • /
    • pp.51-57
    • /
    • 2016
  • In this paper, we research the influence of data preprocessing. We conclude that using different preprocessing methods leads to different classification performances. Moreover, not all data preprocessing methods are necessary, and a criterion is given to make sure which data preprocessing is necessary and which one is effective. Experiments on some real-world data sets validate that different data preprocessing methods result in different effects. Furthermore, experiments about some algorithms with different preprocessing methods also confirm that preprocessing has a great influence on the performance of a classifier.