• Title/Summary/Keyword: 판별모델

Search Result 623, Processing Time 0.027 seconds

Evaluation of Classifiers Performance for Areal Features Matching (면 객체 매칭을 위한 판별모델의 성능 평가)

  • Kim, Jiyoung;Kim, Jung Ok;Yu, Kiyun;Huh, Yong
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.31 no.1
    • /
    • pp.49-55
    • /
    • 2013
  • In this paper, we proposed a good classifier to match different spatial data sets by applying evaluation of classifiers performance in data mining and biometrics. For this, we calculated distances between a pair of candidate features for matching criteria, and normalized the distances by Min-Max method and Tanh (TH) method. We defined classifiers that shape similarity is derived from fusion of these similarities by CRiteria Importance Through Intercriteria correlation (CRITIC) method, Matcher Weighting method and Simple Sum (SS) method. As results of evaluation of classifiers performance by Precision-Recall (PR) curve and area under the PR curve (AUC-PR), we confirmed that value of AUC-PR in a classifier of TH normalization and SS method is 0.893 and the value is the highest. Therefore, to match different spatial data sets, we thought that it is appropriate to a classifier that distances of matching criteria are normalized by TH method and shape similarity is calculated by SS method.

A Study on Discriminant.Classification Model of Impact Factors about Understanding of Traffic Accident Causes and Acknowledgement to Decrease Traffic Accidents (교통사고 발생원인 인식과 감소대책 인지 영향요인 판별.분류에 관한 연구)

  • 고상선;배기목;이원규;정헌영
    • Journal of Korean Society of Transportation
    • /
    • v.20 no.7
    • /
    • pp.143-153
    • /
    • 2002
  • 본 연구는 교통사고의 발생원인에 대한 인식유형과 감소대책에 대한 인지 유형별 영향요인의 정도를 분석하기 위하여 수량화이론 II류와 CHAID 분석법을 이용하여 분류모델과 판별모델을 구축하였다. 수량화이론 II류에 의한 교통사고 발생원인에 대한 인식 유형별 영향요인 판별모델은 전체 적중률이 78.4%로 매우 높게 나타났다. 편상관계수는 설명변수의 항목 중 학력, 성별, 운전경력 년 수, 소유 차종의 순으로 영향을 미치고 외적 변수인 교통사고 발생원인에 대한 유형에서는 기여 정도가 교통단속 부재 > 교통체계 미비 > 승용차 과다 사용 >잘못된 의식 때문의 순으로 나타났다. 교통사고 감소 대책에 대한 인지유형별 영향요인 판별모델은 전체 적중률이 59.9%로 높게 나타났으며, 편상관 계수는 학력, 성별, 운전경력 연수, 연령의 순으로 영향을 미치고 있고, 외적 변수인 교통사고 감소 대책에 대한 유형에서는 기여 정도가 교통단속 강화 > 대중교통수단 이용 유도 > 교통체계 개선 > 의식 개혁의 순으로 나타났다. 또한 CHAID 분석법에 의한 교통사고 발생원인에 대한 인식 유형별 영향요인 분류모델에 있어서는 예측변수로 학력, 연령, 성별, 통행수단의 네 가지 변수가, 교통사고의 감소 대책에 대한인지 유형별 영향요인 분류모델에 있어서는 학력, 운전경력 연수, 성별 그리고 통행수단의 네 가지 변수가 카이제곱 통계량 이 5%의 유의수준에서 유의한 것으로 판단되었다. 교통사고 발생원인 인식과 감소 대책의 인지 유형에 대한 빈도분석과 교차분석은 의식과 관련한 유형이 가장 높게 나타났으나 판별.분류모델에서는 교통단속과 관련한 유형이 기여 정도가 높고 의식 관련 유형이 상대적으로 낮게 나타나는 등 반대양상을 보이고 있어 심리적으로 내재되어 있고 표면에 잘 드러나지 않았던 의식 수준의 낮음이 분류모델을 통해서 명확하게 드러났다.

Identification of Foreign Objects in Soybeans Using Near-infrared Spectroscopy (근적외선 분광법을 이용한 콩과 이물질의 판별)

  • Lim, Jong-Guk;Kang, Sukwon;Lee, Kangjin;Mo, Changyeon;Son, Jaeyong
    • Food Engineering Progress
    • /
    • v.15 no.2
    • /
    • pp.136-142
    • /
    • 2011
  • The objective of this research was to classify intact soybeans and foreign objects using near-infrared (NIR) spectroscopy. Intact soybeans and foreign objects were scanned using a NIR spectrometer equipped with scanning monochromator. NIR spectra of intact soybeans and foreign objects in the wavelength range from 900 to 1800 nm were collected. The classification of intact soybeans and foreign objects were conducted by using partial least-square discriminant analysis (PLS-DA) and soft independent modelling of class analogy (SIMCA) multivariate methods. Various types of data pretreatments were tested to develop the classification models. Intact soybeans and foreign objects were successfully classified by the PLS-DA prediction model with mean normalization pretreatment. These results showed the potential of NIR spectroscopy combined with multivariate analysis as a method for classifying intact soybeans and foreign objects.

A study of methodology for identification models of cardiovascular diseases based on data mining (데이터마이닝을 이용한 심혈관질환 판별 모델 방법론 연구)

  • Lee, Bum Ju
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.4
    • /
    • pp.339-345
    • /
    • 2022
  • Cardiovascular diseases is one of the leading causes of death in the world. The objectives of this study were to build various models using sociodemographic variables based on three variable selection methods and seven machine learning algorithms for the identification of hypertension and dyslipidemia and to evaluate predictive powers of the models. In experiments based on full variables and correlation-based feature subset selection methods, our results showed that performance of models using naive Bayes was better than those of models using other machine learning algorithms in both two diseases. In wrapper-based feature subset selection method, performance of models using logistic regression was higher than those of models using other algorithms. Our finding may provide basic data for public health and machine learning fields.

Discrimination model of cultivation area of Corni Fructus using a GC-MS-Based metabolomics approach (GC-MS 기반 대사체학 기법을 이용한 산수유의 산지판별모델)

  • Leem, Jae-Yoon
    • Analytical Science and Technology
    • /
    • v.29 no.1
    • /
    • pp.1-9
    • /
    • 2016
  • It is believed that traditional Korean medicines can be managed more scientifically through the development of logical criteria to verify their region of cultivation, and that this could contribute to the advancement of the traditional herbal medicine industry. This study attempted to determine such criteria for Sansuyu. The volatile compounds were obtained from 20 samples of domestic Corni fructus (Sansuyu) and 45 samples of Chinese Sansuyu by steam distillation. The metabolites were identified in the NIST Mass Spectral Library via the obtained gas chromatography/mass spectrometer (GC/MS) data of 53 training samples. Data binning at 0.2 min intervals was performed to normalize the number of variables used in the statistical analysis. Multivariate statistical analyses, such as principle component analysis (PCA), partial least squares-discriminant analysis (PLS-DA), and orthogonal partial least squares-discriminant analysis (OPLS-DA) were performed using the SIMCA-P software package. Significant variables with a variable importance in the projection (VIP) score higher than 1.0 were obtained from OPLS-DA, and variables that resulted in a p-value of less than 0.05 through one-way ANOVA were selected to verify the marker compounds. Finally, among the 11 variables extracted, 1-ethylbutyl-hydroperoxide (9.089 min), nonadecane (20.170 min), butylated hydroxytoluene (25.319 min), 5β,7βH,10α-eudesm-11-en-1α-ol (25.921 min), 7,9-bis(2-methyl-2-propanyl)-1-oxaspiro[4.5]deca-6,9-diene-2,8-dione (34.257 min), and 2-decyldodecyl-benzene (54.717 min) were selected as markers to indicate the origin of Sansuyu. The statistical model developed was suitable for the determination of the geographical origin of Sansuyu. The cultivation areas of four Korean and eight Chinese Sansuyu samples were predicted via the established OPLS-DA model, and it was confirmed that 11 of the 12 samples were accurately classified.

Video Camera Model Identification System Using Deep Learning (딥 러닝을 이용한 비디오 카메라 모델 판별 시스템)

  • Kim, Dong-Hyun;Lee, Soo-Hyeon;Lee, Hae-Yeoun
    • The Journal of Korean Institute of Information Technology
    • /
    • v.17 no.8
    • /
    • pp.1-9
    • /
    • 2019
  • With the development of imaging information communication technology in modern society, imaging acquisition and mass production technology have developed rapidly. However, crime rates using these technology are increased and forensic studies are conducted to prevent it. Identification techniques for image acquisition devices are studied a lot, but the field is limited to images. In this paper, camera model identification technique for video, not image is proposed. We analyzed video frames using the trained model with images. Through training and analysis by considering the frame characteristics of video, we showed the superiority of the model using the P frame. Then, we presented a video camera model identification system by applying a majority-based decision algorithm. In the experiment using 5 video camera models, we obtained maximum 96.18% accuracy for each frame identification and the proposed video camera model identification system achieved 100% identification rate for each camera model.

Classification Model of Chronic Gastritis According to The Feature Extraction Method of Radial Artery Pulse Signal (맥파의 특징점 추출 방법에 따른 만성위염 판별 모형)

  • Choi, Sang-Ho;Shin, Ki-Young;Kim, Jeauk;Jin, Seung-Oh;Lee, Tea-Bum
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.1
    • /
    • pp.185-194
    • /
    • 2014
  • One in every 10 persons suffer from chronic gastritis in Korea. Endoscopy is most commonly used to diagnose the chronic gastritis. Endoscopic diagnosis is precise but it is accompanied with pain and high cost. According to pulse diagnosis in Traditional East Asian Medicine, health problems in stomach can be diagnosed with radial pulse signals in 'Guan' location in the right wrist, which are non-invasive and cost-effective. In this study, we developed a classification model of chronic gastritis using pulse signals in right 'Guan' location. We used both linear discrimination method and logistic regression model with respect to pulse features obtained with a peak-valley detection algorithm and a Gaussian model. As a result, we obtained sensitivity ranged between 77%~89% and specificity ranged between 72%~83% depending on classification models and feature extraction methods, and the average classification rates were approximately 80%, irrespective of the models. Specifically, the Gaussian model were featured by superior sensitivities (89.1% and 87.5%) while the peak-valley detection method showed superior specificities (82.8% and 81.3%), and the average classification rate (sensitivity + specificity) of the Gaussian model was 80.9% which was 1.2% ahead of the peak-valley method. In conclusion, we obtained a reliable classification model for the chronic gastritis based on the radial pulse feature extraction algorithms, where the Gaussian model was featured by outperformed sensitivity and the peak-valley method was featured by outperformed specificity.

A Study on the Analysis of Urban Highways Traffic Accident's Impact Factors Based on Building Discriminant Models - In Busan Metropolitan City - (판별모델 구축에 따른 도시고속도로의 교통사고 영향요인 분석에 관한 연구 - 부산지역 사례를 중심으로 -)

  • Jeong, Yong-Hwa;Choi, Yang-Won
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.34 no.4
    • /
    • pp.1269-1278
    • /
    • 2014
  • The urban highway, which is a motorway constructed to solve traffic issues, has the characteristic of extremely high damage to life during traffic accidents because the speed of vehicles is higher than typical roadways. In particular, because traffic accidents involving serious injuries hold a very important place among overall traffic accidents, analysis on factors affecting the occurrence of traffic accidents involving serious injuries must be considered with priority when establishing a reduction measure. Therefore, the study built a model that was capable of distinguishing the degree of the factors as part of microscopic analysis for investigating the complex effect of many elements concerning the occurrence of traffic accidents involving serious injuries in urban highways. The results are as follows. First, discriminant model showed a comparatively high level in overall accuracy rates, and, considering the correlation ratio, the models were determined to be valid, as all characteristics of the factors were clearly distinguished. Second, the problems of traffic accidents involving serious injuries on urban highways according to each factor, were clearly drawn out through the discriminant model. Third, the improvement measure for the problems drawn out from the discriminant models were clearly proposed.

Verification of Transliteration Pairs Using Distance LSTM-CNN with Layer Normalization (Distance LSTM-CNN with Layer Normalization을 이용한 음차 표기 대역 쌍 판별)

  • Lee, Changsu;Cheon, Juryong;Kim, Joogeun;Kim, Taeil;Kang, Inho
    • 한국어정보학회:학술대회논문집
    • /
    • 2017.10a
    • /
    • pp.76-81
    • /
    • 2017
  • 외국어로 구성된 용어를 발음에 기반하여 자국의 언어로 표기하는 것을 음차 표기라 한다. 국가 간의 경계가 허물어짐에 따라, 외국어에 기원을 두는 용어를 설명하기 위해 뉴스 등 다양한 웹 문서에서는 동일한 발음을 가지는 외국어 표기와 한국어 표기를 혼용하여 사용하고 있다. 이에 좋은 검색 결과를 가져오기 위해서는 외국어 표기와 더불어 사람들이 많이 사용하는 다양한 음차 표기를 함께 검색에 활용하는 것이 중요하다. 음차 표기 모델과 음차 표기 대역 쌍 추출을 통해 음차 표현을 생성하는 기존 방법 대신, 본 논문에서는 신뢰할 수 있는 다양한 음차 표현을 찾기 위해 문서에서 음차 표기 후보를 찾고, 이 음차 표기 후보가 정확한 표기인지 판별하는 방식을 제안한다. 다양한 딥러닝 모델을 비교, 검토하여 최종적으로 음차 표기 대역 쌍 판별에 특화된 모델인 Distance LSTM-CNN 모델을 제안하며, 제안하는 모델의 Batch Size 영향을 줄이고 학습 시 수렴 속도 개선을 위해 Layer Normalization을 적용하는 방법을 보인다.

  • PDF

Printer Identification Methods Using Global and Local Feature-Based Deep Learning (전역 및 지역 특징 기반 딥러닝을 이용한 프린터 장치 판별 기술)

  • Lee, Soo-Hyeon;Lee, Hae-Yeoun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.8 no.1
    • /
    • pp.37-44
    • /
    • 2019
  • With the advance of digital IT technology, the performance of the printing and scanning devices is improved and their price becomes cheaper. As a result, the public can easily access these devices for crimes such as forgery of official and private documents. Therefore, if we can identify which printing device is used to print the documents, it would help to narrow the investigation and identify suspects. In this paper, we propose a deep learning model for printer identification. A convolutional neural network model based on local features which is widely used for identification in recent is presented. Then, another model including a step to calculate global features and hence improving the convergence speed and accuracy is presented. Using 8 printer models, the performance of the presented models was compared with previous feature-based identification methods. Experimental results show that the presented model using local feature and global feature achieved 97.23% and 99.98% accuracy respectively, which is much better than other previous methods in accuracy.