• 제목/요약/키워드: Classification Accuracy Test

검색결과 400건 처리시간 0.028초

베이지안 학습을 이용한 문서의 자동분류 (An Automatic Document Classification with Bayesian Learning)

  • 김진상;신양규
    • Journal of the Korean Data and Information Science Society
    • /
    • 제11권1호
    • /
    • pp.19-30
    • /
    • 2000
  • 정보통신기술의 비약적인 발전은 온라인으로 생성되는 전자문서의 양을 폭발적으로 증가시키고 있다. 따라서 수동으로 문서를 분류하던 종래의 방법 대신 문서의 자동분유 기술 개발이 특별히 요구되고 있다. 본 논문에서는 베이지안 학습 기법을 이용하여 문서를 자동으로 분류하는 방법을 연구하고, 20개의 유즈넷 뉴스그룹 문서들을 분류하도록 시험하였다. 사용한 알고리즘은 Naive Bayes Classifier이며, 구현한 시스템을 이용해 유즈넷 문서를 대상으로 자동분류를 실험한 결과 분류의 정확률이 약 77%로 나타났다.

  • PDF

Variations of AlexNet and GoogLeNet to Improve Korean Character Recognition Performance

  • Lee, Sang-Geol;Sung, Yunsick;Kim, Yeon-Gyu;Cha, Eui-Young
    • Journal of Information Processing Systems
    • /
    • 제14권1호
    • /
    • pp.205-217
    • /
    • 2018
  • Deep learning using convolutional neural networks (CNNs) is being studied in various fields of image recognition and these studies show excellent performance. In this paper, we compare the performance of CNN architectures, KCR-AlexNet and KCR-GoogLeNet. The experimental data used in this paper is obtained from PHD08, a large-scale Korean character database. It has 2,187 samples of each Korean character with 2,350 Korean character classes for a total of 5,139,450 data samples. In the training results, KCR-AlexNet showed an accuracy of over 98% for the top-1 test and KCR-GoogLeNet showed an accuracy of over 99% for the top-1 test after the final training iteration. We made an additional Korean character dataset with fonts that were not in PHD08 to compare the classification success rate with commercial optical character recognition (OCR) programs and ensure the objectivity of the experiment. While the commercial OCR programs showed 66.95% to 83.16% classification success rates, KCR-AlexNet and KCR-GoogLeNet showed average classification success rates of 90.12% and 89.14%, respectively, which are higher than the commercial OCR programs' rates. Considering the time factor, KCR-AlexNet was faster than KCR-GoogLeNet when they were trained using PHD08; otherwise, KCR-GoogLeNet had a faster classification speed.

Deep learning for the classification of cervical maturation degree and pubertal growth spurts: A pilot study

  • Mohammad-Rahimi, Hossein;Motamadian, Saeed Reza;Nadimi, Mohadeseh;Hassanzadeh-Samani, Sahel;Minabi, Mohammad A. S.;Mahmoudinia, Erfan;Lee, Victor Y.;Rohban, Mohammad Hossein
    • 대한치과교정학회지
    • /
    • 제52권2호
    • /
    • pp.112-122
    • /
    • 2022
  • Objective: This study aimed to present and evaluate a new deep learning model for determining cervical vertebral maturation (CVM) degree and growth spurts by analyzing lateral cephalometric radiographs. Methods: The study sample included 890 cephalograms. The images were classified into six cervical stages independently by two orthodontists. The images were also categorized into three degrees on the basis of the growth spurt: pre-pubertal, growth spurt, and post-pubertal. Subsequently, the samples were fed to a transfer learning model implemented using the Python programming language and PyTorch library. In the last step, the test set of cephalograms was randomly coded and provided to two new orthodontists in order to compare their diagnosis to the artificial intelligence (AI) model's performance using weighted kappa and Cohen's kappa statistical analyses. Results: The model's validation and test accuracy for the six-class CVM diagnosis were 62.63% and 61.62%, respectively. Moreover, the model's validation and test accuracy for the three-class classification were 75.76% and 82.83%, respectively. Furthermore, substantial agreements were observed between the two orthodontists as well as one of them and the AI model. Conclusions: The newly developed AI model had reasonable accuracy in detecting the CVM stage and high reliability in detecting the pubertal stage. However, its accuracy was still less than that of human observers. With further improvements in data quality, this model should be able to provide practical assistance to practicing dentists in the future.

POTENTIAL OF HYPERSPECTRAL DATA FOR THE CLASSIFICA TION OF VITD SOIL CLASSES

  • Kim Sun-Hwa;Ma Jung-Rim;Lee Kyu-Sung;Eo Yang-Dam;Lee Yong-Woong
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2005년도 Proceedings of ISRS 2005
    • /
    • pp.221-224
    • /
    • 2005
  • Hyperspectral image data have great potential to depict more detailed information on biophysical characteristics of surface materials, which are not usually available with multispectral data. This study aims to test the potential of hyperspectral data for classifying five soil classes defined by the vector product interim terrain data (VITD). In this study, we try to classify surface materials of bare soil over the study area in Korea using both hyperspectral and multispectral image data. Training and test samples for classification are selected with using VITD vector map. The spectral angle mapper (SAM) method is applied to the EO-I Hyperion data and Landsat ETM+ data, that has been radiometrically corrected and geo-rectified. Higher classification accuracy is obtained with the hyperspectral data for classifying five soil classes of gravel, evaporites, inorganic silt and sand.

  • PDF

자동 고장 판별 및 거리 측정 기능을 갖는 휴대용 케이블 고장 검출 장치 개발 (Development of Portable Cable Fault Detection System with Automatic Fault Distinction and Distance Measurement)

  • 김재진;전정채
    • 전기학회논문지
    • /
    • 제65권10호
    • /
    • pp.1774-1779
    • /
    • 2016
  • This paper proposes a portable cable fault detection system with automatic fault distinction and distance measurement using time-frequency correlation and reference signal elimination method and automatic fault classification algorithm in order to have more accurate fault determination and location detection than conventional time domain refelectometry (TDR) system despite increased signal attenuation due to the long distance to cable fault location. The performance of the developed system method was validated via an experiment in the test field constructed for the standardized performance test of power cable fault location equipments. The performance evaluation showed that accuracy of the developed system is less than 1.34%. Also, an error of automatic fault type and location by detection of phase and peak value through elimination of the reference signal and normalization of correlation coefficient and automatic fault classification algorithm not occurred.

Transfer-learning-based classification of pathological brain magnetic resonance images

  • Serkan Savas;Cagri Damar
    • ETRI Journal
    • /
    • 제46권2호
    • /
    • pp.263-276
    • /
    • 2024
  • Different diseases occur in the brain. For instance, hereditary and progressive diseases affect and degenerate the white matter. Although addressing, diagnosing, and treating complex abnormalities in the brain is challenging, different strategies have been presented with significant advances in medical research. With state-of-art developments in artificial intelligence, new techniques are being applied to brain magnetic resonance images. Deep learning has been recently used for the segmentation and classification of brain images. In this study, we classified normal and pathological brain images using pretrained deep models through transfer learning. The EfficientNet-B5 model reached the highest accuracy of 98.39% on real data, 91.96% on augmented data, and 100% on pathological data. To verify the reliability of the model, fivefold cross-validation and a two-tier cross-test were applied. The results suggest that the proposed method performs reasonably on the classification of brain magnetic resonance images.

국부 확률을 이용한 데이터 분류에 관한 연구 (A Study on Data Clustering Method Using Local Probability)

  • 손창호;최원호;이재국
    • 제어로봇시스템학회논문지
    • /
    • 제13권1호
    • /
    • pp.46-51
    • /
    • 2007
  • In this paper, we propose a new data clustering method using local probability and hypothesis theory. To cluster the test data set we analyze the local area of the test data set using local probability distribution and decide the candidate class of the data set using mean standard deviation and variance etc. To decide each class of the test data, statistical hypothesis theory is applied to the decided candidate class of the test data set. For evaluating, the proposed classification method is compared to the conventional fuzzy c-mean method, k-means algorithm and Discriminator analysis algorithm. The simulation results show more accuracy than results of fuzzy c-mean method, k-means algorithm and Discriminator analysis algorithm.

뇨 스트립 분류에서 육안비색법과 신경회로망 알고리즘 비교 (Comparison of visual colorimetric Analysis and neural network algorithm in urine strip classification)

  • Eum, Sang-hee
    • 한국정보통신학회논문지
    • /
    • 제24권10호
    • /
    • pp.1394-1397
    • /
    • 2020
  • The urine test used as a basic test method of in vitro diagnosis for health care has been used for a long time to be simple and convenient. The urine test method is using a color that appears depending on the change in the ion concentration that reacts over time buried in the standard color test paper(Strips) with a urine sample applied to some reaction reagents. In this paper, it was proposed a neural network algorithm to obtain a suitable and reproducibility and accuracy classifier suitable for the urine analysis system. The experimental results were compared with the visual colorimetric analysis, and the neural network algorithm showed better results.

근전도 신호기반 손목 움직임의 추정을 위한 다중 특징점 추출 기법 알고리즘 (Improvements of Multi-features Extraction for EMG for Estimating Wrist Movements)

  • 김서준;정의철;이상민;송영록
    • 전기학회논문지
    • /
    • 제61권5호
    • /
    • pp.757-762
    • /
    • 2012
  • In this paper, the multi feature extraction algorithm for estimation of wrist movements based on Electromyogram(EMG) is proposed. For the extraction of precise features from the EMG signals, the difference absolute mean value(DAMV), the mean absolute value(MAV), the root mean square(RMS) and the difference absolute standard deviation value(DASDV) to consider amplitude characteristic of EMG signals are used. We figure out a more accurate feature-set by combination of two features out of these, because of multi feature extraction algorithm is more precise than single feature method. Also, for the motion classification based on EMG, the linear discriminant analysis(LDA), the quadratic discriminant analysis(QDA) and k-nearest neighbor(k-NN) are used. We implemented a test targeting twenty adult male to identify the accuracy of EMG pattern classification of wrist movements such as up, down, right, left and rest. As a result of our study, the LDA, QDA and k-NN classification method using feature-set with MAV and DASDV showed respectively 87.59%, 89.06%, 91.75% accuracy.

Linear Spectral Mixture Analysis of Landsat Imagery for Wetland land-Cover Classification in Paldang Reservoir and Vicinity

  • Kim, Sang-Wook;Park, Chong-Hwa
    • 대한원격탐사학회지
    • /
    • 제20권3호
    • /
    • pp.197-205
    • /
    • 2004
  • Wetlands are lands with a mixture of water, herbaceous or woody vegetation and wet soil. And linear spectral mixture analysis (LSMA) is one of the most often used methods in handling the spectral mixture problem. This study aims to test LSMA is an enhanced routine for classification of wetland land-covers in Paldang reservoir and vicinity (paldang Reservoir) using Landsat TM and ETM+ imagery. In the LSMA process, reference endmembers were driven from scatter-plots of Landsat bands 3, 4 and 5, and a series of endmember models were developed based on green vegetation (GV), soil and water endmembers which are the main indicators of wetlands. To consider phenological characteristics of Paldang Reservoir, a soil endmember was subdivided into bright and dark soil endmembers in spring and a green vegetation (GV) endmember was subdivided into GV tree and GV herbaceous endmembers in fall. We found that LSMA fractions improved the classification accuracy of the wetland land-cover. Four endmember models provided better GV and soil discrimination and the root mean squared (RMS) errors were 0.011 and 0.0039, in spring and fall respectively. Phenologically, a fall image is more appropriate to classify wetland land-cover than spring's. The classification result using 4 endmember fractions of a fall image reached 85.2 and 74.2 percent of the producer's and user's accuracy respectively. This study shows that this routine will be an useful tool for identifying and monitoring the status of wetlands in Paldang Reservoir.