• Title/Summary/Keyword: Classification Accuracy Test

Search Result 390, Processing Time 0.024 seconds

Pest Control System using Deep Learning Image Classification Method

  • Moon, Backsan;Kim, Daewon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.1
    • /
    • pp.9-23
    • /
    • 2019
  • In this paper, we propose a layer structure of a pest image classifier model using CNN (Convolutional Neural Network) and background removal image processing algorithm for improving classification accuracy in order to build a smart monitoring system for pine wilt pest control. In this study, we have constructed and trained a CNN classifier model by collecting image data of pine wilt pest mediators, and experimented to verify the classification accuracy of the model and the effect of the proposed classification algorithm. Experimental results showed that the proposed method successfully detected and preprocessed the region of the object accurately for all the test images, resulting in showing classification accuracy of about 98.91%. This study shows that the layer structure of the proposed CNN classifier model classified the targeted pest image effectively in various environments. In the field test using the Smart Trap for capturing the pine wilt pest mediators, the proposed classification algorithm is effective in the real environment, showing a classification accuracy of 88.25%, which is improved by about 8.12% according to whether the image cropping preprocessing is performed. Ultimately, we will proceed with procedures to apply the techniques and verify the functionality to field tests on various sites.

Classification Accuracy Improvement for Decision Tree (의사결정트리의 분류 정확도 향상)

  • Rezene, Mehari Marta;Park, Sanghyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2017.04a
    • /
    • pp.787-790
    • /
    • 2017
  • Data quality is the main issue in the classification problems; generally, the presence of noisy instances in the training dataset will not lead to robust classification performance. Such instances may cause the generated decision tree to suffer from over-fitting and its accuracy may decrease. Decision trees are useful, efficient, and commonly used for solving various real world classification problems in data mining. In this paper, we introduce a preprocessing technique to improve the classification accuracy rates of the C4.5 decision tree algorithm. In the proposed preprocessing method, we applied the naive Bayes classifier to remove the noisy instances from the training dataset. We applied our proposed method to a real e-commerce sales dataset to test the performance of the proposed algorithm against the existing C4.5 decision tree classifier. As the experimental results, the proposed method improved the classification accuracy by 8.5% and 14.32% using training dataset and 10-fold crossvalidation, respectively.

Improving Urban Vegetation Classification by Including Height Information Derived from High-Spatial Resolution Stereo Imagery

  • Myeong, Soo-Jeong
    • Korean Journal of Remote Sensing
    • /
    • v.21 no.5
    • /
    • pp.383-392
    • /
    • 2005
  • Vegetation classes, especially grass and tree classes, are often confused in classification when conventional spectral pattern recognition techniques are used to classify urban areas. This paper reports on a study to improve the classification results by using an automated process of considering height information in separating urban vegetation classes, specifically tree and grass, using three-band, high-spatial resolution, digital aerial imagery. Height information was derived photogrammetrically from stereo pair imagery using cross correlation image matching to estimate differential parallax for vegetation pixels. A threshold value of differential parallax was used to assess whether the original class was correct. The average increase in overall accuracy for three test stereo pairs was $7.8\%$, and detailed examination showed that pixels reclassified as grass improved the overall accuracy more than pixels reclassified as tree. Visual examination and statistical accuracy assessment of four test areas showed improvement in vegetation classification with the increase in accuracy ranging from $3.7\%\;to\;18.1\%$. Vegetation classification can, in fact, be improved by adding height information to the classification procedure.

A Study on the Land Cover Classification and Cross Validation of AI-based Aerial Photograph

  • Lee, Seong-Hyeok;Myeong, Soojeong;Yoon, Donghyeon;Lee, Moung-Jin
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.4
    • /
    • pp.395-409
    • /
    • 2022
  • The purpose of this study is to evaluate the classification performance and applicability when land cover datasets constructed for AI training are cross validation to other areas. For study areas, Gyeongsang-do and Jeolla-do in South Korea were selected as cross validation areas, and training datasets were obtained from AI-Hub. The obtained datasets were applied to the U-Net algorithm, a semantic segmentation algorithm, for each region, and the accuracy was evaluated by applying them to the same and other test areas. There was a difference of about 13-15% in overall classification accuracy between the same and other areas. For rice field, fields and buildings, higher accuracy was shown in the Jeolla-do test areas. For roads, higher accuracy was shown in the Gyeongsang-do test areas. In terms of the difference in accuracy by weight, the result of applying the weights of Gyeongsang-do showed high accuracy for forests, while that of applying the weights of Jeolla-do showed high accuracy for dry fields. The result of land cover classification, it was found that there is a difference in classification performance of existing datasets depending on area. When constructing land cover map for AI training, it is expected that higher quality datasets can be constructed by reflecting the characteristics of various areas. This study is highly scalable from two perspectives. First, it is to apply satellite images to AI study and to the field of land cover. Second, it is expanded based on satellite images and it is possible to use a large scale area and difficult to access.

Development of Personal-Credit Evaluation System Using Real-Time Neural Learning Mechanism

  • Park, Jong U.;Park, Hong Y.;Yoon Chung
    • The Journal of Information Technology and Database
    • /
    • v.2 no.2
    • /
    • pp.71-85
    • /
    • 1995
  • Many research results conducted by neural network researchers have claimed that the classification accuracy of neural networks is superior to, or at least equal to that of conventional methods. However, in series of neural network classifications, it was found that the classification accuracy strongly depends on the characteristics of training data set. Even though there are many research reports that the classification accuracy of neural networks can be different, depending on the composition and architecture of the networks, training algorithm, and test data set, very few research addressed the problem of classification accuracy when the basic assumption of data monotonicity is violated, In this research, development project of automated credit evaluation system is described. The finding was that arrangement of training data is critical to successful implementation of neural training to maintain monotonicity of the data set, for enhancing classification accuracy of neural networks.

  • PDF

Investigating the Regression Analysis Results for Classification in Test Case Prioritization: A Replicated Study

  • Hasnain, Muhammad;Ghani, Imran;Pasha, Muhammad Fermi;Malik, Ishrat Hayat;Malik, Shahzad
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.11 no.2
    • /
    • pp.1-10
    • /
    • 2019
  • Research classification of software modules was done to validate the approaches proposed for addressing limitations in existing classification approaches. The objective of this study was to replicate the experiments of a recently published research study and re-evaluate its results. The reason to repeat the experiment(s) and re-evaluate the results was to verify the approach to identify the faulty and non-faulty modules applied in the original study for the prioritization of test cases. As a methodology, we conducted this study to re-evaluate the results of the study. The results showed that binary logistic regression analysis remains helpful for researchers for predictions, as it provides an overall prediction of accuracy in percentage. Our study shows a prediction accuracy of 92.9% for the PureMVC Java open source program, while the original study showed an 82% prediction accuracy for the same Java program classes. It is believed by the authors that future research can refine the criteria used to classify classes of web systems written in various programming languages based on the results of this study.

Feature Selection and Hyper-Parameter Tuning for Optimizing Decision Tree Algorithm on Heart Disease Classification

  • Tsehay Admassu Assegie;Sushma S.J;Bhavya B.G;Padmashree S
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.2
    • /
    • pp.150-154
    • /
    • 2024
  • In recent years, there are extensive researches on the applications of machine learning to the automation and decision support for medical experts during disease detection. However, the performance of machine learning still needs improvement so that machine learning model produces result that is more accurate and reliable for disease detection. Selecting the hyper-parameter that could produce the possible maximum classification accuracy on medical dataset is the most challenging task in developing decision support systems with machine learning algorithms for medical dataset classification. Moreover, selecting the features that best characterizes a disease is another challenge in developing machine-learning model with better classification accuracy. In this study, we have proposed an optimized decision tree model for heart disease classification by using heart disease dataset collected from kaggle data repository. The proposed model is evaluated and experimental test reveals that the performance of decision tree improves when an optimal number of features are used for training. Overall, the accuracy of the proposed decision tree model is 98.2% for heart disease classification.

Classification of Forest Type Using High Resolution Imagery of Satellite IKONOS (고해상도 IKONOS 위성영상을 이용한 임상분류)

  • 정기현;이우균;이준학;김권혁;이승호
    • Korean Journal of Remote Sensing
    • /
    • v.17 no.3
    • /
    • pp.275-284
    • /
    • 2001
  • This study was carried out to evaluate high resolution satellite imagery of IKONOS for classifying the land cover, especially forest type. The IKONOS imagery of 11km$\times$11km size was taken on April 24, 2000 in Bong-pyoung Myun Pyungchang-Gun, Kangwon Province. Land cover classes were water, coniferous evergreen, Larix leptolepis, broad-leaved tree, bare land, farm land, grassland, sandy soil and asphalted area. Supervised classification method with algorithm of maximum likelihood was applied for classification. The terrestrial survey was also carried out to collect the reference data in this area. The accuracy of the classification was analyzed with the items of overall accuracy, producer's accuracy, user's accuracy and k for test area through the error matrix. In the accuracy analysis of the test area, overall accuracy was 94.3%, producer's accuracy was 77.0-99.9%, user's accuracy was 71.9-100% and k and 0.93. Classes of bare land, sandy soil and farm land were less clear than other classes, whereas classification result of IKONOS in forest area showed higher performance than that of other resolution(5-30m) satellite data.

Classification in Different Genera by Cytochrome Oxidase Subunit I Gene Using CNN-LSTM Hybrid Model

  • Meijing Li;Dongkeun Kim
    • Journal of information and communication convergence engineering
    • /
    • v.21 no.2
    • /
    • pp.159-166
    • /
    • 2023
  • The COI gene is a sequence of approximately 650 bp at the 5' terminal of the mitochondrial Cytochrome c Oxidase subunit I (COI) gene. As an effective DeoxyriboNucleic Acid (DNA) barcode, it is widely used for the taxonomic identification and evolutionary analysis of species. We created a CNN-LSTM hybrid model by combining the gene features partially extracted by the Long Short-Term Memory ( LSTM ) network with the feature maps obtained by the CNN. Compared to K-Means Clustering, Support Vector Machines (SVM), and a single CNN classification model, after training 278 samples in a training set that included 15 genera from two orders, the CNN-LSTM hybrid model achieved 94% accuracy in the test set, which contained 118 samples. We augmented the training set samples and four genera into four orders, and the classification accuracy of the test set reached 100%. This study also proposes calculating the cosine similarity between the training and test sets to initially assess the reliability of the predicted results and discover new species.

A Rule-based Urban Image Classification System for Time Series Landsat Data

  • Lee, Jin-A;Lee, Sung-Soon;Chi, Kwang-Hoon
    • Korean Journal of Remote Sensing
    • /
    • v.27 no.6
    • /
    • pp.637-651
    • /
    • 2011
  • This study presents a rule-based urban image classification method for time series analysis of changes in the vicinity of Asan-si and Cheonan-si in Chungcheongnam-do, using Landsat satellite images (1991-2006). The area has been highly developed through the relocation of industrial facilities, land development, construction of a high-speed railroad, and an extension of the subway. To determine the yearly changing pattern of the urban area, eleven classes were made depending on the trend of development. An algorithm was generalized for the rules to be applied as an unsupervised classification, without the need of training area. The analysis results show that the urban zone of the research area has increased by about 1.53 times, and each correlation graph confirmed the distribution of the Built Up Index (BUI) values for each class. To evaluate the rule-based classification, coverage and accuracy were assessed. When Optimal allowable factor=0.36, the coverage of the rule was 98.4%, and for the test using ground data from 1991 to 2006, overall accuracy was 99.49%. It was confirmed that the method suggested to determine the maximum allowable factor correlates to the accuracy test results using ground data. Among the multiple images, available data was used as best as possible and classification accuracy could be improved since optimal classification to suit objectives was possible. The rule-based urban image classification method is expected to be applied to time series image analyses such as thematic mapping for urban development, urban development, and monitoring of environmental changes.