• Title/Summary/Keyword: Supervised Classification

Search Result 405, Processing Time 0.03 seconds

Damage classification of concrete structures based on grey level co-occurrence matrix using Haar's discrete wavelet transform

  • Kabir, Shahid;Rivard, Patrice
    • Computers and Concrete
    • /
    • v.4 no.3
    • /
    • pp.243-257
    • /
    • 2007
  • A novel method for recognition, characterization, and quantification of deterioration in bridge components and laboratory concrete samples is presented in this paper. The proposed scheme is based on grey level co-occurrence matrix texture analysis using Haar's discrete wavelet transform on concrete imagery. Each image is described by a subset of band-filtered images containing wavelet coefficients, and then reconstructed images are employed in characterizing the texture, using grey level co-occurrence matrices, of the different types and degrees of damage: map-cracking, spalling and steel corrosion. A comparative study was conducted to evaluate the efficiency of the supervised maximum likelihood and unsupervised K-means classification techniques, in order to classify and quantify the deterioration and its extent. Experimental results show both methods are relatively effective in characterizing and quantifying damage; however, the supervised technique produced more accurate results, with overall classification accuracies ranging from 76.8% to 79.1%.

A Study on Big-5 based Personality Analysis through Analysis and Comparison of Machine Learning Algorithm (머신러닝 알고리즘 분석 및 비교를 통한 Big-5 기반 성격 분석 연구)

  • Kim, Yong-Jun
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.19 no.4
    • /
    • pp.169-174
    • /
    • 2019
  • In this study, I use surveillance data collection and data mining, clustered by clustering method, and use supervised learning to judge similarity. I aim to use feature extraction algorithms and supervised learning to analyze the suitability of the correlations of personality. After conducting the questionnaire survey, the researchers refine the collected data based on the questionnaire, classify the data sets through the clustering techniques of WEKA, an open source data mining tool, and judge similarity using supervised learning. I then use feature extraction algorithms and supervised learning to determine the suitability of the results for personality. As a result, it was found that the highest degree of similarity classification was obtained by EM classification and supervised learning by Naïve Bayes. The results of feature classification and supervised learning were found to be useful for judging fitness. I found that the accuracy of each Big-5 personality was changed according to the addition and deletion of the items, and analyzed the differences for each personality.

Accuracy Evaluation of Supervised Classification by Using Morphological Attribute Profiles and Additional Band of Hyperspectral Imagery (초분광 영상의 Morphological Attribute Profiles와 추가 밴드를 이용한 감독분류의 정확도 평가)

  • Park, Hong Lyun;Choi, Jae Wan
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.25 no.1
    • /
    • pp.9-17
    • /
    • 2017
  • Hyperspectral imagery is used in the land cover classification with the principle component analysis and minimum noise fraction to reduce the data dimensionality and noise. Recently, studies on the supervised classification using various features having spectral information and spatial characteristic have been carried out. In this study, principle component bands and normalized difference vegetation index(NDVI) was utilized in the supervised classification for the land cover classification. To utilize additional information not included in the principle component bands by the hyperspectral imagery, we tried to increase the classification accuracy by using the NDVI. In addition, the extended attribute profiles(EAP) generated using the morphological filter was used as the input data. The random forest algorithm, which is one of the representative supervised classification, was used. The classification accuracy according to the application of various features based on EAP was compared. Two areas was selected in the experiments, and the quantitative evaluation was performed by using reference data. The classification accuracy of the proposed algorithm showed the highest classification accuracy of 85.72% and 91.14% compared with existing algorithms. Further research will need to develop a supervised classification algorithm and additional input datasets to improve the accuracy of land cover classification using hyperspectral imagery.

Object Classification based on Weakly Supervised E2LSH and Saliency map Weighting

  • Zhao, Yongwei;Li, Bicheng;Liu, Xin;Ke, Shengcai
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.1
    • /
    • pp.364-380
    • /
    • 2016
  • The most popular approach in object classification is based on the bag of visual-words model, which has several fundamental problems that restricting the performance of this method, such as low time efficiency, the synonym and polysemy of visual words, and the lack of spatial information between visual words. In view of this, an object classification based on weakly supervised E2LSH and saliency map weighting is proposed. Firstly, E2LSH (Exact Euclidean Locality Sensitive Hashing) is employed to generate a group of weakly randomized visual dictionary by clustering SIFT features of the training dataset, and the selecting process of hash functions is effectively supervised inspired by the random forest ideas to reduce the randomcity of E2LSH. Secondly, graph-based visual saliency (GBVS) algorithm is applied to detect the saliency map of different images and weight the visual words according to the saliency prior. Finally, saliency map weighted visual language model is carried out to accomplish object classification. Experimental results datasets of Pascal 2007 and Caltech-256 indicate that the distinguishability of objects is effectively improved and our method is superior to the state-of-the-art object classification methods.

Accuracy Assessment of Supervised Classification using Training Samples Acquired by a Field Spectroradiometer: A Case Study for Kumnam-myun, Sejong City (지상 분광반사자료를 훈련샘플로 이용한 감독분류의 정확도 평가: 세종시 금남면을 사례로)

  • Shin, Jung Il;Kim, Ik Jae;Kim, Dong Wook
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.24 no.1
    • /
    • pp.121-128
    • /
    • 2016
  • Many studies are focused on image data and classifier for comparison or improvement of classification accuracy. Therefore studies are needed aspect of the training samples on supervised classification which depend on reference data or skill of analyst. This study tries to assess usability of field spectra as training samples on supervised classification. Classification accuracies of hyperspectral and multispectral images were assessed using training samples from image itself and field spectra, respectively. The results shown about 90% accuracy with training sample collected from image. Using field spectra as training sample, accuracy was decreased 10%p for hyperspectral image, and 20%p for multispectral image. Especially, some classes shown very low accuracies due to similar spectral characteristics on multispectral image. Therefore, field spectra might be used as training samples on classification of hyperspectral image, although it has limitation for multispectral image.

Applicability of Supervised Classification for Subdividing Forested Areas Using SPOT-5 and KOMPSAT-2 Data (산림지역 분류를 위한 SPOT-5 및 KOMPSAT-2 영상의 감독분류 적용성)

  • Choi, Jaeyong;Lee, Sanghyuk;Lee, Sol Ae;Ji, Seung Yong;Lee, Peter Sang-Hoon
    • Journal of the Korean Society of Environmental Restoration Technology
    • /
    • v.18 no.2
    • /
    • pp.89-104
    • /
    • 2015
  • In order to effectively manage forested areas in South Korea on a national scale, using remotely sensed data is considered most suitable. In this study, utilizing Land coverage maps and Forest type maps of national geographic information instead of collecting field data was tested for conducting supervised classification on SPOT-5 and KOMPSAT-2 imagery focusing on forested areas. Supervised classification were conducted in two ways: analysing a whole area around the study site and/or only forested areas around the study site, using Support Vector Machine. The overall accuracy for the classification on the whole area ranged from 54.9% to 68.9% with kappa coefficients of over 0.4, which meant the supervised classification was in general considered moderate because of sub-classifying forested areas into three categories (i.e. hardwood, conifer, mixed forests). Compared to this, the overall accuracy for forested areas were better for sub-classification of forested areas probably due to less distraction in the classification. To further improve the overall accuracy, it is needed to gain individual imagery rather than mosaic imagery to use more spetral bands and select more suitable conditions such as seasonal timing. It is also necessary to obtain precise and accurate training data for sub-classifying forested areas. This new approach can be considered as a basis of developing an excellent analysis manner for understanding and managing forest landscape.

Issues and Empirical Results for Improving Text Classification

  • Ko, Young-Joong;Seo, Jung-Yun
    • Journal of Computing Science and Engineering
    • /
    • v.5 no.2
    • /
    • pp.150-160
    • /
    • 2011
  • Automatic text classification has a long history and many studies have been conducted in this field. In particular, many machine learning algorithms and information retrieval techniques have been applied to text classification tasks. Even though much technical progress has been made in text classification, there is still room for improvement in text classification. In this paper, we will discuss remaining issues in improving text classification. In this paper, three improvement issues are presented including automatic training data generation, noisy data treatment and term weighting and indexing, and four actual studies and their empirical results for those issues are introduced. First, the semi-supervised learning technique is applied to text classification to efficiently create training data. For effective noisy data treatment, a noisy data reduction method and a robust text classifier from noisy data are developed as a solution. Finally, the term weighting and indexing technique is revised by reflecting the importance of sentences into term weight calculation using summarization techniques.

A Clustering-based Semi-Supervised Learning through Initial Prediction of Unlabeled Data (미분류 데이터의 초기예측을 통한 군집기반의 부분지도 학습방법)

  • Kim, Eung-Ku;Jun, Chi-Hyuck
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.33 no.3
    • /
    • pp.93-105
    • /
    • 2008
  • Semi-supervised learning uses a small amount of labeled data to predict labels of unlabeled data as well as to improve clustering performance, whereas unsupervised learning analyzes only unlabeled data for clustering purpose. We propose a new clustering-based semi-supervised learning method by reflecting the initial predicted labels of unlabeled data on the objective function. The initial prediction should be done in terms of a discrete probability distribution through a classification method using labeled data. As a result, clusters are formed and labels of unlabeled data are predicted according to the Information of labeled data in the same cluster. We evaluate and compare the performance of the proposed method in terms of classification errors through numerical experiments with blinded labeled data.

A Hybrid Selection Method of Helpful Unlabeled Data Applicable for Semi-Supervised Learning Algorithm

  • Le, Thanh-Binh;Kim, Sang-Woon
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.3 no.4
    • /
    • pp.234-239
    • /
    • 2014
  • This paper presents an empirical study on selecting a small amount of useful unlabeled data to improve the classification accuracy of semi-supervised learning algorithms. In particular, a hybrid method of unifying the simply recycled selection method and the incrementally-reinforced selection method was considered and evaluated empirically. The experimental results, which were obtained from well-known benchmark data sets using semi-supervised support vector machines, demonstrated that the hybrid method works better than the traditional ones in terms of the classification accuracy.

Comparative Analysis of Supervised and Phenology-Based Approaches for Crop Mapping: A Case Study in South Korea

  • Ehsan Rahimi;Chuleui Jung
    • Korean Journal of Remote Sensing
    • /
    • v.40 no.2
    • /
    • pp.179-190
    • /
    • 2024
  • This study aims to compare supervised classification methods with phenology-based approaches, specifically pixel-based and segment-based methods, for accurate crop mapping in agricultural landscapes. We utilized Sentinel-2A imagery, which provides multispectral data for accurate crop mapping. 31 normalized difference vegetation index (NDVI) images were calculated from the Sentinel-2A data. Next, we employed phenology-based approaches to extract valuable information from the NDVI time series. A set of 10 phenology metrics was extracted from the NDVI data. For the supervised classification, we employed the maximum likelihood (MaxLike) algorithm. For the phenology-based approaches, we implemented both pixel-based and segment-based methods. The results indicate that phenology-based approaches outperformed the MaxLike algorithm in regions with frequent rainfall and cloudy conditions. The segment-based phenology approach demonstrated the highest kappa coefficient of 0.85, indicating a high level of agreement with the ground truth data. The pixel-based phenology approach also achieved a commendable kappa coefficient of 0.81, indicating its effectiveness in accurately classifying the crop types. On the other hand, the supervised classification method (MaxLike) yielded a lower kappa coefficient of 0.74. Our study suggests that segment-based phenology mapping is a suitable approach for regions like South Korea, where continuous cloud-free satellite images are scarce. However, establishing precise classification thresholds remains challenging due to the lack of adequately sampled NDVI data. Despite this limitation, the phenology-based approach demonstrates its potential in crop classification, particularly in regions with varying weather patterns.