• Title/Summary/Keyword: Data classification

Search Result 7,933, Processing Time 0.032 seconds

Land Cover Classification Using Lidar and Optical Image (라이다와 광학영상을 이용한 토지피복분류)

  • Cho Woo-Sug;Chang Hwi-Jung;Kim Yu-Seok
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.24 no.1
    • /
    • pp.139-145
    • /
    • 2006
  • The advantage of the lidar data is in fast acquisition and process time as well as in high accuracy and high point density. However lidar data itself is difficult to classify the earth surface because lidar data is in the form of irregularly distributed point clouds. In this study, we investigated land cover classification using both lidar data and optical image through a supervised classification method. Firstly, we generated 1m grid DSM and DEM image and then nDSM was produced by using DSM and DEM. In addition, we had made intensity image using the intensity value of lidar data. As for optical images, the red, blue, green band of CCD image are used. Moreover, a NDVI image using a red band of the CCD image and infrared band of IKONOS image is generated. The experimental results showed that land cover classification with lidar data and optical image together could reach to the accuracy of 74.0%. To improve classification accuracy, we further performed re-classification of shadow area and water body as well as forest and building area. The final classification accuracy was 81.8%.

Multi-temporal Remote-Sensing Imag e ClassificationUsing Artificial Neural Networks (인공신경망 이론을 이용한 위성영상의 카테고리분류)

  • Kang, Moon-Seong;Park, Seung-Woo;Lim, Jae-Chon
    • Proceedings of the Korean Society of Agricultural Engineers Conference
    • /
    • 2001.10a
    • /
    • pp.59-64
    • /
    • 2001
  • The objectives of the thesis are to propose a pattern classification method for remote sensing data using artificial neural network. First, we apply the error back propagation algorithm to classify the remote sensing data. In this case, the classification performance depends on a training data set. Using the training data set and the error back propagation algorithm, a layered neural network is trained such that the training pattern are classified with a specified accuracy. After training the neural network, some pixels are deleted from the original training data set if they are incorrectly classified and a new training data set is built up. Once training is complete, a testing data set is classified by using the trained neural network. The classification results of Landsat TM data show that this approach produces excellent results which are more realistic and noiseless compared with a conventional Bayesian method.

  • PDF

Cross platform classification of microarrays by rank comparison

  • Lee, Sunho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.475-486
    • /
    • 2015
  • Mining the microarray data accumulated in the public data repositories can save experimental cost and time and provide valuable biomedical information. Big data analysis pooling multiple data sets increases statistical power, improves the reliability of the results, and reduces the specific bias of the individual study. However, integrating several data sets from different studies is needed to deal with many problems. In this study, I limited the focus to the cross platform classification that the platform of a testing sample is different from the platform of a training set, and suggested a simple classification method based on rank. This method is compared with the diagonal linear discriminant analysis, k nearest neighbor method and support vector machine using the cross platform real example data sets of two cancers.

Big Numeric Data Classification Using Grid-based Bayesian Inference in the MapReduce Framework

  • Kim, Young Joon;Lee, Keon Myung
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.14 no.4
    • /
    • pp.313-321
    • /
    • 2014
  • In the current era of data-intensive services, the handling of big data is a crucial issue that affects almost every discipline and industry. In this study, we propose a classification method for large volumes of numeric data, which is implemented in a distributed programming framework, i.e., MapReduce. The proposed method partitions the data space into a grid structure and it then models the probability distributions of classes for grid cells by collecting sufficient statistics using distributed MapReduce tasks. The class labeling of new data is achieved by k-nearest neighbor classification based on Bayesian inference.

Improving the Subject Independent Classification of Implicit Intention By Generating Additional Training Data with PCA and ICA

  • Oh, Sang-Hoon
    • International Journal of Contents
    • /
    • v.14 no.4
    • /
    • pp.24-29
    • /
    • 2018
  • EEG-based brain-computer interfaces has focused on explicitly expressed intentions to assist physically impaired patients. For EEG-based-computer interfaces to function effectively, it should be able to understand users' implicit information. Since it is hard to gather EEG signals of human brains, we do not have enough training data which are essential for proper classification performance of implicit intention. In this paper, we improve the subject independent classification of implicit intention through the generation of additional training data. In the first stage, we perform the PCA (principal component analysis) of training data in a bid to remove redundant components in the components within the input data. After the dimension reduction by PCA, we train ICA (independent component analysis) network whose outputs are statistically independent. We can get additional training data by adding Gaussian noises to ICA outputs and projecting them to input data domain. Through simulations with EEG data provided by CNSL, KAIST, we improve the classification performance from 65.05% to 66.69% with Gamma components. The proposed sample generation method can be applied to any machine learning problem with fewer samples.

A Implementation of Optimal Multiple Classification System using Data Mining for Genome Analysis

  • Jeong, Yu-Jeong;Choi, Gwang-Mi
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.12
    • /
    • pp.43-48
    • /
    • 2018
  • In this paper, more efficient classification result could be obtained by applying the combination of the Hidden Markov Model and SVM Model to HMSV algorithm gene expression data which simulated the stochastic flow of gene data and clustering it. In this paper, we verified the HMSV algorithm that combines independently learned algorithms. To prove that this paper is superior to other papers, we tested the sensitivity and specificity of the most commonly used classification criteria. As a result, the K-means is 71% and the SOM is 68%. The proposed HMSV algorithm is 85%. These results are stable and high. It can be seen that this is better classified than using a general classification algorithm. The algorithm proposed in this paper is a stochastic modeling of the generation process of the characteristics included in the signal, and a good recognition rate can be obtained with a small amount of calculation, so it will be useful to study the relationship with diseases by showing fast and effective performance improvement with an algorithm that clusters nodes by simulating the stochastic flow of Gene Data through data mining of BigData.

Automatic Extraction of Training Data Based on Semi-supervised Learning for Time-series Land-cover Mapping (시계열 토지피복도 제작을 위한 준감독학습 기반의 훈련자료 자동 추출)

  • Kwak, Geun-Ho;Park, No-Wook
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_1
    • /
    • pp.461-469
    • /
    • 2022
  • This paper presents a novel training data extraction approach using semi-supervised learning (SSL)-based classification without the analyst intervention for time-series land-cover mapping. The SSL-based approach first performs initial classification using initial training data obtained from past images including land-cover characteristics similar to the image to be classified. Reliable training data from the initial classification result are then extracted from SSL-based iterative classification using classification uncertainty information and class labels of neighboring pixels as constraints. The potential of the SSL-based training data extraction approach was evaluated from a classification experiment using unmanned aerial vehicle images in croplands. The use of new training data automatically extracted by the proposed SSL approach could significantly alleviate the misclassification in the initial classification result. In particular, isolated pixels were substantially reduced by considering spatial contextual information from adjacent pixels. Consequently, the classification accuracy of the proposed approach was similar to that of classification using manually extracted training data. These results indicate that the SSL-based iterative classification presented in this study could be effectively applied to automatically extract reliable training data for time-series land-cover mapping.

Sparse kernel classication using IRWLS procedure

  • Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.4
    • /
    • pp.749-755
    • /
    • 2009
  • Support vector classification (SVC) provides more complete description of the lin-ear and nonlinear relationships between input vectors and classifiers. In this paper. we propose the sparse kernel classifier to solve the optimization problem of classification with a modified hinge loss function and absolute loss function, which provides the efficient computation and the sparsity. We also introduce the generalized cross validation function to select the hyper-parameters which affects the classification performance of the proposed method. Experimental results are then presented which illustrate the performance of the proposed procedure for classification.

  • PDF

Objective Cloud Type Classification of Meteorological Satellite Data Using Linear Discriminant Analysis (선형판별법에 의한 GMS 영상의 객관적 운형분류)

  • 서애숙;김금란
    • Korean Journal of Remote Sensing
    • /
    • v.6 no.1
    • /
    • pp.11-24
    • /
    • 1990
  • This is the study about the meteorological satellite cloud image classification by objective methods. For objective cloud classification, linear discriminant analysis was tried. In the linear discriminant analysis 27 cloud characteristic parameters were retrieved from GMS infrared image data. And, linear cloud classification model was developed from major parameters and cloud type coefficients. The model was applied to GMS IR image for weather forecasting operation and cloud image was classified into 5 types such as Sc, Cu, CiT, CiM and Cb. The classification results were reasonably compared with real image.

A study on classification accuracy improvements using orthogonal summation of posterior probabilities (사후확률 결합에 의한 분류정확도 향상에 관한 연구)

  • 정재준
    • Spatial Information Research
    • /
    • v.12 no.1
    • /
    • pp.111-125
    • /
    • 2004
  • Improvements of classification accuracy are main issues in satellite image classification. Considering the facts that multiple images in the same area are available, there are needs on researches aiming improvements of classification accuracy using multiple data sets. In this study, orthogonal summation method of Dempster-Shafer theory (theory of evidence) is proposed as a multiple imagery classification method and posterior probabilities and classification uncertainty are used in calculation process. Accuracies of the proposed method are higher than conventional classification methods, maximum likelihood classification(MLC) of each data and MLC of merged data sets, which can be certified through statistical tests of mean difference.

  • PDF