• Title/Summary/Keyword: Data classification

Search Result 7,933, Processing Time 0.035 seconds

Comparison of Hyperspectral and Multispectral Sensor Data for Land Use Classification

  • Kim, Dae-Sung;Han, Dong-Yeob;Yun, Ki;Kim, Yong-Il
    • Proceedings of the KSRS Conference
    • /
    • 2002.10a
    • /
    • pp.388-393
    • /
    • 2002
  • Remote sensing data is collected and analyzed to enhance understanding of the terrestrial surface. Since Landsat satellite was launched in 1972, many researches using multispectral data has been achieved. Recently, with the availability of airborne and satellite hyperspectral data, the study on hyperspectral data are being increased. It is known that as the number of spectral bands of high-spectral resolution data increases, the ability to detect more detailed cases should also increase, and the classification accuracy should increase as well. In this paper, we classified the hyperspectral and multispectral data and tested the classification accuracy. The MASTER(MODIS/ASTER Airborne Simulator, 50channels, 0.4~13$\mu$m) and Landsat TM(7channels) imagery including Yeong-Gwang area were used and we adjusted the classification items in several cases and tested their classification accuracy through statistical comparison. As a result of this study, it is shown that hyperspectral data offer more information than multispectral data.

  • PDF

A Preliminary Study on Clinical Decision Support System based on Classification Learning of Electronic Medical Records

  • Shin, Yang-Kyu
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.4
    • /
    • pp.817-824
    • /
    • 2003
  • We employed a hierarchical document classification method to classify a massive collection of electronic medical records(EMR) written in both Korean and English. Our experimental system has been learned from 5,000 records of EMR text data and predicted a newly given set of EMR text data over 68% correctly. We expect the accuracy rate can be improved greatly provided a dictionary of medical terms or a suitable medical thesaurus. The classification system might play a key role in some clinical decision support systems and various interpretation systems for clinical data.

  • PDF

Classification of Multi Spectral Image Data using Rough Sets (러프 집합을 이용한 다중 분광 이미지 데이터의 분류)

  • 원성현;이병성;정환묵
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1997.11a
    • /
    • pp.205-208
    • /
    • 1997
  • Traditionally, classification of remote sensed image data is one of the important works for image data analysis procedure. So, many researchers devote their endeavor to increasing accuracy of analysis, also, many classification algorithms have been proposed. In this paper, we propose new classification method for remote sensed image data that use rough set theory. Using indiscernibility relation of rough sets, we show that can classify image data very easily.

  • PDF

Vegetation Classification Using Seasonal Variation MODIS Data

  • Choi, Hyun-Ah;Lee, Woo-Kyun;Son, Yo-Whan;Kojima, Toshiharu;Muraoka, Hiroyuki
    • Korean Journal of Remote Sensing
    • /
    • v.26 no.6
    • /
    • pp.665-673
    • /
    • 2010
  • The role of remote sensing in phenological studies is increasingly regarded as a key in understanding large area seasonal phenomena. This paper describes the application of Moderate Resolution Imaging Spectroradiometer (MODIS) time series data for vegetation classification using seasonal variation patterns. The vegetation seasonal variation phase of Seoul and provinces in Korea was inferred using 8 day composite MODIS NDVI (Normalized Difference Vegetation Index) dataset of 2006. The seasonal vegetation classification approach is performed with reclassification of 4 categories as urban, crop land, broad-leaf and needle-leaf forest area. The BISE (Best Index Slope Extraction) filtering algorithm was applied for a smoothing processing of MODIS NDVI time series data and fuzzy classification method was used for vegetation classification. The overall accuracy of classification was 77.5% and the kappa coefficient was 0.61%, thus suggesting overall high classification accuracy.

Movie Popularity Classification Based on Support Vector Machine Combined with Social Network Analysis

  • Dorjmaa, Tserendulam;Shin, Taeksoo
    • Journal of Information Technology Services
    • /
    • v.16 no.3
    • /
    • pp.167-183
    • /
    • 2017
  • The rapid growth of information technology and mobile service platforms, i.e., internet, google, and facebook, etc. has led the abundance of data. Due to this environment, the world is now facing a revolution in the process that data is searched, collected, stored, and shared. Abundance of data gives us several opportunities to knowledge discovery and data mining techniques. In recent years, data mining methods as a solution to discovery and extraction of available knowledge in database has been more popular in e-commerce service fields such as, in particular, movie recommendation. However, most of the classification approaches for predicting the movie popularity have used only several types of information of the movie such as actor, director, rating score, language and countries etc. In this study, we propose a classification-based support vector machine (SVM) model for predicting the movie popularity based on movie's genre data and social network data. Social network analysis (SNA) is used for improving the classification accuracy. This study builds the movies' network (one mode network) based on initial data which is a two mode network as user-to-movie network. For the proposed method we computed degree centrality, betweenness centrality, closeness centrality, and eigenvector centrality as centrality measures in movie's network. Those four centrality values and movies' genre data were used to classify the movie popularity in this study. The logistic regression, neural network, $na{\ddot{i}}ve$ Bayes classifier, and decision tree as benchmarking models for movie popularity classification were also used for comparison with the performance of our proposed model. To assess the classifier's performance accuracy this study used MovieLens data as an open database. Our empirical results indicate that our proposed model with movie's genre and centrality data has by approximately 0% higher accuracy than other classification models with only movie's genre data. The implications of our results show that our proposed model can be used for improving movie popularity classification accuracy.

Functional Data Classification of Variable Stars

  • Park, Minjeong;Kim, Donghoh;Cho, Sinsup;Oh, Hee-Seok
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.4
    • /
    • pp.271-281
    • /
    • 2013
  • This paper considers a problem of classification of variable stars based on functional data analysis. For a better understanding of galaxy structure and stellar evolution, various approaches for classification of variable stars have been studied. Several features that explain the characteristics of variable stars (such as color index, amplitude, period, and Fourier coefficients) were usually used to classify variable stars. Excluding other factors but focusing only on the curve shapes of variable stars, Deb and Singh (2009) proposed a classification procedure using multivariate principal component analysis. However, this approach is limited to accommodate some features of the light curve data that are unequally spaced in the phase domain and have some functional properties. In this paper, we propose a light curve estimation method that is suitable for functional data analysis, and provide a classification procedure for variable stars that combined the features of a light curve with existing functional data analysis methods. To evaluate its practical applicability, we apply the proposed classification procedure to the data sets of variable stars from the project STellar Astrophysics and Research on Exoplanets (STARE).

Development of the ISO 15926-based Classification Structure for Nuclear Plant Equipment (ISO 15926 국제 표준을 이용한 원자력 플랜트 기자재 분류체계)

  • Yun, J.;Mun, D.;Han, S.;Cho, K.
    • Korean Journal of Computational Design and Engineering
    • /
    • v.12 no.3
    • /
    • pp.191-199
    • /
    • 2007
  • In order to construct a data warehouse of process plant equipment, a classification structure should be defined first, identifying not only the equipment categories but also attributes of an each equipment to represent the specifications of equipment. ISO 15926 Process Plants is an international standard dealing with the life-cycle data of process plant facilities. From the viewpoints of defining classification structure, Part 2 data model and Reference Data Library (RDL) of ISO 15926 are seen to respectively provide standard syntactic structure and semantic vocabulary, facilitating the exchange and sharing of plant equipment's life-cycle data. Therefore, the equipment data warehouse with an ISO 15926-based classification structure has the advantage of easy integration among different engineering systems. This paper introduces ISO 15926 and then discusses how to define a classification structure with ISO 15926 Part 2 data model and RDL. Finally, we describe the development result of an ISO 15926-based classification structure for a variety of equipment consisting in the reactor coolant system (RCS) of APR 1400 nuclear plant.

A Comparative Study of Medical Data Classification Methods Based on Decision Tree and System Reconstruction Analysis

  • Tang, Tzung-I;Zheng, Gang;Huang, Yalou;Shu, Guangfu;Wang, Pengtao
    • Industrial Engineering and Management Systems
    • /
    • v.4 no.1
    • /
    • pp.102-108
    • /
    • 2005
  • This paper studies medical data classification methods, comparing decision tree and system reconstruction analysis as applied to heart disease medical data mining. The data we study is collected from patients with coronary heart disease. It has 1,723 records of 71 attributes each. We use the system-reconstruction method to weight it. We use decision tree algorithms, such as induction of decision trees (ID3), classification and regression tree (C4.5), classification and regression tree (CART), Chi-square automatic interaction detector (CHAID), and exhausted CHAID. We use the results to compare the correction rate, leaf number, and tree depth of different decision-tree algorithms. According to the experiments, we know that weighted data can improve the correction rate of coronary heart disease data but has little effect on the tree depth and leaf number.

Active Sonar Target/Nontarget Classification Using Real Sea-trial Data (실제 해상 실험 데이터를 이용한 능동소나 표적/비표적 식별)

  • Seok, J.W.
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.10
    • /
    • pp.1637-1645
    • /
    • 2017
  • Target/Nontarget classification can be divided into the study of shape estimation of the target analysing reflected echo signal and of type classification of the target using acoustical features. In active sonar system, the feature vectors are extracted from the signal reflected from the target, and an classification algorithm is applied to determine whether the received signal is a target or not. However, received sonar signals can be distorted in the underwater environments, and the spatio-temporal characteristics of active sonar signals change according to the aspect of the target. In addition, it is very difficult to collect real sea-trial data for research. In this paper, target/non-target classification were performed using real sea-trial data. Feature vectors are extracted using MFCC(Mel-Frequency Cepstral Coefficients), filterbank energy in the Fourier spectrum and wavelet domain. For the performance verification, classification experiments were performed using backpropagation neural network classifiers.

Land cover classification using LiDAR intensity data and neural network

  • Minh, Nguyen Quang;Hien, La Phu
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.29 no.4
    • /
    • pp.429-438
    • /
    • 2011
  • LiDAR technology is a combination of laser ranging, satellite positioning technology and digital image technology for study and determination with high accuracy of the true earth surface features in 3 D. Laser scanning data is typically a points cloud on the ground, including coordinates, altitude and intensity of laser from the object on the ground to the sensor (Wehr & Lohr, 1999). Data from laser scanning can produce products such as digital elevation model (DEM), digital surface model (DSM) and the intensity data. In Vietnam, the LiDAR technology has been applied since 2005. However, the application of LiDAR in Vietnam is mostly for topological mapping and DEM establishment using point cloud 3D coordinate. In this study, another application of LiDAR data are present. The study use the intensity image combine with some other data sets (elevation data, Panchromatic image, RGB image) in Bacgiang City to perform land cover classification using neural network method. The results show that it is possible to obtain land cover classes from LiDAR data. However, the highest accurate classification can be obtained using LiDAR data with other data set and the neural network classification is more appropriate approach to conventional method such as maximum likelyhood classification.