• Title/Summary/Keyword: Data classification

Search Result 7,984, Processing Time 0.043 seconds

Pattern Classification of Load Demand for Distribution Transformer (배전용 변압기 부하사용 패턴분류)

  • Yun, Sang-Yun;Kim, Jae-Chul;Lee, Young-Suk
    • Proceedings of the KIEE Conference
    • /
    • 2001.05a
    • /
    • pp.89-91
    • /
    • 2001
  • This paper presents the result of pattern classification of load demand for distribution transformer in domestic. The field data of load demand is measured using the load acquisition device and the measurement data is used for the database system for load management of distribution transformed. For the pattern classification, the load data and the customer information data are also used. The K-MEAN method is used for the pattern classification algorithm. The result of pattern classification is used for the 2-step format of load demand curve.

  • PDF

Analysis on classification item and data display format of newspaper article database (기사데이터베이스의 분류항목과 데이터표시형식에 관한 비교분석)

  • 한상길
    • Journal of Korean Library and Information Science Society
    • /
    • v.23
    • /
    • pp.329-362
    • /
    • 1995
  • Newspaper Article Information is an important source of information on social phenomenon with historical value. The development of computer and technology of information communication enables the construction of Newspaper Article Database by CTS and service through computer communication. It made it possible for the peoples to utilize the Newspaper Article Information easily. However, it is very difficult to utilize the currently prevailing system. There are differences in classification system of Newspaper Article Database and the Data Display Format. This survey aims to review the characteristics of Newspaper Article Database and current domestic computer communication service system. By comparing the classification system of Retrieval Menu and Data Display Format, I intended to suggest the standardized way of utilization which enables the users utilize them more easily and conveniently. The results of this survey is as follows : 1. More sub-divided distinction of classification item is required. 2. Separate classification item should be established for the distinction of article form which is very difficult to classify the subject. 3. Data Display Format should be equi n.0, pped with standardized format and signal which enables the users recognize it easily.

  • PDF

Fuzzy Classification Method for Processing Incomplete Dataset

  • Woo, Young-Woon;Lee, Kwang-Eui;Han, Soo-Whan
    • Journal of information and communication convergence engineering
    • /
    • v.8 no.4
    • /
    • pp.383-386
    • /
    • 2010
  • Pattern classification is one of the most important topics for machine learning research fields. However incomplete data appear frequently in real world problems and also show low learning rate in classification models. There have been many researches for handling such incomplete data, but most of the researches are focusing on training stages. In this paper, we proposed two classification methods for incomplete data using triangular shaped fuzzy membership functions. In the proposed methods, missing data in incomplete feature vectors are inferred, learned and applied to the proposed classifier using triangular shaped fuzzy membership functions. In the experiment, we verified that the proposed methods show higher classification rate than a conventional method.

Classification of High Dimensionality Data through Feature Selection Using Markov Blanket

  • Lee, Junghye;Jun, Chi-Hyuck
    • Industrial Engineering and Management Systems
    • /
    • v.14 no.2
    • /
    • pp.210-219
    • /
    • 2015
  • A classification task requires an exponentially growing amount of computation time and number of observations as the variable dimensionality increases. Thus, reducing the dimensionality of the data is essential when the number of observations is limited. Often, dimensionality reduction or feature selection leads to better classification performance than using the whole number of features. In this paper, we study the possibility of utilizing the Markov blanket discovery algorithm as a new feature selection method. The Markov blanket of a target variable is the minimal variable set for explaining the target variable on the basis of conditional independence of all the variables to be connected in a Bayesian network. We apply several Markov blanket discovery algorithms to some high-dimensional categorical and continuous data sets, and compare their classification performance with other feature selection methods using well-known classifiers.

A Study on the Necessity for the Standardization of Information Classification System about Construction Products

  • Hong, Simhee;Yu, Jung-ho
    • International conference on construction engineering and project management
    • /
    • 2017.10a
    • /
    • pp.121-123
    • /
    • 2017
  • The widespread dissemination of the green building certification system has led to the ongoing development of information management technologies with the aim to effectively utilize construction product information. Among them, a data crawling technology enables to collect the data conveniently and to manage large volumes of construction product information in Korea and overseas. However, without a standardized classification system, it is difficult to efficiently utilize information, and problems such as an additional work for classifying information or information-sharing errors. Therefore, this study suggests to present a necessity for the standardization of the information classification system through expert interviews, and to compare construction product classification systems in Korea and overseas. This study is expected to present a necessity for the effective management of construction product information and the standardization of information-sharing with regard to various construction certifications.

  • PDF

Local Linear Logistic Classification of Microarray Data Using Orthogonal Components (직교요인을 이용한 국소선형 로지스틱 마이크로어레이 자료의 판별분석)

  • Baek, Jang-Sun;Son, Young-Sook
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.587-598
    • /
    • 2006
  • The number of variables exceeds the number of samples in microarray data. We propose a nonparametric local linear logistic classification procedure using orthogonal components for classifying high-dimensional microarray data. The proposed method is based on the local likelihood and can be applied to multi-class classification. We applied the local linear logistic classification method using PCA, PLS, and factor analysis components as new features to Leukemia data and colon data, and compare the performance of the proposed method with the conventional statistical classification procedures. The proposed method outperforms the conventional ones for each component, and PLS has shown best performance when it is embedded in the proposed method among the three orthogonal components.

Bathymetric mapping in Dong-Sha Atoll using SPOT data

  • Huang, Shih-Jen;Wen, Yao-Chung
    • Proceedings of the KSRS Conference
    • /
    • v.2
    • /
    • pp.525-528
    • /
    • 2006
  • The remote sensing data can be used to calculate the water depth especially in the clear and shallow water area. In this study, the SPOT data was used for bathymetric mapping in Dong-Sha atoll, located in northern South China Sea. The in situ sea depth was collected by echo sounder as well. A global positioning system was employed to locate the accurate sampling points for sea depth. An empirical model between measurement sea depth and band digital count was determined and based on least squares regression analysis. Both non-classification and unsupervised classification were used in this study. The results show that the standard error is less than 0.9m for non-classification. Besides, the 10% error related to the measurement water depth can be satisfied for more than 85% in situ data points. Otherwise, the 10% relative error can reach more than 97%, 69%, and 51% data points at class 4, 5, and 6 respectively if supervised classification is applied. Meanwhile, we also find that the unsupervised classification can get more accuracy to estimate water depth with standard error less than 0.63, 0.93, and 0.68m at class 4, 5, and 6 respectively.

  • PDF

Comparison Studies of Classification Methods based on L1-Distance and L1-Data Depth (L1-거리와 L1-데이터뎁스를 이용한 분류방법의 비교연구)

  • Baek Soo-Jin;Hwang Jin-Soo;Kim Jean-Kyung
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.183-193
    • /
    • 2006
  • We consider a new classification method(DnDclass) combining two classification rules based on $L_1$-distance(L1DISTclass) and $L_1$-data depth(L1DDclass). To investigate characteristics and to evaluate the performance of these classification methods, we use simulation data in various settings. Through this simulation study, we can confirm that the new method, DnDclass, performs relatively well in many cases.

Building a Classification Scheme of Soil and Groundwater Contamination Sources in Korea: 2. Construction of Classification System and Applications of Attribute Data (토양.지하수오염원 분류체계 구축방안: 2. 분류체계 구축 및 속성자료 활용방안)

  • An, Jeong-Yi;Shin, Kyung-Hee;Hwang, Sang-Il
    • Journal of Soil and Groundwater Environment
    • /
    • v.15 no.6
    • /
    • pp.122-127
    • /
    • 2010
  • Constructing the national inventory that can be used as a tool to identify and assess existing or potential contamination is necessary for efficiently managing the soil and groundwater contamination. In order to start this construction, the first step is how we define and classify potential contamination sources of soil and groundwater. After selecting the basic classification model of contamination sources from developed countries, we suggested the classification and list of the potential contamination sources of soil and groundwater which are appropriate for specific conditions of South Korea. In addition, we investigated several databases to confirm the existence of available data sources and then examined established attribute data through chemical accident response information system (CARIS) and water information system (WIS) in National Institute of Environmental Research and mine geographic information system (MGIS) in Mine Reclamation Corporation. All sorts of attribute data in the existing databases can be utilized as significant assessment factors for determining the management priority of potential contamination sources in the future. Therefore, it is required the expanded investigation of additional database sources and the continual modification so that the classification system of potential contamination sources can be improved.

Ensemble Modulation Pattern based Paddy Crop Assist for Atmospheric Data

  • Sampath Kumar, S.;Manjunatha Reddy, B.N.;Nataraju, M.
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.9
    • /
    • pp.403-413
    • /
    • 2022
  • Classification and analysis are improved factors for the realtime automation system. In the field of agriculture, the cultivation of different paddy crop depends on the atmosphere and the soil nature. We need to analyze the moisture level in the area to predict the type of paddy that can be cultivated. For this process, Ensemble Modulation Pattern system and Block Probability Neural Network based classification models are used to analyze the moisture and temperature of land area. The dataset consists of the collections of moisture and temperature at various data samples for a land. The Ensemble Modulation Pattern based feature analysis method, the extract of the moisture and temperature in various day patterns are analyzed and framed as the pattern for given dataset. Then from that, an improved neural network architecture based on the block probability analysis are used to classify the data pattern to predict the class of paddy crop according to the features of dataset. From that classification result, the measurement of data represents the type of paddy according to the weather condition and other features. This type of classification model assists where to plant the crop and also prevents the damage to crop due to the excess of water or excess of temperature. The result analysis presents the comparison result of proposed work with the other state-of-art methods of data classification.