• Title/Summary/Keyword: data classification

Search Result 8,004, Processing Time 0.04 seconds

배전용 변압기 부하사용 패턴분류 (Pattern Classification of Load Demand for Distribution Transformer)

  • 윤상윤;김재철;이영석
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2001년도 춘계학술대회 논문집 전력기술부문
    • /
    • pp.89-91
    • /
    • 2001
  • This paper presents the result of pattern classification of load demand for distribution transformer in domestic. The field data of load demand is measured using the load acquisition device and the measurement data is used for the database system for load management of distribution transformed. For the pattern classification, the load data and the customer information data are also used. The K-MEAN method is used for the pattern classification algorithm. The result of pattern classification is used for the 2-step format of load demand curve.

  • PDF

기사데이터베이스의 분류항목과 데이터표시형식에 관한 비교분석 (Analysis on classification item and data display format of newspaper article database)

  • 한상길
    • 한국도서관정보학회지
    • /
    • 제23권
    • /
    • pp.329-362
    • /
    • 1995
  • Newspaper Article Information is an important source of information on social phenomenon with historical value. The development of computer and technology of information communication enables the construction of Newspaper Article Database by CTS and service through computer communication. It made it possible for the peoples to utilize the Newspaper Article Information easily. However, it is very difficult to utilize the currently prevailing system. There are differences in classification system of Newspaper Article Database and the Data Display Format. This survey aims to review the characteristics of Newspaper Article Database and current domestic computer communication service system. By comparing the classification system of Retrieval Menu and Data Display Format, I intended to suggest the standardized way of utilization which enables the users utilize them more easily and conveniently. The results of this survey is as follows : 1. More sub-divided distinction of classification item is required. 2. Separate classification item should be established for the distinction of article form which is very difficult to classify the subject. 3. Data Display Format should be equi n.0, pped with standardized format and signal which enables the users recognize it easily.

  • PDF

Fuzzy Classification Method for Processing Incomplete Dataset

  • Woo, Young-Woon;Lee, Kwang-Eui;Han, Soo-Whan
    • Journal of information and communication convergence engineering
    • /
    • 제8권4호
    • /
    • pp.383-386
    • /
    • 2010
  • Pattern classification is one of the most important topics for machine learning research fields. However incomplete data appear frequently in real world problems and also show low learning rate in classification models. There have been many researches for handling such incomplete data, but most of the researches are focusing on training stages. In this paper, we proposed two classification methods for incomplete data using triangular shaped fuzzy membership functions. In the proposed methods, missing data in incomplete feature vectors are inferred, learned and applied to the proposed classifier using triangular shaped fuzzy membership functions. In the experiment, we verified that the proposed methods show higher classification rate than a conventional method.

Classification of High Dimensionality Data through Feature Selection Using Markov Blanket

  • Lee, Junghye;Jun, Chi-Hyuck
    • Industrial Engineering and Management Systems
    • /
    • 제14권2호
    • /
    • pp.210-219
    • /
    • 2015
  • A classification task requires an exponentially growing amount of computation time and number of observations as the variable dimensionality increases. Thus, reducing the dimensionality of the data is essential when the number of observations is limited. Often, dimensionality reduction or feature selection leads to better classification performance than using the whole number of features. In this paper, we study the possibility of utilizing the Markov blanket discovery algorithm as a new feature selection method. The Markov blanket of a target variable is the minimal variable set for explaining the target variable on the basis of conditional independence of all the variables to be connected in a Bayesian network. We apply several Markov blanket discovery algorithms to some high-dimensional categorical and continuous data sets, and compare their classification performance with other feature selection methods using well-known classifiers.

A Study on the Necessity for the Standardization of Information Classification System about Construction Products

  • Hong, Simhee;Yu, Jung-ho
    • 국제학술발표논문집
    • /
    • The 7th International Conference on Construction Engineering and Project Management Summit Forum on Sustainable Construction and Management
    • /
    • pp.121-123
    • /
    • 2017
  • The widespread dissemination of the green building certification system has led to the ongoing development of information management technologies with the aim to effectively utilize construction product information. Among them, a data crawling technology enables to collect the data conveniently and to manage large volumes of construction product information in Korea and overseas. However, without a standardized classification system, it is difficult to efficiently utilize information, and problems such as an additional work for classifying information or information-sharing errors. Therefore, this study suggests to present a necessity for the standardization of the information classification system through expert interviews, and to compare construction product classification systems in Korea and overseas. This study is expected to present a necessity for the effective management of construction product information and the standardization of information-sharing with regard to various construction certifications.

  • PDF

직교요인을 이용한 국소선형 로지스틱 마이크로어레이 자료의 판별분석 (Local Linear Logistic Classification of Microarray Data Using Orthogonal Components)

  • 백장선;손영숙
    • 응용통계연구
    • /
    • 제19권3호
    • /
    • pp.587-598
    • /
    • 2006
  • 본 논문에서는 마이크로어레이 (microarray) 자료에 판별분석을 적용 시 나타나는 고차원 및 소표본 문제의 해결방법으로서 직교요인을 새로운 특징변수로 사용한 비모수적 국소선형 로지스틱 판별분석을 제안한다. 제안된 방법은 국소우도에 기반한 것으로서 다범주 판별분석에 적용될 수 있으며, 고려된 직교인자는 주성분 요인, 부분최소제곱 요인, 인자분석 요인 등이다. 대표적인 두 가지 실제 마이크로어레이 자료에 적용한 결과 직교요인들 중에서 부분최소제곱 요인을 특징변수로 사용한 경우 고전적인 통계적 판별분석보다 향상된 분류 능력을 나타내고 있음을 확인하였다.

Bathymetric mapping in Dong-Sha Atoll using SPOT data

  • Huang, Shih-Jen;Wen, Yao-Chung
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2006년도 Proceedings of ISRS 2006 PORSEC Volume II
    • /
    • pp.525-528
    • /
    • 2006
  • The remote sensing data can be used to calculate the water depth especially in the clear and shallow water area. In this study, the SPOT data was used for bathymetric mapping in Dong-Sha atoll, located in northern South China Sea. The in situ sea depth was collected by echo sounder as well. A global positioning system was employed to locate the accurate sampling points for sea depth. An empirical model between measurement sea depth and band digital count was determined and based on least squares regression analysis. Both non-classification and unsupervised classification were used in this study. The results show that the standard error is less than 0.9m for non-classification. Besides, the 10% error related to the measurement water depth can be satisfied for more than 85% in situ data points. Otherwise, the 10% relative error can reach more than 97%, 69%, and 51% data points at class 4, 5, and 6 respectively if supervised classification is applied. Meanwhile, we also find that the unsupervised classification can get more accuracy to estimate water depth with standard error less than 0.63, 0.93, and 0.68m at class 4, 5, and 6 respectively.

  • PDF

L1-거리와 L1-데이터뎁스를 이용한 분류방법의 비교연구 (Comparison Studies of Classification Methods based on L1-Distance and L1-Data Depth)

  • 백수진;황진수;김진경
    • 응용통계연구
    • /
    • 제19권1호
    • /
    • pp.183-193
    • /
    • 2006
  • $L_1$-데이터뎁스를 이용한 분류방법(L1DDclass)과 관측치들 사이의 $L_1$-거리를 이용한 분류방법(L1DISTclass)의 특징을 살펴보고, 이 두 방법을 결합한 새로운 분류방법 (DnDclass: Distance and Data-depth based classification)의 효용성을 소개하고자 한다. 모의실험을 통해 세가지 분류방법의 결과를 비교하고 제안된 분류방법이 다양한 경우에 더 효과적일 수 있다는 사실을 확인한다.

토양.지하수오염원 분류체계 구축방안: 2. 분류체계 구축 및 속성자료 활용방안 (Building a Classification Scheme of Soil and Groundwater Contamination Sources in Korea: 2. Construction of Classification System and Applications of Attribute Data)

  • 안정이;신경희;황상일
    • 한국지하수토양환경학회지:지하수토양환경
    • /
    • 제15권6호
    • /
    • pp.122-127
    • /
    • 2010
  • Constructing the national inventory that can be used as a tool to identify and assess existing or potential contamination is necessary for efficiently managing the soil and groundwater contamination. In order to start this construction, the first step is how we define and classify potential contamination sources of soil and groundwater. After selecting the basic classification model of contamination sources from developed countries, we suggested the classification and list of the potential contamination sources of soil and groundwater which are appropriate for specific conditions of South Korea. In addition, we investigated several databases to confirm the existence of available data sources and then examined established attribute data through chemical accident response information system (CARIS) and water information system (WIS) in National Institute of Environmental Research and mine geographic information system (MGIS) in Mine Reclamation Corporation. All sorts of attribute data in the existing databases can be utilized as significant assessment factors for determining the management priority of potential contamination sources in the future. Therefore, it is required the expanded investigation of additional database sources and the continual modification so that the classification system of potential contamination sources can be improved.

Ensemble Modulation Pattern based Paddy Crop Assist for Atmospheric Data

  • Sampath Kumar, S.;Manjunatha Reddy, B.N.;Nataraju, M.
    • International Journal of Computer Science & Network Security
    • /
    • 제22권9호
    • /
    • pp.403-413
    • /
    • 2022
  • Classification and analysis are improved factors for the realtime automation system. In the field of agriculture, the cultivation of different paddy crop depends on the atmosphere and the soil nature. We need to analyze the moisture level in the area to predict the type of paddy that can be cultivated. For this process, Ensemble Modulation Pattern system and Block Probability Neural Network based classification models are used to analyze the moisture and temperature of land area. The dataset consists of the collections of moisture and temperature at various data samples for a land. The Ensemble Modulation Pattern based feature analysis method, the extract of the moisture and temperature in various day patterns are analyzed and framed as the pattern for given dataset. Then from that, an improved neural network architecture based on the block probability analysis are used to classify the data pattern to predict the class of paddy crop according to the features of dataset. From that classification result, the measurement of data represents the type of paddy according to the weather condition and other features. This type of classification model assists where to plant the crop and also prevents the damage to crop due to the excess of water or excess of temperature. The result analysis presents the comparison result of proposed work with the other state-of-art methods of data classification.