• Title/Summary/Keyword: Data Classification Systems

Search Result 1,424, Processing Time 0.03 seconds

Semi-Supervised SAR Image Classification via Adaptive Threshold Selection (선별적인 임계값 선택을 이용한 준지도 학습의 SAR 분류 기술)

  • Jaejun Do;Minjung Yoo;Jaeseok Lee;Hyoi Moon;Sunok Kim
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.27 no.3
    • /
    • pp.319-328
    • /
    • 2024
  • Semi-supervised learning is a good way to train a classification model using a small number of labeled and large number of unlabeled data. We applied semi-supervised learning to a synthetic aperture radar(SAR) image classification model with a limited number of datasets that are difficult to create. To address the previous difficulties, semi-supervised learning uses a model trained with a small amount of labeled data to generate and learn pseudo labels. Besides, a lot of number of papers use a single fixed threshold to create pseudo labels. In this paper, we present a semi-supervised synthetic aperture radar(SAR) image classification method that applies different thresholds for each class instead of all classes sharing a fixed threshold to improve SAR classification performance with a small number of labeled datasets.

Design of Fuzzy System with Hierarchical Classifying Structures and its Application to Time Series Prediction (계층적 분류구조의 퍼지시스템 설계 및 시계열 예측 응용)

  • Bang, Young-Keun;Lee, Chul-Heui
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.19 no.5
    • /
    • pp.595-602
    • /
    • 2009
  • Fuzzy rules, which represent the behavior of their system, are sensitive to fuzzy clustering techniques. If the classification abilities of such clustering techniques are improved, their systems can work for the purpose more accurately because the capabilities of the fuzzy rules and parameters are enhanced by the clustering techniques. Thus, this paper proposes a new hierarchically structured clustering algorithm that can enhance the classification abilities. The proposed clustering technique consists of two clusters based on correlationship and statistical characteristics between data, which can perform classification more accurately. In addition, this paper uses difference data sets to reflect the patterns and regularities of the original data clearly, and constructs multiple fuzzy systems to consider various characteristics of the differences suitably. To verify effectiveness of the proposed techniques, this paper applies the constructed fuzzy systems to the field of time series prediction, and performs prediction for nonlinear time series examples.

PCA-based Linear Dynamical Systems for Multichannel EEG Classification (다채널 뇌파 분류를 위한 주성분 분석 기반 선형동적시스템)

  • Lee, Hyekyoung;Park, Seungjin
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.10d
    • /
    • pp.232-234
    • /
    • 2002
  • EEG-based brain computer interface (BCI) provides a new communication channel between human brain and computer. The classification of EEG data is an important task in EEG-based BCI. In this paper we present methods which jointly employ principal component analysis (PCA) and linear dynamical system (LDS) modeling for the task of EEG classification. Experimental study for the classification of EEG data during imagination of a left or right hand movement confirms the validity of our proposed methods.

  • PDF

A Study on the Landcover Classification using Band Ratioing Data of Landsat-TM (Landsat-TM의 밴드비 연산데이터를 이용한 토지피복분류에 관한 연구)

  • Kwon, Bong-Kyum;Yamada, Kiyoshi;Niren, Takaaki;Jo, Myung-Hee
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.6 no.2
    • /
    • pp.80-91
    • /
    • 2003
  • In this research, re-using band ratio data was proposed and examined as a method of raising the accuracy in landcover classification which is using satellite data.In order to determine the band which is used to calculation in the classified item, the six bands except the band 6 were combined with the band in which combination is possible and the landcover classification by MLC of supervised classification was carried out. In the result of landcover classification which is combined with forty nine combination, Two bands which were mostly used by band combination in the accuracy belonged inside the 10th place of a higher rank were selected and also calculated. landcover classification were performed again after the calculation result had been recombinated from the research. In addition, the new landcover classification result was compared and examined with the landcover classification using the old data. From the result of which was compared and examined the new landcover classification data recombinated calculation result with landcover classification using the original data, The classification accuracy of the new landcover classification data recombinated calculation result became low in ground but became improved in the all class. Specially The accuracy to urban area is very improved. therefore, it determined that reusing band ratio data is very useful when we need to analyze landcover classification and land information to urban area after that.

  • PDF

Analysis and Subclass Classification of Microarray Gene Expression Data Using Computational Biology (전산생물학을 이용한 마이크로어레이의 유전자 발현 데이터 분석 및 유형 분류 기법)

  • Yoo, Chang-Kyoo;Lee, Min-Young;Kim, Young-Hwang;Lee, In-Beum
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.11 no.10
    • /
    • pp.830-836
    • /
    • 2005
  • Application of microarray technologies which monitor simultaneously the expression pattern of thousands of individual genes in different biological systems results in a tremendous increase of the amount of available gene expression data and have provided new insights into gene expression during drug development, within disease processes, and across species. There is a great need of data mining methods allowing straightforward interpretation, visualization and analysis of the relevant information contained in gene expression profiles. Specially, classifying biological samples into known classes or phenotypes is an important practical application for microarray gene expression profiles. Gene expression profiles obtained from tissue samples of patients thus allowcancer classification. In this research, molecular classification of microarray gene expression data is applied for multi-class cancer using computational biology such gene selection, principal component analysis and fuzzy clustering. The proposed method was applied to microarray data from leukemia patients; specifically, it was used to interpret the gene expression pattern and analyze the leukemia subtype whose expression profiles correlated with four cases of acute leukemia gene expression. A basic understanding of the microarray data analysis is also introduced.

A Construction of Fuzzy Model for Data Mining (데이터 마이닝을 위한 퍼지 모델 동정)

  • Kim, Do-Wan;Park, Jin-Bae;Kim, Jung-Chan;Joo, Young-Hoon
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2002.12a
    • /
    • pp.191-194
    • /
    • 2002
  • In this paper, a new GA-based methodology with information granules is suggested for construction of the fuzzy classifier. We deal with the selection of the fuzzy region as well as two major classification problems-the feature selection and the pattern classification. The proposed method consists of three steps: the selection of the fuzzy region, the construction of the fuzzy sets, and the tuning of the fuzzy rules. The genetic algorithms (GAs) are applied to the development of the information granules so as to decide the satisfactory fuzzy regions. Finally, the GAs are also applied to the tuning procedure of the fuzzy rules in terms of the management of the misclassified data (e.g., data with the strange pattern or on the boundaries of the classes). To show the effectiveness of the proposed method, an example-the classification of the Iris data, is provided.

Robust Real-time Intrusion Detection System

  • Kim, Byung-Joo;Kim, Il-Kon
    • Journal of Information Processing Systems
    • /
    • v.1 no.1 s.1
    • /
    • pp.9-13
    • /
    • 2005
  • Computer security has become a critical issue with the rapid development of business and other transaction systems over the Internet. The application of artificial intelligence, machine learning and data mining techniques to intrusion detection systems has been increasing recently. But most research is focused on improving the classification performance of a classifier. Selecting important features from input data leads to simplification of the problem, and faster and more accurate detection rates. Thus selecting important features is an important issue in intrusion detection. Another issue in intrusion detection is that most of the intrusion detection systems are performed by off-line and it is not a suitable method for a real-time intrusion detection system. In this paper, we develop the real-time intrusion detection system, which combines an on-line feature extraction method with the Least Squares Support Vector Machine classifier. Applying the proposed system to KDD CUP 99 data, experimental results show that it has a remarkable feature extraction and classification performance compared to existing off-line intrusion detection systems.

Labeling Big Spatial Data: A Case Study of New York Taxi Limousine Dataset

  • AlBatati, Fawaz;Alarabi, Louai
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.6
    • /
    • pp.207-212
    • /
    • 2021
  • Clustering Unlabeled Spatial-datasets to convert them to Labeled Spatial-datasets is a challenging task specially for geographical information systems. In this research study we investigated the NYC Taxi Limousine Commission dataset and discover that all of the spatial-temporal trajectory are unlabeled Spatial-datasets, which is in this case it is not suitable for any data mining tasks, such as classification and regression. Therefore, it is necessary to convert unlabeled Spatial-datasets into labeled Spatial-datasets. In this research study we are going to use the Clustering Technique to do this task for all the Trajectory datasets. A key difficulty for applying machine learning classification algorithms for many applications is that they require a lot of labeled datasets. Labeling a Big-data in many cases is a costly process. In this paper, we show the effectiveness of utilizing a Clustering Technique for labeling spatial data that leads to a high-accuracy classifier.

Development of the ISO 15926-based Classification Structure for Nuclear Plant Equipment (ISO 15926 국제 표준을 이용한 원자력 플랜트 기자재 분류체계)

  • Yun, J.;Mun, D.;Han, S.;Cho, K.
    • Korean Journal of Computational Design and Engineering
    • /
    • v.12 no.3
    • /
    • pp.191-199
    • /
    • 2007
  • In order to construct a data warehouse of process plant equipment, a classification structure should be defined first, identifying not only the equipment categories but also attributes of an each equipment to represent the specifications of equipment. ISO 15926 Process Plants is an international standard dealing with the life-cycle data of process plant facilities. From the viewpoints of defining classification structure, Part 2 data model and Reference Data Library (RDL) of ISO 15926 are seen to respectively provide standard syntactic structure and semantic vocabulary, facilitating the exchange and sharing of plant equipment's life-cycle data. Therefore, the equipment data warehouse with an ISO 15926-based classification structure has the advantage of easy integration among different engineering systems. This paper introduces ISO 15926 and then discusses how to define a classification structure with ISO 15926 Part 2 data model and RDL. Finally, we describe the development result of an ISO 15926-based classification structure for a variety of equipment consisting in the reactor coolant system (RCS) of APR 1400 nuclear plant.

The development of web based power plant maintenance management system (Web기반 발전설비 정비관리시스템 개발)

  • Kim, Bum-Shin;Kim, Eui-Hyun;Jang, Don-Sik;Cho, Jae-Min;Chae, Gil-Seok;Jung, Gyu-Chol
    • Proceedings of the KSME Conference
    • /
    • 2004.04a
    • /
    • pp.2059-2063
    • /
    • 2004
  • Most power plants have operated many independent computerize systems for maintenance. Independence of systems have caused complexity of business process and inconvenience of computer system management. Because the equipment and material master data is not standardize and structurize, it is difficult to manage equipment maintenance history and material delivery. Especially equipment classification criterion is important for standardization of every maintenance information. It is necessary to integrate function of independent systems for business process simplification and rapid work flow. this paper provides equipment classification criterion design and system integration method with the case of live system development.

  • PDF