• 제목/요약/키워드: Data Classification Scheme

검색결과 282건 처리시간 0.032초

SEMISUPERVISED CLASSIFICATION FOR FAULT DIAGNOSIS IN NUCLEAR POWER PLANTS

  • MA, JIANPING;JIANG, JIN
    • Nuclear Engineering and Technology
    • /
    • 제47권2호
    • /
    • pp.176-186
    • /
    • 2015
  • Pattern classifications have become important tools for fault diagnosis in nuclear power plants (NPP). However, it is often difficult to obtain training data under fault conditions to train a supervised classification model. By contrast, normal plant operating data can be easily made available through increased deployment of supervisory, control, and data acquisition systems. Such data can also be used to train classification models to improve the performance of fault diagnosis scheme. In this paper, a fault diagnosis scheme based on semisupervised classification (SSC) scheme is developed. In this scheme, new measurements collected from the plant are integrated with data observed under fault conditions to train the SSC models. The trained models are subsequently applied to new measurements for fault diagnosis. In comparison with supervised classifiers, the proposed scheme requires significantly fewer data collected under fault conditions to train the classifier. The developed scheme has been validated using different fault scenarios on a desktop NPP simulator as well as on a physical NPP simulator using a graph-based SSC algorithm. All the considered faults have been successfully diagnosed. The results have demonstrated that SSC is a promising tool for fault diagnosis in NPPs.

IEEE 802.11ac 변조 방식의 딥러닝 기반 분류 (Deep learning-based classification for IEEE 802.11ac modulation scheme detection)

  • 강석원;김민재;최승원
    • 디지털산업정보학회논문지
    • /
    • 제16권2호
    • /
    • pp.45-52
    • /
    • 2020
  • This paper is focused on the modulation scheme detection of the IEEE 802.11 standard. In the IEEE 802.11ac standard, the information of the modulation scheme is indicated by the modulation coding scheme (MCS) included in the VHT-SIG-A of the preamble field. Transmitting end determines the MCS index suitable for the low signal to noise ratio (SNR) situation and transmits the data accordingly. Since data field decoding can take place only when the receiving end acquires the MCS index information of the frame. Therefore, accurate MCS detection must be guaranteed before data field decoding. However, since the MCS index information is the information obtained through preamble field decoding, the detection rate can be affected significantly in a low SNR situation. In this paper, we propose a relatively robust modulation classification method based on deep learning to solve the low detection rate problem with a conventional method caused by a low SNR.

토지피복분류에 관한 이론적 연구 - 자연환경관리를 중심으로 - (A Theoretical Study on Land Cover Classification - Focused on Natural Environment Management -)

  • 전성우;김귀곤;박종화;이동근
    • 한국환경복원기술학회지
    • /
    • 제2권1호
    • /
    • pp.29-37
    • /
    • 1999
  • Land cover classification is an essential basic information in natural environment management; however, land cover classification studies in Korea have not yet been proceeded to a sufficient level. At the present, only a limited number of the precedent studies that only cover definite city area has been conducted. Furthermore, there is almost no research conducted on the land cover classification schemes that could accurately classify the Korea's land cover conditions. This study primarily focuses on the land cover classification scheme which carries the most urgent priority in order to classify and to map out the Korean land cover conditions. In order to develop the most suitable land cover classification scheme, many foreign land cover classification cases and projects that are being carried out were reviewed in depth. The land cover classification scheme this study proposes comprises 3 levels : The first level consists of 7 different classes; the second level consists of 22 different classes; and the third level is made up of 50 classes. The land cover classification map will serve many important roles in natural environment management, such as the conjecture of natural habitats and estimation of oxygen production or carbon dioxide absorption capability of a forest. In water pollution modelling, the land cover classification data can be used to estimate and locate non-point sources of water pollution. If applied to a watershed, modelling it will allow to estimate the total amount of pollution from non-point sources of pollution in the water shed. The land cover classification data will also be good as a barometer data that determines defusion of air pollutants in air pollution modelling.

  • PDF

Application of Bayesian Statistical Analysis to Multisource Data Integration

  • Hong, Sa-Hyun;Moon, Wooil-M.
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2002년도 Proceedings of International Symposium on Remote Sensing
    • /
    • pp.394-399
    • /
    • 2002
  • In this paper, Multisource data classification methods based on Bayesian formula are considered. For this decision fusion scheme, the individual data sources are handled separately by statistical classification algorithms and then Bayesian fusion method is applied to integrate from the available data sources. This method includes the combination of each expert decisions where the weights of the individual experts represent the reliability of the sources. The reliability measure used in the statistical approach is common to all pixels in previous work. In this experiment, the weight factors have been assigned to have different value for all pixels in order to improve the integrated classification accuracies. Although most implementations of Bayesian classification approaches assume fixed a priori probabilities, we have used adaptive a priori probabilities by iteratively calculating the local a priori probabilities so as to maximize the posteriori probabilities. The effectiveness of the proposed method is at first demonstrated on simulations with artificial and evaluated in terms of real-world data sets. As a result, we have shown that Bayesian statistical fusion scheme performs well on multispectral data classification.

  • PDF

A Model-based Collaborative Filtering Through Regularized Discriminant Analysis Using Market Basket Data

  • Lee, Jong-Seok;Jun, Chi-Hyuck;Lee, Jae-Wook;Kim, Soo-Young
    • Management Science and Financial Engineering
    • /
    • 제12권2호
    • /
    • pp.71-85
    • /
    • 2006
  • Collaborative filtering, among other recommender systems, has been known as the most successful recommendation technique. However, it requires the user-item rating data, which may not be easily available. As an alternative, some collaborative filtering algorithms have been developed recently by utilizing the market basket data in the form of the binary user-item matrix. Viewing the recommendation scheme as a two-class classification problem, we proposed a new collaborative filtering scheme using a regularized discriminant analysis applied to the binary user-item data. The proposed discriminant model was built in terms of the major principal components and was used for predicting the probability of purchasing a particular item by an active user. The proposed scheme was illustrated with two modified real data sets and its performance was compared with the existing user-based approach in terms of the recommendation precision.

텍스트 분석을 통한 제품 분류 체계 수립방안: 관광분야 App을 중심으로 (Building a Hierarchy of Product Categories through Text Analysis of Product Description)

  • 임현아;최재원;이홍주
    • 지식경영연구
    • /
    • 제20권3호
    • /
    • pp.139-154
    • /
    • 2019
  • With the increasing use of smartphone apps, many apps are coming out in various fields. In order to analyze the current status and trends of apps in a specific field, it is necessary to establish a classification scheme. Various schemes considering users' behavior and characteristics of apps have been proposed, but there is a problem in that many apps are released and a fixed classification scheme must be updated according to the passage of time. Although it is necessary to consider many aspects in establishing classification scheme, it is possible to grasp the trend of the app through the proposal of a classification scheme according to the characteristic of the app. This research proposes a method of establishing an app classification scheme through the description of the app written by the app developers. For this purpose, we collected explanations about apps in the tourism field and identified major categories through topic modeling. Using only the apps corresponding to the topic, we construct a network of words contained in the explanatory text and identify subcategories based on the networks of words. Six topics were selected, and Clauset Newman Moore algorithm was applied to each topic to identify subcategories. Four or five subcategories were identified for each topic.

슈퍼스칼라 프로세서에서 동적 분류 능력을 갖는 혼합형 데이타 값 예측기의 설계 (Design of a Hybrid Data Value Predictor with Dynamic Classification Capability in Superscalar Processors)

  • 박희룡;이상정
    • 한국정보과학회논문지:시스템및이론
    • /
    • 제27권8호
    • /
    • pp.741-751
    • /
    • 2000
  • 슈퍼스칼라 프로세서에서 명령어 수준 병렬성(Instruction Level Parallelism)을 적극적으로 활용하기 위해서는 명령들 사이에 존재하는 제어 종속관계 및 데이타 종속관계를 극복하는 것이 필수적이다. 데이타 값 예측은 하나의 명령 결과가 생성되기 전에 미리 결과 값을 예측하고 이 예측된 결과를 사용하여 데이타 종속관계가 있는 명령들을 투기적으로 실행(speculative execution)하는 기법이다. 본 논문에서는 동적 분류 능력을 갖는 혼합형 데이타 값 예측기를 제안한다. 제안된 예측기는 최근 값 예측기, 스트라이드 예측기 및 2 단계 예측기를 결합한 혼합형으로 구성되며, 예측되는 명령은 하드웨어에 의한 동적 분류에 의해 각 예측기로 할당된다. 각 명령들의 특성에 따라 각 예측기로 실행 시에 동적 분류됨으로써 각 예측기는 기존의 혼합형 방식보다도 더욱 효과적으로 활용될 수 있다. 제안된 방식의 타당성 검증을 위해 실행구동방식(execution-driven) 시뮬레이터를 사용하여 SPECint95 벤치마크를 시뮬레이션하여 비교한다. 실험 결과 Instruction Per Cycle 비교실험에서 2 단계 예측기 보다 0.36, 혼합형 예측기 보다 0.0l8의 성능을 보였고, 제안된 방식이 기존의 혼합형 방식보다 예측 정확도가 평균 16%가 향상되었고, 하드웨어 비용을 측정한 결과 45%의 감소효과를 얻었다.

  • PDF

Data-Adaptive ECOC for Multicategory Classification

  • Seok, Kyung-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • 제19권1호
    • /
    • pp.25-36
    • /
    • 2008
  • Error Correcting Output Codes (ECOC) can improve generalization performance when applied to multicategory classification problem. In this study we propose a new criterion to select hyperparameters included in ECOC scheme. Instead of margins of a data we propose to use the probability of misclassification error since it makes the criterion simple. Using this we obtain an upper bound of leave-one-out error of OVA(one vs all) method. Our experiments from real and synthetic data indicate that the bound leads to good estimates of parameters.

  • PDF

시계열 식생지수와 과거 작물 재배 패턴을 이용한 대규모 작물 분류도의 조기 제작 - 미국 아이오와 주 사례연구 - (Early Production of Large-area Crop Classification Map using Time-series Vegetation Index and Past Crop Cultivation Patterns - A Case Study in Iowa State, USA -)

  • 김예슬;박노욱;홍석영;이경도;유희영
    • 대한원격탐사학회지
    • /
    • 제30권4호
    • /
    • pp.493-503
    • /
    • 2014
  • 이 논문에서는 대규모 작물 재배 지역의 작물 분류도의 조기 제작을 목적으로 분광학적 혼재를 줄이고, 과거 토지피복도의 작물 재배 패턴을 반영할 수 있는 계층적 분류 방법론을 제안하였다. 특히 작물 생육 주기로부터 다른 분광 특성을 고려한 계층적 분류 접근을 적용하고, 과거 작물 재배 패턴으로부터 추출된 시간적 문맥 정보를 함께 고려함으로써 분광 혼재가 두드러진 화소의 영향을 줄일 수 있다. 제안 분류 기법의 적용성을 평가하기 위해 미국 아이오와 주 전체를 대상으로 시계열 MODIS 250 m 정규식생지수 자료와 과거 crop data layer를 사용하는 사례 연구를 수행하였다. 사례 연구를 통해 다른 분류 단계와 과거 작물 재배 패턴을 고려함으로써 대상 지역의 주요 재배 작물이면서 분광학적 유사도가 두드러진 콩과 옥수수를 효과적으로 구분할 수 있었다. 그리고 분광 정보만을 이용한 분류 결과에 비해 제안 기법이 최소 7.68%p에서 최대 20.96%p의 향상된 분류 정확도를 보였다. 또한 분류 단계에서 시간적 문맥 정보를 결합함으로써 사용 NDVI 자료의 수에 영향을 덜 받는 가장 높은 분류 정확도(최대 전체 정확도: 86.63%)를 얻을 수 있었다. 따라서 제안 분류 기법은 주요 곡물 수입국의 대규모 작물 구분도의 조기 제작에 유용하게 사용될 수 있을 것으로 기대된다.

데이터 마이닝의 분류화와 연관 규칙을 이용한 네트워크 트래픽 분석 (Analysis of Network Traffic using Classification and Association Rule)

  • 이창언;김응모
    • 한국시뮬레이션학회논문지
    • /
    • 제11권4호
    • /
    • pp.15-23
    • /
    • 2002
  • As recently the network environment and application services have been more complex and diverse, there has. In this paper we introduce a scheme the extract useful information for network management by analyzing traffic data in user login file. For this purpose we use classification and association rule based on episode concept in data mining. Since login data has inherently time series characterization, convertible data mining algorithms cannot directly applied. We generate virtual transaction, classify transactions above threshold value in time window, and simulate the classification algorithm.

  • PDF