• Title/Summary/Keyword: Rough set theory

Search Result 86, Processing Time 0.033 seconds

Intelligent information filtering using rough sets

  • Ratanapakdee, Tithiwat;Pinngern, Ouen
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.1302-1306
    • /
    • 2004
  • This paper proposes a model for information filtering (IF) on the Web. The user information need is described into two levels in this model: profiles on category level, and Boolean queries on document level. To efficiently estimate the relevance between the user information need and documents by fuzzy, the user information need is treated as a rough set on the space of documents. The rough set decision theory is used to classify the new documents according to the user information need. In return for this, the new documents are divided into three parts: positive region, boundary region, and negative region. We modified user profile by the user's relevance feedback and discerning words in the documents. In experimental we compared the results of three methods, firstly is to search documents that are not passed the filtering system. Second, search documents that passed the filtering system. Lastly, search documents after modified user profile. The result from using these techniques can obtain higher precision.

  • PDF

Design of a Hierarchically Structured Gas Identification System Using Fuzzy Sets and Rough Sets (퍼지집합과 러프집합을 이용한 계층 구조 가스 식별 시스템의 설계)

  • Bang, Young-Keun;Lee, Chul-Heui
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.67 no.3
    • /
    • pp.419-426
    • /
    • 2018
  • An useful and effective design method for the gas identification system is presented in this paper. The proposed gas identification system adopts hierarchical structure with two level rule base combining fuzzy sets with rough sets. At first, a hybrid genetic algorithm is used in grouping the array sensors of which the measured patterns are similar in order to reduce the dimensionality of patterns to be analyzed and to make rule construction easy and simple. Next, for low level identification, fuzzy inference systems for each divided group are designed by using TSK fuzzy rule, which allow handling the drift and the uncertainty of sensor data effectively. Finally, rough set theory is applied to derive the identification rules at high level which reflect the identification characteristics of each divided group. Thus, the proposed method is able to accomplish effectively dimensionality reduction as well as accurate gas identification. In simulation, we demonstrated the effectiveness of the proposed methods by identifying five types of gases.

Rough Set-Based Approach for Automatic Emotion Classification of Music

  • Baniya, Babu Kaji;Lee, Joonwhoan
    • Journal of Information Processing Systems
    • /
    • v.13 no.2
    • /
    • pp.400-416
    • /
    • 2017
  • Music emotion is an important component in the field of music information retrieval and computational musicology. This paper proposes an approach for automatic emotion classification, based on rough set (RS) theory. In the proposed approach, four different sets of music features are extracted, representing dynamics, rhythm, spectral, and harmony. From the features, five different statistical parameters are considered as attributes, including up to the $4^{th}$ order central moments of each feature, and covariance components of mutual ones. The large number of attributes is controlled by RS-based approach, in which superfluous features are removed, to obtain indispensable ones. In addition, RS-based approach makes it possible to visualize which attributes play a significant role in the generated rules, and also determine the strength of each rule for classification. The experiments have been performed to find out which audio features and which of the different statistical parameters derived from them are important for emotion classification. Also, the resulting indispensable attributes and the usefulness of covariance components have been discussed. The overall classification accuracy with all statistical parameters has recorded comparatively better than currently existing methods on a pair of datasets.

Uncertainty Improvement of Incomplete Decision System using Bayesian Conditional Information Entropy (베이지언 정보엔트로피에 의한 불완전 의사결정 시스템의 불확실성 향상)

  • Choi, Gyoo-Seok;Park, In-Kyu
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.14 no.6
    • /
    • pp.47-54
    • /
    • 2014
  • Based on the indiscernible relation of rough set, the inevitability of superposition and inconsistency of data makes the reduction of attributes very important in information system. Rough set has difficulty in the difference of attribute reduction between consistent and inconsistent information system. In this paper, we propose the new uncertainty measure and attribute reduction algorithm by Bayesian posterior probability for correlation analysis between condition and decision attributes. We compare the proposed method and the conditional information entropy to address the uncertainty of inconsistent information system. As the result, our method has more accuracy than conditional information entropy in dealing with uncertainty via mutual information of condition and decision attributes of information system.

The Improvement of Rough- set Theory Histogram in Color- image Segmentation

  • Zheng, Qi;Lee, Hyo Jong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.11a
    • /
    • pp.429-430
    • /
    • 2011
  • Roughness set theory is a popular topic to use in color-image segmentation. A new popular color image segmentation algorithm is proposed by scientists with the point using traditional histogram and Histon construct roughness set histogram. But, there is still a problem about that is the correlativity of color vector in roughness set histogram, which take an inactive effect in the process of color-image segmentation. Therefore, this paper represents further research based on this and proposed an improved method proved through lot of experiments. The experimental result reduces the correlativity of color vector in roughness set histogram and calculation time remarkably.

The Integrated Methodology of Rough Set Theory and Artificial Neural Network for Business Failure Prediction (도산 예측을 위한 러프집합이론과 인공신경망 통합방법론)

  • Kim, Chang-Yun;Ahn, Byeong-Seok;Cho, Sung-Sik;Kim, Soung-Hie
    • Asia pacific journal of information systems
    • /
    • v.9 no.4
    • /
    • pp.23-40
    • /
    • 1999
  • This paper proposes a hybrid intelligent system that predicts the failure of firms based on the past financial performance data, combining neural network and rough set approach, We can get reduced information table, which implies that the number of evaluation criteria such as financial ratios and qualitative variables and objects (i.e., firms) is reduced with no information loss through rough set approach. And then, this reduced information is used to develop classification rules and train neural network to infer appropriate parameters. Through the reduction of information table, it is expected that the performance of the neural network improve. The rules developed by rough sets show the best prediction accuracy if a case does match any of the rules. The rationale of our hybrid system is using rules developed by rough sets for an object that matches any of the rules and neural network for one that does not match any of them. The effectiveness of our methodology was verified by experiments comparing traditional discriminant analysis and neural network approach with our hybrid approach. For the experiment, the financial data of 2,400 Korean firms during the period 1994-1996 were selected, and for the validation, k-fold validation was used.

  • PDF

Sensibility Evaluation of Components of Middle and High-rise Apartment Facade in Aesthetic Old Town Districts of Kyoto - Extraction of Component Combinations Using Rough Set Theory - (쿄토시 구시가지형미관지구에서 중고층 집합주택 입면의 구성요소에 대한 감성평가 - 러프 집합을 이용한 구성요소 조합의 추출 -)

  • Shon, Dong-Hwa
    • Journal of the Korean housing association
    • /
    • v.25 no.3
    • /
    • pp.105-114
    • /
    • 2014
  • Landscape zones have been designated as aesthetic old town districts across a wide range of Nakakyo-Ku and Shimokyo-Ku, city center of Kyoto, Japan. In these districts in which traditional structures and new buildings coexist, regulations of restriction on acts such as new building's heights, shapes, materials, and colors are carried out according to local governmental landscape ordinance based on Scenic Conservation Act. And yet, minimal fulfillment of the regulations according to different designer's subjective interpretation and principle of economy is rather creating abnormal shapes not harmonized with the traditional landscape. Thus, this study aims to extract combinations between form elements of middle and high rise apartment facade that affects 'harmony' and 'mismatch' in the districts by clarifying the social rules commonly implied based on intuitive judgments (sensibility evaluation) in which human experiential knowledge is involved. As research methods, the study first analyzes the form elements of the facade through a field survey, sets up a standard model through tasks of classification and segmentation and draws computer graphic images with 99 different patterns based on it. Based on these images, this study carries out sensibility evaluation and analyzes experimental data applying the rough set theory. As a result of the analysis, the combinations of form elements that affect harmony or mismatch act greatly when the colors and shapes of the pillars, positions and the patterns of the use of the first floor are combined.

Development of Automatic Rule Extraction Method in Data Mining : An Approach based on Hierarchical Clustering Algorithm and Rough Set Theory (데이터마이닝의 자동 데이터 규칙 추출 방법론 개발 : 계층적 클러스터링 알고리듬과 러프 셋 이론을 중심으로)

  • Oh, Seung-Joon;Park, Chan-Woong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.14 no.6
    • /
    • pp.135-142
    • /
    • 2009
  • Data mining is an emerging area of computational intelligence that offers new theories, techniques, and tools for analysis of large data sets. The major techniques used in data mining are mining association rules, classification and clustering. Since these techniques are used individually, it is necessary to develop the methodology for rule extraction using a process of integrating these techniques. Rule extraction techniques assist humans in analyzing of large data sets and to turn the meaningful information contained in the data sets into successful decision making. This paper proposes an autonomous method of rule extraction using clustering and rough set theory. The experiments are carried out on data sets of UCI KDD archive and present decision rules from the proposed method. These rules can be successfully used for making decisions.

Using genetic algorithm to optimize rough set strategy in KOSPI200 futures market (선물시장에서 러프집합 기반의 유전자 알고리즘을 이용한 최적화 거래전략 개발)

  • Chung, Seung Hwan;Oh, Kyong Joo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.2
    • /
    • pp.281-292
    • /
    • 2014
  • As the importance of algorithm trading is getting stronger, researches for artificial intelligence (AI) based trading strategy is also being more important. However, there are not enough studies about using more than two AI methodologies in one trading system. The main aim of this study is development of algorithm trading strategy based on the rough set theory that is one of rule-based AI methodologies. Especially, this study used genetic algorithm for optimizing profit of rough set based strategy rule. The most important contribution of this study is proposing efficient convergence of two different AI methodology in algorithm trading system. Target of purposed trading system is KOPSI200 futures market. In empirical study, we prove that purposed trading system earns significant profit from 2009 to 2012. Moreover, our system is evaluated higher shape ratio than buy-and-hold strategy.

Extraction Method of Significant Clinical Tests Based on Data Discretization and Rough Set Approximation Techniques: Application to Differential Diagnosis of Cholecystitis and Cholelithiasis Diseases (데이터 이산화와 러프 근사화 기술에 기반한 중요 임상검사항목의 추출방법: 담낭 및 담석증 질환의 감별진단에의 응용)

  • Son, Chang-Sik;Kim, Min-Soo;Seo, Suk-Tae;Cho, Yun-Kyeong;Kim, Yoon-Nyun
    • Journal of Biomedical Engineering Research
    • /
    • v.32 no.2
    • /
    • pp.134-143
    • /
    • 2011
  • The selection of meaningful clinical tests and its reference values from a high-dimensional clinical data with imbalanced class distribution, one class is represented by a large number of examples while the other is represented by only a few, is an important issue for differential diagnosis between similar diseases, but difficult. For this purpose, this study introduces methods based on the concepts of both discernibility matrix and function in rough set theory (RST) with two discretization approaches, equal width and frequency discretization. Here these discretization approaches are used to define the reference values for clinical tests, and the discernibility matrix and function are used to extract a subset of significant clinical tests from the translated nominal attribute values. To show its applicability in the differential diagnosis problem, we have applied it to extract the significant clinical tests and its reference values between normal (N = 351) and abnormal group (N = 101) with either cholecystitis or cholelithiasis disease. In addition, we investigated not only the selected significant clinical tests and the variations of its reference values, but also the average predictive accuracies on four evaluation criteria, i.e., accuracy, sensitivity, specificity, and geometric mean, during l0-fold cross validation. From the experimental results, we confirmed that two discretization approaches based rough set approximation methods with relative frequency give better results than those with absolute frequency, in the evaluation criteria (i.e., average geometric mean). Thus it shows that the prediction model using relative frequency can be used effectively in classification and prediction problems of the clinical data with imbalanced class distribution.