The Generation of Control Rules for Data Mining

Park, In-Kyoo;

doi:10.14400/JDPM.2013.11.11.343

디지털융복합연구 (Journal of Digital Convergence)

제11권11호
/
Pages.343-349
/
2013
/
2713-6434(pISSN)
/
2713-6442(eISSN)

한국디지털정책학회 (The Society of Digital Policy and Management)

DOI QR Code

데이터 마이닝을 위한 제어규칙의 생성

The Generation of Control Rules for Data Mining

박인규 (중부대학교 컴퓨터학과)

Park, In-Kyoo (Dept. of Computer Science Joongbu University)

투고 : 2013.11.01
심사 : 2013.11.20
발행 : 2013.11.28

https://doi.org/10.14400/JDPM.2013.11.11.343 인용 PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

러프집합에서는 동치류와 근사공간의 개념을 이용하여 데이터 마이닝 분야에서 중복되는 정보로부터 특징점을 효율적으로 추출하여 최적화된 제어규칙을 유도할 수 있다. 이러한 추출과정에서 가장 중요하게 고려되어져야 할 부분은 많은 속성에 대한 감축이다. 본 논문에서는 속성간의 관계에서 러프엔트로피를 이용하여 가장 신뢰도가 우수한 속성을 구할 수 있는 정보이론적인 척도를 제시한다. 제안된 방법은 러프엔트로피를 기반으로 불필요한 속성을 제거함으로써 유용한 리덕트를 생성하고 이들에 대한 코어를 형성한다. 결과적으로 원시정보의 내용은 변하지 않으면서 지식감축을 통하여 간소화된 제어규칙을 구축할 수 있음을 보인다.

Rough set theory comes to derive optimal rules through the effective selection of features from the redundancy of lots of information in data mining using the concept of equivalence relation and approximation space in rough set. The reduction of attributes is one of the most important parts in its applications of rough set. This paper purports to define a information-theoretic measure for determining the most important attribute within the association of attributes using rough entropy. The proposed method generates the effective reduct set and formulates the core of the attribute set through the elimination of the redundant attributes. Subsequently, the control rules are generated with a subset of feature which retain the accuracy of the original features through the reduction.

키워드

참고문헌

Beaubouef, T., Petry, F. E. and Arora, G., Information-theoretic measures of uncertainty for rough sets and rough relational databases, Information Science, Vol. 109, No. 1-4, pp. 185-195, 1998. https://doi.org/10.1016/S0020-0255(98)00019-X
Hand, D.J., Blunt, G., Kelly, M.G. & Adams, N.M., "Data mining for fun and profit, Statistical Science, vol. 15, pp. 111-131, 2000 https://doi.org/10.1214/ss/1009212753
Hand, D.J., Mannila, H., & Smyth, P. "Principles of Data Mining", Cambridge, MA:MIT Press, 2001
Han, Jiawei, Kamber, Micheline, "Data Mining: Concepts and Techniques", San Franciso CA, USA, Morgan, Kaufmann, Publishers, 2001.
Pawlak, Z., "Rough sets", International Journal of Information Sciences, 11, pp. 341-356, 1982 https://doi.org/10.1007/BF01001956
Pawlak, Z., "Using Variable Precision Rough Set for Selection and Classification of Biological Knowledge Integrated in DNA Gene Expression", Jouranl of Integrative Bioinformatics, Vol. 9, No. 3, pp.1-17, 2012
Pal S.K., Skowron, "Rough Fuzzy Hybridization: A new trend in decision making", Springer Verlag, Berlin, 1999
R. Vashist, M.L. Garg, "Rule Generation based on Reduct and Core: A Rough Set Approach", International Journal of Computer Applications, Vol. 29, No. 9, pp. 0975-8887, Sept. 2011
Ramakrishnan., Naren and Grama, Ananth Y,, "Data Mining: From Serendipity to Science", IEEE Computer August Vol. 34-37, 1999
Williams, Grahm J. and Simoff, Simeon J. "Data Mining Theory, Methodology, Techniques and Applications(Lecture Notes in Computer Science/Lecture Notes in Artificial Intelligence)", Springer, 2007

디지털융복합연구 (Journal of Digital Convergence)

데이터 마이닝을 위한 제어규칙의 생성

The Generation of Control Rules for Data Mining

초록

키워드

참고문헌

자세히 찾기