DOI QR코드

DOI QR Code

Improvement of Accuracy of Decision Tree By Reprocessing

재처리를 통한 결정트리의 정확도 개선

  • Published : 2003.10.01

Abstract

Machine learning organizes knowledge for efficient and accurate reuse. This paper is concerned with methods of concept learning from examples, which glean knowledge from a training set of preclassified ‘objects’. Ideally, training facilitates classification of novel, previously unseen objects. However, every learning system relies on processing and representation assumptions that may be detrimental under certain circumstances. We explore the biases of a well-known learning system, ID3, review improvements, and introduce some improvements of our own, each designed to yield accurate and pedagogically sound classification.

기계학습은 효율적이고 정확한 재사용을 위해 지식을 재구성한다. 본 논문은 이미 알려진 학습 객체들로부터 지식을 추출하는 '예제에 의한 개념학습 방법에 관한 연구이다. 대부분 학습 시스템은 처리와 표현에 대한 제약으로 인해 학습 결과를 새로운 객체에 적용할 때 효율성과 정확도가 기대에 못 미치는 경우가 있다. 본 논문에서는 ID3의 바이어스에 대해 조사하고, 다양한 표현 양식을 통해 보다 정확하고 학습적으로 이해하기 쉬운 분류 방법을 제안한다.

Keywords

References

  1. Quinlan, J.R., 'Induction of Decision Trees,' Machine Learning, 1, pp.81-106, 1986 https://doi.org/10.1007/BF00116251
  2. Utgoff, P., 'Decision tree induction based on sfficient tree restructuring,' Tech. Report 95-18, Dept. of Comp. Sci. Univ. of Mass., Amherst, 1995
  3. J.R.Quinlan, C4.5 : Programs for Machine Learning. Morgan Kauffmann, Los Altos, CA, 1993
  4. Jensen, D., and Schmill, M., 'Adjusting for multiple comparisions in decision tree pruning,' Proc. Of 3rd Int. conf. On KDD, 1997
  5. Frank. E., 'Pruning Decision Tress and Lists,' Ph.D. Dissertation, The University of Waikato, 2000
  6. Elomaa,T. and Kaariainen, M., 'An analysis of reduced error pruning,' Journal of AI Research 15, 2001
  7. Oates, T. and Jensen, D., 'The effects of training set size on ducision tree complexity,' Proc. 14th Inter. Conf. On Machine Learning, 1997
  8. Michalski, R.S. & Stepp, R.E., 'Learning from observation : Conceptual clustering,' In R. S. Michaiski, J. G. Caebonell, & T. M. Mitchell (Eds.), Machine Learning : An artificial intelligence approach. Los Altos, CA : Morgan Kaufmann, 1983
  9. Lee, G. & Wu, X., 'Ordering Data Set for Incremental Clustering,' submitted to Int. Conf. On Machine Learning, 2003, Boston, USA
  10. Fisher, D., 'Iterative Optimization and Simplification of Hierarchical Clustering,' Journal of AI Research, 4, pp.147-179, 1996
  11. Blake, C.L. & Merz, C.J., UCI Repository of Machine Learning Databases, http://www.ics.uci.edu/~mlearn/MLR epository.html]. Irvine, CA : University of California, Department of Information and Computer Science, 1998