Search | Korea Science

Hybridized Decision Tree methods for Detecting Generic Attack on Ciphertext

Alsariera, Yazan Ahmad
- International Journal of Computer Science & Network Security
- /
- v.21 no.7
- /
- pp.56-62
- /
- 2021
The surge in generic attacks execution against cipher text on the computer network has led to the continuous advancement of the mechanisms to protect information integrity and confidentiality. The implementation of explicit decision tree machine learning algorithm is reported to accurately classifier generic attacks better than some multi-classification algorithms as the multi-classification method suffers from detection oversight. However, there is a need to improve the accuracy and reduce the false alarm rate. Therefore, this study aims to improve generic attack classification by implementing two hybridized decision tree algorithms namely Naïve Bayes Decision tree (NBTree) and Logistic Model tree (LMT). The proposed hybridized methods were developed using the 10-fold cross-validation technique to avoid overfitting. The generic attack detector produced a 99.8% accuracy, an FPR score of 0.002 and an MCC score of 0.995. The performances of the proposed methods were better than the existing decision tree method. Similarly, the proposed method outperformed multi-classification methods for detecting generic attacks. Hence, it is recommended to implement hybridized decision tree method for detecting generic attacks on a computer network.
https://doi.org/10.22937/IJCSNS.2021.21.7.6 인용 PDF KSCI

Classification Accuracy Improvement for Decision Tree (의사결정트리의 분류 정확도 향상)

Rezene, Mehari Marta;Park, Sanghyun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2017.04a
- /
- pp.787-790
- /
- 2017
Data quality is the main issue in the classification problems; generally, the presence of noisy instances in the training dataset will not lead to robust classification performance. Such instances may cause the generated decision tree to suffer from over-fitting and its accuracy may decrease. Decision trees are useful, efficient, and commonly used for solving various real world classification problems in data mining. In this paper, we introduce a preprocessing technique to improve the classification accuracy rates of the C4.5 decision tree algorithm. In the proposed preprocessing method, we applied the naive Bayes classifier to remove the noisy instances from the training dataset. We applied our proposed method to a real e-commerce sales dataset to test the performance of the proposed algorithm against the existing C4.5 decision tree classifier. As the experimental results, the proposed method improved the classification accuracy by 8.5% and 14.32% using training dataset and 10-fold crossvalidation, respectively.
https://doi.org/10.3745/PKIPS.y2017m04a.787 인용 PDF

A Comparative Study of Medical Data Classification Methods Based on Decision Tree and System Reconstruction Analysis

Tang, Tzung-I;Zheng, Gang;Huang, Yalou;Shu, Guangfu;Wang, Pengtao
- Industrial Engineering and Management Systems
- /
- v.4 no.1
- /
- pp.102-108
- /
- 2005
This paper studies medical data classification methods, comparing decision tree and system reconstruction analysis as applied to heart disease medical data mining. The data we study is collected from patients with coronary heart disease. It has 1,723 records of 71 attributes each. We use the system-reconstruction method to weight it. We use decision tree algorithms, such as induction of decision trees (ID3), classification and regression tree (C4.5), classification and regression tree (CART), Chi-square automatic interaction detector (CHAID), and exhausted CHAID. We use the results to compare the correction rate, leaf number, and tree depth of different decision-tree algorithms. According to the experiments, we know that weighted data can improve the correction rate of coronary heart disease data but has little effect on the tree depth and leaf number.
PDF KSCI

Tree-structured Classification based on Variable Splitting

Ahn, Sung-Jin
- Communications for Statistical Applications and Methods
- /
- v.2 no.1
- /
- pp.74-88
- /
- 1995
This article introduces a unified method of choosing the most explanatory and significant multiway partitions for classification tree design and analysis. The method is derived on the impurity reduction (IR) measure of divergence, which is proposed to extend the proportional-reduction-in-error (PRE) measure in the decision-theory context. For the method derivation, the IR measure is analyzed to characterize its statistical properties which are used to consistently handle the subjects of feature formation, feature selection, and feature deletion required in the associated classification tree construction. A numerical example is considered to illustrate the proposed approach.
PDF

Fuzzy Classification Rule Learning by Decision Tree Induction

Lee, Keon-Myung;Kim, Hak-Joon
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.3 no.1
- /
- pp.44-51
- /
- 2003
Knowledge acquisition is a bottleneck in knowledge-based system implementation. Decision tree induction is a useful machine learning approach for extracting classification knowledge from a set of training examples. Many real-world data contain fuzziness due to observation error, uncertainty, subjective judgement, and so on. To cope with this problem of real-world data, there have been some works on fuzzy classification rule learning. This paper makes a survey for the kinds of fuzzy classification rules. In addition, it presents a fuzzy classification rule learning method based on decision tree induction, and shows some experiment results for the method.
https://doi.org/10.5391/IJFIS.2003.3.1.044 인용 PDF KSCI

Note on classification and regression tree analysis (분류와 회귀나무분석에 관한 소고)

임용빈;오만숙
- Journal of Korean Society for Quality Management
- /
- v.30 no.1
- /
- pp.152-161
- /
- 2002
The analysis of large data sets with hundreds of thousands observations and thousands of independent variables is a formidable computational task. A less parametric method, capable of identifying important independent variables and their interactions, is a tree structured approach to regression and classification. It gives a graphical and often illuminating way of looking at data in classification and regression problems. In this paper, we have reviewed and summarized tile methodology used to construct a tree, multiple trees and the sequential strategy for identifying active compounds in large chemical databases.
PDF KSCI

A Study of Pathogenesis Classification using Decision Tree Method (의사결정나무법을 이이용한 병인(病因)분류에 관한 연구)

Lee, Hyuk-Jae;Kim, Min-Yong;Oh, Hwan-Sup;Park, Young-Bae
- The Journal of the Society of Korean Medicine Diagnostics
- /
- v.12 no.2
- /
- pp.27-40
- /
- 2008
Background : In spite of the predominant of the theory of Pathogenesis, the method of Pathogenesis classification is depending on the doctor's clinical trials because od the lack of the objective test criteria. Methods and Results : This study is trying to improve the objectiveness of classification using a new statistical method, decision tree. Decision tree method -a classification technique in the statistical analysis- was used to analyze the result of pathogenesis questionnaire instead of using discriminant analysis. As a result, 10 among 38 pathogenesis questionnaire was selected as important questions and 12 terminal nodes was built to classify the pathogenesis. Conclusions : Using only 10 questions shown in the result of decision tree, we can classify and interpret the pathogenesis easily and effectively.
PDF

A study of constitution diagnosis using decision tree method (의사결정나무법을 이용한 체질진단에 관한 연구)

Lee, Yong-Seop;Park, Seong-Sik;Park, Eun-Kyung
- Journal of Sasang Constitutional Medicine
- /
- v.13 no.2
- /
- pp.144-155
- /
- 2001
By the increasing concern about Sasang Constitution Medicine, its practical use is considered very important in disease prevention and medical treatment. However, the method of constitution classification is depending on the doctor's clinical trials because of the lack of the objective test criteria. This study is trying to improve the objectiveness of diagnosis using a new statistical method, decision tree. Decision tree method-a classification technique in the statistical analysis- was used to analyze the result of QSCCII instead of using discriminant analysis. As a result, 16 among 121 QSCCII questions was selected as important questions and 21 terminal nodes was built to classify the constitution. Using only 16 questions shown in the result of decision tree, we can diagnose and interpret the constitution easily and effectively.
PDF

Rule Selection Method in Decision Tree Models (의사결정나무 모델에서의 중요 룰 선택기법)

Son, Jieun;Kim, Seoung Bum
- Journal of Korean Institute of Industrial Engineers
- /
- v.40 no.4
- /
- pp.375-381
- /
- 2014
Data mining is a process of discovering useful patterns or information from large amount of data. Decision tree is one of the data mining algorithms that can be used for both classification and prediction and has been widely used for various applications because of its flexibility and interpretability. Decision trees for classification generally generate a number of rules that belong to one of the predefined category and some rules may belong to the same category. In this case, it is necessary to determine the significance of each rule so as to provide the priority of the rule with users. The purpose of this paper is to propose a rule selection method in classification tree models that accommodate the umber of observation, accuracy, and effectiveness in each rule. Our experiments demonstrate that the proposed method produce better performance compared to other existing rule selection methods.
https://doi.org/10.7232/JKIIE.2014.40.4.375 인용 PDF KSCI

Object Classification Method Using Dynamic Random Forests and Genetic Optimization

Kim, Jae Hyup;Kim, Hun Ki;Jang, Kyung Hyun;Lee, Jong Min;Moon, Young Shik
- Journal of the Korea Society of Computer and Information
- /
- v.21 no.5
- /
- pp.79-89
- /
- 2016
In this paper, we proposed the object classification method using genetic and dynamic random forest consisting of optimal combination of unit tree. The random forest can ensure good generalization performance in combination of large amount of trees by assigning the randomization to the training samples and feature selection, etc. allocated to the decision tree as an ensemble classification model which combines with the unit decision tree based on the bagging. However, the random forest is composed of unit trees randomly, so it can show the excellent classification performance only when the sufficient amounts of trees are combined. There is no quantitative measurement method for the number of trees, and there is no choice but to repeat random tree structure continuously. The proposed algorithm is composed of random forest with a combination of optimal tree while maintaining the generalization performance of random forest. To achieve this, the problem of improving the classification performance was assigned to the optimization problem which found the optimal tree combination. For this end, the genetic algorithm methodology was applied. As a result of experiment, we had found out that the proposed algorithm could improve about 3~5% of classification performance in specific cases like common database and self infrared database compare with the existing random forest. In addition, we had shown that the optimal tree combination was decided at 55~60% level from the maximum trees.
https://doi.org/10.9708/jksci.2016.21.5.079 인용 PDF KSCI

Search Result 353, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)