• Title/Summary/Keyword: tree based learning

Search Result 429, Processing Time 0.034 seconds

A Study on Methodology for Air Target Dynamic Targeting Applying Machine Learning (기계학습을 활용한 항공표적 긴급표적처리 발전방안 연구)

  • Kang, Junghyun;Yim, Dongsoon;Choi, Bongwan
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.22 no.4
    • /
    • pp.555-566
    • /
    • 2019
  • In order to prepare for the future warfare environment, which requires a faster operational tempo, it is necessary to utilize the fourth industrial revolution technology in the field of military operations. This study propose a methodology, 'machine learning based dynamic targeting', which can contribute to reduce required man-hour for dynamic targeting. Specifically, a decision tree algorithm is considered to apply to dynamic targeting process. The algorithm learns target prioritization patterns from JIPTL(Joint Integrated Prioritized Target List) which is the result of the deliberate targeting, and then learned algorithm rapidly(almost real-time) determines priorities for new targets that occur during ATO(Air Tasking Order) execution. An experiment is performed with artificially generated data to demonstrate the applicability of the methodology.

Evaluations of AI-based malicious PowerShell detection with feature optimizations

  • Song, Jihyeon;Kim, Jungtae;Choi, Sunoh;Kim, Jonghyun;Kim, Ikkyun
    • ETRI Journal
    • /
    • v.43 no.3
    • /
    • pp.549-560
    • /
    • 2021
  • Cyberattacks are often difficult to identify with traditional signature-based detection, because attackers continually find ways to bypass the detection methods. Therefore, researchers have introduced artificial intelligence (AI) technology for cybersecurity analysis to detect malicious PowerShell scripts. In this paper, we propose a feature optimization technique for AI-based approaches to enhance the accuracy of malicious PowerShell script detection. We statically analyze the PowerShell script and preprocess it with a method based on the tokens and abstract syntax tree (AST) for feature selection. Here, tokens and AST represent the vocabulary and structure of the PowerShell script, respectively. Performance evaluations with optimized features yield detection rates of 98% in both machine learning (ML) and deep learning (DL) experiments. Among them, the ML model with the 3-gram of selected five tokens and the DL model with experiments based on the AST 3-gram deliver the best performance.

Machine Learning Based Automatic Categorization Model for Text Lines in Invoice Documents

  • Shin, Hyun-Kyung
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.12
    • /
    • pp.1786-1797
    • /
    • 2010
  • Automatic understanding of contents in document image is a very hard problem due to involvement with mathematically challenging problems originated mainly from the over-determined system induced by document segmentation process. In both academic and industrial areas, there have been incessant and various efforts to improve core parts of content retrieval technologies by the means of separating out segmentation related issues using semi-structured document, e.g., invoice,. In this paper we proposed classification models for text lines on invoice document in which text lines were clustered into the five categories in accordance with their contents: purchase order header, invoice header, summary header, surcharge header, purchase items. Our investigation was concentrated on the performance of machine learning based models in aspect of linear-discriminant-analysis (LDA) and non-LDA (logic based). In the group of LDA, na$\"{\i}$ve baysian, k-nearest neighbor, and SVM were used, in the group of non LDA, decision tree, random forest, and boost were used. We described the details of feature vector construction and the selection processes of the model and the parameter including training and validation. We also presented the experimental results of comparison on training/classification error levels for the models employed.

An Application of Decision Tree Method for Fault Diagnosis of Induction Motors

  • Tran, Van Tung;Yang, Bo-Suk;Oh, Myung-Suck
    • Proceedings of the Korea Committee for Ocean Resources and Engineering Conference
    • /
    • 2006.11a
    • /
    • pp.54-59
    • /
    • 2006
  • Decision tree is one of the most effective and widely used methods for building classification model. Researchers from various disciplines such as statistics, machine learning, pattern recognition, and data mining have considered the decision tree method as an effective solution to their field problems. In this paper, an application of decision tree method to classify the faults of induction motors is proposed. The original data from experiment is dealt with feature calculation to get the useful information as attributes. These data are then assigned the classes which are based on our experience before becoming data inputs for decision tree. The total 9 classes are defined. An implementation of decision tree written in Matlab is used for these data.

  • PDF

Enhancement of Text Classification Method (텍스트 분류 기법의 발전)

  • Shin, Kwang-Seong;Shin, Seong-Yoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2019.05a
    • /
    • pp.155-156
    • /
    • 2019
  • Traditional machine learning based emotion analysis methods such as Classification and Regression Tree (CART), Support Vector Machine (SVM), and k-nearest neighbor classification (kNN) are less accurate. In this paper, we propose an improved kNN classification method. Improved methods and data normalization achieve the goal of improving accuracy. Then, three classification algorithms and an improved algorithm were compared based on experimental data.

  • PDF

Machine Learning Based Hybrid Approach to Detect Intrusion in Cyber Communication

  • Neha Pathak;Bobby Sharma
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.11
    • /
    • pp.190-194
    • /
    • 2023
  • By looking the importance of communication, data delivery and access in various sectors including governmental, business and individual for any kind of data, it becomes mandatory to identify faults and flaws during cyber communication. To protect personal, governmental and business data from being misused from numerous advanced attacks, there is the need of cyber security. The information security provides massive protection to both the host machine as well as network. The learning methods are used for analyzing as well as preventing various attacks. Machine learning is one of the branch of Artificial Intelligence that plays a potential learning techniques to detect the cyber-attacks. In the proposed methodology, the Decision Tree (DT) which is also a kind of supervised learning model, is combined with the different cross-validation method to determine the accuracy and the execution time to identify the cyber-attacks from a very recent dataset of different network attack activities of network traffic in the UNSW-NB15 dataset. It is a hybrid method in which different types of attributes including Gini Index and Entropy of DT model has been implemented separately to identify the most accurate procedure to detect intrusion with respect to the execution time. The different DT methodologies including DT using Gini Index, DT using train-split method and DT using information entropy along with their respective subdivision such as using K-Fold validation, using Stratified K-Fold validation are implemented.

Development and Effect Analysis of a Learning Support System for Underachievers Using Psychological Learning Style Tests (학습 스타일 심리검사를 이용한 부진아 학습 지원 시스템의 개발 및 효과 분석)

  • Lee, Jong-Suk;Jang, Eun-Sill;Lee, Yong-Kyu
    • Journal of The Korean Association of Information Education
    • /
    • v.11 no.3
    • /
    • pp.299-306
    • /
    • 2007
  • It is urgent to have learning support for children with learning disability according to the survey made by the government educational organization. To this end, we developed a learning support system for children with learning disability. First, the system diagnoses the children with learning disability using a decision tree based on the pre-test results. Secondly, it supports for children with learning disability one of audio-, vision- and tactility-oriented learning types according to the results from the psychological learning style test. Thirdly, one-to-one study is supported for failed students at the achievement test. For the evaluation of the system, the children with disability were divided into an experimental group and a control group and the educational achievement was evaluated. We found that 10% on the average was improved in case that learning was made after the psychological test for learning styles.

  • PDF

A Machine Learning-based Customer Classification Model for Effective Online Free Sample Promotions (온라인 무료 샘플 판촉의 효과적 활용을 위한 기계학습 기반 고객분류예측 모형)

  • Won, Ha-Ram;Kim, Moo-Jeon;Ahn, Hyunchul
    • The Journal of Information Systems
    • /
    • v.27 no.3
    • /
    • pp.63-80
    • /
    • 2018
  • Purpose The purpose of this study is to build a machine learning-based customer classification model to promote customer expansion effect of the free sample promotion. Specifically, the proposed model classifies potential target customers who are expected to purchase the products included in the free sample promotion after receiving the free samples. Design/methodology/approach This study proposes to build a customer classification model for determining customers suitable for providing free samples by using various machine learning techniques such as logistic regression, multiple discriminant analysis, case-based reasoning, decision tree, artificial neural network, and support vector machine. To validate the usefulness of the proposed model, we apply it to a real-world free sample-based target marketing case of a Korean major cosmetic retail company. Findings Experimental results show that a machine learning-based customer classification model presents satisfactory accuracy ranging from 70% to 75%. In particular, support vector machine is found to be the most effective machine learning technique for free sample-based target marketing model. Our study sheds a light on customer relationship management strategies using free sample promotions.

Output Power Prediction of Combined Cycle Power Plant using Logic-based Tree Structured Fuzzy Neural Networks (로직에 기반 한 트리 구조의 퍼지 뉴럴 네트워크를 이용한 복합 화력 발전소의 출력 예측)

  • Han, Chang-Wook;Lee, Don-Kyu
    • Journal of IKEEE
    • /
    • v.23 no.2
    • /
    • pp.529-533
    • /
    • 2019
  • Combined cycle power plants are often used to produce power. These days prediction of power plant output based on operating parameters is a major concern. This paper presents an approach to using computational intelligence technique to predict the output power of combined cycle power plant. Computational intelligence techniques have been developed and applied to many real world problems. In this paper, tree architectures of fuzzy neural networks are considered to predict the output power. Tree architectures of fuzzy neural networks have an advantage of reducing the number of rules by selecting fuzzy neurons as nodes and relevant inputs as leaves optimally. For the optimization of the networks, two-step optimization method is used. Genetic algorithms optimize the binary structure of the networks by selecting the nodes and leaves as binary, and followed by random signal-based learning further refines the optimized binary connections in the unit interval. To verify the effectiveness of the proposed method, combined cycle power plant dataset obtained from the UCI Machine Learning Repository Database is considered.

A System for Determining the Growth Stage of Fruit Tree Using a Deep Learning-Based Object Detection Model (딥러닝 기반의 객체 탐지 모델을 활용한 과수 생육 단계 판별 시스템)

  • Bang, Ji-Hyeon;Park, Jun;Park, Sung-Wook;Kim, Jun-Yung;Jung, Se-Hoon;Sim, Chun-Bo
    • Smart Media Journal
    • /
    • v.11 no.4
    • /
    • pp.9-18
    • /
    • 2022
  • Recently, research and system using AI is rapidly increasing in various fields. Smart farm using artificial intelligence and information communication technology is also being studied in agriculture. In addition, data-based precision agriculture is being commercialized by convergence various advanced technology such as autonomous driving, satellites, and big data. In Korea, the number of commercialization cases of facility agriculture among smart agriculture is increasing. However, research and investment are being biased in the field of facility agriculture. The gap between research and investment in facility agriculture and open-air agriculture continues to increase. The fields of fruit trees and plant factories have low research and investment. There is a problem that the big data collection and utilization system is insufficient. In this paper, we are proposed the system for determining the fruit tree growth stage using a deep learning-based object detection model. The system was proposed as a hybrid app for use in agricultural sites. In addition, we are implemented an object detection function for the fruit tree growth stage determine.