• Title/Summary/Keyword: 랜덤 포레스트 알고리즘

Search Result 86, Processing Time 0.031 seconds

Classification Model of Types of Crime based on Random-Forest Algorithms and Monitoring Interface Design Factors for Real-time Crime Prediction (실시간 범죄 예측을 위한 랜덤포레스트 알고리즘 기반의 범죄 유형 분류모델 및 모니터링 인터페이스 디자인 요소 제안)

  • Park, Joonyoung;Chae, Myungsu;Jung, Sungkwan
    • KIISE Transactions on Computing Practices
    • /
    • v.22 no.9
    • /
    • pp.455-460
    • /
    • 2016
  • Recently, with more severe types felonies such as robbery and sexual violence, the importance of crime prediction and prevention is emphasized. For accurate and prompt crime prediction and prevention, both a classification model of crime with high accuracy based on past criminal records and well-designed system interface are required. However previous studies on the analysis of crime factors have limitations in terms of accuracy due to the difficulty of data preprocessing. In addition, existing crime monitoring systems merely offer a vast amount of crime analysis results, thereby they fail to provide users with functions for more effective monitoring. In this paper, we propose a classification model for types of crime based on random-forest algorithms and system design factors for real-time crime prediction. From our experiments, we proved that our proposed classification model is superior to others that only use criminal records in terms of accuracy. Through the analysis of existing crime monitoring systems, we also designed and developed a system for real-time crime monitoring.

A Study on Random Forest-based Estimation Model for Changing the Automatic Walking Mode of Above Knee Prosthesis (대퇴의족의 자동 보행 모드 변경을 위한 랜덤 포레스트 기반 추정 모델 개발에 관한 연구)

  • Na, Sun-Jong;Shin, Jin-Woo;Eom, Su-Hong;Lee, Eung-Hyuk
    • Journal of IKEEE
    • /
    • v.24 no.1
    • /
    • pp.9-18
    • /
    • 2020
  • The pattern recognition or fuzzy inference, which is mainly used for the development of the automatic walking mode change of the above knee prosthesis, has a disadvantage in that it is difficult to estimate with the immediate change of the walking environment. In order to solve a disadvantage, this paper developed an algorithm that automatically converts the walking mode of the next step by estimating the walking environment at a specific gait phase. Since the proposed algorithm should be implanted and operated in the microcontroller, it is developed using the random forest base in consideration of calculation amount and estimated time. The developed random forest based gait and environmental estimation model were implanted in the microcontroller and evaluated for validity.

Design and Implementation of Indoor Location Recognition System based on Fingerprint and Random Forest (핑거프린트와 랜덤포레스트 기반 실내 위치 인식 시스템 설계와 구현)

  • Lee, Sunmin;Moon, Nammee
    • Journal of Broadcast Engineering
    • /
    • v.23 no.1
    • /
    • pp.154-161
    • /
    • 2018
  • As the number of smartphone users increases, research on indoor location recognition service is necessary. Access to indoor locations is predominantly WiFi, Bluetooth, etc., but in most quarters, WiFi is equipped with WiFi functionality, which uses WiFi features to provide WiFi functionality. The study uses the random forest algorithm, which employs the fingerprint index of the acquired WiFi and the use of the multI-value classification method, which employs the receiver signal strength of the acquired WiFi. As the data of the fingerprint, a total of 4 radio maps using the Mac address together with the received signal strength were used. The experiment was conducted in a limited indoor space and compared to an indoor location recognition system using an existing random forest, similar to the method proposed in this study for experimental analysis. Experiments have shown that the system's positioning accuracy as suggested by this study is approximately 5.8 % higher than that of a conventional indoor location recognition system using a random forest, and that its location recognition speed is consistent and faster than that of a study.

Feature selection and prediction modeling of drug responsiveness in Pharmacogenomics (약물유전체학에서 약물반응 예측모형과 변수선택 방법)

  • Kim, Kyuhwan;Kim, Wonkuk
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.2
    • /
    • pp.153-166
    • /
    • 2021
  • A main goal of pharmacogenomics studies is to predict individual's drug responsiveness based on high dimensional genetic variables. Due to a large number of variables, feature selection is required in order to reduce the number of variables. The selected features are used to construct a predictive model using machine learning algorithms. In the present study, we applied several hybrid feature selection methods such as combinations of logistic regression, ReliefF, TurF, random forest, and LASSO to a next generation sequencing data set of 400 epilepsy patients. We then applied the selected features to machine learning methods including random forest, gradient boosting, and support vector machine as well as a stacking ensemble method. Our results showed that the stacking model with a hybrid feature selection of random forest and ReliefF performs better than with other combinations of approaches. Based on a 5-fold cross validation partition, the mean test accuracy value of the best model was 0.727 and the mean test AUC value of the best model was 0.761. It also appeared that the stacking models outperform than single machine learning predictive models when using the same selected features.

Real time speed-limit sign recognition invariant to image scale (영상 크기변화에 강인한 실시간 속도표지판 인식)

  • Hwang, MinCheol;Ko, ByoungChul;Nam, Jae-Yeal
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.10a
    • /
    • pp.1358-1360
    • /
    • 2015
  • 본 논문에서는 MB-LBP(Multi-scale Block Local Binary Patterns)와 공간피라미드를 이용하여 생성된 특징을 랜덤 포레스트(Random Forest) 분류기에 적용하여 영상내의 표지판 속도를 인식하는 알고리즘을 제안한다. 입력 영상에서 표지판 영역은 다양한 위치와 크기를 가지며 주위 배경이 후보 영역에 포함되므로 먼저 입력 영상에 원형 Hough Transform을 적용하여 원형의 표지판 후보 영역만을 검출한다. 그 후 영상의 화질을 향상시키기 위해 히스토그램 평활화와 모폴로지 연산을 적용하여 표지판의 숫자 영역과 배경 영역의 대비를 높이도록 한다. 표지판의 크기 변화에 강건한 시스템의 구현을 위해 후보 영역에서 LBP(Local Binary Patterns)보다 우수한 성능을 보이는 MB-LBP를 적용하고, 다양한 크기의 속도 표지판을 인식하기 위해 공간 피라미드를 사용하여 지역적 특징과 전역적 특징 모두를 추출하였다. 추출된 특징은 랜덤 포레스트(Random Forest)를 이용하여 각 9개의 속도 표지판으로 분류, 각 속도별 클래스에 대한 인식 성능을 측정하였다.

Random Forest Based Intrusion Detection Method using Activity Data in Smart Home Environment (스마트홈 환경에서 활동 데이터를 활용한 랜덤포레스트 기반 침입탐지 기법)

  • Lee, Pil-Won;Shin, Yong-Tae
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.11a
    • /
    • pp.193-195
    • /
    • 2020
  • 최근 IoT 기술의 발전을 통해 스마트홈 서비스가 사용자에게 활발하게 보급이 되고 있다. 스마트홈 서비스에서 발생하는 데이터는 개인정보를 내포하고 있으므로 보안이 매우 중요한 요소이다. 그러나 매해 스마트홈 해킹 신고가 증가하고 있으며 기존 네트워크 침입탐지 시스템은 관리자 계정을 탈취 당했을 경우 대응할 방법이 미비하다. 본 논문에서는 스마트홈 환경에서 발생하는 활동 데이터를 인공지능 알고리즘의 종류 중 하나인 랜덤포레스트를 통해 학습하고 분류모델을 구현했다. 구현한 모델은 87%이상의 높은 정확도로 측정되었다. 따라서 활동 데이터를 통해 분류를 시행하므로 네트워크에 이미 침입한 사용자를 탐지하여 대응할 수 있다.

Machine learning model for residual chlorine prediction in sediment basin to control pre-chlorination in water treatment plant (정수장 전염소 공정제어를 위한 침전지 잔류염소농도 예측 머신러닝 모형)

  • Kim, Juhwan;Lee, Kyunghyuk;Kim, Soojun;Kim, Kyunghun
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.spc1
    • /
    • pp.1283-1293
    • /
    • 2022
  • The purpose of this study is to predict residual chlorine in order to maintain stable residual chlorine concentration in sedimentation basin by using artificial intelligence algorithms in water treatment process employing pre-chlorination. Available water quantity and quality data are collected and analyzed statistically to apply into mathematical multiple regression and artificial intelligence models including multi-layer perceptron neural network, random forest, long short term memory (LSTM) algorithms. Water temperature, turbidity, pH, conductivity, flow rate, alkalinity and pre-chlorination dosage data are used as the input parameters to develop prediction models. As results, it is presented that the random forest algorithm shows the most moderate prediction result among four cases, which are long short term memory, multi-layer perceptron, multiple regression including random forest. Especially, it is result that the multiple regression model can not represent the residual chlorine with the input parameters which varies independently with seasonal change, numerical scale and dimension difference between quantity and quality. For this reason, random forest model is more appropriate for predict water qualities than other algorithms, which is classified into decision tree type algorithm. Also, it is expected that real time prediction by artificial intelligence models can play role of the stable operation of residual chlorine in water treatment plant including pre-chlorination process.

Automatic scoring of mathematics descriptive assessment using random forest algorithm (랜덤 포레스트 알고리즘을 활용한 수학 서술형 자동 채점)

  • Inyong Choi;Hwa Kyung Kim;In Woo Chung;Min Ho Song
    • The Mathematical Education
    • /
    • v.63 no.2
    • /
    • pp.165-186
    • /
    • 2024
  • Despite the growing attention on artificial intelligence-based automated scoring technology as a support method for the introduction of descriptive items in school environments and large-scale assessments, there is a noticeable lack of foundational research in mathematics compared to other subjects. This study developed an automated scoring model for two descriptive items in first-year middle school mathematics using the Random Forest algorithm, evaluated its performance, and explored ways to enhance this performance. The accuracy of the final models for the two items was found to be between 0.95 to 1.00 and 0.73 to 0.89, respectively, which is relatively high compared to automated scoring models in other subjects. We discovered that the strategic selection of the number of evaluation categories, taking into account the amount of data, is crucial for the effective development and performance of automated scoring models. Additionally, text preprocessing by mathematics education experts proved effective in improving both the performance and interpretability of the automated scoring model. Selecting a vectorization method that matches the characteristics of the items and data was identified as one way to enhance model performance. Furthermore, we confirmed that oversampling is a useful method to supplement performance in situations where practical limitations hinder balanced data collection. To enhance educational utility, further research is needed on how to utilize feature importance derived from the Random Forest-based automated scoring model to generate useful information for teaching and learning, such as feedback. This study is significant as foundational research in the field of mathematics descriptive automatic scoring, and there is a need for various subsequent studies through close collaboration between AI experts and math education experts.

Exploring the Feature Selection Method for Effective Opinion Mining: Emphasis on Particle Swarm Optimization Algorithms

  • Eo, Kyun Sun;Lee, Kun Chang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.11
    • /
    • pp.41-50
    • /
    • 2020
  • Sentimental analysis begins with the search for words that determine the sentimentality inherent in data. Managers can understand market sentimentality by analyzing a number of relevant sentiment words which consumers usually tend to use. In this study, we propose exploring performance of feature selection methods embedded with Particle Swarm Optimization Multi Objectives Evolutionary Algorithms. The performance of the feature selection methods was benchmarked with machine learning classifiers such as Decision Tree, Naive Bayesian Network, Support Vector Machine, Random Forest, Bagging, Random Subspace, and Rotation Forest. Our empirical results of opinion mining revealed that the number of features was significantly reduced and the performance was not hurt. In specific, the Support Vector Machine showed the highest accuracy. Random subspace produced the best AUC results.

3축 가속도 센서 기반 인간 행동 인식을 위한 기계학습 분석

  • Lee, Song-Mi;Jo, Hui-Ryeon;Yun, Sang-Min
    • Information and Communications Magazine
    • /
    • v.33 no.10
    • /
    • pp.65-70
    • /
    • 2016
  • 최근 스마트폰의 이용 사례가 증가함에 따라, 스마트폰에 내장되어 있는 다양한 센서를 이용하여 인간의 행동을 인식하기 위한 연구가 많은 각광을 받고 있다. 본고에서는 인간의 기본적인 행동 중에 앉기, 걷기, 달리기 등의 행동 특성을 스마트폰에 내장되어 있는 3축 가속도 센서를 통하여 분석하고 인간의 기본적 행동을 자동으로 인식하기 위한 방법에 대하여 비교 분석하는 것을 목적으로 한다. 구체적으로는 스마트폰에 내장되어 있는 3차원 가속도 센서로부터 추출된 데이터를 시간축에서 샘플링하여 인간의 행동을 인식하기 위한 기댓값 최대화 알고리즘, 랜덤 포레스트, 딥러닝 기반의 기계학습 방법을 비교하여 각 기계학습 알고리즘의 장단점을 분석한다.