• Title/Summary/Keyword: Regression Algorithms

Search Result 401, Processing Time 0.04 seconds

Adaptive Obstacle Avoidance Algorithm using Classification of 2D LiDAR Data (2차원 라이다 센서 데이터 분류를 이용한 적응형 장애물 회피 알고리즘)

  • Lee, Nara;Kwon, Soonhwan;Ryu, Hyejeong
    • Journal of Sensor Science and Technology
    • /
    • v.29 no.5
    • /
    • pp.348-353
    • /
    • 2020
  • This paper presents an adaptive method to avoid obstacles in various environmental settings, using a two-dimensional (2D) LiDAR sensor for mobile robots. While the conventional reaction based smooth nearness diagram (SND) algorithms use a fixed safety distance criterion, the proposed algorithm autonomously changes the safety criterion considering the obstacle density around a robot. The fixed safety criterion for the whole SND obstacle avoidance process can induce inefficient motion controls in terms of the travel distance and action smoothness. We applied a multinomial logistic regression algorithm, softmax regression, to classify 2D LiDAR point clouds into seven obstacle structure classes. The trained model was used to recognize a current obstacle density situation using newly obtained 2D LiDAR data. Through the classification, the robot adaptively modifies the safety distance criterion according to the change in its environment. We experimentally verified that the motion controls generated by the proposed adaptive algorithm were smoother and more efficient compared to those of the conventional SND algorithms.

Screening Vital Few Variables and Development of Logistic Regression Model on a Large Data Set (대용량 자료에서 핵심적인 소수의 변수들의 선별과 로지스틱 회귀 모형의 전개)

  • Lim, Yong-B.;Cho, J.;Um, Kyung-A;Lee, Sun-Ah
    • Journal of Korean Society for Quality Management
    • /
    • v.34 no.2
    • /
    • pp.129-135
    • /
    • 2006
  • In the advance of computer technology, it is possible to keep all the related informations for monitoring equipments in control and huge amount of real time manufacturing data in a data base. Thus, the statistical analysis of large data sets with hundreds of thousands observations and hundred of independent variables whose some of values are missing at many observations is needed even though it is a formidable computational task. A tree structured approach to classification is capable of screening important independent variables and their interactions. In a Six Sigma project handling large amount of manufacturing data, one of the goals is to screen vital few variables among trivial many variables. In this paper we have reviewed and summarized CART, C4.5 and CHAID algorithms and proposed a simple method of screening vital few variables by selecting common variables screened by all the three algorithms. Also how to develop a logistics regression model on a large data set is discussed and illustrated through a large finance data set collected by a credit bureau for th purpose of predicting the bankruptcy of the company.

Reversible Data Hiding Using a Piecewise Autoregressive Predictor Based on Two-stage Embedding

  • Lee, Byeong Yong;Hwang, Hee Joon;Kim, Hyoung Joong
    • Journal of Electrical Engineering and Technology
    • /
    • v.11 no.4
    • /
    • pp.974-986
    • /
    • 2016
  • Reversible image watermarking, a type of digital data hiding, is capable of recovering the original image and extracting the hidden message with precision. A number of reversible algorithms have been proposed to achieve a high embedding capacity and a low distortion. While numerous algorithms for the achievement of a favorable performance regarding a small embedding capacity exist, the main goal of this paper is the achievement of a more favorable performance regarding a larger embedding capacity and a lower distortion. This paper therefore proposes a reversible data hiding algorithm for which a novel piecewise 2D auto-regression (P2AR) predictor that is based on a rhombus-embedding scheme is used. In addition, a minimum description length (MDL) approach is applied to remove the outlier pixels from a training set so that the effect of a multiple linear regression can be maximized. The experiment results demonstrate that the performance of the proposed method is superior to those of previous methods.

Wave Information Estimation and Revision Using Linear Regression Model (선형회귀모델을 이용한 파랑 정보 예측 및 보정)

  • Lim, Dong-hee;Kim, Jin-soo;Lee, Byung-Gil
    • Journal of Korea Multimedia Society
    • /
    • v.19 no.8
    • /
    • pp.1377-1385
    • /
    • 2016
  • Conventional X-band marine radar has been used as one of the effective tools for collecting and retrieving ocean surface information parameters for three decades. Several wave information extracting algorithms have been designed in such a way that they can be utilized for efficiently estimating sea surface wave parameters such as current velocities, wave direction, significant wave heights in VTS (Vessel Traffic Service). However, their performances are still restricted. For the purpose of overcoming the performance limits, in this paper, first the conventional algorithms are analyzed and their performances are compared, and then a new control algorithm is proposed. Furthermore, we try to improve the estimation performances of typical wave parameters including wave directions and significant wave heights by introducing linear regression model in the process of computing wave information extraction. Through several simulations with the X-band radar images, it is shown that the proposed method is very effective in estimating the wave information compared to the real measured buoy data.

Developing a Pedestrian Satisfaction Prediction Model Based on Machine Learning Algorithms (기계학습 알고리즘을 이용한 보행만족도 예측모형 개발)

  • Lee, Jae Seung;Lee, Hyunhee
    • Journal of Korea Planning Association
    • /
    • v.54 no.3
    • /
    • pp.106-118
    • /
    • 2019
  • In order to develop pedestrian navigation service that provides optimal pedestrian routes based on pedestrian satisfaction levels, it is required to develop a prediction model that can estimate a pedestrian's satisfaction level given a certain condition. Thus, the aim of the present study is to develop a pedestrian satisfaction prediction model based on three machine learning algorithms: Logistic Regression, Random Forest, and Artificial Neural Network models. The 2009, 2012, 2013, 2014, and 2015 Pedestrian Satisfaction Survey Data in Seoul, Korea are used to train and test the machine learning models. As a result, the Random Forest model shows the best prediction performance among the three (Accuracy: 0.798, Recall: 0.906, Precision: 0.842, F1 Score: 0.873, AUC: 0.795). The performance of Artificial Neural Network is the second (Accuracy: 0.773, Recall: 0.917, Precision: 0.811, F1 Score: 0.868, AUC: 0.738) and Logistic Regression model's performance follows the second (Accuracy: 0.764, Recall: 1.000, Precision: 0.764, F1 Score: 0.868, AUC: 0.575). The precision score of the Random Forest model implies that approximately 84.2% of pedestrians may be satisfied if they walk the areas, suggested by the Random Forest model.

Machine learning-based analysis and prediction model on the strengthening mechanism of biopolymer-based soil treatment

  • Haejin Lee;Jaemin Lee;Seunghwa Ryu;Ilhan Chang
    • Geomechanics and Engineering
    • /
    • v.36 no.4
    • /
    • pp.381-390
    • /
    • 2024
  • The introduction of bio-based materials has been recommended in the geotechnical engineering field to reduce environmental pollutants such as heavy metals and greenhouse gases. However, bio-treated soil methods face limitations in field application due to short research periods and insufficient verification of engineering performance, especially when compared to conventional materials like cement. Therefore, this study aimed to develop a machine learning model for predicting the unconfined compressive strength, a representative soil property, of biopolymer-based soil treatment (BPST). Four machine learning algorithms were compared to determine a suitable model, including linear regression (LR), support vector regression (SVR), random forest (RF), and neural network (NN). Except for LR, the SVR, RF, and NN algorithms exhibited high predictive performance with an R2 value of 0.98 or higher. The permutation feature importance technique was used to identify the main factors affecting the strength enhancement of BPST. The results indicated that the unconfined compressive strength of BPST is affected by mean particle size, followed by biopolymer content and water content. With a reliable prediction model, the proposed model can present guidelines prior to laboratory testing and field application, thereby saving a significant amount of time and money.

Machine Learning Approach to Classifying Fatal and Non-Fatal Accidents in Industries (사망사고와 부상사고의 산업재해분류를 위한 기계학습 접근법)

  • Kang, Sungsik;Chang, Seong Rok;Suh, Yongyoon
    • Journal of the Korean Society of Safety
    • /
    • v.36 no.5
    • /
    • pp.52-60
    • /
    • 2021
  • As the prevention of fatal accidents is considered an essential part of social responsibilities, both government and individual have devoted efforts to mitigate the unsafe conditions and behaviors that facilitate accidents. Several studies have analyzed the factors that cause fatal accidents and compared them to those of non-fatal accidents. However, studies on mathematical and systematic analysis techniques for identifying the features of fatal accidents are rare. Recently, various industrial fields have employed machine learning algorithms. This study aimed to apply machine learning algorithms for the classification of fatal and non-fatal accidents based on the features of each accident. These features were obtained by text mining literature on accidents. The classification was performed using four machine learning algorithms, which are widely used in industrial fields, including logistic regression, decision tree, neural network, and support vector machine algorithms. The results revealed that the machine learning algorithms exhibited a high accuracy for the classification of accidents into the two categories. In addition, the importance of comparing similar cases between fatal and non-fatal accidents was discussed. This study presented a method for classifying accidents using machine learning algorithms based on the reports on previous studies on accidents.

A Quality Forecasting System in Glass Melting Processes using Genetic Algorithms (유전 알고리즘을 이용한 유리 용해 공정에서의 불량예측 시스템)

  • Jung, Ho-Sang;Jeong, Bong-Ju
    • IE interfaces
    • /
    • v.13 no.1
    • /
    • pp.78-91
    • /
    • 2000
  • This paper presents a computerized quality forecasting system for glass manufacturing. In forecasting the molten glass quality, we are concerned with three major issues : (1) to find the reasonable time lags between a set of process conditions and the quality measurement of glass products, (2) to find the most significant process variables affecting the quality, and (3) to construct the appropriate causal forecasting models using genetic algorithms. The experimental results show the proposed model results in better forecasting than linear regression model. The suggested forecasting model was implemented successfully and is being currently used in a real manufacturing line.

  • PDF

The Study for Improvement of Data-Quality of Cut-Slope Management System Using Machine Learning (기계학습을 활용한 도로비탈면관리시스템 데이터 품질강화에 관한 연구)

  • Lee, Se-Hyeok;Kim, Seung-Hyun;Woo, Yonghoon;Moon, Jae-Pil;Yang, Inchul
    • The Journal of Engineering Geology
    • /
    • v.31 no.1
    • /
    • pp.31-42
    • /
    • 2021
  • Database of Cut-slope management system (CSMS) has been constructed based on investigations of all slopes on the roads of the whole country. The investigation data is documented by human, so it is inevitable to avoid human-error such as missing-data and incorrect entering data into computer. The goal of this paper is constructing a prediction model based on several machine-learning algorithms to solve those imperfection problems of the CSMS data. First of all, the character-type data in CSMS data must be transformed to numeric data. After then, two algorithms, i.g., multinomial logistic regression and deep-neural-network (DNN), are performed, and those prediction models from two algorithms are compared. Finally, it is identified that the accuracy of DNN-model is better than logistic model, and the DNN-model will be utilized to improve data-quality.

Object Tracking using Color Histogram and CNN Model (컬러 히스토그램과 CNN 모델을 이용한 객체 추적)

  • Park, Sung-Jun;Baek, Joong-Hwan
    • Journal of Advanced Navigation Technology
    • /
    • v.23 no.1
    • /
    • pp.77-83
    • /
    • 2019
  • In this paper, we propose an object tracking algorithm based on color histogram and convolutional neural network model. In order to increase the tracking accuracy, we synthesize generic object tracking using regression network algorithm which is one of the convolutional neural network model-based tracking algorithms and a mean-shift tracking algorithm which is a color histogram-based algorithm. Both algorithms are classified through support vector machine and designed to select an algorithm with higher tracking accuracy. The mean-shift tracking algorithm tends to move the bounding box to a large range when the object tracking fails, thus we improve the accuracy by limiting the movement distance of the bounding box. Also, we improve the performance by initializing the tracking start positions of the two algorithms based on the average brightness and the histogram similarity. As a result, the overall accuracy of the proposed algorithm is 1.6% better than the existing generic object tracking using regression network algorithm.