• Title/Summary/Keyword: Machine Learning Procedure

Search Result 116, Processing Time 0.032 seconds

Learning algorithms for big data logistic regression on RHIPE platform (RHIPE 플랫폼에서 빅데이터 로지스틱 회귀를 위한 학습 알고리즘)

  • Jung, Byung Ho;Lim, Dong Hoon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.4
    • /
    • pp.911-923
    • /
    • 2016
  • Machine learning becomes increasingly important in the big data era. Logistic regression is a type of classification in machine leaning, and has been widely used in various fields, including medicine, economics, marketing, and social sciences. Rhipe that integrates R and Hadoop environment, has not been discussed by many researchers owing to the difficulty of its installation and MapReduce implementation. In this paper, we present the MapReduce implementation of Gradient Descent algorithm and Newton-Raphson algorithm for logistic regression using Rhipe. The Newton-Raphson algorithm does not require a learning rate, while Gradient Descent algorithm needs to manually pick a learning rate. We choose the learning rate by performing the mixed procedure of grid search and binary search for processing big data efficiently. In the performance study, our Newton-Raphson algorithm outpeforms Gradient Descent algorithm in all the tested data.

Normal data based rotating machine anomaly detection using CNN with self-labeling

  • Bae, Jaewoong;Jung, Wonho;Park, Yong-Hwa
    • Smart Structures and Systems
    • /
    • v.29 no.6
    • /
    • pp.757-766
    • /
    • 2022
  • To train deep learning algorithms, a sufficient number of data are required. However, in most engineering systems, the acquisition of fault data is difficult or sometimes not feasible, while normal data are secured. The dearth of data is one of the major challenges to developing deep learning models, and fault diagnosis in particular cannot be made in the absence of fault data. With this context, this paper proposes an anomaly detection methodology for rotating machines using only normal data with self-labeling. Since only normal data are used for anomaly detection, a self-labeling method is used to generate a new labeled dataset. The overall procedure includes the following three steps: (1) transformation of normal data to self-labeled data based on a pretext task, (2) training the convolutional neural networks (CNN), and (3) anomaly detection using defined anomaly score based on the softmax output of the trained CNN. The softmax value of the abnormal sample shows different behavior from the normal softmax values. To verify the proposed method, four case studies were conducted, on the Case Western Reserve University (CWRU) bearing dataset, IEEE PHM 2012 data challenge dataset, PHMAP 2021 data challenge dataset, and laboratory bearing testbed; and the results were compared to those of existing machine learning and deep learning methods. The results showed that the proposed algorithm could detect faults in the bearing testbed and compressor with over 99.7% accuracy. In particular, it was possible to detect not only bearing faults but also structural faults such as unbalance and belt looseness with very high accuracy. Compared with the existing GAN, the autoencoder-based anomaly detection algorithm, the proposed method showed high anomaly detection performance.

Reconstruction of High-Resolution Facial Image Based on Recursive Error Back-Projection of Top-Down Machine Learning (하향식 기계학습의 반복적 오차 역투영에 기반한 고해상도 얼굴 영상의 복원)

  • Park, Jeong-Seon;Lee, Seong-Whan
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.3
    • /
    • pp.266-274
    • /
    • 2007
  • This paper proposes a new reconstruction method of high-resolution facial image from a low-resolution facial image based on top-down machine learning and recursive error back-projection. A face is represented by a linear combination of prototypes of shape and that of texture. With the shape and texture information of each pixel in a given low-resolution facial image, we can estimate optimal coefficients for a linear combination of prototypes of shape and those that of texture by solving least square minimizations. Then high-resolution facial image can be obtained by using the optimal coefficients for linear combination of the high-resolution prototypes. In addition, a recursive error back-projection procedure is applied to improve the reconstruction accuracy of high-resolution facial image. The encouraging results of the proposed method show that our method can be used to improve the performance of the face recognition by applying our method to reconstruct high-resolution facial images from low-resolution images captured at a distance.

Application of Artificial Neural Network to Flamelet Library for Gaseous Hydrogen/Liquid Oxygen Combustion at Supercritical Pressure (초임계 압력조건에서 기체수소-액체산소 연소해석의 층류화염편 라이브러리에 대한 인공신경망 학습 적용)

  • Jeon, Tae Jun;Park, Tae Seon
    • Journal of the Korean Society of Propulsion Engineers
    • /
    • v.25 no.6
    • /
    • pp.1-11
    • /
    • 2021
  • To develop an efficient procedure related to the flamelet library, the machine learning process based on artificial neural network(ANN) is applied for the gaseous hydrogen/liquid oxygen combustor under a supercritical pressure condition. For hidden layers, 25 combinations based on Rectified Linear Unit(ReLU) and hyperbolic tangent are adopted to find an optimum architecture in terms of the computational efficiency and the training performance. For activation functions, the hyperbolic tangent is proper to get the high learning performance for accurate properties. A transformation learning data is proposed to improve the training performance. When the optimal node is arranged for the 4 hidden layers, it is found to be the most efficient in terms of training performance and computational cost. Compared to the interpolation procedure, the ANN procedure reduces computational time and system memory by 37% and 99.98%, respectively.

Advanced performance evaluation system for existing concrete bridges

  • Miyamoto, Ayaho;Emoto, Hisao;Asano, Hiroyoshi
    • Computers and Concrete
    • /
    • v.14 no.6
    • /
    • pp.727-743
    • /
    • 2014
  • The management of existing concrete bridges has become a major social concern in many developed countries due to the large number of bridges exhibiting signs of significant deterioration. This problem has increased the demand for effective maintenance and renewal planning. In order to implement an appropriate management procedure for a structure, a wide array of corrective strategies must be evaluated with respect to not only the condition state of each defect but also safety, economy and sustainability. This paper describes a new performance evaluation system for existing concrete bridges. The system evaluates performance based on load carrying capability and durability from the results of a visual inspection and specification data, and describes the necessity of maintenance. It categorizes all girders and slabs as either unsafe, severe deterioration, moderate deterioration, mild deterioration, or safe. The technique employs an expert system with an appropriate knowledge base in the evaluation. A characteristic feature of the system is the use of neural networks to evaluate the performance and facilitate refinement of the knowledge base. The neural network proposed in the present study has the capability to prevent an inference process and knowledge base from becoming a black box. It is very important that the system is capable of detailing how the performance is calculated since the road network represents a huge investment. The effectiveness of the neural network and machine learning method is verified by comparing diagnostic results by bridge experts.

Effective machine learning-based haze removal technique using haze-related features (안개관련 특징을 이용한 효과적인 머신러닝 기반 안개제거 기법)

  • Lee, Ju-Hee;Kang, Bong-Soon
    • Journal of IKEEE
    • /
    • v.25 no.1
    • /
    • pp.83-87
    • /
    • 2021
  • In harsh environments such as fog or fine dust, the cameras' detection ability for object recognition may significantly decrease. In order to accurately obtain important information even in bad weather, fog removal algorithms are necessarily required. Research has been conducted in various ways, such as computer vision/data-based fog removal technology. In those techniques, estimating the amount of fog through the input image's depth information is an important procedure. In this paper, a linear model is presented under the assumption that the image dark channel dictionary, saturation ∗ value, and sharpness characteristics are linearly related to depth information. The proposed method of haze removal through a linear model shows the superiority of algorithm performance in quantitative numerical evaluation.

A Fault Prognostic System for the Logistics Rotational Equipment (물류 회전설비 고장예지 시스템)

  • Soo Hyung Kim;Berdibayev Yergali;Hyeongki Jo;Kyu Ik Kim;Jin Suk Kim
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.46 no.2
    • /
    • pp.168-175
    • /
    • 2023
  • In the era of the 4th Industrial Revolution, Logistic 4.0 using data-based technologies such as IoT, Bigdata, and AI is a keystone to logistics intelligence. In particular, the AI technology such as prognostics and health management for the maintenance of logistics facilities is being in the spotlight. In order to ensure the reliability of the facilities, Time-Based Maintenance (TBM) can be performed in every certain period of time, but this causes excessive maintenance costs and has limitations in preventing sudden failures and accidents. On the other hand, the predictive maintenance using AI fault diagnosis model can do not only overcome the limitation of TBM by automatically detecting abnormalities in logistics facilities, but also offer more advantages by predicting future failures and allowing proactive measures to ensure stable and reliable system management. In order to train and predict with AI machine learning model, data needs to be collected, processed, and analyzed. In this study, we have develop a system that utilizes an AI detection model that can detect abnormalities of logistics rotational equipment and diagnose their fault types. In the discussion, we will explain the entire experimental processes : experimental design, data collection procedure, signal processing methods, feature analysis methods, and the model development.

A Study on Training Ensembles of Neural Networks - A Case of Stock Price Prediction (신경망 학습앙상블에 관한 연구 - 주가예측을 중심으로 -)

  • 이영찬;곽수환
    • Journal of Intelligence and Information Systems
    • /
    • v.5 no.1
    • /
    • pp.95-101
    • /
    • 1999
  • In this paper, a comparison between different methods to combine predictions from neural networks will be given. These methods are bagging, bumping, and balancing. Those are based on the analysis of the ensemble generalization error into an ambiguity term and a term incorporating generalization performances of individual networks. Neural Networks and AI machine learning models are prone to overfitting. A strategy to prevent a neural network from overfitting, is to stop training in early stage of the learning process. The complete data set is spilt up into a training set and a validation set. Training is stopped when the error on the validation set starts increasing. The stability of the networks is highly dependent on the division in training and validation set, and also on the random initial weights and the chosen minimization procedure. This causes early stopped networks to be rather unstable: a small change in the data or different initial conditions can produce large changes in the prediction. Therefore, it is advisable to apply the same procedure several times starting from different initial weights. This technique is often referred to as training ensembles of neural networks. In this paper, we presented a comparison of three statistical methods to prevent overfitting of neural network.

  • PDF

Development of Induction Motor Diagnosis Method by Variance Based Feature Selection and PCA-ELM (분산정보를 이용한 특징 선택과 PCA-ELM 기반의 유도전동기 고장진단 기법 개발)

  • Lee, Dae-Jong;Chun, Myung-Geun
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.24 no.8
    • /
    • pp.55-61
    • /
    • 2010
  • In this paper, we proposed selective extraction method of frequency information and PCA-ELM based diagnosis system for three-phase induction motors. As the first step for diagnosis procedure, DFT is performed to transform the acquired current signal into frequency domain. And then, frequency components are selected according to discriminate order calculated by variance As the next step, feature extraction is performed by principal component analysis (PCA). Finally, we used the classifier based on Extreme Learning Machine (ELM) with fast learning procedure. To show the effectiveness, the proposed diagnostic system has been intensively tested with the various data acquired under different electrical and mechanical faults with varying load.

An Automatic Method for Selecting Comparative Standard Land Parcels in Land Price Appraisal Using a Decision Tree (의사결정트리를 이용한 개별 공시지가 비교표준지의 자동 선정)

  • Kim, Jong-Yoon;Park, Soo-Hong
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.7 no.1
    • /
    • pp.9-19
    • /
    • 2004
  • The selection of comparative standard parcels should be objective and reasonable, which is an important task in the individual land price appraisal procedure. However, the current procedure is mainly done manually by government officials. Therefore, the efficiency and objectiveness of this selection procedure is not guaranteed and questionable. In this study, we first defined the problem by analyzing the current comparative standard land parcel selection method. In addition, we devised a decision tree-based method using a machine learning algorithm that is considered to be efficient and objective compared to the current selection procedure. Finally the proposed method is then applied to the study area for evaluating the appropriateness and accuracy.

  • PDF