• Title/Summary/Keyword: Prediction of variables

Search Result 1,810, Processing Time 0.033 seconds

Framework for improving the prediction rate with respect to outdoor thermal comfort using machine learning

  • Jeong, Jaemin;Jeong, Jaewook;Lee, Minsu;Lee, Jaehyun
    • International conference on construction engineering and project management
    • /
    • 2022.06a
    • /
    • pp.119-127
    • /
    • 2022
  • Most of the construction works are conducted outdoors, so the construction workers are affected by weather conditions such as temperature, humidity, and wind velocity which can be evaluated the thermal comfort as environmental factors. In our previous researches, it was found that construction accidents are usually occurred in the discomfort ranges. The safety management, therefore, should be planned in consideration of the thermal comfort and measured by a specialized simulation tool. However, it is very complex, time-consuming, and difficult to model. To address this issue, this study is aimed to develop a framework of a prediction model for improving the prediction accuracy about outdoor thermal comfort considering environmental factors using machine learning algorithms with hyperparameter tuning. This study is done in four steps: i) Establishment of database, ii) Selection of variables to develop prediction model, iii) Development of prediction model; iv) Conducting of hyperparameter tuning. The tree type algorithm is used to develop the prediction model. The results of this study are as follows. First, considering three variables related to environmental factor, the prediction accuracy was 85.74%. Second, the prediction accuracy was 86.55% when considering four environmental factors. Third, after conducting hyperparameter tuning, the prediction accuracy was increased up to 87.28%. This study has several contributions. First, using this prediction model, the thermal comfort can be calculated easily and quickly. Second, using this prediction model, the safety management can be utilized to manage the construction accident considering weather conditions.

  • PDF

A Study on the Prediction of Welding Flaw Using Neural Network (인공 신경망을 이용한 실시간 용접품질 예측에 관한 연구)

  • Cho, Jae Hyung;Ko, Sang Hyun
    • Journal of Digital Convergence
    • /
    • v.17 no.5
    • /
    • pp.217-223
    • /
    • 2019
  • A study in predicting defects of spot welding in real time in automotive field is essential for cost reduction and high quality production. Welding quality is determined by shear strength and the size of the nugget, and results depend on different independent variables. In order to develop the real-time prediction system, multiple regression analyses were conducted and the two dependent variables were obtained with sufficient statistical results with three independent variables, however, the quality prediction by the regression formula could not ensure accuracy. In this study, a multi-layer neural network circuit was constructed. The neural network by 10 dynamic resistance variables was constructed with three hidden layers to obtain execution functions and weighting matrix. In this case, the neural network was established with three independent variables based on regression analysis, as there could be difficulties in real-time control due to too many input variables. As a result, all test data were divided into poor, partial, and modalities. Therefore, a real-time welding quality determination system by three independent variables obtained by multiple regression analysis was completed.

A Prediction of Stock Price Movements Using Support Vector Machines in Indonesia

  • ARDYANTA, Ervandio Irzky;SARI, Hasrini
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.8
    • /
    • pp.399-407
    • /
    • 2021
  • Stock movement is difficult to predict because it has dynamic characteristics and is influenced by many factors. Even so, there are some approaches to predict stock price movements, namely technical analysis, fundamental analysis, and sentiment analysis. Many researches have tried to predict stock price movement by utilizing these analysis techniques. However, the results obtained are varied and inconsistent depending on the variables and object used. This is because stock price movement is influenced by a variety of factors, and it is likely that those studies did not cover all of them. One of which is that no research considers the use of fundamental analysis in terms of currency exchange rates and the use of foreign stock price index movement related to the technical analysis. This research aims to predict stock price movements in Indonesia based on sentiment analysis, technical analysis, and fundamental analysis using Support Vector Machine. The result obtained has a prediction accuracy rate of 65,33% on an average. The inclusion of currency exchange rate and foreign stock price index movement as a predictor in this research which can increase average prediction accuracy rate by 11.78% compared to the prediction without using these two variables which only results in average prediction accuracy rate of 53.55%.

Development of Traffic Accident Prediction Model Based on Traffic Node and Link Using XGBoost (XGBoost를 이용한 교통노드 및 교통링크 기반의 교통사고 예측모델 개발)

  • Kim, Un-Sik;Kim, Young-Gyu;Ko, Joong-Hoon
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.45 no.2
    • /
    • pp.20-29
    • /
    • 2022
  • This study intends to present a traffic node-based and link-based accident prediction models using XGBoost which is very excellent in performance among machine learning models, and to develop those models with sustainability and scalability. Also, we intend to present those models which predict the number of annual traffic accidents based on road types, weather conditions, and traffic information using XGBoost. To this end, data sets were constructed by collecting and preprocessing traffic accident information, road information, weather information, and traffic information. The SHAP method was used to identify the variables affecting the number of traffic accidents. The five main variables of the traffic node-based accident prediction model were snow cover, precipitation, the number of entering lanes and connected links, and slow speed. Otherwise, those of the traffic link-based accident prediction model were snow cover, precipitation, the number of lanes, road length, and slow speed. As the evaluation results of those models, the RMSE values of those models were each 0.2035 and 0.2107. In this study, only data from Sejong City were used to our models, but ours can be applied to all regions where traffic nodes and links are constructed. Therefore, our prediction models can be extended to a wider range.

Elman ANNs along with two different sets of inputs for predicting the properties of SCCs

  • Gholamzadeh-Chitgar, Atefeh;Berenjian, Javad
    • Computers and Concrete
    • /
    • v.24 no.5
    • /
    • pp.399-412
    • /
    • 2019
  • In this investigation, Elman neural networks were utilized for predicting the mechanical properties of Self-Compacting Concretes (SCCs). Elman models were designed by using experimental data of many different concrete mixdesigns of various types of SCC that were collected from the literature. In order to investigate the effectiveness of the selected input variables on the network performance in predicting intended properties, utilized data in artificial neural networks were considered in two sets of 8 and 140 input variables. The obtained outcomes showed that not only can the developed Elman ANNs predict the mechanical properties of SCCs with high accuracy, but also for all of the desired outputs, networks with 140 inputs, compared to ones with 8, have a remarkable percent improvement in the obtained prediction results. The prediction accuracy can significantly be improved by using a more complete and accurate set of key factors affecting the desired outputs, as input variables, in the networks, which is leading to more similarity of the predicted results gained from networks to experimental results.

Fatigue Life Prediction of FRP Composites under Uniaxial Tension and Pure Torsion Loadings (인장-비틀림 하중에 의한 섬유강화 복합재료의 피로수명 예측)

  • Park Sung-Oan
    • Transactions of the Korean Society of Machine Tool Engineers
    • /
    • v.13 no.6
    • /
    • pp.64-73
    • /
    • 2004
  • A fatigue damage accumulation model based on the continuum damage mechanics theory was developed where modulus decay ratios in tension and shear were used as indicators for damage variables D. In the model, the damage variables are considered to be second-order tensors. Then, the maximum principal damage variable, $D^*$ is introduced. According to the similarity to the principal stress, $D^*$ is obtained as the maximum eigen value of damage tensor [D]. Under proportional tension and torsion loadings, fatigue lives were satisfactorily predicted at any combined stress ratios using the present model in which the Fatigue characteristics only under uniaxial tension and pure torsion loadings were needed. Fatigue life prediction under uniaxial tension and pure torsion loadings, was performed based on the damage mechanics using boundary element method.

Design of Regression Model and Pattern Classifier by Using Principal Component Analysis (주성분 분석법을 이용한 회귀다항식 기반 모델 및 패턴 분류기 설계)

  • Roh, Seok-Beom;Lee, Dong-Yoon
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.10 no.6
    • /
    • pp.594-600
    • /
    • 2017
  • The new design methodology of prediction model and pattern classification, which is based on the dimension reduction algorithm called principal component analysis, is introduced in this paper. Principal component analysis is one of dimension reduction techniques which are used to reduce the dimension of the input space and extract some good features from the original input variables. The extracted input variables are applied to the prediction model and pattern classifier as the input variables. The introduced prediction model and pattern classifier are based on the very simple regression which is the key point of the paper. The structural simplicity of the prediction model and pattern classifier leads to reducing the over-fitting problem. In order to validate the proposed prediction model and pattern classifier, several machine learning data sets are used.

Prediction on the Economic Activity Level of the Elderly in South Korea - Focusing on Machine Learning Method Combined with Forecast Combination - (우리나라 고령층의 경제활동 수준 예측 - 머신러닝 기법과 연계한 예측조합법을 중심으로 -)

  • Kim, Jeong-Woo
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.5
    • /
    • pp.237-247
    • /
    • 2022
  • This study predicts the economic activity level of the elderly in Korea using various machine learning methods. While the previous studies mainly focused on testing the relationship between the economic activity level and the life satisfaction or the social security system, this study aims at the accurate prediction on the economic activity level of the elderly using various machine learning methods and the forecast combination. Dependent variables such as the activity rate, employment rate, etc and independent variables such as the income, average wage, etc compose the dataset in this study. Five different machine learning methods and two forecast combinations are applied to the given dataset. The prediction performances of the machine learning method and the forecast combination varied across the dependent variables and prediction intervals, but it was found that the forecast combination was relatively superior to other methods in terms of the stability of prediction. This study has significance in that it accurately predicted the economic activity level of the elderly and achieved the stability of the prediction, raising practicality from a policy perspective.

Machine Learning-Based Prediction of COVID-19 Severity and Progression to Critical Illness Using CT Imaging and Clinical Data

  • Subhanik Purkayastha;Yanhe Xiao;Zhicheng Jiao;Rujapa Thepumnoeysuk;Kasey Halsey;Jing Wu;Thi My Linh Tran;Ben Hsieh;Ji Whae Choi;Dongcui Wang;Martin Vallieres;Robin Wang;Scott Collins;Xue Feng;Michael Feldman;Paul J. Zhang;Michael Atalay;Ronnie Sebro;Li Yang;Yong Fan;Wei-hua Liao;Harrison X. Bai
    • Korean Journal of Radiology
    • /
    • v.22 no.7
    • /
    • pp.1213-1224
    • /
    • 2021
  • Objective: To develop a machine learning (ML) pipeline based on radiomics to predict Coronavirus Disease 2019 (COVID-19) severity and the future deterioration to critical illness using CT and clinical variables. Materials and Methods: Clinical data were collected from 981 patients from a multi-institutional international cohort with real-time polymerase chain reaction-confirmed COVID-19. Radiomics features were extracted from chest CT of the patients. The data of the cohort were randomly divided into training, validation, and test sets using a 7:1:2 ratio. A ML pipeline consisting of a model to predict severity and time-to-event model to predict progression to critical illness were trained on radiomics features and clinical variables. The receiver operating characteristic area under the curve (ROC-AUC), concordance index (C-index), and time-dependent ROC-AUC were calculated to determine model performance, which was compared with consensus CT severity scores obtained by visual interpretation by radiologists. Results: Among 981 patients with confirmed COVID-19, 274 patients developed critical illness. Radiomics features and clinical variables resulted in the best performance for the prediction of disease severity with a highest test ROC-AUC of 0.76 compared with 0.70 (0.76 vs. 0.70, p = 0.023) for visual CT severity score and clinical variables. The progression prediction model achieved a test C-index of 0.868 when it was based on the combination of CT radiomics and clinical variables compared with 0.767 when based on CT radiomics features alone (p < 0.001), 0.847 when based on clinical variables alone (p = 0.110), and 0.860 when based on the combination of visual CT severity scores and clinical variables (p = 0.549). Furthermore, the model based on the combination of CT radiomics and clinical variables achieved time-dependent ROC-AUCs of 0.897, 0.933, and 0.927 for the prediction of progression risks at 3, 5 and 7 days, respectively. Conclusion: CT radiomics features combined with clinical variables were predictive of COVID-19 severity and progression to critical illness with fairly high accuracy.

Corporate Default Prediction Model Using Deep Learning Time Series Algorithm, RNN and LSTM (딥러닝 시계열 알고리즘 적용한 기업부도예측모형 유용성 검증)

  • Cha, Sungjae;Kang, Jungseok
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.1-32
    • /
    • 2018
  • In addition to stakeholders including managers, employees, creditors, and investors of bankrupt companies, corporate defaults have a ripple effect on the local and national economy. Before the Asian financial crisis, the Korean government only analyzed SMEs and tried to improve the forecasting power of a default prediction model, rather than developing various corporate default models. As a result, even large corporations called 'chaebol enterprises' become bankrupt. Even after that, the analysis of past corporate defaults has been focused on specific variables, and when the government restructured immediately after the global financial crisis, they only focused on certain main variables such as 'debt ratio'. A multifaceted study of corporate default prediction models is essential to ensure diverse interests, to avoid situations like the 'Lehman Brothers Case' of the global financial crisis, to avoid total collapse in a single moment. The key variables used in corporate defaults vary over time. This is confirmed by Beaver (1967, 1968) and Altman's (1968) analysis that Deakins'(1972) study shows that the major factors affecting corporate failure have changed. In Grice's (2001) study, the importance of predictive variables was also found through Zmijewski's (1984) and Ohlson's (1980) models. However, the studies that have been carried out in the past use static models. Most of them do not consider the changes that occur in the course of time. Therefore, in order to construct consistent prediction models, it is necessary to compensate the time-dependent bias by means of a time series analysis algorithm reflecting dynamic change. Based on the global financial crisis, which has had a significant impact on Korea, this study is conducted using 10 years of annual corporate data from 2000 to 2009. Data are divided into training data, validation data, and test data respectively, and are divided into 7, 2, and 1 years respectively. In order to construct a consistent bankruptcy model in the flow of time change, we first train a time series deep learning algorithm model using the data before the financial crisis (2000~2006). The parameter tuning of the existing model and the deep learning time series algorithm is conducted with validation data including the financial crisis period (2007~2008). As a result, we construct a model that shows similar pattern to the results of the learning data and shows excellent prediction power. After that, each bankruptcy prediction model is restructured by integrating the learning data and validation data again (2000 ~ 2008), applying the optimal parameters as in the previous validation. Finally, each corporate default prediction model is evaluated and compared using test data (2009) based on the trained models over nine years. Then, the usefulness of the corporate default prediction model based on the deep learning time series algorithm is proved. In addition, by adding the Lasso regression analysis to the existing methods (multiple discriminant analysis, logit model) which select the variables, it is proved that the deep learning time series algorithm model based on the three bundles of variables is useful for robust corporate default prediction. The definition of bankruptcy used is the same as that of Lee (2015). Independent variables include financial information such as financial ratios used in previous studies. Multivariate discriminant analysis, logit model, and Lasso regression model are used to select the optimal variable group. The influence of the Multivariate discriminant analysis model proposed by Altman (1968), the Logit model proposed by Ohlson (1980), the non-time series machine learning algorithms, and the deep learning time series algorithms are compared. In the case of corporate data, there are limitations of 'nonlinear variables', 'multi-collinearity' of variables, and 'lack of data'. While the logit model is nonlinear, the Lasso regression model solves the multi-collinearity problem, and the deep learning time series algorithm using the variable data generation method complements the lack of data. Big Data Technology, a leading technology in the future, is moving from simple human analysis, to automated AI analysis, and finally towards future intertwined AI applications. Although the study of the corporate default prediction model using the time series algorithm is still in its early stages, deep learning algorithm is much faster than regression analysis at corporate default prediction modeling. Also, it is more effective on prediction power. Through the Fourth Industrial Revolution, the current government and other overseas governments are working hard to integrate the system in everyday life of their nation and society. Yet the field of deep learning time series research for the financial industry is still insufficient. This is an initial study on deep learning time series algorithm analysis of corporate defaults. Therefore it is hoped that it will be used as a comparative analysis data for non-specialists who start a study combining financial data and deep learning time series algorithm.