• Title/Summary/Keyword: HyperParameter

Search Result 111, Processing Time 0.032 seconds

Exploring performance improvement through split prediction in stock price prediction model (주가 예측 모델에서의 분할 예측을 통한 성능향상 탐구)

  • Yeo, Tae Geon Woo;Ryu, Dohui;Nam, Jungwon;Oh, Hayoung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.4
    • /
    • pp.503-509
    • /
    • 2022
  • The purpose of this study is to set the rate of change between the market price of the next day and the previous day to be predicted as the predicted value, and the market price for each section is generated by dividing the stock price ranking of the next day to be predicted at regular intervals, which is different from the previous papers that predict the market price. We would like to propose a new time series data prediction method that predicts the market price change rate of the final next day through a model using the rate of change as the predicted value. The change in the performance of the model according to the degree of subdivision of the predicted value and the type of input data was analyzed.

Forecasting Baltic Dry Index by Implementing Time-Series Decomposition and Data Augmentation Techniques (시계열 분해 및 데이터 증강 기법 활용 건화물운임지수 예측)

  • Han, Min Soo;Yu, Song Jin
    • Journal of Korean Society for Quality Management
    • /
    • v.50 no.4
    • /
    • pp.701-716
    • /
    • 2022
  • Purpose: This study aims to predict the dry cargo transportation market economy. The subject of this study is the BDI (Baltic Dry Index) time-series, an index representing the dry cargo transport market. Methods: In order to increase the accuracy of the BDI time-series, we have pre-processed the original time-series via time-series decomposition and data augmentation techniques and have used them for ANN learning. The ANN algorithms used are Multi-Layer Perceptron (MLP), Recurrent Neural Network (RNN), and Long Short-Term Memory (LSTM) to compare and analyze the case of learning and predicting by applying time-series decomposition and data augmentation techniques. The forecast period aims to make short-term predictions at the time of t+1. The period to be studied is from '22. 01. 07 to '22. 08. 26. Results: Only for the case of the MAPE (Mean Absolute Percentage Error) indicator, all ANN models used in the research has resulted in higher accuracy (1.422% on average) in multivariate prediction. Although it is not a remarkable improvement in prediction accuracy compared to uni-variate prediction results, it can be said that the improvement in ANN prediction performance has been achieved by utilizing time-series decomposition and data augmentation techniques that were significant and targeted throughout this study. Conclusion: Nevertheless, due to the nature of ANN, additional performance improvements can be expected according to the adjustment of the hyper-parameter. Therefore, it is necessary to try various applications of multiple learning algorithms and ANN optimization techniques. Such an approach would help solve problems with a small number of available data, such as the rapidly changing business environment or the current shipping market.

A Study of AI-based Monitoring Techniques for Land-based Debris in Stream (AI기반 하천 부유쓰레기 모니터링 기술 연구)

  • Kyungsu Lee;Haein Yoon;Jonghwa Won;Sang Hwa Jung
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.137-137
    • /
    • 2023
  • 해양쓰레기는 해안의 심미적 가치 저하뿐만 아니라 생태계 파괴, 유령 어업에 따른 수산업 피해 등의 사회적·환경적 문제를 발생시키며, 그중 70% 이상은 육상 기인으로 플라스틱 및 기타 쓰레기가 주를 이루는 해외와 달리 국내의 경우 다량의 초목류를 포함하고 있다. 다양한 부유쓰레기에 대한 기존의 해양쓰레기량 추정의 한계와 하천·하구 쓰레기 수거의 효율화를 위해 해양으로 유입되는 부유쓰레기 방지를 위한 실효성 있는 대책 수립이 필요한 실정이다. 본 연구는 해양 유입 전 하천의 차단시설에 차집된 부유쓰레기의 수거 효율화 및 지속가능한 해양쓰레기 데이터 구축을 위해 AI기반의 기술을 통해 부유쓰레기 성상 분석 기법(Object Detection)과 차집량 분석 기법(Semantic Segmentation)을 활용하였다. 실제와 유사한 데이터 수집을 위해 다양한 하천 환경(정수조, 소하천, 급경사수로)에 대해 탁도(녹조, 유사), 광량, 쓰레기형상, 초목류 함량, 날씨(소하천), 유속(급경사수로) 등의 실험조건에 대하여 해양쓰레기 분류 기준 및 통계를 바탕으로 부유쓰레기 종류 선정하여 학습을 위한 데이터를 수집하였다. 학습 목적에 따라 구분하여 라벨링(Bounding box, Polygon)을 수행하고, 각 분석 기법별 전이학습을 통해 Phase 1(정수조), Phase 2(소하천), Phase 3(급경사수로) 순서로 모델을 고도화하였다. 성상 분석을 위해 YOLO v4를 활용하여 Train, Test DataSet(9:1)을 구성하고 학습 및 평가는 Iteration마다의 mAP, loss 값을 통해 비교하였으며, 학습 Phase에 따라 모델 고도화로 Test Set의 mAP 값이 성상별로 높아짐을 확인하였으며, 차집량 분석을 위해 Unet을 활용하여 Train, Test, Validation DataSet(8.5:1:0.5)을 구성하고 epoch별 IoU(intersection over Union), F1-score, loss 값을 비교하여 정성적, 정량적 평가 모두 Phase 3에서 가장 높은 성능을 확인하였다. 향후 하천 환경에서의 다양한 영양인자별 분석을 통해 주요 영향인자 도출 및 Hyper Parameter 최적화를 통한 모델 고도화로 인해 활용성이 높아질 것으로 판단된다.

  • PDF

QoS-Aware Optimal SNN Model Parameter Generation Method in Neuromorphic Environment (뉴로모픽 환경에서 QoS를 고려한 최적의 SNN 모델 파라미터 생성 기법)

  • Seoyeon Kim;Bongjae Kim;Jinman Jung
    • Smart Media Journal
    • /
    • v.12 no.4
    • /
    • pp.19-26
    • /
    • 2023
  • IoT edge services utilizing neuromorphic hardware architectures are suitable for autonomous IoT applications as they perform intelligent processing on the device itself. However, spiking neural networks applied to neuromorphic hardware are difficult for IoT developers to comprehend due to their complex structures and various hyper-parameters. In this paper, we propose a method for generating spiking neural network (SNN) models that satisfy user performance requirements while considering the constraints of neuromorphic hardware. Our proposed method utilizes previously trained models from pre-processed data to find optimal SNN model parameters from profiling data. Comparing our method to a naive search method, both methods satisfy user requirements, but our proposed method shows better performance in terms of runtime. Additionally, even if the constraints of new hardware are not clearly known, the proposed method can provide high scalability by utilizing the profiled data of the hardware.

AutoFe-Sel: A Meta-learning based methodology for Recommending Feature Subset Selection Algorithms

  • Irfan Khan;Xianchao Zhang;Ramesh Kumar Ayyasam;Rahman Ali
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.7
    • /
    • pp.1773-1793
    • /
    • 2023
  • Automated machine learning, often referred to as "AutoML," is the process of automating the time-consuming and iterative procedures that are associated with the building of machine learning models. There have been significant contributions in this area across a number of different stages of accomplishing a data-mining task, including model selection, hyper-parameter optimization, and preprocessing method selection. Among them, preprocessing method selection is a relatively new and fast growing research area. The current work is focused on the recommendation of preprocessing methods, i.e., feature subset selection (FSS) algorithms. One limitation in the existing studies regarding FSS algorithm recommendation is the use of a single learner for meta-modeling, which restricts its capabilities in the metamodeling. Moreover, the meta-modeling in the existing studies is typically based on a single group of data characterization measures (DCMs). Nonetheless, there are a number of complementary DCM groups, and their combination will allow them to leverage their diversity, resulting in improved meta-modeling. This study aims to address these limitations by proposing an architecture for preprocess method selection that uses ensemble learning for meta-modeling, namely AutoFE-Sel. To evaluate the proposed method, we performed an extensive experimental evaluation involving 8 FSS algorithms, 3 groups of DCMs, and 125 datasets. Results show that the proposed method achieves better performance compared to three baseline methods. The proposed architecture can also be easily extended to other preprocessing method selections, e.g., noise-filter selection and imbalance handling method selection.

An Extended Function Point Model for Estimating the Implementing Cost of Machine Learning Applications (머신러닝 애플리케이션 구현 비용 평가를 위한 확장형 기능 포인트 모델)

  • Seokjin Im
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.2
    • /
    • pp.475-481
    • /
    • 2023
  • Softwares, especially like machine learning applications, affect human's life style tremendously. Accordingly, the importance of the cost model for softwares increases rapidly. As cost models, LOC(Line of Code) and M/M(Man-Month) estimates the quantitative aspects of the software. Differently from them, FP(Function Point) focuses on estimating the functional characteristics of software. FP is efficient in the aspect that it estimates qualitative characteristics. FP, however, has a limit for evaluating machine learning softwares because FP does not evaluate the critical factors of machine learning software. In this paper, we propose an extended function point(ExFP) that extends FP to adopt hyper parameter and the complexity of its optimization as the characteristics of the machine learning applications. In the evaluation reflecting the characteristics of machine learning applications. we reveals the effectiveness of the proposed ExFP.

A vibration-based approach for detecting arch dam damage using RBF neural networks and Jaya algorithms

  • Ali Zar;Zahoor Hussain;Muhammad Akbar;Bassam A. Tayeh;Zhibin Lin
    • Smart Structures and Systems
    • /
    • v.32 no.5
    • /
    • pp.319-338
    • /
    • 2023
  • The study presents a new hybrid data-driven method by combining radial basis functions neural networks (RBF-NN) with the Jaya algorithm (JA) to provide effective structural health monitoring of arch dams. The novelty of this approach lies in that only one user-defined parameter is required and thus can increase its effectiveness and efficiency, as compared to other machine learning techniques that often require processing a large amount of training and testing model parameters and hyper-parameters, with high time-consuming. This approach seeks rapid damage detection in arch dams under dynamic conditions, to prevent potential disasters, by utilizing the RBF-NNN to seamlessly integrate the dynamic elastic modulus (DEM) and modal parameters (such as natural frequency and mode shape) as damage indicators. To determine the dynamic characteristics of the arch dam, the JA sequentially optimizes an objective function rooted in vibration-based data sets. Two case studies of hyperbolic concrete arch dams were carefully designed using finite element simulation to demonstrate the effectiveness of the RBF-NN model, in conjunction with the Jaya algorithm. The testing results demonstrated that the proposed methods could exhibit significant computational time-savings, while effectively detecting damage in arch dam structures with complex nonlinearities. Furthermore, despite training data contaminated with a high level of noise, the RBF-NN and JA fusion remained the robustness, with high accuracy.

Prediction of Uniaxial Compressive Strength of Rock using Shield TBM Machine Data and Machine Learning Technique (쉴드 TBM 기계 데이터 및 머신러닝 기법을 이용한 암석의 일축압축강도 예측)

  • Kim, Tae-Hwan;Ko, Tae Young;Park, Yang Soo;Kim, Taek Kon;Lee, Dae Hyuk
    • Tunnel and Underground Space
    • /
    • v.30 no.3
    • /
    • pp.214-225
    • /
    • 2020
  • Uniaxial compressive strength (UCS) of rock is one of the important factors to determine the advance speed during shield TBM tunnel excavation. UCS can be obtained through the Geotechnical Data Report (GDR), and it is difficult to measure UCS for all tunneling alignment. Therefore, the purpose of this study is to predict UCS by utilizing TBM machine driving data and machine learning technique. Several machine learning techniques were compared to predict UCS, and it was confirmed the stacking model has the most successful prediction performance. TBM machine data and UCS used in the analysis were obtained from the excavation of rock strata with slurry shield TBMs. The data were divided into 8:2 for training and test and pre-processed including feature selection, scaling, and outlier removal. After completing the hyper-parameter tuning, the stacking model was evaluated with the root-mean-square error (RMSE) and the determination coefficient (R2), and it was found to be 5.556 and 0.943, respectively. Based on the results, the sacking models are considered useful in predicting rock strength with TBM excavation data.

Predicting blast-induced ground vibrations at limestone quarry from artificial neural network optimized by randomized and grid search cross-validation, and comparative analyses with blast vibration predictor models

  • Salman Ihsan;Shahab Saqib;Hafiz Muhammad Awais Rashid;Fawad S. Niazi;Mohsin Usman Qureshi
    • Geomechanics and Engineering
    • /
    • v.35 no.2
    • /
    • pp.121-133
    • /
    • 2023
  • The demand for cement and limestone crushed materials has increased many folds due to the tremendous increase in construction activities in Pakistan during the past few decades. The number of cement production industries has increased correspondingly, and so the rock-blasting operations at the limestone quarry sites. However, the safety procedures warranted at these sites for the blast-induced ground vibrations (BIGV) have not been adequately developed and/or implemented. Proper prediction and monitoring of BIGV are necessary to ensure the safety of structures in the vicinity of these quarry sites. In this paper, an attempt has been made to predict BIGV using artificial neural network (ANN) at three selected limestone quarries of Pakistan. The ANN has been developed in Python using Keras with sequential model and dense layers. The hyper parameters and neurons in each of the activation layers has been optimized using randomized and grid search method. The input parameters for the model include distance, a maximum charge per delay (MCPD), depth of hole, burden, spacing, and number of blast holes, whereas, peak particle velocity (PPV) is taken as the only output parameter. A total of 110 blast vibrations datasets were recorded from three different limestone quarries. The dataset has been divided into 85% for neural network training, and 15% for testing of the network. A five-layer ANN is trained with Rectified Linear Unit (ReLU) activation function, Adam optimization algorithm with a learning rate of 0.001, and batch size of 32 with the topology of 6-32-32-256-1. The blast datasets were utilized to compare the performance of ANN, multivariate regression analysis (MVRA), and empirical predictors. The performance was evaluated using the coefficient of determination (R2), mean absolute error (MAE), mean squared error (MSE), mean absolute percentage error (MAPE), and root mean squared error (RMSE)for predicted and measured PPV. To determine the relative influence of each parameter on the PPV, sensitivity analyses were performed for all input parameters. The analyses reveal that ANN performs superior than MVRA and other empirical predictors, andthat83% PPV is affected by distance and MCPD while hole depth, number of blast holes, burden and spacing contribute for the remaining 17%. This research provides valuable insights into improving safety measures and ensuring the structural integrity of buildings near limestone quarry sites.

Robust Semi-auto Calibration Method for Various Cameras and Illumination Changes (다양한 카메라와 조명의 변화에 강건한 반자동 카메라 캘리브레이션 방법)

  • Shin, Dong-Won;Ho, Yo-Sung
    • Journal of Broadcast Engineering
    • /
    • v.21 no.1
    • /
    • pp.36-42
    • /
    • 2016
  • Recently, many 3D contents have been produced through the multiview camera system. In this system, since a difference of the viewpoint between color and depth cameras is inevitable, the camera parameter plays the important role to adjust the viewpoint as a preprocessing step. The conventional camera calibration method is inconvenient to users since we need to choose pattern features manually after capturing a planar chessboard with various poses. Therefore, we propose a semi-auto camera calibration method using a circular sampling and an homography estimation. Firstly, The proposed method extracts the candidates of the pattern features from the images by FAST corner detector. Next, we reduce the amount of the candidates by the circular sampling and obtain the complete point cloud by the homography estimation. Lastly, we compute the accurate position having the sub-pixel accuracy of the pattern features by the approximation of the hyper parabola surface. We investigated which factor affects the result of the pattern feature detection at each step. Compared to the conventional method, we found the proposed method released the inconvenience of the manual operation but maintained the accuracy of the camera parameters.