• Title/Summary/Keyword: training models

Search Result 1,531, Processing Time 0.033 seconds

Machine Learning-based Detection of HTTP DoS Attacks for Cloud Web Applications (머신러닝 기반 클라우드 웹 애플리케이션 HTTP DoS 공격 탐지)

  • Jae Han Cho;Jae Min Park;Tae Hyeop Kim;Seung Wook Lee;Jiyeon Kim
    • Smart Media Journal
    • /
    • v.12 no.2
    • /
    • pp.66-75
    • /
    • 2023
  • Recently, the number of cloud web applications is increasing owing to the accelerated migration of enterprises and public sector information systems to the cloud. Traditional network attacks on cloud web applications are characterized by Denial of Service (DoS) attacks, which consume network resources with a large number of packets. However, HTTP DoS attacks, which consume application resources, are also increasing recently; as such, developing security technologies to prevent them is necessary. In particular, since low-bandwidth HTTP DoS attacks do not consume network resources, they are difficult to identify using traditional security solutions that monitor network metrics. In this paper, we propose a new detection model for detecting HTTP DoS attacks on cloud web applications by collecting the application metrics of web servers and learning them using machine learning. We collected 18 types of application metrics from an Apache web server and used five machine learning and two deep learning models to train the collected data. Further, we confirmed the superiority of the application metrics-based machine learning model by collecting and training 6 additional network metrics and comparing their performance with the proposed models. Among HTTP DoS attacks, we injected the RUDY and HULK attacks, which are low- and high-bandwidth attacks, respectively. As a result of detecting these two attacks using the proposed model, we found out that the F1 scores of the application metrics-based machine learning model were about 0.3 and 0.1 higher than that of the network metrics-based model, respectively.

Deletion-Based Sentence Compression Using Sentence Scoring Reflecting Linguistic Information (언어 정보가 반영된 문장 점수를 활용하는 삭제 기반 문장 압축)

  • Lee, Jun-Beom;Kim, So-Eon;Park, Seong-Bae
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.3
    • /
    • pp.125-132
    • /
    • 2022
  • Sentence compression is a natural language processing task that generates concise sentences that preserves the important meaning of the original sentence. For grammatically appropriate sentence compression, early studies utilized human-defined linguistic rules. Furthermore, while the sequence-to-sequence models perform well on various natural language processing tasks, such as machine translation, there have been studies that utilize it for sentence compression. However, for the linguistic rule-based studies, all rules have to be defined by human, and for the sequence-to-sequence model based studies require a large amount of parallel data for model training. In order to address these challenges, Deleter, a sentence compression model that leverages a pre-trained language model BERT, is proposed. Because the Deleter utilizes perplexity based score computed over BERT to compress sentences, any linguistic rules and parallel dataset is not required for sentence compression. However, because Deleter compresses sentences only considering perplexity, it does not compress sentences by reflecting the linguistic information of the words in the sentences. Furthermore, since the dataset used for pre-learning BERT are far from compressed sentences, there is a problem that this can lad to incorrect sentence compression. In order to address these problems, this paper proposes a method to quantify the importance of linguistic information and reflect it in perplexity-based sentence scoring. Furthermore, by fine-tuning BERT with a corpus of news articles that often contain proper nouns and often omit the unnecessary modifiers, we allow BERT to measure the perplexity appropriate for sentence compression. The evaluations on the English and Korean dataset confirm that the sentence compression performance of sentence-scoring based models can be improved by utilizing the proposed method.

Development of Steel Composite Cable Stayed Bridge Weigh-in-Motion System using Artificial Neural Network (인공신경망을 이용한 강합성 사장교 차량하중분석시스템 개발)

  • Park, Min-Seok;Jo, Byung-Wan;Lee, Jungwhee;Kim, Sungkon
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.6A
    • /
    • pp.799-808
    • /
    • 2008
  • The analysis of vehicular loads reflecting the domestic traffic circumstances is necessary for the development of adequate design live load models in the analysis and design of cable-supported bridges or the development of fatigue load models to predict the remaining lifespan of the bridges. This study intends to develop an ANN(artificial neural network)-based Bridge WIM system and Influence line-based Bridge WIM system for obtaining information concerning the loads conditions of vehicles crossing bridge structures by exploiting the signals measured by strain gauges installed at the bottom surface of the bridge superstructure. This study relies on experimental data corresponding to the travelling of hundreds of random vehicles rather than on theoretical data generated through numerical simulations to secure data sets for the training and test of the ANN. In addition, data acquired from 3 types of vehicles weighed statically at measurement station and then crossing the bridge repeatedly are also exploited to examine the accuracy of the trained ANN. The results obtained through the proposed ANN-based analysis method, the influence line analysis method considering the local behavior of the bridge are compared for an example cable-stayed bridge. In view of the results related to the cable-stayed bridge, the cross beam ANN analysis method appears to provide more remarkable load analysis results than the cross beam influence line method.

Development of Cloud-based VTS Integration Platform for IVEF Service Implementation (IVEF 서비스 구현을 위한 클라우드 기반 VTS 통합 플랫폼 개발)

  • Yunja Yoo;Dae-Won Kim;Chae-Uk Song;Jung-Jin Lee;Sang-Gil Lee
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.29 no.7
    • /
    • pp.893-901
    • /
    • 2023
  • The International Association Marine Aids to Navigation and Lighthouse Authorities (IALA) proposed guidelines for VTS manual operation in 2016 for safe and efficient operation of ship. The Korea Coast Guard (KCG) established and operated 19 VTS centers in ports and coastal waters across the country by 2022 based on the IALA VTS manual and VTS operator's education and training guidelines. In addition, IALA proposed the Inter-VTS Exchange Format (IVEF) Service recommendation (V-145), a standard for data exchange between VTS, in 2011 for efficient e-Navigation system services and safe and efficient VTS service support by VTS authorities. The IVEF service in a common framework for ship information exchange, and it presents seven basic IVEF service (BISs) models. VTS service providers can provide safer and more efficient VTS services by sharing VTS information on joint area using IVEF standards. Based on the BIS data, interaction, and interfacing models, this paper introduced the development of the cloud-based VTS integration services performed by the KCG and the results of the VTS integration platform test-bed for IVEF service implementation. In addition, the results of establishing a cloud VTS integrated platform test-bed for the implementation of IVEF service and implementing the main functions of IVEF service were presented.

Transfer Learning based DNN-SVM Hybrid Model for Breast Cancer Classification

  • Gui Rae Jo;Beomsu Baek;Young Soon Kim;Dong Hoon Lim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.11
    • /
    • pp.1-11
    • /
    • 2023
  • Breast cancer is the disease that affects women the most worldwide. Due to the development of computer technology, the efficiency of machine learning has increased, and thus plays an important role in cancer detection and diagnosis. Deep learning is a field of machine learning technology based on an artificial neural network, and its performance has been rapidly improved in recent years, and its application range is expanding. In this paper, we propose a DNN-SVM hybrid model that combines the structure of a deep neural network (DNN) based on transfer learning and a support vector machine (SVM) for breast cancer classification. The transfer learning-based proposed model is effective for small training data, has a fast learning speed, and can improve model performance by combining all the advantages of a single model, that is, DNN and SVM. To evaluate the performance of the proposed DNN-SVM Hybrid model, the performance test results with WOBC and WDBC breast cancer data provided by the UCI machine learning repository showed that the proposed model is superior to single models such as logistic regression, DNN, and SVM, and ensemble models such as random forest in various performance measures.

Application of Multiple Linear Regression Analysis and Tree-Based Machine Learning Techniques for Cutter Life Index(CLI) Prediction (커터수명지수 예측을 위한 다중선형회귀분석과 트리 기반 머신러닝 기법 적용)

  • Ju-Pyo Hong;Tae Young Ko
    • Tunnel and Underground Space
    • /
    • v.33 no.6
    • /
    • pp.594-609
    • /
    • 2023
  • TBM (Tunnel Boring Machine) method is gaining popularity in urban and underwater tunneling projects due to its ability to ensure excavation face stability and minimize environmental impact. Among the prominent models for predicting disc cutter life, the NTNU model uses the Cutter Life Index(CLI) as a key parameter, but the complexity of testing procedures and rarity of equipment make measurement challenging. In this study, CLI was predicted using multiple linear regression analysis and tree-based machine learning techniques, utilizing rock properties. Through literature review, a database including rock uniaxial compressive strength, Brazilian tensile strength, equivalent quartz content, and Cerchar abrasivity index was built, and derived variables were added. The multiple linear regression analysis selected input variables based on statistical significance and multicollinearity, while the machine learning prediction model chose variables based on their importance. Dividing the data into 80% for training and 20% for testing, a comparative analysis of the predictive performance was conducted, and XGBoost was identified as the optimal model. The validity of the multiple linear regression and XGBoost models derived in this study was confirmed by comparing their predictive performance with prior research.

A Study on Real-time Autonomous Driving Simulation System Construction based on Digital Twin - Focused on Busan EDC - (디지털트윈 기반 실시간 자율주행 시뮬레이션 시스템 구축 방안 연구 - 부산 EDC 중심으로 -)

  • Kim, Min-Soo;Park, Jong-Hyun;Sim, Min-Seok
    • Journal of Cadastre & Land InformatiX
    • /
    • v.53 no.2
    • /
    • pp.53-66
    • /
    • 2023
  • Recently, there has been a significant interest in the development of autonomous driving simulation environment based on digital twin. In the development of such digital twin-based simulation environment, many researches has been conducted not only performance and functionality validation of autonomous driving, but also generation of virtual training data for deep learning. However, such digital twin-based autonomous driving simulation system has the problem of requiring a significant amount of time and cost for the system development and the data construction. Therefore, in this research, we aim to propose a method for rapidly designing and implementing a digital twin-based autonomous driving simulation system, using only the existing 3D models and high-definition map. Specifically, we propose a method for integrating 3D model of FBX and NGII HD Map for the Busan EDC area into CARLA, and a method for adding and modifying CARLA functions. The results of this research show that it is possible to rapidly design and implement the simulation system at a low cost by using the existing 3D models and NGII HD map. Also, the results show that our system can support various functions such as simulation scenario configuration, user-defined driving, and real-time simulation of traffic light states. We expect that usability of the system will be significantly improved when it is applied to broader geographical area in the future.

Three-Dimensional Printing of Congenital Heart Disease Models for Cardiac Surgery Simulation: Evaluation of Surgical Skill Improvement among Inexperienced Cardiothoracic Surgeons

  • Ju Gang Nam;Whal Lee;Baren Jeong;Eun-Ah Park;Ji Yeon Lim;Yujin Kwak;Hong-Gook Lim
    • Korean Journal of Radiology
    • /
    • v.22 no.5
    • /
    • pp.706-713
    • /
    • 2021
  • Objective: To evaluate the impact of surgical simulation training using a three-dimensional (3D)-printed model of tetralogy of Fallot (TOF) on surgical skill development. Materials and Methods: A life-size congenital heart disease model was printed using a Stratasys Object500 Connex2 printer from preoperative electrocardiography-gated CT scans of a 6-month-old patient with TOF with complex pulmonary stenosis. Eleven cardiothoracic surgeons independently evaluated the suitability of four 3D-printed models using composite Tango 27, 40, 50, and 60 in terms of palpation, resistance, extensibility, gap, cut-through ability, and reusability of. Among these, Tango 27 was selected as the final model. Six attendees (two junior cardiothoracic surgery residents, two senior residents, and two clinical fellows) independently performed simulation surgeries three times each. Surgical proficiency was evaluated by an experienced cardiothoracic surgeon on a 1-10 scale for each of the 10 surgical procedures. The times required for each surgical procedure were also measured. Results: In the simulation surgeries, six surgeons required a median of 34.4 (range 32.5-43.5) and 21.4 (17.9-192.7) minutes to apply the ventricular septal defect (VSD) and right ventricular outflow tract (RVOT) patches, respectively, on their first simulation surgery. These times had significantly reduced to 17.3 (16.2-29.5) and 13.6 (10.3-30.0) minutes, respectively, in the third simulation surgery (p = 0.03 and p = 0.01, respectively). The decreases in the median patch appliance time among the six surgeons were 16.2 (range 13.6-17.7) and 8.0 (1.8-170.3) minutes for the VSD and RVOT patches, respectively. Summing the scores for the 10 procedures showed that the attendees scored an average of 28.58 ± 7.89 points on the first simulation surgery and improved their average score to 67.33 ± 15.10 on the third simulation surgery (p = 0.008). Conclusion: Inexperienced cardiothoracic surgeons improved their performance in terms of surgical proficiency and operation time during the experience of three simulation surgeries using a 3D-printed TOF model using Tango 27 composite.

A Study on Machine Learning-Based Real-Time Gesture Classification Using EMG Data (EMG 데이터를 이용한 머신러닝 기반 실시간 제스처 분류 연구)

  • Ha-Je Park;Hee-Young Yang;So-Jin Choi;Dae-Yeon Kim;Choon-Sung Nam
    • Journal of Internet Computing and Services
    • /
    • v.25 no.2
    • /
    • pp.57-67
    • /
    • 2024
  • This paper explores the potential of electromyography (EMG) as a means of gesture recognition for user input in gesture-based interaction. EMG utilizes small electrodes within muscles to detect and interpret user movements, presenting a viable input method. To classify user gestures based on EMG data, machine learning techniques are employed, necessitating the preprocessing of raw EMG data to extract relevant features. EMG characteristics can be expressed through formulas such as Integrated EMG (IEMG), Mean Absolute Value (MAV), Simple Square Integral (SSI), Variance (VAR), and Root Mean Square (RMS). Additionally, determining the suitable time for gesture classification is crucial, considering the perceptual, cognitive, and response times required for user input. To address this, segment sizes ranging from a minimum of 100ms to a maximum of 1,000ms are varied, and feature extraction is performed to identify the optimal segment size for gesture classification. Notably, data learning employs overlapped segmentation to reduce the interval between data points, thereby increasing the quantity of training data. Using this approach, the paper employs four machine learning models (KNN, SVC, RF, XGBoost) to train and evaluate the system, achieving accuracy rates exceeding 96% for all models in real-time gesture input scenarios with a maximum segment size of 200ms.

Performance Evaluation of LSTM-based PM2.5 Prediction Model for Learning Seasonal and Concentration-specific Data (계절별 데이터와 농도별 데이터의 학습에 대한 LSTM 기반의 PM2.5 예측 모델 성능 평가)

  • Yong-jin Jung;Chang-Heon Oh
    • Journal of Advanced Navigation Technology
    • /
    • v.28 no.1
    • /
    • pp.149-154
    • /
    • 2024
  • Research on particulate matter is advancing in real-time, and various methods are being studied to improve the accuracy of prediction models. Furthermore, studies that take into account various factors to understand the precise causes and impacts of particulate matter are actively being pursued. This paper trains an LSTM model using seasonal data and another LSTM model using concentration-based data. It compares and analyzes the PM2.5 prediction performance of the two models. To train the model, weather data and air pollutant data were collected. The collected data was then used to confirm the correlation with PM2.5. Based on the results of the correlation analysis, the data was structured for training and evaluation. The seasonal prediction model and the concentration-specific prediction model were designed using the LSTM algorithm. The performance of the prediction model was evaluated using accuracy, RMSE, and MAPE. As a result of the performance evaluation, the prediction model learned by concentration had an accuracy of 91.02% in the "bad" range of AQI. And overall, it performed better than the prediction model trained by season.