• Title/Summary/Keyword: Early prediction

Search Result 884, Processing Time 0.027 seconds

A Comparative Study of Machine Learning Algorithms Using LID-DS DataSet (LID-DS 데이터 세트를 사용한 기계학습 알고리즘 비교 연구)

  • Park, DaeKyeong;Ryu, KyungJoon;Shin, DongIl;Shin, DongKyoo;Park, JeongChan;Kim, JinGoog
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.3
    • /
    • pp.91-98
    • /
    • 2021
  • Today's information and communication technology is rapidly developing, the security of IT infrastructure is becoming more important, and at the same time, cyber attacks of various forms are becoming more advanced and sophisticated like intelligent persistent attacks (Advanced Persistent Threat). Early defense or prediction of increasingly sophisticated cyber attacks is extremely important, and in many cases, the analysis of network-based intrusion detection systems (NIDS) related data alone cannot prevent rapidly changing cyber attacks. Therefore, we are currently using data generated by intrusion detection systems to protect against cyber attacks described above through Host-based Intrusion Detection System (HIDS) data analysis. In this paper, we conducted a comparative study on machine learning algorithms using LID-DS (Leipzig Intrusion Detection-Data Set) host-based intrusion detection data including thread information, metadata, and buffer data missing from previously used data sets. The algorithms used were Decision Tree, Naive Bayes, MLP (Multi-Layer Perceptron), Logistic Regression, LSTM (Long Short-Term Memory model), and RNN (Recurrent Neural Network). Accuracy, accuracy, recall, F1-Score indicators and error rates were measured for evaluation. As a result, the LSTM algorithm had the highest accuracy.

Establishment of a Nondestructive Analysis Method for Lignan Content in Sesame using Near Infrared Reflectance Spectroscopy (근적외선분광(NIRS)을 이용한 참깨의 lignan 함량 비파괴 분석 방법 확립)

  • Lee, Jeongeun;Kim, Sung-Up;Lee, Myoung-Hee;Kim, Jung-In;Oh, Eun-Young;Kim, Sang-Woo;Kim, MinYoung;Park, Jae-Eun;Cho, Kwang-Soo;Oh, Ki-Won
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.67 no.1
    • /
    • pp.61-66
    • /
    • 2022
  • Sesamin and sesamolin are major lignan components with a wide range of potential biological activities of sesame seeds. Near infrared reflectance spectroscopy (NIRS) is a rapid and non-destructive analysis method widely used for the quantitative determination of major components in many agricultural products. This study was conducted to develop a screening method to determine the lignan contents for sesame breeding. Sesamin and sesamolin contents of 482 sesame samples ranged from 0.03-14.40 mg/g and 0.10-3.79 mg/g with an average of 4.93 mg/g and 1.74 mg/g, respectively. Each sample was scanned using NIRS and calculated for the calibration and validation equations. The optimal performance calibration model was obtained from the original spectra using partial least squares (PLS). The coefficient of determination in calibration (R2) and standard error of calibration (SEC) were 0.963 and 0.861 for sesamin and 0.875 and 0.292 for sesamolin, respectively. Cross-validation results of the NIRS equation showed an R2 of 0.889 in the prediction for sesamin and 0.781 for sesamolin and a standard error of cross-validation (SECV) of 1.163 for sesamin and 0.417 for sesamolin. The results showed that the NIRS equation for sesamin and sesamolin could be effective in selecting high lignan sesame lines in early generations of sesame breeding.

Analysis of ROX Index, ROX-HR Index, and SpO2/FIO2 Ratio in Patients Who Received High-Flow Nasal Cannula Oxygen Therapy in Pediatric Intensive Care Unit (고유량 비강 캐뉼라 산소요법을 받은 소아중환자실 환아의 ROX Index와 ROX-HR Index 및 SpO2/FIO2 Ratio분석)

  • Choi, Sun Hee;Kim, Dong Yeon;Song, Byung Yun;Yoo, Yang Sook
    • Journal of Korean Academy of Nursing
    • /
    • v.53 no.4
    • /
    • pp.468-479
    • /
    • 2023
  • Purpose: This study aimed to evaluate the use of the respiratory rate oxygenation (ROX) index, ROX-heart rate (ROX-HR) index, and saturation of percutaneous oxygen/fraction of inspired oxygen ratio (SF ratio) to predict weaning from high-flow nasal cannula (HFNC) in patients with respiratory distress in a pediatric intensive care unit. Methods: A total of 107 children admitted to the pediatric intensive care unit were enrolled in the study between January 1, 2017, and December 31, 2021. Data on clinical and personal information, ROX index, ROX-HR index, and SF ratio were collected from nursing records. The data were analyzed using an independent t-test, χ2 test, Mann-Whitney U test, and area under the curve (AUC). Results: Seventy-five (70.1%) patients were successfully weaned from HFNC, while 32 (29.9%) failed. Considering specificity and sensitivity, the optimal cut off points for predicting treatment success and failure of HFNC oxygen therapy were 6.88 and 10.16 (ROX index), 5.23 and 8.61 (ROX-HR index), and 198.75 and 353.15 (SF ratio), respectively. The measurement of time showed that the most significant AUC was 1 hour before HFNC interruption. Conclusion: The ROX index, ROX-HR index, and SF ratio appear to be promising tools for the early prediction of treatment success or failure in patients initiated on HFNC for acute hypoxemic respiratory failure. Nurses caring for critically ill pediatric patients should closely observe and periodically check their breathing patterns. It is important to continuously monitor three indexes to ensure that ventilation assistance therapy is started at the right time.

Analysis on Strategies for Modeling the Wave Equation with Physics-Informed Neural Networks (물리정보신경망을 이용한 파동방정식 모델링 전략 분석)

  • Sangin Cho;Woochang Choi;Jun Ji;Sukjoon Pyun
    • Geophysics and Geophysical Exploration
    • /
    • v.26 no.3
    • /
    • pp.114-125
    • /
    • 2023
  • The physics-informed neural network (PINN) has been proposed to overcome the limitations of various numerical methods used to solve partial differential equations (PDEs) and the drawbacks of purely data-driven machine learning. The PINN directly applies PDEs to the construction of the loss function, introducing physical constraints to machine learning training. This technique can also be applied to wave equation modeling. However, to solve the wave equation using the PINN, second-order differentiations with respect to input data must be performed during neural network training, and the resulting wavefields contain complex dynamical phenomena, requiring careful strategies. This tutorial elucidates the fundamental concepts of the PINN and discusses considerations for wave equation modeling using the PINN approach. These considerations include spatial coordinate normalization, the selection of activation functions, and strategies for incorporating physics loss. Our experimental results demonstrated that normalizing the spatial coordinates of the training data leads to a more accurate reflection of initial conditions in neural network training for wave equation modeling. Furthermore, the characteristics of various functions were compared to select an appropriate activation function for wavefield prediction using neural networks. These comparisons focused on their differentiation with respect to input data and their convergence properties. Finally, the results of two scenarios for incorporating physics loss into the loss function during neural network training were compared. Through numerical experiments, a curriculum-based learning strategy, applying physics loss after the initial training steps, was more effective than utilizing physics loss from the early training steps. In addition, the effectiveness of the PINN technique was confirmed by comparing these results with those of training without any use of physics loss.

Comparison of Effective Soil Depth Classification Methods Using Topographic Information (지형정보를 이용한 유효토심 분류방법비교)

  • Byung-Soo Kim;Ju-Sung Choi;Ja-Kyung Lee;Na-Young Jung;Tae-Hyung Kim
    • Journal of the Korean Geosynthetics Society
    • /
    • v.22 no.2
    • /
    • pp.1-12
    • /
    • 2023
  • Research on the causes of landslides and prediction of vulnerable areas is being conducted globally. This study aims to predict the effective soil depth, a critical element in analyzing and forecasting landslide disasters, using topographic information. Topographic data from various institutions were collected and assigned as attribute information to a 100 m × 100 m grid, which was then reduced through data grading. The study predicted effective soil depth for two cases: three depths (shallow, normal, deep) and five depths (very shallow, shallow, normal, deep, very deep). Three classification models, including K-Nearest Neighbor, Random Forest, and Deep Artificial Neural Network, were used, and their performance was evaluated by calculating accuracy, precision, recall, and F1-score. Results showed that the performance was in the high 50% to early 70% range, with the accuracy of the three classification criteria being about 5% higher than the five criteria. Although the grading criteria and classification model's performance presented in this study are still insufficient, the application of the classification model is possible in predicting the effective soil depth. This study suggests the possibility of predicting more reliable values than the current effective soil depth, which assumes a large area uniformly.

A Review of Seismic Full Waveform Inversion Based on Deep Learning (딥러닝 기반 탄성파 전파형 역산 연구 개관)

  • Sukjoon, Pyun;Yunhui, Park
    • Geophysics and Geophysical Exploration
    • /
    • v.25 no.4
    • /
    • pp.227-241
    • /
    • 2022
  • Full waveform inversion (FWI) in the field of seismic data processing is an inversion technique that is used to estimate the velocity model of the subsurface for oil and gas exploration. Recently, deep learning (DL) technology has been increasingly used for seismic data processing, and its combination with FWI has attracted remarkable research efforts. For example, DL-based data processing techniques have been utilized for preprocessing input data for FWI, enabling the direct implementation of FWI through DL technology. DL-based FWI can be divided into the following methods: pure data-based, physics-based neural network, encoder-decoder, reparameterized FWI, and physics-informed neural network. In this review, we describe the theory and characteristics of the methods by systematizing them in the order of advancements. In the early days of DL-based FWI, the DL model predicted the velocity model by preparing a large training data set to adopt faithfully the basic principles of data science and apply a pure data-based prediction model. The current research trend is to supplement the shortcomings of the pure data-based approach using the loss function consisting of seismic data or physical information from the wave equation itself in deep neural networks. Based on these developments, DL-based FWI has evolved to not require a large amount of learning data, alleviating the cycle-skipping problem, which is an intrinsic limitation of FWI, and reducing computation times dramatically. The value of DL-based FWI is expected to increase continually in the processing of seismic data.

Prediction of Postoperative Lung Function in Lung Cancer Patients Using Machine Learning Models

  • Oh Beom Kwon;Solji Han;Hwa Young Lee;Hye Seon Kang;Sung Kyoung Kim;Ju Sang Kim;Chan Kwon Park;Sang Haak Lee;Seung Joon Kim;Jin Woo Kim;Chang Dong Yeo
    • Tuberculosis and Respiratory Diseases
    • /
    • v.86 no.3
    • /
    • pp.203-215
    • /
    • 2023
  • Background: Surgical resection is the standard treatment for early-stage lung cancer. Since postoperative lung function is related to mortality, predicted postoperative lung function is used to determine the treatment modality. The aim of this study was to evaluate the predictive performance of linear regression and machine learning models. Methods: We extracted data from the Clinical Data Warehouse and developed three sets: set I, the linear regression model; set II, machine learning models omitting the missing data: and set III, machine learning models imputing the missing data. Six machine learning models, the least absolute shrinkage and selection operator (LASSO), Ridge regression, ElasticNet, Random Forest, eXtreme gradient boosting (XGBoost), and the light gradient boosting machine (LightGBM) were implemented. The forced expiratory volume in 1 second measured 6 months after surgery was defined as the outcome. Five-fold cross-validation was performed for hyperparameter tuning of the machine learning models. The dataset was split into training and test datasets at a 70:30 ratio. Implementation was done after dataset splitting in set III. Predictive performance was evaluated by R2 and mean squared error (MSE) in the three sets. Results: A total of 1,487 patients were included in sets I and III and 896 patients were included in set II. In set I, the R2 value was 0.27 and in set II, LightGBM was the best model with the highest R2 value of 0.5 and the lowest MSE of 154.95. In set III, LightGBM was the best model with the highest R2 value of 0.56 and the lowest MSE of 174.07. Conclusion: The LightGBM model showed the best performance in predicting postoperative lung function.

Determinants of Willingness to Undergo Lung Cancer Screening among High-Risk Current and Ex-smokers in Sabah, Malaysia: A Cross-Sectional Pilot Study

  • Larry Ellee Nyanti;Chia Zhen Chua;Han Chuan Loo;Cheng Zhi Khor;Emilia Sheau Yuin Toh;Rasvinder Singh Gill;Eng Tat Chan;Ker Yin Tan;Taufiq Rosli;Muhammad Aklil Abd Rahim;Arfian Ibrahim;Nai Chien Huan;Hema Yamini Devi Ramarmuty;Kunji Kannan Sivaraman Kannan
    • Tuberculosis and Respiratory Diseases
    • /
    • v.86 no.4
    • /
    • pp.284-293
    • /
    • 2023
  • Background: Attitudes towards smoking, lung cancer screening, and perceived risk of lung cancer have not been widely studied in Malaysia. The primary objective of this study was to describe the factors affecting the willingness of high-risk current smokers and ex-smokers to undergo low-dose computed tomography (LDCT) screening for lung cancer. Methods: A prospective, cross-sectional questionnaire study was conducted in current smokers or ex-smokers aged between 55 and 80 years at three hospitals in Kota Kinabalu, Sabah, Malaysia. The questionnaire recorded the following parameters: perceived lung cancer risk; Prostate Lung Colon Ovarian Cancer 2012 risk prediction model excluding race and ethnicity predictor (PLCOm2012norace); demographic characteristics; psychosocial characteristics; and attitudes towards lung cancer and lung cancer screening. Results: A vast majority of the 95 respondents (94.7%) indicated their willingness to undergo screening. Stigma of lung cancer, low levels of knowledge about lung cancer symptoms, concerns about financial constraints, and a preference for traditional medication were still prevalent among the respondents, and they may represent potential barriers to lung cancer screening uptake. A desire to have an early diagnosis (odds ratio [OR], 11.33; 95% confidence interval [CI], 1.53 to 84.05; p=0.02), perceived time constraints (OR, 3.94; 95% CI, 1.32 to 11.73; p=0.01), and proximity of LDCT screening facilities (OR, 14.33; 95% CI, 1.84 to 111.4; p=0.01) had significantly higher odds of willingness to undergo screening. Conclusion: Although high-risk current smokers and ex-smokers are likely to undergo screening for lung cancer, several psychosocial barriers persist. The results of this study may guide the policymakers and clinicians regarding the need to improve lung cancer awareness in our population.

Development of smart car intelligent wheel hub bearing embedded system using predictive diagnosis algorithm

  • Sam-Taek Kim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.10
    • /
    • pp.1-8
    • /
    • 2023
  • If there is a defect in the wheel bearing, which is a major part of the car, it can cause problems such as traffic accidents. In order to solve this problem, big data is collected and monitoring is conducted to provide early information on the presence or absence of wheel bearing failure and type of failure through predictive diagnosis and management technology. System development is needed. In this paper, to implement such an intelligent wheel hub bearing maintenance system, we develop an embedded system equipped with sensors for monitoring reliability and soundness and algorithms for predictive diagnosis. The algorithm used acquires vibration signals from acceleration sensors installed in wheel bearings and can predict and diagnose failures through big data technology through signal processing techniques, fault frequency analysis, and health characteristic parameter definition. The implemented algorithm applies a stable signal extraction algorithm that can minimize vibration frequency components and maximize vibration components occurring in wheel bearings. In noise removal using a filter, an artificial intelligence-based soundness extraction algorithm is applied, and FFT is applied. The fault frequency was analyzed and the fault was diagnosed by extracting fault characteristic factors. The performance target of this system was over 12,800 ODR, and the target was met through test results.

Real-Time Flood Forecasting by Using a Measured Data Based Nomograph for Small Streams (계측자료 기반 Nomograph를 이용한 실시간 소하천 홍수량 산정 연구)

  • Tae Sung Cheong;Changwon Choi;Sung Je Yei;Kang Min Koo
    • Ecology and Resilient Infrastructure
    • /
    • v.10 no.4
    • /
    • pp.116-124
    • /
    • 2023
  • As the flood damage on small streams increase due to the increase in frequency of extreme climate events, the need to measure hydraulic data of them has increased for disaster risk management. National Disaster Management Institute, Ministry of Interior and Safety develops CADMT, a CCTV-based automatic discharge measurement technology, and operates pilot small streams to verify its performance and develop disaster risk management technology. The research selects two small streams such as the Neungmac and the Jungsunpil streams to develop the Nomograph by using the 4-Parameter Logistic method using only the observed rainfall data from the Automatic Weather System operated by the Korea Meteorological Agency closest to the small streams and discharge data collected by using the CADMT. To evaluate developed Nomograph, the research forecasts floods discharges in each small stream and compares the result with the observed discharges. As a result of the evaluations, the forecasted value is found to represent the observed value well, so if more accurate observed data are collected and the Nomograph based on it is developed in the future, the high-accuracy flood prediction and warning will be possible.