• Title/Summary/Keyword: machine learning

Search Result 5,177, Processing Time 0.035 seconds

A Study on XAI-based Clinical Decision Support System (XAI 기반의 임상의사결정시스템에 관한 연구)

  • Ahn, Yoon-Ae;Cho, Han-Jin
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.12
    • /
    • pp.13-22
    • /
    • 2021
  • The clinical decision support system uses accumulated medical data to apply an AI model learned by machine learning to patient diagnosis and treatment prediction. However, the existing black box-based AI application does not provide a valid reason for the result predicted by the system, so there is a limitation in that it lacks explanation. To compensate for these problems, this paper proposes a system model that applies XAI that can be explained in the development stage of the clinical decision support system. The proposed model can supplement the limitations of the black box by additionally applying a specific XAI technology that can be explained to the existing AI model. To show the application of the proposed model, we present an example of XAI application using LIME and SHAP. Through testing, it is possible to explain how data affects the prediction results of the model from various perspectives. The proposed model has the advantage of increasing the user's trust by presenting a specific reason to the user. In addition, it is expected that the active use of XAI will overcome the limitations of the existing clinical decision support system and enable better diagnosis and decision support.

Behavior and Script Similarity-Based Cryptojacking Detection Framework Using Machine Learning (머신러닝을 활용한 행위 및 스크립트 유사도 기반 크립토재킹 탐지 프레임워크)

  • Lim, EunJi;Lee, EunYoung;Lee, IlGu
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.31 no.6
    • /
    • pp.1105-1114
    • /
    • 2021
  • Due to the recent surge in popularity of cryptocurrency, the threat of cryptojacking, a malicious code for mining cryptocurrencies, is increasing. In particular, web-based cryptojacking is easy to attack because the victim can mine cryptocurrencies using the victim's PC resources just by accessing the website and simply adding mining scripts. The cryptojacking attack causes poor performance and malfunction. It can also cause hardware failure due to overheating and aging caused by mining. Cryptojacking is difficult for victims to recognize the damage, so research is needed to efficiently detect and block cryptojacking. In this work, we take representative distinct symptoms of cryptojacking as an indicator and propose a new architecture. We utilized the K-Nearst Neighbors(KNN) model, which trained computer performance indicators as behavior-based dynamic analysis techniques. In addition, a K-means model, which trained the frequency of malicious script words for script similarity-based static analysis techniques, was utilized. The KNN model had 99.6% accuracy, and the K-means model had a silhouette coefficient of 0.61 for normal clusters.

Comparison of survival prediction models for pancreatic cancer: Cox model versus machine learning models

  • Kim, Hyunsuk;Park, Taesung;Jang, Jinyoung;Lee, Seungyeoun
    • Genomics & Informatics
    • /
    • v.20 no.2
    • /
    • pp.23.1-23.9
    • /
    • 2022
  • A survival prediction model has recently been developed to evaluate the prognosis of resected nonmetastatic pancreatic ductal adenocarcinoma based on a Cox model using two nationwide databases: Surveillance, Epidemiology and End Results (SEER) and Korea Tumor Registry System-Biliary Pancreas (KOTUS-BP). In this study, we applied two machine learning methods-random survival forests (RSF) and support vector machines (SVM)-for survival analysis and compared their prediction performance using the SEER and KOTUS-BP datasets. Three schemes were used for model development and evaluation. First, we utilized data from SEER for model development and used data from KOTUS-BP for external evaluation. Second, these two datasets were swapped by taking data from KOTUS-BP for model development and data from SEER for external evaluation. Finally, we mixed these two datasets half and half and utilized the mixed datasets for model development and validation. We used 9,624 patients from SEER and 3,281 patients from KOTUS-BP to construct a prediction model with seven covariates: age, sex, histologic differentiation, adjuvant treatment, resection margin status, and the American Joint Committee on Cancer 8th edition T-stage and N-stage. Comparing the three schemes, the performance of the Cox model, RSF, and SVM was better when using the mixed datasets than when using the unmixed datasets. When using the mixed datasets, the C-index, 1-year, 2-year, and 3-year time-dependent areas under the curve for the Cox model were 0.644, 0.698, 0.680, and 0.687, respectively. The Cox model performed slightly better than RSF and SVM.

Machine learning based anti-cancer drug response prediction and search for predictor genes using cancer cell line gene expression

  • Qiu, Kexin;Lee, JoongHo;Kim, HanByeol;Yoon, Seokhyun;Kang, Keunsoo
    • Genomics & Informatics
    • /
    • v.19 no.1
    • /
    • pp.10.1-10.7
    • /
    • 2021
  • Although many models have been proposed to accurately predict the response of drugs in cell lines recent years, understanding the genome related to drug response is also the key for completing oncology precision medicine. In this paper, based on the cancer cell line gene expression and the drug response data, we established a reliable and accurate drug response prediction model and found predictor genes for some drugs of interest. To this end, we first performed pre-selection of genes based on the Pearson correlation coefficient and then used ElasticNet regression model for drug response prediction and fine gene selection. To find more reliable set of predictor genes, we performed regression twice for each drug, one with IC50 and the other with area under the curve (AUC) (or activity area). For the 12 drugs we tested, the predictive performance in terms of Pearson correlation coefficient exceeded 0.6 and the highest one was 17-AAG for which Pearson correlation coefficient was 0.811 for IC50 and 0.81 for AUC. We identify common predictor genes for IC50 and AUC, with which the performance was similar to those with genes separately found for IC50 and AUC, but with much smaller number of predictor genes. By using only common predictor genes, the highest performance was AZD6244 (0.8016 for IC50, 0.7945 for AUC) with 321 predictor genes.

Analysis and Design of Cattle Management System based on IoT (사물인터넷 기반 소관리 시스템의 분석 및 설계)

  • Cho, Byung-Ho
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.2
    • /
    • pp.125-130
    • /
    • 2021
  • Implementation of livestock smart-farm can be done more effectively with IoT technology developing. An build of useful stock management system can be possibile if push messages of these judgement are notified on smart-phone after cattle's illness and estrus are judged using IoT technology. These judgement method of cattle's illness and estrus can be done with gathering living stock data using temperature sensor and 3 axis acceleration sensor and sending these data using IoT and internet network into server, and studying AI machine learning using these data. In this paper, to build this cattle management system based on IoT, effective system of the whole architecture is showed. Also an effective analysis and design method to develop this system software will be presented by showing user requirement analysis using object-oriented method, flowchart and screen design.

Evaluating flexural strength of concrete with steel fibre by using machine learning techniques

  • Sharma, Nitisha;Thakur, Mohindra S.;Upadhya, Ankita;Sihag, Parveen
    • Composite Materials and Engineering
    • /
    • v.3 no.3
    • /
    • pp.201-220
    • /
    • 2021
  • In this study, potential of three machine learning techniques i.e., M5P, Support vector machines and Gaussian processes were evaluated to find the best algorithm for the prediction of flexural strength of concrete mix with steel fibre. The study comprises the comparison of results obtained from above-said techniques for given dataset. The dataset consists of 124 observations from past research studies and this dataset is randomly divided into two subsets namely training and testing datasets with (70-30)% proportion by weight. Cement, fine aggregates, coarse aggregates, water, super plasticizer/ high-range water reducer, steel fibre, fibre length and curing days were taken as input parameters whereas flexural strength of the concrete mix was taken as the output parameter. Performance of the techniques was checked by statistic evaluation parameters. Results show that the Gaussian process technique works better than other techniques with its minimum error bandwidth. Statistical analysis shows that the Gaussian process predicts better results with higher coefficient of correlation value (0.9138) and minimum mean absolute error (1.2954) and Root mean square error value (1.9672). Sensitivity analysis proves that steel fibre is the significant parameter among other parameters to predict the flexural strength of concrete mix. According to the shape of the fibre, the mixed type performs better for this data than the hooked shape of the steel fibre, which has a higher CC of 0.9649, which shows that the shape of fibers do effect the flexural strength of the concrete. However, the intricacy of the mixed fibres needs further investigations. For future mixes, the most favorable range for the increase in flexural strength of concrete mix found to be (1-3)%.

Water consumption prediction based on machine learning methods and public data

  • Kesornsit, Witwisit;Sirisathitkul, Yaowarat
    • Advances in Computational Design
    • /
    • v.7 no.2
    • /
    • pp.113-128
    • /
    • 2022
  • Water consumption is strongly affected by numerous factors, such as population, climatic, geographic, and socio-economic factors. Therefore, the implementation of a reliable predictive model of water consumption pattern is challenging task. This study investigates the performance of predictive models based on multi-layer perceptron (MLP), multiple linear regression (MLR), and support vector regression (SVR). To understand the significant factors affecting water consumption, the stepwise regression (SW) procedure is used in MLR to obtain suitable variables. Then, this study also implements three predictive models based on these significant variables (e.g., SWMLR, SWMLP, and SWSVR). Annual data of water consumption in Thailand during 2006 - 2015 were compiled and categorized by provinces and distributors. By comparing the predictive performance of models with all variables, the results demonstrate that the MLP models outperformed the MLR and SVR models. As compared to the models with selected variables, the predictive capability of SWMLP was superior to SWMLR and SWSVR. Therefore, the SWMLP still provided satisfactory results with the minimum number of explanatory variables which in turn reduced the computation time and other resources required while performing the predictive task. It can be concluded that the MLP exhibited the best result and can be utilized as a reliable water demand predictive model for both of all variables and selected variables cases. These findings support important implications and serve as a feasible water consumption predictive model and can be used for water resources management to produce sufficient tap water to meet the demand in each province of Thailand.

2-Stage Detection and Classification Network for Kiosk User Analysis (디스플레이형 자판기 사용자 분석을 위한 이중 단계 검출 및 분류 망)

  • Seo, Ji-Won;Kim, Mi-Kyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.5
    • /
    • pp.668-674
    • /
    • 2022
  • Machine learning techniques using visual data have high usability in fields of industry and service such as scene recognition, fault detection, security and user analysis. Among these, user analysis through the videos from CCTV is one of the practical way of using vision data. Also, many studies about lightweight artificial neural network have been published to increase high usability for mobile and embedded environment so far. In this study, we propose the network combining the object detection and classification for mobile graphic processing unit. This network detects pedestrian and face, classifies age and gender from detected face. Proposed network is constructed based on MobileNet, YOLOv2 and skip connection. Both detection and classification models are trained individually and combined as 2-stage structure. Also, attention mechanism is used to improve detection and classification ability. Nvidia Jetson Nano is used to run and evaluate the proposed system.

A Study on the Development of Model for Estimating the Thickness of Clay Layer of Soft Ground in the Nakdong River Estuary (낙동강 조간대 연약지반의 지역별 점성토층 두께 추정 모델 개발에 관한 연구)

  • Seongin, Ahn;Dong-Woo, Ryu
    • Tunnel and Underground Space
    • /
    • v.32 no.6
    • /
    • pp.586-597
    • /
    • 2022
  • In this study, a model was developed for the estimating the locational thickness information of the upper clay layer to be used for the consolidation vulnerability evaluation in the Nakdong river estuary. To estimate ground layer thickness information, we developed four spatial estimation models using machine learning algorithms, which are RF (Random Forest), SVR (Support Vector Regression) and GPR (Gaussian Process Regression), and geostatistical technique such as Ordinary Kriging. Among the 4,712 borehole data in the study area collected for model development, 2,948 borehole data with an upper clay layer were used, and Pearson correlation coefficient and mean squared error were used to quantitatively evaluate the performance of the developed models. In addition, for qualitative evaluation, each model was used throughout the study area to estimate the information of the upper clay layer, and the thickness distribution characteristics of it were compared with each other.

Thermoluminescence Kinetics of LYGBO Crystal (LYGBO 단결정의 열형광 전자포획준위 인자)

  • Sunghwan, Kim
    • Journal of the Korean Society of Radiology
    • /
    • v.17 no.1
    • /
    • pp.17-23
    • /
    • 2023
  • In this study, the thermoluminescence kinetics of electron trap in Li6Y0.5Gd0.5(BO3)3 (LY0.5G0.5BO) scintillator for neutron detection composed of Li, Gd, and B with a high neutron response cross-section were investigated. The thermoluminescence glow curve of the LY0.5G0.5BO scintillation single crystal was measured and analyzed using the peak shape method, the initial rise method, and the machine learning algorithm to evaluate the physical parameters of the electron trap. The glow curve of the LY0.5G0.5BO scintillation single crystal consisted of a single peak. As a result of analyzing this peak, the activation energy, emission order, and frequency factor of the electron trap were 0.61 eV, 1.1, and 1.7×107 s-1, respectively. In addition, the possibility of thermoluminescence analysis of scintillators using machine learning was confirmed.