• Title/Summary/Keyword: 알고리즘 기반

Search Result 13,954, Processing Time 0.039 seconds

The Prediction of Survival of Breast Cancer Patients Based on Machine Learning Using Health Insurance Claim Data (건강보험 청구 데이터를 활용한 머신러닝 기반유방암 환자의 생존 여부 예측)

  • Doeggyu Lee;Kyungkeun Byun;Hyungdong Lee;Sunhee Shin
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.28 no.2
    • /
    • pp.1-9
    • /
    • 2023
  • Research using AI and big data is also being actively conducted in the health and medical fields such as disease diagnosis and treatment. Most of the existing research data used cohort data from research institutes or some patient data. In this paper, the difference in the prediction rate of survival and the factors affecting survival between breast cancer patients in their 40~50s and other age groups was revealed using health insurance review claim data held by the HIRA. As a result, the accuracy of predicting patients' survival was 0.93 on average in their 40~50s, higher than 0.86 in their 60~80s. In terms of that factor, the number of treatments was high for those in their 40~50s, and age was high for those in their 60~80s. Performance comparison with previous studies, the average precision was 0.90, which was higher than 0.81 of the existing paper. As a result of performance comparison by applied algorithm, the overall average precision of Decision Tree, Random Forest, and Gradient Boosting was 0.90, and the recall was 1.0, and the precision of multi-layer perceptrons was 0.89, and the recall was 1.0. I hope that more research will be conducted using machine learning automation(Auto ML) tools for non-professionals to enhance the use of the value for health insurance review claim data held by the HIRA.

Deep Learning based Estimation of Depth to Bearing Layer from In-situ Data (딥러닝 기반 국내 지반의 지지층 깊이 예측)

  • Jang, Young-Eun;Jung, Jaeho;Han, Jin-Tae;Yu, Yonggyun
    • Journal of the Korean Geotechnical Society
    • /
    • v.38 no.3
    • /
    • pp.35-42
    • /
    • 2022
  • The N-value from the Standard Penetration Test (SPT), which is one of the representative in-situ test, is an important index that provides basic geological information and the depth of the bearing layer for the design of geotechnical structures. In the aspect of time and cost-effectiveness, there is a need to carry out a representative sampling test. However, the various variability and uncertainty are existing in the soil layer, so it is difficult to grasp the characteristics of the entire field from the limited test results. Thus the spatial interpolation techniques such as Kriging and IDW (inverse distance weighted) have been used for predicting unknown point from existing data. Recently, in order to increase the accuracy of interpolation results, studies that combine the geotechnics and deep learning method have been conducted. In this study, based on the SPT results of about 22,000 holes of ground survey, a comparative study was conducted to predict the depth of the bearing layer using deep learning methods and IDW. The average error among the prediction results of the bearing layer of each analysis model was 3.01 m for IDW, 3.22 m and 2.46 m for fully connected network and PointNet, respectively. The standard deviation was 3.99 for IDW, 3.95 and 3.54 for fully connected network and PointNet. As a result, the point net deep learing algorithm showed improved results compared to IDW and other deep learning method.

Analysis of Transfer Learning Effect for Automatic Dog Breed Classification (반려견 자동 품종 분류를 위한 전이학습 효과 분석)

  • Lee, Dongsu;Park, Gooman
    • Journal of Broadcast Engineering
    • /
    • v.27 no.1
    • /
    • pp.133-145
    • /
    • 2022
  • Compared to the continuously increasing dog population and industry size in Korea, systematic analysis of related data and research on breed classification methods are very insufficient. In this paper, an automatic breed classification method is proposed using deep learning technology for 14 major dog breeds domestically raised. To do this, dog images are collected for deep learning training and a dataset is built, and a breed classification algorithm is created by performing transfer learning based on VGG-16 and Resnet-34 as backbone networks. In order to check the transfer learning effect of the two models on dog images, we compared the use of pre-trained weights and the experiment of updating the weights. When fine tuning was performed based on VGG-16 backbone network, in the final model, the accuracy of Top 1 was about 89% and that of Top 3 was about 94%, respectively. The domestic dog breed classification method and data construction proposed in this paper have the potential to be used for various application purposes, such as classification of abandoned and lost dog breeds in animal protection centers or utilization in pet-feed industry.

KHistory: A System for Automatic Generation of Multiple Choice Questions on the History of Korea (KHistory: 한국사 객관식 문제 자동 생성 시스템)

  • Kim, Seong-Won;Jung, Hae-Seong;Jin, Jae-Hwan;Lee, Myung-Joon
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.7 no.1
    • /
    • pp.253-263
    • /
    • 2017
  • As needs for knowledge on Korean history and the attention of the people are rapidly increasing, various smartphone applications for learning the history have appeared during recent years. These applications provide multiple choice questions to users through their own problem banks. But, since these questions are selected from the fixed set of problems that are stored previously, the learning efficiency of users is inevitably decreased when they use the applications repeatedly. In this paper, we present a question generation system named K-History which generates multiple choice questions in an automatic way using the database on the history of Korea. In addition, we also describe the development of the application Korean History Infinite Challenge as a learning application for Korean history. To develop K-History, we classify typical types of learning problems through various problems based on Korean history learning materials, proposing algorithms to generate problems according to the types found. Through the developed techniques, various learning systems can reduce the cost for creating questions, while increasing the learning efficiency of users.

Data analysis by Integrating statistics and visualization: Visual verification for the prediction model (통계와 시각화를 결합한 데이터 분석: 예측모형 대한 시각화 검증)

  • Mun, Seong Min;Lee, Kyung Won
    • Design Convergence Study
    • /
    • v.15 no.6
    • /
    • pp.195-214
    • /
    • 2016
  • Predictive analysis is based on a probabilistic learning algorithm called pattern recognition or machine learning. Therefore, if users want to extract more information from the data, they are required high statistical knowledge. In addition, it is difficult to find out data pattern and characteristics of the data. This study conducted statistical data analyses and visual data analyses to supplement prediction analysis's weakness. Through this study, we could find some implications that haven't been found in the previous studies. First, we could find data pattern when adjust data selection according as splitting criteria for the decision tree method. Second, we could find what type of data included in the final prediction model. We found some implications that haven't been found in the previous studies from the results of statistical and visual analyses. In statistical analysis we found relation among the multivariable and deducted prediction model to predict high box office performance. In visualization analysis we proposed visual analysis method with various interactive functions. Finally through this study we verified final prediction model and suggested analysis method extract variety of information from the data.

Obstacle Avoidance of Unmanned Surface Vehicle based on 3D Lidar for VFH Algorithm (무인수상정의 장애물 회피를 위한 3차원 라이다 기반 VFH 알고리즘 연구)

  • Weon, Ihn-Sik;Lee, Soon-Geul;Ryu, Jae-Kwan
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.8 no.3
    • /
    • pp.945-953
    • /
    • 2018
  • In this paper, we use 3-D LIDAR for obstacle detection and avoidance maneuver for autonomous unmanned operation. It is aimed to avoid obstacle avoidance in unmanned water under marine condition using only single sensor. 3D lidar uses Quanergy's M8 sensor to collect surrounding obstacle data and includes layer information and intensity information in obstacle information. The collected data is converted into a three-dimensional Cartesian coordinate system, which is then mapped to a two-dimensional coordinate system. The data including the obstacle information converted into the two-dimensional coordinate system includes noise data on the water surface. So, basically, the noise data generated regularly is defined by defining a hypothetical region of interest based on the assumption of unmanned water. The noise data generated thereafter are set to a threshold value in the histogram data calculated by the Vector Field Histogram, And the noise data is removed in proportion to the amount of noise. Using the removed data, the relative object was searched according to the unmanned averaging motion, and the density map of the data was made while keeping one cell on the virtual grid map. A polar histogram was generated for the generated obstacle map, and the avoidance direction was selected using the boundary value.

Predicting the Fetotoxicity of Drugs Using Machine Learning (기계학습 기반 약물의 태아 독성 예측 연구)

  • Myeonghyeon Jeong;Sunyong Yoo
    • Journal of Life Science
    • /
    • v.33 no.6
    • /
    • pp.490-497
    • /
    • 2023
  • Pregnant women may need to take medications to treat preexisting diseases or diseases that develop during pregnancy. However, some drugs may be fetotoxic and lead to, for example, teratogenicity and growth retardation. Predicting the fetotoxicity of drugs is thus important for the health of the mother and fetus. The fetotoxicity of many drugs has not been established because various challenges hinder the ability of researchers to determine their fetotoxicity. The need exists for in silico-based fetotoxicity assessment models, as they can modernize the testing paradigm, improve predictability, and reduce the use of animals and the costs of fetotoxicity testing. In this study, we collected data on the fetotoxicity of drugs and constructed fetotoxicity prediction models based on various machine learning algorithms. We optimized the models for more precise predictions by tuning the hyperparameters. We then performed quantitative performance evaluations. The results indicated that the constructed machine learning-based models had high performance (AUROC >0.85, AUPR >0.9) in fetotoxicity prediction. We also analyzed the feature importance of our model's predictions, which could be leveraged to identify the specific features of drugs that are strongly associated with fetotoxicity. The proposed model can be used to prescreen drugs and drug candidates at a lower cost and in less time. It provides a predictive score for fetotoxicity risk, which may be beneficial in the design of studies on fetotoxicity in human pregnancy.

Toward understanding learning patterns in an open online learning platform using process mining (프로세스 마이닝을 활용한 온라인 교육 오픈 플랫폼 내 학습 패턴 분석 방법 개발)

  • Taeyoung Kim;Hyomin Kim;Minsu Cho
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.285-301
    • /
    • 2023
  • Due to the increasing demand and importance of non-face-to-face education, open online learning platforms are getting interests both domestically and internationally. These platforms exhibit different characteristics from online courses by universities and other educational institutions. In particular, students engaged in these platforms can receive more learner autonomy, and the development of tools to assist learning is required. From the past, researchers have attempted to utilize process mining to understand realistic study behaviors and derive learning patterns. However, it has a deficiency to employ it to the open online learning platforms. Moreover, existing research has primarily focused on the process model perspective, including process model discovery, but lacks a method for the process pattern and instance perspectives. In this study, we propose a method to identify learning patterns within an open online learning platform using process mining techniques. To achieve this, we suggest three different viewpoints, e.g., model-level, variant-level, and instance-level, to comprehend the learning patterns, and various techniques are employed, such as process discovery, conformance checking, autoencoder-based clustering, and predictive approaches. To validate this method, we collected a learning log of machine learning-related courses on a domestic open education platform. The results unveiled a spaghetti-like process model that can be differentiated into a standard learning pattern and three abnormal patterns. Furthermore, as a result of deriving a pattern classification model, our model achieved a high accuracy of 0.86 when predicting the pattern of instances based on the initial 30% of the entire flow. This study contributes to systematically analyze learners' patterns using process mining.

Estimating Travel Frequency of Public Bikes in Seoul Considering Intermediate Stops (경유지를 고려한 서울시 공공자전거 통행발생량 추정 모형 개발)

  • Jonghan Park;Joonho Ko
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.22 no.3
    • /
    • pp.1-19
    • /
    • 2023
  • Bikes have recently emerged as an alternative to carbon neutrality. To understand the demand for public bikes, we endeavored to estimate travel frequency of public bike by considering the intermediate stops. Using the GPS trajectory data of 'Ttareungyi', a public bike service in Seoul, we identified a stay point and estimated travel frequency reflecting population, land use, and physical characteristics. Application of map matching and a stay point detection algorithm revealed that stay point appeared in about 12.1% of the total trips. Compared to a trip without stay point, the trip with stay point has a longer average travel distance and travel time and a higher occurrence rate during off-peak hours. According to visualization analysis, the stay points are mainly found in parks, leisure facilities, and business facilities. To consider the stay point, the unit of analysis was set as a hexagonal grid rather than the existing rental station base. Travel frequency considering the stay point were analyzed using the Zero-Inflated Negative Binomial (ZINB) model. Results of our analysis revealed that the travel frequency were higher in bike infrastructure where the safety of bike users was secured, such as 'Bikepath' and 'Bike and pedestrian path'. Also, public bikes play a role as first & last mile means of access to public transportation. The measure of travel frequency was also observed to increase in life and employment centers. Considering the results of this analysis, securing safety facilities and space for users should be given priority when planning any additional expansion of bike infrastructure. Moreover, there is a necessity to establish a plan to supply bike infrastructure facilities linked to public transportation, especially the subway.

Implementation of Plastic Bottle Classification System for Recycling (분리수거를 위한 페트병 분리시스템의 구현)

  • Park, Yongha;Park, Jihoon;Chung, Hoyeong;Lee, Joosang;Lee, Jungyeop
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.365-368
    • /
    • 2021
  • In this study, a plastic bottle recycling bin system that utilizes an infrared sensor was implemented. The proposed system consists of a recognition unit, a control unit, an alarm unit, and a driving unit. The recognition unit detects the plastic bottle, measures the distance between the plastic bottle and the infrared sensor, extracts the value of the bottle, compares the extracted value with a standard range, and then transmits the control value to the control unit if the extracted value of the bottle is outside the standard range. In this case, the result of the presence or absence of a brand label or bottle cap is transmitted to the controller. The control unit opens the entrance of the recycling bin or alerts the alarm unit according to the result value transmitted from the sensor unit. In order to implement the proposed system, the recognition unit was implemented with an infrared sensor, and the control unit was made with an Arduino IDE controller, based on the C programming language. Additionally, the recognition unit and the control unit are able to communicate using analog signals. The proposed system accurately judges the presence or absence of a brand label and bottle cap of plastic bottles according to a predetermined algorithm. It then blocks the entrance of the recycling bin when a brand label or bottle cap is still attached. As the amount of waste discharged per person is relatively high and the majority of such waste is incinerated rather than recycled, the system proposed in this study is expected to increase the recycling rate of plastic bottles.

  • PDF