• 제목/요약/키워드: Machine Learning & Training

검색결과 819건 처리시간 0.027초

인공신경망을 이용한 벌크 비정질 합금 소재의 포화자속밀도 예측 성능평가 (Artificial Neural Network Supported Prediction of Magnetic Properties of Bulk Metallic Glasses)

  • 남충희
    • 한국재료학회지
    • /
    • 제33권7호
    • /
    • pp.273-278
    • /
    • 2023
  • In this study, based on the saturation magnetic flux density experimental values (Bs) of 622 Fe-based bulk metallic glasses (BMGs), regression models were applied to predict Bs using artificial neural networks (ANN), and prediction performance was evaluated. Model performance evaluation was investigated by using the F1 score together with the coefficient of determination (R2 score), which is mainly used in regression models. The coefficient of determination can be used as a performance indicator, since it shows the predicted results of the saturation magnetic flux density of full material datasets in a balanced way. However, the BMG alloy contains iron and requires a high saturation magnetic flux density to have excellent applicability as a soft magnetic material, and in this study F1 score was used as a performance indicator to better predict Bs above the threshold value of Bs (1.4 T). After obtaining two ANN models optimized for the R2 and F1 score conditions, respectively, their prediction performance was compared for the test data. As a case study to evaluate the prediction performance, new Fe-based BMG datasets that were not included in the training and test datasets were predicted using the two ANN models. The results showed that the model with an excellent F1 score achieved a more accurate prediction for a material with a high saturation magnetic flux density.

Prediction of Global Industrial Water Demand using Machine Learning

  • Panda, Manas Ranjan;Kim, Yeonjoo
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2022년도 학술발표회
    • /
    • pp.156-156
    • /
    • 2022
  • Explicitly spatially distributed and reliable data on industrial water demand is very much important for both policy makers and researchers in order to carry a region-specific analysis of water resources management. However, such type of data remains scarce particularly in underdeveloped and developing countries. Current research is limited in using different spatially available socio-economic, climate data and geographical data from different sources in accordance to predict industrial water demand at finer resolution. This study proposes a random forest regression (RFR) model to predict the industrial water demand at 0.50× 0.50 spatial resolution by combining various features extracted from multiple data sources. The dataset used here include National Polar-orbiting Partnership (NPP)/Visible Infrared Imaging Radiometer Suite (VIIRS) night-time light (NTL), Global Power Plant database, AQUASTAT country-wise industrial water use data, Elevation data, Gross Domestic Product (GDP), Road density, Crop land, Population, Precipitation, Temperature, and Aridity. Compared with traditional regression algorithms, RF shows the advantages of high prediction accuracy, not requiring assumptions of a prior probability distribution, and the capacity to analyses variable importance. The final RF model was fitted using the parameter settings of ntree = 300 and mtry = 2. As a result, determinate coefficients value of 0.547 is achieved. The variable importance of the independent variables e.g. night light data, elevation data, GDP and population data used in the training purpose of RF model plays the major role in predicting the industrial water demand.

  • PDF

Application of ChatGPT text extraction model in analyzing rhetorical principles of COVID-19 pandemic information on a question-and-answer community

  • Hyunwoo Moon;Beom Jun Bae;Sangwon Bae
    • International journal of advanced smart convergence
    • /
    • 제13권2호
    • /
    • pp.205-213
    • /
    • 2024
  • This study uses a large language model (LLM) to identify Aristotle's rhetorical principles (ethos, pathos, and logos) in COVID-19 information on Naver Knowledge-iN, South Korea's leading question-and-answer community. The research analyzed the differences of these rhetorical elements in the most upvoted answers with random answers. A total of 193 answer pairs were randomly selected, with 135 pairs for training and 58 for testing. These answers were then coded in line with the rhetorical principles to refine GPT 3.5-based models. The models achieved F1 scores of .88 (ethos), .81 (pathos), and .69 (logos). Subsequent analysis of 128 new answer pairs revealed that logos, particularly factual information and logical reasoning, was more frequently used in the most upvoted answers than the random answers, whereas there were no differences in ethos and pathos between the answer groups. The results suggest that health information consumers value information including logos while ethos and pathos were not associated with consumers' preference for health information. By utilizing an LLM for the analysis of persuasive content, which has been typically conducted manually with much labor and time, this study not only demonstrates the feasibility of using an LLM for latent content but also contributes to expanding the horizon in the field of AI text extraction.

Prediction of karst sinkhole collapse using a decision-tree (DT) classifier

  • Boo Hyun Nam;Kyungwon Park;Yong Je Kim
    • Geomechanics and Engineering
    • /
    • 제36권5호
    • /
    • pp.441-453
    • /
    • 2024
  • Sinkhole subsidence and collapse is a common geohazard often formed in karst areas such as the state of Florida, United States of America. To predict the sinkhole occurrence, we need to understand the formation mechanism of sinkhole and its karst hydrogeology. For this purpose, investigating the factors affecting sinkholes is an essential and important step. The main objectives of the presenting study are (1) the development of a machine learning (ML)-based model, namely C5.0 decision tree (C5.0 DT), for the prediction of sinkhole susceptibility, which accounts for sinkhole/subsidence inventory and sinkhole contributing factors (e.g., geological/hydrogeological) and (2) the construction of a regional-scale sinkhole susceptibility map. The study area is east central Florida (ECF) where a cover-collapse type is commonly reported. The C5.0 DT algorithm was used to account for twelve (12) identified hydrogeological factors. In this study, a total of 1,113 sinkholes in ECF were identified and the dataset was then randomly divided into 70% and 30% subsets for training and testing, respectively. The performance of the sinkhole susceptibility model was evaluated using a receiver operating characteristic (ROC) curve, particularly the area under the curve (AUC). The C5.0 model showed a high prediction accuracy of 83.52%. It is concluded that a decision tree is a promising tool and classifier for spatial prediction of karst sinkholes and subsidence in the ECF area.

Severity Analysis for Occupational Heat-related Injury Using the Multinomial Logit Model

  • Peiyi Lyu;Siyuan Song
    • Safety and Health at Work
    • /
    • 제15권2호
    • /
    • pp.200-207
    • /
    • 2024
  • Background: Workers are often exposed to hazardous heat due to their work environment, leading to various injuries. As a result of climate change, heat-related injuries (HRIs) are becoming more problematic. This study aims to identify critical contributing factors to the severity of occupational HRIs. Methods: This study analyzed historical injury reports from the Occupational Safety and Health Administration (OSHA). Contributing factors to the severity of HRIs were identified using text mining and model-free machine learning methods. The Multinomial Logit Model (MNL) was applied to explore the relationship between impact factors and the severity of HRIs. Results: The results indicated a higher risk of fatal HRIs among middle-aged, older, and male workers, particularly in the construction, service, manufacturing, and agriculture industries. In addition, a higher heat index, collapses, heart attacks, and fall accidents increased the severity of HRIs, while symptoms such as dehydration, dizziness, cramps, faintness, and vomiting reduced the likelihood of fatal HRIs. Conclusions: The severity of HRIs was significantly influenced by factors like workers' age, gender, industry type, heat index , symptoms, and secondary injuries. The findings underscore the need for tailored preventive strategies and training across different worker groups to mitigate HRIs risks.

오픈소스를 활용한 융합인재교육(STEAM) 사례분석 연구 : 니팅기의 활용을 중심으로 (STEAM education cases study analysis using Open Source : Focusing on the use of the Knitting Machine)

  • 박지훈;남원석;장중식
    • 한국융합학회논문지
    • /
    • 제10권12호
    • /
    • pp.199-204
    • /
    • 2019
  • 본 연구는 다양한 분야에서 널리 활용되고 있는 오픈소스를 활용한 국내외 STEAM 교육의 현황·동향 및 사례분석을 통하여 향후 연구과제로서 설계될 오픈소스 기반의 니팅기를 활용한 STEAM 교육의 기대효과와 시사점을 파악하고자 하였다. 연구방법으로는 문헌연구를 바탕으로 이론적 고찰을 진행하였으며, 이후 국내외 오픈소스를 활용한 STEAM 교육의 현황·동향을 파악 후, 사례를 조사하고 분석하였다. 그 결과 오픈소스를 활용한 STEAM 교육에 대한 사회적 관심이 증가하고 있음을 확인할 수 있었으며, 흥미유발과 자기주도적학습 능력, 창의적 사고 배양을 목적으로 교육이 설계되며, 이에 따른 긍정적 효과가 가시화되고 있는 것을 알 수 있었다. 이러한 시사점을 토대로 오픈소스를 통하여 제작한 니팅기를 활용하였을 때의 기대효과를 제시하고, 향후에 설계될 STEAM 교육에 대한 설계 방향 및 의의를 검토하고자 한다.

Prediction of squeezing phenomenon in tunneling projects: Application of Gaussian process regression

  • Mirzaeiabdolyousefi, Majid;Mahmoodzadeh, Arsalan;Ibrahim, Hawkar Hashim;Rashidi, Shima;Majeed, Mohammed Kamal;Mohammed, Adil Hussein
    • Geomechanics and Engineering
    • /
    • 제30권1호
    • /
    • pp.11-26
    • /
    • 2022
  • One of the most important issues in tunneling, is the squeezing phenomenon. Squeezing can occur during excavation or after the construction of tunnels, which in both cases could lead to significant damages. Therefore, it is important to predict the squeezing and consider it in the early design stage of tunnel construction. Different empirical, semi-empirical and theoretical-analytical methods have been presented to determine the squeezing. Therefore, it is necessary to examine the ability of each of these methods and identify the best method among them. In this study, squeezing in a part of the Alborz service tunnel in Iran was estimated through a number of empirical, semi- empirical and theoretical-analytical methods. Among these methods, the most robust model was used to obtain a database including 300 data for training and 33 data for testing in order to develop a machine learning (ML) method. To this end, three ML models of Gaussian process regression (GPR), artificial neural network (ANN) and support vector regression (SVR) were trained and tested to propose a robust model to predict the squeezing phenomenon. A comparative analysis between the conventional and the ML methods utilized in this study showed that, the GPR model is the most robust model in the prediction of squeezing phenomenon. The sensitivity analysis of the input parameters using the mutual information test (MIT) method showed that, the most sensitive parameter on the squeezing phenomenon is the tangential strain (ε_θ^α) parameter with a sensitivity score of 2.18. Finally, the GPR model was recommended to predict the squeezing phenomenon in tunneling projects. This work's significance is that it can provide a good estimation of the squeezing phenomenon in tunneling projects, based on which geotechnical engineers can take the necessary actions to deal with it in the pre-construction designs.

Tei Index를 이용한 경도의 좌심실 이완 기능 장애 분류 모델 평가 (Evaluation of Classification Models of Mild Left Ventricular Diastolic Dysfunction by Tei Index)

  • 김수민;예수영
    • 한국방사선학회논문지
    • /
    • 제17권5호
    • /
    • pp.761-766
    • /
    • 2023
  • 본 논문에는 경도의 좌심실 이완 기능 장애 유무를 분류하기 위해 TI을 측정하였다. 분류에 사용된 기계 학습 모델은 SVM과 KNN을 이용하였다. 총 306개의 데이터 중에서 206개는 트레이닝 데이터, 100개는 테스트 데이터로 사용하였다. 그 결과, SVM이 KNN에 비하여 비교적 높은 정확도를 보여 좌심실 이완 기능 장애 유무 진단에 더 유용함을 확인했다. 향후 연구에서 TI 뿐만 아니라 심장의 기능을 평가하는 다양한 지표들을 추가하고 더 많은 데이터를 확보한다면 분류 성능을 더 높일 수 있을 것으로 기대된다. 나아가, 타 질환의 예측 및 분류, 증가하는 검사 건수에 비해 부족한 의료 인력 문제를 해결하는데 기초 자료로 활용될 것으로 기대된다.

A Unicode based Deep Handwritten Character Recognition model for Telugu to English Language Translation

  • BV Subba Rao;J. Nageswara Rao;Bandi Vamsi;Venkata Nagaraju Thatha;Katta Subba Rao
    • International Journal of Computer Science & Network Security
    • /
    • 제24권2호
    • /
    • pp.101-112
    • /
    • 2024
  • Telugu language is considered as fourth most used language in India especially in the regions of Andhra Pradesh, Telangana, Karnataka etc. In international recognized countries also, Telugu is widely growing spoken language. This language comprises of different dependent and independent vowels, consonants and digits. In this aspect, the enhancement of Telugu Handwritten Character Recognition (HCR) has not been propagated. HCR is a neural network technique of converting a documented image to edited text one which can be used for many other applications. This reduces time and effort without starting over from the beginning every time. In this work, a Unicode based Handwritten Character Recognition(U-HCR) is developed for translating the handwritten Telugu characters into English language. With the use of Centre of Gravity (CG) in our model we can easily divide a compound character into individual character with the help of Unicode values. For training this model, we have used both online and offline Telugu character datasets. To extract the features in the scanned image we used convolutional neural network along with Machine Learning classifiers like Random Forest and Support Vector Machine. Stochastic Gradient Descent (SGD), Root Mean Square Propagation (RMS-P) and Adaptative Moment Estimation (ADAM)optimizers are used in this work to enhance the performance of U-HCR and to reduce the loss function value. This loss value reduction can be possible with optimizers by using CNN. In both online and offline datasets, proposed model showed promising results by maintaining the accuracies with 90.28% for SGD, 96.97% for RMS-P and 93.57% for ADAM respectively.

Deep Convolution Neural Networks 이용하여 결함 검출을 위한 결함이 있는 철도선로표면 디지털영상 재 생성 (Regeneration of a defective Railroad Surface for defect detection with Deep Convolution Neural Networks)

  • 김현호;한석민
    • 인터넷정보학회논문지
    • /
    • 제21권6호
    • /
    • pp.23-31
    • /
    • 2020
  • 본 연구는 철도표면상에 발생하는 노후 현상 중 하나인 결함 검출을 위해 학습데이터를 생성함으로써 결함 검출 모델에서 더 높은 점수를 얻기 위해 진행되었다. 철도표면에서 결함은 선로결속장치 및 선로와 차량의 마찰 등 다양한 원인에 의해 발생하고 선로 파손 등의 사고를 유발할 수 있기 때문에 결함에 대한 철도 유지관리가 필요 하다. 그래서 철도 유지관리의 자동화 및 비용절감을 위해 철도 표면 영상에 영상처리 또는 기계학습을 활용한 결함 검출 및 검사에 대한 다양한 연구가 진행되고 있다. 일반적으로 영상 처리 분석기법 및 기계학습 기술의 성능은 데이터의 수량과 품질에 의존한다. 그렇기 때문에 일부 연구는 일반적이고 다양한 철도표면영상의 데이터베이스를 확보하기위해 등간격으로 선로표면을 촬영하는 장치 또는 탑재된 차량이 필요로 하였다. 본연구는 이러한 기계적인 영상획득 장치의 운용비용을 감소시키고 보완하기 위해 대표적인 영상생성관련 딥러닝 모델인 생성적 적대적 네트워크의 기본 구성에서 여러 관련연구에서 제시된 방법을 응용, 결함이 있는 철도 표면 재생성모델을 구성하여, 전용 데이터베이스가 구축되지 않은 철도 표면 영상에 대해서도 결함 검출을 진행할 수 있도록 하였다. 구성한 모델은 상이한 철도 표면 텍스처들을 반영한 철도 표면 생성을 학습하고 여러 임의의 결함의 위치에 대한 Ground-Truth들을 만족하는 다양한 결함을 재 생성하도록 설계하였다. 재생성된 철도 표면의 영상들을 결함 검출 딥러닝 모델에 학습데이터로 사용한다. 재생성모델의 유효성을 검증하기 위해 철도표면데이터를 3가지의 하위집합으로 군집화 하여 하나의 집합세트를 원본 영상으로 정의하고, 다른 두개의 나머지 하위집합들의 몇가지의 선로표면영상을 텍스처 영상으로 사용하여 새로운 철도 표면 영상을 생성한다. 그리고 결함 검출 모델에서 학습데이터로 생성된 새로운 철도 표면 영상을 사용하였을 때와, 생성된 철도 표면 영상이 없는 원본 영상을 사용하였을 때를 나누어 검증한다. 앞서 분류했던 하위집합들 중에서 원본영상으로 사용된 집합세트를 제외한 두 개의 하위집합들은 각각의 환경에서 학습된 결함 검출 모델에서 검증하여 출력인 픽셀단위 분류지도 영상을 얻는다. 이 픽셀단위 분류지도영상들과 실제 결함의 위치에 대한 원본결함 지도(Ground-Truth)들의 IoU(Intersection over Union) 및 F1-score로 평가하여 성능을 계산하였다. 결과적으로 두개의 하위집합의 텍스처 영상을 이용한 재생성된 학습데이터를 학습한 결함 검출모델의 점수는 원본 영상만을 학습하였을 때의 점수보다 약 IoU 및 F1-score가 10~15% 증가하였다. 이는 전용 학습 데이터가 구축되지 않은 철도표면 영상에 대해서도 기존 데이터를 이용하여 결함 검출이 상당히 가능함을 증명하는 것이다.