• 제목/요약/키워드: machine learning (ML)

검색결과 291건 처리시간 0.027초

머신러닝 기법을 활용한 낙동강 하구 염분농도 예측 (Nakdong River Estuary Salinity Prediction Using Machine Learning Methods)

  • 이호준;조민규;천세진;한정규
    • 스마트미디어저널
    • /
    • 제11권2호
    • /
    • pp.31-38
    • /
    • 2022
  • 하천의 염분 변화를 신속히 예측하는 것은 염분 침투로 인한 농업, 생태계의 피해를 예측하고 재해 방지 대책을 수립하기 위해서 중요한 작업이다. 머신러닝 기법은 물리 기반 수리 모델에 비해 계산량이 훨씬 적기 때문에, 비교적 짧은 시간에 염분농도를 예측 가능하여 물리 기반 수리 모델의 보완 기법으로 연구되고 있다. 해외에서는 머신러닝 기법 기반 염분 예측 연구들이 활발히 연구되고 있으나, 대한민국의 공공데이터에 머신러닝 기법을 적용한 연구는 충분치 않다. 낙동강 하구의 환경 정보에 관한 공공데이터와 함께, 본 연구는 여러 종류의 머신러닝 기법의 염분농도에 대한 예측 성능을 측정하였다. 실험 결과에서, 결정 트리 기반의 LightGBM 알고리즘은 평균 RMSE 0.37의 예측 정확도와 타 알고리즘 대비 2-20배 빠른 학습 속도를 보여주었다. 따라서 국내 하천의 염분농도 예측에도 머신러닝 기법을 적용할 수 있다고 판단된다.

Finding Unexpected Test Accuracy by Cross Validation in Machine Learning

  • Yoon, Hoijin
    • International Journal of Computer Science & Network Security
    • /
    • 제21권12spc호
    • /
    • pp.549-555
    • /
    • 2021
  • Machine Learning(ML) splits data into 3 parts, which are usually 60% for training, 20% for validation, and 20% for testing. It just splits quantitatively instead of selecting each set of data by a criterion, which is very important concept for the adequacy of test data. ML measures a model's accuracy by applying a set of validation data, and revises the model until the validation accuracy reaches on a certain level. After the validation process, the complete model is tested with the set of test data, which are not seen by the model yet. If the set of test data covers the model's attributes well, the test accuracy will be close to the validation accuracy of the model. To make sure that ML's set of test data works adequately, we design an experiment and see if the test accuracy of model is always close to its validation adequacy as expected. The experiment builds 100 different SVM models for each of six data sets published in UCI ML repository. From the test accuracy and its validation accuracy of 600 cases, we find some unexpected cases, where the test accuracy is very different from its validation accuracy. Consequently, it is not always true that ML's set of test data is adequate to assure a model's quality.

Scoping Review of Machine Learning and Deep Learning Algorithm Applications in Veterinary Clinics: Situation Analysis and Suggestions for Further Studies

  • Kyung-Duk Min
    • 한국임상수의학회지
    • /
    • 제40권4호
    • /
    • pp.243-259
    • /
    • 2023
  • Machine learning and deep learning (ML/DL) algorithms have been successfully applied in medical practice. However, their application in veterinary medicine is relatively limited, possibly due to a lack in the quantity and quality of relevant research. Because the potential demands for ML/DL applications in veterinary clinics are significant, it is important to note the current gaps in the literature and explore the possible directions for advancement in this field. Thus, a scoping review was conducted as a situation analysis. We developed a search strategy following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. PubMed and Embase databases were used in the initial search. The identified items were screened based on predefined inclusion and exclusion criteria. Information regarding model development, quality of validation, and model performance was extracted from the included studies. The current review found 55 studies that passed the criteria. In terms of target animals, the number of studies on industrial animals was similar to that on companion animals. Quantitative scarcity of prediction studies (n = 11, including duplications) was revealed in both industrial and non-industrial animal studies compared to diagnostic studies (n = 45, including duplications). Qualitative limitations were also identified, especially regarding validation methodologies. Considering these gaps in the literature, future studies examining the prediction and validation processes, which employ a prospective and multi-center approach, are highly recommended. Veterinary practitioners should acknowledge the current limitations in this field and adopt a receptive and critical attitude towards these new technologies to avoid their abuse.

기계학습 옵티마이저 성능 평가 (Performance Evaluation of Machine Learning Optimizers)

  • 주기훈;박치현;임현승
    • 전기전자학회논문지
    • /
    • 제24권3호
    • /
    • pp.766-776
    • /
    • 2020
  • 최근 기계학습에 대한 관심이 높아지고 연구가 활성화됨에 따라 다양한 기계학습 모델에서 최적의 하이퍼 파라미터 조합을 찾는 것이 중요해지고 있다. 본 논문에서는 다양한 하이퍼 파라미터 중에서 옵티마이저에 중점을 두고, 다양한 데이터에서 주요 옵티마이저들의 성능을 측정하고 비교하였다. 특히, 가장 기본이 되는 SGD부터 Momentum, NAG, AdaGrad, RMSProp, AdaDelta, Adam, AdaMax, Nadam까지 총 9개의 옵티마이저의 성능을 MNIST, CIFAR-10, IRIS, TITANIC, Boston Housing Price 데이터를 이용하여 비교하였다. 실험 결과, 전체적으로 Adam과 Nadam을 사용하였을 때 기계학습 모델의 손실 함숫값이 가장 빠르게 감소하는 것을 확인할 수 있었으며, F1 score 또한 높아짐을 확인할 수 있었다. 한편, AdaMax는 학습 중에 불안정한 모습을 많이 보여주었으며, AdaDelta는 다른 옵티마이저들에 비하여 수렴 속도가 느리며 성능이 낮은 것을 확인할 수 있었다.

Lessons Learned from Institutionalization of ML (Machine Learning) Supported HR Services in the Existence of Multiple Institutional Logics

  • Gyeung-min Kim;Heesun Kim
    • Asia pacific journal of information systems
    • /
    • 제33권4호
    • /
    • pp.1171-1187
    • /
    • 2023
  • This study explores how an organization has successfully implemented ML-supported HR services to resolve high employee turnover problems in the IT sector. The empirical setting of the research is where contradicting institutional logics exist among technical, HR, and business groups regarding the ML model development and use of the model predictions in HR services. Institutional framework is used to identify the roles of organizational actors and the legitimacy structures in the organizational environments that can shape or constrain the ML led organizational changes. In institutional theories, technology adoption and organizational change are not only constrained by organizational context, but also fostered through organizational actors' roles and efforts to increase the legitimacy for the change. This research found that when multiple contradicting institutional logics exist, legitimizing the establishment of an enabling environment for multiple logics to reconcile and for the project to move forward is critical. Industry-wide conditions, previous experiences with the pilot ML project, forming a TFT with clearly defined roles and responsibilities, and relevant KPIs are found to legitimize the HR team and the business division to collaborate with the technical personnel to launch ML-supported HR services.

ML-based Interactive Data Visualization System for Diversity and Fairness Issues

  • Min, Sey;Kim, Jusub
    • International Journal of Contents
    • /
    • 제15권4호
    • /
    • pp.1-7
    • /
    • 2019
  • As the recent developments of artificial intelligence, particularly machine-learning, impact every aspect of society, they are also increasingly influencing creative fields manifested as new artistic tools and inspirational sources. However, as more artists integrate the technology into their creative works, the issues of diversity and fairness are also emerging in the AI-based creative practice. The data dependency of machine-learning algorithms can amplify the social injustice existing in the real world. In this paper, we present an interactive visualization system for raising the awareness of the diversity and fairness issues. Rather than resorting to education, campaign, or laws on those issues, we have developed a web & ML-based interactive data visualization system. By providing the interactive visual experience on the issues in interesting ways as the form of web content which anyone can access from anywhere, we strive to raise the public awareness of the issues and alleviate the important ethical problems. In this paper, we present the process of developing the ML-based interactive visualization system and discuss the results of this project. The proposed approach can be applied to other areas requiring attention to the issues.

Wellness Prediction in Diabetes Mellitus Risks Via Machine Learning Classifiers

  • Saravanakumar M, Venkatesh;Sabibullah, M.
    • International Journal of Computer Science & Network Security
    • /
    • 제22권4호
    • /
    • pp.203-208
    • /
    • 2022
  • The occurrence of Type 2 Diabetes Mellitus (T2DM) is hoarding globally. All kinds of Diabetes Mellitus is controlled to disrupt over 415 million grownups worldwide. It was the seventh prime cause of demise widespread with a measured 1.6 million deaths right prompted by diabetes during 2016. Over 90% of diabetes cases are T2DM, with the utmost persons having at smallest one other chronic condition in UK. In valuation of contemporary applications of Big Data (BD) to Diabetes Medicare by sighted its upcoming abilities, it is compulsory to transmit out a bottomless revision over foremost theoretical literatures. The long-term growth in medicine and, in explicit, in the field of "Diabetology", is powerfully encroached to a sequence of differences and inventions. The medical and healthcare data from varied bases like analysis and treatment tactics which assistances healthcare workers to guess the actual perceptions about the development of Diabetes Medicare measures accessible by them. Apache Spark extracts "Resilient Distributed Dataset (RDD)", a vital data structure distributed finished a cluster on machines. Machine Learning (ML) deals a note-worthy method for building elegant and automatic algorithms. ML library involving of communal ML algorithms like Support Vector Classification and Random Forest are investigated in this projected work by using Jupiter Notebook - Python code, where significant quantity of result (Accuracy) is carried out by the models.

Evolution of the Stethoscope: Advances with the Adoption of Machine Learning and Development of Wearable Devices

  • Yoonjoo Kim;YunKyong Hyon;Seong-Dae Woo;Sunju Lee;Song-I Lee;Taeyoung Ha;Chaeuk Chung
    • Tuberculosis and Respiratory Diseases
    • /
    • 제86권4호
    • /
    • pp.251-263
    • /
    • 2023
  • The stethoscope has long been used for the examination of patients, but the importance of auscultation has declined due to its several limitations and the development of other diagnostic tools. However, auscultation is still recognized as a primary diagnostic device because it is non-invasive and provides valuable information in real-time. To supplement the limitations of existing stethoscopes, digital stethoscopes with machine learning (ML) algorithms have been developed. Thus, now we can record and share respiratory sounds and artificial intelligence (AI)-assisted auscultation using ML algorithms distinguishes the type of sounds. Recently, the demands for remote care and non-face-to-face treatment diseases requiring isolation such as coronavirus disease 2019 (COVID-19) infection increased. To address these problems, wireless and wearable stethoscopes are being developed with the advances in battery technology and integrated sensors. This review provides the history of the stethoscope and classification of respiratory sounds, describes ML algorithms, and introduces new auscultation methods based on AI-assisted analysis and wireless or wearable stethoscopes.

A machine learning informed prediction of severe accident progressions in nuclear power plants

  • JinHo Song;SungJoong Kim
    • Nuclear Engineering and Technology
    • /
    • 제56권6호
    • /
    • pp.2266-2273
    • /
    • 2024
  • A machine learning platform is proposed for the diagnosis of a severe accident progression in a nuclear power plant. To predict the key parameters for accident management including lost signals, a long short term memory (LSTM) network is proposed, where multiple accident scenarios are used for training. Training and test data were produced by MELCOR simulation of the Fukushima Daiichi Nuclear Power Plant (FDNPP) accident at unit 3. Feature variables were selected among plant parameters, where the importance ranking was determined by a recursive feature elimination technique using RandomForestRegressor. To answer the question of whether a reduced order ML model could predict the complex transient response, we performed a systematic sensitivity study for the choices of target variables, the combination of training and test data, the number of feature variables, and the number of neurons to evaluate the performance of the proposed ML platform. The number of sensitivity cases was chosen to guarantee a 95 % tolerance limit with a 95 % confidence level based on Wilks' formula to quantify the uncertainty of predictions. The results of investigations indicate that the proposed ML platform consistently predicts the target variable. The median and mean predictions were close to the true value.

머신러닝 플랫폼을 활용한 소프트웨어 교수-학습 모형 개발 (The Development of Software Teaching-Learning Model based on Machine Learning Platform)

  • 박대륜;안중민;장준혁;유원진;김우열;배영권;유인환
    • 정보교육학회논문지
    • /
    • 제24권1호
    • /
    • pp.49-57
    • /
    • 2020
  • 현대사회는 21세기 초반 지식정보사회를 지나 지능정보사회로 바뀌어 가고 있다. 본 연구에서는 지능정보사회에서 요구되는 학습자의 핵심역량을 신장시키기 위하여 인공지능의 한 분야인 머신러닝을 기반으로 소프트웨어 교육 교수-학습 모형을 개발하였다. 본 모형은 인공지능 자체에 대한 학습의 부담감을 줄이고, 머신러닝을 활용하여 문제를 해결하는 과정에서 핵심역량을 신장시키는 것에 중점을 두었다. 개발된 모형의 구체적인 단계는 문제인식 및 분석, 데이터 수집, 데이터 가공 및 선별, ML모델 훈련 및 평가, ML프로그래밍, 적용 및 해결, 공유 및 환류의 7단계로 구성되어 있다. 본 연구에서 개발한 모형을 학생과 학부모를 대상으로 적용한 결과 긍정적인 반응을 얻을 수 있었으며, 이를 통해 머신러닝 기반의 소프트웨어 교육 프로그램의 개발 및 운영에 작은 밑거름을 제시할 수 있을 것으로 기대한다.