• Title/Summary/Keyword: AI. Big data

Search Result 501, Processing Time 0.024 seconds

Development of Type 2 Prediction Prediction Based on Big Data (빅데이터 기반 2형 당뇨 예측 알고리즘 개발)

  • Hyun Sim;HyunWook Kim
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.5
    • /
    • pp.999-1008
    • /
    • 2023
  • Early prediction of chronic diseases such as diabetes is an important issue, and improving the accuracy of diabetes prediction is especially important. Various machine learning and deep learning-based methodologies are being introduced for diabetes prediction, but these technologies require large amounts of data for better performance than other methodologies, and the learning cost is high due to complex data models. In this study, we aim to verify the claim that DNN using the pima dataset and k-fold cross-validation reduces the efficiency of diabetes diagnosis models. Machine learning classification methods such as decision trees, SVM, random forests, logistic regression, KNN, and various ensemble techniques were used to determine which algorithm produces the best prediction results. After training and testing all classification models, the proposed system provided the best results on XGBoost classifier with ADASYN method, with accuracy of 81%, F1 coefficient of 0.81, and AUC of 0.84. Additionally, a domain adaptation method was implemented to demonstrate the versatility of the proposed system. An explainable AI approach using the LIME and SHAP frameworks was implemented to understand how the model predicts the final outcome.

A Quantitative Analysis on Machine Learning and Smart Farm with Bibliographic Data from 2013 to 2023

  • Yong Sauk Hau
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.16 no.3
    • /
    • pp.388-393
    • /
    • 2024
  • The convergence of machine learning and smart farm is becoming more and more important. The purpose of this research is to quantitatively analyze machine learning and smart farm with bibliographic data from 2013 to 2023. This study analyzed the 251 articles, filtered from the Web of Science, with regard to the article publication trend, the article citation trend, the top 10 research area, and the top 10 keywords representing the articles. The quantitative analysis results reveal the four points: First, the number of article publications in machine learning and smart farm continued growing from 2016. Second, the article citations in machine learning and smart farm drastically increased since 2018. Third, Computer Science, Engineering, Agriculture, Telecommunications, Chemistry, Environmental Sciences Ecology, Material Science, Instruments Instrumentation, Science Technology Other Topics, and Physics are top 10 research areas. Fourth, it is 'machine learning', 'smart farming', 'internet of things', 'precision agriculture', 'deep learning', 'agriculture', 'big data', 'machine', 'smart' and 'smart agriculture' that are the top 10 keywords composing authors' keywords in the articles in machine learning and smart farm from 2013 to 2023.

Finding the Optimal Data Classification Method Using LDA and QDA Discriminant Analysis

  • Kim, SeungJae;Kim, SungHwan
    • Journal of Integrative Natural Science
    • /
    • v.13 no.4
    • /
    • pp.132-140
    • /
    • 2020
  • With the recent introduction of artificial intelligence (AI) technology, the use of data is rapidly increasing, and newly generated data is also rapidly increasing. In order to obtain the results to be analyzed based on these data, the first thing to do is to classify the data well. However, when classifying data, if only one classification technique belonging to the machine learning technique is applied to classify and analyze it, an error of overfitting can be accompanied. In order to reduce or minimize the problems caused by misclassification of the classification system such as overfitting, it is necessary to derive an optimal classification by comparing the results of each classification by applying several classification techniques. If you try to interpret the data with only one classification technique, you will have poor reasoning and poor predictions of results. This study seeks to find a method for optimally classifying data by looking at data from various perspectives and applying various classification techniques such as LDA and QDA, such as linear or nonlinear classification, as a process before data analysis in data analysis. In order to obtain the reliability and sophistication of statistics as a result of big data analysis, it is necessary to analyze the meaning of each variable and the correlation between the variables. If the data is classified differently from the hypothesis test from the beginning, even if the analysis is performed well, unreliable results will be obtained. In other words, prior to big data analysis, it is necessary to ensure that data is well classified to suit the purpose of analysis. This is a process that must be performed before reaching the result by analyzing the data, and it may be a method of optimal data classification.

Valid Data Conditions and Discrimination for Machine Learning: Case study on Dataset in the Public Data Portal (기계학습에 유효한 데이터 요건 및 선별: 공공데이터포털 제공 데이터 사례를 통해)

  • Oh, Hyo-Jung;Yun, Bo-Hyun
    • Journal of Internet of Things and Convergence
    • /
    • v.8 no.1
    • /
    • pp.37-43
    • /
    • 2022
  • The fundamental basis of AI technology is learningable data. Recently, the types and amounts of data collected and produced by the government or private companies are increasing exponentially, however, verified data that can be used for actual machine learning has not yet led to it. This study discusses the conditions that data actually can be used for machine learning should meet, and identifies factors that degrade data quality through case studies. To this end, two representative cases of developing a prediction model using public big data was selected, and data for actual problem solving was collected from the public data portal. Through this, there is a difference from the results of applying valid data screening criteria and post-processing. The ultimate purpose of this study is to argue the importance of data quality management that must be most fundamentally preceded before the development of machine learning technology, which is the core of artificial intelligence, and accumulating valid data.

Blockchain-based safety MyData Service Model (블록체인 기반 안전한 마이데이터 서비스 모델)

  • Lee, Kwang Hyoung;Jung, Young Hoon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.12
    • /
    • pp.873-879
    • /
    • 2020
  • The importance of data as a core resource of the 4th industrial revolution is emerging, and companies illegally collect and use personal data. In the financial sector, active research is conducted to safely manage personal data and provide better services using blockchain, big data, and AI technology. In this paper, we propose a system that can safely manage personal data by using blockchain technology, which can be used without changing the existing system. The composition of this system consists of a blockchain, blockchain linkages, a service provider, and a user (i.e., an app). Blockchain can be used regardless of its type and form, and services are provided by classifying blockchains and services in the blockchain linkages. Service providers can access personal data only after requesting and receiving delegated permission from users. Existent MyData services store all data in a user's mobile phone, so information may get leaked due to jailbreaks or rooting. But in the proposed system, personal data are stored in blockchain so information leakage can be prevented. In the future, we will study ways to provide customized services using personal data stored in blockchain.

The Analysis of Patent Trends and Radiation Convergence Technology (방사선 융합기술과 특허 동향 분석)

  • Park, Jang-Hoon;Ock, Young Seok
    • Journal of the Korean Society of Radiology
    • /
    • v.13 no.5
    • /
    • pp.785-790
    • /
    • 2019
  • Convergence and advancement between technologies such as Artificial Intelligence, Big Data, and the Internet of Things have a significant impact on the regional flagship industry. All technical fields are used as a converged technology by connecting between technology and industry. In order to understanding the recent technical trend, it is possible to easily realized the technical trend research and analysis through keyword search using patent information. The purpose of this study is to identify patent trends applied to convergence technology in the 4th Industrial Revolution age in radiation technology development and to present patent trends and analysis for strengthening and utilizing radiation-related industrial technology competitiveness and to apply them to demand technology and forecast future promising technologies.

A study on the smart band, technologies, and case studies for the vulnerable group. - The Digital Age and the Fourth Industrial Revolution.

  • YU, Kyoungsung;SHIN, Seung-Jung
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.14 no.1
    • /
    • pp.182-187
    • /
    • 2022
  • This study aims to study non-rechargeable wrist-type smart bands for those vulnerable to the digital environment. The transition to the digital age means improving the efficiency of human life and the convenience of management. In the digital age, it can be a very convenient infrastructure for the digital generation, but otherwise, it can cause inconvenience. COVID-19 is spreading non-face-to-face culture. The reality is that the vulnerable are complaining of discomfort in non-face-to-face culture. The core of the digital environment is smartphones. Digital life is spreading around smartphones. Technology that drives the digital environment is the core technology of the Fourth Industrial Revolution. The technologies are lot, big data, Blockchain, Smart Mobility, and AI. Related technologies based on these technologies include digital ID cards, digital keys, and nfc technologies. Non-rechargeable wrist-type smart bands based on related technologies can be conceptualized. Through these technologies, blind people can easily access books and manage their ID cards conveniently and efficiently. In particular, access authentication is required wherever you go due to COVID-19, which can be used as a useful tool for the elderly who feel uncomfortable using smartphones. It can also eliminate the inconvenience of the elderly finding or losing their keys.

A Study on the Web Building Assistant System Using GUI Object Detection and Large Language Model (웹 구축 보조 시스템에 대한 GUI 객체 감지 및 대규모 언어 모델 활용 연구)

  • Hyun-Cheol Jang;Hyungkuk Jang
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2024.05a
    • /
    • pp.830-833
    • /
    • 2024
  • As Large Language Models (LLM) like OpenAI's ChatGPT[1] continue to grow in popularity, new applications and services are expected to emerge. This paper introduces an experimental study on a smart web-builder application assistance system that combines Computer Vision with GUI object recognition and the ChatGPT (LLM). First of all, the research strategy employed computer vision technology in conjunction with Microsoft's "ChatGPT for Robotics: Design Principles and Model Abilities"[2] design strategy. Additionally, this research explores the capabilities of Large Language Model like ChatGPT in various application design tasks, specifically in assisting with web-builder tasks. The study examines the ability of ChatGPT to synthesize code through both directed prompts and free-form conversation strategies. The researchers also explored ChatGPT's ability to perform various tasks within the builder domain, including functions and closure loop inferences, basic logical and mathematical reasoning. Overall, this research proposes an efficient way to perform various application system tasks by combining natural language commands with computer vision technology and LLM (ChatGPT). This approach allows for user interaction through natural language commands while building applications.

Quality management direction in the 4th industrial revolution era (제4차 산업혁명시대에서의 품질경영 방향)

  • Baik, Jaiwook
    • Industry Promotion Research
    • /
    • v.5 no.4
    • /
    • pp.1-13
    • /
    • 2020
  • Since the 4th industrial revolution was thrown into the world at the Davos World Economic Forum in January 2016, the world has been undergoing major social and economic changes. In this study, the direction of quality management in the 4th industrial revolution era was examined. First, in all the major countries the industrial structural changes and smart business models were confirmed due to the convergence of new ICT such as IoT, robotics, 3D printing, big data, and AI with the existing technologies and industries. Second, we found that although the core technology level of the 4th industrial revolution in Korea is not as good as that of advanced countries, we have been working on expanding smart production methods and creating new industries by utilizing new ICT. Finally, it was confirmed that quality management is a real-time implementation of new ICT that reflects the needs of the market in real time based on big data from the planning and design stage of products or services.

Development of data collection education programs for lower grades in elementary school students (초등학교 저학년을 위한 데이터 수집 교육 프로그램 개발)

  • Yi, Seul;Ma, Daisung
    • 한국정보교육학회:학술대회논문집
    • /
    • 2021.08a
    • /
    • pp.275-281
    • /
    • 2021
  • Much of our lives are closely related to artificial intelligence, and society is changing more rapidly. Reflecting this era, the need for artificial intelligence education has emerged and various learning methods have been proposed, but guidance on artificial intelligence teaching and learning activities for lower grades elementary school students is insufficient. Therefore, in this study, the data collection education program for the lower grades of elementary school was developed based on the contents standards of the Korea Foundation for the Advancement of Science & Creativity. Focusing on the principles of artificial intelligence and the detailed data area of the utilization area, the focus was on expressing numbers and letters in various ways, such as colors and pictures, and finding various types of data in life to learn the principles of artificial intelligence. Through this program, it is expected that lower-grade elementary school students will be able to understand the importance of data collection in artificial intelligence through the process of knowing about data and collecting sound, picture, and text data.

  • PDF