• Title/Summary/Keyword: 빅데이터 기반 모델링

Search Result 103, Processing Time 0.027 seconds

Internet of Things (IoT) Based Modeling for Dynamic Security in Nuclear Systems with Data Mining Strategy (데이터 마이닝 전략을 사용하여 원자력 시스템의 동적 보안을 위한 사물 인터넷 (IoT) 기반 모델링)

  • Jang, Kyung Bae;Baek, Chang Hyun;Kim, Jong Min;Baek, Hyung Ho;Woo, Tae Ho
    • Journal of Internet of Things and Convergence
    • /
    • v.7 no.1
    • /
    • pp.9-19
    • /
    • 2021
  • The data mining design incorporated with big data based cloud computing system is investigated for the nuclear terrorism prevention where the conventional physical protection system (PPS) is modified. The networking of terror related bodies is modeled by simulation study for nuclear forensic incidents. It is needed for the government to detect the terrorism and any attempts to attack to innocent people without illegal tapping. Although the mathematical algorithm of the study can't give the exact result of the terror incident, the potential possibility could be obtained by the simulations. The result shows the shape oscillation by time. In addition, the integration of the frequency of each value can show the degree of the transitions of the results. The value increases to -2.61741 in 63.125th hour. So, the terror possibility is highest in later time.

Socio-National Issues Detection Modeling based on Domain Knowledge - Focusing on the Issue of Increase in Domestic Inflow Infectious Diseases (도메인 지식 기반 이슈 탐지 모델링 - 해외 발생 감염병 국내 유입 이슈를 중심으로)

  • Hwang, Mi-Nyeong;Lee, Seungwoo
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.12
    • /
    • pp.158-168
    • /
    • 2017
  • As the big data technologies advance, there is an increasing interest in systematic methodologies for data-based policy determination especially in the public health area. This study proposes a method to develop an issue detection model through the collaboration with domain experts in order to intelligently detect major socio-national issues on infectious diseases based on data. At first, the factors influencing the 'domestic inflow of foreign infectious diseases' are determined and variables representing the factors are set. Thereafter, by using system dynamics methods, the causal analysis is made to find causal map indicating main influential factors. In this process, an empirical modeling is conducted through collaboration between data analysts and experts in the infectious disease domain. The proposed issue detection approach based on domain knowledges will make it possible to make a decision on policies more efficiently if the detection system is capable of continuos monitoring of the related issues.

An Empirical Study on Hybrid Recommendation System Using Movie Lens Data (무비렌즈 데이터를 이용한 하이브리드 추천 시스템에 대한 실증 연구)

  • Kim, Dong-Wook;Kim, Sung-Geun;Kang, Juyoung
    • The Journal of Bigdata
    • /
    • v.2 no.1
    • /
    • pp.41-48
    • /
    • 2017
  • Recently, the popularity of the recommendation system and the evaluation of the performance of the algorithm of the recommendation system have become important. In this study, we used modeling and RMSE to verify the effectiveness of various algorithms in movie data. The data of this study is based on user-based collaborative filtering using Pearson correlation coefficient, item-based collaborative filtering using cosine correlation coefficient, and item-based collaborative filtering model using singular value decomposition. As a result of evaluating the scores with three recommendation models, we found that item-based collaborative filtering accuracy is much higher than user-based collaborative filtering, and it is found that matrix recommendation is better when using matrix decomposition.

  • PDF

Topic Modeling on Research Trends of Industry 4.0 Using Text Mining (텍스트 마이닝을 이용한 4차 산업 연구 동향 토픽 모델링)

  • Cho, Kyoung Won;Woo, Young Woon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.7
    • /
    • pp.764-770
    • /
    • 2019
  • In this research, text mining techniques were used to analyze the papers related to the "4th Industry". In order to analyze the papers, total of 685 papers were collected by searching with the keyword "4th industry" in Korea Journal Index(KCI) from 2016 to 2019. We used Python-based web scraping program to collect papers and use topic modeling techniques based on LDA algorithm implemented in R language for data analysis. As a result of perplexity analysis on the collected papers, nine topics were determined optimally and nine representative topics of the collected papers were extracted using the Gibbs sampling method. As a result, it was confirmed that artificial intelligence, big data, Internet of things(IoT), digital, network and so on have emerged as the major technologies, and it was confirmed that research has been conducted on the changes due to the major technologies in various fields related to the 4th industry such as industry, government, education field, and job.

Analysis of domestic and foreign future automobile research trends based on topic modeling (토픽모델링 기반의 국내외 미래 자동차 연구동향 비교 분석: CASE 키워드 중심으로)

  • Jeong, Ho Jeong;Kim, Keun-Wook;Kim, Na-Gyeong;Chang, Won-Jun;Jeong, Won-Oong;Park, Dae-Yeong
    • Journal of Digital Convergence
    • /
    • v.20 no.5
    • /
    • pp.463-476
    • /
    • 2022
  • After industrialization in the past, the automobile industry has continued to grow centered on internal combustion engines, but is facing a major change with the recent 4th industrial revolution. Most companies are preparing for the transition to electric vehicles and autonomous driving. Therefore, in this study, topic modeling was performed based on LDA algorithm by collecting 4,002 domestic papers and 68,372 overseas papers that contain keywords related to CASE (Connectivity, Autonomous, Sharing, Electrification), which represent future automobile trends. As a result of the analysis, it was found that domestic research mainly focuses on macroscopic aspects such as traffic infrastructure, urban traffic efficiency, and traffic policy. Through this, the government's technical support for MaaS (Mobility-as-a-Service) is required in the domestic shared car sector, and the need for data opening by means of transportation was presented. It is judged that these analysis results can be used as basic data for the future automobile industry.

Performance Comparison of Traffic-Dependent Displacement Estimation Model of Gwangan Bridge by Improvement Technique (개선 기법에 따른 광안대교의 교통량 의존 변위 추정 모델 성능 비교)

  • Kim, Soo-Yong;Shin, Sung-Woo;Park, Ji-Hyun
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.23 no.4
    • /
    • pp.120-130
    • /
    • 2019
  • In this study, based on the correlation between traffic volume data and vertical displacement data developed in previous research using the bridge maintenance big data of 2006, the vertical displacement estimation model using the traffic volume data of Gwangan Bridge for 10 years A comparison of the performance of the developed model with the current applicability is presented. The present applicability of the developed model is analyzed that the estimated displacement is similar to the actual displacement and that the displacement estimation performance of the model based on the structured regression analysis and the principal component analysis is not significantly different from each other. In conclusion, the vertical displacement estimation model using the traffic volume data developed by this study can be effectively used for the analysis of the behavior according to the traffic load of Gwangan Bridge.

Modeling Framework for Continuous Dynamic Systems Using Machine Learning of Hypothetical Model (가설적 모델의 기계학습을 이용한 연속시간 동적시스템 모델링 프레임워크)

  • Hae Sang Song;Tag Gon Kim
    • Journal of the Korea Society for Simulation
    • /
    • v.32 no.1
    • /
    • pp.13-21
    • /
    • 2023
  • This paper proposes a method of automatically generating a model through a machine learning technique by setting a hypothetical model in the form of a gray box or black box with unknown parameters, when the big data of the actual system is given. We implements the proposed framework and conducts experiments to find an appropriate model among various hypothesis models and compares the cost and fitness of them. As a result we find that the proposed framework works well with continuous systems that could be modeled with ordinary differential equation. This technique is expected to be used well for the purpose of automatically updating the consistency of the digital twin model or predicting the output for new inputs using recently generated big data.

Topic Modeling-Based Domestic and Foreign Public Data Research Trends Comparative Analysis (토픽 모델링 기반의 국내외 공공데이터 연구 동향 비교 분석)

  • Park, Dae-Yeong;Kim, Deok-Hyeon;Kim, Keun-Wook
    • Journal of Digital Convergence
    • /
    • v.19 no.2
    • /
    • pp.1-12
    • /
    • 2021
  • With the recent 4th Industrial Revolution, the growth and value of big data are continuously increasing, and the government is also actively making efforts to open and utilize public data. However, the situation still does not reach the level of demand for public data use by citizens, At this point, it is necessary to identify research trends in the public data field and seek directions for development. In this study, in order to understand the research trends related to public data, the analysis was performed using topic modeling, which is mainly used in text mining techniques. To this end, we collected papers containing keywords of 'Public data' among domestic and foreign research papers (1,437 domestically, 9,607 overseas) and performed topic modeling based on the LDA algorithm, and compared domestic and foreign public data research trends. After analysis, policy implications were presented. Looking at the time series by topic, research in the fields of 'personal information protection', 'public data management', and 'urban environment' has increased in Korea. Overseas, it was confirmed that research in the fields of 'urban policy', 'cell biology', 'deep learning', and 'cloud·security' is active.

Study on the Modeling of Health Medical Examination Knowledge Base Construction using Data Analysis based on AI (인공지능 기반의 데이터 분석을 적용한 건강검진 지식 베이스 구축 모델링 연구)

  • Kim, Bong-Hyun
    • Journal of Convergence for Information Technology
    • /
    • v.10 no.6
    • /
    • pp.35-40
    • /
    • 2020
  • As we enter the society of the future, efforts to increase healthy living are a major area of concern for modern people. In particular, the development of technology for a healthy life that combines ICT technology with a competitive healthcare industry environment is becoming the next growth engine. Therefore, in this paper, artificial intelligence-based data analysis of the examination results was applied in the health examination process. Through this, a research was conducted to build a knowledge base modeling that can improve the reliability of the overall judgment. To this end, an algorithm was designed through deep learning analysis to calculate and verify the test result index. Then, the modeling that provides comprehensive examination information through judgment knowledge was studied. Through the application of the proposed modeling, it is possible to analyze and utilize big data on national health, so it can be expected to reduce medical expenses and increase health.

Prediction of Agricultural Purchases Using Structured and Unstructured Data: Focusing on Paprika (정형 및 비정형 데이터를 이용한 농산물 구매량 예측: 파프리카를 중심으로)

  • Somakhamixay Oui;Kyung-Hee Lee;HyungChul Rah;Eun-Seon Choi;Wan-Sup Cho
    • The Journal of Bigdata
    • /
    • v.6 no.2
    • /
    • pp.169-179
    • /
    • 2021
  • Consumers' food consumption behavior is likely to be affected not only by structured data such as consumer panel data but also by unstructured data such as mass media and social media. In this study, a deep learning-based consumption prediction model is generated and verified for the fusion data set linking structured data and unstructured data related to food consumption. The results of the study showed that model accuracy was improved when combining structured data and unstructured data. In addition, unstructured data were found to improve model predictability. As a result of using the SHAP technique to identify the importance of variables, it was found that variables related to blog and video data were on the top list and had a positive correlation with the amount of paprika purchased. In addition, according to the experimental results, it was confirmed that the machine learning model showed higher accuracy than the deep learning model and could be an efficient alternative to the existing time series analysis modeling.