• Title/Summary/Keyword: large Dataset

Search Result 553, Processing Time 0.025 seconds

Big Data Management in Structured Storage Based on Fintech Models for IoMT using Machine Learning Techniques (기계학습법을 이용한 IoMT 핀테크 모델을 기반으로 한 구조화 스토리지에서의 빅데이터 관리 연구)

  • Kim, Kyung-Sil
    • Advanced Industrial SCIence
    • /
    • v.1 no.1
    • /
    • pp.7-15
    • /
    • 2022
  • To adopt the development in the medical scenario IoT developed towards the advancement with the processing of a large amount of medical data defined as an Internet of Medical Things (IoMT). The vast range of collected medical data is stored in the cloud in the structured manner to process the collected healthcare data. However, it is difficult to handle the huge volume of the healthcare data so it is necessary to develop an appropriate scheme for the healthcare structured data. In this paper, a machine learning mode for processing the structured heath care data collected from the IoMT is suggested. To process the vast range of healthcare data, this paper proposed an MTGPLSTM model for the processing of the medical data. The proposed model integrates the linear regression model for the processing of healthcare information. With the developed model outlier model is implemented based on the FinTech model for the evaluation and prediction of the COVID-19 healthcare dataset collected from the IoMT. The proposed MTGPLSTM model comprises of the regression model to predict and evaluate the planning scheme for the prevention of the infection spreading. The developed model performance is evaluated based on the consideration of the different classifiers such as LR, SVR, RFR, LSTM and the proposed MTGPLSTM model and the different size of data as 1GB, 2GB and 3GB is mainly concerned. The comparative analysis expressed that the proposed MTGPLSTM model achieves ~4% reduced MAPE and RMSE value for the worldwide data; in case of china minimal MAPE value of 0.97 is achieved which is ~ 6% minimal than the existing classifier leads.

Ground Subsidence Risk Grade Prediction Model Based on Machine Learning According to the Underground Facility Properties and Density (기계학습 기반 지하매설물 속성 및 밀집도를 활용한 지반함몰 위험도 예측 모델)

  • Sungyeol Lee;Jaemo Kang;Jinyoung Kim
    • Journal of the Korean GEO-environmental Society
    • /
    • v.24 no.4
    • /
    • pp.23-29
    • /
    • 2023
  • Ground subsidence shows a mechanism in which the upper ground collapses due to the formation of a cavity due to the movement of soil particles in the ground due to the formation of a waterway because of damage to the water supply/sewer pipes. As a result, cavity is created in the ground and the upper ground is collapsing. Therefore, ground subsidence frequently occurs mainly in downtown areas where a large amount of underground facilities are buried. Accordingly, research to predict the risk of ground subsidence is continuously being conducted. This study tried to present a ground subsidence risk prediction model for two districts of ○○ city. After constructing a data set and performing preprocessing, using the property data of underground facilities in the target area (year of service, pipe diameter), density of underground facilities, and ground subsidence history data. By applying the dataset to the machine learning model, it is evaluated the reliability of the selected model and the importance of the influencing factors used in predicting the ground subsidence risk derived from the model is presented.

Performance Evaluation of Object Detection Deep Learning Model for Paralichthys olivaceus Disease Symptoms Classification (넙치 질병 증상 분류를 위한 객체 탐지 딥러닝 모델 성능 평가)

  • Kyung won Cho;Ran Baik;Jong Ho Jeong;Chan Jin Kim;Han Suk Choi;Seok Won Jung;Hvun Seung Son
    • Smart Media Journal
    • /
    • v.12 no.10
    • /
    • pp.71-84
    • /
    • 2023
  • Paralichthys olivaceus accounts for a large proportion, accounting for more than half of Korea's aquaculture industry. However, about 25-30% of the total breeding volume throughout the year occurs due to diseases, which has a very bad impact on the economic feasibility of fish farms. For the economic growth of Paralichthys olivaceus farms, it is necessary to quickly and accurately diagnose disease symptoms by automating the diagnosis of Paralichthys olivaceus diseases. In this study, we create training data using innovative data collection methods, refining data algorithms, and techniques for partitioning dataset, and compare the Paralichthys olivaceus disease symptom detection performance of four object detection deep learning models(such as YOLOv8, Swin, Vitdet, MvitV2). The experimental findings indicate that the YOLOv8 model demonstrates superiority in terms of average detection rate (mAP) and Estimated Time of Arrival (ETA). If the performance of the AI model proposed in this study is verified, Paralichthys olivaceus farms can diagnose disease symptoms in real time, and it is expected that the productivity of the farm will be greatly improved by rapid preventive measures according to the diagnosis results.

Dust Prediction System based on Incremental Deep Learning (증강형 딥러닝 기반 미세먼지 예측 시스템)

  • Sung-Bong Jang
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.6
    • /
    • pp.301-307
    • /
    • 2023
  • Deep learning requires building a deep neural network, collecting a large amount of training data, and then training the built neural network for a long time. If training does not proceed properly or overfitting occurs, training will fail. When using deep learning tools that have been developed so far, it takes a lot of time to collect training data and learn. However, due to the rapid advent of the mobile environment and the increase in sensor data, the demand for real-time deep learning technology that can dramatically reduce the time required for neural network learning is rapidly increasing. In this study, a real-time deep learning system was implemented using an Arduino system equipped with a fine dust sensor. In the implemented system, fine dust data is measured every 30 seconds, and when up to 120 are accumulated, learning is performed using the previously accumulated data and the newly accumulated data as a dataset. The neural network for learning was composed of one input layer, one hidden layer, and one output. To evaluate the performance of the implemented system, learning time and root mean square error (RMSE) were measured. As a result of the experiment, the average learning error was 0.04053796, and the average learning time of one epoch was about 3,447 seconds.

Machine-learning-based out-of-hospital cardiac arrest (OHCA) detection in emergency calls using speech recognition (119 응급신고에서 수보요원과 신고자의 통화분석을 활용한 머신 러닝 기반의 심정지 탐지 모델)

  • Jong In Kim;Joo Young Lee;Jio Chung;Dae Jin Shin;Dong Hyun Choi;Ki Hong Kim;Ki Jeong Hong;Sunhee Kim;Minhwa Chung
    • Phonetics and Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.109-118
    • /
    • 2023
  • Cardiac arrest is a critical medical emergency where immediate response is essential for patient survival. This is especially true for Out-of-Hospital Cardiac Arrest (OHCA), for which the actions of emergency medical services in the early stages significantly impact outcomes. However, in Korea, a challenge arises due to a shortage of dispatcher who handle a large volume of emergency calls. In such situations, the implementation of a machine learning-based OHCA detection program can assist responders and improve patient survival rates. In this study, we address this challenge by developing a machine learning-based OHCA detection program. This program analyzes transcripts of conversations between responders and callers to identify instances of cardiac arrest. The proposed model includes an automatic transcription module for these conversations, a text-based cardiac arrest detection model, and the necessary server and client components for program deployment. Importantly, The experimental results demonstrate the model's effectiveness, achieving a performance score of 79.49% based on the F1 metric and reducing the time needed for cardiac arrest detection by 15 seconds compared to dispatcher. Despite working with a limited dataset, this research highlights the potential of a cardiac arrest detection program as a valuable tool for responders, ultimately enhancing cardiac arrest survival rates.

Empirical Analysis of the Influence of ICT SMEs' R&D Resources on Corporate Performance (ICT 중소기업의 연구개발 자원이 기업성과에 미치는 영향에 관한 실증연구)

  • Jong Yoon Won;Kun Chang Lee
    • Information Systems Review
    • /
    • v.23 no.3
    • /
    • pp.1-23
    • /
    • 2021
  • The national economic policy paradigm is constantly changing according to the global business environment. Among them, fostering SMEs is a core policy of many developed countries. The growth of SMEs contributes to the creation of jobs and the development of local communities in the era of employment-free growth. In particular, the growth of SMEs is the foundation for growth into mid-sized and large enterprises. Therefore, the growth of SMEs plays an important role in the national economy. Information and communication technology (ICT) became important much more with the emergence of the 4th industrial revolution. Among them, the growth of ICT SMEs is the nation's future asset. Therefore, this study examines and verifies the main factors affecting the performance of ICT SMEs from the view of their R&D resources. On the basis of 1,999 SMEs dataset, empirical analysis was performed to investigate the influence of R&D resources on their corporate performance. Its results are as follows. First, based on theresource-based theory, ICT SMEs' R&D investment, R&D manpower, and government support policies were found to have a positive effect on securing a company's competitive advantage. Second, it was found that the level of product has a positive effect on the company's performance. Finally, it was found that M&A and technology acquisition method strategies differ according to the growth stage of the company. Therefore, in order to achieve technological innovation and corporate performance of ICT SMEs, the government support policy and investment into internal R&D personnel play as main factors. In addition, it was found that technology acquisition strategies differ depending on the growth stage of the company.

Research on Characterizing Urban Color Analysis based on Tourists-Shared Photos and Machine Learning - Focused on Dali City, China - (관광객 공유한 사진 및 머신 러닝을 활용한 도시 색채 특성 분석 연구 - 중국 대리시를 대상으로 -)

  • Yin, Xiaoyan;Jung, Taeyeol
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.52 no.2
    • /
    • pp.39-50
    • /
    • 2024
  • Color is an essential visual element that has a significant impact on the formation of a city's image and people's perceptions. Quantitative analysis of color in urban environments is a complex process that has been difficult to implement in the past. However, with recent rapid advances in Machine Learning, it has become possible to analyze city colors using photos shared by tourists. This study selected Dali City, a popular tourist destination in China, as a case study. Photos of Dali City shared by tourists were collected, and a method to measure large-scale city colors was explored by combining machine learning techniques. Specifically, the DeepLabv3+ model was first applied to perform a semantic segmentation of tourist sharing photos based on the ADE20k dataset, thereby separating artificial elements in the photos. Next, the K-means clustering algorithm was used to extract colors from the artificial elements in Dali City, and an adjacency matrix was constructed to analyze the correlations between the dominant colors. The research results indicate that the main color of the artificial elements in Dali City has the highest percentage of orange-grey. Furthermore, gray tones are often used in combination with other colors. The results indicated that local ethnic and Buddhist cultures influence the color characteristics of artificial elements in Dali City. This research provides a new method of color analysis, and the results not only help Dali City to shape an urban color image that meets the expectations of tourists but also provide reference materials for future urban color planning in Dali City.

Using noise filtering and sufficient dimension reduction method on unstructured economic data (노이즈 필터링과 충분차원축소를 이용한 비정형 경제 데이터 활용에 대한 연구)

  • Jae Keun Yoo;Yujin Park;Beomseok Seo
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.2
    • /
    • pp.119-138
    • /
    • 2024
  • Text indicators are increasingly valuable in economic forecasting, but are often hindered by noise and high dimensionality. This study aims to explore post-processing techniques, specifically noise filtering and dimensionality reduction, to normalize text indicators and enhance their utility through empirical analysis. Predictive target variables for the empirical analysis include monthly leading index cyclical variations, BSI (business survey index) All industry sales performance, BSI All industry sales outlook, as well as quarterly real GDP SA (seasonally adjusted) growth rate and real GDP YoY (year-on-year) growth rate. This study explores the Hodrick and Prescott filter, which is widely used in econometrics for noise filtering, and employs sufficient dimension reduction, a nonparametric dimensionality reduction methodology, in conjunction with unstructured text data. The analysis results reveal that noise filtering of text indicators significantly improves predictive accuracy for both monthly and quarterly variables, particularly when the dataset is large. Moreover, this study demonstrated that applying dimensionality reduction further enhances predictive performance. These findings imply that post-processing techniques, such as noise filtering and dimensionality reduction, are crucial for enhancing the utility of text indicators and can contribute to improving the accuracy of economic forecasts.

A Basic Study on User Experience Evaluation Based on User Experience Hierarchy Using ChatGPT 4.0 (챗지피티 4.0을 활용한 사용자 경험 계층 기반 사용자 경험 평가에 관한 기초적 연구)

  • Soomin Han;Jae Wan Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.2
    • /
    • pp.493-498
    • /
    • 2024
  • With the rapid advancement of generative artificial intelligence technology, there is growing interest in how to utilize it in practical applications. Additionally, the importance of prompt engineering to generate results that meet user demands is being newly highlighted. Exploring the new possibilities of generative AI can hold significant value. This study aims to utilize ChatGPT 4.0, a leading generative AI, to propose an effective method for evaluating user experience through the analysis of online customer review data. The user experience evaluation method was based on the six-layer elements of user experience: 'functionality', 'reliability', 'usability', 'convenience', 'emotion', and 'significance'. For this study, a literature review was conducted to enhance the understanding of prompt engineering and to grasp the clear concept of the user experience hierarchy. Based on this, prompts were crafted, and experiments for the user experience evaluation method were carried out using the analysis of collected online customer review data. In this study, we reveal that when provided with accurate definitions and descriptions of the classification processes for user experience factors, ChatGPT demonstrated excellent performance in evaluating user experience. However, it was also found that due to time constraints, there were limitations in analyzing large volumes of data. By introducing and proposing a method to utilize ChatGPT 4.0 for user experience evaluation, we expect to contribute to the advancement of the UX field.

Textbook Outcome of Delta-Shaped Anastomosis in Minimally Invasive Distal Gastrectomy for Gastric Cancer in 4,505 Consecutive Patients

  • Seul-Gi Oh;Suin Lee;Ba Ool Seong;Chang Seok Ko;Sa-Hong Min;Chung Sik Gong;Beom Su Kim;Moon-Won Yoo;Jeong Hwan Yook;In-Seob Lee
    • Journal of Gastric Cancer
    • /
    • v.24 no.3
    • /
    • pp.341-352
    • /
    • 2024
  • Purpose: Textbook outcome is a comprehensive measure used to assess surgical quality and is increasingly being recognized as a valuable evaluation tool. Delta-shaped anastomosis (DA), an intracorporeal gastroduodenostomy, is a viable option for minimally invasive distal gastrectomy in patients with gastric cancer. This study aims to evaluate the surgical outcomes and calculate the textbook outcome of DA. Materials and Methods: In this retrospective study, the records of 4,902 patients who underwent minimally invasive distal gastrectomy for DA between 2009 and 2020 were reviewed. The data were categorized into three phases to analyze the trends over time. Surgical outcomes, including the operation time, length of post-operative hospital stay, and complication rates, were assessed, and the textbook outcome was calculated. Results: Among 4,505 patients, the textbook outcome is achieved in 3,736 (82.9%). Post-operative complications affect the textbook outcome the most significantly (91.9%). The highest textbook outcome is achieved in phase 2 (85.0%), which surpasses the rates of in phase 1 (81.7%) and phase 3 (82.3%). The post-operative complication rate within 30 d after surgery is 8.7%, and the rate of major complications exceeding the Clavien-Dindo classification grade 3 is 2.4%. Conclusions: Based on the outcomes of a large dataset, DA can be considered safe and feasible for gastric cancer.