• Title/Summary/Keyword: Classification accuracy

Search Result 3,065, Processing Time 0.029 seconds

Performance comparison on vocal cords disordered voice discrimination via machine learning methods (기계학습에 의한 후두 장애음성 식별기의 성능 비교)

  • Cheolwoo Jo;Soo-Geun Wang;Ickhwan Kwon
    • Phonetics and Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.35-43
    • /
    • 2022
  • This paper studies how to improve the identification rate of laryngeal disability speech data by convolutional neural network (CNN) and machine learning ensemble learning methods. In general, the number of laryngeal dysfunction speech data is small, so even if identifiers are constructed by statistical methods, the phenomenon caused by overfitting depending on the training method can lead to a decrease the identification rate when exposed to external data. In this work, we try to combine results derived from CNN models and machine learning models with various accuracy in a multi-voting manner to ensure improved classification efficiency compared to the original trained models. The Pusan National University Hospital (PNUH) dataset was used to train and validate algorithms. The dataset contains normal voice and voice data of benign and malignant tumors. In the experiment, an attempt was made to distinguish between normal and benign tumors and malignant tumors. As a result of the experiment, the random forest method was found to be the best ensemble method and showed an identification rate of 85%.

Development of Holter ECG Monitor with Improved ECG R-peak Detection Accuracy (R 피크 검출 정확도를 개선한 홀터 심전도 모니터의 개발)

  • Junghyeon Choi;Minho Kang;Junho Park;Keekoo Kwon;Taewuk Bae;Jun-Mo Park
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.23 no.2
    • /
    • pp.62-69
    • /
    • 2022
  • An electrocardiogram (ECG) is one of the most important biosignals, and in particular, continuous ECG monitoring is very important in patients with arrhythmia. There are many different types of arrhythmia (sinus node, sinus tachycardia, atrial premature beat (APB), and ventricular fibrillation) depending on the cause, and continuous ECG monitoring during daily life is very important for early diagnosis of arrhythmias and setting treatment directions. The ECG signal of arrhythmia patients is very unstable, and it is difficult to detect the R-peak point, which is a key feature for automatic arrhythmias detection. In this study, we develped a continuous measuring Holter ECG monitoring device and software for analysis and confirmed the utility of R-peak of the ECG signal with MIT-BIH arrhythmia database. In future studies, it needs the validation of algorithms and clinical data for morphological classification and prediction of arrhythmias due to various etiologies.

Development of machine learning model for reefer container failure determination and cause analysis with unbalanced data (불균형 데이터를 갖는 냉동 컨테이너 고장 판별 및 원인 분석을 위한 기계학습 모형 개발)

  • Lee, Huiwon;Park, Sungho;Lee, Seunghyun;Lee, Seungjae;Lee, Kangbae
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.1
    • /
    • pp.23-30
    • /
    • 2022
  • The failure of the reefer container causes a great loss of cost, but the current reefer container alarm system is inefficient. Existing studies using simulation data of refrigeration systems exist, but studies using actual operation data of refrigeration containers are lacking. Therefore, this study classified the causes of failure using actual refrigerated container operation data. Data imbalance occurred in the actual data, and the data imbalance problem was solved by comparing the logistic regression analysis with ENN-SMOTE and class weight with the 2-stage algorithm developed in this study. The 2-stage algorithm uses XGboost, LGBoost, and DNN to classify faults and normalities in the first step, and to classify the causes of faults in the second step. The model using LGBoost in the 2-stage algorithm was the best with 99.16% accuracy. This study proposes a final model using a two-stage algorithm to solve data imbalance, which is thought to be applicable to other industries.

A Study on National R&D Report Reference Technological Improvement (국가R&D보고서 참고문헌 기술항목 구축 개선방안 연구)

  • Lee, Kangsandajeong;Lee, Hyejin;Hyun, Mihwan
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.1
    • /
    • pp.31-42
    • /
    • 2022
  • The purpose of this study is to analyze the reference data of the National R&D Report database built by the Korea Institute of Science and Technology Information(KISTI) to derive problems and to find complementary points for the construction technical items in the system. To this end, we investigated the descriptions of references in the National R&D Report constructed in the KISTI DB and analyzed the construction process, guidelines, types, items, etc. Based on this, the appropriateness and accuracy of the classification results of the reference types constructed with the DB were checked. As a result, the problems of the items built into the system and corrections to the construction guidelines were derived. Through this, database construction can proceed efficiently and data quality is expected to improve. In addition, a follow-up study was proposed on the convergence service plan that can be provided in the future through standardization of technical regulations.

A study on the Improvement of the Food Waste Discharge System through the Classification on Foreign Substances (이물질 구별을 통한 음식물쓰레기 배출시스템 개선에 관한 연구)

  • Kim, Yongil;Kim, Seungcheon
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.22 no.6
    • /
    • pp.51-56
    • /
    • 2022
  • With the development of industrialization, the amount of food and waste is rapidly increasing. Accordingly, the government is aware of the seriousness and is making efforts in various ways to reduce it. As a part of that, the volume-based food system was introduced, and although there were several trials and errors at the beginning of the introduction, it shows a reduction effect of 20 to 30%. These results suggest that the volume-based food system is being established. However, the waste is caused by foreign substances in the process of recycling resources by collecting them from the 1st collection to the 2nd collection process. Therefore, in this study, to solve these problems fundamentally, artificial intelligence is applied to classify foreign substances and improve them. Due to the nature of food waste, there is a limit to obtaining many images, so we compare several models based on CNNs and classify them as abnormal data, that is, CNN-based models are trained on various types of foreign substances, and then models with high accuracy are selected. We intend to prepare improvement measures for maintenance, such as manpower input to protect equipment and classify foreign substances by applying it.

Stock Market Prediction Using Sentiment on YouTube Channels (유튜브 주식채널의 감성을 활용한 코스피 수익률 등락 예측)

  • Su-Ji, Cho;Cheol-Won Yang;Ki-Kwang Lee
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.46 no.2
    • /
    • pp.102-108
    • /
    • 2023
  • Recently in Korea, YouTube stock channels increased rapidly due to the high social interest in the stock market during the COVID-19 period. Accordingly, the role of new media channels such as YouTube is attracting attention in the process of generating and disseminating market information. Nevertheless, prior studies on the market forecasting power of YouTube stock channels remain insignificant. In this study, the market forecasting power of the information from the YouTube stock channel was examined and compared with traditional news media. To measure information from each YouTube stock channel and news media, positive and negative opinions were extracted. As a result of the analysis, opinion in channels operated by media outlets were found to be leading indicators of KOSPI market returns among YouTube stock channels. The prediction accuracy by using logistic regression model show 74%. On the other hand, Sampro TV, a popular YouTube stock channel, and the traditional news media simply reported the market situation of the day or instead showed a tendency to lag behind the market. This study is differentiated from previous studies in that it verified the market predictive power of the information provided by the YouTube stock channel, which has recently shown a growing trend in Korea. In the future, the results of advanced analysis can be confirmed by expanding the research results for individual stocks.

Development and its APPLIcation of Computer Program for Slope Hazards Prediction using Decision Tree Model (의사결정나무모형을 이용한 급경사지재해 예측프로그램 개발 및 적용)

  • Song, Young-Suk;Cho, Yong-Chan;Seo, Yong-Seok;Ahn, Sang-Ro
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.29 no.2C
    • /
    • pp.59-69
    • /
    • 2009
  • Based on the data obtained from field investigation and soil testing to slope hazards occurrence section and non-occurrence section in crystalline rocks like gneiss, granite, and so on, a prediction model was developed by the use of a decision tree model. The classification standard of the selected prediction model is composed of the slope angle, the coefficient of permeability and the void ratio in the order. The computer program, SHAPP ver. 1.0 for prediction of slope hazards around an important national facilities using GIS technique and the developed model. To prove the developed prediction model and the computer program, the field data surveyed from Jumunjin, Gangneung city were compared with the prediction result in the same site. As the result of comparison, the real occurrence location of slope hazards was similar to the predicted section. Through the continuous study, the accuracy about prediction result of slope hazards will be upgraded and the computer program will be commonly used in practical.

A Taxonomy of Geriatric Hospitals Using National Health Insurance Claim Data (건강보험청구자료로 본 요양병원의 기능 유형)

  • Min Kyoung Lim;Sun-Jea Kim;Jeong-Yeon Seon
    • Korea Journal of Hospital Management
    • /
    • v.28 no.2
    • /
    • pp.9-20
    • /
    • 2023
  • Purpose: This study classified the actual functions of geriatric hospitals and examined the differences in their characteristics, in order to provide a basis for discussions on defining the functions of geriatric hospitals and how to pay for care. Methodology: This study used various administrative data such as health insurance data and long-term care insurance data. Cluster analysis was used to categorize geriatric hospitals. To examine the validity of the cluster analysis results, we conducted a discriminant analysis to calculate the accuracy of the classification. To examine cluster characteristics, we examined structure, process, and outcome indicators for each cluster. Findings: The cluster analysis identified five clusters. They were geriatric hospitals with relatively short stays for cancer patients(cluster 1; cancer patient-centered), geriatric hospitals with relatively large numbers of patients using rehabilitation services(cluster 2; rehabilitation patient-centered), geriatric hospitals with a high proportion of relatively severe elderly patients(cluster 3; severe elderly patient-centered), geriatric hospitals with a high proportion of mildly ill elderly patients with various conditions(cluster 4; mildly ill elderly patient-centered), and geriatric hospitals with a significantly higher proportion of dementia patients(cluster 5; dementia patient-centered). The largest number of geriatric hospitals were categorized in clusters 4 and 5, and the structure and process indicators for these clusters were generally lower than for the other clusters. Practical Implications: We have confirmed the existence of geriatric hospitals where the medical function, which is the original purpose of a geriatric hospital, has been weakened. It has been observed that the quality level of these geriatric hospitals is likely to be lower compared to hospitals that prioritize enhanced medical functions. Therefore, it is suggested to consider the conversion of these geriatric hospitals into long-term care facilities, and careful consideration should be given to the review of care-giver payment coverage.

  • PDF

A Comparative Study of Predictive Factors for Hypertension using Logistic Regression Analysis and Decision Tree Analysis

  • SoHyun Kim;SungHyoun Cho
    • Physical Therapy Rehabilitation Science
    • /
    • v.12 no.2
    • /
    • pp.80-91
    • /
    • 2023
  • Objective: The purpose of this study is to identify factors that affect the incidence of hypertension using logistic regression and decision tree analysis, and to build and compare predictive models. Design: Secondary data analysis study Methods: We analyzed 9,859 subjects from the Korean health panel annual 2019 data provided by the Korea Institute for Health and Social Affairs and National Health Insurance Service. Frequency analysis, chi-square test, binary logistic regression, and decision tree analysis were performed on the data. Results: In logistic regression analysis, those who were 60 years of age or older (Odds ratio, OR=68.801, p<0.001), those who were divorced/widowhood/separated (OR=1.377, p<0.001), those who graduated from middle school or younger (OR=1, reference), those who did not walk at all (OR=1, reference), those who were obese (OR=5.109, p<0.001), and those who had poor subjective health status (OR=2.163, p<0.001) were more likely to develop hypertension. In the decision tree, those over 60 years of age, overweight or obese, and those who graduated from middle school or younger had the highest probability of developing hypertension at 83.3%. Logistic regression analysis showed a specificity of 85.3% and sensitivity of 47.9%; while decision tree analysis showed a specificity of 81.9% and sensitivity of 52.9%. In classification accuracy, logistic regression and decision tree analysis showed 73.6% and 72.6% prediction, respectively. Conclusions: Both logistic regression and decision tree analysis were adequate to explain the predictive model. It is thought that both analysis methods can be used as useful data for constructing a predictive model for hypertension.

Development of Methodology for Measuring Water Level in Agricultural Water Reservoir through Deep Learning anlaysis of CCTV Images (딥러닝 기법을 이용한 농업용저수지 CCTV 영상 기반의 수위계측 방법 개발)

  • Joo, Donghyuk;Lee, Sang-Hyun;Choi, Gyu-Hoon;Yoo, Seung-Hwan;Na, Ra;Kim, Hayoung;Oh, Chang-Jo;Yoon, Kwang-Sik
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.65 no.1
    • /
    • pp.15-26
    • /
    • 2023
  • This study aimed to evaluate the performance of water level classification from CCTV images in agricultural facilities such as reservoirs. Recently, the CCTV system, widely used for facility monitor or disaster detection, can automatically detect and identify people and objects from the images by developing new technologies such as a deep learning system. Accordingly, we applied the ResNet-50 deep learning system based on Convolutional Neural Network and analyzed the water level of the agricultural reservoir from CCTV images obtained from TOMS (Total Operation Management System) of the Korea Rural Community Corporation. As a result, the accuracy of water level detection was improved by excluding night and rainfall CCTV images and applying measures. For example, the error rate significantly decreased from 24.39 % to 1.43 % in the Bakseok reservoir. We believe that the utilization of CCTVs should be further improved when calculating the amount of water supply and establishing a supply plan according to the integrated water management policy.