• Title/Summary/Keyword: cross-subject cross-validation

Search Result 17, Processing Time 0.022 seconds

Machine Learning Approach to Blood Stasis Pattern Identification Based on Self-reported Symptoms (기계학습을 적용한 자기보고 증상 기반의 어혈 변증 모델 구축)

  • Kim, Hyunho;Yang, Seung-Bum;Kang, Yeonseok;Park, Young-Bae;Kim, Jae-Hyo
    • Korean Journal of Acupuncture
    • /
    • v.33 no.3
    • /
    • pp.102-113
    • /
    • 2016
  • Objectives : This study is aimed at developing and discussing the prediction model of blood stasis pattern of traditional Korean medicine(TKM) using machine learning algorithms: multiple logistic regression and decision tree model. Methods : First, we reviewed the blood stasis(BS) questionnaires of Korean, Chinese, and Japanese version to make a integrated BS questionnaire of patient-reported outcomes. Through a human subject research, patients-reported BS symptoms data were acquired. Next, experts decisions of 5 Korean medicine doctor were also acquired, and supervised learning models were developed using multiple logistic regression and decision tree. Results : Integrated BS questionnaire with 24 items was developed. Multiple logistic regression models with accuracy of 0.92(male) and 0.95(female) validated by 10-folds cross-validation were constructed. By decision tree modeling methods, male model with 8 decision node and female model with 6 decision node were made. In the both models, symptoms of 'recent physical trauma', 'chest pain', 'numbness', and 'menstrual disorder(female only)' were considered as important factors. Conclusions : Because machine learning, especially supervised learning, can reveal and suggest important or essential factors among the very various symptoms making up a pattern identification, it can be a very useful tool in researching diagnostics of TKM. With a proper patient-reported outcomes or well-structured database, it can also be applied to a pre-screening solutions of healthcare system in Mibyoung stage.

Hierarchical Smoothing Technique by Empirical Mode Decomposition (경험적 모드분해법에 기초한 계층적 평활방법)

  • Kim Dong-Hoh;Oh Hee-Seok
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.2
    • /
    • pp.319-330
    • /
    • 2006
  • A signal in real world usually composes of multiple signals having different scales of frequencies. For example sun-spot data is fluctuated over 11 year and 85 year. Economic data is supposed to be compound of seasonal component, cyclic component and long-term trend. Decomposition of the signal is one of the main topics in time series analysis. However when the signal is subject to nonstationarity, traditional time series analysis such as spectral analysis is not suitable. Huang et. at(1998) proposed data-adaptive method called empirical mode decomposition (EMD) . Due to its robustness to nonstationarity, EMD has been applied to various fields. Huang et. at, however, have not considered denoising when data is contaminated by error. In this paper we propose efficient denoising method utilizing cross-validation.

Feasibility of Using Similar Electrocardiography Measured around the Ears to Develop a Personal Authentication System (귀 주변에서 측정한 유사 심전도 기반 개인 인증 시스템 개발 가능성)

  • Choi, Ga-Young;Park, Jong-Yoon;Kim, Da-Yeong;Kim, Yeonu;Lim, Ji-Heon;Hwang, Han-Jeong
    • Journal of Biomedical Engineering Research
    • /
    • v.41 no.1
    • /
    • pp.42-47
    • /
    • 2020
  • A personal authentication system based on biosignals has received increasing attention due to its relatively high security as compared to traditional authentication systems based on a key and password. Electrocardiography (ECG) measured from the chest or wrist is one of the widely used biosignals to develop a personal authentication system. In this study, we investigated the feasibility of using similar ECG measured behind the ears to develop a personal authentication system. To this end, similar ECGs were measured from thirty subjects using a pair of three electrodes attached behind each of the ears during resting state during which the standard Lead-I ECG was also simultaneously measured from both wrists as baseline ECG. The three ECG components, Q, R, and S, were extracted for each subject as classification features, and authentication accuracy was estimated using support vector machine (SVM) based on a 5×5-fold cross-validation. The mean authentication accuracies of Lead I-ECG and similar ECG were 90.41 ± 8.26% and 81.15 ± 7.54%, respectively. Considering a chance level of 3.33% (=1/30), the mean authentication performance of similar ECG could demonstrate the feasibility of using similar ECG measured behind the ears on the development of a personal authentication system.

Soil Depth Information DB Construction Methods for Liquefaction Assessment (액상화 평가를 위한 지층심도DB 구축 방안)

  • Gang, ByeongJu;Hwang, Bumsik;Kim, Hansam;Cho, Wanjei
    • Journal of the Korean GEO-environmental Society
    • /
    • v.20 no.3
    • /
    • pp.39-46
    • /
    • 2019
  • The liquefaction is a phenomenon that the effective stress becomes zero due to the rapidly accumulated excess pore water pressure when a strong load acts on the ground for a short period of time, such as an earthquake or pile driving, resulting in the loss of the shear strength of the ground. Since the Geongju and Pohang earthquake, liquefaction brought increasing domestic attention. This liquefaction can be assessed mainly through the semi-empirical procedures proposed by Seed and Idriss (1982) and the liquefaction risk based on the penetration resistance obtained from borehole DB and SPT. However, the geotechnical information data obtained by the in-situ tests or boring information fundamentally have an issue of the representative of the target area. Therefore, this study sought to construct a ground information database by classifying and reviewing the ground information required for liquefaction assessment, and tried to solve the representative problem of the soil layer that is subject to liquefaction evaluation by performing spatial interpolation using GIS.

Convolutional Autoencoder based Stress Detection using Soft Voting (소프트 보팅을 이용한 합성곱 오토인코더 기반 스트레스 탐지)

  • Eun Bin Choi;Soo Hyung Kim
    • Smart Media Journal
    • /
    • v.12 no.11
    • /
    • pp.1-9
    • /
    • 2023
  • Stress is a significant issue in modern society, often triggered by external or internal factors that are difficult to manage. When high stress persists over a long term, it can develop into a chronic condition, negatively impacting health and overall well-being. However, it is challenging for individuals experiencing chronic stress to recognize their condition, making early detection and management crucial. Using biosignals measured from wearable devices to detect stress could lead to more effective management. However, there are two main problems with using biosignals: first, manually extracting features from these signals can introduce bias, and second, the performance of classification models can vary greatly depending on the subject of the experiment. This paper proposes a model that reduces bias using convo utional autoencoders, which can represent the key features of data, and enhances generalizability by employing soft voting, a method of ensemble learning, to minimize performance variability. To verify the generalization performance of the model, we evaluate it using LOSO cross-validation method. The model proposed in this paper has demonstrated superior accuracy compared to previous studies using the WESAD dataset.

  • PDF

Preliminary Inspection Prediction Model to select the on-Site Inspected Foreign Food Facility using Multiple Correspondence Analysis (차원축소를 활용한 해외제조업체 대상 사전점검 예측 모형에 관한 연구)

  • Hae Jin Park;Jae Suk Choi;Sang Goo Cho
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.121-142
    • /
    • 2023
  • As the number and weight of imported food are steadily increasing, safety management of imported food to prevent food safety accidents is becoming more important. The Ministry of Food and Drug Safety conducts on-site inspections of foreign food facilities before customs clearance as well as import inspection at the customs clearance stage. However, a data-based safety management plan for imported food is needed due to time, cost, and limited resources. In this study, we tried to increase the efficiency of the on-site inspection by preparing a machine learning prediction model that pre-selects the companies that are expected to fail before the on-site inspection. Basic information of 303,272 foreign food facilities and processing businesses collected in the Integrated Food Safety Information Network and 1,689 cases of on-site inspection information data collected from 2019 to April 2022 were collected. After preprocessing the data of foreign food facilities, only the data subject to on-site inspection were extracted using the foreign food facility_code. As a result, it consisted of a total of 1,689 data and 103 variables. For 103 variables, variables that were '0' were removed based on the Theil-U index, and after reducing by applying Multiple Correspondence Analysis, 49 characteristic variables were finally derived. We build eight different models and perform hyperparameter tuning through 5-fold cross validation. Then, the performance of the generated models are evaluated. The research purpose of selecting companies subject to on-site inspection is to maximize the recall, which is the probability of judging nonconforming companies as nonconforming. As a result of applying various algorithms of machine learning, the Random Forest model with the highest Recall_macro, AUROC, Average PR, F1-score, and Balanced Accuracy was evaluated as the best model. Finally, we apply Kernal SHAP (SHapley Additive exPlanations) to present the selection reason for nonconforming facilities of individual instances, and discuss applicability to the on-site inspection facility selection system. Based on the results of this study, it is expected that it will contribute to the efficient operation of limited resources such as manpower and budget by establishing an imported food management system through a data-based scientific risk management model.

Development and Validation of the 'Food Safety and Health' Workbook for High School (고등학교 「식품안전과 건강」 워크북 개발 및 타당도 검증)

  • Park, Mi Jeong;Jung, Lan-Hee;Yu, Nan Sook;Choi, Seong-Youn
    • Journal of Korean Home Economics Education Association
    • /
    • v.34 no.1
    • /
    • pp.59-80
    • /
    • 2022
  • The purpose of this study was to develop a workbook that can support the class and evaluation of the subject, 「Food safety and health」 and to verify its validity. The development direction of the workbook was set by analyzing the 「Food safety and health」 curriculum, dietary education materials, and previous studies related to the workbook, and the overall structure was designed by deriving the activity ideas for each area. Based on this, the draft was developed, and the draft went through several rounds of cross-review by the authors and the examination and revision by the Ministry of Food and Drug Safety, before the final edited version was developed. The workbook was finalized with corrections and enhancements based on the advice of 9 experts and 44 home economics teachers. The workbook consists of 4 areas: the 'food selection' area, with 10 learning topics and 36 lessons, the 'food poisoning and food management' area, with 10 learning topics and 36 lessons, the 'cooking' area, with 11 learning topics and 43 lessons, and the 'healthy eating' area, with 11 learning topics and 55 lessons, resulting in a total of 42 learning topics, 170 lessons. The workbook was designed to evenly cultivate practical problem-solving competency, self-reliance capacity, creative thinking capacity, and community capacity. In-depth inquiry-learning is conducted on the content, and the context is structured so that self-diagnosis can be made through evaluation. According to the validity test of the workbook, it was evaluated to be very appropriate for encouraging student-participatory classes and evaluations, and to create a class atmosphere that promotes inquiry by strengthening experiments and practices. In the current situation where the high school credit system is implemented and individual students' learning options are emphasized, the results of this study is expected to help expand the scope of home economics-based elective courses and contribute to realizing student-led classrooms with a focus on inquiry.