• Title/Summary/Keyword: leave-one-out

Search Result 120, Processing Time 0.023 seconds

Pridict of Liver cirrhosis susceptibility using Decision tree with SNP (Decision Tree와 SNP정보를 이용한 간경화 환자의 감수성 예측)

  • Kim, Dong-Hoi;Uhmn, Saang-Yong;Cho, Sung-Won;Ham, Ki-Baek;Kim, Jin
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10a
    • /
    • pp.63-66
    • /
    • 2006
  • 본 논문에서는 SNP데이터를 이용하여 간경화에 대한 감수성을 예측하기 위해 의사결정 트리를 이용하였다. 데이터는 간경화 환자와 정상환자 총 116명의 데이터를 사용하였으며, Feature 값으로는 간질환과 밀접한 연관성을 갖는 28개의 SNP데이터를 사용하였다. 실험방법은 각각의 SNP에 대하여 의사결정트리로 분류율을 측정한 후 가장 높은 분류율을 가지는 SNP부터 조합해 나가는 방식으로 C4.5 의사결정트리를 이용 leave-one-out cross validation으로 간경화와 정상을 구분하는 정확도를 측정하였다. 실험결과 간 질환 관련 SNP중 IL1RN-S130S, IRNGR2-Q64R, IL-10(-592), IL1B_S35S 4개의 SNP조합에서 65.52%의 정확도를 얻을 수 있었다.

  • PDF

Multiple Optimal Classifiers based on Speciated Evolution for Classifying DNA Microarray Data (DNA 마이크로어레이 데이터의 분류를 위한 종분화 진화 기반의 최적 다중 분류기)

  • 박찬호;조성배
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04b
    • /
    • pp.724-726
    • /
    • 2004
  • DNA 마이크로어레이 기술의 발전은 암의 조기 발견 및 예후 예측을 가능하게 해주었으며, 이와 관련된 많은 연구가 진행 중이다. 마이크로어레이 데이터의 분류에서 관련 유전자들의 선택은 필수적이며, 유전자 선택방법은 분류기와 짝을 이루어 특징-분류기를 형성한다. 이제까지 여러 가지 특징-분류기를 사용하여 마이크로어레이 데이터를 분류해 왔지만, 알고리즘의 한계와 데이터의 결함 등으로 인하여 최적의 특징-분류기를 찾기 어려웠다. 따라서 앙상블 분류기를 이용하여 높은 분류성능을 얻는 방법이 시도되어왔으며. 최적의 것을 찾기 위하여 유전자 알고리즘이 사용되기도 했다. 본 논문에서는 이를 발전시켜 다양한 최적의 앙상블을 생성하기 위해 종분화 방법을 사용한다. 림프종 암 데이터에 대하여 leave-one-out cross-validation을 적용한 결과, 제안한 방법으로 다양한 최적해를 탐색하는 것을 확인할 수 있었다.

  • PDF

Automatic Classification of Music Moods Based on the AV Model (AV 모델을 이용한 음악 분위기 자동 분류1))

  • Moon, Chang-Bae;Kim, Hyun-Soo;Song, Min-Kyun;Kim, Byeong-Man
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.04a
    • /
    • pp.365-368
    • /
    • 2011
  • 본 논문에서 구조 분석 기법을 이용하여 음악을 구간들로 나누고, 그 구간 중 대표 구간들을 자동으로 설정 후 다수의 사용자에게 그 구간들에 대한 분위기 값을 입력받은 후 이 값들을 바탕으로 구간의 대표 분위기를 결정하는 방법을 제안하였다. 또한, 이렇게 결정된 대표 분위기와 그 구간의 음악적 특징들을 이용하여 음악 분위기 판별 실험을 하였다. 이를 위해 음원의 분위기를 211명에게 수집하였고, 음원에서 특징들을 결정계수를 이용하여 특징의 수를 줄인 후 신경망을 이용하여 학습 및 판별을 하였다. 그리고 Leave-one-out 교차 검증을 통하여 성능 분석을 하였다. 실험결과, 3,000번 학습 시 은닉층 17개를 이용하였을 때 66%의 판별율을 보였다.

Post-Examination Analysis on the Student Dropout Prediction Index (학생 중도탈락 예측지수에 관한 사후검증 연구)

  • Lee, Ji-Eun
    • The Journal of Bigdata
    • /
    • v.4 no.2
    • /
    • pp.175-183
    • /
    • 2019
  • Drop-out issue is one of the challenges of cyber university. There are about 130,000 students enrolled in cyber universities, but the dropout rate is also very high. To lower the dropout rate, cyber universities invest heavily in learning analytics. Some cyber universities analyze the possibility of dropout and actively support students who are more likely to drop out. The purpose of this paper is to identify the learning data affecting the dropout prediction index. As a result of the analysis, it is confirmed that number of lessons(progress), credits, achievement and leave of absence have a significant effect on dropout rate. It is necessary to increase the accuracy of the prediction model through post-test on the student dropout prediction index.

  • PDF

Micro-Expression Recognition Base on Optical Flow Features and Improved MobileNetV2

  • Xu, Wei;Zheng, Hao;Yang, Zhongxue;Yang, Yingjie
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.6
    • /
    • pp.1981-1995
    • /
    • 2021
  • When a person tries to conceal emotions, real emotions will manifest themselves in the form of micro-expressions. Research on facial micro-expression recognition is still extremely challenging in the field of pattern recognition. This is because it is difficult to implement the best feature extraction method to cope with micro-expressions with small changes and short duration. Most methods are based on hand-crafted features to extract subtle facial movements. In this study, we introduce a method that incorporates optical flow and deep learning. First, we take out the onset frame and the apex frame from each video sequence. Then, the motion features between these two frames are extracted using the optical flow method. Finally, the features are inputted into an improved MobileNetV2 model, where SVM is applied to classify expressions. In order to evaluate the effectiveness of the method, we conduct experiments on the public spontaneous micro-expression database CASME II. Under the condition of applying the leave-one-subject-out cross-validation method, the recognition accuracy rate reaches 53.01%, and the F-score reaches 0.5231. The results show that the proposed method can significantly improve the micro-expression recognition performance.

The impact of functional brain change by transcranial direct current stimulation effects concerning circadian rhythm and chronotype (일주기 리듬과 일주기 유형이 경두개 직류전기자극에 의한 뇌기능 변화에 미치는 영향 탐색)

  • Jung, Dawoon;Yoo, Soomin;Lee, Hyunsoo;Han, Sanghoon
    • Korean Journal of Cognitive Science
    • /
    • v.33 no.1
    • /
    • pp.51-75
    • /
    • 2022
  • Transcranial direct current stimulation (tDCS) is a non-invasive brain stimulation that is able to alter neuronal activity in particular brain regions. Many studies have researched how tDCS modulates neuronal activity and reorganizes neural networks. However it is difficult to conclude the effect of brain stimulation because the studies are heterogeneous with respect to the stimulation parameter as well as individual difference. It is not fully in agreement with the effects of brain stimulation. In particular few studies have researched the reason of variability of brain stimulation in response to time so far. The study investigated individual variability of brain stimulation based on circadian rhythm and chronotype. Participants were divided into two groups which are morning type and evening type. The experiment was conducted by Zoom meeting which is video meeting programs. Participants were sent experiment tool which are Muse(EEG device), tdcs device, cell phone and cell phone holder after manuals for experimental equipment were explained. Participants were required to make a phone in frount of a camera so that experimenter can monitor online EEG data. Two participants who was difficult to use experimental devices experimented in a laboratory setting where experimenter set up devices. For all participants the accuracy of 98% was achieved by SVM using leave one out cross validation in classification in the the effects of morning stimulation and the evening stimulation. For morning type, the accuracy of 92% and 96% was achieved in classification in the morning stimulation and the evening stimulation. For evening type, it was 94% accuracy in classification for the effect of brain stimulation in the morning and the evening. Feature importance was different both in classification in the morning stimulation and the evening stimulation for morning type and evening type. Results indicated that the effect of brain stimulation can be explained with brain state and trait. Our study results noted that the tDCS protocol for target state is manipulated by individual differences as well as target state.

A comparative study of conceptual model and machine learning model for rainfall-runoff simulation (강우-유출 모의를 위한 개념적 모형과 기계학습 모형의 성능 비교)

  • Lee, Seung Cheol;Kim, Daeha
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.9
    • /
    • pp.563-574
    • /
    • 2023
  • Recently, climate change has affected functional responses of river basins to meteorological variables, emphasizing the importance of rainfall-runoff simulation research. Simultaneously, the growing interest in machine learning has led to its increased application in hydrological studies. However, it is not yet clear whether machine learning models are more advantageous than the conventional conceptual models. In this study, we compared the performance of the conventional GR6J model with the machine learning-based Random Forest model across 38 basins in Korea using both gauged and ungauged basin prediction methods. For gauged basin predictions, each model was calibrated or trained using observed daily runoff data, and their performance was evaluted over a separate validation period. Subsequently, ungauged basin simulations were evaluated using proximity-based parameter regionalization with Leave-One-Out Cross-Validation (LOOCV). In gauged basins, the Random Forest consistently outperformed the GR6J, exhibiting superiority across basins regardless of whether they had strong or weak rainfall-runoff correlations. This suggest that the inherent data-driven training structures of machine learning models, in contrast to the conceptual models, offer distinct advantages in data-rich scenarios. However, the advantages of the machine-learning algorithm were not replicated in ungauged basin predictions, resulting in a lower performance than that of the GR6J. In conclusion, this study suggests that while the Random Forest model showed enhanced performance in trained locations, the existing GR6J model may be a better choice for prediction in ungagued basins.

Added Value of Contrast Leakage Information over the CBV Value of DSC Perfusion MRI to Differentiate between Pseudoprogression and True Progression after Concurrent Chemoradiotherapy in Glioblastoma Patients

  • Pak, Elena;Choi, Seung Hong;Park, Chul-Kee;Kim, Tae Min;Park, Sung-Hye;Won, Jae-Kyung;Lee, Joo Ho;Lee, Soon-Tae;Hwang, Inpyeong;Yoo, Roh-Eul;Kang, Koung Mi;Yun, Tae Jin
    • Investigative Magnetic Resonance Imaging
    • /
    • v.26 no.1
    • /
    • pp.10-19
    • /
    • 2022
  • Purpose: To evaluate whether the added value of contrast leakage information from dynamic susceptibility contrast magnetic resonance imaging (DSC MRI) is a better prognostic imaging biomarker than the cerebral blood volume (CBV) value in distinguishing true progression from pseudoprogression in glioblastoma patients. Materials and Methods: Forty-nine glioblastoma patients who had undergone MRI after concurrent chemoradiotherapy with temozolomide were enrolled in this retrospective study. Twenty features were extracted from the normalized relative CBV (nCBV) and extraction fraction (EF) map of the contrast-enhancing region in each patient. After univariable analysis, we used multivariable stepwise logistic regression analysis to identify significant predictors for differentiating between pseudoprogression and true progression. Receiver operating characteristic (ROC) analysis was employed to determine the best cutoff values for the nCBV and EF features. Finally, leave-one-out cross-validation was used to validate the best predictor in differentiating between true progression and pseudoprogression. Results: Multivariable stepwise logistic regression analysis showed that MGMT (O6-methylguanine-DNA methyltransferase) and EF max were independent differentiating variables (P = 0.004 and P = 0.02, respectively). ROC analysis yielded the best cutoff value of 95.75 for the EF max value for differentiating the two groups (sensitivity, 61%; specificity, 84.6%; AUC, 0.681 ± 0.08; 95% CI, 0.524-0.837; P = 0.03). In the leave-one-out cross-validation of the EF max value, the cross-validated values for predicting true progression and pseudoprogression accuracies were 69.4% and 71.4%, respectively. Conclusion: We demonstrated that contrast leakage information parameter from DSC MRI showed significance in differentiating true progression from pseudoprogression in glioblastoma patients.

Optimization of Soil Contamination Distribution Prediction Error using Geostatistical Technique and Interpretation of Contributory Factor Based on Machine Learning Algorithm (지구통계 기법을 이용한 토양오염 분포 예측 오차 최적화 및 머신러닝 알고리즘 기반의 영향인자 해석)

  • Hosang Han;Jangwon Suh;Yosoon Choi
    • Economic and Environmental Geology
    • /
    • v.56 no.3
    • /
    • pp.331-341
    • /
    • 2023
  • When creating a soil contamination map using geostatistical techniques, there are various sources that can affect prediction errors. In this study, a grid-based soil contamination map was created from the sampling data of heavy metal concentrations in soil in abandoned mine areas using Ordinary Kriging. Five factors that were judged to affect the prediction error of the soil contamination map were selected, and the variation of the root mean squared error (RMSE) between the predicted value and the actual value was analyzed based on the Leave-one-out technique. Then, using a machine learning algorithm, derived the top three factors affecting the RMSE. As a result, it was analyzed that Variogram Model, Minimum Neighbors, and Anisotropy factors have the largest impact on RMSE in the Standard interpolation. For the variogram models, the Spherical model showed the lowest RMSE, while the Minimum Neighbors had the lowest value at 3 and then increased as the value increased. In the case of Anisotropy, it was found to be more appropriate not to consider anisotropy. In this study, through the combined use of geostatistics and machine learning, it was possible to create a highly reliable soil contamination map at the local scale, and to identify which factors have a significant impact when interpolating a small amount of soil heavy metal data.

The Experience of Nursing Staff on the Dementia Patients′ Aggressive Behavior (간호제공자들의 치매노인 공격행동 경험에 대한 연구)

  • 오진주
    • Journal of Korean Academy of Nursing
    • /
    • v.30 no.2
    • /
    • pp.293-306
    • /
    • 2000
  • Providing care to the dementia elderly with behavioral problem is a major issue in nursing homes today. This study was aimed to explore the nursing staffs′ response to aggressive dementia patients, and the effect that the aggressive behavior had on Nsgstaff. The interviews used a semi-structured questionnaire are were carried out from May to July, 1999. The subjects were 23 nursing personnel working in the nursing homes for dementia elderly. The result are as follows; 1. The types of aggressive behavior cited by the subjects was "physical", "linguistic", and "sexual". 2. The factors that caused the behavior were "symptoms of disease", "under- conditioning", "context of nursing care", "unsatisfied need", "relationship to other patients", "change of outer environment", and "invasion of one′s own privacy". 3. The response of the subjects were "unhappiness", "stress", "anger", "exaggeration", "anxiety", and "fear". 4. The management strategies listed by nursing personnel used to alleviate aggressive behavior were "ignorance" "patience" "leaving the area" "soothing sounds" "verbal punishment" "restriction and isolation" and "various management skill appliance" "adaptation". 5. The effect that aggressive behavior and were "disturbance of relationship to elderly" "decline in the amount and quality of nursing care" "interruption of task performance" "job exhaustion" "desire to leave the job" "physical impact" "stress on the cognition of others" "anxiety about health and one own future" "interference to family life′. The findings of this study will be useful in understanding the difficulties of nursing personnel when confronting the aggressive behavior of dementia partients. It also is useful as basic data in preparing efficient intervention program for these difficulties.

  • PDF