• Title/Summary/Keyword: Statistical Index

Search Result 2,061, Processing Time 0.042 seconds

A Statistical Analysis of Phenotypic Diversity Based on Genetic Traits in Barley Germplasms (특성평가 정보를 활용한 보리 유전자원 형태적 형질 다양성의 통계적 분석)

  • Yu, Dong Su;Shin, Myoung-Jae;Park, Jin-Cheon;Kang, Manjung
    • Korean Journal of Plant Resources
    • /
    • v.35 no.5
    • /
    • pp.641-651
    • /
    • 2022
  • The biodiversity research of barley, a functional food, is proceeding to conserve germplasms and develop new cultivar of barley to improve its functional effects. In this study, with 25,104 barley germplasms in the National Agrobiodiversity Center, South Korea, the biodiversity index of species was much lower (1.17) than the origins (24.73) because of the presence of a biased species, Hordeum vulgare subsp. vulgare, but the species and origin of germplasms were significantly different with regard to genetic traits. In the clustering analysis based on genetic traits, we found that 97% barley germplasms could mostly be distributed between 1~7 clusters out of a total of 15 clusters; 'normal and uzu type', 'lodging', and 'loose smut' were commonly represented in the 1~7 clusters and some clusters showed specific differences in five genetic traits including 'growth habit'. In correlation of each genetic trait, the infection of 'barley yellow mosaic virus' was highly correlated to 'number of grains per spike'. '1000 grain weight' was weakly correlated with seven genetic traits including 'number of grains per spike'. Our analysis for barley's biodiversity can provide a useful guide to the species' phenotypes that need to be collected to conserve biodiversity and to breed new barley varieties.

Prediction of Maximal Oxygen Uptake Ages 18~34 Years (18~34 남성의 최대산소 섭취량 추정)

  • Jeon, Yoo-Joung;Im, Jae-Hyeng;Lee, Byung-Kun;Kim, Chang-Hwan;Kim, Byeong-Wan
    • 한국체육학회지인문사회과학편
    • /
    • v.51 no.3
    • /
    • pp.373-382
    • /
    • 2012
  • The purpose of this study is to predict VO2max with body index and submaximal metabolic responses. The subjects are consisted of 250 male aging from 18 to 34 and we separated them into two groups randomly; 179 for a sample, 71 for a cross-validation group. They went through maximal exercise testing with Bruce protocol, and we measured the metabolic responses in the end of the first(3 minute) and second stage(6 minute). To predict VO2max, we applied multiple regression analysis to the sample with stepwise method. Model 1's variables are weight, 6 minute HR and 6 minute VO2(R=0.64, SEE=4.74, CV=11.7%, p<.01), and the equation is VO2max(ml/kg/min)= 72.256-0.340(Weight)-0.220(6minHR)+0.013(6minVO2). Model 2's variables are weight, 6 minute HR, 6 minute VO2, and 6 minute VCO2(R=0.67, SEE=4.59, CV=11.3%, p<.01), and the equation is VO2max(ml/kg/min)= 68.699-0.277(Weight) -0.206(6minHR)+0.020(6minVO2)-0.009(6minVCO2). And the result did not show multicolinearity for both models. Model 2 demonstrated more correlation compared to Model 1. However, when we conducted cross-validation of those models with 71 men, measured VO2max and estimated VO2 Max had statistical significance with correlation (R=0.53, 0.56, P<.01). Although both models are functional with validity considering their simplicity and utility, Model 2 has more accuracy.

Effects of Impact of Climate Change on Livestock Productivity - For bullocks, dairy, pigs, laying hens, and broilers - (기후변화가 축산 생산성에 미치는 영향 -거세우, 낙농, 양돈, 산란계, 육계를 대상으로-)

  • Lee, H.K.;Park, H.M.;Shin, Y.K.
    • Journal of Practical Agriculture & Fisheries Research
    • /
    • v.20 no.1
    • /
    • pp.107-123
    • /
    • 2018
  • The global impact of climate change on agriculture is now increasing. The purpose of this study was to investigate the effect of climate change on livestock productivity. The variables that have the greatest influence on climate change factors were examined through previous studies and expert surveys. We also used the actual productivity data of livestock farmers to investigate the relationship with climate change. In order to evaluate the climate for changes in livestock productivity, national representative data (such as bullocks, dairy, pigs, laying hens, and broilers) were surveyed in Korea. Also, to select and classify evaluation indexes, we selected climate change factor variables as prior studies and studied the weighting factor of climate variable factors. In this study, the researchers of industry, academia, and farmers in the livestock sector conducted questionnaires on the indicators of vulnerability to climate change using experts, and then weighed the selected indicators using the hierarchical analysis process (AHP). In order to verify the validity of the evaluation index, was examined using domestic climate data (temperature, precipitation, humidity, etc.). Correlation and regression analysis were performed. The empirical relationship between climate change and livestock productivity was examined through this study. As a result, we used data with high reliability of statistical analysis and found that there are significant variables.

Development of Weight Estimation Equations and Weight Tables for Larix kaempferi and Pinus rigida Stand (일본잎갈나무와 리기다소나무의 중량추정식 및 중량표 개발)

  • Jintaek Kang;Chiung Ko;Jeongmuk Park;Jongsu Yim;Sun-Jeong Lee;Myoungsoo Won
    • Journal of Korean Society of Forest Science
    • /
    • v.112 no.4
    • /
    • pp.472-489
    • /
    • 2023
  • This study was conducted to derive the optimal estimation equations for deriving the green and dry weights of Larix kaempferi (Japanese larch) and Pinus rigida (Rigida pine), which are major coniferous tree species in South Korea. The equations were then used to develop weight tables. Table development began with the sampling of 150 L. kaempferi and 90 P. rigida trees distributed throughout the national scale, after which green weights were measured on-site. Samples from each stand were then collected, and their dry weights were measured in a laboratory. The equation used to calculate green and dry weights was divided into a one-variable formula that uses only the diameter at breast height (DBH) and a two-variable equation that employs DBH and height. The equations used to estimate the green and dry weights of logs were divided into one- and two-variable equations using DBH. Statistical data, such as the fitness index (FI), root mean square error, standard error of estimation, and residual diagram, were used to verify the suitability of the estimation equations. Applicability was examined by calculating weights using the derived optimal equations. The equation W = bD+cD2 was used in measurements involving only DBH, whereas the equation W = aDbHc was employed in cases involving both diameter and height at breast height. The FI of W = bD+cD2 was 0.91, while that of W = aDbHc was 0.95, both of which are high values. With these estimation formulas, weight tables for the green and dry weights of L. kaempferi and P. rigida were prepared and compared with weight tables created 20 years ago. The green and dry weight tables of both species were larger.

The detection of collapsible airways contributing to airflow limitation (기류 제한에 영향을 미치는 허탈성 기도의 분석)

  • Kim, Yun Seong;Park, Byung Gyu;Lee, Kyong In;Son, Seok Man;Lee, Hyo Jin;Lee, Min Ki;Son, Choon Hee;Park, Soon Kew
    • Tuberculosis and Respiratory Diseases
    • /
    • v.43 no.4
    • /
    • pp.558-570
    • /
    • 1996
  • Background : The detection of Collapsible airways has important therapeutic implications in chronic airway disease and bronchial asthma. The distinction of a purely collapsible airways disease from that of asthma is important because the treatment of the dormer may include the use of pursed lip breathing or nasal positive pressure ventilation whereas in the latter, pharmacologic approaches are used. One form of irreversible airflow limitation is collapsible airways, which has been shown to be a Component of asthma or to emphysema, it can be assessed by the volume difference between what exits the lung as determined by a spirometer and the volume compressed as measured by the plethysmography. Method : To investigate whether volume difference between slow and forced vital Capacity(SVC-FVC) by spirometry may be used as a surrogate index of airway collapse, we examined pulmonary function parameters before and after bronchodilator agent inhalation by spirometry and body plethysmography in 20 cases of patients with evidence of airflow limitation(chronic obstructive pulmonary disease 12 cases, stable bronchial asthma 7 cases, combined chronic obstructive pulmonary disease with asthma 1 case) and 20 cases of normal subjects without evidence of airflow limitation referred to the Pusan National University Hospital pulmonary function laboratory from January 1995 to July 1995 prospectively. Results : 1) Average and standard deviation of age, height, weight of patients with airflow limitation was $58.3{\pm}7.24$(yr), $166{\pm}8.0$(cm), $59.0{\pm}9.9$(kg) and those of normal subjects was $56.3{\pm}12.47$(yr), $165.9{\pm}6.9$(cm), $64.4{\pm}10.4$(kg), respectively. The differences of physical characteristics of both group were not significant statistically and male to female ratio was 14:6 in both groups. 2) The difference between slow vital capacity and forced vital capacity was $395{\pm}317ml$ in patients group and $154{\pm}176ml$ in normal group and there was statistically significance between two groups(p<0.05). Sensitivity and specificity were most higher when the cut-off value was 208ml. 3) After bronchodilator inhalation, reversible airway obstructions were shown in 16 cases of patients group, 7 cases of control group(p<0.05) by spirometry or body plethysmography d the differences of slow vital capacity and forced vital capacity in bronchodilator response group and nonresponse group were $300.4{\pm}306ml$, $144.7{\pm}180ml$ and this difference was statistically significant. 4) The difference between slow vital capacity and forced vital capacity before bronchodilator inhalation was correlated with airway resistance before bronchodilator(r=0.307 p=0.05), and the difference between slow vital capacity and forced vital capacity after bronchodilator was correlated with difference between slow vital capacity and forced vital capacity(r=0.559 p=0.0002), thoracic gas volume(r=0.488 p=0.002) before bronchodilator and airway resistance(r=0.583 p=0.0001), thoracic gas volume(r=0.375 p=0.0170) after bronchodilator, respectively. 5) The difference between slow vital capacity and forced vital capacity in smokers and nonsmokers was $257.5{\pm}303ml$, $277.5{\pm}276ml$, respectively and this difference did not reach statistical significance(p>0.05). Conclusion : The difference between slow vital capacity and forced vital capacity by spirometry may be useful for the detection of collapsible airway and may help decision making of therapeutic plans.

  • PDF

Environmental Changes after Timber Harvesting in (Mt.) Paekunsan (백운산(白雲山) 성숙활엽수림(成熟闊葉樹林) 개벌수확지(皆伐收穫地)에서 벌출직후(伐出直後)의 환경변화(環境變化))

  • Park, Jae-Hyeon
    • Journal of Korean Society of Forest Science
    • /
    • v.84 no.4
    • /
    • pp.465-478
    • /
    • 1995
  • The objective of this study was to investigate the impacts of large-scale timber harvesting on the environment of a mature hardwood forest. To achieve the objective, the effects of harvesting on forest environmental factors were analyzed quantitatively using the field data measured in the study sites of Seoul National University Research Forests [(Mt.) Paekunsan] for two years(1993-1994) following timber harvesting. The field data include information on vegetation, soil mesofauna, physicochemical characteristics of soil, surface water runoff, water quality in the stream, and hillslope erosion. For comparison, field data for each environmental factor were collected in forest areas disturbed by logging and undisturbed, separately. The results of this study were as follows : The diversity of vegetational species increased in the harvested sites. However, the similarity index value of species between harvested and non-harvested sites was close to each other. Soil bulk density and soil hardness were increased after timber harvesting, respectively. The level of organic matter, total-N, avail $P_2O_5$, CEC($K^+$, $Na^+$, $Ca^{{+}{+}}$, $Mg^{{+}{+}}$) in the harvested area were found decreased. While the population of Colembola spp., and Acari spp. among soil mesofauna in harvested sites increased by two to seven times compared to those of non-harvested sites during the first year, the rates of increment decreased in the second year. However, those members of soil mesofauna in harvested sites were still higher than those of non-harvested sites in the second year. The results of statistical analysis using the stepwise regression method indicated that the diversity of soil mesofauna were significantly affected by soil moisture, soil bulk density, $Mg^{{+}{+}}$, CEC, and soil temperature at soil depth of 5(0~10)cm in the order of importance. The amount of surface water runoff on harvested sites was larger than that of non-harvested sites by 28% in the first year and 24.5% in the second year after timber harvesting. The level of BOD, COD, and pH in the stream water on the harvested sites reached at the level of the domestic use for drinking in the first and second year after timber harvesting. Such heavy metals as Cd, Pb, Cu, and organic P were not found. Moreover, the level of eight factors of domestic use for drinking water designated by the Ministry of Health and Welfare of Korea were within the level of the first class in the quality of drinking water standard. The study also showed that the amount of hillslope erosion in harvested sites was 4.77 ton/ha/yr in the first year after timber harvesting. In the second year, the amount decreased rapidly to 1.0 ton/ha/yr. The impact of logging on hillslope erosion in the harvested sites was larger than that in non-harvested sites by seven times in the first year and two times in the second year. The above results indicate that the large-scale timber harvesting cause significant changes in the environmental factors. However, the results are based on only two-year field observation. We should take more field observation and analyses to increase understandings on the impacts of timber harvesting on environmental changes. With the understandings, we might be able to improve the technology of timber harvesting operations to reduce the environmental impacts of large-scale timber harvesting.

  • PDF

Assessment of Cerebral Hemodynamic Changes in Pediatric Patients with Moyamoya Disease Using Probabilistic Maps on Analysis of Basal/Acetazolamide Stress Brain Perfusion SPECT (소아 모야모야병에서 뇌확률지도를 이용한 수술전후 혈역학적 변화 분석)

  • Lee, Ho-Young;Lee, Jae-Sung;Kim, Seung-Ki;Wang, Kyu-Chang;Cho, Byung-Kyu;Chung, June-Key;Lee, Myung-Chul;Lee, Dong-Soo
    • Nuclear Medicine and Molecular Imaging
    • /
    • v.42 no.3
    • /
    • pp.192-200
    • /
    • 2008
  • To evaluate the hemodynamic changes and the predictive factors of the clinical outcome in pediatric patients with moyamoya disease, we analyzed pre/post basal/acetazolamide stress brain perfusion SPECT with automated volume of interest (VOIs) method. Methods: Total fifty six (M:F = 33:24, age $6.7{\pm}3.2$ years) pediatric patients with moyamoya disease, who underwent basal/acetazolamide stress brain perfusion SPECT within 6 before and after revascularization surgery (encephalo-duro-arterio-synangiosis (EDAS) with frontal encephalo-galeo-synangiosis (EGS) and EDAS only followed on contralateral hemisphere), and followed-up more than 6 months after post-operative SPECT, were included. A mean follow-up period after post-operative SPECT was $33{\pm}21$ months. Each patient's SPECT image was spatially normalized to Korean template with the SPM2. For the regional count normalization, the count of pons was used as a reference region. The basal/acetazolamide-stressed cerebral blood flow (CBF), the cerebral vascular reserve index (CVRI), and the extent of area with significantly decreased basal/acetazolamide- stressed rCBF than age-matched normal control were evaluated on both medial frontal, frontal, parietal, occipital lobes, and whole brain in each patient's images. The post-operative clinical outcome was assigned as good, poor according to the presence of transient ischemic attacks and/or fixed neurological deficits by pediatric neurosurgeon. Results: In a paired t-test, basal/acetazolamide-stressed rCBF and the CVRI were significantly improved after revascularization (p<0.05). The significant difference in the pre-operative basal/acetazolamide-stressed rCBF and the CVRI between the hemispheres where EDAS with frontal EGS was performed and their contralateral counterparts where EDAS only was done disappeared after operation (p<0.05). In an independent student t-test, the pre-operative basal rCBF in the medial frontal gyrus, the post-operative CVRI in the frontal lobe and the parietal lobe of the hemispheres with EDAS and frontal EGS, the post-operative CVRI, and ${\Delta}CVRI$ showed a significant difference between patients with a good and poor clinical outcome (p<0.05). In a multivariate logistic regression analysis, the ${\Delta}CVRI$ and the post-operative CVRI of medial frontal gyrus on the hemispheres where EDAS with frontal EGS was performed were the significant predictive factors for the clinical outcome (p =0.002, p =0.015), Conclusion: With probabilistic map, we could objectively evaluate pre/post-operative hemodynamic changes of pediatric patients with moyamoya disease. Specifically the post-operative CVRI and the post-operative CVRI of medial frontal gyrus where EDAS with frontal EGS was done were the significant predictive factors for further clinical outcomes.

A Time Series Graph based Convolutional Neural Network Model for Effective Input Variable Pattern Learning : Application to the Prediction of Stock Market (효과적인 입력변수 패턴 학습을 위한 시계열 그래프 기반 합성곱 신경망 모형: 주식시장 예측에의 응용)

  • Lee, Mo-Se;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.167-181
    • /
    • 2018
  • Over the past decade, deep learning has been in spotlight among various machine learning algorithms. In particular, CNN(Convolutional Neural Network), which is known as the effective solution for recognizing and classifying images or voices, has been popularly applied to classification and prediction problems. In this study, we investigate the way to apply CNN in business problem solving. Specifically, this study propose to apply CNN to stock market prediction, one of the most challenging tasks in the machine learning research. As mentioned, CNN has strength in interpreting images. Thus, the model proposed in this study adopts CNN as the binary classifier that predicts stock market direction (upward or downward) by using time series graphs as its inputs. That is, our proposal is to build a machine learning algorithm that mimics an experts called 'technical analysts' who examine the graph of past price movement, and predict future financial price movements. Our proposed model named 'CNN-FG(Convolutional Neural Network using Fluctuation Graph)' consists of five steps. In the first step, it divides the dataset into the intervals of 5 days. And then, it creates time series graphs for the divided dataset in step 2. The size of the image in which the graph is drawn is $40(pixels){\times}40(pixels)$, and the graph of each independent variable was drawn using different colors. In step 3, the model converts the images into the matrices. Each image is converted into the combination of three matrices in order to express the value of the color using R(red), G(green), and B(blue) scale. In the next step, it splits the dataset of the graph images into training and validation datasets. We used 80% of the total dataset as the training dataset, and the remaining 20% as the validation dataset. And then, CNN classifiers are trained using the images of training dataset in the final step. Regarding the parameters of CNN-FG, we adopted two convolution filters ($5{\times}5{\times}6$ and $5{\times}5{\times}9$) in the convolution layer. In the pooling layer, $2{\times}2$ max pooling filter was used. The numbers of the nodes in two hidden layers were set to, respectively, 900 and 32, and the number of the nodes in the output layer was set to 2(one is for the prediction of upward trend, and the other one is for downward trend). Activation functions for the convolution layer and the hidden layer were set to ReLU(Rectified Linear Unit), and one for the output layer set to Softmax function. To validate our model - CNN-FG, we applied it to the prediction of KOSPI200 for 2,026 days in eight years (from 2009 to 2016). To match the proportions of the two groups in the independent variable (i.e. tomorrow's stock market movement), we selected 1,950 samples by applying random sampling. Finally, we built the training dataset using 80% of the total dataset (1,560 samples), and the validation dataset using 20% (390 samples). The dependent variables of the experimental dataset included twelve technical indicators popularly been used in the previous studies. They include Stochastic %K, Stochastic %D, Momentum, ROC(rate of change), LW %R(Larry William's %R), A/D oscillator(accumulation/distribution oscillator), OSCP(price oscillator), CCI(commodity channel index), and so on. To confirm the superiority of CNN-FG, we compared its prediction accuracy with the ones of other classification models. Experimental results showed that CNN-FG outperforms LOGIT(logistic regression), ANN(artificial neural network), and SVM(support vector machine) with the statistical significance. These empirical results imply that converting time series business data into graphs and building CNN-based classification models using these graphs can be effective from the perspective of prediction accuracy. Thus, this paper sheds a light on how to apply deep learning techniques to the domain of business problem solving.

Studies on Development of Prediction Model of Landslide Hazard and Its Utilization (산지사면(山地斜面)의 붕괴위험도(崩壞危險度) 예측(豫測)모델의 개발(開發) 및 실용화(實用化) 방안(方案))

  • Ma, Ho-Seop
    • Journal of Korean Society of Forest Science
    • /
    • v.83 no.2
    • /
    • pp.175-190
    • /
    • 1994
  • In order to get fundamental information for prediction of landslide hazard, both forest and site factors affecting slope stability were investigated in many areas of active landslides. Twelve descriptors were identified and quantified to develop the prediction model by multivariate statistical analysis. The main results obtained could be summarized as follows : The main factors influencing a large scale of landslide were shown in order of precipitation, age group of forest trees, altitude, soil texture, slope gradient, position of slope, vegetation, stream order, vertical slope, bed rock, soil depth and aspect. According to partial correlation coefficient, it was shown in order of age group of forest trees, precipitation, soil texture, bed rock, slope gradient, position of slope, altitude, vertical slope, stream order, vegetation, soil depth and aspect. The main factors influencing a landslide occurrence were shown in order of age group of forest trees, altitude, soil texture, slope gradient, precipitation, vertical slope, stream order, bed rock and soil depth. Two prediction models were developed by magnitude and frequency of landslide. Particularly, a prediction method by magnitude of landslide was changed the score for the convenience of use. If the total store of the various factors mark over 9.1636, it is evaluated as a very dangerous area. The mean score of landslide and non-landslide group was 0.1977 and -0.1977, and variance was 0.1100 and 0.1250, respectively. The boundary value between the two groups related to slope stability was -0.02, and its predicted rate of discrimination was 73%. In the score range of the degree of landslide hazard based on the boundary value of discrimination, class A was 0.3132 over, class B was 0.3132 to -0.1050, class C was -0.1050 to -0.4196, class D was -0.4195 below. The rank of landslide hazard could be divided into classes A, B, C and D by the boundary value. In the number of slope, class A was 68, class B was 115, class C was 65, and class D was 52. The rate of landslide occurrence in class A and class B was shown at the hige prediction of 83%. Therefore, dangerous areas selected by the prediction method of landslide could be mapped for land-use planning and criterion of disaster district. And also, it could be applied to an administration index for disaster prevention.

  • PDF

Machine learning-based corporate default risk prediction model verification and policy recommendation: Focusing on improvement through stacking ensemble model (머신러닝 기반 기업부도위험 예측모델 검증 및 정책적 제언: 스태킹 앙상블 모델을 통한 개선을 중심으로)

  • Eom, Haneul;Kim, Jaeseong;Choi, Sangok
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.2
    • /
    • pp.105-129
    • /
    • 2020
  • This study uses corporate data from 2012 to 2018 when K-IFRS was applied in earnest to predict default risks. The data used in the analysis totaled 10,545 rows, consisting of 160 columns including 38 in the statement of financial position, 26 in the statement of comprehensive income, 11 in the statement of cash flows, and 76 in the index of financial ratios. Unlike most previous prior studies used the default event as the basis for learning about default risk, this study calculated default risk using the market capitalization and stock price volatility of each company based on the Merton model. Through this, it was able to solve the problem of data imbalance due to the scarcity of default events, which had been pointed out as the limitation of the existing methodology, and the problem of reflecting the difference in default risk that exists within ordinary companies. Because learning was conducted only by using corporate information available to unlisted companies, default risks of unlisted companies without stock price information can be appropriately derived. Through this, it can provide stable default risk assessment services to unlisted companies that are difficult to determine proper default risk with traditional credit rating models such as small and medium-sized companies and startups. Although there has been an active study of predicting corporate default risks using machine learning recently, model bias issues exist because most studies are making predictions based on a single model. Stable and reliable valuation methodology is required for the calculation of default risk, given that the entity's default risk information is very widely utilized in the market and the sensitivity to the difference in default risk is high. Also, Strict standards are also required for methods of calculation. The credit rating method stipulated by the Financial Services Commission in the Financial Investment Regulations calls for the preparation of evaluation methods, including verification of the adequacy of evaluation methods, in consideration of past statistical data and experiences on credit ratings and changes in future market conditions. This study allowed the reduction of individual models' bias by utilizing stacking ensemble techniques that synthesize various machine learning models. This allows us to capture complex nonlinear relationships between default risk and various corporate information and maximize the advantages of machine learning-based default risk prediction models that take less time to calculate. To calculate forecasts by sub model to be used as input data for the Stacking Ensemble model, training data were divided into seven pieces, and sub-models were trained in a divided set to produce forecasts. To compare the predictive power of the Stacking Ensemble model, Random Forest, MLP, and CNN models were trained with full training data, then the predictive power of each model was verified on the test set. The analysis showed that the Stacking Ensemble model exceeded the predictive power of the Random Forest model, which had the best performance on a single model. Next, to check for statistically significant differences between the Stacking Ensemble model and the forecasts for each individual model, the Pair between the Stacking Ensemble model and each individual model was constructed. Because the results of the Shapiro-wilk normality test also showed that all Pair did not follow normality, Using the nonparametric method wilcoxon rank sum test, we checked whether the two model forecasts that make up the Pair showed statistically significant differences. The analysis showed that the forecasts of the Staging Ensemble model showed statistically significant differences from those of the MLP model and CNN model. In addition, this study can provide a methodology that allows existing credit rating agencies to apply machine learning-based bankruptcy risk prediction methodologies, given that traditional credit rating models can also be reflected as sub-models to calculate the final default probability. Also, the Stacking Ensemble techniques proposed in this study can help design to meet the requirements of the Financial Investment Business Regulations through the combination of various sub-models. We hope that this research will be used as a resource to increase practical use by overcoming and improving the limitations of existing machine learning-based models.