• Title/Summary/Keyword: regression trees

Search Result 248, Processing Time 0.024 seconds

Analysis of Survivability for Combatants during Offensive Operations at the Tactical Level (전술제대 공격작전간 전투원 생존성에 관한 연구)

  • Kim, Jaeoh;Cho, HyungJun;Kim, GakGyu
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.5
    • /
    • pp.921-932
    • /
    • 2015
  • This study analyzed military personnel survivability in regards to offensive operations according to the scientific military training data of a reinforced infantry battalion. Scientific battle training was conducted at the Korea Combat Training Center (KCTC) training facility and utilized scientific military training equipment that included MILES and the main exercise control system. The training audience freely engaged an OPFOR who is an expert at tactics and weapon systems. It provides a statistical analysis of data in regards to state-of-the-art military training because the scientific battle training system saves and utilizes all training zone data for analysis and after action review as well as offers training control during the training period. The methodologies used the Cox PH modeling (which does not require parametric distribution assumptions) and decision tree modeling for survival data such as CART, GUIDE, and CTREE for richer and easier interpretation. The variables that violate the PH assumption were stratified and analyzed. Since the Cox PH model result was not easy to interpret the period of service, additional interpretation was attempted through univariate local regression. CART, GUIDE, and CTREE formed different tree models which allow for various interpretations.

Artificial Intelligence Techniques for Predicting Online Peer-to-Peer(P2P) Loan Default (인공지능기법을 이용한 온라인 P2P 대출거래의 채무불이행 예측에 관한 실증연구)

  • Bae, Jae Kwon;Lee, Seung Yeon;Seo, Hee Jin
    • The Journal of Society for e-Business Studies
    • /
    • v.23 no.3
    • /
    • pp.207-224
    • /
    • 2018
  • In this article, an empirical study was conducted by using public dataset from Lending Club Corporation, the largest online peer-to-peer (P2P) lending in the world. We explore significant predictor variables related to P2P lending default that housing situation, length of employment, average current balance, debt-to-income ratio, loan amount, loan purpose, interest rate, public records, number of finance trades, total credit/credit limit, number of delinquent accounts, number of mortgage accounts, and number of bank card accounts are significant factors to loan funded successful on Lending Club platform. We developed online P2P lending default prediction models using discriminant analysis, logistic regression, neural networks, and decision trees (i.e., CART and C5.0) in order to predict P2P loan default. To verify the feasibility and effectiveness of P2P lending default prediction models, borrower loan data and credit data used in this study. Empirical results indicated that neural networks outperforms other classifiers such as discriminant analysis, logistic regression, CART, and C5.0. Neural networks always outperforms other classifiers in P2P loan default prediction.

The Influence of Pedestrian Environment Perception on Pedestrian Environment Satisfaction and Expected Health Promotion Effects - Focused on Park User for Health Promotion - (보행환경 인식이 보행환경 만족도 및 건강증진 기대효과에 미치는 영향 - 건강 목적의 공원 이용자를 대상으로 -)

  • Lee, Gyeong-Mi;Lee, Woo-Sung;Jung, Sung-Gwan;Jang, Cheol-Kyu
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.44 no.6
    • /
    • pp.137-147
    • /
    • 2016
  • The purpose of this study was to analyze the perception factors of a pedestrian environment that affect pedestrian environment satisfaction(PES) and determine the relationship between PES and the expected effects of health promotion. The targeted areas of study are neighborhood parks in Suseong-gu, Daegu city. First, regarding the results for the evaluation of pedestrian environment perception, 'Gentle slope' was rated the highest, while factors regarding pedestrian safety such as 'Lots of unpleasant elements', 'Risk from biking and motorcycling' and 'Many obstacles on sidewalks' were rated low. A stepwise regression analysis showed that factors such as 'Fresh air', 'Beautiful scenery', 'Continuity of the sidewalks', 'Various attractions', 'The shade of trees' and 'Lots of unpleasant elements' influenced the PES. Therefore, creating fresh air and shade trees by planting trees and removing unpleasant elements from pedestrian areas are important. Also, it is necessary to cultivate beautiful scenery and attractions through street improvement and improve the continuity of the sidewalks. Finally, in terms of path analysis, PES influenced the frequency of park use, the expected effects of physical and mental health promotion both directly and indirectly.

Biomass Regressions of Pinus densiflora Natural Forests of Four Local Forms in Korea (한국산(韓國産) 4개(個) 지역형(地域型) 소나무천연림(天然林)의 물질(物質) 현존량(現存量) 추정식(推定式)에 관(關)한 연구(硏究))

  • Park, In Hyeop;Kim, Joon Seon
    • Journal of Korean Society of Forest Science
    • /
    • v.78 no.3
    • /
    • pp.323-330
    • /
    • 1989
  • Pinass densiflora natural forests of four local forms in Korea were studies to investigate effective biomass estimation method. Dimension analysis was used and three allometric regression models, such as logWt=A+BlogD, logWt=$A+B1ogD^2H$ and 1ogWt=A+BlogD+ClogH were applied to estimate biomass, The most accurate estimation was made by the regression model of logWt=A+BlogD+ClogH where Wt is dry weight, D is diameter at breast height, and H is tree height. However, dry weights of cones and dead branches were remotely related to tree size factor, such as D and H. In the interest of practical use. generalized allometric regressions for all samples trees of four stands were computed and analysis of covariance was used to compare the allometric regressions among the four stands. Based on the test criteria applied in this study, significant differences were found in terms of error variance and regression intercept, not in terms of regression slope. These trends suggest a generalized biomass regression is not valid for accurate estimation over a range of four local form stands.

  • PDF

Variation of Stomatal Traits of Natural Population of Quercus spp. (참나무 천연집단(天然集團)의 기공형질변이(氣孔形質變異))

  • Kim, Chi Moon;Kwon, Ki Won;Moon, Heung Kyu
    • Journal of Korean Society of Forest Science
    • /
    • v.66 no.1
    • /
    • pp.82-94
    • /
    • 1984
  • The variation of stomatal density and stomatal length of four species of oaks was studied for the purpose of examining the differences among populations and among individual trees within population. Nine populations of Quercus mongolica, four populations of Q. serrata and Q. variabilis respectively, and three populations of Q. acutissima were selected in the natural stands of oaks distributed through the whole country. Twelve leaves were sampled from each of 20 trees from each population. The length of 20 stomata and ten replications of stomatal density were measured from collodion replicas of each leaf under a microscope. Average stomatal densities and lengths ranged through $600-1000/mm^2$ and $19-26{\mu}m$ respectively in all of the species studied. The stomatal densities and lengths presented significant differences statistically at the level of 1 or 5% among populations and among individual trees within population in all the species. Quercus mongolica, especially, showed large variation among populations, while Q. variabilis did very narrow variation compared to the other species. The coefficients of variation of stomatal densities and lengths among individual trees within population exhibited small values of 3.7-12.0% and 1.4-5.3% respectively in all the populations of the species. The average stomatal densities of Q. mongolica showed statistically significant correlation of multiple correlation coefficient of $R_{df{\cdot}2.6}=0.868^*$ and multiple regression equation of $Y=0.041X_1(G.M.T.S.)+0.489X_2(G.M.H.S.)+22.37$ with the sum of growing season mean daily temperature and the sum of growing season mean daily humidity of the stand studied. However the average stomatal lengths showed no relation with the same meteological variables. The figures of frequency distribution of the measurements of leaves or the mean values of individual trees did not show normal distribution curves in some populations. The curves, as well as the results of ANOVA, exhibited the differences among populations.

  • PDF

Traffic Safety Countermeasures According to the Accident Area Patterns and Impact Factor Analysis of the Large-scale Traffic Accident Locations (대형 교통사고 발생지점 유형화와 영향요인 분석에 따른 교통안전대책 방안에 관한 연구)

  • Kim, Bong-Gi;Jeong, Heon-Yeong;Go, Sang-Seon
    • Journal of Korean Society of Transportation
    • /
    • v.24 no.1 s.87
    • /
    • pp.39-52
    • /
    • 2006
  • This study divided the large-scale traffic accident locations into its own characteristics by using Cluster Analysis. Also, Quantification II and Classification and Regression Tree methods were used enabling evaluation for the amount of affecting influence by the crash type. After these analyses, we tested the fitness of the results and suggested the simplification of the quantification index. With the results from the discussed procedure, obvious differences were observed by groups according to the characteristics of crash type from the Discrimination and Classification analysis of divided four groups. Thus, measures and supplementary measures for the traffic accidents could be suggested in groups systematically. However, a lot of missing values in variables caused a huge loss of data and made this study difficult for more detailed analysis, With this difficulty. recording mandatory log files with a standardized format is also recommended to Prevent this Problem in advance.

Main SNP Identification of Hanwoo Carcass Weight with Multifactor Dimensionality Reduction(MDR) Method (MULTIFACTOR DIMENSIONALITY REDUCTION(MDR)을 이용한 한우 도체중에서의 주요 SNP 규명)

  • Lee, Jea-Young;Kim, Dong-Chul
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.1
    • /
    • pp.53-63
    • /
    • 2008
  • It is commonly believed that disease of human or economic traits of livestock are caused not by single gene acting alone, but by multiple genes interacting with one an-other. This issue is difficult due to the limitations of parametric statistical method like as logistic regression for detection of gene effects that are dependent solely on interactions with other genes and with environmental exposures. Multifactor dimensionality reduction (MDR) nonparametric statistical method, to improve the identification of single nucleotide polymorphism (SNP) associated with the Hanwoo(Korean cattle) carcass cold weight, is applied and compared with ANOVA results.

Binary Forecast of Asian Dust Days over South Korea in the Winter Season (남한지역 겨울철 황사출현일수에 대한 범주 예측모형 개발)

  • Sohn, Keon-Tae;Lee, Hyo-Jin;Kim, Seung-Bum
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.3
    • /
    • pp.535-546
    • /
    • 2011
  • This study develops statistical models for the binary forecast of Asian dust days over South Korea in the winter season. For this study, we used three kinds of data; the rst one is the observed Asian dust days for a period of 31 years (1980 to 2010) as target values, the second one is four meteorological factors(near surface temperature, precipitation, snowfall, ground wind speed) in the source regions of Asian dust based on the NCEP reanalysis data and the third one is the large-scale climate indices. Four kinds of statistical models(multiple regression models, logistic regression models, decision trees, and support vector machines) are applied and compared based on skill scores(hit rate, probability of detection and false alarm rate).

Stand Structure, Volume, and Biomass Production of 9-year-old Alnus hirsuta var. sibirica grown in Minirotation (물갬나무 9년생(年生)의 임분구조(林分構造)와 재적(材積) 및 Biomass 생산(生産)에 관(關)한 연구(硏究))

  • Oh, Jeong Soo;Kim, Jong Won;Jeong, Yong Ho;Oh, Min Yung;Park, Sung Kul;Kim, Suk Kwon
    • Journal of Korean Society of Forest Science
    • /
    • v.65 no.1
    • /
    • pp.54-59
    • /
    • 1984
  • Research was conducted in a minirotation plantation with four different planting densities at Tatae-ri, Chongwoon-myon, Yangpyong-gun, Kyonggi-do, to investigate the relation between volume and biomass production. Nine-year-old Alnus hirsuta var. sibirica analyzed to determine volume yield and weight equations for aboveground parts. The results suggest that the most suitable harvesting or thinning period at highly dense plots, more than 6,000 trees per hectare, is five years after planting, and the most fitted regression equation model for estimating aboveground biomass or total tree biomass is $logY=b_0+b_1logd^2h$.

  • PDF

Relation between the Shade Hours and the Landscape Tree Growth in the Apartment Housing Areas (공동주택단지내 조경수목의 생장과 피음시간과의 관계)

  • 윤근영;안건용
    • Korean Journal of Environment and Ecology
    • /
    • v.10 no.1
    • /
    • pp.49-57
    • /
    • 1996
  • To figure out the relation between the shade hours and the landscape tree growth in the apartment housing areas, the present sizes and planting positions of 4 tree species in Gwacheon-si apartment housing areas were surveyed. Then, shade hours were analyzed and the data were analyzed by simple linear regression method. As a whole, the R$^{2}$ was too low to generalize the regression equation. Therefore, it was presumed that the gravity of shade hours in landscape tree growth in this sample site was relatively lower than that of any other environmental factors. However, it was presumed that the characteristics of shade intolerant and tolerant tree were turned up, because Pinus strobus showed a low negative correlation with shade housm and Acer palmatum and Magnolia denudata showed a low positive correlation with shade hours generally. And, it was proved that the statistically significant cases were the tree diameter at root collar and tree sidth of Acer palmatum and tree width of Magnolia denudata with shade hours showing a low correlation coefficient less than 0.4.

  • PDF