• Title/Summary/Keyword: Logit Models

Search Result 210, Processing Time 0.024 seconds

Habitat Suitability Models of Endangered Wildlife Class II Mauremys reevesii in Gurye-gun, the Republic of Korea (전라남도 구례군에 서식하는 멸종위기 야생생물 II급 남생이의 서식지 적합성 모델 개발)

  • Chang-Deuk Park;Jeongwoo Yoo;Kwanik Kwon;Nakyung Yoo;Moon Seong Heo;Ju-Duk Yoon
    • Journal of Environmental Impact Assessment
    • /
    • v.32 no.2
    • /
    • pp.83-93
    • /
    • 2023
  • This study was conducted to clarify the environmental variables that affect the appearance of Mauremys reevesii and to understand the relationship between M. reevesii and the variables. Habitat environmental survey was implemented by selecting 17 environmental variables considering ecological characteristics of M. reevesii in the main reservoir in Gurye-gun, the Republic of Korea. And the habitat data on the presence and absence of M.reevesii were analyzed statistically. The habitat suitability model of M. reevesii was described in following equation : logit (p) = -3.68 + (0.17 × leaf litter depth) + (1.55 × vegetation coverage of overstory on land) + (0.71 × coverage of midstory on land) + (0.96 × vegetation coverage of understory on water). This information gained is valuable for better understanding the distribution and how to conserve and promote populations of M. reevesii occurring in the Republic of Korea.

A Study on Selecting Model for Small and Medium Management Innovative Manufacturers (경영혁신형 중.소 제조기업 선정 모형에 관한 연구)

  • You, Yen-Yoo;Roh, Jae-Whak
    • The Journal of Society for e-Business Studies
    • /
    • v.15 no.2
    • /
    • pp.55-75
    • /
    • 2010
  • The primary purposes of this study are to find a proper model for government's selections of Mainbiz and present what are the better weights of the current indexes. We prepared three sets of models:first one using original 10 variables; second one using 9principally composed variables; third one using 7 principally composed variables. Among 3 models, the last one had higher explanation power than the other two models. Therefore, if index weights are adjusted according to the third newly developed model, the credibility in evaluating and selecting Mainbiz will be improved. When transforming the index weights and running the analysis, 5 variables(organization process, marketing management, management process, production-facility states, the level of forecasting) have more direct influences than other 4 variables(innovation strategies, knowledge management, achieving level, operational level) on selecting Main-biz.

Unified Approach to Coefficient of Determination $R^2$ Using Likelihood Distancd (우도거리에 의한 결정계수 $R^2$에의한 통합적 접근)

  • 허명회;이종한;정진환
    • The Korean Journal of Applied Statistics
    • /
    • v.4 no.2
    • /
    • pp.117-127
    • /
    • 1991
  • Coefficient of determination $R^2$ is most frequently used descriptive measure in practical use of linear regression analysis. But there have been controversies on defining this measure in the cases of linear regression without the intercept, weighted linear regression and robust linear regression. Several authors such as Kvalseth(1985) and Willet and Singer(1988) proposed many variations of $R^2$ to meet the situations. However, theire measures are not satisfactory due to the lack of a universal principle. In this study, we propose a unfied approach to defining the coefficient of determination $R^2$ using the concept of likelihood distance. This new measure is in good accordance with typical $R^2$ in linear regression and, moreover, can be applied to nonlinear regression models and generalized linear models such as logit and log-linear models.

  • PDF

Design and Implementation of Travel Mode Choice Model Using the Bayesian Networks of Data Mining (데이터마이닝의 베이지안 망 기법을 이용한 교통수단선택 모형의 설계 및 구축)

  • Kim, Hyun-Gi;Kim, Kang-Soo;Lee, Sang-Min
    • Journal of Korean Society of Transportation
    • /
    • v.22 no.2 s.73
    • /
    • pp.77-86
    • /
    • 2004
  • In this study, we applied the Bayesian Network for the case of the mode choice models using the Seoul metropolitan area's house trip survey Data. Sex and age were used lot the independent variables for the explanation or the mode choice, and the relationships between the mode choice and the travellers' social characteristics were identified by the Bayesian Network. Furthermore, trip and mode's characteristics such as time and fare were also used for independent variables and the mode choice models were developed. It was found that the Bayesian Network were useful tool to overcome the problems which were in the traditional mode choice models. In particular, the various transport policies could be evaluated in the very short time by the established relation-ships. It is expected that the Bayesian Network will be utilized as the important tools for the transport analysis.

A Study for Comparison of Risk Estimates According to Extrapolating Methods of Benzo(a)Pyrene in the Ambient Air (대기중 Benzo(a) pyrene의 외삽방법에 따른 위해도 추계치의 비교 연구)

  • Kim, Jong-Man;Chung, Yong
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.8 no.1
    • /
    • pp.29-37
    • /
    • 1992
  • The risk of benzo(a)pyrene for cancer in the ambient air of Seoul was assessed by using the extrapolation methods. The average daily lifetime exposure of benzo(a)pyrene in the ambient air of Seoul was calculated at 6.97-24.30ng/$m^2$/day, which was based on the occurrence analysis of benzo(a)pyrene in the residential(Bull Kwang Dong) and traffic areas(Shin Chon) of Seoul. Using the dose scaling based on body surface area in comparisons of toxicity for extrapolation from animal to human and mathematical models from the high dose region, the low-dose risk was estimated. The response probabilities were estimated by the tolerance distribution models; Probit, Logit and Weibull model. They were consistent with the observed ones at experimental dose region. The unit risk estimates of these models were too low to be used. One-hit and multistage model to prove more conservative risk was selected. As a redult, the lifetime unit risk of benzo(a)pyrene for cancer and virtually safe dose were calculated; One-hit model provided the risk 2.8 $\times 10^{-7}$ and 3.4ng/$m^3$, respectively and multistage model provided 5.2 $\times 10^{-7}$ and 1.9ng/$m^3$ as the more conservatives. The lifetime excess risk estimates of benzo(a)pyrene for cancer were calculated at 0.37-1.30 persons/million persons by one-hit model and 0.69-2.41 persons/million persons by multistage model, which was considered in without virtual risk.

  • PDF

Optimization of Support Vector Machines for Financial Forecasting (재무예측을 위한 Support Vector Machine의 최적화)

  • Kim, Kyoung-Jae;Ahn, Hyun-Chul
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.241-254
    • /
    • 2011
  • Financial time-series forecasting is one of the most important issues because it is essential for the risk management of financial institutions. Therefore, researchers have tried to forecast financial time-series using various data mining techniques such as regression, artificial neural networks, decision trees, k-nearest neighbor etc. Recently, support vector machines (SVMs) are popularly applied to this research area because they have advantages that they don't require huge training data and have low possibility of overfitting. However, a user must determine several design factors by heuristics in order to use SVM. For example, the selection of appropriate kernel function and its parameters and proper feature subset selection are major design factors of SVM. Other than these factors, the proper selection of instance subset may also improve the forecasting performance of SVM by eliminating irrelevant and distorting training instances. Nonetheless, there have been few studies that have applied instance selection to SVM, especially in the domain of stock market prediction. Instance selection tries to choose proper instance subsets from original training data. It may be considered as a method of knowledge refinement and it maintains the instance-base. This study proposes the novel instance selection algorithm for SVMs. The proposed technique in this study uses genetic algorithm (GA) to optimize instance selection process with parameter optimization simultaneously. We call the model as ISVM (SVM with Instance selection) in this study. Experiments on stock market data are implemented using ISVM. In this study, the GA searches for optimal or near-optimal values of kernel parameters and relevant instances for SVMs. This study needs two sets of parameters in chromosomes in GA setting : The codes for kernel parameters and for instance selection. For the controlling parameters of the GA search, the population size is set at 50 organisms and the value of the crossover rate is set at 0.7 while the mutation rate is 0.1. As the stopping condition, 50 generations are permitted. The application data used in this study consists of technical indicators and the direction of change in the daily Korea stock price index (KOSPI). The total number of samples is 2218 trading days. We separate the whole data into three subsets as training, test, hold-out data set. The number of data in each subset is 1056, 581, 581 respectively. This study compares ISVM to several comparative models including logistic regression (logit), backpropagation neural networks (ANN), nearest neighbor (1-NN), conventional SVM (SVM) and SVM with the optimized parameters (PSVM). In especial, PSVM uses optimized kernel parameters by the genetic algorithm. The experimental results show that ISVM outperforms 1-NN by 15.32%, ANN by 6.89%, Logit and SVM by 5.34%, and PSVM by 4.82% for the holdout data. For ISVM, only 556 data from 1056 original training data are used to produce the result. In addition, the two-sample test for proportions is used to examine whether ISVM significantly outperforms other comparative models. The results indicate that ISVM outperforms ANN and 1-NN at the 1% statistical significance level. In addition, ISVM performs better than Logit, SVM and PSVM at the 5% statistical significance level.

A GA-based Rule Extraction for Bankruptcy Prediction Modeling (유전자 알고리즘을 활용한 부실예측모형의 구축)

  • Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.7 no.2
    • /
    • pp.83-93
    • /
    • 2001
  • Prediction of corporate failure using past financial data is well-documented topic. Early studies of bankruptcy prediction used statistical techniques such as multiple discriminant analysis, logit and probit. Recently, however, numerous studies have demonstrated that artificial intelligence such as neural networks (NNs) can be an alternative methodology for classification problems to which traditional statistical methods have long been applied. Although numerous theoretical and experimental studies reported the usefulness or neural networks in classification studies, there exists a major drawback in building and using the model. That is, the user can not readily comprehend the final rules that the neural network models acquire. We propose a genetic algorithms (GAs) approach in this study and illustrate how GAs can be applied to corporate failure prediction modeling. An advantage of GAs approach offers is that it is capable of extracting rules that are easy to understand for users like expert systems. The preliminary results show that rule extraction approach using GAs for bankruptcy prediction modeling is promising.

  • PDF

Hurdle Model for Longitudinal Zero-Inflated Count Data Analysis (영과잉 경시적 가산자료 분석을 위한 허들모형)

  • Jin, Iktae;Lee, Keunbaik
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.6
    • /
    • pp.923-932
    • /
    • 2014
  • The Hurdle model can to analyze zero-inflated count data. This model is a mixed model of the logit model for a binary component and a truncated Poisson model of a truncated count component. We propose a new hurdle model with a general heterogeneous random effects covariance matrix to analyze longitudinal zero-inflated count data using modified Cholesky decomposition. This decomposition factors the random effects covariance matrix into generalized autoregressive parameters and innovation variance. The parameters are modeled using (generalized) linear models and estimated with a Bayesian method. We use these methods to carefully analyze a real dataset.

The Importance of a Borrower's Track Record on Repayment Performance: Evidence in P2P Lending Market

  • KIM, Dongwoo
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.7 no.7
    • /
    • pp.85-93
    • /
    • 2020
  • In peer-to-peer (P2P) loan markets, as most lenders are unskilled and inexperienced ordinary individuals, it is important to know the characteristics of borrowers that significantly impact their repayment performance. This study investigates the effects and importance of borrowers' past repayment performance track record within the platform to identify its predictive power. To this end, I analyze the detailed loan repayment data from two leading P2P lending platforms in Korea using a Cox proportional hazard, multiple linear regression, and logit models. Furthermore, the predictive power of the factors proxied by borrowers' track records are evaluated through the receiver operating characteristic (ROC) curves. As a result, it is found that the borrowers' past track record within the platform have the most important impact on the repayment performance of their current loans. In addition, this study also reveals that the borrowers' track record is much more predictive of their repayment performance than any other factor. The findings of this study emphasize that individual lenders must take into account the quality of borrowers' past transaction history when making a funding decision, and that platform operators should actively share the borrowers' past records within the markets with lenders.

Forecasting Market Shares of Environment-Friendly Vehicles under Different Market Scenarios

  • Bae, Jeong Hwan;Jung, Heayoung
    • Environmental and Resource Economics Review
    • /
    • v.22 no.1
    • /
    • pp.3-29
    • /
    • 2013
  • The purpose of this study is to estimate consumer preferences on hybrid cars and electric cars by employing a choice experiment reflecting the various market conditions, such as different projected market shares of green vehicles and $CO_2$ emission regulations. Depending on different market scenarios, we examine as to which attribute and individual characteristic affect the preferences of potential consumers on green vehicles and further, forecast the potential market shares of green cars. The primary results, estimated by a conditional logit and panel probit models, indicate that sales price, fuel cost, maximum speed, emission of air pollutants, fuel economy, and distance between fuel stations can significantly affect consumer's choice of environment-friendly cars. The second finding is that the unique features of electric cars might better appeal to consumers as the market conditions for electric cars are improved. Third, education, age, and gender can significantly affect individual preferences. Finally, as the market conditions become more favorable toward green cars, the forecasted market shares of hybrid and electric vehicles will increase up to 67% and 14%.