• Title/Summary/Keyword: categorical variable

Search Result 104, Processing Time 0.026 seconds

Multiple Regression Analysis to Estimate the Unit Price of Hanwoo (Bos taurus coreanae) Beef

  • Eum, Seung-Hoon;Park, Hu-Rak;Seo, Jakyeom;Cho, Seong-Keun;Hur, Sun-Jin;Kim, Byeong-Woo
    • Food Science of Animal Resources
    • /
    • v.37 no.5
    • /
    • pp.663-669
    • /
    • 2017
  • This study were estimated the contribution of carcass traits to unit price, to analyze the marbling score as a categorical variable rather than a numerical variable, and to develop an optimal model that also includes the holiday effect and the raising period. The data for this study were acquired from the Quality Evaluation of the Korea Institute for Animal Products, and consisted of the trading records of 1,613,699 heads at 12 wholesale markets from 2010 to 2014. The unit price of a cow was estimated from the following parameters: -52.50 Won/mm, $8.93Won/cm^2$, 7.20 Won/kg, and -1.04 Won/day for backfat thickness, eye muscle area, carcass weight, and raising period, respectively. Parameters for the dummy variables of marbling scores varied from 0 to 8328.74 Won/kg, which means that each marbling score grade had a different price value. The unit price of a steer was estimated from the following parameters: - 92.12 Won/mm, $20.22Won/cm^2$, 1.30 Won/kg, and -1.72 Won/day for backfat thickness, eye muscle area, carcass weights, and raising period, respectively. Parameters for dummy variables of marbling scores varied from 0 to 7338.80 Won/kg, which means that the grades of each marbling score had different price values. The unit price of sales during traditional holidays was significantly higher (827.71 Won/kg for cows, and 645.15 Won/kg for steers) than during non-holidays.We conclude that the use of categorical values for marbling scores would be needed to evaluate the price of Hanwoo beef using multiple regression analysis based on carcass traits and environmental factors.

Influence of Global Competitive Capability on Global Performance of Distribution Industry in South Korea

  • KIM, Boine;KIM, Byoung-Goo
    • Journal of Distribution Science
    • /
    • v.19 no.12
    • /
    • pp.83-89
    • /
    • 2021
  • Purpose: Purpose of this study is to empirically analyze influence of global competitive capability on global performance of distribution industry in South Korea. Also based on the empirical results, give managerial implication to distribution industry and contribute to academies of management. Research design, data and methodology: This study focuses on relationship analysis between global competitive capability and global performance. This study measured global competitive capability with three concepts; human capability, network capability and product/service capability. And measured global performance with export performance. To empirically analyze relationship between variables, this study used 2,316 data of GCL Test by KOTRA and Kdata. This study used SPSS26 and analyzed frequency, reliability, correlation and stepwise regression analysis. Results: Result shows that, in control variable, business period and business field give significant positive influence on export performance. Among antecedents, human capability and network capability give significant positive influence on export performance. However, product/goods/service was not significant. Due to significant influence of business field which is categorical variable. This study additionally analyze relationship by business field group to confirm whether relationship differ by group or similar. Conclusions: Based on the results, this study try to give implication to distribution industry management and contribute to academic.

Determinants of Tourist Expenditure on 2013 Gangneung Dano Festival (2013 강릉단오제 관광객의 소비지출 결정요인에 관한 연구)

  • Jeong, Ug-Yeong;Han, Jin-Young
    • Journal of Digital Convergence
    • /
    • v.11 no.12
    • /
    • pp.93-100
    • /
    • 2013
  • This paper analyzes determinants of tourist consumption in the case of 2013 Gangneung Dano Festival, based on the multiple regression model. We set 12 determinants of consumption such as income as explanatory variables and consumption expenditure as a dependent variable. Also Five kinds of categorical consumptions are estimated. Main results are the followings. First, income is the most important factor and shows positive effect in tourist consumption. Second, age and metropolitan area influence consumption positively. Third number of participating day and length of stay also influence consumption positively. Fourth, number of accompanying person shows negative effect on consumption. Fifth, male, married person, and lodge with own expense influence consumption positively. Finally, categorical consumption has its specific determinants distinct from common factors This paper can be applied to invent and implement efficient strategies for development in regional economies and tour industries.

Developing Medium-size Corporate Credit Rating Systems by the Integration of Financial Model and Non-financial Model (재무모형과 비재무모형을 통합한 중기업 신용평가시스템의 개발)

  • Park, Cheol-Soo
    • Journal of the Korea Safety Management & Science
    • /
    • v.10 no.2
    • /
    • pp.71-83
    • /
    • 2008
  • Most researches on the corporate credit rating are generally classified into the area of bankruptcy prediction and bond rating. The studies on bankruptcy prediction have focused on improving the performance in binary classification problem, since the criterion variable is categorical, bankrupt or non-bankrupt. The other studies on bond rating have predicted the credit ratings, which was already evaluated by bond rating experts. The financial institute, however, should perform effective loan evaluation and risk management by employing the corporate credit rating model, which is able to determine the credit of corporations. Therefore, in this study we present a medium sized corporate credit rating system by using Artificial Neural Network(ANN) and Analytical Hierarchy Process(AHP). Also, we developed AHP model for credit rating using non-financial information. For the purpose of completed credit rating model, we integrated the ANN and AHP model using both financial information and non-financial information. Finally, the credit ratings of each firm are assigned by the proposed method.

Bayesian Methods for Generalized Linear Models

  • Paul E. Green;Kim, Dae-Hak
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.2
    • /
    • pp.523-532
    • /
    • 1999
  • Generalized linear models have various applications for data arising from many kinds of statistical studies. Although the response variable is generally assumed to be generated from a wide class of probability distributions we focus on count data that are most often analyzed using binomial models for proportions or poisson models for rates. The methods and results presented here also apply to many other categorical data models in general due to the relationship between multinomial and poisson sampling. The novelty of the approach suggested here is that all conditional distribution s can be specified directly so that staraightforward Gibbs sampling is possible. The prior distribution consists of two stages. We rely on a normal nonconjugate prior at the first stage and a vague prior for hyperparameters at the second stage. The methods are demonstrated with an illustrative example using data collected by Rosenkranz and raftery(1994) concerning the number of hospital admissions due to back pain in Washington state.

  • PDF

FUZZY REGRESSION TOWARDS A GENERAL INSURANCE APPLICATION

  • Kim, Joseph H.T.;Kim, Joocheol
    • Journal of applied mathematics & informatics
    • /
    • v.32 no.3_4
    • /
    • pp.343-357
    • /
    • 2014
  • In many non-life insurance applications past data are given in a form known as the run-off triangle. Smoothing such data using parametric crisp regression models has long served as the basis of estimating future claim amounts and the reserves set aside to protect the insurer from future losses. In this article a fuzzy counterpart of the Hoerl curve, a well-known claim reserving regression model, is proposed to analyze the past claim data and to determine the reserves. The fuzzy Hoerl curve is more flexible and general than the one considered in the previous fuzzy literature in that it includes a categorical variable with multiple explanatory variables, which requires the development of the fuzzy analysis of covariance, or fuzzy ANCOVA. Using an actual insurance run-off claim data we show that the suggested fuzzy Hoerl curve based on the fuzzy ANCOVA gives reasonable claim reserves without stringent assumptions needed for the traditional regression approach in claim reserving.

An Introduction to Logistic Regression: From Basic Concepts to Interpretation with Particular Attention to Nursing Domain

  • Park, Hyeoun-Ae
    • Journal of Korean Academy of Nursing
    • /
    • v.43 no.2
    • /
    • pp.154-164
    • /
    • 2013
  • Purpose: The purpose of this article is twofold: 1) introducing logistic regression (LR), a multivariable method for modeling the relationship between multiple independent variables and a categorical dependent variable, and 2) examining use and reporting of LR in the nursing literature. Methods: Text books on LR and research articles employing LR as main statistical analysis were reviewed. Twenty-three articles published between 2010 and 2011 in the Journal of Korean Academy of Nursing were analyzed for proper use and reporting of LR models. Results: Logistic regression from basic concepts such as odds, odds ratio, logit transformation and logistic curve, assumption, fitting, reporting and interpreting to cautions were presented. Substantial shortcomings were found in both use of LR and reporting of results. For many studies, sample size was not sufficiently large to call into question the accuracy of the regression model. Additionally, only one study reported validation analysis. Conclusion: Nursing researchers need to pay greater attention to guidelines concerning the use and reporting of LR models.

A generalized logit model with mixed effects for categorical data (다가자료에 대한 혼합효과모형)

  • 최재성
    • The Korean Journal of Applied Statistics
    • /
    • v.15 no.1
    • /
    • pp.129-137
    • /
    • 2002
  • This paper suggests a generalized logit model with mixed effects for analysing frequency data in multi-contingency table. In this model nominal response variable is assumed to be polychotomous. When some factors are fixed but considered as ordinal and others are random, this paper shows how to use baseline-category logits to incoporate the mixed-effects of those factors into the model. A numerical algorithm was used to estimate model parameters by using marginal log-likelihood.

Tree size determination for classification ensemble

  • Choi, Sung Hoon;Kim, Hyunjoong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.1
    • /
    • pp.255-264
    • /
    • 2016
  • Classification is a predictive modeling for a categorical target variable. Various classification ensemble methods, which predict with better accuracy by combining multiple classifiers, became a powerful machine learning and data mining paradigm. Well-known methodologies of classification ensemble are boosting, bagging and random forest. In this article, we assume that decision trees are used as classifiers in the ensemble. Further, we hypothesized that tree size affects classification accuracy. To study how the tree size in uences accuracy, we performed experiments using twenty-eight data sets. Then we compare the performances of ensemble algorithms; bagging, double-bagging, boosting and random forest, with different tree sizes in the experiment.

A generalized logit model with mixed effects for categorical data (다가자료에 대한 혼합효과모형)

  • Choi, Jae-Sung
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2001.10a
    • /
    • pp.25-33
    • /
    • 2001
  • This paper suggests a generalized logit model with mixed effects for analysing frequency data in multi-contingency table. In this model nominal response variable is assumed to be polychotomous. When some factors are fixed but condisered as ordinal and others are random, this paper shows how to use baseline-category logits to incoporate the mixed-effects of those factors into the model. A numerical algorithm was used to estimate model parameters by using marginal log-likelihood.

  • PDF