• Title/Summary/Keyword: 이항반응자료

Search Result 21, Processing Time 0.024 seconds

The Detection of Unreliable Data in Survey Database (조사자료 데이터베이스의 허위 잠재 가능성 분류군 탐지)

  • Byon, Lu-Na;Han, Jeong-Hye
    • The KIPS Transactions:PartD
    • /
    • v.12D no.4 s.100
    • /
    • pp.657-662
    • /
    • 2005
  • The Non-Sampling Error can happen any time by means of the intended or unintended error by the interviewer or respondent, but it is very difficult to find the error in survey database because it can hardly be computed mathematically and systematically. Until now, we have found it accidentally through the simple relation between the items or through the inspection from the random field. Therefore we introduced an heuristic methodology that can detect the interviewer's error by statistical decision-making or data mining techniques with a case study. It will be helpful so as to improve the statistical duality and provide efficient field management for the supervisor.

Various Graphical Methods for Assessing a Logistic Regression Model (로지스틱회귀모형의 평가를 위한 그래픽적 방법)

  • Kim, Kyung Jin;Kahng, Myung Wook
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1191-1208
    • /
    • 2015
  • Most statistical methods are dependent on the summary statistic. However, with graphical approaches, it is easier to identify the characteristics of the data and detect information that cannot be obtained by the summary statistic. We present various graphical methods to assess the adequacy of models in logistic regression that include checking log-density ratio, structural dimension, marginal model plot, chi-residual plot, and CERES plot. Through simulation data, we investigate and compare the results of graphical approaches under diverse conditions.

A Zero-Inated Model for Insurance Data (제로팽창 모형을 이용한 보험데이터 분석)

  • Choi, Jong-Hoo;Ko, In-Mi;Cheon, Soo-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.3
    • /
    • pp.485-494
    • /
    • 2011
  • When the observations can take only the non-negative integer values, it is called the count data such as the numbers of car accidents, earthquakes, or insurance coverage. In general, the Poisson regression model has been used to model these count data; however, this model has a weakness in that it is restricted by the equality of the mean and the variance. On the other hand, the count data often tend to be too dispersed to allow the use of the Poisson model in practice because the variance of data is significantly larger than its mean due to heterogeneity within groups. When overdispersion is not taken into account, it is expected that the resulting parameter estimates or standard errors will be inefficient. Since coverage is the main issue for insurance, some accidents may not be covered by insurance, and the number covered by insurance may be zero. This paper considers the zero-inflated model for the count data including many zeros. The performance of this model has been investigated by using of real data with overdispersion and many zeros. The results indicate that the Zero-Inflated Negative Binomial Regression Model performs the best for model evaluation.

Categorical data analysis of sensory evaluation data with Hanwoo bull beef (한우 수소 고기 관능평가 데이터에 대한 범주형 자료 분석)

  • Lee, Hye-Jung;Cho, Soo-Hyun;Kim, Jae-Hee
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.5
    • /
    • pp.819-827
    • /
    • 2009
  • This study was conducted to investigate the relationship between the sociodemographic factors and the Korean consumers palatability evaluation grades with Hanwoo sensory evaluation data. The dichotomy logistic regression model and the multinomial logistic regression model are fitted with the independent variables such as the consumer living location, age, gender, occupation, monthly income, and beef cut and the the palatability grade as the dependent variable. Stepwise variable selection procedure is incorporated to find the final model and odds ratios are calculated to find the associations between categories.

  • PDF

A Re-evaluation of Housing Wealth Effect in Korea (한국의 주택 부 효과에 대한 재고찰)

  • Kim, Jangryoul;Lee, Hangyong
    • KDI Journal of Economic Policy
    • /
    • v.30 no.2
    • /
    • pp.1-26
    • /
    • 2008
  • This paper attempts to re-evaluate the size of housing wealth effect in Korea. Our focus is on the size of 'genuine' housing wealth effect, i.e., the response of consumption spending by home-owners to the changes in housing wealth. Two issues show up while we estimate the 'genuine' wealth effects using aggregate time series data: the issues around home ownership and proper measure of consumption. We first argue that it is more appropriate to use non-housing consumption, because housing consumption is in large part not of the choice of home owners but the imputed rents they do not actually choose to pay. We then proceed to address the issue of home ownership, by examining how much to revise the estimates of housing wealth effect obtained from aggregate non-housing consumption data. We construct two structural models and estimate the share of home-owners' consumption in those models' context. It is found that, if properly revised in light of the estimated consumption shares of home-owners, the magnitude of resulting housing wealth effects are larger than what simple time series regressions imply.

  • PDF

Characteristics of Geometric Conditions Affecting Freeway Traffic Safety at Nighttime, Sunrise, and Sunset (야간 및 일출몰 시간대 교통안전에 영향을 미치는 고속도로 기하구조 특성분석)

  • Hong, Sung-Min;Kim, Joon-Ki;Oh, Cheol
    • Journal of Korean Society of Transportation
    • /
    • v.30 no.4
    • /
    • pp.95-106
    • /
    • 2012
  • Driver's capability of identifying the change in freeway alignments and environments is one of important factors associated with traffic safety on freeways. In particular, driver's visibility and recognition capability are highly dependent on the altitude of the sun by sunset, sunrise, and nighttime. The purpose of this study is to identify the characteristics of geometric conditions affecting crash occurrences at sunset, sunrise, and nighttime. Poisson and negative binomial regressions were adopted to predict freeway crash frequency in this study. Freeway crash data during 2007~2010 were used for developing the crash frequency models. A set of variables representing the characteristics of geometric conditions were identified as significant ones affecting crash occurrences. The results of this study would be useful in deriving effective countermeasures for preventing traffic crashes that mainly occur at sunset, sunrise, and nighttime on freeways.

Effects of Salespersons' Appreciative Inquiry and Emotional Labor on Adaptive Selling Behavior and Customer Satisfaction (영업사원의 긍정 탐색 수용도와 감정노동이 적응적 판매행동 및 고객만족에 미치는 영향)

  • Lee, Hang;Kim, Joon-Hwan
    • Journal of Digital Convergence
    • /
    • v.16 no.8
    • /
    • pp.151-159
    • /
    • 2018
  • This study focused on appreciative inquiry(AI) of salespeople who have to respond to various types of emotions according to the desires of individual customers at service contact points and the effect of emotional labor on adaptive selling behavior and customer satisfaction. Dyadic questionnaires were administerd to 115 automobile salespeople and 2 customers who received service from each salesperson, and the collected data was analyzed by using structural equation modeling. The results showed that AI had positive influences on deep acting and surface acting. Only deep acting was found to have positive relationship with adaptive selling behavior, but not to surface acting. Adaptive selling behavior had a positive effect on customer satisfaction. This study will contribute to identifying the need for AI access for salespersons and for activating adaptive selling behavior through emotional labor related to AI practice.

Estimation of Asymmetric Bell Shaped Probability Curve using Logistic Regression (로지스틱 회귀모형을 이용한 비대칭 종형 확률곡선의 추정)

  • 박성현;김기호;이소형
    • The Korean Journal of Applied Statistics
    • /
    • v.14 no.1
    • /
    • pp.71-80
    • /
    • 2001
  • Logistic regression model is one of the most popular linear models for a binary response variable and used for the estimation of probability function. In many practical situations, the probability function can be expressed by a bell shaped curve and such a function can be estimated by a second order logistic regression model. However, when the probability curve is asymmetric, the estimation results using a second order logistic regression model may not be precise because a second order logistic regression model is a symmetric function. In addition, even if a second order logistic regression model is used, the interpretation for the effect of second order term may not be easy. In this paper, in order to alleviate such problems, an estimation method for asymmetric probabiity curve based on a first order logistic regression model and iterative bi-section method is proposed and its performance is compared with that of a second order logistic regression model by a simulation study.

  • PDF

Physiological Responses of Roadside Trees by Regional Groups and Species in Daejeon City (대전광역시 가로수의 지역별·수종별 생리적 반응)

  • Kim, Dong Il;Park, Gwan Soo;Kim, Gil Nam;Lee, Hang Goo;Park, Beom Hwan
    • Journal of Korean Society of Forest Science
    • /
    • v.100 no.1
    • /
    • pp.88-94
    • /
    • 2011
  • This study was conducted to provide on a basic information for maintenance of roadside trees and identify the links between environmental pollution and tree damages (Ginkgo biloba, Platanus occidentalis) in Daejeon City by analyzing photosynthetic characteristics, water use efficiency, chlorophyll fluorescence and chlorophyll contents by the regions and plants. The investigations were conducted at Chungnam national university(CNU) considered as the least air-polluted area compared to other study sites, Daedeok science town(DS) which a new road with little traffic recently was built around, Daehwa industry complex(DIC) and Daejeon station(DS) which an old road with heavy is located around. The photosynthetic capacity of the two species were highest in CNU and lowest in the DS. However the water use efficiency was highest on the contrary to the photosynthetic capacity in DS. Chlorophyll fluorescence and chlorophyll contents were highest in CNU and lowest in DS as the photosynthetic capacity. On all of the test, Ginkgo biloba shows the more favorable physiological responses than Platanus occidentalis.

Effects of Financial College Tuition Support by Korean Parents using a Hierarchical Bayes Model (계층적 베이즈 모형을 이용한 대학등록금에 대한 부모님의 경제적 지원 영향 분석)

  • Oh, Man-Suk;Oh, Hyun Sook;Oh, Min Jung
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.2
    • /
    • pp.267-280
    • /
    • 2013
  • College tuition is a significant economic, social, and political issue in Korea. We conduct a Bayesian analysis of a hierarchical model to address the factors related to college tuition based on a survey data collected by Statistics Korea. A binary response variable is selected depending on if more than 70% of tuition costs are supported by parents, and a hierarchical Probit model is constructed with areas as groups. A set of explanatory variables is selected from a factor analysis of available variables in the survey. A Markov chain Monte Carlo algorithm is used to estimate parameters. From the analysis results, income and stress are significantly related to college tuition support from parents. Parents with high income tend to support children's college tuition and students with parents' financial support tend to be mentally less stressed; subsequently, this shows that the economic status of parents significantly affects the mental health of college students. Gender, a healthy life style, and college satisfaction are not significant factors. Comparing areas in terms of the degrees of correlation between stress/income and tuition support from parents, students in Kangwon-do are the most mentally stressed when parents' support is limited; in addition, the positive correlation between parents support and income is stronger in big cities compared to provincial areas.