• Title/Summary/Keyword: 범주형 자료

Search Result 222, Processing Time 0.024 seconds

Effect of complex sample design on Pearson test statistic for homogeneity (복합표본자료에서 동질성검정을 위한 피어슨 검정통계량의 효과)

  • Heo, Sun-Yeong;Chung, Young-Ae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.4
    • /
    • pp.757-764
    • /
    • 2012
  • This research is for comparison of test statistics for homogeneity when the data is collected based on complex sample design. The survey data based on complex sample design does not satisfy the condition of independency which is required for the standard Pearson multinomial-based chi-squared test. Today, lots of data sets ara collected by complex sample designs, but the tests for categorical data are conducted using the standard Pearson chi-squared test. In this study, we compared the performance of three test statistics for homogeneity between two populations using data from the 2009 customer satisfaction evaluation survey to the service from Gyeongsangnam-do regional offices of education: the standard Pearson test, the unbiasedWald test, and the Pearsontype test with survey-based point estimates. Through empirical analyses, we fist showed that the standard Pearson test inflates the values of test statistics very much and the results are not reliable. Second, in the comparison of Wald test and Pearson-type test, we find that the test results are affected by the number of categories, the mean and standard deviation of the eigenvalues of design matrix.

Bayesian ordinal probit semiparametric regression models: KNHANES 2016 data analysis of the relationship between smoking behavior and coffee intake (베이지안 순서형 프로빗 준모수 회귀 모형 : 국민건강영양조사 2016 자료를 통한 흡연양태와 커피섭취 간의 관계 분석)

  • Lee, Dasom;Lee, Eunji;Jo, Seogil;Choi, Taeryeon
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.1
    • /
    • pp.25-46
    • /
    • 2020
  • This paper presents ordinal probit semiparametric regression models using Bayesian Spectral Analysis Regression (BSAR) method. Ordinal probit regression is a way of modeling ordinal responses - usually more than two categories - by connecting the probability of falling into each category explained by a combination of available covariates using a probit (an inverse function of normal cumulative distribution function) link. The Bayesian probit model facilitates posterior sampling by bringing a latent variable following normal distribution, therefore, the responses are categorized by the cut-off points according to values of latent variables. In this paper, we extend the latent variable approach to a semiparametric model for the Bayesian ordinal probit regression with nonparametric functions using a spectral representation of Gaussian processes based BSAR method. The latent variable is decomposed into a parametric component and a nonparametric component with or without a shape constraint for modeling ordinal responses and predicting outcomes more flexibly. We illustrate the proposed methods with simulation studies in comparison with existing methods and real data analysis applied to a Korean National Health and Nutrition Examination Survey (KNHANES) 2016 for investigating nonparametric relationship between smoking behavior and coffee intake.

Processes of Voluntary Services Delivered by Korean Undergraduates: An Approach Based on the Grounded Theory (대학생의 자발적 봉사활동에 대한 질적 연구: 근거이론을 중심으로)

  • Hu, Sungho;Jung, Taeyun
    • Korean Journal of Culture and Social Issue
    • /
    • v.17 no.3
    • /
    • pp.287-304
    • /
    • 2011
  • The Purpose of this study is to understand phases and paradigms related to voluntary services offered by undergraduates and the processes in which voluntary services are implemented. For this, interviews for 23(men: 10, women: 13) undergraduates were conducted from Aug., 2008 to Apr., 2009 were conducted and the data collected from those interviews were analyzed on the basis of the Grounded Theory. Main analysis procedure is known as codings(open coding, axial coding, selective coding). This analyses produced 119 concepts, 41 subcategories, and 16 categories in open coding. Then, axial coding was conducted to organize the basic framework of generic relationships among psychological motivation, social context, personal perception, practical action, psychological response, and psychological consequence. Core essence is "Volunteer types are categorized simple practice type, self-serving type, and community type." Finally, undergraduate volunteers were explained in 3 types(simple practice, self-serving, and community) on the basis of paradigms. These results were discussed in terms of further research and limitation.

  • PDF

A Study for Recent Development of Generalized Linear Mixed Model (일반화된 선형 혼합 모형(GENERALIZED LINEAR MIXED MODEL: GLMM)에 관한 최근의 연구 동향)

  • 이준영
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.541-562
    • /
    • 2000
  • The generalized linear mixed model framework is for handling count-type categorical data as well as for clustered or overdispersed non-Gaussian data, or for non-linear model data. In this study, we review its general formulation and estimation methods, based on quasi-likelihood and Monte-Carlo techniques. The current research areas and topics for further development are also mentioned.

  • PDF

An Empirical Study on the Measurement of Clustering and Trend Analysis among the Asian Container Ports Using the Variable Group Benchmarking and Categorical Variable Models (가변 그룹 벤치마킹 모형과 범주형 변수모형을 이용한 아시아 컨테이너항만의 클러스터링측정 및 추세분석에 관한 실증적 연구)

  • Park, Rokyung
    • Journal of Korea Port Economic Association
    • /
    • v.29 no.1
    • /
    • pp.143-175
    • /
    • 2013
  • The purpose of this paper is to show the clustering trend by using the variable group benchmarking(VGB) and categorical variable(CV) models for 38 Asian ports during 9 years(2001-2009) with 4 inputs(birth length, depth, total area, and number of crane) and 1 output(container TEU). The main empirical results of this paper are as follows. First, clustering results by using VGB show that Shanghai, Qingdao, and Ningbo ports took the core role for clustering. Second, CV analysis focusing on the container throughputs indicated that Singapore, Keelong, Dubai, and Kaosiung ports except Chinese ports are appeared as the center ports of clustering. Third, Aqaba, Dubai, Hongkong, Shanghai, Guangzhou, and Ningbo ports are recommended as the efficient ports for the target of clustering. Fourth, when the ports are classified by the regional location, Dubai, Khor Fakkan, Shanghai, Hongkong, Keelong, Ningbo, and Singapore ports are the core ports for clustering. On the whole, other ports located in Asia should be clustered to Dubai, Khor Fakkan, Shanghai, Hongkong, Ningbo, and Singapore ports. The policy implication of this paper is that Korean port policy planner should introduce the VGB model, and CV model for clustering among the international ports for enhancing the efficiency of inputs and outputs.

Assessment of predictability of categorical probabilistic long-term forecasts and its quantification for efficient water resources management (효율적인 수자원관리를 위한 범주형 확률장기예보의 예측력 평가 및 정량화)

  • Son, Chanyoung;Jeong, Yerim;Han, Soohee;Cho, Younghyun
    • Journal of Korea Water Resources Association
    • /
    • v.50 no.8
    • /
    • pp.563-577
    • /
    • 2017
  • As the uncertainty of precipitation increases due to climate change, seasonal forecasting and the use of weather forecasts become essential for efficient water resources management. In this study, the categorical probabilistic long-term forecasts implemented by KMA (Korea Meteorological Administration) since June 2014 was evaluated using assessment indicators of Hit Rate, Reliability Diagram, and Relative Operating Curve (ROC) and a technique for obtaining quantitative precipitation estimates based on probabilistic forecasts was proposed. The probabilistic long-term forecasts showed its maximum predictability of 48% and the quantified precipitation estimates were closely matched with actual observations; maximum correlation coefficient (R) in predictability evaluation for 100% accurate and actual weather forecasts were 0.98 and 0.71, respectively. A precipitation quantification approach utilizing probabilistic forecasts proposed in this study is expected to enable water management considering the uncertainty of precipitation. This method is also expected to be a useful tool for supporting decision-making in the long-term planning for water resources management and reservoir operations.

On Accumulation Analysis (누적법에 관한 연구)

  • 백운봉;이우선
    • The Korean Journal of Applied Statistics
    • /
    • v.12 no.1
    • /
    • pp.275-293
    • /
    • 1999
  • 다구찌의 누적법은 다구찌 품질공학에서 중요한 통계분석 방법이다. 그러나 이 방법이 복잡하고 비효율적일 뿐만 아니라 실험의 결과가 잘못 해석 될 수 있는 문제점을 가지고 이싿. 특히 순서 지어진 범주형에 관한 다요인(multificator) 실험에서는 이러한 가능성이 큰 것으로 지적되고 있다. 이에 대한 걱정과 비판이 Nair(1986) 그리고 Hamada and Wu(1990)에 의하여 심각하게 제기되어 왔다. 본 논문은 이러한 내용들을 정리하고 이들의 논란과 주장에 대한 평가와 이에대한 최선의 실천방안을 제안하고 있다. 아울러 실제 자료분석을 위하여 필요한 SAS/IML 프로그램을 제시하고 있다.

  • PDF

A Study on Causes of Industrial Accident Cases by a Categorical Analysis (범주형 분석에 의한 산업재해사례 요인의 고찰)

  • 지경택;송영호;정국삼
    • Proceedings of the Korean Institute of Industrial Safety Conference
    • /
    • 1998.11a
    • /
    • pp.199-204
    • /
    • 1998
  • 우리나라의 산업재해통계는 산업재해의 규모 및 원인 등의 분포상태와 근로자에 대한 특성 등을 파악하여 산업재해 예방정책 및 산업재해 보상 보험 운용 방침 수립의 기초 자료로 사용되고 있다. 그런데, 우리나라의 현행 산업재해 통계 산출 방법은 산업재해보험 가입 사업장의 재해자가 제출한 요양신청서 중 업무상 재해로 인정된 재해만을 대상으로 통계를 산출하는 것이다. (중략)

  • PDF

Categorical Data Analysis System in the Internet (인터넷상에서의 범주형 자료분석 시스템 개발)

  • Hong, Jong Seon;Kim, Dong Uk;O, Min Gwon
    • The Korean Journal of Applied Statistics
    • /
    • v.12 no.1
    • /
    • pp.81-81
    • /
    • 1999
  • A categorical data analysis system in the World Wide Web is proposed with an easy- to-use environment . This system is composed of four components. First, this system presents several graphical displays for Exploratory Data Analysis for categorical data. Second, it provides some measures of association Including dynamic graphics for mosaic plots of Hartigan and Kleiner (1981) and Friendly (1994). Dynamic graphics for mosaic plots give some useful informations. Third, this system can analyze categorical data with loglinear models. So we can select the best fitted loglinear model interactively.