• Title/Summary/Keyword: 불완전 자료

Search Result 139, Processing Time 0.031 seconds

Methods for Handling Incomplete Repeated Measures Data (불완전한 반복측정 자료의 보정방법)

  • Woo, Hae-Bong;Yoon, In-Jin
    • Survey Research
    • /
    • v.9 no.2
    • /
    • pp.1-27
    • /
    • 2008
  • Problems of incomplete data are pervasive in statistical analysis. In particular, incomplete data have been an important challenge in repeated measures studies. The objective of this study is to give a brief introduction to missing data mechanisms and conventional/recent missing data methods and to assess the performance of various missing data methods under ignorable and non-ignorable missingness mechanisms. Given the inadequate attention to longitudinal studies with missing data, this study applied recent advances in missing data methods to repeated measures models and investigated the performance of various missing data methods, such as FIML (Full Information Maximum Likelihood Estimation) and MICE(Multivariate Imputation by Chained Equations), under MCAR, MAR, and MNAR mechanisms. Overall, the results showed that listwise deletion and mean imputation performed poorly compared to other recommended missing data procedures. The better performance of EM, FIML, and MICE was more noticeable under MAR compared to MCAR. With the non-ignorable missing data, this study showed that missing data methods did not perform well. In particular, this problem was noticeable in slope-related estimates. Therefore, this study suggests that if missing data are suspected to be non-ignorable, developmental research may underestimate true rates of change over the life course. This study also suggests that bias from non-ignorable missing data can be substantially reduced by considering rich information from variables related to missingness.

  • PDF

Monte Carlo Random Permutation Tests for Incompletely Ranked Data (불완전 순위 자료를 위한 몬테칼로 임의순열 검정)

  • Huh, Myung-Hoe;Choi, Won
    • The Korean Journal of Applied Statistics
    • /
    • v.14 no.1
    • /
    • pp.191-199
    • /
    • 2001
  • 본 소고는 n명의 심사자가 k개의 객체를 평가하여 얻어진 불완전 순위자료에서 객체간 선호도에 있어 차이가 없다는 영가설을 검정하는 방법에 관한 연구이다. 주어진 자료에서 결측값들을 다중대체하는 방식을 제안하고 이들을 평균 p-값으로 묶는 몬테칼로방식의 임의순열 검정을 제안한다.

  • PDF

Nonparametric Approaches of Analyzing Randomly Incomplete Ranking Data (임의의 불완전 순위자료 분석을 위한 비모수적 방법)

  • 임동훈
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.1
    • /
    • pp.45-53
    • /
    • 2000
  • 본 논문에서는 모든 판정자(judge)들이 모든 객체(object)들에 대해 순위를 부여할 수 없는 경우에 얻어지는 불완전 순위자료에서 판정자들의 처리 효과에 대한 유의성을 검정하는데 관심이 있다. 이를 위해 불완전 순위자료를 완전자료로 바꾸는 알고리즘을 제안하고 알고리즘에 의해 얻어진 완전 순위자료에 Friedman 검정법을 적용하고자 한다. 제안된 검정법은 결측 객체에 순위를 부여하는데 있어서 완전순위를 갖는 판정자들의 정보를 이용함으로서 효율적이며 검정을 시행하는데 기존의 Friedman 통계량에 대한 분포표를 사용할 수 있어 간편하다. 그리고 몬테칼로 모의실험을 통하여 제안된 검정법과 기존의 평균 순위법, 최대/최소 Friedman 검정법과 검정력을 비교하였다.

  • PDF

Statistical analysis of recurrent gap time events with incomplete observation gaps (불완전한 관측틈을 가진 재발 사건 소요시간에 대한 자료 분석)

  • Shin, Seul Bi;Kim, Yang Jin
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.2
    • /
    • pp.327-336
    • /
    • 2014
  • Recurrent event data occurs when a subject experiences same type of event repeatedly and is found in various areas such as the social sciences, Economics, medicine and public health. To analyze recurrent event data either a total time or a gap time is adopted according to research interest. In this paper, we analyze recurrent event data with incomplete observation gap using a gap time scale. That is, some subjects leave temporarily from a study and return after a while. But it is not available when the observation gaps terminate. We adopt an interval censoring mechanism for estimating the termination time. Furthermore, to model the association among gap times of a subject, a frailty effect is incorporated into a model. Programs included in Survival package of R program are implemented to estimate the covariate effect as well as the variance of frailty effect. YTOP (Young Traffic Offenders Program) data is analyzed with both proportional hazard model and a weibull regression model.

Metropolis-Hastings Expectation Maximization Algorithm for Incomplete Data (불완전 자료에 대한 Metropolis-Hastings Expectation Maximization 알고리즘 연구)

  • Cheon, Soo-Young;Lee, Hee-Chan
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.1
    • /
    • pp.183-196
    • /
    • 2012
  • The inference for incomplete data such as missing data, truncated distribution and censored data is a phenomenon that occurs frequently in statistics. To solve this problem, Expectation Maximization(EM), Monte Carlo Expectation Maximization(MCEM) and Stochastic Expectation Maximization(SEM) algorithm have been used for a long time; however, they generally assume known distributions. In this paper, we propose the Metropolis-Hastings Expectation Maximization(MHEM) algorithm for unknown distributions. The performance of our proposed algorithm has been investigated on simulated and real dataset, KOSPI 200.

Detecting Influential Observations in Multivariate Statistical Analysis of Incomplete Data by PCA (주성분분석에 의한 결손 자료의 영향값 검출에 대한 연구)

  • 김현정;문승호;신재경
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.383-392
    • /
    • 2000
  • Since late 1970, methods of influence or sensitivity analysis for detecting influential observations have been studied not only in regression and related methods but also in various multivariate methods. If results of multivariate analyses sometimes depend heavily on a small number of observations, we should be very careful to draw a conclusion. Similar phenomena may also occur in the case of incomplete data. In this research we try to study such influential observations in multivariate statistical analysis of incomplete data. Case of principal component analysis is studied with a numerical example.

  • PDF

An Efficient Test in a Randomly Incomplete Two-way (임의의 불완전 이원배치 순위계획법에서 효율적인 검정법)

  • 임동훈
    • The Korean Journal of Applied Statistics
    • /
    • v.12 no.1
    • /
    • pp.191-202
    • /
    • 1999
  • 본 논문에서는 n명의 judge들이 k개의 object들에 대해 순위를 부여하는 경우 얻어질수 있는 불완전 순위자료에서 object효과들이 같은지에 대한 검정법을 제안하고 컴퓨터 프로그램없이 쉽게 구현할수 있는 효과적인 알고리즘을 개발하였다. 그리고 기존의 평균순위법과 Monte Carlo 연구를 통하여 검정력을 비교하였다.

  • PDF

Estimation from Incomplete Data in Multivariate Distributions under Stochastic Ordering (확률적 순서를 갖는 다변량분포에서 불완전자료에 의한 추정)

  • Kwang Mo Jeoung
    • The Korean Journal of Applied Statistics
    • /
    • v.7 no.2
    • /
    • pp.145-157
    • /
    • 1994
  • For multivariate distributions satisfying stochastic ordering, we suggest maximum likelihood estimation with incomplete data via an EM algorithm. In this paper we restrict our attention to the contingency tables with partially cross-classified observations. We may use the existing isotonic regression program to implement EM algorithm, and we illustrate the estimation process through an example.

  • PDF

An Efficient Processing Method of Top-k(g) Skyline Group Queries for Incomplete Data (불완전 데이터를 위한 효율적 Top-k(g) 스카이라인 그룹 질의 처리 기법)

  • Park, Mi-Ra;Min, Jun-Ki
    • The KIPS Transactions:PartD
    • /
    • v.17D no.1
    • /
    • pp.17-24
    • /
    • 2010
  • Recently, there has been growing interest in skyline queries. Most of works for skyline queries assume that the data do not have null value. However, when we input data through the Web or with other different tools, there exist incomplete data with null values. As a result, several skyline processing techniques for incomplete data have been proposed. However, available skyline query techniques for incomplete data do not consider the environments that coexist complete data and incomplete data since these techniques deal with the incomplete data only. In this paper, we propose a novel skyline group processing technique which evaluates skyline queries for the environments that coexist complete data and incomplete data. To do this, we introduce the top-k(g) skyline group query which searches g skyline groups with respect to the user's dimensional preference. In our experimental study, we show efficiency of our proposed technique.

A Study of Generalized Maximum Entropy Estimator for the Panel Regression Model (패널회귀모형에서 최대엔트로피 추정량에 관한 연구)

  • Song, Seuck-Heun;Cheon, Soo-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.521-534
    • /
    • 2006
  • This paper considers a panel regression model with ill-posed data and proposes the generalized maximum entropy(GME) estimator of the unknown parameters. These are natural extensions from the biometries, statistics and econometrics literature. The performance of this estimator is investigated by using of Monte Carlo experiments. The results indicate that the GME method performs the best in estimating the unknown parameters.