• Title/Summary/Keyword: 대치법

Search Result 147, Processing Time 0.028 seconds

On the Use of Sequential Adaptive Nearest Neighbors for Missing Value Imputation (순차 적응 최근접 이웃을 활용한 결측값 대치법)

  • Park, So-Hyun;Bang, Sung-Wan;Jhun, Myoung-Shic
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.6
    • /
    • pp.1249-1257
    • /
    • 2011
  • In this paper, we propose a Sequential Adaptive Nearest Neighbor(SANN) imputation method that combines the Adaptive Nearest Neighbor(ANN) method and the Sequential k-Nearest Neighbor(SKNN) method. When choosing the nearest neighbors of missing observations, the proposed SANN method takes the local feature of the missing observations into account as well as reutilizes the imputed observations in a sequential manner. By using a Monte Carlo study and a real data example, we demonstrate the characteristics of the SANN method and its potential performance.

A Missing Data Imputation by Combining K Nearest Neighbor with Maximum Likelihood Estimation for Numerical Software Project Data (K-NN과 최대 우도 추정법을 결합한 소프트웨어 프로젝트 수치 데이터용 결측값 대치법)

  • Lee, Dong-Ho;Yoon, Kyung-A;Bae, Doo-Hwan
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.4
    • /
    • pp.273-282
    • /
    • 2009
  • Missing data is one of the common problems in building analysis or prediction models using software project data. Missing imputation methods are known to be more effective missing data handling method than deleting methods in small software project data. While K nearest neighbor imputation is a proper missing imputation method in the software project data, it cannot use non-missing information of incomplete project instances. In this paper, we propose an approach to missing data imputation for numerical software project data by combining K nearest neighbor and maximum likelihood estimation; we also extend the average absolute error measure by normalization for accurate evaluation. Our approach overcomes the limitation of K nearest neighbor imputation and outperforms on our real data sets.

On the use of weighted adaptive nearest neighbors for missing value imputation (가중 적응 최근접 이웃을 이용한 결측치 대치)

  • Yum, Yunjin;Kim, Dongjae
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.4
    • /
    • pp.507-516
    • /
    • 2018
  • Widely used among the various single imputation methods is k-nearest neighbors (KNN) imputation due to its robustness even when a parametric model such as multivariate normality is not satisfied. We propose a weighted adaptive nearest neighbors imputation method that combines the adaptive nearest neighbors imputation method that accounts for the local features of the data in the KNN imputation method and weighted k-nearest neighbors method that are less sensitive to extreme value or outlier among k-nearest neighbors. We conducted a Monte Carlo simulation study to compare the performance of the proposed imputation method with previous imputation methods.

A New Method for Imputation of Missing Genotype using Linkage Disequilibrium and Haplotype Information (결측치가 존재하는 유전형 자료에서의 연관불균형과 일배체형을 사용한 결측치 대치 방법)

  • Park Yun-Ju;Kim Young-Jin;Park Jung-Sun;Kim Kuchan;Koh Insong;Jung Ho-Youl
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.2
    • /
    • pp.99-107
    • /
    • 2005
  • In this paper, wc propose a now missing imputation method for minimizing loss of information linkage disequilibrium-based and haplotype-based imputation method, which estimate missing values of the data based on the specificity of Single Nucleotide Polymorphism(SNP) genotype data. Method for imputing data is needed to minimize the loss of information caused by experimental missing data. In general, missing imputation of biological data has used major allele imputation method. but this approach is not optima]. 1'his method has high error rates of missing values estimation since the characteristics of the genotype data are not considered not take into consideration the specific structure of the data. In this paper, we show the results of the comparative evaluation of our model methods and major imputation method for the estimation of missing values.

On the Use of Weighted k-Nearest Neighbors for Missing Value Imputation (Weighted k-Nearest Neighbors를 이용한 결측치 대치)

  • Lim, Chanhui;Kim, Dongjae
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.1
    • /
    • pp.23-31
    • /
    • 2015
  • A conventional missing value problem in the statistical analysis k-Nearest Neighbor(KNN) method are used for a simple imputation method. When one of the k-nearest neighbors is an extreme value or outlier, the KNN method can create a bias. In this paper, we propose a Weighted k-Nearest Neighbors(WKNN) imputation method that can supplement KNN's faults. A Monte-Carlo simulation study is also adapted to compare the WKNN method and KNN method using real data set.

A Comparison of Survival Distributions with Unequal Censoring Distributions (이질적인 중도절단분포 하에서 생존분포의 동일성 검정법 비교연구)

  • Song, Sujeong;Lee, Jae Won
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.1
    • /
    • pp.1-11
    • /
    • 2014
  • The Weighted Logrank test and its special case, Logrank test are widely used to compare survival distributions; however, these methods are inappropriate when the sample size is small or censoring distributions are not equal since they use test statistics from approximate distributions. A permutation test can be an alternative for small sample cases; however, this should be used only when censoring distributions are equal. To handle cases with small sample size and unequal censoring distributions, the permutation-imputation method was developed to compare two survival distributions. In this paper, approximate method, permutation method and permutation-imputation method were compared using a Logrank test and Prentice-Wilcoxon test for three or more survival distributions comparison.

Review of Parameter Estimation Procedure of Freund Bivariate Exponential Distribution (Freund 이변량 지수분포의 매개변수 추정과정 검토)

  • Park, Cheol-Soon;Yoo, Chul-Sang
    • Journal of Korea Water Resources Association
    • /
    • v.45 no.2
    • /
    • pp.191-201
    • /
    • 2012
  • This study reviewed the parameter estimation procedure of the Freund bivariate exponential distribution for the decision of the annual maximum rainfall event. The method of moments was reviewed first, whose results were compared with those from the method of maximum likelihood. Both methods were applied to the hourly rainfall data of the Seoul rain gauge station measured from 1961 to 2010 to select the annual maximum rainfall events, which were also compared each other. The results derived are as follows. First, when applying the method of moments for the parameter estimation, it was found necessary to consider the correlation coefficient between the two variables as well as the mean and variance. Second, the method of maximum likelihood was better to reproduce the mean, but the method of moments was better to reproduce the annual variation of the variance. Third, The annual maximum rainfall events derived were very similar in both cases. Among differently selected annual maximum rainfall events, those with the higher rainfall amount were selected by the method of maximum likelihood, but those with the higher rainfall intensity by the method of moments.

SOLAS를 이용한 결측자료의 다중대치법

  • Kim, Hyeon-Jeong;Mun, Seung-Ho;Sin, Jae-Gyeong
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2003.05a
    • /
    • pp.145-158
    • /
    • 2003
  • 불완전 데이터 즉, 결측값을 가지는 데이터를 분석할 경우 결측데이터에 대해서 어떠한 처리를 해야할 필요가 있다. 결측데이터에 대한 처리로서 주로 이용되어온 방법으로는 결측값을 포함한 관측값(case)을 제외하는 방법이었다. 이후 여러 방법들이 제안되어 EM알고리즘이나 회귀알고리즘에 의한 추정을 바탕으로 결측값에 대한 추정을 해서 그 추정값으로 결측값을 대치하는 방법을 사용할 수 있게되었다. 본 논문에서는 복수 개의 데이터세트를 생성해서 대치하는 다중대입 소프트인 SOLAS를 소개한다.

  • PDF

Seismic Response Estimation and System Identification of Test Steel Structure Using Approximate Nonlinear Filter (비선형 근사필터에 강구조시험체의 지진응답추정 및 동특성식별)

  • 배기환;김두영
    • Journal of the Earthquake Engineering Society of Korea
    • /
    • v.5 no.2
    • /
    • pp.67-72
    • /
    • 2001
  • 대상으로 하는 시스템의 입출력신호에 근거하여, 시스템의 수학적 모델을 결정하는 것을 총칭하여 시스템식별이라 한다. 본 논문에서는 지진응답 관측치를 입출력신호로 하여 조건부대치를 최적치로 판단하는 비선형근사필터법을 사용한 건축구조물의 지진응답추정 및 파라미터식별에 관하여 논한다. 비선형근사필터법에 의한 건축구조물식별의 유효성의 적용성을 판단하기 위해, 진동대를 사용하여 강구조시험체의 진동실험을 행하고 결과적으로 얻어진 시험체의 수학적 모델에 대한 지진응답 수치해석결과와 진동실험에서의 관측기록을 비교하여 본 식별법의 타당성을 보인다.

  • PDF