• Title/Summary/Keyword: outlier adjustment

Search Result 15, Processing Time 0.021 seconds

Multiple Imputation Reducing Outlier Effect using Weight Adjustment Methods (가중치 보정을 이용한 다중대체법)

  • Kim, Jin-Young;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.4
    • /
    • pp.635-647
    • /
    • 2013
  • Imputation is a commonly used method to handle missing survey data. The performance of the imputation method is influenced by various factors, especially an outlier. The removal of the outlier in a data set is a simple and effective approach to reduce the effect of an outlier. In this paper in order to improve the precision of multiple imputation, we study a imputation method which reduces the effect of outlier using various weight adjustment methods that include the removal of an outlier method. The regression method in PROC/MI in SAS is used for multiple imputation and the obtained final adjusted weight is used as a weight variable to obtain the imputed values. Simulation studies compared the performance of various weight adjustment methods and Monthly Labor Statistic data is used for real data analysis.

A Multiple Imputation for Reducing Outlier Effect (이상점 영향력 축소를 통한 무응답 대체법)

  • Kim, Man-Gyeom;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.7
    • /
    • pp.1229-1241
    • /
    • 2014
  • Most of sampling surveys have outliers and non-response missing values simultaneously. In that case, due to the effect of outliers, the result of imputation is not good enough to meet a given precision. To overcome this situation, outlier treatment should be conducted before imputation. In this paper in order for reducing the effect of outlier, we study outlier imputation methods and outlier weight adjustment methods. For the outlier detection, the method suggested by She and Owen (2011) is used. A small simulation study is conducted and for real data analysis, Monthly Labor Statistic and Briquette Consumption Survey Data are used.

Adjustment System for Outlier and Missing Value using Data Storage (데이터 저장소를 이용한 이상치 및 결측치 보정 시스템)

  • Gwangho Kim;Neunghoe Kim
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.23 no.5
    • /
    • pp.47-53
    • /
    • 2023
  • With the advent of the 4th Industrial Revolution, diverse and a large amount of data has been accumulated now. The agricultural community has also collected environmental data that affects the growth of crops in smart farms or open fields with sensors. Environmental data has different features depending on where and when they are measured. Studies have been conducted using collected agricultural data to predict growth and yield with statistics and artificial intelligence. The results of these studies vary greatly depending on the data on which they are based. So, studies to enhance data quality have also been continuously conducted for performance improvement. A lot of data is required for high performance, but if there are outlier or missing values in the data, it can greatly affect the results even if the amount is sufficient. So, adjustment of outlier and missing values is essential in the data preprocessing. Therefore, this paper integrates data collected from actual farms and proposes a adjustment system for outlier and missing values based on it.

An outlier weight adjustment using generalized ratio-cum-product method for two phase sampling (이중추출법에서 일반화 ratio-cum-product 방법을 이용한 이상점 가중치 보정법)

  • Oh, Jung-Taek;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.7
    • /
    • pp.1185-1199
    • /
    • 2016
  • Two phase sampling (double sampling) is often used when there is inadequate population information for proper stratification. Many recent papers have been devoted to the estimation method to improve the precision of the estimator using first phase information. In this study we suggested outlier weight adjustment methods to improve estimation precision based on the weight of the generalized ratio-cum-product estimator. Small simulation studies are conducted to compare the suggested methods and the usual method. Real data analysis is also performed.

Modified BLS Weight Adjustment (수정된 BLS 가중치보정법)

  • Park, Jung-Joon;Cho, Ki-Jong;Lee, Sang-Eun;Shin, Key-Il
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.3
    • /
    • pp.367-376
    • /
    • 2011
  • BLS weight adjustment is a widely used method for business surveys with non-responses and outliers. Recent surveys show that the non-response weight adjustment of the BLS method is the same as the ratio imputation method. In this paper, we suggested a modified BLS weight adjustment method by imputing missing values instead of using weight adjustment for non-response. Monthly labor survey data is used for a small Monte-Carlo simulation and we conclude that the suggested method is superior to the original BLS weight adjustment method.

A Study on the Sensitivity of the BLS Methods (BLS 보정 방법의 민감도에 관한 연구)

  • Lee, Seok-Jin;Shin, Key-Il
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.6
    • /
    • pp.843-858
    • /
    • 2008
  • BLS adjustment methods have been able to provide more accurate estimates of total and make samples represent population characteristics by post-adjustment of design weights of samples. However, BLS methods use additional data, for instance number of employee, without this information or using other information, give different weight adjustment factors. In this paper we studied the sensitivity of the variables used in BLS adjustment. The 2007 monthly labor survey data is used in analysis.

A Study on the Efficiency of the BLS Nonresponse Adjustment According to the Correlation and Sample Size (상관관계와 표본 크기에 따른 BLS 무응답 보정의 효율성 비교)

  • Kim, Seok;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.6
    • /
    • pp.1301-1313
    • /
    • 2009
  • Efficiency and sensitivity of BLS adjustment method have been studied and the method is known to provide more accurate estimate of total by using properly adjusted weights of samples. However, BLS methods provide different efficiencies according to the magnitudes of correlation coefficients and the sizes of samples in strata. In this paper we study the efficiency of the BLS adjustment according to the sample sizes and correlations in strata. For this study, 2007 monthly labor survey data is used.

Detection Power when outliers are present at or near the end of time series

  • Lee, Jong-Seon;An, Mi-Hye;Lee, Jae-Jun
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2003.10a
    • /
    • pp.281-283
    • /
    • 2003
  • 시계열 모형을 따르는 자료의 예측(Forecasting)이나 공정조정(Process Adjustment)의 경우, 자료의 마지막 부분에 발생한 이상치(Outlier)에 의해 크게 영향 받을 수 있다. 그러나 지금까지 제안된 이상치 탐지 방법은 주로 자료의 중간 부분에 발생한 이상치를 검출하는데 효율적이라고 알려져 왔다. 본 연구에서는 자료의 마지막 부분에 발생한 이상치에 대한 기존 탐지 방법의 검출력을 모의 실험을 통해 분석하였다 또한, 이를 개선할 수 있는 방안을 제시하고, 모의 실험을 통해 기존의 검출력과 비교하였다.

  • PDF

A Study on Outlier Adjustment for Multibeam Echosounder Data (다중빔 음향측심기 자료의 이상치 보정에 관한 연구)

  • Lee, Jung-Sook;Kim, Soo-Young;Lee, Yong-Kook;Shin, Dong-Wan;Jou, Hyeong-Tae;Kim, Han-Joon
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.6 no.1
    • /
    • pp.35-39
    • /
    • 2001
  • Multibeam echosounder data, collected to investigate seabed features and topography, are usually subject to outliers resulting from the ship's irregular movements and insufficient correction for pressure calibration to the positions of beams. We introduce a statistical method which adjusts the outliers using the ARMA (Autoregressive Moving Average) technique. Our method was applied to a set of real data acquired in the East Sea. In our approach, autocorrelation of the data is modeled by an AR (1) model. If an observation is substantially different from that obtained from the estimated AR (1) model, it is declared as an outlier and adjusted using the estimated AR (1) model. This procedure is repeated until no outlier is found. The result of processing shows that outliers that are far greater than signals in amplitude were successfully removed.

  • PDF

Travel Time Forecasting in an Interrupted Traffic Flow by adopting Historical Profile and Time-Space Data Fusion (히스토리컬 프로파일 구축과 시.공간 자료합성에 의한 단속류 통행시간 예측)

  • Yeo, Tae-Dong;Han, Gyeong-Su;Bae, Sang-Hun
    • Journal of Korean Society of Transportation
    • /
    • v.27 no.2
    • /
    • pp.133-144
    • /
    • 2009
  • In Korea, the ITS project has been progressed to improve traffic mobility and safety. Further, it is to relieve traffic jam by supply real time travel information for drivers and to promote traffic convenience and safety. It is important that the traffic information is provided accurately. This study was conducted outlier elimination and missing data adjustment to improve accuracy of raw data. A method for raise reliability of travel time prediction information was presented. We developed Historical Profile model and adjustment formula to reflect quality of interrupted flow. We predicted travel time by developed Historical Profile model and adjustment formula and verified by comparison between developed model and existing model such as Neural Network model and Kalman Filter model. The results of comparative analysis clarified that developed model and Karlman Filter model similarity predicted in general situation but developed model was more accurate than other models in incident situation.