• Title/Summary/Keyword: Bootstrap sampling

Search Result 50, Processing Time 0.02 seconds

A Note on Bootstrapping M-estimators in TAR Models

  • Kim, Sahmyeong
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.3
    • /
    • pp.837-843
    • /
    • 2000
  • Kreiss and Franke(192) and Allen and Datta(1999) proposed bootstrapping the M-estimators in ARMA models. In this paper, we introduce the robust estimating function and investigate the bootstrap approximations of the M-estimators which are solutions of the estimating equations in TAR models. A number of simulation results are presented to estimate the sampling distribution of the M-estimators, and asymptotic validity of the bootstrap for the M-estimators is established.

  • PDF

Streamflow Generation by Boostrap Method and Skewness (Bootstrap 방법에 의한 하천유출량 모의와 왜곡도)

  • Kim, Byung-Sik;Kim, Hung-Soo;Seoh, Byung-Ha
    • Journal of Korea Water Resources Association
    • /
    • v.35 no.3
    • /
    • pp.275-284
    • /
    • 2002
  • In this study, a method of random resampling of residuals from stochastic models such as the Monte-Carlo model, the lag-one autoregressive model(AR(1)) and the periodic lag-one autoregressive model(PAR(1)), has been adopted to generate a large number of long traces of annual and monthly steamflows. Main advantage of this resampling scheme called the Bootstrap method is that it does not rely on the assumption of population distribution. The Bootstrap is a method for estimating the statistical distribution by resampling the data. When the data are a random sample from a distribution, the Bootstrap method can be implemented (among other ways) by sampling the data randomly with replacement. This procedure has been applied to the Yongdam site to check the performance of Bootstrap method for the streamflow generation. and then the statistics between the historical and generated streamflows have been computed and compared. It has been shown that both the conventional and Bootstrap methods for the generation reproduce fairly well the mean, standard deviation, and serial correlation, but the Bootstrap technique reproduces the skewness better than the conventional ones. Thus, it has been noted that the Bootstrap method might be more appropriate for the preservation of skewness.

Frequency Analysis Using Bootstrap Method and SIR Algorithm for Prevention of Natural Disasters (풍수해 대응을 위한 Bootstrap방법과 SIR알고리즘 빈도해석 적용)

  • Kim, Yonsoo;Kim, Taegyun;Kim, Hung Soo;Noh, Huisung;Jang, Daewon
    • Journal of Wetlands Research
    • /
    • v.20 no.2
    • /
    • pp.105-115
    • /
    • 2018
  • The frequency analysis of hydrometeorological data is one of the most important factors in response to natural disaster damage, and design standards for a disaster prevention facilities. In case of frequency analysis of hydrometeorological data, it assumes that observation data have statistical stationarity, and a parametric method considering the parameter of probability distribution is applied. For a parametric method, it is necessary to sufficiently collect reliable data; however, snowfall observations are needed to compensate for insufficient data in Korea, because of reducing the number of days for snowfall observations and mean maximum daily snowfall depth due to climate change. In this study, we conducted the frequency analysis for snowfall using the Bootstrap method and SIR algorithm which are the resampling methods that can overcome the problems of insufficient data. For the 58 meteorological stations distributed evenly in Korea, the probability of snowfall depth was estimated by non-parametric frequency analysis using the maximum daily snowfall depth data. The results of frequency based snowfall depth show that most stations representing the rate of change were found to be consistent in both parametric and non-parametric frequency analysis. According to the results, observed data and Bootstrap method showed a difference of -19.2% to 3.9%, and the Bootstrap method and SIR(Sampling Importance Resampling) algorithm showed a difference of -7.7 to 137.8%. This study shows that the resampling methods can do the frequency analysis of the snowfall depth that has insufficient observed samples, which can be applied to interpretation of other natural disasters such as summer typhoons with seasonal characteristics.

Rainfall Frequency Analysis Using SIR Algorithm and Bootstrap Methods (극한강우를 고려한 SIR알고리즘과 Bootstrap을 활용한 강우빈도해석)

  • Moon, Ki Ho;Kyoung, Min Soo;Kim, Hung Soo
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.30 no.4B
    • /
    • pp.367-377
    • /
    • 2010
  • In this study, we considered annual maximum rainfall data from 56 weather stations for rainfall frequency analysis using SIR(Sampling Important Resampling) algorithm and Bootstrap method. SIR algorithm is resampling method considering weight in extreme rainfall sample and Bootstrap method is resampling method without considering weight in rainfall sample. Therefore we can consider the difference between SIR and Bootstrap method may be due to the climate change. After the frequency analysis, we compared the results. Then we derived the results which the frequency based rainfall obtained using the data from SIR algorithm has the values of -10%~60% of the rainfall obtained using the data from Bootstrap method.

A Computer Intensive Method for Modern Statistical Data Analysis I ; Bootststrap Method and Its Applications (통계적 데이터 분석방법을 위한 컴퓨터의 활용 I : 붓스트랩 이론과 응용+)

  • 전명식
    • The Korean Journal of Applied Statistics
    • /
    • v.3 no.1
    • /
    • pp.121-141
    • /
    • 1990
  • Computer intensive bootstrap methods are studied as a tool of statistics. Practical calculation and theoretical justification problem of the methods in estimating the sampling distribution and construction confidence region of parameters are discussed through several examples. Statistical meaning of the methods are also considered.

  • PDF

Application of Bootstrap Method for Change Point Test based on Kernel Density Estimator

  • Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.1
    • /
    • pp.107-117
    • /
    • 2004
  • Change point testing problem is considered. Kernel density estimators are used for constructing proposed change point test statistics. The proposed method can be used to the hypothesis testing of not only parameter change but also distributional change. Bootstrap method is applied to get the sampling distribution of proposed test statistic. Small sample Monte Carlo Simulation were also conducted in order to show the performance of proposed method.

  • PDF

How Many SNPs Should Be Used for the Human Phylogeny of Highly Related Ethnicities? A Case of Pan Asian 63 Ethnicities

  • Ghang, Ho-Young;Han, Young-Joo;Jeong, Sang-Jin;Bhak, Jong;Lee, Sung-Hoon;Kim, Tae-Hyung;Kim, Chul-Hong;Kim, Sang-Soo;Al-Mulla, Fahd;Youn, Chan-Hyun;Yoo, Hyang-Sook;The HUGO Pan-Asian SNP Consortium, The HUGO Pan-Asian SNP Consortium
    • Genomics & Informatics
    • /
    • v.9 no.4
    • /
    • pp.181-188
    • /
    • 2011
  • In planning a model-based phylogenic study for highly related ethnic data, the SNP marker number is an important factor to determine for relationship inferences. Genotype frequency data, utilizing a sub sampling method, from 63 Pan Asian ethnic groups was used for determining the minimum SNP number required to establish such relationships. Bootstrap random sub-samplings were done from 5.6K PASNPi SNP data. DA distance was calculated and neighbour-joining trees were drawn with every re-sampling data set. Consensus trees were made with the same 100 sub-samples and bootstrap proportions were calculated. The tree consistency to the one obtained from the whole marker set, improved with increasing marker numbers. The bootstrap proportions became reliable when more than 7,000 SNPs were used at a time. Within highly related ethnic groups, the minimum SNPs number for a robust neighbor-joining tree inference was about 7,000 for a 95% bootstrap support.

Improving an Ensemble Model by Optimizing Bootstrap Sampling (부트스트랩 샘플링 최적화를 통한 앙상블 모형의 성능 개선)

  • Min, Sung-Hwan
    • Journal of Internet Computing and Services
    • /
    • v.17 no.2
    • /
    • pp.49-57
    • /
    • 2016
  • Ensemble classification involves combining multiple classifiers to obtain more accurate predictions than those obtained using individual models. Ensemble learning techniques are known to be very useful for improving prediction accuracy. Bagging is one of the most popular ensemble learning techniques. Bagging has been known to be successful in increasing the accuracy of prediction of the individual classifiers. Bagging draws bootstrap samples from the training sample, applies the classifier to each bootstrap sample, and then combines the predictions of these classifiers to get the final classification result. Bootstrap samples are simple random samples selected from the original training data, so not all bootstrap samples are equally informative, due to the randomness. In this study, we proposed a new method for improving the performance of the standard bagging ensemble by optimizing bootstrap samples. A genetic algorithm is used to optimize bootstrap samples of the ensemble for improving prediction accuracy of the ensemble model. The proposed model is applied to a bankruptcy prediction problem using a real dataset from Korean companies. The experimental results showed the effectiveness of the proposed model.

Bootstrap Variance Estimation for Calibration Estimators in Stratified Sampling (층화 추출에서 보정추정량에 대한 붓스트랩 분산 추정)

  • 염준근;정영미
    • Proceedings of the Korean Association for Survey Research Conference
    • /
    • 2001.11a
    • /
    • pp.77-85
    • /
    • 2001
  • In this paper we study the calibration estimator and its variance estimator for the population total using a bootstrap method according to the levels of an auxiliary information having strong correlation with an interested variable in nonresponse situation. At this point, we find tire calibration estimator in case of auxiliary information for population and sample, and then we drive the bootstrap variance estimator of it. By simulation study we compare the efficiencies with the Taylor and Jackknife variance estimators.

  • PDF