• Title/Summary/Keyword: 확률표본

Search Result 469, Processing Time 0.027 seconds

Study on Optimal Sample Size for Bivariate Frequency Anlaysis using POT (POT 방법을 이용한 이변량 빈도해석 적정 표본크기 연구)

  • Joo, Kyungwon;Joo, Kyungwon;Joo, Kyungwon;Heo, Jun-Haeng
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2015.05a
    • /
    • pp.38-38
    • /
    • 2015
  • 최근 다변량 확률모형을 이용한 빈도해석이 여러 수문분야에 걸쳐 연구되고 있다. 기존 일변량 빈도해석에 비해 변수활용에 대한 자유도와 물리적 현상을 정확하게 표현할 수 있다는 장점이 있으나, 표본자료의 부족, 매개변수 추정 및 적합도 검정 등의 어려움으로 실제 분야에 사용되기 어려운 점이 있다. 본 연구에서는 copula 모형에 대하여 Cramer-von Mises(CVM) 적합도 검정 시 표본자료의 적정 크기를 결정하기 위하여 Peaks-Over-Threshold(POT) 방법을 이용하였다. 서울지점의 기상청 시강우 자료를 이용하여 빈도해석을 수행하였으며, Gumbel copula 모형에 대하여 매개변수 추정은 maximum pseudolikelihood method(MPL) 방법을 이용하였다. 50년의 기록 자료에 대하여 표본크기를 50개부터 2500개까지 조절하여 CVM 통계값과 p-value를 기준으로 적정 표본크기를 산정하였다.

  • PDF

A Stratified Multi-proportions Randomized Response Model (층화 다지 확률화응답모형)

  • Lee, Gi-Sung;Park, Kyung-Soon
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1113-1120
    • /
    • 2015
  • We propose a multi-proportions randomized response model by stratified simple random sampling for surveys of sensitive issues of a polychotomous population composed of several stratum. We also systemize a theoretical validity to apply multi-proportions randomized response model (Abul-Ela et al.' model, Eriksson's model) to stratified simple random sampling and derive the estimate and its dispersion matrix of the proportion of sensitive characteristic of population using the suggested model. Two types of sample allocations (proportional allocation and optimum allocation) are considered under the fixed cost. In efficiency, the Eriksson's model by stratified sampling are compared to the Abul-Ela et al.' model.

Methodology for Internet Survey: Case Study (인터넷을 활용한 표본조사 방법에 관한 사례연구)

  • 윤은성;김영원
    • Survey Research
    • /
    • v.3 no.1
    • /
    • pp.25-51
    • /
    • 2002
  • We examine the response patterns to web survey with a series of experiments embedded in a survey of students at the Sookmyung Women's University. A sample of 960 students was sent e-mail invitation to participate in a internet survey, The response rate was 53.9% except partial and overall non-response. Methodological experiments included the use of a pre-notification, reminder notices as well as the type of questionnaire. These factors that manipulate the perceived burden of the task had an effect on the likelihood of accepting the survey invitation. This paper discusses the overall implementation and outcome of the survey, and describes the results of the imbedded design experiments. Also we compared the representative of self-selected and probability sample.

  • PDF

Bayesian Variable Selection in Linear Regression Models with Inequality Constraints on the Coefficients (제한조건이 있는 선형회귀 모형에서의 베이지안 변수선택)

  • 오만숙
    • The Korean Journal of Applied Statistics
    • /
    • v.15 no.1
    • /
    • pp.73-84
    • /
    • 2002
  • Linear regression models with inequality constraints on the coefficients are frequently used in economic models due to sign or order constraints on the coefficients. In this paper, we propose a Bayesian approach to selecting significant explanatory variables in linear regression models with inequality constraints on the coefficients. Bayesian variable selection requires computation of posterior probability of each candidate model. We propose a method which computes all the necessary posterior model probabilities simultaneously. In specific, we obtain posterior samples form the most general model via Gibbs sampling algorithm (Gelfand and Smith, 1990) and compute the posterior probabilities by using the samples. A real example is given to illustrate the method.

Detection Schemes Based on Local Optimality and Sequential Criterion: 2. Performance Analysis (국소 최적성과 순차 기준을 바탕으로 한 검파 기법: 2. 성능 분석)

  • Choi Sang Won;Kang Hyun Gu;Lee Jumi;Park So Ryoung;Kim Sun Yong;Song Iickho
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.30 no.10C
    • /
    • pp.1027-1035
    • /
    • 2005
  • In this paper, the performance of the sequential detection scheme proposed in Part 1 is compared with that of the fixed sample size (FSS) test, sequential probability ratio test (SPRT), and truncated sequential probability ratio test (TSPRT). The proposed sequential detection scheme requires less complexity and, in most cases, smaller sample size than the SPRT. It is also observed that the proposed sequential detection scheme has always lower complexity and smaller sample size than the FSS test and TSPRT.

A Study on the Adjustment of Posterior Probability for Oversampling when the Target is Rare (목표 범주가 희귀한 자료의 과대표본추출에 대한 연구)

  • Kim, U.N.;Lee, S.K.;Choi, J.H.
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.3
    • /
    • pp.477-484
    • /
    • 2011
  • When an event of target variable is rare, a widespread strategy is to build a model on the sample that disproportionally over-represents the events, that is over-sampled. Using the data over-sampled from the original data set, the predicted values would be biased; however, it can be easily corrected to represent the population. In this study, we investigate into the relationship between the proportion of rare event on a data-mart and the model performance using real world data of a Korean credit card company. Also, we use the methods for adjusting of posterior probability for over-sampled data of the offset method and the weighted method. Finally, we compare the performance of the methods using real data sets.

자본자산가격의 운동법칙을 표상하는 연속시간 확률매분방정식의 추정방법 - 비시뮬레이션 방법 -

  • Lee, Il-Gyun
    • The Korean Journal of Financial Studies
    • /
    • v.10 no.1
    • /
    • pp.1-44
    • /
    • 2004
  • 연속시간모형은 시간의 흐름에 대응되는 자본자산의 운동의 성질과 시간의 흐름에 따라 형성되는 자본자산의 가격을 동시적으로 파악할 수 있는 것이 큰 장점이다. 연속시간 확률미분방정식을 구성하는 표류함수와 확산함수가 폐형해나 해석적 형태로 존재하지 않는 경우가 대부분이다. 여기에서 모수추정의 어려움이 발생한다. 전이 확률밀도함수의 인지 또는 발견의 어려움과 표류함수와 확산함수의 적분 불가능성은 최대가능도법의 사용을 어렵게 만든다. 여기에서 모수방법 보다는 비모수방법을 통하여 연속 확률 미분방정식을 추정하려는 성향이 존재한다. 밀도를 모르면 표본적률을 사용하여 모수를 추정할 수 있으므로 일반화 적률법이 연속시간 확률미분방정식의 모수 추정과 검정에 사용되고 있다. 전이밀도의 값을 시뮬레이션을 통하여 얻는 마코브연쇄 몬테카를로 방법, 전이밀도를 무한소 생성작용소를 통하여 얻는 방법, 비 모수방법, 여러 종류의 전개에 의하여 얻은 표류함수와 확산함수의 전이밀도에 대한 최대가능도법 등 여러 종류의 연속시간 확률미분방정식의 실증분석에서 사용되고 있다. 이 논문에서는 연속시간 확률미분방정식의 실증분석 방법들을 정리하는데 목적이 있다. 이일균(2004)은 이 논문과의 자매논문으로 시뮬레이션에 의한 확률미분방정식의 추정을 다루고 있어 시뮬레이션방법은 그 논문에 미룬다.

  • PDF

Rainfall Frequency Analysis Using SIR Algorithm and Bootstrap Methods (극한강우를 고려한 SIR알고리즘과 Bootstrap을 활용한 강우빈도해석)

  • Moon, Ki Ho;Kyoung, Min Soo;Kim, Hung Soo
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.30 no.4B
    • /
    • pp.367-377
    • /
    • 2010
  • In this study, we considered annual maximum rainfall data from 56 weather stations for rainfall frequency analysis using SIR(Sampling Important Resampling) algorithm and Bootstrap method. SIR algorithm is resampling method considering weight in extreme rainfall sample and Bootstrap method is resampling method without considering weight in rainfall sample. Therefore we can consider the difference between SIR and Bootstrap method may be due to the climate change. After the frequency analysis, we compared the results. Then we derived the results which the frequency based rainfall obtained using the data from SIR algorithm has the values of -10%~60% of the rainfall obtained using the data from Bootstrap method.

Comparision of two samples and the role of randomization (두 표본의 비교와 확률화)

  • 허명회
    • The Korean Journal of Applied Statistics
    • /
    • v.1 no.2
    • /
    • pp.61-65
    • /
    • 1987
  • Randomization is one of the principles that should be adopted in comparative experiments. Randomization is well known as a useful tool for averaging out the effects of external factors. It also validates statistical inference based on mathematical model. This teaching meterial is designed for the purpose of illustrating the role of randomization.

Study on Teachers' Understanding on Generating Random Number in Monte Carlo Simulation (몬테카를로 시뮬레이션의 난수 생성에 관한 교사들의 이해에 관한 연구)

  • Heo, Nam Gu;Kang, Hyangim
    • School Mathematics
    • /
    • v.17 no.2
    • /
    • pp.241-255
    • /
    • 2015
  • The purpose of this study is to analyze teachers' understanding on generating random number in Monte Carlo simulation and to provide educational implications in school practice. The results showed that the 70% of the teachers selected wrong ideas from three types for random-number as strategies for problem solving a probability problem and also they make some errors to justify their opinion. The first kind of the errors was that the probability of a point or boundary was equal to the value of the probability density function in the continuous probability distribution. The second kind of the errors was that the teachers failed to recognize that the sample space has been changed by conditional probability. The third kind of the errors was that when two random variables X, Y are independence of each other, then only, joint probability distribution is satisfied $P(X=x,\;Y=y)=p(X=x){\times}P(Y=y{\mid}X=x)$.