• Title/Summary/Keyword: Generalized Pareto Distribution (GPD)

Search Result 19, Processing Time 0.024 seconds

A Bayesian Analysis of Return Level for Extreme Precipitation in Korea (한국지역 집중호우에 대한 반환주기의 베이지안 모형 분석)

  • Lee, Jeong Jin;Kim, Nam Hee;Kwon, Hye Ji;Kim, Yongku
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.6
    • /
    • pp.947-958
    • /
    • 2014
  • Understanding extreme precipitation events is very important for flood planning purposes. Especially, the r-year return level is a common measure of extreme events. In this paper, we present a spatial analysis of precipitation return level using hierarchical Bayesian modeling. For intensity, we model annual maximum daily precipitations and daily precipitation above a high threshold at 62 stations in Korea with generalized extreme value(GEV) and generalized Pareto distribution(GPD), respectively. The spatial dependence among return levels is incorporated to the model through a latent Gaussian process of the GEV and GPD model parameters. We apply the proposed model to precipitation data collected at 62 stations in Korea from 1973 to 2011.

Comparison of Methods of Selecting the Threshold of Partial Duration Series for GPD Model (GPD 모형 산정을 위한 부분시계열 자료의 임계값 산정방법 비교)

  • Um, Myoung-Jin;Cho, Won-Cheol;Heo, Jun-Haeng
    • Journal of Korea Water Resources Association
    • /
    • v.41 no.5
    • /
    • pp.527-544
    • /
    • 2008
  • Generalized Pareto distribution (GPD) is frequently applied in hydrologic extreme value analysis. The main objective of statistics of extremes is the prediction of rare events, and the primary problem has been the estimation of the threshold and the exceedances which were difficult without an accurate method of calculation. In this paper, to obtain the threshold or the exceedances, four methods were considered. For this comparison a GPD model was used to estimate parameters and quantiles for the seven durations (1, 2, 3, 6, 12, 18 and 24 hours) and the ten return periods (2, 3, 5, 10, 20, 30, 50, 70, 80 and 100 years). The parameters and quantiles of the three-parameter generalized Pareto distribution were estimated with three methods (MOM, ML and PWM). To estimate the degree of fit, three methods (K-S, CVM and A-D test) were performed and the relative root mean squared error (RRMSE) was calculated for a Monte Carlo generated sample. Then the performance of these methods were compared with the objective of identifying the best method from their number.

Time-varying modeling of the composite LN-GPD (시간에 따라 변화하는 로그-정규분포와 파레토 합성 분포의 모형 추정)

  • Park, Sojin;Baek, Changryong
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.1
    • /
    • pp.109-122
    • /
    • 2018
  • The composite lognormal-generalized Pareto distribution (LN-GPD) is a mixture of right-truncated lognormal and GPD for a given threshold value. Scollnik (Scandinavian Actuarial Journal, 2007, 20-33, 2007) shows that the composite LN-GPD is adequate to describe body distribution and heavy-tailedness. This paper considers time-varying modeling of the LN-GPD based on local polynomial maximum likelihood estimation. Time-varying model provides significant detailed information of time dependent data, hence it can be applied to disciplines such as service engineering for staffing and resources management. Our work also extends to Beirlant and Goegebeur (Journal of Multivariate Analysis, 89, 97-118, 2004) in the sense of losing no data by including truncated lognormal distribution. Our proposed method is shown to perform adequately in simulation. Real data application to the service time of the Israel bank call center shows interesting findings on the staffing policy.

An Alternative Study of the Determination of the Threshold for the Generalized Pareto Distribution (일반화 파레토 분포에서 임계치 결정에 대한 대안적 연구)

  • Yoon, Jeong-Yoen;Cho, Jae-Beom;Jun, Byoung-Cheol
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.5
    • /
    • pp.931-939
    • /
    • 2011
  • In practice, thresholds are determined by the two subjective assessment methods in a generalized pareto distribution of mean extreme function(MEF-graph) or Hill-graph. To remedy the problem of subjectiveness of these methods, we propose an alternative method to determine the threshold based on the robust statistics. We compared the MEF-graph, Hill-graph and our method through VaRs on the Korean stock market data from January 5, 1987 to August 3, 2009. As a result, the VaR based on the proposed method is not much different from the existing methods, and the standard deviation of VaR for our method was the smallest. The results show that our method can be a promising alternative to determine thresholds of the generalized pareto distributions.

Extreme Value Analysis of Statistically Independent Stochastic Variables

  • Choi, Yongho;Yeon, Seong Mo;Kim, Hyunjoe;Lee, Dongyeon
    • Journal of Ocean Engineering and Technology
    • /
    • v.33 no.3
    • /
    • pp.222-228
    • /
    • 2019
  • An extreme value analysis (EVA) is essential to obtain a design value for highly nonlinear variables such as long-term environmental data for wind and waves, and slamming or sloshing impact pressures. According to the extreme value theory (EVT), the extreme value distribution is derived by multiplying the initial cumulative distribution functions for independent and identically distributed (IID) random variables. However, in the position mooring of DNVGL, the sampled global maxima of the mooring line tension are assumed to be IID stochastic variables without checking their independence. The ITTC Recommended Procedures and Guidelines for Sloshing Model Tests never deal with the independence of the sampling data. Hence, a design value estimated without the IID check would be under- or over-estimated because of considering observations far away from a Weibull or generalized Pareto distribution (GPD) as outliers. In this study, the IID sampling data are first checked in an EVA. With no IID random variables, an automatic resampling scheme is recommended using the block maxima approach for a generalized extreme value (GEV) distribution and peaks-over-threshold (POT) approach for a GPD. A partial autocorrelation function (PACF) is used to check the IID variables. In this study, only one 5 h sample of sloshing test results was used for a feasibility study of the resampling IID variables approach. Based on this study, the resampling IID variables may reduce the number of outliers, and the statistically more appropriate design value could be achieved with independent samples.

Finding optimal portfolio based on genetic algorithm with generalized Pareto distribution (GPD 기반의 유전자 알고리즘을 이용한 포트폴리오 최적화)

  • Kim, Hyundon;Kim, Hyun Tae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.6
    • /
    • pp.1479-1494
    • /
    • 2015
  • Since the Markowitz's mean-variance framework for portfolio analysis, the topic of portfolio optimization has been an important topic in finance. Traditional approaches focus on maximizing the expected return of the portfolio while minimizing its variance, assuming that risky asset returns are normally distributed. The normality assumption however has widely been criticized as actual stock price distributions exhibit much heavier tails as well as asymmetry. To this extent, in this paper we employ the genetic algorithm to find the optimal portfolio under the Value-at-Risk (VaR) constraint, where the tail of risky assets are modeled with the generalized Pareto distribution (GPD), the standard distribution for exceedances in extreme value theory. An empirical study using Korean stock prices shows that the performance of the proposed method is efficient and better than alternative methods.

Estimation of Car Insurance Loss Ratio Using the Peaks over Threshold Method (POT방법론을 이용한 자동차보험 손해율 추정)

  • Kim, S.Y.;Song, J.
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.1
    • /
    • pp.101-114
    • /
    • 2012
  • In car insurance, the loss ratio is the ratio of total losses paid out in claims divided by the total earned premiums. In order to minimize the loss to the insurance company, estimating extreme quantiles of loss ratio distribution is necessary because the loss ratio has essential prot and loss information. Like other types of insurance related datasets, the distribution of the loss ratio has heavy-tailed distribution. The Peaks over Threshold(POT) and the Hill estimator are commonly used to estimate extreme quantiles for heavy-tailed distribution. This article compares and analyzes the performances of various kinds of parameter estimating methods by using a simulation and the real loss ratio of car insurance data. In addition, we estimate extreme quantiles using the Hill estimator. As a result, the simulation and the loss ratio data applications demonstrate that the POT method estimates quantiles more accurately than the Hill estimation method in most cases. Moreover, MLE, Zhang, NLS-2 methods show the best performances among the methods of the GPD parameters estimation.

Threshold Estimation of Generalized Pareto Distribution Based on Akaike Information Criterion for Accurate Reliability Analysis (정확한 신뢰성 해석을 위한 아카이케 정보척도 기반 일반화파레토 분포의 임계점 추정)

  • Kang, Seunghoon;Lim, Woochul;Cho, Su-Gil;Park, Sanghyun;Lee, Minuk;Choi, Jong-Su;Hong, Sup;Lee, Tae Hee
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.39 no.2
    • /
    • pp.163-168
    • /
    • 2015
  • In order to perform estimations with high reliability, it is necessary to deal with the tail part of the cumulative distribution function (CDF) in greater detail compared to an overall CDF. The use of a generalized Pareto distribution (GPD) to model the tail part of a CDF is receiving more research attention with the goal of performing estimations with high reliability. Current studies on GPDs focus on ways to determine the appropriate number of sample points and their parameters. However, even if a proper estimation is made, it can be inaccurate as a result of an incorrect threshold value. Therefore, in this paper, a GPD based on the Akaike information criterion (AIC) is proposed to improve the accuracy of the tail model. The proposed method determines an accurate threshold value using the AIC with the overall samples before estimating the GPD over the threshold. To validate the accuracy of the method, its reliability is compared with that obtained using a general GPD model with an empirical CDF.

Orographic Precipitation Analysis with GPD Model and Linear Regression (GPD 모형 및 선형회귀분석을 이용한 산악형 강수 해석)

  • Um, Myoung-Jin;Yun, Hye-Seon;Cho, Won-Cheol;Heo, Jun-Haeng
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2008.05a
    • /
    • pp.1053-1057
    • /
    • 2008
  • 본 연구에서는 산악형 강수 해석을 위해 제주도내 강우관측 자료를 이용하여 확률강우량 산정 및 고도와의 선형회귀분석을 수행하였다. 제주도내 강우관측 자료는 기상관서 4개소 및 AWS(Automatic Weather System, 자동기상관측소) 13개소의 자료를 활용하였다. 확률강우량 산정시 AWS 강우관측 자료는 AMS(Annual Maximum Series, 연 최대치 계열) 모형을 적용하기에는 자료기간이 충분하지 않으므로 짧은 자료기간에 적합한 PDS(Partial Duration Series, 부분 기간치 계열) 모형을 적용하였다. 따라서 본 연구에서는 PDS의 대표적인 분포형인 GPD(Generalized Pareto Distribution)를 적용하여 지속시간별 확률강우량을 산정하였다. 산정된 지속시간별 확률강우량과 고도와의 관계를 확인하기 위하여 선형회귀분석을 수행하였다. 회귀분석 결과 확률강우량은 고도가 증가함에 따라 선형적으로 증가하였다. 또한, 재현기간이 길어질수록 고도에 따른 확률강우량 증가율도 증가하였다. 다만, 재현기간과 관계없이 지속시간이 짧을 경우 확률강우량과 고도와의 선형 관계는 약해지는 것으로 나타났다.

  • PDF

Analysis of Extreme Values of Daily Percentage Increases and Decreases in Crude Oil Spot Prices (국제현물원유가의 일일 상승 및 하락율의 극단값 분석)

  • Yun, Seok-Hoon
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.5
    • /
    • pp.835-844
    • /
    • 2010
  • Tools for statistical analysis of extreme values include the classical annual maximum method, the modern threshold method and variants improving the second one. While the annual maximum method is to t th generalized extreme value distribution to the annual maxima of a time series, the threshold method is to the generalized Pareto distribution to the excesses over a high threshold from the series. In this paper we deal with the Poisson-GPD method, a variant of the threshold method with a further assumption that the total number of exceedances follows the Poisson distribution, and apply it to the daily percentage increases and decreases computed from the spot prices of West Texas Intermediate, which were collected from January 4th, 1988 until December 31st, 2009. According to this analysis, the distribution of daily percentage increases as well as decreases turns out to have a heavy tail, unlike the normal distribution, which coincides well with the general phenomenon appearing in the analysis of lots of nowaday nancial data.