• Title/Summary/Keyword: Quantile-on-quantile estimation

Search Result 86, Processing Time 0.023 seconds

A Study on the Accuracy of the Maximum Likelihood Estimator of the Generalized Logistic Distribution According to Information Matrix (Information Matrix에 따른 Generalized Logistic 분포의 최우도 추정량 정확도에 관한 연구)

  • Shin, Hong-Joon;Jung, Young-Hun;Heo, Jun-Haeng
    • Journal of Korea Water Resources Association
    • /
    • v.42 no.4
    • /
    • pp.331-341
    • /
    • 2009
  • In this study, we compared the observed information matrix with the Fisher information matrix to estimate the uncertainty of maximum likelihood estimators of the generalized logistic (GL) distribution. The previous literatures recommended the use of the observed information matrix because this is convenient since this matrix is determined as the part of the parameter estimation procedure and there is little difference in accuracy between the observed information matrix and the Fisher information matrix for large sample size. The observed information matrix has been applied for the generalized logistic distribution based on the previous study without verification. For this purpose, a simulation experiment was performed to verify which matrix gave the better accuracy for the GL model. The simulation results showed that the variance-covariance of the ML parameters for the GL distribution came up with similar results to those of previous literature, but it is preferable to use of the Fisher information matrix to estimate the uncertainty of quantile of ML estimators.

A Study on Estimation of Soil Moisture Multiple Quantile Regression Model Using Conditional Merging and MODIS Land Surface Temperature Data (조건부 합성기법과 MODIS LST를 활용한 토양수분 다중분위회귀모형 산정 연구)

  • Jung, Chung Gil;Lee, Ji Wan;Kim, Da Rae;Kim, Se Hun;Kim, Seong Joon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2018.05a
    • /
    • pp.23-23
    • /
    • 2018
  • 본 연구에서는 다중분위회귀분석모형(Multiple Quantile Regression Model, MQRM)과 MODIS(MODerate resolution Imaging Spectroradiometer) LST (Land Surface Temperature) 자료를 이용하여 전국 공간토양수분을 산정하였다. 공간토양수분을 산정하기 위한 과정은 크게 두가지로 구분된다. 첫 번째로 기존의 MODIS LST 자료를 조건부 합성 보정기법을 적용하여 실측 LST 자료와 비교하여 위성 LST 자료가 갖고 있는 오차를 보정하였다. 그 결과, 조건부 합성 보정기법을 적용하기전 전국 71개 지상관측지점에서 관측한 실측 LST와 MODIS LST의 $R^2$는 전체 평균 0.70으로 어는정도 유의성 있는 상관관계를 나타냈으나 조건부 합성 보정기법을 적용한 후 실측 LST와 MODIS LST의 $R^2$는 전체 평균 0.92로 상당히 크게 향상됨을 알 수 있었다. 두 번째로 보정된 MODIS LST를 이용하여 다중분위회귀분석 모형을 개발하고 토양수분을 예측하는 단계로 입력자료로 위성영상 자료와 관측자료를 융합하여 사용하였다. 위성영상 자료로는 보정된 MODIS LST와 MODIS NDV를 구축하였고 일단위 강수량 및 일조시간의 기상자료는 기상청으로부터 전국 71개 지점에 대해 구축하여 IDW 공간보간기법을 이용한 공간자료로 구축하였다. 토양수분 결과를 비교하기 위한 관측 토양수분은 자동농업기상관측(Automated Agriculture Observing System, AAOS)지점에서 2013년 1월부터 2015년 12월까지의 실측 일단위 토양수분 자료를 구축하여 사용하였다. 다중분위회귀분석 모형은 LST 인자를 중심으로 각각의 분위(0.05, 0.25, 0.5, 0.75, 0.95)에 해당되는 값의 회귀식을 NDVI, 강수 입력자료를 독립인자로서 조합하여 계절 및 토성에 따른 총 80개의 회귀식을 산정하였다. 관측 토양수분과 모의 토양수분을 비교한 결과 $R^2$가 0.70 (철원), 0.90 (춘천), 0.85 (수원), 0.65 (서산), 0.78 (청주), 0.82 (전주), 0.62 (순천), 0.63 (진주), 0.78 (보성)로 높은 상관성을 보였다. 본 연구에서는 다중분위회귀 모형의 성능을 검증하기 위해 기존의 다중선형회귀모형의 결과와 비교하여 크게 개선됨을 나타냈다.

  • PDF

Planning Accelerated Degradation Tests: the Case of Gamma Degradation Process (열화가 감마과정을 따르는 경우 가속열화시험의 최적 계획)

  • Lim, Heonsang;Lim, Dae-Eun
    • Journal of Korean Society for Quality Management
    • /
    • v.43 no.2
    • /
    • pp.169-184
    • /
    • 2015
  • Purpose: This paper is concerned with optimally designing accelerated degradation test (ADT) plans based on a gamma process for the degradation model. Methods: By minimizing the asymptotic variance of the MLE of the q-th quantile of the lifetime distribution at the use condition, the test stress levels and the proportion of test units allocated to each stress level are optimally determined. Results: The optimal plans of ADT are developed for various combination of parameters. In addition, a method for determining the sample size is developed, and sensitivity analysis procedures are illustrated with an example. Conclusion: It is important to optimally design ADT based on a gamma process under the condition that a degradation process should be always nonnegative and strictly increasing over time.

Support vector expectile regression using IRWLS procedure

  • Choi, Kook-Lyeol;Shim, Jooyong;Seok, Kyungha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.4
    • /
    • pp.931-939
    • /
    • 2014
  • In this paper we propose the iteratively reweighted least squares procedure to solve the quadratic programming problem of support vector expectile regression with an asymmetrically weighted squares loss function. The proposed procedure enables us to select the appropriate hyperparameters easily by using the generalized cross validation function. Through numerical studies on the artificial and the real data sets we show the effectiveness of the proposed method on the estimation performances.

Regional flood frequency analysis of extreme rainfall in Thailand, based on L-moments

  • Thanawan Prahadchai;Piyapatr Busababodhin;Jeong-Soo Park
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.1
    • /
    • pp.37-53
    • /
    • 2024
  • In this study, flood records from 79 sites across Thailand were analyzed to estimate flood indices using the regional frequency analysis based on the L-moments method. Observation sites were grouped into homogeneous regions using k-means and Ward's clustering techniques. Among various distributions evaluated, the generalized extreme value distribution emerged as the most appropriate for certain regions. Regional growth curves were subsequently established for each delineated region. Furthermore, 20- and 100-year return values were derived to illustrate the recurrence intervals of maximum rainfall across Thailand. The predicted return values tend to increase at each site, which is associated with growth curves that could describe an increasing long-term predictive pattern. The findings of this study hold significant implications for water management strategies and the design of flood mitigation structures in the country.

Confidence Interval for the Difference or Ratio of Two Median Failure Times from Clustered Survival Data

  • Lee, Seung-Yeoun;Jung, Sin-Ho
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.355-364
    • /
    • 2009
  • A simple method is proposed for constructing nonparametric confidence intervals for the difference or ratio of two median failure times. The method applies when clustered survival data with censoring is randomized either (I) under cluster randomization or (II) subunit randomization. This method is simple to calculate and is based on non-parametric density estimation. The proposed method is illustrated with the otology study data and HL-A antigen study data. Moreover, the simulation results are reported for practical sample sizes.

A Kernel-function-based Approach to Sequential Estimation with $\beta$-protection of Quantiles

  • 김성래;김성균
    • Proceedings of the Korean Society of Computational and Applied Mathematics Conference
    • /
    • 2003.09a
    • /
    • pp.14-14
    • /
    • 2003
  • Given a sequence { $X_{n}$} of independent and identically distributed random variables with F, a sequential procedure for the p-th quantile ξ$_{P}$= $F^{-1}$ (P), 0$\beta$-protection. Some asymptotic properties for the proposed procedure and of an involved stopping time are proved: asymptotic consistency, asymptotic efficiency and asymptotic normality. From one of the results an effect of smoothing based on kernel functions is discussed. The results are also extended to the contaminated case.e.e.

  • PDF

Rainfall Quantile Estimation based on Scaling Invariance Property (강우자료의 지속기간별 스케일링 성질을 이용한 확률강우량 산정)

  • Jung, Young-Hun;Kim, Soo-Young;Kim, Tae-Soon;Heo, Jun-Haeng
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2007.05a
    • /
    • pp.1538-1542
    • /
    • 2007
  • 확률강우량은 일반적으로 년최대 강우량자료를 바탕으로 빈도해석을 실시하여 산정하며, 국내에서는 주로 매시각별로 관측된 자료를 이용하여 지속기간 1시간에서 24시간 사이에 대하여 산정하고 있다. 그러나 도달시간이 매우 단시간인 도시 유역의 확률강우량 산정을 위해서는 지속기간 15분 혹은 지속기간 30분과 같은 짧은 지속기간에 대한 확률강우량의 추정이 필요하며, 이와는 반대로 지속기간 24시간 이상의 장기간에 대한 확률강우량의 추정이 필요한 경우도 있다. 본 연구에서는 이와 같이 관측되지 않은 지속기간에 대한 확률강 우량을 산정하기 위한 방법으로써 강우자료의 지속기간별로 일정한 스케일이 유지된다는 스케일링 성질(Scaling Invariance Property)을 적용하여 확률강우량을 산정하였다. 이를 위해 대상지역의 지속기간별 년최대 강우량자료를 구축한 뒤 L-모멘트법으로 산정된 매개변수와 스케일링 성질을 이용하여 확률강우량을 산정한 후 이를 기존의 빈도해석 결과에 의한 확률강우량과 비교하여 적용성을 판단하였다.

  • PDF

A comparative study in Bayesian semiparametric approach to small area estimation

  • Heo, Simyoung;Kim, Dal Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.5
    • /
    • pp.1433-1441
    • /
    • 2016
  • Small area model provides reliable and accurate estimations when the sample size is not sufficient. Our dataset has an inherent nonlinear pattern which signicantly affects our inference. In this case, we could consider semiparametric models such as truncated polynomial basis function and radial basis function. In this paper, we study four Bayesian semiparametric models for small areas to handle this point. Four small area models are based on two kinds of basis function and different knots positions. To evaluate the different estimates, four comparison measurements have been employed as criteria. In these comparison measurements, the truncated polynomial basis function with equal quantile knots has shown the best result. In Bayesian calculation, we use Gibbs sampler to solve the numerical problems.

A concise overview of principal support vector machines and its generalization

  • Jungmin Shin;Seung Jun Shin
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.2
    • /
    • pp.235-246
    • /
    • 2024
  • In high-dimensional data analysis, sufficient dimension reduction (SDR) has been considered as an attractive tool for reducing the dimensionality of predictors while preserving regression information. The principal support vector machine (PSVM) (Li et al., 2011) offers a unified approach for both linear and nonlinear SDR. This article comprehensively explores a variety of SDR methods based on the PSVM, which we call principal machines (PM) for SDR. The PM achieves SDR by solving a sequence of convex optimizations akin to popular supervised learning methods, such as the support vector machine, logistic regression, and quantile regression, to name a few. This makes the PM straightforward to handle and extend in both theoretical and computational aspects, as we will see throughout this article.