Browse > Article
http://dx.doi.org/10.5351/KJAS.2021.34.5.697

Predicting claim size in the auto insurance with relative error: a panel data approach  

Park, Heungsun (Department of Statistics, Hankuk University of Foreign Studies)
Publication Information
The Korean Journal of Applied Statistics / v.34, no.5, 2021 , pp. 697-710 More about this Journal
Abstract
Relative error prediction is preferred over ordinary prediction methods when relative/percentile errors are regarded as important, especially in econometrics, software engineering and government official statistics. The relative error prediction techniques have been developed in linear/nonlinear regression, nonparametric regression using kernel regression smoother, and stationary time series models. However, random effect models have not been used in relative error prediction. The purpose of this article is to extend relative error prediction to some of generalized linear mixed model (GLMM) with panel data, which is the random effect models based on gamma, lognormal, or inverse gaussian distribution. For better understanding, the real auto insurance data is used to predict the claim size, and the best predictor and the best relative error predictor are comparatively illustrated.
Keywords
best relative error predictor; best predictor; generalized linear mixed models; random effect; percentile error; panel data; claim size;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Breslow NE and Clayton DG (1993). Approximate inference in generalized linear mixed models, Journal of the American Statistical Association, 88, 9-25.   DOI
2 Chen K, Guo S, Lin Y, and Ying Z (2010). Least absolute relative error estimation, Journal of the American Statistical Association, 105, 1104-1112.   DOI
3 Chhikara RS and Folks JL (1989). The Inverse Gaussian Distribuition, Marcel Dekker, New York.
4 Davidian M and Giltinan DM (1995).Nonlinear Models for Repeated Measurement Data, Boca Raton: Chapman & Hall/CRC.
5 Hong L and Martin R (2019). Valid Model-Free Prediction of Future Insurance Claims, Retrieved October 12, 2019, from SSRN: https://ssrn.com/abstract=3468969 or http://dx.doi.org/10.2139/ssrn.3468969   DOI
6 Johnson NL and Kotz S (1970). Continuous Univariate Distributions: Distributions in Statistics, John Wiley & Sons, New York.
7 Jorgensen B and Souza MCP de (1994). Fitting Tweedie's compound poisson model to insurance claims data, Scandinavian Actuarial Journal, 1, 69-93.   DOI
8 Bickel PJ and Doksum KA (1977). Mathematical Statistics, Holden-Day Inc., Oakland, CA.
9 Boland PJ (2007). Statistical and Probabilistic Methods in Actuarial Science, Boca Raton: Chapman & Hall/CRC.
10 Boodhun N and Jayabalan M (2018). Risk prediction in life insurance industry using supervised learning algorithm, Complex & Intelligent Systems, 4, 145-154.   DOI
11 Wolfinger RD and O'Connell M (1993). Generalized linear mixed models: a pseudo-likelihood approach, Journal of Statistical Computation and Simulation, 40, 233-243.   DOI
12 Kim MJ and Kim YH (2009). Various modeling approaches in auto insurance pricing, Journal of the Korean Data & Information Science Society, 20, 515-526.
13 Makridakis SG (1984). The Forecasting Accuracy of Major Time Series Methods, Wiley, New York.
14 Huang T, Zhao R, and Tang W (2009). Risk model with fuzzy random individual claim amount, European Journal of Operational Research, 192, 879-890.   DOI
15 Jong Piet de and Heller GZ (2008). Generalized Linear Models for Insurance Data, international series on actuarial science, Cambridge University Press.
16 SAS/STAT User's Guide (2012). The GLIMMIX Procedure, SAS Institute Inc., Cary, NC, USA.
17 Tweedie MCK (1984). An index which distinguishes between some important exponential families in statistics applications and new directions. In Proceedings of the Indian Statistical Institute Golden Jubilee International Conference, 579-604.
18 Lee Y and Nelder JA (2001). Hierarchical generalized linear models: a synthesis of generalized linear models, random-effect models and structured dispersions, Biometrika, 88, 987-1006.   DOI
19 Kahneman D and Tversky A (1979). Prospect theory: an analysis of decision under risk, Econometrica, 47, 263-291.   DOI
20 Lee Y and Nelder JA (1996). Hierarchical generalized linear models, Journal of Royal Statistical Society: Series B (Methodological), 58, 619-656.   DOI
21 Park H and Shin KI (2005). A shrinked forecast in stationary processes favoring percentage error, Journal of Time Series Analysis, 27, 129-139.   DOI
22 Park H and Stefanski LA (1998). Relative-error prediction, Statistics & Probability Letters, 40, 227-236.   DOI
23 Stoyanov J (1999). Inverse Gaussian distribution and the moment problem, Journal of Applied Statistical Science, 9, 61-71.
24 Wang Z, Chen Z, and Chen Z (2018). H-relative error estimation approach for multiplicative regression model with random effect, Computational Statistics, 33, 623-638.   DOI
25 Smyth GK and Jorgensen B (2002). Fitting tweedie's compound poisson model to insurance claims data: dispersion modeling, Actuarial Studies in Non-life insurance (ASTIN) bulletin, 32, 143-157.
26 Werner G and Modlin C (2009). Basic Ratemaking Workshop, The Casualty Actuarial Society.
27 Wuthrich MV and Merz M (2008). Stochastic Claims Reserving Methods in Insurance, West Sussex: John Wiley & Sons, England.
28 Jones MC, Park H, Shin KI, Vines SK, and Jeong SO (2008). Relative error prediction via kernel regression smoothers, Journal of Statistical Planning and Inference, 138, 2887-2898.   DOI
29 Golub GH and Welsch JH (1969). Calculation of gaussian auadrature rules, Mathematical Computing, 23, 221-230.   DOI
30 Chen K, Lin Y, Wang Z, and Ying Z (2016). Least product relative error estimation, Journal of Multivariate Analysis, 144, 91-98.   DOI
31 Frees E (2018). Loss Data Analytics, an open text authored by the Actuarial Community.