DOI QR코드

DOI QR Code

Interval prediction on the sum of binary random variables indexed by a graph

  • Park, Seongoh (Department of Statistics, Seoul National University) ;
  • Hahn, Kyu S. (Department of Communication, Seoul National University) ;
  • Lim, Johan (Department of Statistics, Seoul National University) ;
  • Son, Won (The Bank of Korea)
  • Received : 2018.10.15
  • Accepted : 2019.03.28
  • Published : 2019.05.31

Abstract

In this paper, we propose a procedure to build a prediction interval of the sum of dependent binary random variables over a graph to account for the dependence among binary variables. Our main interest is to find a prediction interval of the weighted sum of dependent binary random variables indexed by a graph. This problem is motivated by the prediction problem of various elections including Korean National Assembly and US presidential election. Traditional and popular approaches to construct the prediction interval of the seats won by major parties are normal approximation by the CLT and Monte Carlo method by generating many independent Bernoulli random variables assuming that those binary random variables are independent and the success probabilities are known constants. However, in practice, the survey results (also the exit polls) on the election are random and hardly independent to each other. They are more often spatially correlated random variables. To take this into account, we suggest a spatial auto-regressive (AR) model for the surveyed success probabilities, and propose a residual based bootstrap procedure to construct the prediction interval of the sum of the binary outcomes. Finally, we apply the procedure to building the prediction intervals of the number of legislative seats won by each party from the exit poll data in the $19^{th}$ and $20^{th}$ Korea National Assembly elections.

Keywords

References

  1. Anselin L (1988). Spatial Econometrics, Dordrecht: Kluwer Academic Publishing.
  2. Arnold M and Wied D (2010). Improved GMM estimation of the spatial autoregressive error model, Economic Letters, 108, 65-68. https://doi.org/10.1016/j.econlet.2010.04.012
  3. Bafumi J, Erikson RS, and Wlezien C (2010). Ideological balancing, generic polls and midterm congressional elections, Journal of Politics, 72, 705-719. https://doi.org/10.1017/S0022381610000113
  4. Baltagi BH, Song SH, and Koh W (2003). Testing panel data regression models with spatial error correlation, Journal of Econometrics, 117, 123-150. https://doi.org/10.1016/S0304-4076(03)00120-9
  5. Brown P and Payne C (1975). Election night forecasting, Journal of the Royal Statistical Society, Series A, 138, 463-498. https://doi.org/10.2307/2345211
  6. Cochrane D and Orcutt GH (1949). Application of least squares regression to relationships containing auto-correlated Error Terms, 44(245), 32-61. https://doi.org/10.2307/2280349
  7. Curtice J and Firth D (2008). Exit polling in a cold climate: the BBC-ITV experience in Britain in 2005, Journal of the Royal Statistical Society, Series A, 171, 509-539. https://doi.org/10.1111/j.1467-985X.2007.00536.x
  8. Greiner DJ and Quinn KM (2010). Exit polling and racial bloc voting: Combining individual-level and $R{\times}C$ ecological data, The Annals of Applied Statistics, 4, 1774-1796. https://doi.org/10.1214/10-AOAS353
  9. Hordijk L (1974). Spatial correlation in the disturbances of a linear interregional model, Regional and Urban Economics, 4, 117-140. https://doi.org/10.1016/0034-3331(74)90025-6
  10. Huh MH (2008). Predicting major political parties' number of seats in general election: the case of 2004 general election of Korea, Korean Association for Survey Research, 9, 87-100.
  11. Kawk J and Choi B (2014). A comparison study for accuracy of exit poll based on nonresponse model, Journal of the Korean Data and Information Science Society, 25, 53-64. https://doi.org/10.7465/jkdi.2014.25.1.53
  12. Kwak ES, Kim JY, and Kim YW (2013) Analysis of forecasting error of the exit poll for the general election of 2012 in Korea, The Korean Association for Survey Research, 11, 33-55.
  13. Lee L (2002). Consistency and efficiency of least squares estimation for mixed regressive, spatial autoregressive models, Econometric Theory, 18, 252-277. https://doi.org/10.1017/S0266466602182028
  14. Mendenhall W, Scheaffer RL, and Ott L (1971). Elementary Survey Sampling, Wadsworth Publishing Company.
  15. Mitosfky WJ (1991). A Short History of Exit Polls, Sage, Newbury Park, CA.
  16. Mitosfky WJ (1995). A Review of the 1992 VRS Exit Polls, Westview Press, Boulder, Colorado.
  17. Ord K (1975). Estimation methods for models of spatial interaction, Journal of the American Statistical Association, 70, 120-126. https://doi.org/10.1080/01621459.1975.10480272
  18. Wang W, Rothschild D, Goel S, and Gelman A (2015). Forecasting elections with non-representative polls, International Journal of Forecasting, 31, 980-991. https://doi.org/10.1016/j.ijforecast.2014.06.001