Browse > Article
http://dx.doi.org/10.29220/CSAM.2019.26.3.261

Interval prediction on the sum of binary random variables indexed by a graph  

Park, Seongoh (Department of Statistics, Seoul National University)
Hahn, Kyu S. (Department of Communication, Seoul National University)
Lim, Johan (Department of Statistics, Seoul National University)
Son, Won (The Bank of Korea)
Publication Information
Communications for Statistical Applications and Methods / v.26, no.3, 2019 , pp. 261-272 More about this Journal
Abstract
In this paper, we propose a procedure to build a prediction interval of the sum of dependent binary random variables over a graph to account for the dependence among binary variables. Our main interest is to find a prediction interval of the weighted sum of dependent binary random variables indexed by a graph. This problem is motivated by the prediction problem of various elections including Korean National Assembly and US presidential election. Traditional and popular approaches to construct the prediction interval of the seats won by major parties are normal approximation by the CLT and Monte Carlo method by generating many independent Bernoulli random variables assuming that those binary random variables are independent and the success probabilities are known constants. However, in practice, the survey results (also the exit polls) on the election are random and hardly independent to each other. They are more often spatially correlated random variables. To take this into account, we suggest a spatial auto-regressive (AR) model for the surveyed success probabilities, and propose a residual based bootstrap procedure to construct the prediction interval of the sum of the binary outcomes. Finally, we apply the procedure to building the prediction intervals of the number of legislative seats won by each party from the exit poll data in the $19^{th}$ and $20^{th}$ Korea National Assembly elections.
Keywords
binary sum; exit poll; graph indexed variables; Korea National Assembly election; prediction interval; residual bootstrap; spatial auto-regressive model;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Anselin L (1988). Spatial Econometrics, Dordrecht: Kluwer Academic Publishing.
2 Arnold M and Wied D (2010). Improved GMM estimation of the spatial autoregressive error model, Economic Letters, 108, 65-68.   DOI
3 Bafumi J, Erikson RS, and Wlezien C (2010). Ideological balancing, generic polls and midterm congressional elections, Journal of Politics, 72, 705-719.   DOI
4 Baltagi BH, Song SH, and Koh W (2003). Testing panel data regression models with spatial error correlation, Journal of Econometrics, 117, 123-150.   DOI
5 Brown P and Payne C (1975). Election night forecasting, Journal of the Royal Statistical Society, Series A, 138, 463-498.   DOI
6 Cochrane D and Orcutt GH (1949). Application of least squares regression to relationships containing auto-correlated Error Terms, 44(245), 32-61.   DOI
7 Curtice J and Firth D (2008). Exit polling in a cold climate: the BBC-ITV experience in Britain in 2005, Journal of the Royal Statistical Society, Series A, 171, 509-539.   DOI
8 Greiner DJ and Quinn KM (2010). Exit polling and racial bloc voting: Combining individual-level and $R{\times}C$ ecological data, The Annals of Applied Statistics, 4, 1774-1796.   DOI
9 Hordijk L (1974). Spatial correlation in the disturbances of a linear interregional model, Regional and Urban Economics, 4, 117-140.   DOI
10 Huh MH (2008). Predicting major political parties' number of seats in general election: the case of 2004 general election of Korea, Korean Association for Survey Research, 9, 87-100.
11 Kawk J and Choi B (2014). A comparison study for accuracy of exit poll based on nonresponse model, Journal of the Korean Data and Information Science Society, 25, 53-64.   DOI
12 Kwak ES, Kim JY, and Kim YW (2013) Analysis of forecasting error of the exit poll for the general election of 2012 in Korea, The Korean Association for Survey Research, 11, 33-55.
13 Mitosfky WJ (1995). A Review of the 1992 VRS Exit Polls, Westview Press, Boulder, Colorado.
14 Lee L (2002). Consistency and efficiency of least squares estimation for mixed regressive, spatial autoregressive models, Econometric Theory, 18, 252-277.   DOI
15 Mendenhall W, Scheaffer RL, and Ott L (1971). Elementary Survey Sampling, Wadsworth Publishing Company.
16 Mitosfky WJ (1991). A Short History of Exit Polls, Sage, Newbury Park, CA.
17 Ord K (1975). Estimation methods for models of spatial interaction, Journal of the American Statistical Association, 70, 120-126.   DOI
18 Wang W, Rothschild D, Goel S, and Gelman A (2015). Forecasting elections with non-representative polls, International Journal of Forecasting, 31, 980-991.   DOI