This Week's Readings

What I Read This Week

Bias and Excess Variance in Election Polling:

Interested in Polling Errors
The poll errors are extremely time sensitive to the time between the poll and the actual election outcome
propose a hidden Markov model to capture time varying preferenes and treat the election results as a peak at the typically hidden process.
CLAIM: Their solution is much less sensitive to time window, avoids conflating errors, and are interpretable
Compare their model to an already established 2018 paper by Shirani-Mehr et all, which is a linear model as well as a simple non-intercept nor time dependent distribution model.
MAIN ISSUE: The methods are inconsistent across many inclusion windows, ie if your support is overstated by polls that factor changes over how many days of polling. support overstated = X days polled

Mislabel changes in preferences as polling errors with high precision because they dont account for model misspessification [1]
Certain implementations also require log & logit transformations which can cause directional error Model Advantages 1. Consistency across time windows 2. Avoid conflating changes in preferences with polling errors 3. Interpretability

Specification (a) i = poll (b) ri = election (c) yi = Proportion of Sample Intending to Vote Republican out of Republican/Democrat (d) ni = number of two party voters (e) vri = Republican portion of the two party vote (f) ASSUME: yi ∼ N (pi, σ2 i ) (g) THE MODELS DIFFER BASED ON HOW pi THE TRUE UNDERLYING PREFERENCE IN Poll i IS DECOMPOSED
M1: Static Model considering (a) pi := vri + αri (b) yi − vri ∼ N (αri, σ2 i ) (c) This is assuming the electorates preferences don’t change over time and the error, αri is time-invariant election specific error. This also means you just have to take a poll very close.
M2: A Linear Model (a) logit(pi) = logit(vri) + αri + βriti (b) This error is now defined as the sum of the time-invariant erorr plus the linear model error.
Random Walk Model
- Specification