Notebook
How well are we predicting?
The coefficients of the discrete choice model do not tell us much. What we're after is marginal effects.
The "correct" model here is likely the Tobit model. We have an work in progress branch "tobit-model" on github, if anyone is interested in censored regression models.
Compare the estimates of the Logit Fair model above to a Probit model. Does the prediction table look better? Much difference in marginal effects?
Toss a six-sided die 5 times, what's the probability of exactly 2 fours?
The number of trials
First differences: We hold all explanatory variables constant at their means and manipulate the percentage of low income households to assess its impact on the response variables:
The interquartile first difference for the percentage of low income households in a school district is: