Estimating grouped data models with a binary dependent variable and fixed effect via logit vs OLS: the impact of dropped units
Nathaniel Beck

TL;DR
This paper compares OLS and logit models for grouped binary data with fixed effects, highlighting how OLS averages over all groups including those with no variation, and recommends reporting results for both all groups and varying groups.
Contribution
It demonstrates that OLS and logit differ in handling groups with no variation in the dependent variable and advises on proper reporting practices.
Findings
OLS averages over all groups, including those with no variation.
Logit drops groups with no variation, focusing on groups with variation.
Researchers should report results for all groups and only varying groups.
Abstract
This letter deals with a very simple issue: if we have grouped data with a binary dependent variable and want to include fixed effects (group specific intercepts) in the specification, is Ordinary Least Squares (OLS) in any way superior to a logit form because the OLS method \emph{appears} to keep all observations whereas the logit drops all groups which have either all zeros or all ones on the dependent variable? It is shown that OLS averages the estimates for the all zero (and all one) groups, which by definition have all slope coefficients of zero, with the slope coefficients for the groups with a mix of zeros and ones. Thus the correct comparison of OLS to logit is to only look at groups with some variation in the dependent variable. Researchers using OLS are urged to report results both for all groups and for the subset of groups where the dependent variable varies. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpatial and Panel Data Analysis · Game Theory and Voting Systems · Efficiency Analysis Using DEA
