Report
Nonlinear Binscatter Methods
Abstract: Binned scatter plots are a powerful statistical tool for empirical work in the social, behavioral, and biomedical sciences. Available methods rely on a quantile-based partitioning estimator of the conditional mean regression function to primarily construct flexible yet interpretable visualization methods, but they can also be used to estimate treatment effects, assess uncertainty, and test substantive domain-specific hypotheses. This paper introduces novel binscatter methods based on nonlinear, possibly nonsmooth M-estimation methods, covering generalized linear, robust, and quantile regression models. We provide a host of theoretical results and practical tools for local constant estimation along with piecewise polynomial and spline approximations, including (i) optimal tuning parameter (number of bins) selection, (ii) confidence bands, and (iii) formal statistical tests regarding functional form or shape restrictions. Our main results rely on novel strong approximations for general partitioning-based estimators covering random, data-driven partitions, which may be of independent interest. We demonstrate our methods with an empirical application studying the relation between the percentage of individuals without health insurance and per capita income at the zip-code level. We provide general-purpose software packages implementing our methods in Python, R, and Stata.
Keywords: partition-based semi-linear estimators; Linear models; quantile regression; robust bias correction; uniform inference; binning selection; treatment effect estimation;
JEL Classification: C14; C18; C21;
https://doi.org/10.59576/sr.1110
Access Documents
File(s):
File format is application/pdf
https://www.newyorkfed.org/medialibrary/media/research/staff_reports/sr1110.pdf
Description: Full text
File(s):
File format is text/html
https://www.newyorkfed.org/research/staff_reports/sr1110.html
Description: Summary
Bibliographic Information
Provider: Federal Reserve Bank of New York
Part of Series: Staff Reports
Publication Date: 2024-08-01
Number: 1110