Report

Nonlinear Binscatter Methods


Abstract: Binned scatter plots are a powerful statistical tool for empirical work in the social, behavioral, and biomedical sciences. Available methods rely on a quantile-based partitioning estimator of the conditional mean regression function to primarily construct flexible yet interpretable visualization methods, but they can also be used to estimate treatment effects, assess uncertainty, and test substantive domain-specific hypotheses. This paper introduces novel binscatter methods based on nonlinear, possibly nonsmooth M-estimation methods, covering generalized linear, robust, and quantile regression models. We provide a host of theoretical results and practical tools for local constant estimation along with piecewise polynomial and spline approximations, including (i) optimal tuning parameter (number of bins) selection, (ii) confidence bands, and (iii) formal statistical tests regarding functional form or shape restrictions. Our main results rely on novel strong approximations for general partitioning-based estimators covering random, data-driven partitions, which may be of independent interest. We demonstrate our methods with an empirical application studying the relation between the percentage of individuals without health insurance and per capita income at the zip-code level. We provide general-purpose software packages implementing our methods in Python, R, and Stata.

Keywords: partition-based semi-linear estimators; Linear models; quantile regression; robust bias correction; uniform inference; binning selection; treatment effect estimation;

JEL Classification: C14; C18; C21;

https://doi.org/10.59576/sr.1110

Access Documents

File(s): File format is application/pdf https://www.newyorkfed.org/medialibrary/media/research/staff_reports/sr1110.pdf
Description: Full text

File(s): File format is text/html https://www.newyorkfed.org/research/staff_reports/sr1110.html
Description: Summary

Authors

Bibliographic Information

Provider: Federal Reserve Bank of New York

Part of Series: Staff Reports

Publication Date: 2024-08-01

Number: 1110