Working Paper
Big Data versus a Survey
Abstract: Economists are shifting attention and resources from work on survey data to work on ?big data.? This analysis is an empirical exploration of the trade-offs this transition requires. Parallel models are estimated using the Federal Reserve Bank of New York Consumer Credit Panel/Equifax and the Survey of Consumer Finances. After adjustments to account for different variable definitions and sampled populations, it is possible to arrive at similar models of total household debt. However, the estimates are sensitive to the adjustments. Little similarity is observed in parallel models of nonmortgage debt. While surveys intentionally collect theoretically related variables, it may be necessary to merge external data into commercial big data. In this example, some education and income measures are successfully integrated with the big data, but other external aggregates fail to adequately substitute for survey responses. Big data offers sample sizes, frequencies, and details that surveys cannot match. However, this example illustrates why caution is appropriate when attempting to substitute big data for a carefully executed survey.
Keywords: Big Data; Survey Data; Household Debt;
JEL Classification: C55; C81; D12;
https://doi.org/10.26509/frbc-wp-201440
Access Documents
File(s):
https://doi.org/10.26509/frbc-wp-201440
Description: Persistent link
File(s):
File format is application/pdf
https://www.clevelandfed.org/-/media/project/clevelandfedtenant/clevelandfedsite/publications/working-papers/2014/wp-1440-big-data-versus-a-survey-pdf.pdf
Description: Full text
Authors
Bibliographic Information
Provider: Federal Reserve Bank of Cleveland
Part of Series: Working Papers (Old Series)
Publication Date: 2015-01-07
Number: 1440