Working Paper Revision
Explaining Machine Learning by Bootstrapping Partial Marginal Effects and Shapley Values
Abstract: Machine learning and artificial intelligence are often described as “black boxes.” Traditional linear regression is interpreted through its marginal relationships as captured by regression coefficients. We show that the same marginal relationship can be described rigorously for any machine learning model by calculating the slope of the partial dependence functions, which we call the partial marginal effect (PME). We prove that the PME of OLS is analytically equivalent to the OLS regression coefficient. Boot- strapping provides standard errors and confidence intervals around the point estimates of the PMEs. We apply the PME to a hedonic house pricing example and demonstrate that the PMEs of neural networks, support vector machines, random forests, and gradient boosting models reveal the non-linear relationships discovered by the machine learning models and allow direct comparison between those models and a traditional linear regression. Finally we extend PME to a Shapley value decomposition and explore how it can be used to further explain model outputs.
JEL Classification: C14; C15; C18; C45; C52;
https://doi.org/10.18651/RWP2021-12
Access Documents
File(s):
File format is text/html
https://www.kansascityfed.org/research/research-working-papers/explaining-machine-learning-by-bootstrapping-partial-dependence-functions-and-shapley-values/
Description: Full text
Bibliographic Information
Provider: Federal Reserve Bank of Kansas City
Part of Series: Research Working Paper
Publication Date: 2024-08-06
Number: RWP 21-12
Related Works
- Working Paper Revision (2024-08-06) : You are here.
- Working Paper Original (2021-11-15) : Explaining Machine Learning by Bootstrapping Partial Dependence Functions and Shapley Values