Document [original]

1. Introduction

Average flood losses in the USA amount to US$32.1 billion per year and flood risk is estimated to grow by

26.4% in 2050 (Wing etal.,2022). Impact patterns are inequitable across different socio-economic groups (Wing

etal.,2022). Thus, there is an urgent need for improved and more equitable adaptation to flood risk in the US.

Proactive adaptation strategies, that is, informed actions based on risk assessments that are adopted before a

flood hits, can be driven by top-down measures (e.g., community-level policies or infrastructural interventions

for flood protection) or bottom-up actions (e.g., purchase of flood insurance by households). Household adap-

tation, for instance through insurance purchase (Cremades etal.,2018), has the potential to lower future flood

risk substantially by reducing the financial vulnerability to residual risk (Aerts etal.,2018; Jongman etal.,2015;

Kundzewicz etal.,2018).

In the US, top-down and bottom-up flood adaptation measures are interlinked within the mechanism that relates

individuals' flood insurance access and communities' flood risk management. Flood insurance adoption as a

bottom-up measure is regulated within the National Flood Insurance Program (NFIP). The NFIP was founded

in 1968 as the only source of flood insurance in the US. Communities maintain minimum floodplain manage-

ment standards to make insurance accessible for their inhabitants (Horn & Brown,2018). Within the NFIP, the

Community Rating System (CRS) (FEMA,2021c) is a top-down strategy to encourage insurance adoption. If a

community agrees to undertake flood risk mitigation and floodplain management measures, it is ranked with a

higher CRS class and the insurance premiums in the community will be lowered (Sadiq etal.,2019).

Abstract Floods cause average annual losses of more than US$30 billion in the US and are estimated

to significantly increase due to global change. Flood resilience, which currently differs strongly between

socio-economic groups, needs to be substantially improved by proactive adaptive measures, such as timely

purchase of flood insurance. Yet, knowledge about the state and uptake of private adaptation and its drivers is

so far scarce and fragmented. Based on interpretable machine learning and large insurance and socio-economic

open data sets covering the whole continental US we reveal that flood insurance purchase is characterized by

reactive behavior after severe flood events. However, we observe that the Community Rating System helps

overcome this behavior by effectively fostering proactive insurance purchase, irrespective of socio-economic

backgrounds in the communities. Thus, we recommend developing additional targeted measures to help

overcome existing inequalities, for example, by providing special incentives to the most vulnerable and exposed

communities.

Plain Language Summary Flood resilience of individuals and communities can be improved by

bottom-up strategies, such as insurance purchase, or top-down measures like the US National Flood Insurance

Program's Community Rating System (CRS). Our interpretable machine learning approach shows that flood

insurances are mostly purchased reactively, after the occurrence of a flood event. Yet, reactive behaviors are

ill-suited as more extreme events are expected under future climate, also in areas that were not previously

flooded. The CRS counteracts this behavior by fostering proactive adaptation across a widespread range

of socio-economic backgrounds. Future risk management including the CRS should support and motivate

individuals' proactive adaptation with a particular focus on highly vulnerable social groups to overcome existing

inequalities in flood risk.

VEIGEL ETAL.

This is an open access article under

the terms of the Creative Commons

Attribution-NonCommercial License,

which permits use, distribution and

reproduction in any medium, provided the

original work is properly cited and is not

used for commercial purposes.

Interpretable Machine Learning Reveals Potential to

Overcome Reactive Flood Adaptation in the Continental US

Nadja Veigel1,2,3 , Heidi Kreibich2 , and Andrea Cominola1,3

1Chair of Smart Water Networks, Technische Universität Berlin, Berlin, Germany, 2Section 4.4 Hydrology, GFZ German

Research Centre for Geosciences, Potsdam, Germany, 3Einstein Center Digital Future, Berlin, Germany

Key Points:

• Flood insurance purchase in the US is

dominated by reactive behavior after

severe floods

• The Community Rating System (CRS)

fosters proactive insurance adoption

irrespective of socio-economic

background

• The CRS should further balance

existing inequalities by targeting

specific population segments

Supporting Information:

Supporting Information may be found in

the online version of this article.

Correspondence to:

N. Veigel,

nadja.v[email protected]

Citation:

Veigel, N., Kreibich, H., & Cominola, A.

(2023). Interpretable machine learning

reveals potential to overcome reactive

flood adaptation in the continental US.

Earth's Future, 11, e2023EF003571.

https://doi.org/10.1029/2023EF003571

Received 5 FEB 2023

Accepted 6 SEP 2023

Author Contributions:

Conceptualization: Nadja Veigel, Heidi

Kreibich, Andrea Cominola

Data curation: Nadja Veigel

Funding acquisition: Heidi Kreibich,

Andrea Cominola

Methodology: Nadja Veigel, Andrea

Cominola

Resources: Heidi Kreibich, Andrea

Cominola

Software: Nadja Veigel, Andrea

Cominola

Supervision: Heidi Kreibich, Andrea

Cominola

Validation: Nadja Veigel

Visualization: Nadja Veigel

Writing – original draft: Nadja Veigel

10.1029/2023EF003571

RESEARCH ARTICLE

1 of 11

Earth’s Future

VEIGEL ETAL.

10.1029/2023EF003571

2 of 11

Previous studies have shown that individuals' adaptation actions are potentially impacted by a large number of

demographic, socio-economic, and psychological factors (Koerth etal.,2017; McPhillips etal.,2018), including

risk perception (Bubeck etal.,2012; Sanders etal.,2020), preparedness, and social networks (Hu,2020). For

instance, clear increases of flood insurance take-up rates were observed after the occurrence of catastrophic

floods by several studies (Gallagher,2014; Kuang & Liao,2020). Moreover, resilience differs quite strongly

between different socio-economic groups and there are problems of inequitable access to adaptation meas-

ures and unbalanced consideration of different socio-demographic groups in flood risk management policies

(Knighton etal.,2021; Wing etal.,2022). A US wide study with focus on selected metropolitan areas showed

a disproportional exposure of metropolitan areas with a higher percentage of Black, Indigenous, and People of

Color (BIPoC) residents (Knighton etal.,2021). Another study in Georgia showed that African-Americans as

well as educated and older people were more likely to purchase flood insurance (Atreya etal.,2015). Conversely,

Cannon etal.(2020) found that the dominant ethnicity in a neighborhood was not significantly associated with

insurance coverage, whereas Knighton etal.(2021) describe unequal exposure patterns.

Overall, the literature on the socio-economic drivers of bottom-up adaptation measures in the US is heteroge-

neous and in some cases contradictory. This could be a result of spatially heterogeneous individual reactions or

analysis run on small samples. This reveals a research gap: to generalize beyond fragmented results, studies are

needed that provide large scale evidence on the drivers of bottom-up flood adaptation strategies in combination

with community-level policies.

Here, we investigate the relationship between socio-economic backgrounds, household flood insurance purchase

(bottom-up), and the community-scale CRS (top-down) in the whole continental US. We aim to reveal the main

drivers of insurance uptake and explore the potential of the CRS to foster proactive adaptation and overcome

inequitable flood risk management across household and community scales, without any a priori assumption on

individuals' motivations, or utility maximization. We formulate the following research questions:

1. What are the main socio-economic and behavioral drivers of flood insurance purchase in the US? Is the CRS

effective in fostering individuals' flood insurance purchase?

2. Which patterns emerge in the socio-economic composition of communities that are active in the CRS? Are

there inequalities in the representation of different social groups?

2. Materials and Methods

To address the above questions, we develop a data-driven method based on interpretable machine learning (ML).

We investigate the role of socio-economic characteristics, flood history, and participation in the CRS with respect

to flood insurance purchase as for the whole continental US. Similarly, we model communities' participation in

the CRS as a function of flood history and socio-economic characteristics.

2.1. Data Sources and Data Preparation

We analyze open-access flood insurance data by the NFIP and the American Community Survey (ACS) by the

US Census Bureau with a census tract resolution to model the policy records since 2009 that were in-force at the

time of data retrieval (FEMA,2020). From the ACS we initially selected 400 different potential behavioral and

socio-economic predictors to model individuals' and communities' flood resilience and compare it across differ-

ent census tracts. Additionally we model CRS participation probability (FEMA,2021a).

The locations of the NFIP policies-in-force (FEMA,2020) are redacted to 0.1° precision upon retrieval. To coun-

teract the resulting high concentration of points (in a few census tracts), we apply a random spread to the points.

Finally, we spatially aggregate this data to match the census tract scale of the ACS data. Thus, we total the data

points within the census tracts and divide by the number of housing units. We then use the resulting processed

data as output variable to train a ML model to predict household flood insurance purchase as a function of the

potential behavioral and socio-economic predictors. In building the set of candidate model input features, or

predictors, we manually select 400 variables from more than 25,000 census variables contained in the ACS data

set. Feature values were retrieved as total counts and divided by the number of inhabitants in a census tract to

estimate the percentage of people showing a respective feature. We consider the claims filed within the NFIP as a

proxy to estimate past average annual number of floods (feature called flood history, regular) and the maximum

Writing – review & editing: Nadja

Veigel, Heidi Kreibich, Andrea Cominola

Earth’s Future

VEIGEL ETAL.

10.1029/2023EF003571

3 of 11

amount of people affected in a flood per census tract (FEMA,2019) to estimate flood severity (flood history,

severe). This strategy may result in an under-estimation of under-insured census tracts for the flood severity

variable. The area within the flood zone in a census tract is calculated based on the official Special Flood Hazard

Area (SFHA) by FEMA (FEMA,2021b) and distinguished between coastal (flood zone V) and fluvial flood

zones (flood zone A). Two features describing whether the insurance policy covered a building or its contents are

calculated based on the metadata of the policies in force indicating the percentage of policies covering contents

and those covering buildings. We find no significant pair-wise variable correlations in our input variables, which

we assume to be a result of spatial heterogeneity and locally differing effects. The resulting data set contains 400

features for 72,366 samples (census tracts) across all states in the continental US.

2.2. Regression of Flood Insurance Purchase per Household and Classification of CRS Participation

We first perform a regression task to estimate insurance coverage in a census tract based on the 400 features

described in the previous section using a Gradient Boosting Decision Trees (GBDT) framework. In GBDT

an ensemble model is assembled by step-wise addition of decision trees to minimize the residual error of a

tree ensemble. We use an efficient form of state-of-the-art GBDT called Light Gradient Boosting Machine

(LightGBM) (Ke etal.,2017). LightGBM builds on Gradient-based-One Side-Sampling and Exclusive Feature

Bundling to accelerate model training and achieve higher prediction accuracy (Ke etal.,2017). We train the

LightGBM using data from 80% of the census tracts (n=54,274) and test on the remaining 20% (n=18,092)

(see Text S1.3 in Supporting InformationS1). We train the LightGBM model both at the aggregate level for the

whole US, as well as a series of individual models, one for each state in the continental US. For the regression

problem of insurance purchase modeling, we tune the hyperparameters of the LightGBM via grid search and

k-fold cross-validation (k=10) on the training set. For all simulations we select a Tweedie objective for extremely

unbalanced zero-inflated distributions as in Zhou etal.(2020). Its gradient is calculated as the log likelihood of a

Tweedie distribution (dispersion parameter equivalent to 1.7). For details on the implementation in this specific

framework please refer to Veigel etal.(2022). We estimate the Tweedie distribution dispersion parameter by

fitting a Tweedie distribution to the target variable in the training data set before training the LightGBM. Finally,

we assess the model performance using multiple performance metrics, including the Root-Mean-Square Error

(RMSE), the Mean Absolute Error (MAE), and the coefficient of determination (R

2). LightGBM robustness to

possible correlation and redundancy in the input data is achieved by selection of one feature within a correlated

group.We apply a hierarchical grouping strategy (Text S1.2 in Supporting InformationS1) to counteract this

model limitation.

In the second part of our research, we use LightGBM also in classification mode to tackle the binary classifica-

tion problem of predicting whether a community takes part in the CRS. LightGBM hyperparameters are trained

similarly to the above regression task, but the area under the receiver operating characteristic curve was used as an

objective function (Bradley,1997). Estimated participation probabilities above 0.5 are classified as participating

communities. Class imbalance in the training data set is equalized prior to training by random under-sampling in

the class with more instances to contain the same amount of observations as in the underrepresented class (Guo

etal.,2008).

2.3. Interpretable Machine Learning

We use LightGBM in combination with SHapley Additive exPlanations (SHAP) (S. Lundberg & Lee,2017) with

the two-fold goal of, first, predicting insurance coverage and CRS participation and, second, understanding which

predictors (features) affect the model outcome and with which magnitude. SHAP values are an Interpretable ML

method that quantifies the shift of the predicted value caused by each individual input feature. They represent

the marginal change we observe in a conditional expectation function upon feature introduction at each step in a

unique and additive manner. Since the calculation of marginal impacts depends on the features previously eval-

uated, the contribution should be averaged over all previously implemented features. Several methods to achieve

this estimation were previously proposed, tough they are computationally demanding (S. Lundberg & Lee,2017).

Conversely, SHAP values can be estimated efficiently by exploiting the structure of tree based models such as

LightGBM. They are calculated by evaluating all possible combinations of branches that end in the respective

node with the Tree SHAP algorithm (S. M. Lundberg etal.,2020). In this work, we adopt SHAP values over other

measures of feature importance to quantify the effect of features on model output. SHAP enhances interpretation

Earth’s Future

VEIGEL ETAL.

10.1029/2023EF003571

4 of 11

by quantifying whether a specific predictor has a positive or negative influence on model output, along with

whether specific ranges of interest emerge, when a predictor has a stronger/weaker influence on the model output.

We calculate SHAP values both for the regression problem of predicting household flood insurance purchase in a

census tract, where the SHAP value represents the change in insurance policies per housing unit in a census tract

that can be attributed to a specific input variable, and for the classification problem of predicting communities'

participation in the CRS, where the SHAP values represent the changes in participation probability attributed to

a specific input variable. To mitigate the potential presence of weakly correlated features in our data set contain-

ing 400 variables, we apply a grouping strategy, in which we combine the SHAP values for thematically related

features. Feature importance within a group is quantified as the weighted sum of absolute SHAP values. The

combined feature importance within a group is represented even if the importance gets assigned to one variable

or split across multiple variables depending on the model structure. The results and discussion are based on those

variables that show the highest SHAP, either individually, or in groups.

2.4. Kruskal Wallis and Dunn's Test

As an output of the binary classification problem of CRS participation probability estimation we obtain the

SHAP values representing the changes observed for the communities' participation probability in the CRS. If

a community was less likely to participate because of a specific feature, such feature would be attributed with

negative SHAP values more frequently. Positive SHAP values represent a shift toward a higher participation

probability, and SHAP values close to zero (±0.01) indicate no change in participation probability. To understand

if the effects of a feature on communities' participation in the CRS as quantified by SHAP values are statistically

significant, that is, the distributions of that feature across different SHAP groups (positive, negative, null) are

statistically different, we performed a Kruskal Wallis (KW) test (Kruskal & Wallis,1952) and post-hoc Dunn's

test (Dunn,1964). The KW test is a non-parametric test to evaluate the similarity of variances and estimate if

the samples of a feature originate from the same distribution when taken from different SHAP groups. The

effect size describes how strongly the groups differ from each other by comparing the feature value variances.

Prior to running the KW test we apply a floodzone correction, dividing the percentage of inhabitants that are

characterized by a certain feature by the floodplain area in a census tract. We further apply a post-hoc Dunn's

test (Dunn,1964) on the variables with significant results in the KW test to identify which groups exactly show

significant differences with each other (Text S1.3 in Supporting InformationS1).

3. Results

3.1. Flood Insurance Coverage and CRS Participation Model Performance

We here present results both for an aggregate model trained over the entire data set at the continental US scale,

and models fitted individually for each state. The aggregate US LightGBM model we implement to predict

flood insurance coverage at the census tract level and identify its significant drivers attains a very good train-

ing performance (R

2=0.987, RMSE=0.004, MAE=0.001) and satisfactory test performance (R

2=0.677,

RMSE=0.019, MAE=0.003). For further model performance analysis broken down by individual US states

please refer to Figure S1 and Table S1 in Supporting InformationS1, which show that our model generalizes

well for the majority of the US states, even if overfitting occurs in a small number of cases (e.g., Pennsylvania).

This result demonstrates the ability of our ML approach to model overall flood insurance purchase patterns at the

continental scale. Conversely, flood insurance coverage models fitted to individual states achieve varied accu-

racy levels, suggesting that some local effects might not be captured by the set of predictors we included in our

analysis. Notably, R

2 higher than 0.6 (obtained for the US states included in Figure1) are achieved for high-risk

US states such as Georgia (test R

2=0.81) and Louisiana (test R

2=0.78). Model performance results confirm

that despite the existence of several climate zones and heterogeneous socio-demographic characteristics among

different states in the continental US, a relation between flood insurance coverage and local socio-economic and

behavioral drivers exists.

3.2. Flood Insurance Policies Are Purchased as a Reactive Flood Adaptation Behavior

Individual state-level and aggregate US model alike indicate that flood history is the most important predictor

of flood insurance coverage (Figure1). The top-ranked flood history feature summarizes both flood frequency

Earth’s Future

VEIGEL ETAL.

10.1029/2023EF003571

5 of 11

(regular) and severity (severe) (see Section2). Both the feature ranking in Figure1 and spatial patterns of flood

severity, flood frequency, and their SHAP values in Figures2a–2e reveal that flood insurance purchase is a reac-

tive behavior, triggered by the occurrence of severe or frequent flooding events.

Flood insurance policies are either covering damage to buildings or to their contents. This is the second most

important determinant of flood insurance coverage (insurance type in Figure1). If the majority of insurance poli-

cies in a census tract covers building damage, the SHAP value for this variable increases, that is, there is a strong

correlation with overall flood insurance purchase.

Spatial patterns of population density also correlate with flood insurance purchase, with lower insurance cover-

age associated with high population density, that is, in urban areas (population in Figure1). Typically metropol-

itan areas experience high turnover of residents leading to lesser knowledge of the flood history in those areas.

Therefore urban areas should be in the focus of resilience increasing strategies.

Conversely, other socio-economic backgrounds only marginally correlate with insurance coverage. Features

related to income, structural characteristics of buildings (structural in Figure1), education, and overall living

situation are ranked with lower importance, as shown by the lowest weighted sum of absolute SHAP values.

3.3. Community-Level Policies Foster Proactive Adaptation

The results in Figure1 show that the CRS stands out for positively influencing insurance coverage in a census

tract and helps overcome reactive behaviors, being ranked right after flood history and flood insurance type.

People are more likely to purchase a flood insurance to protect their property and belongings if the community

they live in participates in the CRS. Therefore we implement a second model to classify which communities

participate or do not participate in the CRS. The CRS follows a categorization system with 10 categories with

Figure 1. Feature importance for individual state models and an aggregated US model. Feature importance is quantified

as the weighted sum of absolute SHapley Additive exPlanations (SHAP) values, ranked from top to bottom in descending

order based on the values obtained for the aggregate US model. The graph shows feature groups including age, gender, and

population density (population), the area within floodzones (environment), work-related characteristics such as the average

daily working hours (work), and information about the building (structural). Only individual-state models that achieved a

training performance with R

2>0.6 are selected for representation. Maximum number of affected people or annual average

number of floods (combined in the feature group of flood history) is the most relevant determinant of flood insurance

purchase for all states.

Earth’s Future

VEIGEL ETAL.

10.1029/2023EF003571

6 of 11

incremental reduction in insurance premiums (Figure2c). In the results from the insurance coverage simulation

(Figure2e) we find that the magnitude of increase in insurance purchase is similar for all positive CRS classes.

Thus, the increase in insurance adoption does not depend on the specific CRS class achieved by a community,

but it rather correlates with whether the communities participate at all. However, participation in the CRS is not

selected as an important predictor of flood insurance in some states such as Rhode Island, Michigan, and Okla-

homa (Figure1). These states had low overall participation rates (FEMA,2021c), which results in low absolute

changes in insurance coverage attributed to the CRS. CRS premium reductions are higher within the SFHA.

When we look at the changes of predicted insurance coverage for the proportion of SFHA in a census tract by

CRS class we see an equal increase of flood insurance coverage for census tracts rated in all CRS classes (Figure

S1 in Supporting InformationS1). This indicates that the amount of subsidization within the CRS does not neces-

sarily affect the insurance coverage.

3.4. Characteristics of CRS Communities Are Heterogeneous

Using a second model to classify the communities in the CRS, we analyze statistically significant correlations

between communities' probability of participation in the CRS and their socio-economic characteristics via KW

test (see Section2). The features spatially represented in Figure3 with their corresponding average SHAP values

are a subset of grouped socio-economic variables that significantly correlate with different levels of probability

of participation in the CRS. The amount of meaningful associations represented in Figure3 shows that the CRS

altogether covers a wide range of socio-economic backgrounds and that a variety of socio-economic variables

Figure 2. Feature values and related SHapley Additive exPlanations (SHAP) values for the number of people affected per flood (a–d), the average annual number

of floods (b–e), and Community Rating System (CRS) average class (c–f). SHAP values are computed from modeling insurance coverage in each census tract. The

maximum number of people affected by a flood (a) is skewed toward the hurricane-affected coastal communities. This historical flood severity directly translates

to higher insurance coverage values (d). Flood Frequency (b) shows similar patterns with a reduced magnitude and high SHAP values centered around fluvial flood

zones, such as the Lower Mississippi and the Minnesota Red River (e). The features related to flood history (flood severity and flood frequency) are the most important

predictors of insurance coverage, followed by the CRS class (c) that leads to increased insurance coverage in participating communities (f).

Earth’s Future

VEIGEL ETAL.

10.1029/2023EF003571

7 of 11

correlates with communities' participation in the CRS. However, some patterns stand out. The overall high-

est SHAP values (darker color opacity in Figure3) originate from flood history as well as population density,

consistently with the feature importance ranking reported in Figure1. Flood history contributes to an increase

in CRS participation probability between 50% and 100%. Conversely, the observed SHAP values for popula-

tion density see not to suggest any influence on CRS participation (opaque black color in Figure3). However,

their high average SHAP (low opacity) originates from the combined effect of decreased participation rates in

rural areas and participation increase in semi-urban areas (Figures S3, S6, and Table S2 in Supporting Informa-

tionS1). According to the CRS classification model, census tracts with a history of severe flooding are more

likely to be participating in the CRS. The insurance coverage regression model shows that they also have a higher

insurance coverage. Evaluation of the CRS participation as a predictor in the insurance coverage model shows

the multi-dimensional relationship between CRS class, past events, and insurance adoption (see Figure S4 in

Supporting InformationS1). In areas were past floods were frequent, those communities that participate in the

CRS show a higher insurance coverage. Conversely, with increasing number of people affected by past floods,

communities that do not participate in the CRS show a slightly higher insurance coverage. Within our model

for CRS participation we can identify the socio-economic characteristics of communities that participate in the

CRS. Those communities show higher insurance coverage values, despite having experienced less flooding in the

past. Multiple socio-economic predictors (i.e., digital access, education, housing, income, inequality, living situ-

ation, ethnicity, and structural characteristics of buildings) are also statistically correlated with CRS participation

probability. However, their impact along this correlation is marginal compared to the shift in CRS participation

probability caused by the above external factors (flood history and population density).

Figure 3. Socio-economic and external features grouped by type and their impact on communities' probability of participation in the Community Rating System

(CRS). We train a Gradient Boosting Decision Trees (GBDT) classifier that successfully simulates and predicts CRS participation (training and testing accuracy

94%, see Figure S2 in Supporting InformationS1). We analyze the output of the GBDT classifier with the most detailed aggregation level separating the features

thematically. The impact of a feature on communities' CRS participation probability is categorized based on its SHAP value in lower participation (blue), higher

participation (red), and no change (black). The strength of this impact is represented by the absolute average SHAP value for that feature (color opacity). Severe is a

continuous variable that quantifies the number of people affected per flood, while regular refers to the annual number of floods. Both of the features are combined in

the flood history variable. High feature values of each variable indicate either more severe flood characteristics or more regular flooding. If the highest mean feature

values are associated with negative SHAP values (before transformation to absolute SHAP value), the feature group is appearing in blue and inversely correlates

with participation in CRS (lower participation). If the highest mean feature values are associated with positive SHAP values, the feature is marked in red and directly

correlates with higher CRS participation. Average absolute SHAP values close to zero indicate no change in CRS participation. Only variables that were found to

significantly correlate with different levels of probability of participation in the CRS are represented (see Section2). Feature values are divided by the area inside the

floodplain in the respective census tract.

Earth’s Future

VEIGEL ETAL.

10.1029/2023EF003571

8 of 11

3.5. Effects of Socio-Economic Structure on CRS Participation

Insights about the correlation of socio-economic variables with CRS participation (from the CRS classification

model) emerge from the statistically significant differences in feature distributions displayed in Figure3 and

grouped by SHAP value (see also Figure S7, Tables S3, and S4 in Supporting InformationS1). To further assess

the magnitude, or strength, of their effect on CRS participation beyond averaged SHAP values, we further analyze

their effect size (see Section2). The features we find to have very high differences between the groups (KW effect

size f>0.998, corresponding to the 0.8 quantile of all significant effect sizes) are external factors. Thus, consist-

ently with our previous analysis, the areas with past flood experience of severe or regular events are more likely

to be participating in the CRS. However, also features indicating vulnerable communities (structural deficits,

population density, and no digital access), describing the amount and type of rented places (high rental prices,

rented properties), and employment type displayed significant effect sizes, suggesting that CRS can help over-

come inequalities. Finally, both high and low property values are associated with higher participation probability,

reflecting the accumulation of wealth in floodplains, the overvaluation of housing prices inside floodplains (Hino

& Burke,2021), and the increased exposure of low-income population (Wing etal.,2022).

4. Discussion

The findings from our data-driven analysis advance knowledge on human behavior in relation to flood risk with

two key insights. First, previous studies have shown that vulnerable groups are systematically more exposed to

floods (Knighton etal.,2020; Wing etal.,2022). Thus, there is a necessity for more vulnerable inhabitants of

areas that are more exposed to floods in the US to increase their resilience to a higher degree (e.g., BIPoC resi-

dents (Knighton etal.,2021; Wing etal.,2022)).

We observe that community-level policies such as the CRS can overcome such inequality bias and foster proactive

adaptation across heterogeneous socio-economic backgrounds, especially in areas with a record of experienced

severe floods. We show evidence that the CRS constitutes a useful framework to encourage increased resilience

for different types of communities. The observed patterns of unequal exposure imply a necessity for targeted

risk communication. We find that the CRS is not targeting specific groups, but instead it covers a wide range

of socio-economic characteristics. However, the CRS should improve its focus toward low income and BIPoC

population segments to further balance existing inequalities. Future research could help understand more specific

varying exposure to evaluate where to prioritize efforts to increase resilience via top-down and bottom-up disaster

risk reduction (DRR) measures.

Second, previous studies demonstrated that individuals' ability of making informed and rational decisions to

maximize their utility is hampered, particularly in high risk scenarios that occur with a low probability, for

example, flooding (Conlisk,1996). Furthermore, subsidization has been proven ineffective in improving flood

insurance uptake (Kunreuther & Pauly,2004), leaving risk communication as an alternative option to encourage

informed insurance decisions (Robinson & Botzen,2019). Our results confirm and generalize this behavioral

pattern on a continental scale: insurance purchase is triggered by previous flood experience and by participating

in the CRS but does not rise consistently within the classes, indicating that further subsidization does not encour-

age risk-averse behavior alone. We do not see a spread in insurance adoption between census tracts that expect

higher reductions of insurance premiums (larger areas within the SFHA), our results suggest that the reduction

of premiums is not the defining factor for higher insurance coverage in CRS communities. Rather, improved risk

communication, which is part of the CRS point collection system, could make a difference in rising insurance

coverage.

The unexplained portion of behaviors we model, that is, the residual in our models, might be emotionally driven

by non-tangible anticipated or immediate emotions as suggested by Loewenstein and Lerner(2003). However,

future research is needed to investigate these emotional components, when existing, as they cannot be described

through the socio-economic variables and at the scale and level of aggregation considered in this study. The

complexity of behavioral drivers is a limiting factor for model performance. The large-scale nature of this study

also introduces some uncertainty with regards to collinearity in the data. In the case of localized collinearity

unaccounted for by our method, the SHAP values could be proportionately reduced. To mitigate this effect we

apply the predictor grouping strategy to distribute the importance between potentially correlated variables and

therefore we can draw conclusions for the whole US. Future research could focus on evaluating correlations that

Earth’s Future

VEIGEL ETAL.

10.1029/2023EF003571

9 of 11

might occur within smaller administrative units and how they relate to our findings. Further, inside the SFHA

households with governmentally backed mortgages are required to purchase an insurance with the NFIP.In some

cases the recipients of post-disaster aid may be required to hold an insurance for a limited amount of time. In

this study we still include this small fraction of policy holders in the analysis, although their decision to acquire

insurance is not self motivated. Furthermore, we do not aim to assume causality within this framework. However,

our findings that areas in which households have previously been exposed to flooding are more likely to purchase

insurance and that subsidization is not a major factor driving insurance purchase are in line with existing research

(Cannon etal.,2020; Kunreuther & Pauly,2004).

With the SHAP values and LightGBM we develop an interpretable ML-based analysis that minimizes biases

because of the data-driven model structure without conceptualizations/hypothesis. Our results regarding

the overall marginal impact of socio-economics could indicate a potential confirmation bias in conceptual or

socio-hydrologic models. Due to a selection of socio-economic variables performed before model building and

implementation based on prior expectation of the human-environment system, many existing models may show

significant relationship that are inherent in their structure which is based on conceptualization. However, the error

introduced through location redaction of NFIP insurance policies data, along with the potentially heterogeneous

composition of socio-economic variables in a census tract that might be unevenly associated with the local insur-

ance coverage after spatial aggregation at the census tract level are two limitations in this study deriving from the

data processing sequence we implemented.

Since our results show that the increase of insurance purchase is similar for all CRS classes, we can conclude

that even simple methods of DRR (e.g., making flood maps available to the community) can make an impact to

improve individuals' motivations to undertake resilience actions. Our results suggest that the strategies and struc-

ture of the CRS has been so far successful in promoting insurance purchase throughout the US and supported

people of all socio-economic backgrounds. The observation that with increasing CRS class there is no strong

increase in insurance purchase could also imply that there is a limit to which communities can be motivated to

take up an insurance. The defining factor of improving insurance adoption seems to be the participation in the

CRS and slightly lowered insurance rates. However, the extent to which the insurance rates are lowered is not

decisive for insurance purchase. These findings are further supported by the observation that income is not a

significant predictor for flood insurance coverage, indicating that financial constraints on a household level do

not have a predominant role in driving the decision to purchase insurance policies.

Our research shows that currently the purchase of flood insurance is mainly due to reactive behavior after severe

floods. Affected people learn from experiencing a flood and improve their resilience via insurance. However, the

CRS effectively motivates proactive insurance adoption and can increase community resilience in areas where

extreme events have not occurred recently. We find that the CRS covers a wide range of socio-economic back-

grounds in a non-discriminatory way and supports both, vulnerable communities (e.g., with low education, high

BIPoC population) and less vulnerable communities (e.g., with high housing value, high median rent). Since, for

instance, the BIPoC population faces the largest increase in future flood exposure (Knighton etal.,2021; Wing

etal.,2022), they would need most help.Thus, the CRS should address and balance such existing inequalities, in

focusing on high risk areas with an increasingly exposed and vulnerable population.

Data Availability Statement

All data used in this study is available from the U.S. Census Bureau and the Federal Emergency Management

Agency (U.S. Census Bureau,2018; FEMA,2019,2021a,2021b,2021c). The R code used to generate the results

reported in this article is available at Veigel etal.(2023).

References

Aerts, J. C., Botzen, W. J., Clarke, K. C., Cutter, S. L., Hall, J. W., Merz, B., etal. (2018). Integrating human behaviour dynamics into flood

disaster risk assessment. Nature Climate Change, 8(3), 193–199. https://doi.org/10.1038/s41558-018-0085-1

Atreya, A., Ferreira, S., & Michel-Kerjan, E. (2015). What drives households to buy flood insurance? New evidence from Georgia. Ecological

Economics, 117, 153–161. https://doi.org/10.1016/j.ecolecon.2015.06.024

Bradley, A. P. (1997). The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30(7),

1145–1159. https://doi.org/10.1016/S0031-3203(96)00142-2

Bubeck, P., Botzen, W. J., & Aerts, J. C. (2012). A review of risk perceptions and other factors that influence flood mitigation behavior. Risk

Analysis, 32(9), 1481–1495. https://doi.org/10.1111/j.1539-6924.2011.01783.x

Acknowledgments

The authors would like to thank the

Helmholtz Einstein International Berlin

Research School in Data Science

(HEIBRiDS) for supporting this project

and the research environment that made

it possible. We are grateful for the

computing resources provided by the HPI

FutureSOC Lab. Open Access funding

enabled and organized by Projekt DEAL.

Earth’s Future

VEIGEL ETAL.

10.1029/2023EF003571

10 of 11

Cannon, C., Gotham, K. F., Lauve-Moon, K., & Powers, B. (2020). The climate change double whammy: Flood damage and the determinants

of flood insurance coverage, the case of post-Katrina New Orleans. Climate Risk Management, 27(September 2019), 100210. https://doi.

org/10.1016/j.crm.2019.100210

Conlisk, J. (1996). Why bounded rationality? Journal of Economic Literature, 34(2), 669–700.

Cremades, R., Surminski, S., Máñez Costa, M., Hudson, P., Shrivastava, P., & Gascoigne, J. (2018). Using the adaptive cycle in climate-risk

insurance to design resilient futures. Nature Climate Change, 8(1), 4–7. https://doi.org/10.1038/s41558-017-0044-2

Dunn, O. J. (1964). Multiple comparisons using rank sums. Technometrics, 6(3), 241–252. https://doi.org/10.1080/00401706.1964.10490181

FEMA. (2019). FIMA NFIP redacted claims—FEMA.gov [Dataset]. FEMA. Retrieved from https://www.fema.gov/about/openfema/

data-sets#nfip

FEMA. (2020). FIMA NFIP redacted policies—V1—FEMA.gov [Dataset]. FEMA. Retrieved from https://www.fema.gov/about/openfema/

data-sets#nfip

FEMA. (2021a). Community rating system overview and participation. Retrieved from https://www.fema.gov/fact-sheet/community-rating-system-

overview-and-participation

FEMA. (2021b). FEMA flood map service center [Dataset]. FEMA. Retrieved from https://msc.fema.gov/portal/home

FEMA. (2021c). Information about the community rating system [Dataset]. FEMA. Retrieved from https://www.fema.gov/case-study/

information-about-community-rating-system

Gallagher, J. (2014). Learning about an infrequent event: Evidence from flood insurance take-up in the United States. American Economic Jour-

nal: Applied Economics, 6(3), 206–233. https://doi.org/10.1257/app.6.3.206

Guo, X., Yin, Y., Dong, C., Yang, G., & Zhou, G. (2008). On the class imbalance problem. In 2008 Fourth international conference on natural

computation (Vol. 4,pp.192–201).

Hino, M., & Burke, M. (2021). The effect of information about climate risk on property values. Proceedings of the National Academy of Sciences

of the United States of America, 118(17). https://doi.org/10.1073/pnas.2003374118

Horn, D. P., & Brown, J. T. (2018). Introduction to the National Flood Insurance Program (NFIP). Congressional Research Service, 3–4.

Retrieved from https://fas.org/sgp/crs/homesec/R44593.pdf

Hu, Z. (2020). Salience and households’ flood insurance decisions. SSRN Electronic Journal. https://doi.org/10.2139/SSRN.3759016

Jongman, B., Winsemius, H. C., Aerts, J. C., Coughlan De Perez, E., Van Aalst, M. K., Kron, W., & Ward, P.J. (2015). Declining vulnerability to

river floods and the global benefits of adaptation. Proceedings of the National Academy of Sciences of the United States of America, 112(18),

E2271–E2280. https://doi.org/10.1073/PNAS.1414439112

Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., etal. (2017). LightGBM: A highly efficient gradient boosting decision tree. Advances

in Neural Information Processing Systems, 30, 3146–3154.

Knighton, J., Buchanan, B., Guzman, C., Elliott, R., White, E., & Rahm, B. (2020). Predicting flood insurance claims with hydrologic and socio-

economic demographics via machine learning: Exploring the roles of topography, minority populations, and political dissimilarity. Journal of

Environmental Management, 272, 111051. https://doi.org/10.1016/j.jenvman.2020.111051

Knighton, J., Hondula, K., Sharkus, C., Guzman, C., & Elliott, R. (2021). Flood risk behaviors of United States riverine metropolitan areas are

driven by local hydrology and shaped by race. Proceedings of the National Academy of Sciences of the United States of America, 118(13),

2021. https://doi.org/10.1073/pnas.2016839118

Koerth, J., Vafeidis, A. T., & Hinkel, J. (2017). Household-level coastal adaptation and its drivers: A systematic case study review. Risk Analysis,

37(4), 629–646. https://doi.org/10.1111/risa.12663

Kruskal, W. H., & Wallis, W. A. (1952). Use of ranks in one-criterion variance analysis. Journal of the American Statistical Association, 47(260),

583–621. https://doi.org/10.1080/01621459.1952.10483441

Kuang, D., & Liao, K. H. (2020). Learning from floods: Linking flood experience and flood resilience. Journal of Environmental Management,

271(June), 111025. https://doi.org/10.1016/j.jenvman.2020.111025

Kundzewicz, Z. W., Hegger, D. L. T., Matczak, P., & Driessen, P.P.J. (2018). Flood-risk reduction: Structural measures and diverse strategies.

Proceedings of the National Academy of Sciences, 115(49), 12321–12325. https://doi.org/10.1073/pnas.1818227115

Kunreuther, H., & Pauly, M. (2004). Neglecting disaster: Why don’t people insure against large losses? Journal of Risk and Uncertainty, 28(1),

5–21. https://doi.org/10.1023/b:risk.0000009433.25126.87

Loewenstein, G., & Lerner, J. S. (2003). The role of affect in decision making. In Handbook of affective sciences (pp.619–642).

Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., etal. (2020). From local explanations to global understanding with

explainable AI for trees. Nature Machine Intelligence, 2(1), 56–67. https://doi.org/10.1038/s42256-019-0138-9

Lundberg, S. M., & Lee, S.-I. (2017). A unified approach to interpreting model predictions. In Advances in neural information processing

systems, 2017-December (pp.4766–4775). Retrieved from https://arxiv.org/abs/1705.07874v2

McPhillips, L. E., Chang, H., Chester, M. V., Depietri, Y., Friedman, E., Grimm, N. B., etal. (2018). Defining extreme events: A cross-disciplinary

review. Earth's Future, 6(3), 441–455. https://doi.org/10.1002/2017EF000686

Robinson, P.J., & Botzen, W. W. (2019). Determinants of probability neglect and risk attitudes for disaster risk: An online experimental study of

flood insurance demand among homeowners. Risk Analysis, 39(11), 2514–2527.

Sadiq, A.-A., Tyler, J., Noonan, D. S., Norton, R. K., Cunniff, S. E., & Czajkowski, J. (2019). Review of the federal emergency management agen-

cy’s community rating system program. Natural Hazards Review, 21(1), 03119001. https://doi.org/10.1061/(ASCE)NH.1527-6996.0000320

Sanders, B. F., Schubert, J. E., Goodrich, K. A., Houston, D., Feldman, D. L., Basolo, V., etal. (2020). Collaborative modeling with fine-resolution

data enhances flood awareness, minimizes differences in flood perception, and produces actionable flood maps. Earth's Future, 8(1), 1–23.

https://doi.org/10.1029/2019EF001391

U.S. Census Bureau. (2018). 2018 American Community Survey 5-year public use microdata samples [Dataset]. Census Data API. Retrieved

from https://api.census.gov/data/2018/acs/

Veigel, N., Kreibich, H., & Cominola, A. (2022). A gradient boosting approach to identify behavioral and policy determinants of flood resilience

in the continental US. IFAC-PapersOnLine, 55(33), 85–91. (2nd IFAC Workshop on Control Methods for Water Resource Systems CMWRS

2022). https://doi.org/10.1016/j.ifacol.2022.11.014

Veigel, N., Kreibich, H., & Cominola, A. (2023). Code for interpretable machine learning reveals potential to overcome reactive flood adaptation

in the continental US [Code]. Zenodo. https://doi.org/10.5281/zenodo.8067448

Wing, O. E., Lehman, W., Bates, P.D., Sampson, C. C., Quinn, N., Smith, A. M., etal. (2022). Inequitable patterns of US flood risk in the Anthro-

pocene. Nature Climate Change, 12(2), 156–162. https://doi.org/10.1038/S41558-021-01265-6

Zhou, H., Qian, W., & Yang, Y. (2020). Tweedie gradient boosting for extremely unbalanced zero-inflated data. Communications in Statistics

Simulation and Computation, 51(9), 1–23. https://doi.org/10.1080/03610918.2020.1772302

Earth’s Future

VEIGEL ETAL.

10.1029/2023EF003571

11 of 11

References From the Supporting Information

Hijmans, R. J. (2021). raster: Geographic data analysis and modeling [Software]. (R package version 3.5-2). Retrieved from https://CRAN.R-

project.org/package=raster

Ke, G., Soukhavong, D., Lamb, J., Meng, Q., Finley, T., Wang, T., etal. (2021). lightgbm: Light gradient boosting machine [Software]. (R package

version 3.2.1). Retrieved from https://CRAN.R-project.org/package=lightgbm

Liu, Y., & Just, A. (2021). Shapforxgboost: Shap plots for ‘xgboost’. [Software]. (R package version 0.1.1). Retrieved from https://CRAN.R-

project.org/package=SHAPforxgboost

Pebesma, E. (2018). Simple features for R: Standardized support for spatial vector data [Software]. The R Journal, 10(1), 439–446. https://doi.

org/10.32614/RJ-2018-009

R Core Team. (2021). R: A language and environment for statistical computing [Software]. Vienna, Austria. Retrieved from https://www.R-

project.org/

Tomczak, M., & Tomczak, E. (2014). The need to report effect size estimates revisited. an overview of some recommended measures of effect

size. Trends in Sport Sciences, 1(21), 19–25.

Wickham, H. (2016). ggplot2: Elegant graphics for data analysis [Software]. Springer-Verlag. Retrieved from https://ggplot2.tidyverse.org

Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L. D., François, R., etal. (2019). Welcome to the tidyverse. Journal of Open Source

Software, 4(43), 1686. https://doi.org/10.21105/joss.01686