Article

April 2021

Estimating state and local employment in recent disasters—from Hurricane Harvey to the COVID-19 pandemic

Natural disasters, including hurricanes, floods, wildfires, and the coronavirus disease 2019 (COVID-19) pandemic, have challenged the standard practices used to produce state and area employment estimates. In some cases, these challenges have led to modifications to the handling of reported business closures, assumptions regarding nonresponse, and the techniques used for modeling employment in domains with small samples for state and metropolitan areas. This article examines how a series of major hurricanes in 2017 and 2018 affected the estimation of state and metropolitan area payroll employment and how lessons learned from these disasters provided a playbook for producing estimates during the COVID-19 pandemic.

The U.S. Bureau of Labor Statistics (BLS) Current Employment Statistics (CES) program produces some of the timeliest economic indicators each month, both for the nation as a whole and for varying levels of geographic detail. The CES program conducts a monthly survey of about 144,000 businesses and government agencies representing about 697,000 individual worksites. Respondents to the CES survey provide information on the number of employees, total payroll, and hours paid for the pay period that includes the 12th day of the month. The CES program uses these data to produce estimates of industry employment, hours, and earnings. BLS typically publishes national data within a week of the end of the reference month; data for the 50 states; Washington, DC; Puerto Rico; the U.S. Virgin Islands; and about 450 metropolitan areas are released approximately 2 weeks later.

Natural disasters challenge assumptions built into the employment estimation process regarding nonresponse and the relationship between business openings and closures, and the CES program must address this issue in order to produce accurate data. This article provides an overview of three aspects of the CES methodology that natural disasters confront. It then provides case studies of how the CES program dealt with these questions in developing state and metropolitan area estimates following five recent hurricanes—Harvey, Irma, and Maria in 2017, and Florence and Michael in 2018—and how the lessons learned from the earlier experience were applied to producing estimates in the early months of the coronavirus disease 2019 (COVID-19) pandemic. (See table 1.)

Table 1. Selected natural disasters that affected state and local employment data
Disaster	Analyzed effect in Current Employment Statistics estimates
Hurricane Harvey	September 2017, Texas
Hurricane Irma	September 2017, Florida
Hurricane Maria	October 2017, Puerto Rico
Hurricane Florence	September 2018, North Carolina and South Carolina
Hurricane Michael	October 2018, Florida and Georgia
COVID-19 pandemic	March and April 2020, 50 states, plus Washington, DC, and Puerto Rico
Source: U.S. Bureau of Labor Statistics. Note: COVID-19 = coronavirus disease 2019.

CES methodology and natural disasters

The CES program surveys businesses each month and uses the data and a consistent, objective methodology to produce estimates of industry employment, hours, and earnings. The methodology relies on assumptions that are continually evaluated and generally hold in normal times. However, natural disasters are extreme events, and this section explores three ways that such disasters raise questions about the usual CES methodological underpinnings.

Are survey respondents different from nonrespondents?

The CES survey is designed as a probability-based survey.¹ The BLS Quarterly Census of Employment and Wages (QCEW), which contains information on all employers required to participate in the unemployment insurance (UI) system and covers about 97 percent of all nonfarm payroll employment, provides both a sampling frame and the main benchmark source for the CES program.² In developing the QCEW as a sample frame, BLS stratifies the data by state, industry, and establishment size. The CES survey randomly selects businesses within these strata, with the inverse of the probability of selection used as the sample weight in estimation. The survey estimates are benchmarked to the administrative QCEW data annually. The difference between the benchmark data and the survey-based estimates—the benchmark revision—is regularly used to assess the accuracy of the CES estimates. After setting benchmark employment levels each year, the CES program estimates monthly employment growth rates with a weighted link-relative estimator. The estimator is essentially the ratio of the current month’s weighted employment to that of the prior month in the set of matched respondents that reported nonzero employment for the pay period containing the 12th of both months. The generally small number of active survey units reporting zero employment in either the current or prior month is not used because of assumptions in the business birth–death model (discussed in the next subsection).

Equation 1 provides the weighted link-relative estimate () for time t, where is reported employment for a matched respondent i in n matched sample establishments, w is the sampling weight, and d is a nonresponse weight adjustment (calculated for industries identified to respond at substantially higher or lower rates)³:

(1) .

Within each estimating domain (an area or industry combination in which the estimate calculation is performed—other published cells are summaries of estimating cells), the estimator implicitly imputes data for nonrespondents as well as businesses reporting zero employment in the current or prior month by using the growth rates of the matched units. State-level estimates are produced for “estimation supersectors,” high-level industries (such as construction or leisure and hospitality) that are estimated directly and control the sum of estimates of more detailed industries. The sample size in these estimation supersectors is large enough to avoid using small-area modeling techniques in most cases, but domain estimates use respondents in businesses from quite different industries (for example, clothing stores, gas stations, and grocery stores mixed together in retail trade).

The implicit imputation follows from the form of the estimator, which does not require explicit imputation, and a missing-at-random assumption within each estimating cell (conditioned on the industries in d_i). The survey respondents stand in for the nonrespondents, and this assumption works well when they are similar, on average.⁴

Natural disasters challenge the missing-at-random assumption. Certain types of businesses—for example, those in federally designated disaster areas—might have more job losses than others in their state because of business disruptions, and they might respond to the CES survey at a lower rate, as other matters take a higher priority. Similarly, certain industries within an estimation supersector may fare differently from one another during a disaster in a way that correlates with response rates, and the same may be true for differences in employment trends and response rates between large and small establishments.

Are business births and deaths properly captured?

The ability to use the QCEW as a frame with a close match to the target population of all nonfarm payroll jobs is one of the CES program’s greatest strengths. Two of the program’s biggest challenges are accounting for businesses that have opened and closed since the frame was established. It is impossible to sample, enroll, and collect data on business births in real time because the QCEW lags the CES survey by several months. Establishment deaths—defined as worksites that no longer have any payroll employees, which may be temporary or permanent—present a challenge because businesses generally stop reporting when there are no employees. A small number of businesses report establishment deaths; however, they are mainly firms with multiple worksites. These two components of job growth generally offset one another, and the degree to which CES procedures do not capture the residual growth when applied to historical QCEW population data is known as the “net birth–death residual.” This value is generally small and stable, and therefore the CES program forecasts historical values with a time-series model for use in estimation. Equation 2 provides the formula for the employment estimate ():

(2)

The CES program multiplies the prior month’s employment level () by the weighted link relative () and then adds the current month’s forecast value ( ).⁵

Natural disasters strain the usual assumptions about business births and deaths. Disaster conditions may halt the formation of new businesses, while forcing many existing businesses to cease operations. But despite disruptions—and sometimes even severe damage to physical locations—many businesses continue to pay employees through the hardship, mitigating the number of business deaths by CES definitions. Still, the net birth–death residual calculated from the QCEW is often noticeably lower in domains affected by natural disasters, and the forecast error tends to be positive in these cases.

Are small-area models adequate?

The CES program produces estimates with a version of the Fay-Herriot model for estimation supersectors and for many metropolitan areas with small sample sizes.⁶ The model estimate is a weighted combination of the direct weighted link-relative estimate and a synthetic component consisting of a historical trend value corrected by a coefficient derived from the regression model in equation 3:

(3)

Y1_i,j represents the statewide weighted link-relative estimates for a given industrial supersector (i) and state (j), Y2 represents the corresponding historical trend values, is an estimated coefficient, and is a model residual. The regression modeling is performed at a statewide supersector level, and the correction factor ( from the state-level model is linked to detailed metropolitan areas. The Fay-Herriot estimate for metropolitan area k is displayed in equation 4:

(4) .

The weight (W_i_,k) assigned to the direct estimate (Y1_i_,k) is a function of the variance of the direct and trend (Y2_i_,k) components as well as the variance from the state-level model. When the relationship between the direct estimates and the historic trend is looser, the direct estimates receive a higher weight. The correction factor adjusts the historic trend values to the direct estimates roughly on average across the country, but it does not account for any clusters of heterogeneity among states or areas.

This model relies on assumptions about the similarity of industry behavior between areas in order to “borrow strength” across those areas; the model also relies on assumptions about the relationship between statewide and metropolitan area trends.⁷ Hurricanes bring these assumptions into stark relief, as some affected states may be dissimilar to the nation as a whole, and even within those states some areas face devastation while others remain unscathed. Similarly, the COVID-19 pandemic affected jobs in states and cities to widely different degrees, despite broad-based shutdowns and job losses.

Five major hurricanes

Although the CES survey began in 1915, and the program has faced many natural disasters since then, the survey has evolved substantially over the years. The CES program fully implemented a probability-based design for the 50 states and Washington, DC, in 2003, BLS assumed responsibility for producing the state and local estimates in 2011, and estimation procedures and models have continued to evolve since then.⁸ As a result, the 2017–18 hurricanes provided the best context in which to address estimation issues related to the COVID-19 pandemic.⁹

Several major hurricanes made landfall on the Gulf of Mexico and Atlantic coasts, as well as over U.S. territories in the Caribbean, during the 2017 and 2018 hurricane seasons.¹⁰ The timing of these storms matters because anyone who worked or received pay for any portion of the pay period including the 12th day of the month was counted as employed. Hurricane Harvey made landfall at Port Aransas, Texas, as a category-4 storm, on Friday, August 25, 2017, and produced heavy rainfall and severe flooding in coastal Texas and other parts of the Gulf Coast, most notably in Louisiana. The system that became Hurricane Irma also formed in August 2017 and made landfall over several Caribbean islands as a category-5 storm in early September; it struck the Florida Keys as a category-4 storm on Sunday, September 10, 2017, and reached the mainland later that day, continuing north along the Florida peninsula. Hurricane Maria formed in September 2017 and caused severe devastation as it made its way through the Caribbean that month. The center of the then-category-5 storm passed just south of St. Croix in the U.S. Virgin Islands on September 19, 2017, before moving diagonally across Puerto Rico the following day.¹¹ The next year, Hurricane Florence reached category-4 strength and weakened to a category-1 storm, before making landfall on Friday, September 14, 2018, near Wrightsville Beach, North Carolina; it then traveled into South Carolina, causing severe flooding and storm surges in both states. The following month, Hurricane Michael struck the Florida panhandle on Wednesday, October 10, 2018, as a category-5 hurricane.

Hurricanes Harvey and Irma both caused tragic loss of life and rank among the most severe U.S. natural disasters, in terms of property damage. Harvey caused 68 direct deaths in Texas and an estimated $125 billion in property damages in the region. Irma caused 10 direct deaths across three states—7 in Florida, 2 in Georgia, and 1 in South Carolina—and 3 direct deaths in the U.S. Virgin Islands, as well as $50 billion in property damages. The CES survey measured far fewer job losses associated with Harvey than with Irma, however. Texas employment grew in September 2017—adding 14,600 jobs, not far off the prior 12-month average growth of 16,900 jobs—with net job losses in coastal metropolitan areas such as Houston-The Woodlands-Sugar Land, TX (–8,800). The state added 39,300 jobs the following month as businesses bounced back. Florida, by contrast, lost 172,700 jobs in September 2017, before recovering 194,700 jobs the following month. The difference resulted from the timing of the storms. Irma struck Florida at the beginning of the week that included September 12, so workers on a weekly payroll who lost their shifts for 7 days showed up in the estimates as a job loss. By contrast, Harvey hit Texas more than 2 weeks earlier, so only prolonged layoffs resulted in a drop in measured employment in that state.¹² Hurricane Maria, despite hitting Puerto Rico and the Virgin Islands well before the October 2017 reference period, followed closely behind Irma’s strike on the territories and caused lasting devastation that was evident in employment data for a prolonged period.

The birth–death model

In normal months, a small number of businesses report zero employment in the CES survey. The rate of reported establishment “deaths” generally follows a stable, seasonal pattern, and some of the deaths are temporary. Following Hurricanes Irma and Maria, the number of establishments reporting zero employment increased sharply. Table 2 shows the proportion of respondents that reported zero employment and compares the forecasted and actual birth–death residuals (later derived from the QCEW) in disaster months. In order to provide frames of reference, table 2 also shows the average proportion of business deaths in the sample over the 2-year period before the disaster and the average absolute monthly forecast error, in terms of employment. The proportion of reported establishment deaths was double the usual rate in Florida following Hurricane Irma and 7 times the usual rate in Puerto Rico following Hurricane Maria. This indicated that the usual birth–death relationship did not hold, and actual birth–death values from the QCEW—calculated several months after the initial release of the CES estimates—were substantially more negative than the forecast values. The forecast error of employment in Florida following Irma was nearly 4 times the average absolute monthly forecast error, while the forecast error for Puerto Rico following Maria was more than 8 times larger than average. Following other major hurricanes, the proportion of reported establishment deaths was relatively normal, and the birth–death forecasts were closer to their mark and well within normal ranges. (See table 2.)

Table 2. Reported zeros and birth–death forecast errors
Event	Proportion of businesses reporting zero employment		Birth–death forecast error (mean absolute error, 2014–19)
Event	Month	Percent	Month	Value (in thousands)
Harvey (Texas)	September 2017	0.23	September 2017	–0.3 (7.9)
Harvey (Texas)	Average, September 2014, 2015, and 2016	0.15	September 2017	–0.3 (7.9)
Irma (Florida)	September 2017	0.40	September 2017	28.4 (7.5)
Irma (Florida)	Average, September 2014, 2015, and 2016	0.19	September 2017	28.4 (7.5)
Maria (Puerto Rico)	October 2017	2.20	October 2017	13.8 (1.7)
Maria (Puerto Rico)	Average, October 2014, 2015, and 2016	0.32	October 2017	13.8 (1.7)
Florence (North Carolina)	September 2018	0.14	September 2018	4.9 (3.6)
Florence (North Carolina)	Average, September 2015, 2016, and 2017	0.16	September 2018	4.9 (3.6)
Florence (South Carolina)	September 2018	0.21	September 2018	–0.5 (2.1)
Florence (South Carolina)	Average, September 2015, 2016, and 2017	0.20	September 2018	–0.5 (2.1)
Michael (Florida)	October 2018	0.22	October 2018	4.5 (7.5)
Michael (Florida)	Average, October 2015, 2016, and 2017	0.18	October 2018	4.5 (7.5)
Michael (Georgia)	October 2018	0.13	October 2018	2.3 (5.0)
Michael (Georgia)	Average, October 2015, 2016, and 2017	0.18	October 2018	2.3 (5.0)
COVID-19 (50 states, plus Washington, DC, and Puerto Rico)	March 2020	0.46	March 2020	Unavailable
	Average, March 2017, 2018, and 2019	0.18	March 2020	Unavailable
	April 2020	3.40	April 2020	Unavailable
	Average, April 2017, 2018, and 2019	0.22	April 2020	Unavailable
Source: U.S. Bureau of Labor Statistics.

Although reported establishment deaths generally are not used in the estimation process, the seven-fold increase in their rate in Puerto Rico following Hurricane Maria necessitated a different approach. BLS determined the risk of underestimating the number of business deaths to be substantial enough that reported zeros were treated the same as other matched reports and were used in the sample link relative. This change negatively affected the October 2017 employment estimates by 11,800 jobs, nearly offsetting the birth–death forecast error of 13,800. Because many of the closures were temporary, the returns from zero employment were also used in the estimates.

Following Hurricane Irma, the CES program did not initially use reported business deaths in the weighted link-relative estimator for Florida, but after benchmarking the September 2017 employment data, the program recognized that the initial sample-based estimates undercounted the extent of job loss.¹³ The QCEW data contained a large increase in the number of establishments reporting zero employment, and had the sample-based estimates used the reported zeros, the estimated employment level would have been 35,000 lower—more closely tracking the population data—which would have accounted for the birth–death forecast error. (The actual value was 28,000 lower than the forecast value.) The CES program included the returns for businesses that had reported declines to zero following Irma in the sample-based estimates that were produced from the new September 2017 benchmark level, increasing Florida employment estimates in October 2017 by 32,000. Including these returns from zero and making other adjustments to account for reporting differences between the CES survey and the QCEW resulted in October–December 2017 reestimates that would more closely track the eventual benchmark data released in March 2019.

Nonresponse analysis

Following several major hurricanes, concerns arose over the possibility of businesses in severely affected areas responding at lower rates than those in the rest of the state, which would result in a violation of the missing-at-random assumption. The CES program has investigated nonresponse weight adjustments following hurricanes and other disasters such as floods and wildfires dating back to Katrina, in 2005, in order to appropriately represent disaster areas in statewide estimates. Although these adjustments can be calculated for different levels of devastation—for instance using flood maps—the set of counties designated by the Federal Emergency Management Agency (FEMA) as eligible for individual and public assistance has typically defined disaster areas for purposes of nonresponse analysis. The program now regularly monitors response rates and the impact of differential nonresponse in states affected by natural disasters. However, response rates were not significantly lower in disaster areas following the 2017–18 hurricanes, in part because of dedicated data collection efforts. Applying nonresponse adjustments following the 2017–18 hurricanes would have changed estimates little, with no obvious benefit, and therefore the estimates were not adjusted for differential nonresponse.

Small-area estimation

The CES program widely uses variants of the Fay-Herriot model in producing employment estimates, particularly at the metropolitan-area level.¹⁴ In areas affected by recent major hurricanes, the direct employment estimates were lower than the synthetic components 65 percent of the time. This indicates a breakdown in the assumptions of similarity across states and among areas within each state. Chart 1 shows a comparison of the link relatives for the direct and synthetic parts of Fay-Herriot models in metropolitan areas containing counties designated by FEMA as eligible for individual and public assistance. (Three outlier values are not displayed.) The reference line has a slope of 1, and values above that line indicate a direct estimate lower than the synthetic one.

View Chart Data

Chart 1. Comparison of direct and synthetic components in Fay-Herriot model estimates
Hurricane	Direct link relative	Synthetic link relative
Florence
Florence, point 1	0.94811	0.99979
Florence, point 2	0.98895	1.00932
Florence, point 3	0.98387	0.99575
Florence, point 4	1.00000	0.99346
Florence, point 5	1.00021	0.98963
Florence, point 6	0.98272	0.99035
Florence, point 7	0.95403	1.01231
Florence, point 8	0.98479	1.00165
Florence, point 9	0.99221	0.99836
Florence, point 10	1.01326	1.02214
Florence, point 11	1.06331	0.98622
Florence, point 12	0.97549	1.00121
Florence, point 13	0.99428	0.99700
Florence, point 14	0.99827	1.00123
Florence, point 15	1.03701	1.00716
Florence, point 16	0.97410	0.98648
Florence, point 17	1.00914	1.00270
Florence, point 18	0.93720	0.98745
Florence, point 19	1.03020	0.99503
Florence, point 20	0.93417	0.97247
Florence, point 21	0.92330	0.99816
Harvey
Harvey, point 1	0.97018	1.00082
Harvey, point 2	1.00695	0.99916
Harvey, point 3	0.99874	0.99778
Harvey, point 4	1.00247	1.00515
Harvey, point 5	1.00047	1.00876
Harvey, point 6	0.91907	0.99307
Harvey, point 7	0.84136	0.97225
Harvey, point 8	0.99941	0.96977
Harvey, point 9	1.00000	0.99640
Harvey, point 10	1.00598	1.00200
Harvey, point 11	0.98309	0.99564
Harvey, point 12	0.99056	0.99497
Harvey, point 13	1.00000	1.00122
Harvey, point 14	1.00925	0.98644
Harvey, point 15	0.98949	0.99994
Harvey, point 16	0.90932	0.99829
Harvey, point 17	1.18857	1.01965
Harvey, point 18	0.98623	0.98945
Harvey, point 19	1.05961	0.99740
Harvey, point 20	0.99964	1.00013
Harvey, point 21	1.02870	0.99949
Harvey, point 22	0.87394	0.98019
Irma
Irma, point 1	0.80047	1.01281
Irma, point 2	0.99955	1.00233
Irma, point 3	0.98327	0.99634
Irma, point 4	1.05323	1.00735
Irma, point 5	1.00613	1.00517
Irma, point 6	1.00773	1.00776
Irma, point 7	0.93257	1.00511
Irma, point 8	0.96906	1.00296
Irma, point 9	1.04845	1.05325
Irma, point 10	1.03072	1.00342
Irma, point 11	1.03684	1.01387
Irma, point 12	1.00792	0.97620
Irma, point 13	1.16800	1.00364
Irma, point 14	0.96340	0.97241
Irma, point 15	1.00093	1.00624
Irma, point 16	0.96975	0.99829
Irma, point 17	0.94601	0.98997
Irma, point 18	1.01101	0.98995
Irma, point 19	0.98028	1.00199
Irma, point 20	1.04734	0.99590
Irma, point 21	0.97851	0.98745
Irma, point 22	1.00333	1.00789
Irma, point 23	1.00046	1.00134
Irma, point 24	1.00651	1.00054
Irma, point 25	0.88308	0.97349
Irma, point 26	0.98671	0.99418
Irma, point 27	0.99689	1.01167
Irma, point 28	1.00000	1.00761
Irma, point 29	1.00000	1.01786
Irma, point 30	0.99687	1.00200
Irma, point 31	0.98396	1.00544
Irma, point 32	0.90971	0.97839
Irma, point 33	0.99599	1.00333
Irma, point 34	1.00552	1.00839
Irma, point 35	0.98871	0.94909
Irma, point 36	0.99860	0.99852
Irma, point 37	1.00191	1.00147
Irma, point 38	0.97196	1.00499
Irma, point 39	0.90375	1.01399
Irma, point 40	0.99301	1.00775
Irma, point 41	0.99584	1.00719
Irma, point 42	1.00653	1.00340
Irma, point 43	1.00541	0.99722
Irma, point 44	1.01953	1.00642
Irma, point 45	1.00634	1.00102
Irma, point 46	0.94453	1.00065
Irma, point 47	1.00030	1.00163
Irma, point 48	1.00103	0.99729
Irma, point 49	0.97948	0.99352
Irma, point 50	0.99608	0.99998
Irma, point 51	0.99778	1.00270
Irma, point 52	0.98055	0.97492
Irma, point 53	0.98091	0.99702
Irma, point 54	1.00151	0.99156
Irma, point 55	0.99894	0.99951
Irma, point 56	0.99734	1.00288
Irma, point 57	0.93833	1.00029
Irma, point 58	1.00962	1.00201
Irma, point 59	0.95734	0.99333
Irma, point 60	1.00270	0.99871
Irma, point 61	1.00000	0.98920
Irma, point 62	1.04882	1.00878
Irma, point 63	0.99976	1.00284
Irma, point 64	0.90649	1.00597
Irma, point 65	0.98560	1.00050
Irma, point 66	0.90313	1.00564
Irma, point 67	0.98291	1.00643
Irma, point 68	0.99119	1.00706
Maria
Maria, point 1	0.97842	0.98354
Maria, point 2	0.47188	0.98264
Maria, point 3	1.01311	0.99787
Maria, point 4	0.91548	1.02403
Maria, point 5	0.95974	1.01254
Maria, point 6	0.95532	1.00458
Maria, point 7	1.05328	1.03390
Maria, point 8	0.96381	1.01525
Maria, point 9	0.93112	1.00359
Maria, point 10	0.98805	0.99940
Maria, point 11	0.94712	1.01050
Maria, point 12	1.05387	1.01656
Maria, point 13	0.91177	1.01505
Maria, point 14	1.00432	1.02584
Michael
Michael, point 1	1.00254	1.01474
Michael, point 2	1.01139	1.01115
Michael, point 3	1.01629	0.99781
Michael, point 4	1.01365	1.00822
Michael, point 5	0.94168	0.94786
Michael, point 6	1.00122	1.00059
Michael, point 7	0.99491	1.00058
Michael, point 8	1.01698	1.03629
Michael, point 9	1.00320	0.99074
Michael, point 10	1.00925	0.99158
Michael, point 11	0.96577	0.96592
Michael, point 12	1.00729	0.99873
Michael, point 13	1.00161	1.00285
Michael, point 14	1.00491	0.99882
Source: U.S. Bureau of Labor Statistics.

Although the direct estimates showed the job-losing effects of these hurricanes more, on average, than did the model as a whole, the sample in these domains is small and often quite volatile. Had the CES program used direct estimates instead of Fay-Herriot models, the root-mean-square benchmark revision to the link relatives would have been 37 percent higher.

Experiences following the 2017 and 2018 hurricanes led to the development of other modeling approaches that would generate better results and not violate important assumptions. Instead of modeling all statewide supersectors together and applying the results to metropolitan areas, the domains directly affected by hurricanes could be pooled and modeled together. Had it been used, this “hurricane variant” of the Fay-Herriot model would have produced benchmark revisions that were, on average, 12 percent lower, and it would have improved 64 percent of estimates. The variant model would have improved accuracy the most for the hurricanes with the largest effect on estimates, reducing benchmark revisions for Puerto Rico following Maria by 39 percent and for Florida following Irma by 11 percent. (The hurricane variant improved estimates 71 percent of the time for Puerto Rico and 74 percent of the time for Florida.) The hurricane variant model did not perform well for Texas following Harvey: although the correction factor better captured the drop in employment, worse model fit resulted in a much larger weight on the direct components, which would have resulted in higher benchmark revisions. (See table 3.)

Table 3. Root-mean-square benchmark revision of link relatives for Fay-Herriot modeled series in metropolitan areas during major hurricanes
Hurricane	Direct sample-based	Fay-Herriot	Fay-Herriot hurricane variant	Percentage improved by hurricane variant
All (n = 139)	0.056	0.041	0.036	64.7
Florence (n = 21)	0.031	0.027	0.026	61.9
Harvey (n = 22)	0.060	0.021	0.033	31.8
Irma (n = 68)	0.047	0.036	0.032	73.5
Maria (n = 14)	0.102	0.075	0.046	71.4
Michael (n = 14)	0.058	0.056	0.056	71.4
Source: U.S. Bureau of Labor Statistics.

Applying lessons learned to producing estimates during the COVID-19 pandemic

A historically unprecedented number of jobs were lost and regained in the months after the onset of the COVID-19 pandemic.¹⁵ Health officials identified the first case of COVID-19 in the United States in Washington State on January 21, 2020.¹⁶ However, it would be several weeks before community spread of the virus led to severe business disruptions and shutdowns. Many novel data sources show declining activity throughout the month of March 2020,¹⁷ even before the issuance of formal shelter-in-place and stay-at-home orders.¹⁸ The week of March 8–14 marked a turning point in business activity. Restaurant reservation data from OpenTable illustrate the evolving decline in business activity that week: as of Sunday, March 8, reservations were down only 2 percent over the year, but by the following Saturday (March 14), they showed a 42-percent decline.¹⁹ Ohio was the first state to close all public schools on March 12, and by the end of the following week (March 15–21), nearly all public schools had closed,²⁰ air passenger travel had fallen by 75 percent,²¹ and restaurant reservations had dropped by nearly 100 percent. The National Bureau of Economic Research (NBER) Business Cycle Dating Committee determined that the longest economic expansion in U.S history ended in February 2020 and a recession began in March.²² Nationwide job losses in March (–1.7 million) and April (–20.7 million) marked the steepest employment decline in history, and were followed by the two largest monthly job gains ever in May (+2.8 million) and June (+4.8 million). The steepness and suddenness of these job losses, followed by a rapid (if partial) recovery, were more reminiscent to the losses seen after major hurricanes than those seen during a typical recession.

The birth–death model

The nationwide increases in the number of establishments reporting zero employment in March 2020 (2.6 times normal) and April 2020 (15 times normal) could only be compared with the spikes following Irma in Florida (2.1 times normal) and Maria in Puerto Rico (6.9 times normal). Evidence from Irma and Maria indicates that using reported zeros could reduce error. Beginning with the March final and April preliminary estimates, the CES state and area program incorporated “excess” reported zeros in the link relative in calculating the employment estimates. In order to avoid statistical bias, the program applied a weight-reduction adjustment value ranging between 0 and 1 to establishments reporting zero employment. In cases in which the proportion of reported zeros at the state estimation supersector level was at or below normal, the factor was set to 0, and in those cases in which the proportion was many times the usual rate, the factor approached 1. The average weight adjustment factor for a reported zero was 0.603 in March and 0.954 in April. As with Hurricanes Irma and Maria, employment counts also needed to appropriately reflect returns from zero. Weight adjustment factors were also developed for returns, and they were first applied in the May preliminary estimates, as many business reopened. The use of excess reported drops-to and returns-from zero lowered the final sum of state employment estimates by 331,000 in March and 3.0 million in April, while increasing them by 1.0 million in May and 934,000 in June, as many businesses reopened.

In addition, the CES program augmented the time-series forecast component of the net birth–death model with information from the nonzero matched sample as an exogenous regressor in a regression model with autoregressive integrated moving average errors (regARIMA), beginning with the April values.²³ Earlier research indicated that this method could improve forecast accuracy during business cycle turning points, although it was not feasible to implement following hurricanes.²⁴ The CES program updated the forecast models at the national level and controlled the sum of the state subsector forecasts to the new national forecasts, incorporating sample information on the states and industries seeing spikes in reported zero employment to allocate the distribution of differences. This lowered the sum of state birth–death factors by 799,000 in April compared with what they would have been without the regressor. The models in turn processed more positive sample information in May and June 2020 into higher forecast values.

Nonresponse analysis

The COVID-19 recession led to nonresponse concerns about the state and area employment estimates that were different from but related to those explored after hurricanes. Subdividing states into affected and unaffected regions, as is done for a hurricane, was not a promising option because of the geographic pervasiveness of business disruption. The CES program explored making differential-response-rate adjustments based on business size (the hypothesis being that larger firms may be less disrupted and better able to continue paying employees), but the results were unclear and any adjustments may have exacerbated the effects of confounding factors. The program also explored conducting nonresponse analysis based on linked unemployment insurance filings, but doing so did not uncover notable differences between respondents and nonrespondents.

In producing the April 2020 state-level estimates, a major nonresponse problem appeared as certain detailed industries within the estimation supersectors were broadly underrepresented across states in the responding CES sample in a way that correlated with employment trends. In retail trade, clothing and clothing accessories stores closed almost everywhere in April 2020, and these establishments responded to the survey at relatively low rates, while general merchandise stores and building material and garden equipment stores had comparatively resilient employment and responded at higher rates. Within leisure and hospitality, full-service restaurants lost relatively more jobs and responded to the CES survey at lower rates than limited-service restaurants, while in the “other services” supersector, the laundry and personal care services industry showed steep losses and was underrepresented in the responding sample. This nonresponse issue did not arise in the national estimates because they are produced at a detailed industry level using samples from all states, in contrast to the approach used in the state and local estimates, where estimation supersectors pool samples from heterogeneous industries into a discrete geographic domain.²⁵

The CES program took two approaches to nonresponse adjustments for these industries. If it was possible to do so, especially in very large states, the program calculated sample-based estimates at a more detailed level and summed them to replace the estimation supersectors.²⁶ In states for which this approach was infeasible, the CES program calculated nonresponse adjustments similar to those calculated (but not applied) following hurricanes and those obtained by using preidentified nonresponse factors in the link-relative estimator (d_i). Many of the response-rate differences were longstanding, but they had not substantially affected the estimates. In total, nonresponse adjustments served to lower the sum of April 2020 state employment estimates by 461,000 in leisure and hospitality, 413,000 in retail trade, and 73,000 in other services. As jobs returned, the differential nonresponse problem presented in the opposite direction, and correcting for it increased employment estimates.

Small-area estimation

The CES program explored an approach similar to the hurricane variant of the Fay-Herriot model in March 2020. The program considered a scenario in which certain states and metropolitan areas in the Northeast and Pacific Northwest that had early cases of COVID-19 might have substantially larger employment drops in March, in which case it could have been beneficial to pool those areas together and model them separately. This was not the case, however, as job losses that month presented along a continuum, with no clearly identifiable division, and were only loosely related to early case counts. The ordinary Fay-Herriot models were also problematic. The wide range of job losses among states resulted in very high model variance and subsequently near-total reliance on the direct estimates. Although the direct estimates are approximately unbiased, they tend to have high variability when sample sizes are small, and the poor performance of direct estimates following hurricanes in modeled cells indicated the potential for low accuracy.

Instead, BLS generated and in many cases used a small-area model that generalizes the Fay-Herriot model and relaxes many of its assumptions by jointly modeling variance and point estimates and clustering the data.²⁷ This model outperformed the existing Fay-Herriot model in simulations, in terms of benchmark revisions, during both the steep downturn of the 2007–09 recession and the subsequent recovery and expansion. When applied during the COVID-19 recession, metropolitan area estimates were more consistent with statewide values and showed a similar story: compared with the Fay-Herriot estimates, the new model reduced the root-mean-square difference between metropolitan and statewide estimates in the same state or industry by 29 percent in the April 2020 preliminary estimates. The combination of lessened reliance on volatile direct estimates (because of better model fit) and state-specific effects being captured in the synthetic component of the clustered model resulted in the tighter relationship among metropolitan area estimates and corresponding state-level estimates.

Conclusion

Timely economic data is needed to capture rapid shifts in business conditions shortly after they occur, and payroll employment estimates from the CES program are among the timeliest economic indicators available each month for states and metropolitan areas. Natural disasters often cause sudden, steep job losses, but the measurement of employment change following disasters presents challenges. The CES survey must accurately capture the relationship between business openings and closings, account for differences between respondents and nonrespondents, and use robust models to accurately estimate employment in small domains. Five major hurricanes that occurred in 2017 and 2018 brought these concerns to light and resulted in the development of potential solutions. Using reported zeros proved beneficial to the estimation process after the survey showed sharp increases in their number. Modeled estimates could be improved by refining the pool of observations that are processed together.

When the COVID-19 pandemic caused the steepest job losses in U.S. history, the CES program applied the lessons learned from producing estimates following major hurricanes to better capture business deaths, properly represent industries experiencing the most severe declines, and incorporate models that capture important differences between, and commonalities among, states. Using reported zeros proved beneficial following Hurricanes Irma and Maria, and the CES program generalized their use with the wave of business closures that began in March 2020. Although nonresponse was not a major problem after the 2017–18 hurricanes, monitoring processes set up in the wake of those hurricanes helped BLS identify industry-based differential nonresponse adjustments when the steepest job losses in history occurred. Techniques pooling similar observations in Fay-Herriot models showed promise, and the CES program applied a model that more formally clustered the data to produce small-area estimates during the pandemic.

Although job losses during the COVID-19 recession have been far steeper and more sudden than those occurring in typical downturns, the associated measurement challenges highlight areas of potential improvement that could help the CES survey better capture future business-cycle turning points and sudden changes in state and area employment that result from disasters.

ACKNOWLEDGMENTS: Julie Gershunskaya, Greg Erkens, Chris Grieves, and Dan Zhao were essential in solving many of the methodological problems we encountered following recent hurricanes and, in particular, during the COVID-19 pandemic. Paige Schroeder, Brad Rhein, Greg Erkens, and Alba Báez all provided comments that noticeably improved this article. I want to thank them, as well as the CES State and Area program office staff more broadly, who showed a great deal of ingenuity, flexibility, and hard work in producing the estimates.

Suggested citation:

Steven M. Mance, "Estimating state and local employment in recent disasters—from Hurricane Harvey to the COVID-19 pandemic," Monthly Labor Review, U.S. Bureau of Labor Statistics, April 2021, https://doi.org/10.21916/mlr.2021.9

Notes

¹ For more information on the design of the U.S. Bureau of Labor Statistics (BLS) Current Employment Statistics (CES) survey and its evolution, see Laura A. Kelter, “One hundred years of Current Employment Statistics—the history of CES sample design,” Monthly Labor Review, August 2016, https://doi.org/10.21916/mlr.2016.35.

² For more information on the process for benchmarking state and local CES employment data, see Kirk Mueller, “Benchmarking the Current Employment Statistics state and area estimates,” Monthly Labor Review, November 2017, https://doi.org/10.21916/mlr.2017.26.

³ This formula is simplified to remove components that are not relevant to this article. A robust estimation technique is used to identify “atypical” reporters, which are removed from the link and only represent themselves, and reduce the weight on other influential reporters. Employees of religious organizations, which are in scope for CES but not on the sampling frame, are addressed by removing their benchmark level from the prior month’s employment before applying the weighted link relative and then added back to the current month’s level.

⁴ Decompositions of benchmark revisions have indicated nonresponse to not be a principal driver of total survey error in the CES survey. See, for example, Larry L. Huff and Julie G. Gershunskaya, “Components of error analysis in the Current Employment Statistics survey,” Proceedings of the Survey Research Methods Section, American Statistical Association, October 2009, http://www.asasrms.org/Proceedings/y2009f.html; the paper is also available on the BLS website at https://www.bls.gov/osmr/research-papers/2009/st090050.htm. Although the implicit imputation of the weighted link-relative estimator is generally used, there is some explicit imputation in state and area estimates for large nonrespondents that exhibit a stable, seasonal difference from the rest of the universe using historical QCEW data. These explicit imputations are often not made during disasters.

⁵ For more information about the CES net birth–death model, see Kirk Mueller, “Impact of business births and deaths in the payroll survey,” Monthly Labor Review, May 2006, https://www.bls.gov/opub/mlr/2006/05/art4full.pdf.

⁶ Fay-Herriot models are commonly used in producing small-area estimates and were introduced by Robert E. Fay III and Roger A. Herriot in “Estimates of income for small places: an application of James-Stein procedures to census data,” Journal of the American Statistical Association, vol. 74, no. 366, June 1979, pp. 269–277, https://www.jstor.org/stable/pdf/2286322.pdf.

⁷ More formally, this involves distributional assumptions linking the direct estimates, historical trend projections, and actual values from a high-level model to more detailed domains. For more detail, see Julie Gershunskaya, “Estimation for detailed publication levels in the Current Employment Statistics survey,” Proceedings of the Survey Research Methods Section, American Statistical Association, October 2012, https://www.bls.gov/osmr/research-papers/2012/st120190.htm.

⁸ Puerto Rico transitioned from a quota-based design to the current probability-based design with the January 2014 estimates. Although the U.S. Virgin Islands were severely affected by Hurricane Maria, employment estimates for the Virgin Islands are not examined in this article because they employ a quota-based sample with an estimation methodology that differs from that of the rest of the CES program.

⁹ The handling of recent disasters was built upon past experiences, primarily Hurricane Katrina, a thorough examination of which can be found in Molly Barth Garber, Linda Unger, James White, and John Wohlford, “Hurricane Katrina’s effects on industry employment and wages,” Monthly Labor Review, August 2006, https://www.bls.gov/opub/mlr/2006/08/art3full.pdf.

¹⁰ Information on the track and development of these storms and estimates of the damage they caused are from the National Hurricane Center’s Tropical Cyclone Reports; for more information, see https://www.nhc.noaa.gov/data/tcr/.

¹¹ Devastation from Hurricanes Irma and Maria was severe enough that officials in the U.S. Virgin Islands were unable to collect establishment data for several months, resulting in a delay in the release of preliminary September, October, and November 2017 estimates for the territory. Unlike the 50 states and the District of Columbia, the workforce agencies in the U.S. Virgin Islands and Puerto Rico collect the majority of the microdata used in their CES estimates. More information is available at https://www.bls.gov/sae/notices/2017/hurricanes-irma-maria-september-october-november-december-2017-payroll-data-for-puerto-rico-and-the-us-virgin-islands.htm.

¹² An analysis of credit card data by researchers at the Federal Reserve Board indicated that retail trade spending in the areas affected by Hurricanes Harvey and Irma dropped steeply but returned to normal within 2 weeks of the storms making landfall, which helps explain the comparatively smaller effect of Harvey on CES estimates. See Aditya Aladangady Shifrah, Aron-Dine, Wendy Dunn, Laura Feiveson, Paul Lengermann, and Claudia Sahm, “From transactions data to economic statistics: constructing real-time, high-frequency, geographic measures of consumer spending,” Finance and Economics Discussion Series 2019-057 (Board of Governors of the Federal Reserve System, 2019), https://doi.org/10.17016/FEDS.2019.057.

¹³ Some large business deaths were accounted for in Florida following Hurricane Irma, but they solely represented themselves.

¹⁴ Fay-Herriot models are also used in some statewide estimation supersectors, mostly in smaller industries such as mining and logging. Another type of small-domain model is also used by the CES program, primarily in detailed industries.

¹⁵ Coronavirus disease 2019 (COVID-19) is caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). For more information, see “COVID-19 science update” (Centers for Disease Control and Prevention, January 15, 2021), https://www.cdc.gov/library/covid19/01152021_covidupdate.html.

¹⁶ Lena H. Sun and Lenny Bernstein, “First U.S. case of potentially deadly Chinese coronavirus confirmed in Washington state,” The Washington Post, January 21, 2020, https://www.washingtonpost.com/health/2020/01/21/coronavirus-us-case/.

¹⁷ For example, researchers at the Federal Reserve Bank of Minneapolis used smartphone data to create indexes that measure changes in travel and social encounters, which showed broad declines through March and into April 2020. See Victor Couture, Jessie Handbury, Jonathan I. Dingel, Kevin R. Williams, and Allison Green, “Measuring movement and social contact with smartphone data: a real-time application to COVID-19,” Institute Working Paper 35 (Federal Reserve Bank of Minneapolis, Opportunity and Inclusive Growth Institute, July 2020), https://doi.org/10.21034/iwp.35.

¹⁸ There is evidence that the formal shutdown orders only explain a small proportion of the decline in business activity. See Austan Goolsbee and Chad Syverson, “Fear, lockdown, and diversion: comparing drivers of pandemic economic decline 2020,” Working Paper 27432 (Cambridge, MA: National Bureau of Economic Research, June 2020), https://doi.org/10.3386/w27432.

¹⁹ For more information on restaurant reservation data, see the “State of the Industry” page on the OpenTable website at https://www.opentable.com/state-of-industry.

²⁰ Ohio became the first state to close schools on March 12, and 27 states and territories had done so by March 16. By March 25, all U.S. public schools were closed. See “The coronavirus spring: the historic closing of U.S. schools,” Education Week, July 1, 2020, https://www.edweek.org/ew/section/multimedia/the-coronavirus-spring-the-historic-closing-of.html.

²¹ Data on passenger throughput during the COVID-19 pandemic from the Transportation Security Administration are available at https://www.tsa.gov/coronavirus/passenger-throughput.

²² The Business Cycle Dating Committee of the National Bureau of Economic Research determined that a peak in economic activity occurred in February 2020, marking the end of the economic expansion that began in June 2009 and the beginning of a recession. For more information, see “Determination of the February 2020 peak in U.S. economic activity” (Cambridge, MA: National Bureau of Economic Research, June 8, 2020), https://www.nber.org/cycles/june2020.html.

²³ For more information on regression models with autoregressive integrated moving average (regARIMA), see Kathleen M. McDonald-Johnson and Catherine C. Hood, “Outlier selection for RegARIMA models” (U.S. Census Bureau, 2001), https://www.census.gov/ts/papers/asa2001kmm.pdf.

²⁴ For more information on the regression model used to modify birth–death forecasts, see Victoria Battista, “Back to the future: using current regression variables to forecast forward from historical net birth/death employment,” Proceedings of the Government Statistics Section, American Statistical Association, October 2013, https://www.bls.gov/osmr/research-papers/2013/st130140.htm.

²⁵ See Huff and Gershunskaya, “Components of error analysis in the Current Employment Statistics survey.”

²⁶ In retail trade and other services, the post-stratified estimates were produced at the three-digit NAICS subsector level. In leisure and hospitality, the estimates became the sum of arts, entertainment, and recreation; accommodation; full-service restaurants; and the remainder of food services and drinking places. In some cases, QCEW ratios were applied to the March leisure and hospitality estimates to create a prior-month employment level from which to apply the weighted link relative.

²⁷ For more details on the generalized small-area model, see Julie Gershunskaya and Terrance D. Savitsky, “Bayesian nonparametric joint model for point estimates and variances under biased domain variances,” Proceedings of the Survey Research Methods Section, American Statistical Association, November 2019, https://www.bls.gov/osmr/research-papers/2019/st190020.htm.

About the Author

Steven M. Mance
mance.steven@bls.gov

Steven M. Mance is a supervisory economist in the Office of Employment and Unemployment Statistics, U.S. Bureau of Labor Statistics.

Article Citations

Crossref0

top Back to Top

close or Esc Key