Leaf and Bloom Dates Identification 1. Indicator Description Phenology is the study of plant and animal life cycle events in relation to environmental drivers such as weather and climate. Phenological data offer simple, yet sensitive indicators of climate change impacts on ecosystems (Schwartz et al2013). This indicator examines the timing of first leaf dates and flower bloom dates in lilacs and honeysuckle plants in the contiguous 48 states between 1900 and 2020. The first leaf date in these plants relates to the timing of events that occur in early spring, while the first bloom date is consistent with the timing of later spring events, such as the start of growth in forest vegetation. Lilacs and honeysuckles are especially useful plant types for an indicator of spring events because they are widely distributed across most of the contiguous 48 states and widely studied in the peer-reviewed literature. Scientists have very high confidence that recent warming trends in global climate have contributed to the earlier arrival of spring events (IPCC, 2014). Components of this indicator include: • Trends in first leaf dates and first bloom dates since 1900, aggregated across the contiguous 48 states (Figure 1). • A map showing changes in first leaf dates between 1951-1960 and 2011-2020 (Figure 2). • A map showing changes in first bloom dates between 1951-1960 and 2011-2020 (Figure 3). 2. Revision History April 2010: Indicator published. May 2014: Combined original Figures 1 and 2 (leaf and bloom date time series) into Figure 1 and updated with data through 2013. Added Figures 2 and 3. June 2015: Updated indicator with data through 2014. August 2016: Updated indicator with data through 2015. April 2021: Updated indicator with data through 2020. Data Sources 3. Data Sources This indicator is based on leaf and bloom observations that are archived by the USA National Phenology Network (USA-NPN) and climate data that are maintained by the National Oceanic and Atmospheric Administration's (NOAA's) National Centers for Environmental Information (NCEI) (formerly the National Climatic Data Center). Data for this indicator were analyzed using a method described by Schwartz et al. (2013). Technical Documentation: Leaf and Bloom Dates 1 ------- 4. Data Availability Phenological Observations This indicator is based in part on observations of lilac and honeysuckle leaf and bloom dates, to the extent that these observations contributed to the development of models. USA-NPN provides online access to historical phenological observations at: www.usanpn.org/data. Temperature Data This indicator is based in part on historical daily temperature records, which are publicly available online through NCEI. Individual station measurements and metadata are available through NCEI's website (www.ncdc.noaa.gov/data-access/land-based-station-data). Model Results The processed leaf and bloom date data set is not publicly available. EPA obtained the model outputs by contacting Dr. Mark Schwartz at the University of Wisconsin-Milwaukee, who developed the analysis and created the original time series and maps. Results from previous iterations of this analysis have been published in Schwartz et al. (2013) and other papers. Methodology 5. Data Collection This indicator was developed using models that relate phenological observations (leaf and bloom dates) to weather and climate variables. These models were developed by analyzing the relationships between two types of measurements: 1) observations of the first leaf emergence and the first flower bloom of the season in lilacs and honeysuckles and 2) temperature data. The models were developed using measurements collected throughout the portions of the Northern Hemisphere where lilacs and/or honeysuckles grow, then applied to temperature records from a larger set of stations throughout the contiguous 48 states. Phenological Observations First leaf date is defined as the date on which leaves first start to grow beyond their winter bud tips. First bloom date is defined as the date on which flowers start to open. Ground observations of leaf and bloom dates were gathered by government agencies, field stations, educational institutions, and trained citizen scientists; these observations were then compiled by organizations such as the USA-NPN. These types of phenological observations have a long history and have been used to support a wide range of peer-reviewed studies. See Schwartz et al. (2013) and references cited therein for more information about phenological data collection methods. Temperature Data Weather data used to construct, validate, and then apply the models—specifically daily maximum and minimum temperatures—were collected from officially recognized weather stations using standard Technical Documentation: Leaf and Bloom Dates 2 ------- meteorological instruments. These data have been compiled in NCEI databases such as the TD3200 Daily Summary of the Day data from cooperative weather stations. As described in the methods for an earlier version of this analysis (Schwartz et al., 2006), station data were used rather than gridded values, "primarily because of the undesirable homogenizing effect that widely available coarse-resolution grid point data can have on spatial differences, resulting in artificial uniformity of processed outputs..." (Schwartz and Reiter, 2000; Schwartz and Chen, 2002; Menzel et al., 2003). Ultimately, 2,805 weather stations were selected according to the following criteria: • Provide the best temporal and spatial coverage possible. At some stations, the period of record includes most of the 20th century. • Have at least 25 of 30 years during the 1981-2010 baseline period, with no 30-day periods missing more than 10 days of data. • Have sufficient spring-summer warmth to generate valid model output. For more information on the procedures used to obtain temperature data, see Schwartz et al. (2013) and references cited therein. 6. Indicator Derivation Daily temperature data and observations of first leaf and bloom dates were used to construct and validate a set of models that relate phenological observations to weather and climate variables (specifically daily maximum and minimum temperatures). These models were developed for the entire Northern Hemisphere and validated at 378 sites in Germany, Estonia, China, and the United States. Once the models were validated, they were applied to locations throughout the contiguous 48 states using temperature records from 1900 to 2020. Even if actual phenological observations were not collected at a particular station, the models essentially predict phenological behavior based on observed daily maximum and minimum temperatures, allowing the user to estimate the date of first leaf and first bloom for each year at that location. The value of these models is that they can estimate the onset of spring events in locations and time periods where actual lilac and honeysuckle observations are sparse. In the case of this indicator, the models have been applied to a time period that is much longer than most phenological observation records. The models have also been extended to areas of the contiguous 48 states where lilacs and honeysuckles do not actually grow—mainly parts of the South and the West coast where winter is too warm to provide the extended chilling that these plants need in order to bloom the following spring. This step was taken to provide more complete spatial coverage. This indicator was developed by applying phenological models to nearly 3,000 sites in the contiguous 48 states where sufficient weather data have been collected. The exact number of sites varies from year to year over the period 1900-2020 depending on data availability. This is a much larger set of sites than the earlier (2014) version of the indicator, which was based on a set of 779 sites. After running the models, analysts looked at each location and compared the first leaf date and first bloom date in each year with the average leaf date and bloom date for 1981 to 2010, which was established as a "climate normal" or baseline. This step resulted in a data set that lists each station along with the "departure from normal" for each year—measured in days—for each component of the indicator (leaf date and bloom date). Note that 1981 to 2010 represents an arbitrary baseline for comparison, and choosing a different baseline period would shift the observed long-term trends up or down but would not alter the shape, magnitude, or statistical significance of the trends. Technical Documentation: Leaf and Bloom Dates 3 ------- Figure 1. First Leaf and Bloom Dates in the Contiguous 48 States, 1900-2020 EPA obtained a data set listing annual departure from normal for each station, then performed some additional steps to create Figure 1. For each component of the indicator (leaf date and bloom date), EPA aggregated the data for each year to determine an average departure from normal across all stations. This step involved calculating an unweighted arithmetic mean of all stations with data in a given year. The aggregated annual trend line appears as a thin curve in each figure. To smooth out some of the year-to-year variability, EPA also calculated a nine-year weighted moving average for each component of the indicator. This curve appears as a thick line in each figure, with each value plotted at the center of the corresponding nine-year window. For example, the average from 2000 to 2008 is plotted at year 2004. This nine-year average was constructed using a normal curve weighting procedure that preferentially weights values closer to the center of the window. Weighting coefficients for values 1 through 9, respectively, were as follows: 0.0076, 0.036, 0.1094, 0.214, 0.266, 0.214, 0.1094, 0.036, 0.0076. This procedure was recommended by the authors of Schwartz et al. (2013) as an appropriate way to reduce some of the "noise" inherent in annual phenology data. EPA used endpoint padding to extend the nine-year smoothed lines all the way to the ends of the period of record. Per the data provider's recommendation, EPA calculated smoothed values centered at 2017, 2018, 2019, and 2020 by inserting the 2016-2020 average into the equation in place of the as-yet unreported annual data points for 2020 and beyond. EPA used an equivalent approach at the beginning of the time series. Figures 2 and 3. Change in First Leaf and Bloom Dates Between 1951-1960 and 2011-2020 To show spatial patterns in leaf and bloom changes, Figures 2 and 3 compare the most recent decade of data with the decade from 1951 to 1960 at individual stations. The 1950s were chosen as a baseline period to be consistent with the analysis published by Schwartz et al. (2013), who noted that broad changes in the timing of spring events appeared to start around the 1950s. To create the maps, EPA calculated the average departure from normal during each 10-year period and then calculated the difference between the two periods. The maps are restricted to stations that had at least eight years of valid data in both 10-year periods; 1,257 stations met these criteria. For more information on the procedures used to develop, test, and apply the models for this indicator, see Schwartz et al. (2013) and references cited therein. Indicator Development The 2010 edition of EPA's Climate Change Indicators in the United States report presented an earlier version of this indicator based on an analysis published in Schwartz et al. (2006). That analysis was referred to as the Spring Indices (SI). The team that developed the original SI subsequently developed an enhanced version of their algorithm, which is referred to as the Extended Spring Indices (Sl-x). EPA adopted the Sl-x approach for the 2012 edition of Climate Change Indicators in the United States. The Sl- x represents an extension of the original SI because it can now characterize the timing of spring events in areas where lilacs and honeysuckles do not grow. Additional details about the Sl-x are discussed in Schwartz et al. (2013). Technical Documentation: Leaf and Bloom Dates 4 ------- For the 2014 edition of this indicator, EPA added a set of maps (Figures 2 and 3) to provide a more robust depiction of regional variations. These maps were published in Schwartz et al. (2013) and have since been updated with more recent data. The 2015 update of this indicator included an expansion in the number of weather stations used to nearly 3,000 sites, compared with the previous 2014 version, which was based on 779 sites. This change was possible due to the availability of improved historical temperature data. 7. Quality Assurance and Quality Control Phenological Observations Quality assurance and quality control (QA/QC) procedures for phenological observations are not readily available. Temperature Data Most of the daily maximum and minimum temperature values were evaluated and cleaned to remove questionable values as part of their source development. For example, several papers have been written about the methods of processing and correcting historical climate data for NCEI's U.S. Historical Climatology Network (USHCN) and Global Historical Climatology Network (GHCN). NCEI's websites at: www.ncdc.noaa.gov/ushcn/introduction and www.ncdc.noaa.gov/data-access/land-based-station- data/land-based-datasets/global-historical-climatology-network-monthlv-version-3 describe the underlying methods and cite peer-reviewed publications for justification. Before applying the model, all temperature data were checked to ensure that no daily minimum temperature value was larger than the corresponding daily maximum temperature value (Schwartz et al., 2006). Model Results QA/QC procedures are not readily available regarding the use of the models and processing the results. These models and results have been published in numerous peer-reviewed studies, however, suggesting a high level of QA/QC and review. For more information about the development and application of these models, see Schwartz et al. (2013), McCabe et al. (2012), and the references cited therein. Analysis 8. Comparability Over Time and Space Phenological Observations For consistency, the phenological observations used to develop this indicator were restricted to certain cloned species of lilac and honeysuckle. Using cloned species minimizes the influence of genetic differences in plant response to temperature cues, and it helps to ensure consistency over time and space. Technical Documentation: Leaf and Bloom Dates 5 ------- Temperature Data NCEI's databases have undergone extensive testing to identify errors and biases in the data and either remove these stations from the time series or apply scientifically appropriate correction factors to improve the utility of the data. In particular, these corrections address changes in the time-of-day of observation, advances in instrumentation, and station location changes. Homogeneity testing and data correction methods are described in more than a dozen peer-reviewed scientific papers by NCEI. Data corrections were developed to specifically address potential problems in trend estimation of the rates of warming or cooling. Model Results The same model was applied consistently over time and space. Figure 1 generalizes results over space by averaging station-level departures from normal in order to determine the aggregate departure from normal for each year. This step uses a simple unweighted arithmetic average, which is appropriate given the national scale of this indicator and the large number of weather stations spread across the contiguous 48 states. 9. Data Limitations Factors that may impact the confidence, application, or conclusions drawn from this indicator are as follows: 1. Plant phenological events are studied using several data collection methods, including satellite images, models, and direct observations. The use of varying data collection methods in addition to the use of different phenological indicators (such as leaf or bloom dates for different types of plants) can lead to a range of estimates of the arrival of spring. 2. Climate is not the only factor that can affect phenology. Observed variations can also reflect plant genetics, changes in the surrounding ecosystem, and other factors. This indicator minimizes genetic influences, however, by relying on cloned plant species, which have no genetic differences. 10. Sources of Uncertainty Error estimates are not readily available for the underlying temperature data upon which this indicator is based. It is generally understood that uncertainties in the temperature data increase as one goes back in time because there are fewer stations early in the record. These uncertainties are not sufficient, however, to mislead the user about fundamental trends in the data. In aggregating station-level "departure from normal" data into an average departure for each year, EPA calculated the standard error of each component of Figure 1 (leaf date and bloom date) in each year. Standard errors range from 0.2 days to 0.4 days for leaf date, and from 0.1 days to 0.4 days for bloom date, depending on the year. Uncertainty has not been calculated for the individual station-level changes shown in Figures 2 and 3. Technical Documentation: Leaf and Bloom Dates 6 ------- Schwartz et al. (2013) provide error estimates for the models. The use of modeled data should not detract from the conclusions that can be inferred from the indicator. These models have been extensively tested and refined over time and space such that they offer good certainty. 11. Sources of Variability Temperatures naturally vary from year to year, which can strongly influence leaf and bloom dates. To smooth out some of the year-to-year variability, EPA calculated a nine-year weighted moving average for each component of this indicator in Figure 1, and EPA created the maps in Figures 2 and 3 based on 10-year averages for each station. 12. Statistical/Trend Analysis Statistical testing of individual station trends within the contiguous 48 states suggests that many of these trends are not significant. Other studies (e.g., Schwartz et al., 2006) have come to similar conclusions, finding that trends in the earlier onset of spring at individual stations are much stronger in Canada and parts of Eurasia than they are in the contiguous 48 states. In part, as a result of these findings, Figure 1 focuses on aggregate trends across the contiguous 48 states, which should be more statistically robust than individual station trends. However, the aggregate trends still are not statistically significant (p<0.05) over the entire period of record, based on a standard t-test statistic. References IPCC (Intergovernmental Panel on Climate Change). 2014. Climate change 2014: Impacts, adaptation, and vulnerability. Working Group II contribution to the IPCC Fifth Assessment Report. Cambridge, United Kingdom: Cambridge University Press, www.ipcc.ch/report/ar5/wg2. McCabe, G.J., T.R. Ault, B.I. Cook, J.L. Betancourt, and M.D. Schwartz. 2012. Influences of the El Nino Southern Oscillation and the Pacific Decadal Oscillation on the timing of the North American spring. Int. J. Climatol. 32:2301-2310. Menzel, A., F. Jakobi, R. Ahas, et al. 2003. Variations of the climatological growing season (1951-2000) in Germany compared to other countries. Int. J. Climatol. 23:793-812. Schwartz, M.D., and X. Chen. 2002. Examining the onset of spring in China. Clim. Res. 21:157-164. Schwartz, M.D., and B.E. Reiter. 2000. Changes in North American spring. Int. J. Climatol. 20:929-932. Schwartz, M.D., R. Ahas, and A. Aasa. 2006. Onset of spring starting earlier across the Northern Hemisphere. Glob. Change Biol. 12:343-351. Schwartz, M.D., T.R. Ault, and J.L. Betancourt. 2013. Spring onset variations and trends in the continental United States: Past and regional assessment using temperature-based indices. Int. J. Climatol. 33:2917-2922. Technical Documentation: Leaf and Bloom Dates 7 ------- |