EJSCREEN
Environmental Justice
Mapping and Screening Tool
EJSCREEN
Technical Documentation
May 2015

-------
EJSCREEN
Environmental Justice Mapping and Screening Tool
EJSCREEN Technical Documentation
May 2015
U.S. Environmental Protection Agency
Office of Policy
Washington, D.C. 20460
Suggested citation:
U.S. Environmental Protection Agency (EPA), 2015. EJSCREEN Technical Documentation.
For more information:
www.epa.gov/ejscreen

-------
ACKNOWLEDGEMENTS
This document was prepared by the United States Environmental Protection Agency's Office of Policy
(OP) with drafting, editing and technical support from Abt Associates. ESRI and SAIC, working with the
Office of Environmental Information (OEI), obtained and processed datasets obtained from the Census
Bureau, US Department of Transportation (DOT), and various EPA offices including the Office of Air
(OAR), Office of Research and Development (ORD), Office of Water (OW), and Office of Solid Waste and
Emergency Response (OSWER). Ongoing input, review, and outreach have been provided by all of these
offices as well as EPA's Regional offices, Office of Chemical Safety and Pollution Prevention (OCSPP),
Office of International and Tribal Affairs (OITA), Office of Enforcement and Compliance Assurance
(OECA), Office of Environmental Justice (OEJ), and Office of General Counsel (OGC). External expert peer
reviewers provided comments on the methods and documentation as well. Specific acknowledgement is
given to the contributions of Mark Corrales, Bridgid Curry, William Nickerson, and Fred Talcott in the
design of EJSCREEN and the preparation of this document.

-------
Contents
1	INTRODUCTION	6
Environmental Justice at EPA	6
Environmental Justice Mapping and Screening at EPA	7
Development of EJSCREEEN	8
Purposes and Uses of EJSCREEN	8
Caveats and Limitations of EJSCREEN	9
Advances in EJ Screening Provided by EJSCREEN	11
2	OVERVIEW OF DATA AND METHODS IN EJSCREEN	13
Environmental Indicators Selected for EJSCREEN	13
Demographics in EJSCREEN	16
Using Demographics as Proxies for Potential Susceptibility	16
Demographic Indicators Included in EJSCREEN	20
Demographic Indexes in EJSCREEN	21
Environmental Justice Indexes in EJSCREEN	22
How the EJ Index Works	22
Supplementary EJ Indexes	24
EJ Indexes, Population Density, and Rural Areas	25
Why the EJ Indexes are Not all Combined	26
PERCENTILES	27
What a Percentile Means	27
Color-coded High Percentile Bins	28
Buffer Reports	30
What the Buffer Report Calculates	30
Choosing a Buffer Size versus Rationale for Distance in Proximity Indicators	30
3	DETAILS ON THE ENVIRONMENTAL INDICATORS IN EJSCREEN	33
Environmental Factors Not Included	33
Environmental Factors in EJSCREEN	35
4	BIBLIOGRAPHY	67
Appendices
Appendix A. Development of EJSCREEN	78
Appendix B. Technical details on percentiles, rounding, buffering, and demographic data	80
Appendix C. Technical details on proximity indicators	100
Appendix D. Summary statistics for indicators	107
Appendix E. Formulas for demographics and EJ indexes	115
Appendix F. Quality control / quality assurance	118
Appendix G. Peer review	120
Appendix H. Initial filter approach for screening	122
iv | P a g e

-------
Tables
Table 1. Types of Environmental Indicators Included in EJSCREEN	15
Table 2. Summary Table of Environmental Indicators and Sources	35
Table 3. Likelihood of Lead-Based Paint Hazards by Housing Construction Date	49
Table 4. Tallies of 2008-12 ACS Block Groups Used in 2015 Version of EJSCREEN	85
Table 5. ACS Tables Underlying EJSCREEN Demographic Data and Lead Paint Indicator	87
Table 6. Summary Statistics for Environmental Indicators	108
Table 7. Summary Statistics for Demographics	109
Table 8. Spearman Correlation Coefficients for Environmental Indicators	Ill
Table 9. Spearman Correlation Coefficients for Demographic Indicators	112
Figures
Figure 1. Histograms of Block Group Environmental Indicators as ratio to mean value (log scale shows mean
value as zero)	113
v | Page

-------
Introduction
EJSCREEN Environmental Justice
Mapping and Screening Tool
Technical Documentation
1 INTRODUCTION
The United States Environmental Protection Agency (EPA) is charged with protecting human health and
the environment for all Americans. In order to better meet the Agency's responsibilities related to the
protection of public health and the environment, EPA has developed a new environmental justice (EJ)
screening tool, called EJSCREEN. In some ways, EJSCREEN is similar to prior screening or mapping tools.
As a new tool, however, it offers improvements such as easy web-based access to powerful mapping
and data reporting tools, a wide range of updated demographic information, environmental indicators
addressing more topics, and higher resolution maps covering the entire nation. EJSCREEN also provides
standard reports that bring together environmental and demographic data in the form of EJ indexes.
These are summarized as percentiles to put the information in perspective and facilitate comparisons
between locations.
EJ screening tools may be used to explore one location using a data report, or to look across a wide area
using maps. EJ tools have been used in a wide variety of circumstances, and EJSCREEN can support a
similarly broad range of applications. EJSCREEN provides useful data and indicators, and highlights
places that may be candidates for further review, including additional consideration, analysis or
outreach.
This document describes EJSCREEN within the context of EPA's EJ program, and provides details on the
data and methods used to create the indicators and indexes in EJSCREEN. The Appendices in this
document provide additional detail on data and methods for interested users.
Environmental Justice at EPA
Since EJ mapping and screening is just one aspect of EPA's ongoing commitment to environmental
justice, it is helpful to understand the broad, historical context of EPA's EJ work.
EPA has defined "environmental justice" as follows:
Environmental Justice is the fair treatment and meaningful involvement of all people regardless
of race, color, national origin, or income with respect to the development, implementation, and
enforcement of environmental laws, regulations, and policies.... Fair treatment means that no
group of people should bear a disproportionate share of the negative environmental
6 | P a g e

-------
Introduction
consequences resulting from industrial, governmental and commercial operations or policies.1
(italics added)
EPA's efforts to understand EJ concerns date back at least to EPA's 1992 report on Environmental Equity
(U.S. EPA, 1992). The 1992 report documented health and exposure disparities associated with
race/ethnicity and income. To address such disparities, in 1994, Executive Order 12898 (EO 12898)
mandated that each covered federal agency make achieving environmental justice part of its mission by
identifying and addressing, as appropriate, any disproportionately high and adverse human health or
environmental effects of its programs, policies and activities on minority, low-income, tribal and
indigenous populations.2
These early activities provided a foundation for EPA's continued commitment to environmental justice,
which was reaffirmed in January 2010 when EPA Administrator Lisa P. Jackson announced Expanding the
Conversation on Environmentalism and Working for Environmental Justice as one of the Agency's top
seven priorities. EPA has made great progress implementing this priority and has worked across the
Agency to make a difference in overburdened communities, a goal that has been reiterated by EPA
Administrator Gina McCarthy.
EPA has been engaged in a variety of research and analytic efforts related to environmental justice, to
support both regulatory analysis and screening efforts. Recent efforts have been guided by Plan EJ 2014,
which was released for public comment in July 2010 and finalized in September 2011.3 EPA's Office of
Environmental Justice (OEJ), in conjunction with the rest of the Agency, works to protect human health
and the environment in communities overburdened by environmental pollution by integrating
environmental justice into all EPA programs, policies and activities.
Environmental Justice Mapping and Screening at EPA
Mapping tools as well as screening-level applications have a substantial history at EPA, and EJ is one area
in which maps and screening can be useful. Several EPA Regional offices have used basic screening tools
that map demographic information and allow staff to overlay selected environmental data such as
facility locations. In connection with EJSEAT, a screening tool that was developed by EPA's Office of
Enforcement and Compliance Assurance (OECA), the National Environmental Justice Advisory Council
(NEJAC) provided recommendations to EPA about how to design an EJ screening tool in its May 2010
report, "Nationally Consistent Environmental Justice Screening Approaches" (NEJAC, 2010).
These various early screening tools have been used for internal EPA purposes only, and generally were
not available to the public. EPA's main publicly available EJ mapping tool until 2015 was EJVIEW,4 a web-
based tool that displayed selected demographic and environmental data, and allowed users to overlay
these data on maps of a community or wider area.
1	http://www.epa.gov/environmentaliustice/basics/index.html, accessed 10/16/2014.
2	http://www.archives.gov/federal-register/executive-orders/pdf/12898.pdf
3	http://www.epa.gov/environmentaliustice/plan-ei/index.html
4	http://www.epa.gov/compliance/ei/mapping.html
7 | P a g e

-------
Introduction
Development of EJSCREEEN
Plan EJ 2014 included a commitment to develop a nationally consistent environmental justice screening
and mapping tool in order to better meet the Agency's responsibilities related to the protection of
public health and the environment in a manner that is consistent with EO 12898 and the goals of Plan EJ
2014. This commitment was the impetus behind the development of EJSCREEN. This new effort
provided an opportunity to reassess and build upon prior efforts, while considering new data, new
scientific findings, new analytic methods and a variety of policy considerations.
The goal in developing EJSCREEN has been to take account of this prior progress and learning, and
provide a new, user-friendly screening tool that addresses policy questions and stakeholder concerns in
an informative manner. An important part of this effort has been to ensure the screening tool reflects an
appropriate balance between simple, feasible, screening-level information on the one hand, and high-
quality data and strong science on the other.
Development of EJSCREEN began in late 2010, and EPA staff began using an early version in 2012, as
prior tools such as EJSEAT were phased out. EJSCREEN was peer reviewed in early 2014 through a letter
review (see Appendix G), and updated with newer data and an improved interface in 2014/2015.
EJSCREEN was released to the public in 2015, replacing EJVIEW as EPA's public-facing EJ mapping tool.
Purposes and Uses of EJSCREEN
EJ mapping and screening tools combine environmental and demographic indicators in maps and
reports. This information can help to highlight geographic areas and the extent to which they may be
candidates for further review, including additional consideration, analysis or outreach. The tools also
allow users to explore locations at a detailed geographic level, across broad areas or across the entire
nation. Environmental indicators typically are direct or proxy estimates of risk, pollution levels or
potential exposure (e.g., due to nearby facilities). Demographic indicators are often used as proxies for a
community's health status and potential susceptibility to pollution. Environmental and demographic
data and indicators may be viewed separately or in combination.
This type of screening information may be of interest to communities as well as many other
stakeholders, and also can support a wide range of research and policy goals. In general, EPA's efforts
are more effective and efficient if they are informed by an understanding of where the impacts of
existing pollution may be greatest. Screening tools can also help ensure that such areas are not
overlooked, and receive appropriate consideration, analysis or outreach.
Screening tools can be appropriately put toward a wide variety of uses. The public has used EJVIEW in a
variety of ways, and is likely to use EJSCREEN in many ways as well. EPA has used existing internal EJ
screening tools in aspects of enforcement, compliance, the Superfund program, permitting, and
voluntary programs. Screening tools also have been used in developing retrospective reports, and to
enhance geographically based initiatives. EJSCREEN will be able to support a similarly wide variety of
uses.
8 | P a g e

-------
Introduction
Screening tools should be used for a "screening-level" look. Screening is a useful first step in
understanding or highlighting locations that may be candidates for further review. However, it is
essential to remember that screening-level results do not provide a complete assessment of risk, and
have significant limitations.
Caveats and Limitations of EJSCREEN
EJSCREEN is a pre-decisional screening tool, and was not designed to be the basis for agency decision-
making or determinations regarding the existence or absence of EJ concerns. It also should not be used
to identify or label an area as an "EJ Community." Instead, EJSCREEN is designed as a starting point, to
highlight the extent to which certain locations may be candidates for further review or outreach.
EJSCREEN's initial results should be supplemented with additional information and local knowledge
whenever appropriate, for a more complete picture of a location. Additional considerations and data,
such as national, regional, or local information and concerns, along with appropriate analysis, should
form the basis for any decisions.
EJSCREEN, as a screening tool, is more limited than a detailed analysis in two key ways. First, it has data
on only some of the relevant issues, and second, there is uncertainty in the data it does have. It is
important to understand each of these limitations.
The first limitation arises because a screening tool cannot capture all the relevant issues that should be
considered (e.g., other local environmental concerns). Any national screening tool must balance a desire
for data quality and national coverage against the goal of including as many important environmental
factors as feasible given resource constraints. Many environmental concerns are not yet included in
comprehensive, nationwide databases. For example, data on environmental factors such as local
drinking water quality and indoor air quality were not available with adequate quality, coverage and/or
resolution to be included in this national screening tool. EJSCREEN cannot provide data on every
environmental impact and demographic factor that may be important to any location.
The second important limitation is that EJSCREEN relies on demographic and environmental estimates
that involve substantial uncertainty. This is especially true when looking at a small geographic area, such
as a single Census block group. A single block group is often small and has uncertain estimates. A buffer
that is roughly the same size as a block group or smaller will introduce additional uncertainty because it
has to approximate the locations of residences. Therefore, it is typically very useful to summarize
EJSCREEN data for a larger area, covering several block groups, in what is called a "buffer" report, as
explained later in this document. There is a tradeoff between resolution and precision: Detailed maps at
high resolution can suggest the presence of a local "hotpot," but are uncertain. Estimates based on
larger areas will provide more confidence and precision, but may overlook local "hotpots" if not
supplemented with detailed maps.
The demographic uncertainty combined with uncertainty in environmental data means EJ index values
are often quite uncertain for a single block group. Therefore, modest differences in percentile scores
between block groups or small buffers should not be interpreted as meaningful because of the
uncertainties in demographic and environmental data at the block group level. We do not have a high
9 | P a g e

-------
Introduction
degree of confidence when comparing or ranking places with only modest differences in estimated
percentile. For this reason, it is critical that EJSCREEN results be interpreted carefully, particularly for
individual block groups, and that additional information be used to supplement or follow up on
screening, where appropriate.
The demographic estimates, such as percent low-income, come from the American Community Survey
(ACS) from the United States Census Bureau. The ACS is comprised of surveys, not a full census of all
households. This means the Census Bureau may estimate that a block group is 30% low-income, for
example, but it might actually be 20% or 40% in some cases (see Appendix B for a discussion of
uncertainty in demographics).
Uncertainties are also discussed in section 2 (with regard to buffer reports), and Appendix B (in
discussions of buffering details and demographic data).
Related to the issue of uncertainty is that fact that the environmental indicators are only screening-level
proxies for actual exposures or health risks. This is particularly true for the proximity indicators, for
example. Even for the indicators that directly estimate risks or hazards, as with the air toxics cancer risk
indicator, estimates have substantial uncertainty because emissions, ambient levels in the air, exposure
of individuals, and toxicity are uncertain. Section 3 provides technical details on each environmental
indicator.
The inclusion of a dataset in EJSCREEN does not imply it is the newest, best, or primary estimate of
actual conditions or risks. Estimates are based on historical data and may not reflect current or future
conditions. The vintage of environmental indicators varies and is not the same as the vintage of the
demographic data. The NATA air toxics indicators and the PM2.5 and ozone indicators in particular should
be viewed with this in mind, because emissions related to PM2.5, ozone, and air toxics generally have
decreased in recent years. This version of EJSCREEN incorporated the most recent data that were
available at the time of each indicator's development. Every attempt will be made to use the most
recent appropriate data available in future updates of EJSCREEN. There is always a delay between the
release of raw data and their eventual incorporation into any models, tools, or maps. It is also useful to
note that although the raw numbers for some indicators do not represent current conditions, the
percentiles are much more likely to be reasonably representative of today's conditions in most
locations. This is because even if emissions have been significantly reduced overall, for example, the
differences between various locations are unlikely to have changed as dramatically, especially when the
reductions have come from national regulations and other trends affecting entire industries or sectors in
many locations. For this reason, the percentiles may be more representative of current conditions than
the raw values of the indicators. Finally, some supplementary maps and local information can
complement the EJSCREEN indicators to provide more recent information. In particular, EJSCREEN also
provides updated maps of PM2.5 and ozone nonattainment areas (areas not meeting national ambient
air quality standards).
There are also some limitations in geographic coverage - EJSCREEN lacks data in some locations for
some indicators, such as in Alaska, Hawaii, and Puerto Rico.
10 | Page

-------
Introduction
In short, as with any screening tool, the indicators in EJSCREEN cannot address all the considerations
that may be relevant to a given situation, they are often only a screening-level proxy for a given issue,
and in any case, there is significant uncertainty involved, particularly for a single block group. For these
types of reasons, among others, it generally is not appropriate to rely on any screening tool as the basis
for a key decision. It is often very useful to obtain information on other issues not included in EJSCREEN,
updated information when available, as well as local knowledge, data, and concerns.
Advances in EJ Screening Provided by EJSCREEN
EJSCREEN offers a variety of enhancements relative to previous approaches to EJ mapping and screening
or analysis. For example, the tool includes updated demographic information (from the ACS) rather than
relying on the Census that is conducted every ten years. It also provides several new environmental
indicators, covering a wider range of issues such as traffic volume and proximity. EJSCREEN includes a
suite of EJ indexes that quantify the combination of environmental and demographic indicators. It
includes high resolution maps, and a new geospatial software and data system that improves access to
new tools with a simple, browser-based interface and centralized, consistent data. The buffer reports
are calculated using detailed Census block data for more accurate estimates of where residents are
located. EJSCREEN provides access to a great deal of data, and presents standardized reports. These
reports use summary metrics and percentiles to facilitate national, regional or state-level perspectives
and a better understanding of EJ issues.
EJSCREEN can help explore the environmental, demographic and EJ characteristics of a block group or
buffer area. It provides numerical estimates for each place, for both environmental and demographic
data, such as the traffic proximity indicator, or the percentage of local residents who are racial/ethnic
minorities.
EJSCREEN also presents multiple "EJ Indexes" for each place. An EJ Index is a way of combining, in a
quantitative way rather than only visually on a map, the environmental indicator and the demographic
information for a location. A separate EJ Index is provided for each environmental indicator in EJSCREEN,
for each block group in the US. The EJ Index goes beyond a simple visual overlay of maps of environment
and demographics, to actually quantify the extent to which these two factors co-occur.
In EJSCREEN, the basic level of geographic resolution is the Census block group. Each block group is
defined by the U.S. Census Bureau, with a logical and unambiguous numbering scheme, and associated
digital shape files that permit mapping with modern geographical information system (GIS) software.
Block group data are widely used by researchers and others. Block groups also provide a relatively stable
framework; for instance, block groups are not subject to frequent boundary definition changes that
political jurisdictions and postal ZIP codes may experience.
Estimates in EJSCREEN are compiled by block group, and that is the most detailed level at which results
can be viewed. However, demographic estimates for a single block group are often based on a small
sample of the local population, and are uncertain. Similarly, some environmental indicator estimates
are derived from lower-resolution data, and all involve uncertainty. Therefore, it is typically very useful
and advisable to summarize EJSCREEN data within a larger area that covers several block groups, in what
11 | Page

-------
Introduction
is called a "buffer" report. An EJSCREEN user can specify or draw buffers of custom sizes and shapes as
needed. For example, a buffer could include all residents within 1 mile of a certain location. When a
buffer covers several block groups, it provides an estimate that has less uncertainty than a single block
group or smaller buffer would. EJSCREEN summarizes data for all residents within some distance from a
selected point, using a circular buffer, or within a user-defined buffer of any shape, using Census blocks
(not just block groups) to refine estimates of how many residents are inside the buffer, as explained in
Appendix B.
12 | Page

-------
Overview of Data and Methods
2 OVERVIEW OF DATA AND METHODS IN EJSCREEN
This section describes the environmental and demographic data used in the tool, as well as the methods
used to combine them and produce EJ Indexes. As of 2015, EJSCREEN contains 12 environmental
indicators, which range from estimates of human health risk to proxies for potential exposure such as
proximity to hazardous waste sites. The tool also contains six demographic indicators, which are
combined into two separate demographic indexes (the demographic index uses the average of two
indicators, and a supplementary demographic index uses the average of all six). A demographic index is
combined with an environmental indicator to create an associated EJ Index. The environmental,
demographic, and EJ indicators and indexes are all calculated for each block group, and can be
summarized within a defined buffer area. The sections immediately below summarize the
environmental data, demographic data, and EJ indexes. Section 3 provides more detail on each
environmental indicator.
Environmental Indicators Selected for EJSCREEN
Some environmental indicators used in EJSCREEN quantify proximity to and the numbers of certain
types of potential sources of exposure to environmental pollutants, such as nearby hazardous waste
sites or traffic. The lead paint indicator indicates the presence of older housing, which often, but not
always, indicates the presence of lead paint, and therefore the possibility of exposure. In some cases,
the term "exposure" is used very broadly here to refer to the potential for exposure. Others indicators in
EJSCREEN are estimates of ambient levels of air pollutants, such as PM2.5, ozone, and diesel particulate
matter. Still others are actual estimates of air toxics-related cancer risk, or a hazard index, which
summarizes the ratios of ambient air toxics levels to health-based reference concentrations. In other
words, these environmental indicators vary widely in what they indicate, as discussed further below.
A variety of considerations has informed the selection of these environmental indicators; in general, the
selected indicators exhibit the following characteristics:
•	Resolution: Screening level data are available (or could be readily developed) at the block
group level (or at least close to this resolution).
•	Coverage: Screening level data are available (or could be readily developed) for the entire
United States (or with nearly complete coverage).
•	Relevance to EJ: Pollutants or impacts are relevant to EJ (e.g., differences between groups
have been indicated in exposures, susceptibility, or health endpoints associated with the
exposures)
•	Public health significance: Pollutants or impacts are potentially important in the United
States (e.g., notable impacts estimated or significant concerns have been expressed, at least
locally, or exposure has been linked to health endpoints with substantial impacts
nationwide).
EPA selected environmental indicators after a review of data availability (including the criteria and
review of data availability for an Environmental Quality Index, now provided by county in Messer, Jagai,
13 | P a g e

-------
Overview of Data and Methods
Rappazzo, & Lobdell (2014), and described in Lobdell, Jagai, Rappazzo, & Messer (2011)); health
disparity information (e.g., CDC's major 2011 report on health disparities (Centers for Disease Control
and Prevention, 2011a)); risk ranking studies (e.g., Unfinished Business (U.S. EPA, 1987), Reducing Risk
(U.S. EPA, 1992), and related reports); and risk estimates from major studies (e.g., related to PM2.5
ambient standards). EPA also reviewed data from the CDC (Centers for Disease Control and Prevention,
2011c) and other sources in the federal government (Fedstats, 2007), and consulted EPA Regions and
program offices that are responsible for data collection and analysis under EPA's key environmental
statutes. The State of California's work on CalEnviroScreen5 was also tracked throughout its
development. Other internal EPA data tools were also examined, including EJSEAT, EJVIEW, C-FERST, and
tools used in EPA Regional Offices.
After review, EPA selected the following environmental factors for use in the first version of EJSCREEN:
•	Air pollution:
o PM2.5 level in air.
o Ozone level in air.
o NATA air toxics:
¦	Diesel particulate matter level in air.
¦	Air toxics cancer risk.
¦	Air toxics respiratory hazard index.
¦	Air toxics neurological hazard index.
•	Traffic proximity and volume: Amount of vehicular traffic nearby, and distance from roads.
•	Lead paint indicator: Percentage of housing units built before 1960, as an indicator of
potential exposure to lead.
•	Proximity to waste and hazardous chemical facilities or sites: Number of significant
industrial facilities and/or hazardous waste sites nearby, and distance from those:
o National Priorities List (NPL) sites,
o Risk Management Plan (RMP) Facilities.
o Hazardous waste Treatment, Storage and Disposal Facilities (TSDFs).
o National Pollutant Discharge Elimination System (NPDES) permitted major direct
dischargers to water.
Each of these environmental indicators is explained in detail in section 3, and Appendix D provides
summary statistics for the indicators. Again, it is important to understand that these indicators vary in
how relevant they are to actual estimated risks to health or welfare, and how significant those impacts
may be. These indicators represent a spectrum in terms of the quality of information about potential
impacts, ranging from direct estimates of risk to rough indicators of proximity or exposure to pollution
or other environmental hazards. Table 1 provides more detail on how closely each environmental
indicator in EJSCREEN approximates actual estimated risk.
5 http://oehha.ca.gov/ei/ces2.html at http://oehha.ca.gov/ei
14 | Page

-------
Overview of Data and Methods
Table 1. Types of Environmental Indicators Included in EJSCREEN
Indicator
Place on Exposure-
Risk Continuum
Key
Medium
NATA Air Toxics Cancer Risk
Lifetime inhalation cancer risk


NATA Respiratory Hazard Index
Ratio of exposure concentration to RfC
Risk/Hazard

NATA Neurological Hazard Index
Ratio of exposure concentration to RfC


NATA Diesel PM (DPM)
(Hg/m3)

Air
Particulate Matter (PM2.5)
Annual average (ng/m3)


Ozone
Summer seasonal average of daily maximum 8-hour
concentration in air (ppb)
Potential Exposure

Lead Paint Indicator
Percentage of housing units built before 1960

Dust/ Lead
Paint
Traffic Proximity and Volume
Count of vehicles (average annual daily traffic) at major roads
within 500 meters, divided by distance in kilometers (km)

Air/ Other
Proximity to RMP Sites
Count of facilities within 5 km, divided by distance


Proximity to TSDFs
Count of major TSDFs within 5 km, divided by distance
Proximity/ Quantity
Waste/
Water/ Air
Proximity to NPL Sites
Count of proposed and listed NPL sites within 5 km, divided by
distance6

Proximity to Major Direct Water Dischargers
Count of NPDES major facilities within 5 km, divided by distance

Water
Abbreviations:


NATA
National Air Toxics Assessment
RfC
Reference concentration from EPA's
NPL
National Priorities List, Superfund program

Integrated Risk Information System
NPDES
National Pollutant Discharge Elimination System
PM2.5
Particulate matter (PM) composed of
RMP
Risk Management Plan

particles smaller than 2.5 microns
TSDFs
Hazardous waste Treatment, Storage, and
Mg/m3
micrograms of PM2.5 per cubic meter of air

Disposal Facilities
ppb
parts per billion, of ozone in air
6 Count of NPL sites excludes deleted sites and sites in U.S. territories.
15 | P a g e

-------
Overview of Data and Methods
It is also important to note that each proximity indicator focuses on one category of facility or site (e.g.,
NPL), but the category's facilities or sites vary in the degree to which they could actually pose risks. They
vary in the amount of emissions (if any), the possibility of exposure to any pollutants released, the size
of the facility or site, and toxicity of the pollutants or severity of the impacts that might occur. As a
screening tool, EJSCREEN generally does not distinguish based on these factors in proximity indicators
(although NATA indicators do account for such information). Any closer review of a particular location
would have to consider these important differences.
All of these indicators are focused on potential impact at residential locations (e.g., proximity of
residence to traffic), and therefore only address some of the exposures that individuals may face. Data
are generally insufficient to readily estimate exposures away from the home, particularly in a screening
tool. Exposures that occur away from the home, such as at work, at school or during a commute, are not
captured in EJSCREEN unless those exposures are near the home or in other locations that happen to
have the same level of exposure.7
Demographics in EJSCREEN
This section describes why demographic indicators are included in EJSCREEN, which specific
demographic indicators were selected, and what data are used to derive the demographic indicators.
Using Demographics as Proxies for Potential Susceptibility
EJSCREEN has been designed in the context of Executive Order 12898s which ordered the following:
To the greatest extent practicable and permitted by law, and consistent with the principles set
forth in the report on the National Performance Review, each Federal agency shall make
achieving environmental justice part of its mission by identifying and addressing, as appropriate,
disproportionately high and adverse human health or environmental effects of its programs,
policies, and activities on minority populations and low-income populations9
EJSCREEN was also designed in the context of EPA's EJ policies, including EPA's Interim Guidance on
Considering Environmental Justice During the Development of an Action (U.S. EPA, 2010). That guidance
document explained EPA's focus on demographics as an indicator of potential susceptibility or
vulnerability to environmental pollution:
To help achieve EPA's goal for EJ (i.e., the fair treatment and meaningful involvement of all
people), EPA places particular emphasis on the public health of and environmental conditions
affecting minority, low-income, and indigenous populations. In recognizing that these
populations frequently bear a disproportionate burden of environmental harms and risks ... EPA
works to protect them from adverse public health and environmental effects of its programs.
7	A partial exception is the data from NATA, which make some attempt to include some nonresidential exposures,
as explained in NATA's technical documentation (http://epa.gov/nata/).
8	http://www2.epa.gov/laws-regulations/summarv-executive-order-12898-federal-actions-address-environmental-
justice
9	http://www.archives.gov/federal-register/executive-orders/pdf/12898.pdf
16 | Page

-------
Overview of Data and Methods
EPA should pay particular attention to the vulnerabilities of these populations because they have
historically been exposed to a combination of physical, chemical, biological, social, and cultural
factors that have imposed greater environmental burdens on them than those imposed on the
general population. (U.S. EPA, 2010, p. 4)
EJSCREEN uses demographic indicators as very general indicators of a community's potential
susceptibility to the types of environmental exposures included in this screening tool. Impacts of
pollutants depend on a combination of exposure and susceptibility to those exposures. Demographic
factors may be related to both of these. Therefore, it is very useful to distinguish between 1) the fact
that some demographics are associated with higher exposure, and 2) the fact that demographics are
useful in predicting susceptibility to those exposures. To indicate potential exposures, EJSCREEN uses
environmental indicators, not demographics. EJSCREEN uses demographics to indicate potential
susceptibility. EJSCREEN then combines the exposure and susceptibility indicators in the form of an EJ
Index.
The demographic indicators in EJSCREEN are a way to indicate which communities may be more
susceptible to a given level of exposure to environmental pollutants. For example, individuals may be
more susceptible when they are already in poor health, have reduced access to care, lack resources or
language skills or education that would help them avoid exposures or obtain treatment, or are at
susceptible life stages. Nationwide direct measures of health status are not available for all block groups
or even tracts - such data are typically compiled by county in national databases. Demographics,
however, are available for every block group, and are correlated with health status and these other
susceptibility factors, making them useful screening-level indicators of potential susceptibility at the
local level.
Note that this report uses the term susceptibility in a qualitative, general sense, to refer to what various
authors have called susceptibility and/or vulnerability. Susceptibility in this report means greater
"impact" for a given environmental indicator value. The terms vulnerability and susceptibility sometimes
are used interchangeably, although various other reports and programs have made distinctions between
these terms.10
The relationships between demographics, exposure, and susceptibility are complex. For example,
demographics may be associated with susceptibility to pollutants in any of the following ways:
• Greater personal exposure despite the same ambient level of pollutant. For example,
children have higher breathing rates or ingest more lead dust than adults (U.S. EPA, 2011a),
10 For example, EPA's 2009 National Ambient Air Quality Standards (NAAQS) documents (U.S. EPA, 2009b) and also
EPA's Regional Vulnerability Assessment program treat susceptibility and vulnerability as essentially identical
(http://www.epa. gov/reva /glossa rv. htm I). Other EPA definitions have addressed particular contexts, such as in
EPA glossaries (http://www.epa.gov/OCEPAterms/vterms.html, http://www.epa.gov/OCEPAterms/sterms.html),
and a report on vulnerability to climate change (U.S. EPA, 2009a). One National Academies report (Science and
Decisions) distinguished between the two terms (National Research Council, 2009). In other contexts, the terms
have varied uses - see Villagran de Leon (2006) for a detailed comparison of various definitions of vulnerability in
the context of natural disasters, or work on vulnerability indexes for developing countries by Briguglio (1997).
17 | Page

-------
Overview of Data and Methods
and certain groups may tend to encounter or be less able to avoid certain exposures due to
limited resources, language barriers, education, cultural practices, or lack of information.
•	Susceptibility because of a greater percentage increase in health risk for a given exposure,
e.g., "effect modification" or "multiplicative interaction" may occur. An example would be
where cumulative previous exposure means a group is more likely to be closer to a
threshold for adverse effects, or where greater stress/allostatic load increases susceptibility
through inflammatory or other pathways. Several examples of effect modification relevant
to EJ and PM2.5 are referenced by Bell & Ebisu, 2012 and in a review of subgroups
susceptible to ozone (Bell, Zanobetti, & Dominici, 2014). A growing body of research has
documented interactions of psychosocial stress and environmental exposures.
•	Susceptibility because of higher baseline risk or rates of pre-existing diseases. The same
percent increase in mortality risk has a larger impact on absolute risk if baseline risk is
higher.
•	Susceptibility because of increased overall burden resulting from an initial health risk (e.g.,
because of less ability to recover due to lack of health care or resources). For example, low-
income or minority individuals, or those with less than a high school education, are far less
likely to have health insurance (Cohen & Martinez, 2011).
One reason for EJSCREEN to focus on potentially susceptible demographic groups is that a large body of
research has documented health disparities between demographic groups in the United States, such as
differences in mortality and morbidity associated with factors that include race/ethnicity, income and
educational attainment (e.g., Centers for Disease Control and Prevention, 2011a; Galea, Tracy, &
Hoggatt, 2011). For example:
•	About two thirds (65%) of non-Hispanic white adults reported excellent or very good health
in 2009. In contrast, less than half (49%) of non-Hispanic black adults and 52% of Hispanic
adults reported excellent or very good health.11
•	Residents with lower income report fewer average healthy days than others (Centers for
Disease Control and Prevention, 2011a), and report worse health overall (Centers for
Disease Control and Prevention, 2010).
•	Both lower income and minority race/ethnicity have independent associations with higher
asthma rates, particularly among children, and diabetes rates differ greatly by race/ethnicity
(Centers for Disease Control and Prevention, 2011a).
•	Mortality rates for cancer and heart disease vary somewhat by race/ethnicity (Centers for
Disease Control and Prevention, 2011b). Coronary heart disease and stroke are elevated
among black individuals but generally not in other minority subgroups (Centers for Disease
Control and Prevention, 2011a).
11 This survey, the National Health Interview Survey, represents the U.S. non-institutionalized civilian population,
and presents age-adjusted estimates based on household interviews (Centers for Disease Control and Prevention,
2011d).
18 | Page

-------
Overview of Data and Methods
•	Infant mortality is higher among non-Hispanic black, American Indian/Alaska Native, and
Puerto Rican (but not other Hispanic) populations (Centers for Disease Control and
Prevention, 2011a).
While some health disparities are due to differences in health care, diet, activities, psychosocial stress or
even genetics, it is possible that some portion of certain disparities may be related to differences in
environmental exposures. Some of these differences in exposure are associated with residential
location, and could be considered in EJSCREEN (while others cannot be considered in EJSCREEN, such as
those related to use of consumer products or diet, for which high-resolution geographic data are not
available). Various environmental exposures have been shown to vary by race/ethnicity, income and
other demographic factors (Liu, 2001; Maantay, Chakraborty, & Brender, 2010; U.S. EPA, 2006a), but
EJSCREEN is not predicated on an assumption of such a correlation.
In addition to Executive Order 12898's call, perhaps the most important reason to focus on key
demographic groups in EJSCREEN is that a growing body of research has shown that demographic
factors are associated with susceptibility - certain groups are more impacted by a given level of
exposure to certain pollutants. Various groups have shown increased susceptibility to certain pollutants,
but further evidence is still emerging in this area and data are limited. Evidence currently available
includes the following:
•	Certain demographic groups, such as those with lower educational attainment, children, the
elderly and those with low socio-economic status (SES), appear to be more susceptible to a
given exposure to particulate matter (U.S. EPA, 2009b).
•	Blood lead's association with cardiovascular outcomes appears to be stronger among
Mexican Americans and non-Hispanic blacks than non-Hispanic whites (U.S. EPA 2011c).
•	Some but not all studies suggest lead has a greater impact on IQ among low SES than high
SES individuals (U.S. EPA 2011c).
EJSCREEN is not designed to explore the root causes of differences in exposure. The demographic
factors included in EJSCREEN are not necessarily causes of a given community's increased exposure or
risk. This does not limit their usefulness for the limited purposes of the screening tool, however - these
demographic factors are still useful as indicators of potential susceptibility to the environmental factors
in EJSCREEN. They may be associated with susceptibility, whether or not they are causal, and can be
used as proxies for other harder-to-measure factors that would better describe or determine
susceptibility but for which nationally consistent data are not available. EJSCREEN screens geographic
areas for increased potential for exposure and increased potential for susceptibility to exposures.
Additional analysis is always needed to explore any underlying reasons for differences in susceptibility,
exposure or health.
Some studies have begun to quantify the degree of susceptibility to specific pollutants in particular
demographic groups, such as the work on educational attainment as an effect modifier for risks
associated with PM2.5 exposure, but such emerging knowledge is still limited to a handful of pollutants
19 | Page

-------
Overview of Data and Methods
and demographic factors (U.S. EPA, 2009b). EJSCREEN, as a screening tool, does not attempt to use this
type of emerging quantitative information.
Demographic Indicators Included in EJSCREEN
A wide range of demographic descriptors have been used by researchers and in EJ screening tools to
represent the "social vulnerability" characteristics of a disadvantaged population (for example, see
deFur et al., 2007, and Bell & Ebisu, 2008).
Executive Order (EO) 12898, addressing EJ issues, refers to low-income and minority populations. We
define these two core factors as:
•	Low-Income: The number or percent of a block group's population in households where the
household income is less than or equal to twice the federal "poverty level."12
•	Minority: The number or percent of individuals in a block group who list their racial status as
a race other than white alone and/or list their ethnicity as Hispanic or Latino. That is, all
people other than non-Hispanic white-alone individuals. The word "alone" in this case
indicates that the person is of a single race, since multiracial individuals are tabulated in
another category - a non-Hispanic individual who is half white and half American Indian
would be counted as a minority by this definition.13
Based on a review of other factors used in various EPA EJ screening tools, the four other factors most
commonly used by EPA Headquarters and Regions for EJ analyses are also included in EJSCREEN. The
other four factors are:
•	Less than high school education: The number or percent of people age 25 or older in a block
group whose education is short of a high school diploma.
•	Linguistic isolation: The number or percent of people in a block group living in linguistically
isolated households. A household in which all members age 14 years and over speak a non-
English language and also speak English less than "very well" (have difficulty with English) is
linguistically isolated.
•	Individuals under age 5: The number or percent of people in a block group under the age of
5.
12	More precisely, percent low-income is calculated as a percentage of those for whom the poverty ratio was
known, as reported by the Census Bureau, which may be less than the full population in some block groups. More
information on the federally-defined poverty threshold is available at
http://www.census.gov/hhes/www/povertv/methods/definitions.html. See Appendix B for details on using twice
the poverty threshold.
13	Census definitions of race/ethnicity are available at:
http://www.census.gov/population/www/socdemo/race/index.html and the questions asked about race are
available at: http://www.census.gov/acs/www/about the survey/questions and why we ask/
20 | P a g e

-------
Overview of Data and Methods
• Individuals over age 64: The number or percent of people in a block group over the age of
64.
The source of all demographic data used in EJSCREEN is the American Community Survey (ACS) five-year
summary file, which the U.S. Census Bureau compiles yearly.
Appendix D provides summary statistics for the demographic indicators.
The supplementary reports and maps provided by EJSCREEN also include an extensive list of additional
demographic variables, including statistics on race/ethnicity subgroups (e.g., percent Hispanic or Latino),
languages spoken (e.g., % speaking Vietnamese), income (% in poverty), and many other factors. This
supplementary information may be very useful. For example, subgroups within the broad category of
"minority" can differ greatly in their baseline health, exposures, geographic locations, and other factors.
Demographic Indexes in EJSCREEN
The Demographic Index in EJSCREEN is created using the two demographic indicators that were
explicitly named in EO 12898, low-income and minority. For each Census block group, these two
indicators are simply averaged together.
Demographic Index = (% minority + % low-income) / 2
A Supplementary Demographic Index is also available in EJSCREEN, and is the average of all six
demographic indicators.14
Supplementary Demographic Index =
(% minority + % low-income + % less than high school education + % linguistic isolation +
% individuals under age 5+ % individuals over age 64) / 6
14 Census evaluates each characteristic for a different population, so the denominators are not the same across
factors. For example, the denominator for "% less than high school education" is the population in the block group
25 years and older. The denominators for the factors "% minority" and "% low-income" are both close to the total
block group population, but some people in each block group are not evaluated for each demographic
characteristic.
21 | P a g e

-------
Overview of Data and Methods
Users can also view each demographic indicator separately in EJSCREEN, in reports or in maps.
The Demographic Indexes count each indicator as adding to overall potential susceptibility of the
population in a block group, and assumes the demographic indicator have equal and additive impacts.
The current lack of available data precludes any attempt to disentangle the different influences of the
individual demographic indicators, or quantify the degree of overlap or potential synergy between them.
The demographic groups in EJSCREEN overlap to some extent, because some individuals are both low-
income and minority, for example. In fact, these indicators are correlated at the block group level,
because minorities are more likely to be low-income than non-minorities. Appendix D has information
on the correlations between these variables. These correlations do not affect the indicator's ability to
account for susceptibility, if the assumption of additive effects on susceptibility is appropriate. As more
data becomes available in the future, some of these complexities can be reexamined.
Additional information about the demographic data used in EJSCREEN is available in Appendix B.
Environmental Justice Indexes in EJSCREEN
The EJ index is a combination of environmental and demographic information. The environmental
portion of the EJ index is drawn directly from the environmental indicators described above, and the
demographic information is also taken from the demographic indicators above.
How the EJ Index Works
To calculate a single EJ Index, EJSCREEN combines a single environmental indicator with demographic
information. It considers the extent to which the local demographics are above the national average. It
does this by looking at the difference between the demographic composition of the block group, as
measured by the Demographic Index, and the national average (which is approximately 35%). It also
considers the population of the block group.
Mathematically, the EJ Index is constructed as the product of three items, multiplied together as
follows:
EJ Index =
(Environmental Indicator)
X (Demographic Index for Block Group -Demographic Index for US)
X (Population count for Block Group)
The demographic portions of the EJ Index can be thought of as the additional number of susceptible
individuals in the block group, beyond what you would expect for a block group with this size total
population.
22 | P a g e

-------
Overview of Data and Methods
"Susceptible" or "potentially susceptible individuals" are used informally in these examples, as a way to
think of the Demographic Index times the population count in a block group, which is essentially the
average of the count of minorities and count of low-income individuals.15 It is easiest to think of the
average of these counts as "the susceptible individuals" in these examples.
The number of potentially susceptible individuals (Demographic Index times population count) of course
is typically less than the actual number who are minority, low-income, or both. The demographic
breakdown is not reported by block group -the ACS does not provide that level of resolution on the
overlaps.16 For example, suppose that in a certain block group of 1000 people, 350 (35%) are minority
and 350 (35%) low-income. There might be 200 (20%) who are low-income but not minority, and 200
(20%) who are minority but not low-income. In that case, there would be 150 (15%) who are both, and
450 (45%) who are neither. Therefore, there actually would be 550 (55%) who were either minority,
low-income, or both. The Demographic Index would use 35% in this case, which falls between the 15%
who were both minority and low-income, and the 55% who were in at least one of these groups. These
detailed numbers cannot be obtained from the ACS by block group. Therefore, to represent both groups
in a simple way, the average is used.
An extreme example shows another situation: Suppose a block group has 1000 people but is 0%
minority and 100% low-income. The demographic index would be 50%, or the equivalent of 500
"potentially susceptible individuals" in this case. The same would be true in a block group that was 100%
minority but 0% low-income - it would treated as having the equivalent of 50% (500) "potentially
susceptible" for the sake of these examples.
The EJ Index uses the concept of "excess risk" by looking at how far above the national average the
block group demographics are. For example, assume a block group with 1000 people in it. In that block
group, one would expect 350 potentially susceptible individuals (1000 people here x US average of 35%).
However, if the Demographic Index for that block group is 75%, well above the US average, then there
are the equivalent of 750 potentially susceptible people in that block group, or 400 more than expected
for a block group with a population of 1000. The EJ Index would be 400 times the environmental
indicator in this case.
This formula for the EJ Index is useful because for each environmental indicator it finds the block groups
that contribute the most toward the national disparity in that environmental indicator. By "disparity" in
this case we mean the difference between the environmental indicator's average value among certain
demographic groups and the average in the US population.
15	To be precise, the percent low-income times population is not always exactly the same as the count of low-
income residents. The percent low-income is calculated as a fraction of those for whom poverty status could be
determined, which is less than the full population in some block groups. For simplicity, these examples omit that
detail.
16	The closest available data would be table B17001 and related tables, which provide tract resolution cross-
tabulations of race/ethnic groups by poverty status, but this is not available for block groups and does not provide
the income to poverty ratio data needed to calculated "low-income" as defined in EJSCREEN.
23 | P a g e

-------
Overview of Data and Methods
Minority and low-income individuals live in older housing more often than the rest of the US population,
for example. The EJ Index for lead paint (pre-1960 housing) tells us how much each block group
contributes toward this "excess population risk" or "excess number" of people in older housing, for
potentially susceptible individuals. "Excess" in this context simply means the number of potentially
susceptible individuals in older housing nationwide is above what it would be if they were in older
housing at the same rate as the rest of the U.S. population. Locally, it also means the number is above
what it would be if the block group had the same demographic percentages as the U.S. overall.
Analysis of the EJSCREEN data for minority, or for low income, individuals (roughly one third of the US
population in either case), shows they have a higher environmental indicator value on average than the
rest of the U.S. population, for 11 of the 12 environmental indicators (ozone is the exception).
Note that the EJ Index raw value itself is not reported in EJSCREEN reports- it is reported in percentile
terms, to make the results easier to interpret. If one is calculating the actual raw values using the
formula, it is clear that the EJ Index value can be a positive or negative number. A positive number
occurs where the local Demographic Index is above the US average, and this means the location adds to
any excess in environmental indicator values among the specified populations (minority and low-
income) nationwide. A negative value occurs where the local Demographic Index is below the US
average, and it means the location offsets the other locations, reducing any excess in nationwide
average environmental indicator values among minority and low-income populations relative to others.
Most EJSCREEN users will not work directly with EJ Index raw values, however, and positive raw values
for an EJ Index will be presented as higher percentiles and negative raw values will appear as lower
percentiles.
Supplementary EJ Indexes
In addition to this EJ Index formula, two other types of EJ Indexes were tested during EJSCREEN's
development, and those supplementary EJ indexes are also available in maps and the database files. The
three approaches, the basic index and two supplementary approaches, are complementary to one
another and provide different perspectives on a given area. The difference between the three EJ indexes
for any single indicator lies in the different ways the demographic portion of the index is developed.
Analyses comparing the three approaches concluded that they differ only moderately, and that the EJ
Index selected by EPA as the focus of EJSCREEN falls squarely between the other two, in terms of which
locations it highlighted. In other words, the other two approaches were at either end of a continuum,
and did not overlap as much, but the EJ Index featured in EJSCREEN overlaps very substantially with
both of the other two. It also has the advantage of having a clear interpretation, analogous to excess
population risk or the local contribution to excess risk nationwide.
The Supplementary EJ Indexes also combine an environmental indicator with demographic information.
In the first Supplementary EJ Index, the demographic information is simply the Demographic Index for
the block group, times the population of the block group. This index simply omits the demographic index
24 | Page

-------
Overview of Data and Methods
for the nation, so it focuses less on the extent to which demographics exceed the national average. The
first Supplementary EJ Index can be expressed mathematically as follows:
Supplementary EJ Index 1 =
(Environmental Indicator)
X (Demographic Index for Block Group)
X (Population count for Block Group)
The second Supplementary EJ Index can be expressed mathematically as follows:
Supplementary EJ Index 2 =
(Environmental Indicator)
X (Demographic Index for Block Group)
The second Supplementary EJ Index combines an environmental indicator with the Demographic Index
for the block group, leaving out demographic index for the nation and also ignoring the block group
population. This index does not give more weight to block groups with very large numbers of residents.
An advantage is that it is simple and treats each block group equally regardless of how many people live
there. A disadvantage is that this effectively gives less weight to a person in a populous block group than
a person in a less populated block group.
The EJSCREEN mapping application also allows users to see each of these EJ indexes calculated using the
Supplementary Demographic Index, which uses all six demographic indicators instead of just two. In
most locations the results do not differ greatly from what is found using the standard EJ Index.
Details on all the available indexes are also included in Appendix E.
EJ Indexes, Population Density, and Rural Areas
It is very important to understand that the population count per block group is not the same as
population density, so the population weighting of the EJ Index has nothing to do with whether a place
is high density, or rural versus urban. In fact, there is almost no correlation between population count
per block group and population density (population count per square mile covered by the block group),
because in low density areas each block group covers a larger area, keeping the population per block
group fairly consistent. This means population weighting in the EJ Index does not emphasize urban or
high density locations - it is neutral with regard to population density or urbanization. Furthermore, the
vast majority of block groups in the US have similar population counts, so the population weighting in
the EJ Index has a strong influence only in a tiny fraction of locations. For example, about 90% of block
groups had a population between 500 and 2500 in 2008-2012, and only about 1% had a population over
4000.
25 | P a g e

-------
Overview of Data and Methods
It is true that many of the EJ Indexes have higher values in urban, high-density areas, but this is true for
the supplementary EJ Indexes as well, and is not the result of population weighting. Differences in
environmental indicator values (and to some extent percentage demographics) are generally the drivers
of higher EJ Indexes in urban or high-density block groups. NATA indicators and the lead paint indicator,
in particular, are strongly correlated with population density, as are the PM2.5 and traffic indicators to
some extent. The proximity indicators are also positively (but weakly) correlated with population
density. In other words, these environmental indicators appear to be lower in rural areas in general, and
combined with some demographic differences, this tends to make the EJ Indexes lower in those areas.
Relative to those factors, the population weighting (or choice of EJ Index formula) has a very small
influence on whether urban or rural areas are highlighted.
Why the EJ Indexes are Not all Combined
For each environmental indicator, one standard EJ index is available in EJSCREEN. At this time, there is
not a single composite EJ index that combines all the environmental factors. Although it would be useful
if a simple metric could summarize all of the information in EJSCREEN as a single number, there is no
widely-accepted, objective way to combine the differing environmental concerns into one number. This
is because of the value judgments and scientific challenges inherent in deciding how much weight or
importance should be given to each of the environmental factors. They are very difficult to compare, in
terms of public health importance, public concerns, and the many other important considerations that
could be weighed. This topic has been covered extensively elsewhere17, but a very brief explanation may
be useful here.
First, a so-called "equal weighting" does not exist, because it would just be an artifact of the units
(scaling) and aggregation method one chose, which would carry implicit value judgments about how to
weight and combine the factors, even if it seemed simple at first glance. Putting equal weight on each
percentile, for example, would implicitly equate very low risks (e.g., air toxics well below health based
reference levels, as with a neurological hazard index well below 1) and much higher risks (e.g., PM2.5
levels well above a health based standard).
Furthermore, while the use of percentiles provides useful perspective by putting the 12 EJ indexes in
common units, it would be a mistake to assume the 80th percentile, for example, has the same
"importance" for one index or indicator as for another. If two indexes are at the same percentile, it
simply means those two scores are equally common (or equally rare) in the United States. It does not
mean the risks are comparable. It is therefore critical when interpreting EJSCREEN percentiles to also
look at the actual raw numbers for the environmental and demographic indicators.
The challenge is compounded by the fact that rankings of block groups using a composite environmental
index would be quite sensitive to the method chosen to combine the environmental indicators, based
on EPA's analysis of the data. This is also the case, albeit to a lesser degree, for any composite EJ index.
This is a result of the environmental indicators not being highly correlated with each other. The locations
17 See, for example, OECD, 2008, or Finkel & Golding, 1994.
26 | P a g e

-------
Overview of Data and Methods
with the highest PM2.5 levels are not usually the same as the ones with the highest NPL proximities, for
example, as suggested in Appendix D. If the NPL indicator is treated as more important, different block
groups would be highlighted than if the PM2.5 indicator were given more weight. Again, it is important to
acknowledge that there is no objective version of "equal weighting."
For these reasons, the environmental indicators are not combined as a single number, and must be
understood individually for a complete picture. However, they can be viewed all at the same time in a
single tabular report, and this facilitates a broad perspective on all the factors at one time. Those using
EJSCREEN and considering aggregating the data as a single summary metric are strongly urged to
carefully consider these pitfalls associated with doing so. A thorough understanding of each indicator
and the ability to view all of them in a report provides a far better picture of the screening results than
any single number or map is capable of.
Percentiles
What a Percentile Means
EJSCREEN puts each indicator or index value in perspective by reporting the value as a percentile. For
example, an area may show 60% of housing was built prior to 1960. It may not be obvious whether this
is a relatively high or low value, compared to the rest of the nation or in the state. Therefore, EJSCREEN
also reports that 60% pre-1960 puts this area at the 80th percentile nationwide. For a place at the 80th
percentile nationwide, that means 20% of the US population has a higher value.
A percentile in EJSCREEN tells us roughly what percent of the US population lives in a block group that
has a lower value (or in some cases, a tied value). This means that 100 minus the percentile tells us
roughly what percent of the US population has a higher value. This is generally a reasonable
interpretation because for most indicators there are not many exact ties between places and not many
places with missing data.
More precisely, the exact percentile for a given raw indicator value is calculated as the number of US
residents of block groups with that value or lower, divided by the total population with known indicator
values. This is typically the same as or almost exactly the same as dividing by the total US population,
but for some indicators some locations do not have an indicator value. For example, the NATA indicators
are missing for only about one twentieth of 1% of the US population in the 2015 version of EJSCREEN.
The calculated percentile would change by much, much less than 1 percentile point if calculated as a
fraction of the total population instead of as a fraction of those with valid indicator values.
All percentiles in EJSCREEN are population percentiles, meaning they describe the distribution of block
group indicator scores across the population. Note that a population percentile may be slightly different
than the unweighted percentile (the percent of block groups, not people, with lower or tied values),
because not all block groups have the same population size. In practice they are very similar because
very few block groups diverge very much from the average in population size.
27 | P a g e

-------
Overview of Data and Methods
Color-coded High Percentile Bins
Locations at least at the 80th percentile but less than the 90th are shown in yellow on EJSCREEN maps,
while those at the 90th percentile but less than 95th percentile are orange on the maps, and those at the
95th percentile or above are shown in red on maps and reports. These colors call attention to certain
locations as a very simple way to communicate relative screening results. There is no official policy
significance assigned to each individual color on the maps, but the choice of these categories or "bins" is
noteworthy because it signifies that certain ranges of percentiles may merit closer attention.
Percentiles at or above the 95th percentile are shown in red on the EJSCREEN standard report. This is a
way to call particular attention to those cases where the value is in the top 5% of the nation (or region
or state). Indicator or index values in the top 5% tend to be much higher than those in the next 5-10%,
so they may merit close attention. This is especially true for the indicators with highly skewed
distributions, such as the traffic proximity indicator (see Appendix D, Table 6 and Figure 1). For example,
block groups in the top 5% (shown in red on maps and reports) have traffic, NPL, and TSDF proximity
indicators on average that are about three times as high as in the next 5% (shown in orange on the
maps). These differences are far less extreme in the cases of PM2.5 and lead paint indicators, which don't
vary as much across block groups. In general, though, indicator or index values above the 95th percentile
represent much higher demographic, environmental, or EJ Index values than those at lower percentiles.
The maps also identify areas in the 90th to 95th percentiles as orange, and those at the 80th to 90th as
yellow. These additional categories highlight larger groups of locations that have indicator or index
values well above the national mean or median for the given indicator or index. The actual values are
lower than those in the top 5%, typically much lower, but they are still in the top 10 to 20% of values for
the US population overall.
A relatively high percentile means the value is relatively uncommon. However, a high percentile is not
necessarily a real concern from a health or legal perspective. To understand the actual health or other
implications of any screening results requires looking at the actual data and the indicator represents,
and also looking at other relevant data if available. Besides the percentile, other important
considerations in interpreting any screening results include the following:
1.	whether and to what extent the environmental data shows values above any relevant health-
based or legal threshold,
2.	the significance of any such thresholds, or the magnitude and severity of the health or other
impacts of the given environmental concern, nationally or locally, and
3.	the degree of any disparity between various groups, in exposures to the relevant environmental
pollutants.
In maps, EJSCREEN focuses on the US percentiles, as a way to visualize all results in common units.
28 | Page

-------
Overview of Data and Methods
The US percentile uses the US population as the basis of comparison. The state or regional percentile
was calculated based on the population in a given state (or DC) or one of EPA's 10 regions.18 The
national or state or regional mean value was calculated as the population weighted average of the block
groups with data for that indicator, within the respective geographic scope.
Note that the US and state percentiles both will rank block groups in exactly the same rank order within
the given state. If the goal is just to rank or compare locations within a single state, it does not matter
whether the US or state percentile is used. The difference between state and US percentiles becomes
apparent mainly in two situations: when comparing places across states, or when comparing results to
some pre-determined, specific reference percentile (e.g., 80th percentile).
The advantage of US percentiles for an EJ Index, for example, is that a higher percentile in place A versus
place B clearly indicates that the combination of the environmental indicator and demographic index is
greater in place A than place B. In a sense, the US percentile indicates how uncommon it is to have such
a high level for an indicator or index.
State or regional percentiles cannot be compared across states or regions as easily. If two places A and
B, in two different states, happen to both be at the 80th percentile for the traffic EJ Index, for example, it
is not clear which actually has the higher index value. It just means that A's index is just as uncommon
within that state as B's is in B's state. However, this may be useful information because an EJSCREEN
user may want to know how high the indicator is relative to the rest of that state.
The state and US percentiles will be very similar if the state and US average indicator values are very
similar. However, if the state average is very low compared to the US, the state percentile shown will be
higher than US percentile shown, for a given raw value of an indicator. If the state average is much
higher than the US average, for an indicator like the traffic indicator, then a traffic score that would
normally be considered fairly high nationwide, such as the 90th percentile in the US, would not be
considered very unusual within that state, so the state percentile would be lower, and might be only
78th percentile, for example. The state percentile being lower than the US percentile does not mean the
indicator value is lower in the given place, it just means the state average is higher than the US average.
Appendix B provides details on how percentiles are calculated, and rounded when displayed.
18 Regions in the 2014 version of EJSCREEN were defined only by State. A small number of block groups on Tribal
Lands along the borders of Nevada and Arizona are actually part of EPA Region 9, even though they are within the
States OR/ID (Region 10), UT/CO (Region 8), or NM (Region 6). EJSCREEN's percentiles and reports processed those
block groups as if they were in the Region that corresponds to their State. Regional and Tribal Lands maps should
be consulted when viewing those locations and reported Regional percentiles should not be used in those
locations.
29 | P a g e

-------
Overview of Data and Methods
Buffer Reports
What the Buffer Report Calculates
EJSCREEN allows a user to define a buffer, such as the circle that includes everything within 1 mile of a
specific point. Non-circular, user-defined shapes also can be defined to represent buffers of any shape.
The summary within a buffer represents the average resident within the buffer. A report summarizes
the demographics of residents within this buffer, as well as the environmental indicators and EJ index
values within the buffer. It also provides an estimate of the total population residing in the buffer.
Note that this means one cannot compare two buffers of very different population counts without
understanding what each set of results represents. Each represents the average person in that buffer. It
does not represent the absolute total amount the buffer contributes to overall disparity in indicator
scores nationwide or statewide. Even if the two sets of scores are identical other than in population
counts, the buffer with a larger population will contribute more to any national or overall disparity in
indicator scores. In general, however, this situation does not tend to arise because most buffers that a
user creates in practice will be at least roughly similar in size and population. Even if they are not, a user
simply needs to acknowledge that some buffers have larger populations, in which case those percentile
results represent more people.
Appendix B provides the details of buffer calculations.
Choosing a Buffer Size versus Rationale for Distance in Proximity Indicators
An EJSCREEN user's choice of distance for a circular buffer is important, and the considerations need to
be understood.
EJSCREEN is not able to report on buffers that are too large (e.g., ten or more miles in radius) due to
computational limits. It also is not able to report on buffers that are too small (i.e., they do not intersect
the internal points of any Census blocks).
In addition to those limits, as a rule of thumb, it is important to know that a buffer that is as small as the
buffer it centers on will result in estimates with substantially higher uncertainty than one which covers
several block groups. A buffer covering five or more block groups, for example, will provide much more
confidence in demographic estimates (because of sampling uncertainty as well as the challenge of
estimating where residents are located within block groups intersected by the edges of the buffer).
Examining patterns the size of a block group or smaller requires using maps of block groups and Census
blocks, rather than attempting to draw buffers in EJSCREEN.
Uncertainties are also discussed in section 1 (as general caveats), and Appendix B (in discussions of
buffering details and demographic data).
30 | P a g e

-------
Overview of Data and Methods
It is important not to confuse two different distances:
1)	the distance a user selects for a circular buffer radius (e.g., by default 1 mile, which is 1.6 km),
and
2)	the distance EJSCREEN used in proximity score calculations (i.e., 5 km for facilities and sites, or
500 meters for traffic).
These two are very different, as explained here, because proximity scores use a large distance (e.g., 5
km) and inverse distance weighting, while for circular buffers a user may wish to specify a shorter
distance (e.g., 1 mile or 1.6 km, but at least as large as one or more local block groups) because a buffer
report does not use distance weighting.
The buffer analysis provides a summary of the average resident inside the buffer. It gives equal weight
to each resident, regardless of whether they are closer or further from the center of the buffer. There is
no distance weighting in a buffer report.
Proximity scores were created very differently than buffer reports are calculated. The proximity scores
for each block group were calculated for each residential location using distance weighting to give more
weight to closer facilities, sites, or traffic. Because the proximity score uses distance weighting to focus
less on the more distant points, it was designed to look at a large area using a large radius, or distance (5
km for facilities or NPL sites). By distance weighting, the proximity score can examine this large area and
still provide a useful summary of all the facilities or sites in that wide area.
The proximity score for traffic, for example, looks within a search radius of 500 meters (or further if
none is found in that radius). This distance, or scope, was selected to be large enough to capture the
great majority of road segments (with traffic data) that could have a significant impact on the local
residents, balanced against the need to limit the scope due to computational constraints. Within this
relatively wide zone, the closest traffic is given more weight, and the distant traffic given less weight,
through inverse distance weighting. The same approach was used for the facility or NPL site proximity
scores. A distance of 5 km was chosen to capture the great majority of facilities or sites that could have a
significant impact on local residents. The fact that impacts may be very small for distances of 4 or 5 km is
handled by the use of inverse distance weighting in the proximity score formula.
By contrast, a buffer report, again, averages together all residents in the buffer, treating them all equally
regardless of their distance from the buffer center. Therefore, many EJSCREEN users may wish to define
a modest buffer distance that focuses on those residents who may be "most affected" and "similarly
affected" by a single facility or site of interest at the center of the circular buffer.
A very large buffer (e.g., over three miles in radius) could provide misleading results if the goal is to
describe the "affected" population, because people in a large buffer could be extremely varied in the
extent of their exposure to some source at the center of such a large buffer. If impacts decline with
distance, a large buffer would mix many relatively unimpacted, distant residents, with fewer residents
who are closer and impacted more, giving a diluted result that fails to describe those most impacted.
31 | P a g e

-------
Overview of Data and Methods
At the other extreme, a very small buffer (e.g., the size of a local block group or smaller) is problematic
because it could fail to include some significantly affected residents, and also because estimates are
more uncertain for smaller geographic areas due to sampling error in the ACS and spatial error in
estimating which residents are inside the buffer. Some EJSCREEN users may wish to define a large buffer
distance when they know a local facility or site covers a wide area (for example some NPL sites can be
very large). Some users may wish to run and compare separate buffer reports for two or more choices of
distance, or define hand-drawn buffers to look at zones at various distances from some point.
32 | P a g e

-------
Details on Environmental Indicators
3 DETAILS ON THE ENVIRONMENTAL INDICATORS IN
EJSCREEN
Environmental Factors Not Included
As described above, EJSCREEN contains 12 environmental factors, which were selected after a review of
available data, other EJ tools and analyses, and the data selection criteria discussed above.
A number of possible factors were identified in the review, which were not ultimately included in
EJSCREEN due to various limitations. These limitations were almost always a lack of high resolution data
(e.g., only available at county level), and/or lack of geographic coverage (e.g., only available in selected
or sampled locations). In some cases a factor was not added because of a high degree of overlap and
double-counting with existing indicators, or resource constraints and practical considerations. One or
more of these factors may be included in future versions of EJSCREEN as more data become available.
Other EPA resources also have more information on many of these issues, such as
•	EPA's website (www.epa.gov)
•	Envirofacts (http://www.epa.gov/enviro/)
•	A county-level US Environmental Quality Index (EQI)19
•	C-FERST (http://www.epa.gov/heasd/c-ferst/)
•	EnviroAtlas (http://enviroatlas.epa.gov/enviroatlas/atlas.html)
EJSCREEN also provides the ability to view some of these issues directly within the EJSCREEN maps, such
as impaired water bodies, criteria air pollutant nonattainment areas, TRI facilities, and others.
Furthermore, users can import and view, within EJSCREEN, other maps available on the internet, for
more of these environmental issues.
Users of EJSCREEN are also encouraged to consider these issues, where appropriate, to the extent they
have relevant local information.
Factors not currently used in EJSCREEN include:
•	Health data (e.g., overall mortality rate) - Note this is not an environmental factor, but is
sometimes of interest in this context. Relevant data were found to be available only at
county level resolution.20
•	Drinking water (from private wells or public water supplies) and surface water quality (other
than through the potential relevance of the NPDES proximity indicator in EJSCREEN)21
19	Messer, Jagai, Rappazzo, & Lobdell (2014)
20	Useful resources include http://www.countvhealthrankings.org, http://www.americashealthrankings.org/,
http://www.healthindicators.gov/, and http://www.rwif.org/en/research-publications/research-features/rwif-
datahub/national.html#q/scope/national/ind/31/dist/29/char/119/time/14/viz/map/cmp/brkdwn
21	See http://www2.epa.gov/learn-issues/water-resources and
http://water.epa.gov/scitech/datait/tools/waters/tools/index.cfm
33 | P a g e

-------
Details on Environmental Indicators
•	Contaminated fish/ seafood (other than through the potential relevance of the NPDES
proximity indicator in EJSCREEN)22
•	Beach closures due to pathogens (other than through the potential relevance of the NPDES
proximity indicator in EJSCREEN)23
•	Impaired surface waters (assessed only in certain locations, so lacking complete coverage of
all locations in the US)
•	Sea-level rise or other impacts of global climate change24
•	Radon gas exposure25 or indoor air pollutants other than radon26
•	Criteria air pollutants other than PM2.5 and ozone (Pb, CO, SOx and NOx)27 (Not included due
to resource constraints, and more limited modeling/coverage, frequency of nonattainment,
or health impact than for PM2.5 and ozone).
•	Proximity to other point or area sources not already accounted for by EJSCREEN's proximity
indicators, NATA indicators, or other air quality indicators. This can include some TRI
reporting facilities that do not emit HAPs to air, for example. TRI facilities are a small but
important fraction of all regulated facilities. Those emitting HAPs to air are already
considered through the NATA indicators, and many others are included in the RMP
indicator.28
•	Exposures to short episodes of elevated releases of air pollutants during startup, shutdown,
malfunction, etc. (data gaps in coverage and resolution)
•	Exposures to undocumented emissions caused by leaks
•	Exposures related to oil and gas extraction, such as hydraulic fracking29
•	Mining (e.g., uranium mining, etc.)30
•	Coal ash ponds
•	Combined animal feeding operations (CAFOs)31
•	Leaking underground storage tanks or other contaminated sites other than National
Priorities List (NPL) sites, Risk Management Plan (RMP) facilities, and Treatment, Storage
and Disposal Facilities (TSDFs)32
•	Pesticide exposures from spray drift or other sources, or pesticide exposures from
residential and other non-agricultural uses33
See http
//www2.epa.gov/learn-issues/water-resources
See http
//www2.epa.gov/learn-issues/water-resources
See http
//www.epa.gov/climatechange/
See http
//www.epa.gov/iaa/ia-intro. html
See http
//www.epa.gov/iaa/ia-intro. html
See http
//www.epa.gov/air/urbanair/
28	For examples of other efforts to consider a wide range of facilities, see the various TRI mapping tools such as
(http://www2.epa.gov/toxics-release-inventorv-tri-program/tri-data-and-tools). and see EPA's RSEI tool:
http://www.epa.gov/opptintr/rsei/index.html
29	See http://www2.epa.gov/hvdraulicfracturing
30	See http://water.epa.gov/polwaste/npdes/Mining.cfm
31	See http://water.epa.gov/polwaste/npdes/afo/index.cfm
32	See http://www.epa.gov/swerustl/overview.htm
33	See http://www2.epa.gov/safepestcontrol and http://www2.epa.gov/science-and-technology/pesticides-science
34 | P a g e

-------
Details on Environmental Indicators
•	Noise pollution and odors not already accounted for in other data34
•	Occupational exposures
•	Exposures related to imported or domestic consumer products, foods and beverages, or
other sources of exposure where we lack detailed geographic data
•	Ecosystem services35
EJSCREEN is designed to be a nationally consistent screening tool, with results calculated and displayed
at the Census block group level. Data inputs must be from publicly available sources, available and
consistent across the entire country, and with sufficient spatial resolution. There must be some plausible
means of quantifying an adverse effect or a proxy for an adverse effect on residential populations. These
requirements set a high bar for including environmental data. Currently, none of the potential
environmental factors listed above meet those criteria.
Environmental Factors in EJSCREEN
Each of the 12 environmental indicators included in EJSCREEN can be viewed separately. Each
environmental indicator is also combined with demographic indexes to form the EJ indexes outlined
above. Table 2 summarizes the environmental indicators in EJSCREEN, and the following sections
describe each environmental indicator in more detail. The sections above address criteria for selecting
which environmental factors to include in this version of EJSCREEN. Appendix D provides summary
statistics for each indicator, including mean and percentile values.
Table 2. Summary Table of Environmental Indicators and Sources
Key
Indicator
Details
Source [
Medium



Air
Air
Air
NATA air
toxics
cancer risk
NATA
neurological
hazard
index
NATA
respiratory
hazard
index
Lifetime cancer risk from
inhalation of air toxics
Air toxics neurological hazard
index (ratio of exposure
concentration to health-based
reference concentration)
Air toxics respiratory hazard
index (ratio of exposure
concentration to health-based
reference concentration)
EPA NATA, retrieved 20XX
http://www.epa.gov/ttn/atw/nat
amain/index.html
EPA NATA, retrieved 20XX
httpV/www.epa.p""74"4""^4"'"7^
amain/index.html
9
20xx
EPA NATA, retrieved 20XX
http://www.epa.gov/ttn/at
amain/index.html
Air
NATA diesel
PM
Diesel particulate matter level
in air, ng/m3
EPA NATA, retrieved 20XX
http://www.epa.gov/ttn/atw/nat
amain/index.html
20xx
34	See http://www.epa.gov/air/noise.html
35	See http://enviroatlas.epa.gov/enviroatlas/
35 | P a g e

-------
Details on Environmental Indicators
Air
Particulate
matter
PM2.5 levels in air, ng/m3
annual avg. (2011)
EPA, OAR (fusion of model and
monitor data). For methods, see
http://www.epa.gov/esd/land-
sci/lcb/lcb faasd.html
2011
Air
Ozone
Ozone summer seasonal avg.
of daily maximum 8-hour
concentration in air in parts
per billion (2011)
EPA, OAR (fusion of model and
monitor data). For methods, see
http://www.epa.gov/esd/land-
sci/lcb/lcb faqsd.html
2011
Air/other
Traffic
proximity
and volume
Count of vehicles (AADT, avg.
annual daily traffic) at major
roads within 500 meters,
divided by distance in meters
(not km)
Calculated from 2011 U.S. DOT
traffic data, retrieved 4/2012
http://www.rita.dot.gov/bts/site
s/rita.dot.gov. bts/files/publicatio
ns/national transportation atlas
database/2011/index.html
2011
Dust/
lead
paint
Lead paint
indicator
Percent of housing units built
pre-1960, as indicator of
potential lead paint exposure
Calculated based on Census/ACS
data, retrieved 2014
http://www2.census.gov/acs201
2008-
2012
2 5vr/summarvfile/
Waste/
air /
water
Proximity to
RMP sites
Count of RMP (potential
chemical accident
management plan) facilities
within 5 km
(or nearest one beyond 5 km),
each divided by distance in
kilometers
Calculated from EPA RMP
database, retrieved 11/2013
2013
Waste/
air /
water
Proximity to
TSDFs
Count of TSDFs (hazardous
waste management facilities)
within 5 km (or nearest
beyond 5 km), each divided by
distance in kilometers
Calculated from EPA RCRAInfo
database, retrieved 11/2013
http://www.epa.gov/enviro/fact
s/rcrainfo/search.html
2013
Waste/
air /
water
Proximity to
NPL sites
Count of proposed and listed
NPL sites36 within 5 km (or
nearest one beyond 5 km),
each divided by distance in
kilometers
Calculated from EPA CERCLIS
database, retrieved 11/12/2013
http://cumulis.epa.gov/supercpa
d/cursites/srchsites.cfm
2013
Water
Proximity to
major direct
water
dischargers
Count of NPDES major direct
water discharger facilities
within 5 km (or nearest one
beyond 5 km), each divided by
distance in kilometers
Calculated from EPA PCS/ICIS
database, retrieved 12/2013
http://www.epa.gov/enviro/fact
s/pcs-icis/search.html
2013
Note: EJSCREEN's EJ Indexes also include demographic information that is obtained from the U.S.
Census Bureau's American Community Survey (ACS). The 2015 version of EJSCREEN includes 2008-
2012 ACS 5-year summary file data, which is based on 2010 Census boundaries.
36 Count of NPL sites excludes deleted sites, sites in U.S. territories, and other sites that could not be included.
36 | P a g e

-------
Details on Environmental Indicators
NATA Air Toxics and NATA Diesel PM
Air toxics, often referred to as hazardous air pollutants (HAPs), are pollutants that are known or
suspected to cause cancer or other serious health effects, such as reproductive effects or birth defects,
or adverse environmental effects. EPA regulates 187 chemicals under its HAP program (U.S. EPA, 2009d).
Most air toxics originate from transportation and industry, including motor vehicles, industrial facilities
and power plants.
Indicator	EPA's National Air Toxics Assessment (NATA) provides the following indicators
that are used in EJSCREEN:
•	Estimated lifetime inhalation cancer risk from the analyzed carcinogens in
ambient outdoor air.
•	Hazard index for respiratory effects.
•	Hazard index for neurological effects.
•	Diesel particulate matter concentration.
Rationale for	A chemical's listing as a HAP is based on evidence of cancer or other adverse
Inclusion	health effects or environmental effects associated with exposure to the chemical,
as determined by EPA and the initial list in the Clean Air Act Amendments of
1990. EPA's Integrated Risk Information System (IRIS) program documents the
health risks associated with these chemicals and serves as a basis for the analysis
of health implications (U.S. EPA, 2012c).
Air toxics cancer risk and noncancer impacts have been included in other EPA EJ
screening tools.
HAPs are emitted from a wide variety of sources and disperse around the sources,
especially downwind. In some cases, these substances react with other
constituents in the atmosphere or break down to other chemicals, and most are
eventually removed through precipitation or other atmospheric processes. People
are exposed in their daily activities in and around their homes, at school or work,
and while moving about the area. They inhale the substances, exhale or excrete
some portion of them, and have the potential for incurring adverse effects from
the portion that stays in the body.
More Information More information is available at the air toxics website
(http://www.epa.gov/air/toxicair), the NATA website (www.epa.gov/nata), and
the IRIS website (www.epa.gov/iris).
Relevant Studies A comprehensive list of EJ studies using the NATA database can be found in
Chakraborty et al., (2011). Some examples of EJ studies of chemicals listed as
HAPs include Morello-Frosch & Jesdale (2006), and other studies reviewed by Liu
37 | P a g e

-------
Details on Environmental Indicators
Data Source
Data Version
(2001) and Brender et al., (2011). Diesel particulate matter has also been the
subject of EJ analysis (Rosenbaum, Hartley, & Holder, 2011).
EJSCREEN uses the most recent data from EPA's National-Scale Air Toxics
Assessment (NATA). NATA estimates cancer risk or noncancer implications of
many of the 187 air pollutants classified as HAPs, as well as diesel particulate
matter. NATA uses emissions estimates from the National Emissions Inventory
(NEI), which is updated every three years. The NEI includes all of the Toxics
Release Inventory (TRI) reporting facilities that release hazardous air pollutants,
along with many other sources of air pollutants, such as motor vehicles.
Note that the publicly-available NATA, PM2.5, and ozone estimates are at tract
resolution, and tract level is the resolution used for EJSCREEN, unlike with
proximity indicators, for example. Each block group was assigned the NATA or PM
or ozone score of the tract containing it. All indicators or statistics then were
calculated using block group data, whether or not those block group scores had
been assigned based on tracts.
The 2015 version of EJSCREEN uses 20xx NATA data, which is based on NEI
emissions estimates for 20xx (U.S. EPA, 2015b). This version of NATA estimated
ambient concentrations of 177 HAPs plus DPM, and then estimated health
implications for 139 of these HAPs (cancer risk for 80 and noncancer results for
100).
NATA update pending
Data from recent years may no longer be as representative of current conditions
as they were at the time the data was collected. The NATA-based indicators in
particular should be viewed with this in mind, because emissions of air toxics
generally have decreased in the intervening years. This version of EJSCREEN
incorporated the most recent data that were available at the time of indicator
development. Every attempt will be made to use the most recent appropriate
data available in future updates of EJSCREEN. There is always a delay between the
release of raw data and their eventual incorporation into any models, tools, or
maps. It is also useful to note that although the raw numbers for some indicators
do not represent current conditions, the percentiles are much more likely to be
reasonably representative of today's conditions in most locations. This is because
even if emissions have been significantly reduced overall, for example, the
differences between various locations are unlikely to have changed as
dramatically, especially when the reductions have come from national regulations
and other trends affecting entire industries or sectors in many locations. For this
reason, the percentiles may be more representative of current conditions than
the raw values of the indicators.
38 | P a g e

-------
Details on Environmental Indicators
Discussion	EPA's NATA website has extensive documentation of all of the data and methods
used in developing the NATA indicators, as well as discussions of uncertainty,
caveats, and limitations in the NATA estimates. That information is not repeated
here, but it is important that anyone using NATA data understand these issues, so
anyone using EJSCREEN should consult the NATA documentation
(www.epa.gov/nata).
Very briefly, the air pollutants in NATA include likely or known carcinogens such
as formaldehyde, benzene, polycyclic aromatic hydrocarbons and naphthalene, as
well as important sources of noncancer impact such as acrolein, DPM and
manganese compounds. The cancer risk in NATA is aggregated as a cumulative
risk for the combination of all analyzed HAPs, and this total is used in EJSCREEN.
NATA calculates a hazard quotient, which is the ratio of ambient air concentration
to a chemical's health-based RfC. No adverse health effects are expected from
exposure if the hazard quotient is less than one. A hazard index is the sum of
hazard quotients for chemicals that cause adverse effects through the same toxic
mechanism. NATA currently includes hazard indexes for respiratory effects and
neurological effects, both of which are included as environmental indicators in
EJSCREEN. Each represents the cumulative impacts of all the relevant air toxics.
The NATA website provides more detailed data than EJSCREEN - Tables and maps
on individual HAPs or specific types of sources (e.g., mobile sources only) can be
generated by GIS practitioners using data from the NATA website, for those
requiring more detail than is provided by the data in EJSCREEN.
The reports in EJSCREEN present the environmental indicators from NATA using
ranges of percentiles such as 90-95 or 95-100 rather than as the numbers 1-100
(for Regional and US percentiles). This is done in recognition of the uncertainties
inherent in comparing NATA estimates across States that may have different
approaches in emissions inventories.
39 | P a g e

-------
Details on Environmental Indicators
Particulate Matter (PM2.5)
PM2.5 is particulate matter that is 2.5 microns or less in diameter. Common sources of PM2.5 emissions
include power plants and industrial facilities. Secondary PM2.5 can form from gases, such as oxides of
nitrogen (NOx) or sulfur dioxide (S02), reacting in the atmosphere. EPA set the first PM2.5 National
Ambient Air Quality Standards (NAAQS) in 1997, and revised the standards in 2006 and 2012.
Indicator	Annual average PM2.5 concentration in micrograms per cubic meter
(Hg/m3).
Rationale for	EPA's work associated with the PM NAAQS has documented the health
Inclusion	effects associated with exposure to PM2.5, including elevated risk of
premature mortality from cardiovascular diseases or lung cancer, and
increased health problems such as asthma attacks (U.S. EPA, 2009b).
PM2.5 concentrations at different levels are found in all parts of the United
States, so residents are exposed via inhalation to varying degrees. A 2012
EPA report found that the majority of the U.S. population lived in areas in
nonattainment of one or more of the NAAQS in effect at that time—a total
population of 159 million people (based on 2010 Census data for those
locations) (U.S. EPA, 2012g).
Several studies relevant to EJ and PM2.5, including those discussing
susceptible subgroups, are referenced by Bell & Ebisu (2012).
PM2.5 has been included in other EPA EJ screening tools.
More Information More information is available at the PM2.5 website
(http://www.epa.gov/pm).
Relevant Studies Some examples of studies focused on disparities in exposure to PM2.5 have
been reviewed in Liu (2001), and more recent studies include Bell & Ebisu
(2012); Fann et al. (2011); Post, Belova, & Huang (2011); Miranda,
Edwards, Keating, & Paul (2011); Brochu et al. (2011); and Levy, Wilson, &
Zwack (2007). A very recent study found disparities in exposure to NOx, a
precursor to PM2.5- (Clark, Millet, & Marshall, 2014).
Data Source	EJSCREEN's PM2.5 data are estimated from a combination of monitoring
data and air quality modeling. Ambient PM2.5 concentration is estimated by
EPA's Office of Research and Development using a Bayesian space-time
downscaling fusion model approach. This approach is described in a series
40 | P a g e

-------
Details on Environmental Indicators
of three published journal articles (Berrocal, Gelfand, & Holland, 2010a,
2010b, 2011).37
PM2.5 and ozone estimates were not available for Alaska or Hawaii for use
in the 2015 version of EJSCREEN, due to a lack of CMAQ modeling. EPA
may be able to include estimates in a future version of EJSCREEN.
Data Version	The 2015 version of EJSCREEN uses PM2.5 data that are based on 2011
monitoring and modeling estimates (U.S. EPA, 2015a).
Data from several years ago may no longer be as representative of current
conditions as they were at the time the data was collected. The PM2.5 and
ozone indicators in particular should be viewed with this in mind, because
emissions related to PM2.5 and ozone generally have decreased in the
intervening years. This version of EJSCREEN incorporated the most recent
data that were available at the time of indicator development. Every
attempt will be made to use the most recent appropriate data available in
future updates of EJSCREEN. There is always a delay between the release
of raw data and their eventual incorporation into any models, tools, or
maps. It is also useful to note that although the raw numbers for some
indicators do not represent current conditions, the percentiles are much
more likely to be reasonably representative of today's conditions in most
locations. This is because even if emissions have been significantly reduced
overall, for example, the differences between various locations are unlikely
to have changed as dramatically, especially when the reductions have
come from national regulations and other trends affecting entire industries
or sectors in many locations. For this reason, the percentiles may be more
representative of current conditions than the raw values of the indicators.
Finally, some supplementary maps and local information can complement
the EJSCREEN indicators to provide more recent information. In particular,
EJSCREEN also provides updated maps of PM2.5 and ozone nonattainment
areas (areas not meeting national ambient air quality standards).
Resolution	High-resolution estimates of PM2.5 are very difficult to develop for the
entire United States. Block groups vary widely in geographic area—some
are larger than 100 square kilometers, but a substantial fraction are
smaller than 1 sq. km in area. This makes it challenging to develop relevant
spatial data.
37 Detailed documentation and GIS metadata describing a tract-level application of the same approach are
provided here: http://www.epa.gov/esd/land-sci/lcb/lcb faqsd.html and http://www.epa.gov/esd/land-
sci/lcb/pdf/DSMetadataAir 0612.pdf.
41 | P a g e

-------
Details on Environmental Indicators
Some small areas have used the high-resolution AERMOD model to
estimate PM2.5 levels for EJ analysis (Maroko, 2012), but such modeling is
not feasible for the entire United States at this time.
In the past (prior to approximately 2007), CMAQ used a grid size of 36x36
km in the Western United States and 12x12 km in the Eastern United
States, or 36 km in general, as in a recent EJ analysis of the heavy-duty
diesel emissions rule finalized in 2001 (Post et al., 2011). After
approximately 2007, CMAQ modeling has divided the nation into a grid of
cells that are each roughly 12 km by 12 km, and has estimated the PM2.5
concentration in each cell.
In a 2010 study, satellite data have been used to estimate PM2.5 levels with
a spatial resolution of roughly 10 km by 10 km, showing reasonably good
agreement with monitoring data (van Donkelaar et al., 2010).
Land use regression (LUR) has also been proposed, and can provide better
resolution. Nationwide LUR-based estimates have been developed for NOx
but not PM2.5.
The downscaler method was selected for EJSCREEN partly because it is
particularly useful for this application, in that it estimates concentration at
a specified point, rather than for the average of a large grid cell. The
downscaler algorithms combine information from nearby monitors and
CMAQ grid cell estimates. This provides an estimate based on more
information than models alone or monitors alone could provide. It is
important to note that the downscaler and indicators here are not
attempting to describe all of the local variations in ambient air
concentrations. They are merely capturing some additional variation that is
not seen when relying on models or monitors alone.
Discussion	The downscaling fusion model uses both air quality monitoring data from
NAMS/SLAMS (data collected by EPA, state, local and tribal air pollution
control agencies at more than 600 hundred monitors nationwide) and
numerical output from the Models-3/CMAQ model. The CMAQ model is
used extensively by EPA and has been described in detail elsewhere (Byun
& Schere, 2006).
This downscaling approach is designed to provide daily, predictive PM2.5
(daily average) and 03 (daily 8-hour maximum) surfaces for a given year,
such as 2011, at specified points.
For EJSCREEN, the downscaling method was applied to a point within each
Census tract. EPA's Office of Air and Radiation generated an estimate for
42 | P a g e

-------
Details on Environmental Indicators
each tract, and then assigned the same tract value to every block group
within the given tract. Daily estimates from the downscaling method were
averaged for the whole year in the case of PM2.5 and for the ozone season
(May-September) in the case of ozone.
Again, it is important to note that the downscaler and indicators here are
not attempting to describe all of the local variations in ambient air
concentrations. They are merely capturing some additional variation that is
not seen when relying on models or monitors alone.
Several data sources have been used elsewhere and were considered for
inclusion in EJSCREEN. For instance, EPA's regulatory impact analyses
(RIAs) for recent rules and the PM2.5 NAAQS have used estimates of PM2.5
that combine modeling and monitoring in a different way (U.S. EPA,
2009b). These estimates also start with CMAQ air quality modeling results,
but then locally adjust those estimates up or down based on local
monitoring data using MATS (monitor attainment test software), which
provides an enhanced Voronoi neighbor averaging interpolation technique.
Published analyses of PM2.5 health impacts have used similarly fused
estimates (Fann et al., 2012). Before the downscaler method was
developed, a different Bayesian modeling approach was also used, as
described in McMillan, Holland, Morara, & Fang (2010). Other efforts have
used interpolation between monitors without air quality modeling, such as
through basic Voronoi neighbor averaging (Fann & Risley, 2011), or simply
the average of monitors within a county (Bravo, Fuentes, Zhang, Burr, &
Bell, 2012). Monitors provide reliable estimates where they are located,
but suitable PM2.5 data are available at fewer than 900 monitors in the
United States. While urban areas tend to have PM2.5 monitors, more than
two-thirds of U.S. counties lack any monitoring data, so modeling is an
important complement to monitoring. Methods based on CMAQ alone,
monitors alone, CMAQ-MATS and the downscaling approach all provide
somewhat different estimates.
Note that the EJSCREEN value does not directly indicate nonattainment of
the NAAQS standard because the indicator in EJSCREEN is based on
estimates from a combination of modeling and monitoring for a single
year, while nonattainment is determined for a large area (often a county)
based on three years of monitoring data.
43 | P a g e

-------
Details on Environmental Indicators
Ozone
Ozone (03) is not usually emitted directly into the air, but is created at ground level by a chemical
reaction between oxides of nitrogen (NOx) and volatile organic compounds (VOCs) in the presence of
sunlight. These ozone precursors are emitted by motor vehicles, industrial facilities and power plants as
well as natural sources. Ground-level ozone is the primary constituent of smog.
Indicator	The May-September (summer/ ozone season) average of daily-maximum
8-hour-average ozone concentrations, in parts per billion (ppb).
Rationale for	Toxicological and epidemiological studies have established an association
Inclusion	between exposure to ambient ozone and a variety of health outcomes,
including reduction in lung function, increased inflammation and increased
hospital admissions and mortality (U.S. EPA, 2006b). In the 2006 Air Quality
Criteria Document for Ozone, a comprehensive review of the clinical and
epidemiological evidence was inconclusive about a possible threshold for
ozone-induced health effects. EPA concluded that if a population threshold
level exists, it is near the lower limit of ambient ozone concentrations in
the United States (U.S. EPA, 2006b). Several subpopulations may
experience susceptibility to ozone-induced health effects. These
subpopulations include older adults, children, individuals with preexisting
pulmonary disease and those with higher exposure levels such as outdoor
workers (U.S. EPA, 2006b). A recent review of studies identifying
subgroups susceptible to ozone found the strongest evidence for greater
sensitivity among the elderly and also the unemployed (Bell, Zanobetti, &
Dominici, 2014).
A 2012 EPA report found that the majority of the U.S. population lived in
areas in nonattainment of one or more of the NAAQS—a total population
of 159 million people (based on 2010 Census data for those locations) (U.S.
EPA, 2012g). As standards are updated, nonattainment areas are redefined
along with the number of people living in redefined nonattainment areas.
Ozone concentrations at different levels are found in all parts of the United
States, so residents are exposed via inhalation to varying degrees.
Ozone has been included in other EPA EJ screening tools.
More Information More information is available at the ground-level ozone website
(http://www.epa.gov/air/ozonepollution/).
Relevant Studies Some examples of studies that have focused on disparities in exposure to
ozone include Fann et al. (2011), Grineski (2007), Grineski et al. (2007), and
those reviewed in Liu (2001).
44 | P a g e

-------
Details on Environmental Indicators
Data Source
Data Version
Resolution
Discussion
45 | P a g e
EJSCREEN's ozone data are estimated by EPA from a combination of
monitoring data and CMAQ air quality modeling. Ozone was estimated
with the same approach as PM2.5, and the methodology is described above.
Ozone faces similar limitations, in that a limited number of U.S. monitors
have suitable data, so modeling is an important complement to monitoring
data.
PM2.5 and ozone estimates were not available for Alaska or Hawaii for use
in the 2015 version of EJSCREEN, due to a lack of CMAQ modeling. EPA
may be able to include estimates in a future version of EJSCREEN.
The 2015 version of EJSCREEN uses ozone data that are based on 2011
monitoring and modeling estimates (U.S. EPA, 2015a). Tract estimates
were assigned to block groups as described for the PM2.5 indicator.
See "Resolution" section under the previous environmental indicator,
PM2.5.
See "Discussion" section under the previous environmental indicator,
PM2.5.

-------
Details on Environmental Indicators
Lead Paint Indicator
A key source of exposure to lead for many Americans is through lead paint and lead-containing dust that
accumulates indoors, in homes or in other buildings where lead paint was used. Exterior structures
painted with lead-based paint are also a source of ambient lead through chipping exterior paint.
Elevated short-term lead dust loadings have also been observed following demolition of old buildings
(U.S. EPA, 2011c). Lead-based paint was banned in the United States by the Consumer Product Safety
Commission in 1978, but lead-based paint used in housing before the ban remains a significant source of
exposure to lead for children and adults. Lead paint and contaminated dust and soil are considered the
leading cause of high lead levels in U.S. children (Levin et al., 2008).
Indicator	The percentage of occupied housing units built before 1960 was selected
as an indicator of the likelihood of having significant lead-based paint
hazards in the home.
Rationale for	Elevated blood lead levels are a well-documented public health concern
Inclusion	of particular interest to EJ stakeholders, and represent an important
environmental health issue (U.S. EPA, 2006a, 2011c).
Certain demographic groups may be more susceptible to lead exposure.
For example, blood lead's association with cardiovascular outcomes
appears to be stronger among Mexican Americans and non-Hispanic
blacks than non-Hispanic whites (U.S. EPA 2011c). Also, some but not all
studies suggest lead has a greater impact on IQ among those of low
socioeconomic status (U.S. EPA 2011c).
Despite significant reductions in ambient levels of lead from the phase-
out of leaded gasoline and the 1978 ban on lead-based paint, lead
remains a significant hazard for children. Recent research has
demonstrated that children can experience neurological damage even at
low levels of exposure to lead. In May 2012, the Centers for Disease
Control and Prevention (CDC) agreed to adopt the recommendations of
the CDC Advisory Committee for Childhood Lead Poisoning Prevention
(ACCLPP) for defining elevated blood-lead levels (BLLs) among children.
The ACCLPP recommended that CDC use a childhood BLL reference value
based on the 97.5th percentile of the population BLL in children under age
6 to identify children and environments associated with lead-exposure
hazards (Centers for Disease Control and Prevention, 2012). The 97.5th
percentile value is currently 5 ng/dL.
Surfaces originally covered with lead-based paint may chip, flake or
develop a chalky surface. The lead in these pieces or particles may be
moved about the interior or exterior of the painted structures, and be
46 | P a g e

-------
Details on Environmental Indicators
moved from inside to outside and vice versa. Through direct contact with
the painted surfaces or through contact with the released particles, lead
may adhere to hands and other parts of the residents' bodies, and people
may ingest some portion of the lead. If the painted surfaces are disturbed
through renovation or other actions, some lead-based paint particles may
be temporarily suspended in the air, and particles on surfaces within the
structures may be re-suspended during the residents' activities. The
suspended particles may be inhaled or may fall on food and be ingested.
Children playing inside or outside and exposed to particles of lead-based
paint may ingest some of the lead through hand-mouth actions.
An analysis of data collected during the 1999-2004 National Health and
Nutrition Examination Survey (NHANES) showed that children living in
older housing stock (built before 1950) were significantly more likely to
have blood lead levels greater than 5 ng/dL than children living in housing
built after 1978 (Jones et al., 2009). Jones et al. estimated that 7.4% of
children under age 6 had blood lead levels greater than 5 ng/dL during
NHANES 1999-2004. For children under age 6 living in the highest risk
housing (built before 1950), Jones et al. observed that 15.1% had blood
lead levels above 5 ng/dL. For children under age 6 living in the lowest risk
housing (built in 1978 or later), 2.1% had BLLs above 5 ng/dL.
EPA EJ screening tools in the past generally have not included proxies for
lead exposure.
More Information More information is available at EPA's lead website
(http://www2.epa.gov/lead).
Relevant Studies Several examples of EJ studies of exposure from lead paint exist, including
Gaitens et al., 2009 and others.
Data Source	The data were collected at the block group level from the ACS estimates
from the Census Bureau. The indicator was calculated by dividing the
count of occupied housing units built prior to 1960 by the total number of
built housing units in the block group (ACS table B25034, see Appendix B
for details).
Data Version	The block-group level data for the 2015 version of EJSCREEN were
collected from the 2008-2012 ACS (U.S. Census Bureau, 2011).
Approximately 40% of occupied, non-institutional housing units in the
United States were built prior to 1960, as of 1999. The ACS 2008-2012
47 | P a g e

-------
Details on Environmental Indicators
data in EJSCREEN indicate the average person in the US lives in a block
group where about 30% of the housing was built before 1960.
Discussion	Lead paint was used extensively in the United States prior to the 1978 ban
on lead in new residential paint, and a home built prior to 1960 is far
more likely to have lead hazards than one built more recently (Gaitens et
al., 2009; Jacobs et al., 2002). In 2002, Jacobs et al. reported that
approximately 40 million homes in the United States still had lead-based
paint hazards, based on a nationally representative survey conducted in
1998-2000. The likelihood of such hazards was found to have changed
dramatically for housing built in 1960-1977 compared to pre-1960
housing (Table 3).
Based on Jacobs et al. (2002), EPA calculated the following likelihoods of
significant lead-based paint hazards:
•	Pre-1960 vs. all others: 54% vs. 6% (9 times as likely).
•	Pre-1960 vs. all others, among those with children under age 6:
68% vs. 4% (16 times as likely).
Data and analysis published in 2009 confirmed prior conclusions that
potential exposure to lead is associated with housing age, providing more
information on lead concentrations in household dust as a function of
housing age (Gaitens et al., 2009). Some of the models presented by
Gaitens et al. (2009) suggest that the largest decreases in lead dust levels
are seen between housing built prior to 1940 and after 1940, with more
modest contrasts seen for housing built after 1960 and after 1977. A
cutoff of 1960, however, is consistent with the data from Jacobs et al.
(2002), and the window sill lead dust models from Gaitens et al. (2009).
It is important to note that older housing alone may not represent any
actual risk or even exposure.
48 | P a g e

-------
Details on Environmental Indicators
Table 3. Likelihood of Lead-Based Paint Hazards by Housing Construction Date
Share of Housing with Significant Lead-Based
Year Built	Paint Hazards (and 95% Confidence Interval)
Post-1960:

1978-1998
3% (1-6%)
1960-1977
8% (6-12%)
Pre-1960:

1940-1959
43% (32-51%)
Before 1940
68% (56-75%)
Source: Jacobs et al. (2002). A "significant lead-based paint hazard" is defined as "a lead-based paint hazard above de
minimis levels as defined in U.S. EPA and U.S. Department of Housing and Urban Development (HUD) regulations."38
38 The de minimis levels for paint deterioration are < 20 ft2 (exterior) or < 2 ft2 (interior) of lead-based paint on
large surface area components (walls, doors), or damage to < 10% of the total surface area of interior small surface
area component types (windowsills, baseboards, trim) (40 C.F.R. § 745.65).
49 | P a g e

-------
Details on Environmental Indicators
Traffic Proximity
A substantial fraction of the U.S. population lives in close proximity to traffic, and the number of vehicle-
miles traveled has grown 40% from 1990 to 2010 (U.S. EPA, 2012d). Proximity to motor vehicle traffic is
associated with increased exposures to ambient noise, toxic gases and particulate matter including
diesel particulates. Technical details about the methodology used to determine traffic proximity are
provided in Appendix C.
Indicator	The count of vehicles per day within 500 meters of a block centroid, divided by
distance in meters, presented as the population-weighted average of blocks in each
block group. Adjustments are made so that the minimum distance used is
reasonable when very small.
Rationale for A 2011 literature review identified several studies that "found that living near
Inclusion	hazardous waste sites, industrial sites, cropland with pesticide applications, highly
trafficked roads, nuclear plants, and gas stations or repair shops is related to an
increased risk of adverse health outcomes" (Brender et al., 2011, p. S37). This
indicator is the first of five that relate to this category of concern.
Is should first be noted that there are both positive and negative aspects to living
near major roads. Proximity to roads can provide access to jobs, health care, food,
recreational opportunities, and other benefits. The indicator of traffic proximity and
volume is designed to screen for the negative aspects, so it uses distance weighting
and volume weighting to focus on the extremes of very close proximity to very high
volumes of traffic, such as living closer than 50-100 meters from a multi-lane
highway, as explained below.
Residential proximity to traffic has been associated with various health impacts,
particularly asthma exacerbation and possibly onset of asthma, as well as mortality
rates (Baumann et al., 2011; Health Effects Institute, 2010). Proximity to traffic has
also been associated with subclinical atherosclerosis (a key pathology underlying
cardiovascular disease (CVD)), prevalence of CVD and coronary heart disease (CHD),
incidence of myocardial infarction, and CVD mortality (Hoffman et al., 2009).
Vehicle-related emissions of various pollutants—ultrafine and other components of
PM2.5, lead and other metals, and mobile source air toxics such as benzene, nitrogen
oxides (NOx), hydrocarbons and carbon monoxide (CO)—are believed to contribute
to these health effects. Vehicles also emit precursors that add to ambient ozone
and PM2.5. Additionally, EPA's 2005 NATA estimated that mobile emissions
accounted for about 30% of average cancer risk from the pollutants in NATA, mainly
from benzene (U.S. EPA, 2009c). However, the spatial accuracy of NATA's mobile
source impacts is limited, because local estimates are based on countywide total
mobile source emissions roughly allocated to each part of the county based on
50 | P a g e

-------
Details on Environmental Indicators
presence of major roads. The traffic indicator in EJSCREEN provides a more detailed
analysis of the volume and location of traffic than was used in NATA. Also, NATA
captures only some of the impacts associated with traffic, so the traffic indicator is a
useful complement.
Traffic proximity is also associated with noise, which is a risk factor for various
health problems. Workplace and transportation-related noise have been associated
with release of stress hormones; sleep disturbance; hypertension; altered heart
rate; ischemic heart disease; myocardial infarction; and, among the elderly, risk of
stroke (S0rensen et al., 2011). In one study, for example, among those older than
64.5 years of age, the stroke incidence rate ratio was 1.27 per 10 dB more road
traffic noise (S0rensen et al., 2011).
Whether noise or other factors account for it, local traffic volume is a predictor of
stress (which itself is associated with significant health risks). In 2010, Yang &
Matthews concluded that, "[a]t the neighborhood level, the presence of hazardous
waste sites and traffic volume were determinants of self-rated stress even after
controlling for other individual characteristics" (2010, p. 803).
Any indicator of residential proximity addresses exposures relevant to the
residences within a block group, and would not capture most exposures that occur
away from the home, such as at work, at school or during a commute.
In the past, EJ screening tools at EPA have not included traffic proximity.
More	More information is available at the near-roadway website
Information	(http://www.epa.gov/otaq/nearroadway.htm).
Relevant	Some examples of studies of disparities in proximity to traffic include Tian et al.,
Studies	2013; Rowangould, 2013; Brender et al., 2011; Chakraborty (2006); and Liu, 2001.
Data Source Measures of traffic proximity in EJSCREEN are based on average annual daily traffic
(AADT) estimates in the Highway Performance Monitoring System (HPMS) dataset
in the Department of Transportation (DOT) National Transportation Atlas Database
(NTAD). The HPMS highway data is maintained by states and compiled by DOT.
The HPMS data are collected at the state level, and the traffic counting program is
designed to cover all interstate, principal arterial, other National Highway System
and HPMS sample sections on a 3-year maximum cycle where at least one-third of
roads are counted each year. More details on the HPMS are available at
http://www.fhwa.dot.gov/policvinformation/hpms.cfm.
51 | P a g e

-------
Details on Environmental Indicators
Data Version The 2015 version of EJSCREEN uses 2011 HPMS data (U.S. Department of
Transportation, 2011)39. This dataset contains 490,781 road segments, of which
86.7% contain AADT information, and those have a length of approximately
548,000 km (340,511 miles). While this is only 8% of total road miles of public road
in the US, the roads not included (e.g., local streets) tend to have much lower levels
of traffic, so the roads included appear to account for almost two thirds of all US
traffic (vehicle-miles travelled).40 For the 2015 version of EJSCREEN, a total of
11,078,297 Census 2010 blocks were analyzed to find all road segments within 500
meters of each block's internal point, or the nearest single segment if none were
found within 500 meters.
Discussion	The traffic proximity indicator is based on average annual daily traffic (AADT)
divided by distance in meters. For example, a single highway with 16,000 AADT at
400 meters distance would result in a score of 16,000/400=40, which is close to the
median person's block group traffic proximity indicator value in the US. About 5% of
the population has traffic proximity indicator values more than ten times as high as
the median, because traffic volumes vary widely across roads and communities. The
most traveled highway section in the United States, the 1-405 in the Los Angeles-
Long Beach-Santa Ana area, had 374,000 vehicles of AADT in 2008.41 About forty
other highway sections in the US exceeded 250,000 AADT, but about half were in
just one state - California - and the were rest spread over just a dozen states.
The proximity score is based on the traffic within a search radius of 500 meters (or
further if none is found in that radius). It is important to understand that this
distance was selected to be large enough to capture the great majority of road
segments (with traffic data) that could have a significant impact on the local
residents, balanced against the need to limit the scope due to computational
constraints. The closest traffic is given more weight, and the distant traffic given
less weight, through inverse distance weighting. For example, traffic 500 meter
away is given only one tenth as much weight as traffic 50 meters away.
Based on the available evidence, a distance of roughly 100-300 meters, or perhaps
up to 500 meters, from traffic should be considered as most important. This
distance focuses on the types of exposures typically studied and shown to be
39
http://www.rita.dot.gov/bts/sites/rita.dot.gov.bts/files/publications/national transportation atlas database/201
1/index.html and also see http://www.fhwa.dot.gov/policvinformation/hpms.cfm
40	The included 340,000 miles is comparable to the road miles covered by all interstate, freeway/expressway, and
principal arterials, plus 50% of the minor arterial miles in the US, which together carry 64% of VMT. Collector and
local roads are the balance of public roads. See
http://www.fhwa.dot.gov/policvinformation/statistics/2011/hm220.cfm and
http://www.fhwa.dot.gov/policvinformation/statistics/2011/vm202.cfm
41	Office of Highway Policy Information, US DOT, 2008 Highway performance monitoring system (HPMS) July 27,
2010.
52 | P a g e

-------
Details on Environmental Indicators
associated with health impacts in near-roadway epidemiology. Epidemiologic
studies of the impacts of proximity to traffic often utilize distances of 50-1,500
meters to define a cutoff between less and more exposed locations (Health Effects
Institute, 2010). For example, a major study of coronary heart disease prevalence
used distances greater than 200 meters as the reference group and found adjusted
odds ratios of 1.08 for residences within 100-200 meters, 1.71 for 50-100 meters
and 1.55 within 50 meters of a major road. Only 15% of participants lived within
200 meters of a major road, and only 3% within 50 meters in this study of heart
disease (Hoffman et al., 2009).
Additionally, a distance cutoff of 500 meters captures exposures of concern for
most definitions of mobile source impact. In a review of numerous prior studies of
proximity to roads, in combination with a modeling case study, Zhou & Levy (2007)
suggested that a distance of 500 meters should capture exposures of concern,
although impacts may be largely limited to just 100 meters from roads for ultrafine
particles and PM2.5 mass from mobile sources alone.
A critical review of literature on traffic-related air pollution in 2010 "identified an
exposure zone within a range of up to 300 to 500 meters from a highway or a major
road as the area most highly affected by traffic emissions... and estimated that 30%
to 45% of people living in large North American cities live within such zones"
(Health Effects Institute, 2010, p. 7-5). A 2009 analysis of PM2.5 levels in Southern
California found that traffic within 300 meters of a monitor was the most
informative predictor of monitored PM2.5 levels, out of a wide range of factors
considered such as various distances from roads, population density and the
presence of industry (Krewski et al., 2009).
On the other hand, some studies have shown a dramatic drop in at least ultrafine
levels within the first 100 meters downwind from a freeway, and an even sharper
(essentially immediate) drop in the upwind direction (Zhu, Hinds, Kim, & Sioutas,
2002). This pattern has been seen in more recent measurements—levels on
California highways (measured using monitors on vehicles) were compared to levels
near those roads (roughly 50-300 meters away), and black carbon levels in
particular were as much as 10 times higher on the road than near the road, for 1-
hour averages (Fujita, Campbell, Zielinska, Arnott, & Chow, 2011). The same study
found much higher levels (generally 2-5 times higher) on the road than near the
road, for PM2.5 mass, CO, NO, NOx, VOCs, benzene, toluene, ethylene, xylene,
formaldehyde and acetaldehyde. This reinforces the idea that exposures very close
to a busy highway are most important, and that levels drop rapidly within tens of
meters, falling to much lower levels within the first 50-300 meters (Spengler et al.,
2011).
53 | P a g e

-------
Details on Environmental Indicators
Many studies have analyzed roadway proximity categorically, including only major
roads, but roads vary in the amount of traffic they carry, so AADT provides a better
starting point for considering impacts than simply whether a road is a major road.
Several land use regression studies and other research (Hoek et al., 2011; Hystad et
al., 2011) have suggested that inverse distance-weighted traffic volume is a
reasonably good proxy for ambient air concentrations of NOx, PM2.5 mass (ng/m3),
black carbon or ultrafine PM (as particle number concentration) nearby (50-500
meters). Levels are clearly higher downwind of the road, and higher where wind
speeds are lower, but in the absence of detailed location-specific data, traffic
volume and distance are useful indicators (Hoek et al., 2011).
54 | P a g e

-------
Details on Environmental Indicators
Proximity to Major Direct Water Dischargers
The Clean Water Act regulates facilities that discharge pollutants from point sources to waters of the
United States. Discharging facilities are generally prohibited unless authorized by a specific provision of
the act. Direct discharges may be authorized by National Pollutant Discharge Elimination System
(NPDES) permits issued by EPA or states authorized to administer the NPDES program. There are tens of
thousands of dischargers with such permits, many releasing only limited quantities of pollutants. Several
thousand facilities, however, are "major direct dischargers" as defined by law, including industrial direct
dischargers (facilities that discharge pollutants directly into water bodies) and Publicly Owned
Treatment Works (POTWs) (which receive and treat domestic and municipal waste and industrial
wastewater and discharge treated water into water bodies). Major direct dischargers are found in a
wide variety of industry sectors ranging from cement manufacturing to metal products and machinery
to petroleum refining. Major direct dischargers may be subject to industry-specific Effluent Limitation
Guidelines (ELGs),42 which are national technology-based standards for wastewater discharges to
surface waters (U.S. EPA, 2012b).
Technical details about the methodology used to determine proximity to major direct dischargers to
water are provided in Appendix C.
Indicator	The count of major direct discharger facilities within 5 km, divided by
distance, presented as population-weighted averages of blocks in each
block group. Adjustments are made if there are none within 5 km, and so
that the minimum distance used is reasonable when very small.
Water pollutants can have human health or adverse ecological effects,
depending on concentration in the water, exposure to the water, toxicity
of the particular chemical and other factors. There are approximately
6,700 major direct dischargers in the United States. These facilities
discharge around 50 billion pounds of pollutants each year directly into the
nation's streams and rivers (including conventional pollutants such as
nitrogen and phosphorus) (U.S. EPA, 2012a).
People may be exposed to the discharged pollutants either directly or
through indirect pathways. People swimming in the downstream waters or
engaging in water-based recreation may be directly exposed dermally,
orally or through inhalation of volatized substances. If the released
substances reach a downstream drinking water intake, consumers of the
finished waters may consume whatever portion of the substances is not
removed by the drinking water utility. Some portion of the discharged
materials may enter the groundwater of neighboring areas and reach
42 A list of Effluent Guidelines by industry category can be found at
http://water.epa.gov/scitech/wastetech/guide/industrv.cfm.
Rationale for
Inclusion
55 | P a g e

-------
Details on Environmental Indicators
people through drinking water derived from wells that draw upon that
aquifer.
Some EPA EJ screening tools have included measures of proximity to all
facilities regulated by EPA, which includes major direct water dischargers.
More Information More information is at the water website (http://www.epa.gov/water) for
drinking water, rivers, and other categories (http://water.epa.gov/type/). a
local water quality webpage (http://water.epa.gov/drink/local/index.cfm).
and the NPDES webpage (http://water.epa.gov/polwaste/npdes/).
Relevant Studies Some examples of studies of disparities in proximity to water dischargers
include Fitos and Chakraborty (2010); Brender et al., 2011; VanDerslice,
2011; and a recent study (Deganian & Thompson, 2012) that tallied the
number of facilities of different types within 10 km2 squares and compared
these counts to the demographics of the squares.
Data Source	Latitudes/Longitudes for major direct dischargers were taken from EPA's
PCS/ICIS database.
Data Version	The 2015 version of EJSCREEN uses locational information retrieved from
the PCS/ICS database in May 2012 (U.S. EPA, 2012a). A total of 6,981 of the
major direct dischargers had suitable location information and were
included. Note that the point data used to show facility locations in the
"Map Supplementary Layers" menu is updated more often than the
database with calculated EJSCREEN indicators and indexes, so in some
small number of cases a facility may be in one data source but not the
other.
According to these data, almost 50% of the U.S. population lives in block
groups where at least one block's centroid is within 5 km of the nearest
NPDES facility. The population's mean score was 0.25, which could indicate
one facility at a distance of 4 km.
Discussion	Monitored or modeled data on drinking or surface water quality could not
be identified with adequate national coverage and resolution. As more
data become available in the future, such data may be considered for
inclusion in EJSCREEN. As with all proximity-based indicators, proximity
alone may not represent any actual risk or even exposure.
Each block group in the United States was assigned a proximity score that
was the population-weighted sum of block-level proximity scores.
Appendix C provides more details on calculation of proximity scores. First,
each block was given a proximity score that was the sum of inverse
56 | P a g e

-------
Details on Environmental Indicators
distance-weighted count of facilities anywhere within 5 km of the block's
centroid. This score can be thought of as the number of facilities per
kilometer of distance from the average person. It is also equal to the
number of facilities divided by the harmonic mean of their distances. This
means one facility exactly 2 km away gives a score of 1/2, while three
facilities exactly 4 km away give a score of 3/4, and five facilities all at 1 km
away give a score of 5.43 If there is no facility within 5 km of a block
centroid, 1/d is used, where d is the distance to the single nearest at any
distance.
Proximity to major NPDES dischargers is only an indirect indicator of
potential exposure to water pollutants, because residents may not spend
time in or near the affected water, and the discharges may not impact local
drinking water supplies, or result in exposure from swimming or fishing.
43 An adjustment was made so that any distance smaller than 90% of the block's "radius" was set equal to 90% of
that radius, with radius defined as the square root of (area/pi). This adjustment accounted for the fact that the
average location (residence) within a circle of radius R is 0.9R away from the average point (facility) that is within
the circle. For more detail, see Appendix C.
57 | P a g e

-------
Details on Environmental Indicators
Proximity to NPL Sites
Congress enacted the Comprehensive Environmental Response, Compensation and Liability Act
(CERCLA), commonly known as Superfund, in 1980. This law was established to provide broad federal
authority to respond to uncontrolled abandoned hazardous waste sites. Under CERCLA, EPA's response
can involve remedial (long-term) cleanup actions or short-term removal actions.
EPA places sites on the National Priorities List (NPL) (a key subset of all "Superfund" sites) based on a
defined set of criteria and a public comment process. Inclusion of a site on the NPL does not impose a
financial obligation on EPA, nor does it assign liability to any party. The NPL serves primarily
informational purposes, identifying sites that appear to warrant remedial actions, thereby conveying to
policymakers and the public the size and nature of the nation's cleanup challenges.
Sites can be placed on the NPL in one of three ways44:
1.	The site receives a score of 28.5 or higher in EPA's Hazard Ranking System (HRS);
2.	States or territories designate a top-priority site; or
3.	A site meets these requirements:
a.	The Agency for Toxic Substances and Disease Registry (ATSDR) of the U.S. Public Health
Service has issued a health advisory that recommends removing people from the site;
b.	EPA determines the site poses a significant threat to public health; and
c.	EPA anticipates it will be more cost-effective to use its remedial authority (available only
at NPL sites) than to use its emergency removal authority to respond to the site.
Technical details about the methodology used to determine proximity to NPL sites are provided in
Appendix C.
Indicator	The count of sites proposed and listed on the National Priorities List (NPL), each
represented by a point on the map (latitude/ longitude coordinate), within 5 km of
the average resident in a block group, divided by distance, calculated as the
population-weighted average of blocks in each block group. Adjustments are made
if there are no NPL sites within 5 km, and so that the minimum distance used is
reasonable when very small.
Soon after the passage of CERCLA and the Superfund Amendments and
Reauthorization Act, questions started to be raised about the locations, listing
decisions and pace of cleanup at NPL sites in low-income and minority communities
(Hird, 1993; Probst, 1990; United Church of Christ, 1987), and such concerns have
continued to this day (Anderton, Oakes, & Egan, 1997; Baden, Noonan, & Turaga,
2007; O'Neil, 2007). The study by Deganian & Thompson (2012) included NPL sites
in the tally of pollution points in each of the 10 km2 squares in the study area, for
comparison with demographic variables. Earlier studies related the presence of NPL
sites to population characteristics for different definitions of the host areas-
Rationale for
Inclusion
44 http://www.epa.gov/superfund/programs/npl hrs/nplon.htm
58 | P a g e

-------
Details on Environmental Indicators
counties (Hird, 1993), Census places or minor civil divisions where places are not
defined (Zimmerman, 1993), and Census tracts (Anderton et al., 1997).
The contaminants in NPL sites may reach humans in a number of ways. Volatile
contaminants may enter the atmosphere and reach individuals via the inhalation
route. Particularly in dry climates or seasons, contaminants on the surface of some
sites can become airborne and reach people directly through inhalation or
indirectly after being deposited on surfaces that people may contact. Contaminants
can also enter the food chain if the wind disperses them onto land used for
agriculture. Some contaminants may migrate into groundwater. People may be
exposed via drinking water derived from the aquifer, through vapor intrusion into
their residences or through other routes.
Some EPA EJ screening tools have included measures of proximity to all facilities or
other sites regulated by EPA, which include NPL sites.
More	More information is available at the Superfund website
Information (http://www.epa.gov/superfund).
Relevant
Studies
Some examples of studies focused on disparities in proximity to NPL sites include
Brender et al. (2011) and those reviewed in Liu (2001).
59 | P a g e

-------
Details on Environmental Indicators
Data Source A single point location (latitude/ longitude coordinates) for each proposed and
listed NPL site was obtained from EPA's CERCLIS database. The database does not
provide details on the boundaries of each site, so this point data had to serve as a
way to represent site locations. For residents close to very large sites, the available
data may not provide an accurate representation of proximity to relevant portions
of the site. These points are approximations of the locations of sites, and are not
necessarily at the "center" of a given site. In a few cases a site's coordinates were
located in a major body of water according to the database, so EPA manually
specified new, plausible, nearby coordinates for use in EJSCREEN.
The count excludes deleted sites and sites in U.S. territories. Sites located in Guam
and Puerto Rico are not included in the 2015 version of EJSCREEN.
Data Version The 2015 version of EJSCREEN uses locational information retrieved from the
CERCLIS database in November 2013. A total of 1,350 proposed and listed NPL sites
were included in the EJSCREEN indicator. Note that the point data used to show site
locations in the "Map Supplementary Layers" menu is a different database than the
database used to calculate the EJSCREEN NPL proximity indicators and indexes. The
Superfund "Map Supplementary Layer" database includes deleted NPL sites, and
NPL sites in U.S. territories, excluded from the EJSCREEN NPL proximity indicator
database.
Discussion Each Census block group in the United States was assigned a proximity score that
was the population-weighted sum of block-level proximity scores. Appendix C
provides more details on how proximity scores were calculated. First, each Census
block was given a proximity score that was the sum of inverse distance-weighted
count of sites anywhere within 5 km of the block's internal point. This score can be
thought of as the number of NPL sites per kilometer of distance from the average
person. It is also equal to the number of sites divided by the harmonic mean of
their distances. This means one site 2 km away gives a score of 1/2, while three
sites each 4 km away give a score of 3/4, and five sites all at 1 km away give a score
of 5.45 If there is no site within 5 km of a block centroid, 1/d is used, where d is the
distance to the single nearest at any distance.
As with all proximity-based indicators, proximity alone may not represent any
actual risk or even exposure.
45 An adjustment was made so that any distance smaller than 90% of the block's "radius" was set equal to 90% of
that radius, with radius defined as the square root of (area/pi). This adjustment accounted for the fact that the
average location (residence) within a circle of radius R is 0.9R away from the average point (site) that is within the
circle. For more detail, see Appendix C.
60 | P a g e

-------
Details on Environmental Indicators
Proximity to TSDFs
The Resource Conservation and Recovery Act (RCRA), an amendment to the Solid Waste Disposal Act,
was enacted in 1976 to address the growing volumes of municipal and industrial solid waste generated
nationwide. RCRA was further amended in 1984 with the addition of the Hazardous and Solid Waste
Amendments. RCRA Subtitle C establishes a federal program to manage hazardous wastes from "cradle
to grave/' or from generation to disposal, to ensure that hazardous waste is managed in a manner that
protects human health and the environment. EPA has developed Subtitle C regulations governing
hazardous waste generation, transportation, and the several hundred active treatment, storage or
disposal facilities (TSDFs).46
Technical details about the methodology used to determine proximity to TSDF facilities are provided in
Appendix C.
Indicator	The count of all commercial TSDF facilities within 5 km, divided by distance,
presented as population-weighted averages of blocks in each block group.
Adjustments are made if there are none within 5 km, and so that the
minimum distance used is reasonable when very small.
The substances at TSDF facilities may reach humans in a number of ways.
Volatile substances may enter the atmosphere and reach residents via the
inhalation route. Particularly in dry climates or seasons, substances on the
surface of some sites may be entrained in the atmosphere and reach people
directly through inhalation or indirectly after being deposited on surfaces
that people may contact or on arable land. Some substances may migrate
from the site into groundwater. People may be exposed via drinking water
derived from the aquifer, through vapor intrusion into their residences or
through other routes.
Some EPA EJ screening tools have included measures of proximity to all
facilities regulated by EPA, which includes TSDFs.
More Information More information is available at the hazardous waste webpage
(http://www.epa.gov/epawaste/hazard), the TSD webpage
(http://www.epa.gov/epawaste/hazard/tsd/). the waste information page
(http://www.epa.gov/epawaste/inforesources/) and a RCRAInfo page
(http://www.epa.gov/epawaste/inforesources/data).
46 Basic information and TSDF counts are available here:
http://www.epa.gov/compliance/data/results/performance/rcra/index.html
More information is available from EPA's RCRA Orientation Manual, available at
http://www.epa.gov/osw/inforesources/pubs/orientat/ (U.S. EPA, 2012e).
Rationale for
Inclusion
61 | P a g e

-------
Details on Environmental Indicators
Relevant Studies Some examples of studies or reviews that have focused on disparities in
proximity to TSDFs include Liu (2001) and Brender et al., (2011). Issues
around environmental justice and TSDFs influenced the early origins of EJ
work (General Accounting Office, 1983; United Church of Christ, 1987) and
have been the topic of ongoing research (Been & Gupta, 1997; Boer, Pastor
Jr., Sadd, & Synder, 1997; Mohai & Saha, 2007; Oakes, Anderton, &
Anderson, 1996; Pastor Jr., Sadd, & Hipp, 2001; Saha & Mohai, 2005; United
Church of Christ, 2007).
The study by Deganian & Thompson (2012) included Hazardous Waste
Inventory sites, RCRA hazardous waste storage sites and active solid waste
landfills sites in the tally of pollution points in each of the 10 km2 squares in
the study area, for comparison with demographic variables.
Earlier studies related the presence of TSDFs to population characteristics for
different definitions of the host areas—Census-designated areas (General
Accounting Office, 1983), postal ZIP codes (United Church of Christ, 1987),
and Census tracts (Anderton, Anderson, Oakes, & Fraser, 1994).
Data Source	Latitudes/Longitudes for all active commercial TSDF sites were obtained
from the RCRAInfo database.
Data Version	The 2015 version of EJSCREEN uses locational information retrieved from the
RCRAInfo database in May 2012 (U.S. EPA, 2011e). A total of 586 TSDF
facilities were included in this version of EJSCREEN. Note that the point data
used to show facility locations in the "Map Supplementary Layers" menu is
updated more often than the database with calculated EJSCREEN indicators
and indexes, so in some small number of cases a facility may be in one data
source but not the other.
Discussion	Each block group in the United States was assigned a proximity score that
was the population-weighted sum of block-level proximity scores. Appendix
C provides more details on the calculation of proximity scores. First, each
block was given a proximity score that was the sum of inverse distance-
weighted count of TSDFs anywhere within 5 km of the block's centroid. This
score can be thought of as the number of facilities per kilometer of distance
from the average person. It is also equal to the number of facilities divided
by the harmonic mean of their distances. This means one facility exactly 2
km away gives a score of 1/2, while three facilities exactly 4 km away give a
score of 3/4, and five facilities all at 1 km away give a score of 5.47 If there is
47 An adjustment was made so that any distance smaller than 90% of the block's "radius" was set equal to 90% of
that radius, with radius defined as the square root of (area/pi). This adjustment accounted for the fact that the
62 | P a g e

-------
Details on Environmental Indicators
no facility within 5 km of a block centroid, 1/d is used, where d is the
distance to the single nearest at any distance.
As with all proximity-based indicators, proximity alone may not represent
any actual risk or even exposure.
average location (residence) within a circle of radius R is 0.9R away from the average point (facility) that is within
the circle. For more detail, see Appendix C.
63 | P a g e

-------
Details on Environmental Indicators
Proximity to RMP Sites
Accidental releases of toxic substances and incidents involving fires and explosions can result from the
production, use, or transport of industrial materials. Evacuations, injuries and deaths have resulted in
some cases. Concern about the risks of chemical accidents led Congress to pass the Emergency Planning
and Community Right-to-Know Act of 1986 (EPCRA), and amendments to the Clean Air Act (CAA)
(section 112(r)), which together created reporting and planning obligations for a variety of facility types,
the EPA, and state and local planning and response organizations.
The facilities discussed here as "RMP facilities" are those facilities required by the CAA to file risk
management plans. The regulations under CAA section 112(r) establishes a List of Regulated
Substances—72 substances listed because of their high acute toxicity and 60 because of their flammable
or explosive potential—along with threshold quantities (TQs) for each. The listed substances are those
that pose the greatest risk of harm from accidental releases. If a facility maintains a quantity of any such
chemical above those TQs, it must file an RMP with EPA.
It should be noted that some concerns related to proximity to facilities are already accounted for in
NATA indicators for ambient air pollutants (e.g., cancer risk and hazard indexes), but NATA is based on
one year of reported annual releases, which would not account for accidental releases unless they
occurred that year.
Technical details about the methodology used to determine proximity to RMP facilities are provided in
Appendix C.
Indicator	The count of RMP facilities within 5 km, divided by distance, presented as
population-weighted averages of blocks in each block group. Adjustments
are made if there are none within 5 km, and so that the minimum distance
used is reasonable when very small.
Rationale for	RMP facilities are diverse in their size, structure, activities and the makeup
Inclusion	of the regulated substances. As with many types of industrial facilities,
there may be routine releases to the air and water of the residuals after
pollution control devices remove what is generally a large fraction of the
waste stream. Thus, people may be exposed to some substances directly
through inhalation or indirectly through water routes or via ingestion of
food. But the primary concerns with RMP facilities are the accidental
release of substances and fires or explosions. The sudden release of
relatively large quantities of acutely toxic substances can cause serious
health effects including death after inhalation or dermal exposure. These
effects may be prompt or may occur or persist for some time after
exposure. Fires may affect neighboring areas and the associated smoke
may expose people to toxic combustion products. Explosions may cause
material damage and injuries to people in neighboring areas. Local
64 | P a g e

-------
Details on Environmental Indicators
residents, as well as workers and emergency responders, may suffer severe
adverse effects.
Some EPA EJ screening tools have included measures of proximity to all
facilities regulated by EPA, which include RMP facilities.
More Information More information is available at the RMP program webpage
(http://www2.epa.gov/rmp) and the RMP Info database stored in
Envirofacts (http://www.epa.gov/enviro/facts/rcrainfo/search.html).
Relevant Studies The EJ literature contains numerous studies that have examined proximity
to various types of sites, including some relevant to the possibility or
frequency of chemical accidents. Since the 1980s, many studies have
examined the frequency and consequences of accidental releases of
acutely toxic chemicals or events resulting in fires or explosions (Binder,
1989, and many other studies). After the RMP program was established,
researchers examined the characteristics of the RMP reporting facilities
and their reported accident histories for insights into causes,
consequences, prevention and emergency response (Kleindorfer et al.,
2003).
Fewer studies have focused specifically on the relationship of RMP facilities
to the demographics of the surrounding populations. Disparity in acute
exposures to hazardous substances was addressed by Chakraborty (2001).
The study by Deganian & Thompson (2012) included two categories of
facilities—facilities having air pollution permits and facilities reporting to
the Toxics Release Inventory program—which include some overlap with
RMP facilities, but are not limited to RMP facilities. M. R. Elliott, Wang,
Lowe, & Kleindorfer (2004) examined the characteristics of RMP-reporting
facilities and their reported 5-year accident history versus the demographic
characteristics of the counties in which they are located. The demographic
characteristics examined included total population, race, education and
income. The study found an association between the presence of larger
and more chemical-intensive facilities and counties with larger African-
American populations, and in counties with high levels of income
inequality but higher median incomes. Further, the study found a greater
risk of accidents for facilities in heavily African-American counties.
Data Source	Latitudes/Longitudes for RMP facilities were obtained from EPA's RMP
database.
Data Version	The 2015 version of EJSCREEN uses locational information retrieved from
the RMP database in May 2012. A total of 12,759 RMP facilities were
65 | P a g e

-------
Details on Environmental Indicators
included in the proximity indicators and related EJ indexes in this version of
EJSCREEN. Note that the point data used to show facility locations in the
"Map Supplementary Layers" menu is updated more often than the
database with calculated EJSCREEN indicators and indexes, so in some
small number of cases a facility may be in one data source but not the
other.
Discussion	Each block group in the United States was assigned a proximity score that
was the population-weighted sum of block-level proximity scores.
Appendix C provides more details on the calculation of proximity scores.
First, each block was given a proximity score that was the sum of inverse
distance-weighted count of facilities anywhere within 5 km of the block's
internal point. This score can be thought of as the number of RMP facilities
per kilometer of distance from the average person. It is also equal to the
number of facilities divided by the harmonic mean of their distances. This
means one facility exactly 2 km away gives a score of 1/2, while three
facilities exactly 4 km away give a score of 3/4, and five facilities all at 1 km
away give a score of 5.48 If there is no facility within 5 km of a block
centroid, 1/d is used, where d is the distance to the single nearest at any
distance.
As with all proximity-based indicators, proximity alone may not represent
any actual risk or even exposure.
48 An adjustment was made so that any distance smaller than 90% of the block's "radius" was set equal to 90% of
that radius, with radius defined as the square root of (area/pi). This adjustment accounted for the fact that the
average location (residence) within a circle of radius R is 0.9R away from the average point (facility) that is within
the circle. For more detail, see Appendix C.
66 | P a g e

-------
4 BIBLIOGRAPHY
Anderton, D. L., Anderson, A. B., Oakes, J. M., & Fraser, M. R. (1994). Environmental equity: The demographics
of dumping. Demography, 31(2), 229-248.
Anderton, D. L., Oakes, J. M., & Egan, K. L. (1997). Environmental equity in Superfund. Demographics of the
discovery and prioritization of abandoned toxic sites. Eval Rev, 21(1), 3-26.
Baden, B. M., Noonan, D. S., & Turaga, R. M. R. (2007). Scales of justice: Is there a geographic bias in
environmental equity analysis? J Environ Planning Manage, 50(2), 163-185.
Baumann, L. M., Robinson, C. L., Combe, J. M., Gomez, A., Romero, K., Gilman, R. H., . . . Checkley, W. (2011).
Effects of distance from a heavily transited avenue on asthma and atopy in a periurban shantytown in
Lima, Peru .J Allergy Clin Immunol, 127(4), 875-882.
Been, V., & Gupta, F. (1997). Coming to the Nuisance or Going to the Barrios-A Longitudinal Analysis of
Environmental Justice Claims. Ecol Law Q, 24, 1-56.
Bell, M. L., & Ebisu, K. (2012). Environmental Inequality in Exposures to Airborne Particulate Matter
Components in the United States. Environ Health Perspect, 120(12), 1699-1704.
Bell, M. L., Zanobetti, A., & Dominici, F. (2014). Who is more affected by ozone pollution? A systematic review
and meta-analysis. Am J Epidemiol, 180(1), 15-28. http://dx.doi.org/10.1093/aje/kwull5
Berrocal, V. J., Gelfand, A. E., & Holland, D. M. (2010a). A bivariate space-time downscaler under space and
time misalignment. Ann Appl Stat, 4(4), 1942-1975.
Berrocal, V. J., Gelfand, A. E., & Holland, D. M. (2010b). A Spatio-Temporal Downscaler for Output From
Numerical Models. J Agric Biol Environ Stat, 15(2), 176-197.
Berrocal, V. J., Gelfand, A. E., & Holland, D. M. (2011). Space-Time Data fusion Under Error in Computer Model
Output: An Application to Modeling Air Quality. Biometrics, First published online: 29 Dec 2011.
Binder, S. (1989). Deaths, injuries, and evacuations from acute hazardous materials releases. Am J Public
Health, 79(8), 1042-1044.
67 | P a g e

-------
Boer, J. T., Pastor Jr., M., Sadd, L. L., & Synder, L. D. (1997). Is There Environmental Racism? The Demographics
of Hazardous Waste in Los Angeles County. Soc Sci Quart, 78(4), 793-810.
Bravo, M. A., Fuentes, M., Zhang, Y., Burr, M. J., & Bell, M. L. (2012). Comparison of exposure estimation
methods for air pollutants: Ambient monitoring data and regional air quality simulation. Environ Res,
116(1-10).
Brender, J. D., Maantay, J. A., & Chakraborty, J. (2011). Residential Proximity to Environmental Hazards and
Adverse Health Outcomes. Am J Public Health, 101(SI), S19-S26.
Briguglio, L. (1997). Alternative Economic Vulnerability Indices for Developing Countries. Report prepared for
the Expert Group on Vulnerability Index. UN (DESA), 17-19, December.
Brochu, P. J., Yanosky, J. D., Paciorek, C. J., Schwartz, J., Chen, J. T., Herrick, R. F., & Suh, H. H. (2011). Particulate
air pollution and socioeconomic position in rural and urban areas of the Northeastern United States.
Am J Public Health, 101(S1), S224-S230.
Byun, D., & Schere, K. (2006). Review of the governing equations, computational algorithms, and other
components of the models-3 Community Multiscale Air Quality (CMAQ) modeling system. Appl Mech
Rev, 59, 51-77.
Cauthen, N. K., & Fass, S. (2008). Measuring Poverty in the United States: National Center for Children in
Poverty. Mailman School of Public Health.
Centers for Disease Control and Prevention. (2010). Behavioral Risk Factor Surveillance System Survey Data.
Atlanta: U.S. Department of Health and Human Services, http://www.cdc.gov/brfss/
Centers for Disease Control and Prevention. (2011a). CDC Health Disparities and Inequalities Report — United
States, 2011. MMWR, January 14, 2011; Vol. 60 (Suppl). Retrieved from
http://www.cdc.gov/mmwr/pdf/other/su6001.pdf.
Centers for Disease Control and Prevention. (2011b). Deaths: Leading Causes for 2007. National Vital Statistics
Reports. Vol. 59, No. 8. Retrieved from http://www.cdc.gov/nchs/data/nvsr/nvsr59/nvsr59 08.pdf.
Centers for Disease Control and Prevention. (2011c). Epi Info™. Retrieved April 12, 2012, from
http://wwwn.cdc.gov/epiinfo/.
68 | P a g e

-------
Centers for Disease Control and Prevention. (2011d). MMWR, September 9, 2011; Vol. 60, No. 35. Retrieved
from http://www.cdc.gov/mmwr/pdf/wk/mm6035.pdf.
Centers for Disease Control and Prevention. (2012). CDC Response to Advisory Committee on Childhood Lead
Poisoning Prevention Recommendations in "Low Level Lead Exposure Harms Children: A Renewed Call of
Primary Prevention". Retrieved from
http://www.cdc.gov/nceh/lead/acclpp/cdc response lead exposure recs.pdf.
Chakraborty, J. (2001). Acute Exposure to Extremely Hazardous Substances: An Analysis of Environmental
Equity. Risk Analysis, 21(5), 883-894.
Chakraborty, J. (2006). Evaluating the Environmental Justice Impacts of Transportation Improvement Projects in
the US. Transp Res D Transp Environ, 11(5), 315-323.
Chakraborty, J., Maantay, J. A., and Brender, J. D. (2011). Disproportionate Proximity to Environmental Health
Hazards: Methods, Models, and Measurement. Am J Public Health, 101 (SI), S27-S36.
Clark, L. P., Millet, D. B., & Marshall, J.D. (2014). National Patterns in Environmental Injustice and Inequality:
Outdoor N02 Air Pollution in the United States. PLoS ONE 9(4): e94431.
http://dx.doi.10.1371/iournal.pone.0094431
Cohen, R. A., & Martinez, M. E. (2011). Health Insurance Coverage: Early Release of Estimates From the
National Health Interview Survey, January-March 2011: National Center for Health Statistics.
deFur, P. L., Evans, G. W., Cohen Hubal, E. A., Kyle, A. D., Morello-Frosch, R. A., & Williams, D. R. (2007).
Vulnerability as a Function of Individual and Group Resources in Cumulative Risk Assessment. Environ
Health Perspect, 115(5), 817-824.
Deganian, D., & Thompson, J. (2012). The Patterns of Pollution: A Report on Demographics and Pollution in
Metro Atlanta: GreenLaw.
Elliott, M. R., Wang, Y., Lowe, R. A., & Kleindorfer, P. R. (2004). Environmental justice: Frequency and severity of
US chemical industry accidents and the socioeconomic status of surrounding communities. J Epidemiol
Community Health, 58, 24-30.
Environmental Health Sciences. Environmental Health News. Online periodical. Retrieved August 24, 2012,
from www.environmentalhealthnews.org.
69 | P a g e

-------
Fann, N., Lamson, A. D., Anenberg, S. C., Wesson, K., Risley, D., & Hubbell, B. J. (2012). Estimating the national
public health burden associated with exposure to ambient PM2.5 and ozone. Risk Analysis, 32(1), 81-
95.
Fann, N., & Risley, D. (2011). The public health context for PM2.5 and ozone air quality trends. Air Qual Atmos
Health, [Epub ahead of print January 5, 2011],
Fann, N., Roman, H. A., Fulcher, C. M., Gentile, M. A., Hubbell, B. J., Wesson, K., & Levy, J. I. (2011). Maximizing
health benefits and minimizing inequality: Incorporating local-scale data in the design and evaluation of
air quality policies. Risk Analysis, 31(6), 908-922.
Fedstats. (2007). FedStats. Retrieved April 12, 2012, from http://www.fedstats.gov/.
Finkel, A. M., & Golding, D. (Eds.). (1994). Worst Things First? The Debate Over Risk-Based National
Environmental Priorities. Washington DC: Resources for the Future.
Fitos, E., and Chakraborty, J. (2010). Race, Class, and Wastewater Pollution. In J. Chakraborty and M. M.
Bosman (Eds.) Spatial and Environmental Injustice in an American Metropolis: A Study of Tampa Bay,
Florida. Cambria Press: Amherst, NY; 59-82.
Fujita, E. M., Campbell, D. E., Zielinska, B., Arnott, W. P., & Chow, J. C. (2011). Concentrations of air toxics in
motor vehicle-dominated environments, Research Report 156. Boston: Health Effects Institute.
Gaitens, J. M., Dixon, S. L., Jacobs, D. E., Nagaraja, J., Wilson, J. W., & Ashley, P. J. (2009). Exposure of U.S.
children to residential dust lead, 1999-2004: I. Housing and demographic factors. Environ Health
Perspect, 117(3), 461-467.
Galea, S., Tracy, M., & Hoggatt, K. J. (2011). Estimated Deaths Attributable to Social Factors in the United
States. Am J Public Health, 101(8), 1456-1465.
General Accounting Office. (1983). Siting of Hazardous Waste Landfills and their Correlation with Racial and
Economic Status of Surrounding Communities. Washington, DC: Retrieved from
http://www.gao.gov/products/121648.
Grineski, S. (2007). Incorporating Health Outcomes into Environmental Justice Research: The Case of Children's
Asthma and Air Pollution in Phoenix, Arizona. Environmental Hazards, 7, 360-371.
70 | P a g e

-------
Grineski, S., Bolin, B., and Boone, C. (2007). Criteria Pollution and Marginal Populations: Environmental Inequity
in metropolitan Phoenix, Arizona, USA. SocSci Q, 88, 535-554.
Health Effects Institute. (2010). Traffic-Related Air Pollution: A Critical Review of the Literature on Emissions,
Exposure, and Health Effects (HEI Special Report 17).
Hird, J. A. (1993). Environmental policy and equity: The case of superfund. J Policy Anal Manag, 12(2), 323-343.
Hoek, G., Beelen, R., Kos, G., Dijkema, M., Zee, S. C. v. d., Fischer, P. H., & Brunekreef, B. (2011). Land Use
Regression Model for Ultrafine Particles in Amsterdam. Environ Sci Technol, 45(2), 622-628.
Hoffman, B., Moebus, S., Dragano, N., Mohlenkamp, S., Memmesheimer, M., Erbel, R., & Jockel, K. H. (2009).
Residential traffic exposure and coronary heart disease: results from the Heinz Nixdorf Recall Study.
Biomarkers, 14(SI), 74-78.
Hystad, P., Setton, E., Cervantes, A., Poplawski, K., Deschenes, S., Brauer, M., . .. Demers, P. (2011). Creating
national air pollution models for population exposure assessment in Canada. Environ Health Perspect,
119(8), 1123-1129.
Jacobs, D. E., Clickner, R. P., Zhou, J. Y., Viet, S. M., Marker, D. A., Rogers, J. W., .. . Friedman, W. (2002). The
Prevalence of Lead-Based Paint Hazards in U.S. Housing. Environ Health Perspect, 110(10), a599-a606.
Jones, R. L., Homa, D. M., Meyer, P. A., Brody, D. J., Caldwell, K. L., Pirkle, J. L., & Brown, M. J. (2009). Trends in
blood lead levels and blood lead testing among US children aged 1 to 5 years, 1988-2004. Pediatrics,
123(3), e376-e385.
Kleindorfer, P. R., Belke, J. C., Elliott, M. R., Lee, K., Lowe, R. A., & Feldman, H. I. (2003). Accident epidemiology
and the U.S. chemical industry: Accident history and worst-case data from RMP*lnfo. Risk Analysis,
23(5), 865-881.
Krewski, D., Jerrett, M., Burnett, R. T., Ma, R., Hughes, E., Shi, Y., . . . Thun, M. J. (2009). Extended Follow-Up
and Spatial Analysis of the American Cancer Society Study Linking Particulate Air Pollution and
Mortality, Research Report 140. Boston: Health Effects Institute.
Levin, R., Brown, M. J., Kashtock, M. E., Jacobs, D. E., Whelan, E. A., Rodman, J., . . . Sinks, T. (2008). Lead
Exposures in U.S. Children, 2008: Implications for Prevention. Environ Health Perspect, 116(10), 1285-
1293.
71 | P a g e

-------
Levy, J. I., Wilson, A. M., & Zwack, L. M. (2007). Quantifying the Efficiency and Equity Implications of Power
Plant Air Pollution Control Strategies in the United States. Environ Health Perspect, 115(5), 743-750.
Liu, F. (2001). Environmental Justice Analysis: Theories, Methods, and Practice. Boca Raton: CRC Press.
https://www.crcpress.com/product/isbn/9781566704038
Lobdell, D. T., Jagai, J. S., Rappazzo, K., & Messer, L. C. (2011). Data sources for an environmental quality index:
Availability, quality, and utility. Am J Public Health, 101(S1), S277-S285.
Maantay, J., Chakraborty, J., & Brender, J. (2010). Proximity to Environmental Hazards: Environmental Justice
and Adverse Health Outcomes, Final Revised Draft. Prepared for the U.S. Environmental Protection
Agency "Strengthening Environmental Justice Research and Decision Making: A Symposium on the
Science of Disproportionate Environmental Health Impacts". Final publication available at
http://dx.doi.org/10.2105/AJPH.2011.30Q183
Maroko, A. R. (2012). Using air dispersion modeling and proximity analysis to assess chronic exposure to fine
particulate matter and environmental justice in New York City. Appl Geogr, 34, 533-547.
McMillan, N. J., Holland, D. M., Morara, M., & Feng, J. (2010). Combining numerical model output and
particulate data using Bayesian space-time modeling. Environmetrics, 21(1), 48-65.
Messer, L. C., Jagai, J. S., Rappazzo, K. M., Lobdell, D. T. (2014). Construction of an environmental quality index
for public health research. Environ Health. May 22;13(1):39. http://dx.doi.org/10.1186%2F1476-069X-
13-39
Miranda, M. L., Edwards, S. E., Keating, M. H., & Paul, C. J. (2011). Making the Environmental Justice Grade: The
Relative Burden of Air Pollution Exposure in the United States. IntJ Environ Res Public Health, 8(6).
Mohai, P., & Saha, R. (2007). Racial Inequality in the Distribution of Hazardous Waste: A National-Level
Reassessment. Social Problems, 54(3), 343-270.
Morello-Frosch, R., & Jesdale, B. M. (2006). Separate and Unequal: Residential Segregation and Estimated
Cancer Risks Associated with Ambient Air Toxics in U.S. Metropolitan Areas. Environ Health Perspect,
114(3), 386-393.
National Research Council of the National Academies Committee on Improving Risk Analysis Approaches Used
by the U.S. EPA. (2009). Science and Decisions: Advancing Risk Assessment. Washington, DC: The
National Academies Press, http://www.nap.edu/catalog/12209/science-and-decisions-advancing-risk-
assessment
72 | P a g e

-------
National Environmental Justice Advisory Council (NEJAC). (2010). Recommendations for Nationally Consistent
Environmental Justice Screening Approaches.
http://www.epa.gov/environmentaliustice/resources/publications/neiac/ei-screening-approaches-rpt-
2010.pdf via http://www.epa.gov/environmentaliustice/neiac/recommendations.html
O'Neil, S. G. (2007). Superfund: Evaluating the Impact of Executive Order 12898. Environ Health Perspect,
115(7), 1087-1093.
Oakes, J. M., Anderton, D. L., & Anderson, A. B. (1996). A Longitudinal Analysis of Environmental Equity in
Communities with Hazardous Waste Facilities. SocSciRes, 25, 125-148.
The Organisation for Economic Co-operation and Development (OECD). (2008). Handbook on Constructing
Composite Indicators.
http://www.oecd.org/els/soc/handbookonconstructingcompositeindicatorsmethodologyanduserguide.
htm
Pastor Jr., M., Morello-Frosch, R., & Sadd, J. (2010). Final Report: Contract Number 04-308. Air Pollution and
Environmental Justice: Integrating Indicators of Cumulative Impact and Socio-Economic Vulnerability
into Regulatory Decision-Making.
Pastor Jr., M., Sadd, J., & Hipp, J. (2001). Which Came First? Toxic Facilities, Minority Move-In, and
Environmental Justice. J Urban Affairs, 23(1), 1-21.
Post, E. S., Belova, A., & Huang, J. (2011). Distributional Benefit Analysis of a National Air Quality Rule. IntJ
Environ Res Public Health, 8(6), 1872-1892.
Probst, K. N. (1990). Hazardous Waste and the Rural Poor: A Preliminary Assessment: Clean Sites, Inc.
Rosenbaum, A., Hartley, S., & Holder, C. (2011). Analysis of diesel particulate matter health risk disparities in
selected US harbor areas. Am J Public Health, 101(SI), S217-S223.
Rowangould, G.M. (2013). A census of US near-roadway population: Public health and environmental justice
considerations. Transp Res D Transp Environ, 25, 59-67.
Sadd, J. L., Pastor, M., Morello-Frosch, R., Scoggins, J., & Jesdale, B. (2011). Playing it safe: Assessing cumulative
impact and social vulnerability through an environmental justice screening method in the South Coast
Air Basin, California. IntJ Environ Res Public Health, 8(5), 1441-1459.
73 | P a g e

-------
Saha, R., & Mohai, P. (2005). Historical Context and Hazardous Waste Facility Siting: Understanding Temporal
Patterns in Michigan. Social Problems, 52(4), 618-648.
S0rensen, M., Hvidberg, M., Andersen, Z. J., Nordsborg, R. B., Lillelund, K. G., Jakobsen, J., . . . Raaschou-Nielsen,
O. (2011). Road traffic noise and stroke: A prospective cohort study. Eur Heart J, 32(6), 737-744.
Spengler, J., Lwebuga-Mukasa, J., Vallarino, J., Melly, S., Chillrud, S., Baker, J., & Minegishi, T. (2011). Air Toxics
Exposure from Vehicle Emissions at a U.S. Border Crossing: Buffalo Peace Bridge Study, Research
Report 158. Boston: Health Effects Institute.
Su, J. G., Morello-Frosch, R., Jesdale, B. M., Kyle, A. D., Shamasunder, B., & Jerrett, M. (2009). An index for
assessing demographic inequalities in cumulative environmental hazards with application to Los
Angeles, California. Environ Sci Technol, 43(20), 7626-7634.
Tian, N., Xue, J., & Barzyk, T.M. (2013). Evaluating socioeconomic and racial differences in traffic-related
metrics in the United States using a GIS approach. J Exposure Science Envt Epidem, 23, 215-222.
U.S. Census Bureau. (2013). American Community Survey, 5-year Estimates 2008-2012. Data retrieved in 2013,
from http://www2.census.gov/acs2012 5yr/summaryfile/ and
http://www.census.gov/acs/www/data documentation/summary file/
U.S. Department of Transportation. (2011). National Transportation Atlas Database. Highway Performance
Monitoring System. Data retrieved April 12, 2012, from
http://www.rita.dot.gov/bts/sites/rita.dot.gov.bts/files/publications/national transportation atlas dat
abase/2011/index.html.
U.S. EPA. (1987). Unfinished Business: A Comparative Assessment of Environmental Problems. Washington, DC:
Retrieved from http://nepis.epa.gov/Exe/ZvPURL.cgi?Dockev=2000BZQP.txt.
U.S. EPA. (1992). Environmental Equity: Reducing the Risk for All Communities. Washington, DC: Retrieved from
http://www.epa.gov/compliance/ei/resources/reports/annual-proiect-
reports/reducing risk com voll.pdf.
U.S. EPA. (2006a). Air Quality Criteria for Lead (2006) Final Report. EPA/600/R-05/144aF-bF. Washington, DC.
http://cfpub.epa.gov/ncea/cfm/recordisplav.cfm?deid=158823
U.S. EPA. (2006b). Air Quality Criteria for Ozone and Related Photochemical Oxidants. Washington, DC:
Retrieved from http://cfpub.epa.gov/ncea/cfm/recordisplav.cfm?deid=149923.
74 | P a g e

-------
U.S. EPA. (2009a). A Framework for Categorizing the Relative Vulnerability of Threatened and Endangered
Species to Climate Change (External Review Draft). EPA/600/R-09/011. Washington, DC.
http://cfpub.epa.gov/ncea/cfm/recordisplay.cfm7deich203743
U.S. EPA. (2009b). Integrated Science Assessment for Particulate Matter (Final Report). Washington, DC:
Retrieved from http://cfpub.epa.gov/ncea/cfm/recordisplav.cfm?deid=216546.
U.S. EPA. (2009c). National-Scale Air Toxics Assessment for 2002 - Fact Sheet. Retrieved April 12, 2012, from
http://www.epa.gov/ttn/atw/nata2002/factsheet.html.
U.S. EPA. (2009d). Original list of hazardous air pollutants. Retrieved April 12, 2012, from
http://www.epa.gov/ttnatw01/187polls.html.
U.S. EPA. (2010). Interim Guidance on Considering Environmental Justice During the Development of an Action.
Retrieved from http://www.epa.gov/compliance/ei/resources/policv/considering-ei-in-rulemaking-
guide-07-2010.pdf.
U.S. EPA. (2011a). Exposure Factors Handbook: 2011 Edition. Washington, DC: Retrieved from
http://www.epa.gov/ncea/efh/pdfs/efh-complete.pdf.
U.S. EPA. (2011b). [reference no longer used],
U.S. EPA. (2011c). Integrated Science Assessment for Lead (1st External Review Draft 2011 & Final Report 2013).
Washington, DC. EPA/600/R-1 0/075F. http://cfpub.epa.gov/ncea/cfm/recordisplav.cfm?deid=255721
U.S. EPA. (2011d). [reference no longer used]
U.S. EPA. (2011e). RCRAInfo database search in Envirofacts. Data retrieved April 12, 2012, from
http://www.epa.gov/enviro/facts/rcrainfo/search.html.
U.S. EPA. (2012a). Facilities and Enforcement Activities related to the Clean Water Act National Pollutant
Discharge Elimination System (NPDES) Program. Data retrieved April 12, 2012, from
http://www.epa.gov/oecaerth/data/results/performance/cwa/index.html.
U.S. EPA. (2012b). Industrial Regulations. Retrieved April 12, 2012, from
http://water.epa.gov/scitech/wastetech/guide/industry.cfm.
75 | 3 a g e

-------
U.S. EPA. (2012c). Integrated Risk Information System (IRIS). Retrieved April 12, 2012, from
http://www.epa.gov/IRIS/.
U.S. EPA. (2012d). Our Nation's Air: Status and Trends Through 2010. Research Triangle Park: Retrieved from
http://www.epa.gov/airtrends/2011/report/fullreport.pdf.
U.S. EPA. (2012e). RCRA Orientation Manual 2011: Resource Conservation and Recovery Act. Retrieved August
24, 2012, from http://www.epa.gov/osw/inforesources/pubs/orientat/.
U.S. EPA. (2012f). Search Superfund Site Information. Data retrieved May, 2012, from
http://cumulis.epa.gov/supercpad/cursites/srchsites.cfm.
U.S. EPA. (2012g). Summary Nonattainment Area Population Exposure Report. Retrieved August 24, 2012, from
http://www.epa.gov/oar/oaqps/greenbk/popexp.html.
U.S. EPA. (2015a). Fused Air Quality Surfaces Using Downscaling. Data retrieved from
http://www.epa.gov/esd/land-sci/lcb/lcb faqsd.html.
U.S. EPA. (2015b). National-Scale Air Toxics Assessments. Data retrieved from
http://www.epa.gov/ttn/atw/natamain/index.html.
United Church of Christ. (1987). Toxic Waste and Race in the United States: A National Report on the Racial and
Socio-Economic Characteristics of Communities with Hazardous Waste Sites: Commission for Racial
Justice, http://www.ucc.org/environmental-ministries toxic-waste-20
United Church of Christ. (2007). Toxic Wastes and Race at Twenty: 1987-2007. Cleveland: Justice and Witness
Ministries, http://www.ucc.org/environmental-ministries toxic-waste-20
van Donkelaar, A., Martin, R. V., Brauer, M., Kahn, R., Levy, R., Verduzco, C., & Villeneuve, P. J. (2010). Global
Estimates of Ambient Fine Particulate Matter Concentrations from Satellite-Based Aerosol Optical
Depth: Development and Application. Environ Health Perspect, 118(6), 847-855.
http://dx.doi.org/10.1289/ehp.09Q1623
VanDerslice, J. (2011). Drinking water infrastructure and environmental disparities: evidence and
methodological considerations. Am J Public Health, 101(S1), S109-S114.
76 | P a g e

-------
Villagran de Leon, J. C. (2006). Vulnerability: A Conceptual and Methodological Review (Vol. 4/2006): SOURCE
(Studies Of the University: Research, Counsel, Education), United Nations University - Institute for
Environmental and Human Security.
Yang, T.-C., & Matthews, S. A. (2010). The Role of Social and Built Environments in Predicting Self-rated Stress:
A Multilevel Analysis in Philadelphia. Health & Place, 16(5), 803-810.
Zhou, Y., & Levy, J. I. (2007). Factors influencing the spatial extent of mobile source air pollution impacts: A
meta-analysis. BMC Public Health, 7(89).
Zhu, Y., Hinds, W. C., Kim, S., & Sioutas, C. (2002). Concentration and size distribution of ultrafine particles near
a major highway. J Air Waste Manag Assoc, 52(9), 1032-1042.
Zimmerman, R. (1993). Social Equity and Environmental Risk. Risk Analysis, 13(6), 649-666.
77 | P a g e

-------
Appendix A
APPENDIX A. DEVELOPMENT OF EJSCREEN
Review of Available Data and Other Tools
Preliminary planning for EJSCREEN began in late 2010. The first steps involved a review of existing or planned EJ
screening methods from EPA program and regional offices. EJSCREEN draws upon a great deal of prior research,
analysis and public involvement in the development of very closely related screening efforts. Early steps also
included a review of current EJ research on methods and data for EJ analysis. Information was gathered from
the following sources, among others:
•	Stakeholder and expert presentations at EPA's March 2010 conference on environmental justice.
•	EPA's 2010 expert workshop on economics and environmental justice.
•	EPA's ORD's C-FERST research program, including a review of data sources for EJ analysis.
•	EPA's ORD's Environmental Quality Index (EQI) compilation and review of data sources for
environmental indicators.
•	A review of several national reviews of analytic methods including the use of inequality metrics
(e.g. as presented in the expert workshop).
•	Review of EPA guidance documents and related documents on environmental justice49
•	Review of the NEJAC report of May 2010 on EJ screening (NEJAC, 2010).
•	Review of prior tools including EJSEAT, EJVIEW, various EPA Regional tools, and some state EJ
screening tools such as CalEnviroScreen.
Selecting an Approach to EJ Screening
A number of important considerations must be balanced when selecting an approach to an EJ screening tool:
•	Useful to end-users and other stakeholders.
•	Reflects EPA policies and EJ policy goals.
•	Reflects sound science.
•	Is feasible to develop and maintain, update and upgrade.
Data coverage and quality considerations are also discussed in the chapters describing the environmental
indicators.
EJSCREEN was developed through an EPA workgroup with participation from a very wide range of program
offices and Regional offices, and in consultation with management and scientists representing the various
offices, building upon the public input and scientific information developed in the course of prior screening
efforts such as EJSEAT and Regional experience with EJ screening. Quality control and peer review of EJSCREEN
are described in Appendices F and G.
[49 http://www.epa.gov/environmentaliustice/resources/policv/index.html
78 | P a g e

-------
Appendix B
[intentionally blank]
79 | P a g e

-------
Appendix B
APPENDIX B. TECHNICAL DETAILS ON PERCENTILES, ROUNDING,
BUFFERING, AND DEMOGRAPHIC DATA
How Percentiles are Calculated and Displayed
Percentiles, such as "80th percentile," are displayed in EJSCREEN as rounded down to the closest percentile
that is lower than the exact value. This is called showing the "floor" of the exact percentile (rather than
rounding off the percentile to the nearest 1 percentile). For example, if the exact percentile is equal to or
greater than 79 but less than 80, it is displayed as "79th percentile." If the exact percentile is equal to or greater
than 80 but less than 81, it is displayed as "80th percentile." The reason for this is to ensure that EJSCREEN only
displays "80th percentile" if the exact percentile truly is as high as 80.50
Ties in indicator values are fairly common, especially for the lead paint indicator and percent linguistic isolation,
where large shares are tied with values of zero. Ties are assigned a percentile that can be thought of as the
upper edge of the range of tied values. For example, if 3 percent of the US population were tied at the
maximum indicator value, they would all be shown as being at the 100th percentile, and the next lower value
would be assigned the 97th percentile. If 4 percent were all tied for the lowest value, they would all be shown
as being at the 4th percentile, and nobody would be shown as being at the 0-3 percentiles. The percent
linguistic isolation was zero in block groups comprising about 45% of the US population in the 2008-2012 ACS
data, so all those places were shown as tied for the 45th percentile, and none were reported as being at any
lower percentiles. However, it is worth noting that a group of tied values is usually not shared by more than 1%
of the population, so once the percentiles are converted to integers 0-100, the tied raw values do not cause
jumps in percentiles except in a few cases. A jump would be a case where there are no block groups assigned a
percentile between zero and 45, because around 45% of the population is tied with a value of zero.
Percentiles are assigned to calculated values (such as in buffer reports) by use of national, region-specific, and
state-specific lookup tables that show the raw value cutoff value that corresponds to each integer percentile 0-
100. To ensure that exact matches are found when looking for the 100th percentile, for example, the cutoff
values in the lookup tables are all stored with exactly 6 decimal places, and a raw value is rounded to exactly 6
decimal places before it is looked up in those tables. If a value matches the cutoff, it is assigned that percentile.
If it falls between two cutoffs, it is assigned the lower of the two percentiles, to provide displayed results that
are consistent with the way percentiles are displayed using the "floor" function described above. The lookup
tables are stored in a geodatabase used for EJCSREEN.
50 This also ensures that map colors correspond to displayed percentiles. For example, if the exact percentile is 79.99, the
map will show the place as gray, meaning it is still below the 80th percentile, and the percentile will be shown as "79th
percentile." If the map is yellow, it indicates the exact percentile is at least 80, and the displayed percentile will also be at
least 80. Without using the "floor" of the exact percentile, the map colors and displayed percentile would sometimes
disagree, and a user would not be sure if "80th percentile" actually meant the exact percentile was actually at least 80.
Using the "floor" instead of rounding ensures clarity about whether a place actually reaches a given percentile.
80 | Page

-------
Appendix B
In output tables, percentages are rounded to the nearest percent but percentiles are displayed using a "floor"
function as described above. Occasionally, this can lead to some potentially confusing situations in some
tables. For example, a place may be shown as 100% minority but only at the 98th percentile. This is because the
place may actually be 99.6% minority, which is displayed as 100% minority. But if 1.8% of the US population
lives where there is an even higher percent minority (e.g., 100%), this place is only at the 98.2 percentile, which
is displayed as 98th percentile.
The percentiles and lookup tables were calculated using the statistical software called R, using code written by
EPA, based on wtd.quantile() and wtd.Ecdf() functions in the Hmisc package (http://cran.r-
proiect.org/web/packages/Hmisc/index.html). The scripting language R is documented here: http://cran.r-
project.org
How Percentages and Raw Values are Rounded and Displayed
ESJSCREEN displays raw indicator numbers and percentiles in a standard report (on-screen or PDF format), the
"Explore reports" window, a tabular view (on-screen or downloaded text file), bar graphs, popup windows on
maps, and in the downloadable raw data files. Several standard rules have been applied to keep these formats
consistent, clear, and at an appropriate level of detail.
The raw data stored in the database used in EJSCREEN are stored as the "exact" values calculated from
estimated counts obtained from the Bureau of Census, or from the development of environmental indicators,
at the highest degree of precision used by the software calculating the indicators and by the GIS database. This
ensures that all internal calculations use the best estimate of a given number rather than relying on a rounded
off approximation, and is standard best practice in working with such data. All calculations use the "exact"
(unrounded) numbers from Census or stored in the GIS database. This includes converting raw Census data into
a demographic indicator, calculating an EJ index, or estimating the values for a buffer report.
When displaying data, such as in reports, popup windows on maps, or the tabular view, EJSCREEN presents
formatted numbers that follow certain conventions:
- Raw environmental indicator values are displayed using specified numbers of significant figures (also known
as significant digits). This is a standard way of communicating precision appropriately. Precision of these
estimates depends largely on sample size in Census survey data, and the ability of measurements and models
to estimate environmental conditions. Two significant figures are shown for all environmental indicators other
than PM2.5, ozone, and diesel PM, which are shown with three significant figures. For example, a cancer risk
calculated to be 144.44 per million would be displayed as 140 (i.e., using 2 significant figures). A PM2.5
concentration of 14.44 would be shown as 14.4 (i.e., using 3 significant figures). Proximity scores of at least
0.185 but less than 0.195 would be displayed as 0.19 (which shows 2 significant figures). This means a proximity
score displayed as 0.010 came from an exact value of at least 0.0095 but less than 0.015, for example. Note
that these significant figure rules have been applied in all cases, and if in some case the number is missing a
trailing zero that should appear, it is simply a limitation of print formatting. For example, if a proximity score is
81 | P a g e

-------
Appendix B
shown as 0.1, best practice would be to display it as 0.10 to make explicit the use of 2 significant figures, a
printout may only display it as 0.1 instead of 0.10, but it was still rounded using the 2 significant figures rule.
- Demographic percentages, such as "34% low-income," are displayed as rounded to the nearest 1%. For
example, any values equal to or greater than 79.5% but less than 80.5% are displayed as "80%."
It is also important to keep in mind that all of the numbers are estimates, so small differences in raw values or
percentiles should not be regarded as certain and meaningful, given the uncertainty in the environmental and
demographic estimates.
Calculations for Buffer Reports
EJSCREEN allows a user to define a buffer, such as the circle that includes everything within 1 mile of a specific
point. Non-circular, user-defined shapes also can be defined to represent buffers of any shape. A report
summarizes the demographics of residents within this buffer, as well as the environmental indicators and EJ
index values within the buffer.
The summary within a buffer is designed to represent the average resident within the buffer, and also provides
an estimate of the total population residing in the buffer. For example, the traffic proximity indicator for a
buffer is the population-weighted average of all the traffic indicator values in the buffer. Similarly, the percent
minority would be a weighted average, which is the same as the overall percent minority for all residents in the
buffer.
Some block groups will be partly inside and partly outside a buffer, and any buffer analysis must estimate how
much of each block group's population is inside the buffer. Areal apportionment of block groups is one
standard method, but it assumes that population is evenly spread throughout a block group, which may be far
from the actual distribution of residents. Areal apportionment of blocks would be even more accurate but
extremely computationally intensive.
To provide the most accurate counts that are currently feasible for a screening tool, EJSCREEN uses an
approach based on Census block internal points. EJSCREEN estimates the fraction of the Census block group
population that is inside the buffer by using block-level population counts from Census 2010. These blocks
provide data about where residents are at a higher resolution than block groups. Each block has an internal
point defined by the Census Bureau, and the entire block population is counted as inside or outside the buffer
depending on whether the block internal point is inside or outside. This assumption typically introduces
relatively little error because blocks are so small relative to a typical buffer, so a small fraction of the total
buffer population is in blocks that span an edge of the buffer. Also, any blocks along the edge of a buffer whose
populations are close to 0 or 100% inside the buffer will be well represented by this assumption.
As long as users draw buffers much larger than a local block group, this method should represent the average
person inside the buffer reasonably well.
82 | P a g e

-------
Appendix B
The calculation of a value for the buffer is essentially the population-weighted average of the indicator values
in the blocks included in the buffer, where each block uses the indicator values of the block group containing it,
A block group is weighted based on the fraction of the ACS block group population that is considered in the
buffer. That fraction is estimated as the Census 2010 block population divided by the Census 2010 block group
population. The formula below is used to estimate the population average of a raw indicator value in a buffer.
This formula is simply a population-weighted average - it sums the population-weighted raw values, and then
divides that sum by the total population in the buffer.
"BlockPoplO" refers to the Census 2010 block level population total (used here because the ACS does not
provide block resolution), and "BG" indicates block group. "BGACSPop" is the block group estimated population
count from the ACS, which is often different than the Census 2010 total for all blocks in the block group,
because the ACS data used here is a composite estimate based on survey samples spanning five years, while
the Census is a full count at one point in time.
Demographic Data and Geographic Coverage
In the first decade of this century, the Census Bureau made a fundamental shift in how detailed demographic
data are collected. Rather than collecting basic data from everyone, plus more detailed data from a one-in-six
sample of households once a decade in the decennial census, a mixed approach has been adopted. The basic
data, required for Congressional redistricting under the U.S. Constitution, are still collected every ten years in
what is intended to be a 100% census. But that basic information, plus virtually all the more detailed
demographic data, are also collected throughout each year in a stratified random sample of more than 200,000
households each month. This is the American Community Survey (ACS). Some of this information is then
aggregated and displayed in yearly summaries, others in 3-year summaries, and others in 5-year summaries.
Only the five-year summary files provide block group resolution. The result is a timelier, evolving picture of U.S.
demographics. For instance, the ACS 2005 to 2009 average data were released in December 2010, while most
demographics data users were still working with the April 2000 decennial census snapshot.
Extensive documentation of the ACS is available. For a general overview, see
http://www.census.gov/acs/www/ and for complete documentation see
http://www.census.gov/acs/www/data documentation/documentation main/. For information on the 5-year
summary file, which is what EJSCREEN uses, see
http://www.census.gov/acs/www/data documentation/summary file/. For information on using the data see
http://www.census.gov/acs/www/guidance for data users/handbooks/).
Race and ethnicity, the two items that determine minority status in our approach, are available from the 100%
enumeration from the decennial census or the ACS, while all the other measures are only contained in ACS
estimates. For the purposes of EJSCREEN, EPA did not believe that the increased precision of the minority
Fai„e(A)= £ _BCPovW
BlockPoplO
* BGACSPop * BG_RawValue
VBlktflknA 2-iVBlk£lkr\A
V
BlkPoplQ
BGPoplG *BGACSP°P
83 | P a g e

-------
Appendix B
measures that might be gained from combining those once per decade data with the other data items from
ACS were worth the problems and ambiguities entailed in such a hybrid approach.
All of EJSCREEN's demographic data come from the latest annual update of the five-year average ACS
estimates, with some lag time from publication by Census to inclusion in EJSCREEN. The Census Bureau does
not recommend making direct comparisons between data from a five-year summary and a prior, overlapping
five-year period, such as comparing 2007-2011 to 2008-2012.51 This means attempting to look for trends in
terms of year-to-year changes is not recommended - changes in ACS estimates at block group resolution can
be reviewed every five years, but not more often. Yearly changes can be examined at county resolution using
the 1-year ACS data.
Each of the nation's counties (or county equivalents, such as Municipios in Puerto Rico) is completely divided
into Census tracts. Each tract is in turn divided into Census block groups. Census block groups generally have
between 600 and 3,000 people, with an optimal population of 1,500; however, a few are much more populous
and a small number have zero residents. A block group consists of one or more Census blocks. In urban areas, a
block is typically a city block defined by streets. The Census Bureau collects data by household, but block
groups are the smallest area for which the ACS presents estimates.
Tracts and block groups are defined in ACS exactly as in the Census 2010 other than a few exceptions due to
updates or corrections.52 For the 2008-2012 ACS data in EJSCREEN, Census made changes in a few dozen block
groups and their tracts and FIPS codes in NY, AZ, and CA53. Also see details in the discussion of how NATA data
was converted to 2010 boundaries, in Section 3.
There are 11,078,297 blocks in the Census 2010 data, not including Puerto Rico and the Island Areas,54 but
since then some changes have been made in the FIPS codes that relate blocks to their parent block groups.
Block to block group relationships are used by EJSCREEN in part of the buffer analysis (as described in Appendix
C). This required making manual adjustments to reassign some Census 2010 blocks to their updated parent ACS
block groups for purposes of calculating buffer estimates.
The EJSCREEN dataset based on the ACS 2008-2012 summary file has data for 51 States/equivalents (includes
DC), 3,143 counties/county equivalents, 73,056 census tracts, and 217,739 block groups. Puerto Rico (2,594
block groups) is part of the ACS, but could not be included in the 2015 version of the EJSCREEN dataset due to
environmental data and other constraints, although it may be included in future versions. The Census Bureau
does not collect ACS data for the Virgin Islands or the other Island Territories.
Compared to the ACS 2008-2012, the Census 2010 tallies counted one block group more than are included in
the ACS (or EJSCREEN) - an extra block group in New York State - but they were otherwise identical in block
51	http://www.census.gov/acs/www/guidance for data users/comparing data/
52	http://www.census.gov/acs/www/data documentation/geography/
53	See https://www.census.gov/acs/www/data documentation/2011 geography release notes/ on NY changes and
https://www.census.gov/acs/www/data documentation/geography notes/index.php on AZ and CA changes.
54	http://www.census.gov/geo/maps-data/data/tallies/tractblock.html
84 | Page

-------
Appendix B
group counts per state. The count of geographies (e.g., number of block groups) covered by the ACS is
summarized by the Census Bureau for each update:
•	ACS tallies, as counted in data downloaded from ACS FTP site, and as used in EJSCREEN (217,739 block
groups, without Puerto Rico)
•	ACS tallies, with Puerto Rico55: 220,333 with PR, 217,739 without Puerto Rico.
•	Census 2010 tallies, by State/PR56: 220,334 with Puerto Rico, and 217,740 without Puerto Rico.
Table 4. Tallies of 2008-12 ACS Block Groups Used in 2015 Version of EJSCREEN
Geography
Number of Block
Groups
In EJSCREEN?
Continental U.S., 48 States plus DC
216,330
Yes
Alaska & Hawaii
534 & 875
Yes
SUBTOTAL: Included in EJSCREEN
217,739
Yes
Puerto Rico
2,594
No
Virgin Islands and other Island Territories
(American Samoa, Commonwealth of the
Northern Mariana Islands, Guam)
408
No
Source: U.S. Census Bureau
http://www.census.gov/acs/www/data documentation/areas published/ and
http://www.census.gov/geo/maps-data/data/tallies/tractblock.html
Demographic Variables and Formulas
This section provides details on the Census variables and formulas used to calculate demographic indicators for
each block group. Short variable names used in EJSCREEN internal calculations are shown below, preceded by
tabular data showing the ACS summary file table number, sequence number, variable number, and name of the
table or variable. For example, total population, referred to in EJSCREEN calculations as "pop" and called
"ACSTOTPOP" in the geodatabase, is taken from the Census variable B01001.001, in ACS 5-year summary file
Table B01001. Field names as used in the EJSCREEN geodatabase differ from the names shown below, and a
table was used to map between alternative fieldnames.
These details are based on the ACS 2008-2012 summary files.
55	http://www.census.gov/acs/www/data documentation/areas published/
56	http://www.census.gov/geo/maps-data/data/tallies/tractblock.html
85 | P a g e

-------
ACS Summary File documentation is here:
http://www.census.gov/acs/www/data documentation/summary

-------
Appendix B
ACS Summary File Variables
Tables used from ACS 2008-2012 are shown below. Block group data were obtained from the Census FTP site57,
and are not available from American Fact Finder at block group resolution.
Note that only selected variables from these tables are in the EJSCREEN geodatabase, for performance reasons.
Many of the intermediate variables or detailed breakdowns are not in the geodatabase. Documentation
supplied with the geodatabase download explains the variable names used in the geodatabase.
The URLs that can be used to view and download one ACS 2008-2012 table at a time, for the US total only, are
as follows:
Table 5. ACS Tables Underlying EJSCREEN Demographic Data and Lead Paint Indicator
ACS
Table
ID
URL for US summary table
via American Fact Finder
Table Title
B01001
http://factfinder.census.Rov/bkmk/table
SEX BY AGE
/1.0/en/ACS/12 5YR/B01001/0100000US

B03002
http: / /factfinder, census.eov/bkmk/table
HISPANIC OR LATINO ORIGIN BY RACE
/1.0/en/ACS/12 5YR/B03002/0100000US
B15002
http: / /factfinder, census. eov/bkmk/table
SEX BY EDUCATIONAL ATTAINMENT FOR THE
POPULATION 25 YEARS AND OVER
/1.0/en/ACS/12 5YR/B15002/0100000US
B16002
http: //factfinder, census .eov/bkmk/table
HOUSEHOLD LANGUAGE BY HOUSEHOLDS IN WHICH NO
ONE 14 AND OVER SPEAKS ENGLISH ONLY OR SPEAKS
A LANGUAGE OTHER THAN ENGLISH AT HOME AND
SPEAKS ENGLISH "VERY WELL"
/1.0/en/ACS/12 5YR/B16002/0100000US

C17002
http: //factfinder, census .eov/bkmk/table
RATIO OF INCOME TO POVERTY LEVEL IN THE PAST
12 MONTHS
/I.0/en/ACS/12 5YR/C17002/0100000US

B25034
http: //factfinder, census .eov/bkmk/table
YEAR STRUCTURE BUILT
/1.0/en/ACS/12 5YR/B25034/0100000US

URLs to view and download one ACS 2008-2012 table at a time, for the totals for the US,
every state plus DC, and Puerto Rico are shown below.
57 http://www.census.gov/acs/www/data documentation/data via ftp/
87 | P a g e

-------
Appendix B
B01001
http://factfinder2.census.gov/bkmk/table/1.0/en/ACS/12 5YR/B01001/0100000US10400000US0110400000US0210400000US0410400000US051040000
0US0610400000US08 10400000US09 10400000US1010400000US1110400000US1210400000US13 10400000US15 10400000US1610400000US1710400000U
S18I0400000US19I0400000US20I0400000US21I0400000US22I0400000US23I0400000US24I0400000US25I0400000US26I0400000US27I0400000US2
8 10400000US2910400000US3010400000US3110400000US3210400000US3310400000US3410400000US3510400000US3610400000US3710400000US38I
0400000US3910400000US4010400000US4110400000US42 10400000US4410400000US4510400000US4610400000US4710400000US48 10400000US49 104
00000US5010400000US5110400000US5310400000US5410400000US5510400000US5610400000US72
B03002
http://factfinder2.census.gOv/bkmk/table/l.0/en/ACS/12 5YR/B03002/0100000US10400000US0110400000US0210400000US0410400000US051040000
0US0610400000US08 10400000US09 10400000US1010400000US1110400000US1210400000US13 10400000US15 10400000US1610400000US1710400000U
S18I0400000US19I0400000US20I0400000US21I0400000US22I0400000US23I0400000US24I0400000US25I0400000US26I0400000US27I0400000US2
8 10400000US2910400000US3010400000US3110400000US3210400000US3310400000US3410400000US3510400000US3610400000US3710400000US38I
0400000US3910400000US4010400000US4110400000US42 10400000US4410400000US4510400000US4610400000US4710400000US48 10400000US49 104
00000US5010400000US5110400000US5310400000US5410400000US5510400000US5610400000US72
B15002
http://factfinder2.census.gOv/bkmk/table/l.0/en/ACS/12 5YR/B15002/0100000US10400000US0110400000US0210400000US0410400000US051040000
0US0610400000US08 10400000US09 10400000US1010400000US1110400000US1210400000US13 10400000US15 10400000US1610400000US1710400000U
S18I0400000US19I0400000US20I0400000US21I0400000US22I0400000US23I0400000US24I0400000US25I0400000US26I0400000US27I0400000US2
8 10400000US29 10400000US3010400000US3110400000US3210400000US33 10400000US3410400000US35 10400000US3610400000US3710400000US38 I
0400000US3910400000US4010400000US4110400000US42 10400000US4410400000US45 10400000US4610400000US4710400000US48 10400000US49 104
00000US5010400000US5110400000US5310400000US5410400000US5510400000US5610400000US72
B16002
http://factfinder2.census.gov/bkmk/table/1.0/en/ACS/12 5YR/B16002/0100000US10400000US0110400000US0210400000US0410400000US051040000
0US0610400000US08 10400000US09 10400000US1010400000US1110400000US1210400000US13 10400000US15 10400000US1610400000US1710400000U
S18 10400000US19 10400000US2010400000US2110400000US22 10400000US23 10400000US2410400000US2510400000US2610400000US2710400000US2
8 10400000US2910400000US3010400000US3110400000US3210400000US3310400000US3410400000US3510400000US3610400000US3710400000US38I
0400000US3910400000US4010400000US4110400000US42 10400000US4410400000US4510400000US4610400000US4710400000US48 10400000US49 104
00000US5010400000US5110400000US5310400000US5410400000US5510400000US5610400000US72
C17002
http://factfinder2.census.gov/bkmk/table/1.0/en/ACS/12 5YR/C17002/0100000US10400000US0110400000US0210400000US0410400000US051040000
0US0610400000US08 10400000US09 10400000US1010400000US1110400000US1210400000US13 10400000US15 10400000US1610400000US1710400000U
S18I0400000US19I0400000US20I0400000US21I0400000US22I0400000US23I0400000US24I0400000US25I0400000US26I0400000US27I0400000US2
8 10400000US29 10400000US3010400000US3110400000US32 10400000US33 10400000US3410400000US35 10400000US3610400000US3710400000US38 I
0400000US3910400000US4010400000US4110400000US42 10400000US4410400000US45 10400000US4610400000US4710400000US48 10400000US49 104
00000US5010400000US5110400000US5310400000US5410400000US5510400000US5610400000US72
B25034
http://factfinder2.census.gOv/bkmk/table/l.0/en/ACS/12 5YR/B25034/0100000US10400000US0110400000US0210400000US0410400000US051040000
0US0610400000US08 10400000US09 10400000US1010400000US1110400000US1210400000US13 10400000US15 10400000US1610400000US1710400000U
S18I0400000US19I0400000US20I0400000US21I0400000US22I0400000US23I0400000US24I0400000US25I0400000US26I0400000US27I0400000US2
8 10400000US2910400000US3010400000US3110400000US3210400000US3310400000US3410400000US3510400000US3610400000US3710400000US38I
0400000US39 10400000US4010400000US4110400000US42 10400000US4410400000US45 10400000US4610400000US4710400000US48 10400000US49 104
00000US5010400000US5110400000US5310400000US5410400000US5510400000US5610400000US72
88 | P a g e

-------
Appendix B
TOTAL POPULATION COUNTS AND AGES
SEX BY AGE
Table.ID Sequence.Number Line.Number	Field
B01001
0002
NA

B01001
0002
NA Universe:
Total population
B01001
0002
1
Total:
B01001
0002
2
Male:
B01001
0002
3
Under 5 years
B01001
0002
4
5 to 9 years
B01001
0002
5
10 to 14 years
B01001
0002
6
15 to 17 years
B01001
0002
7
18 and 19 years
B01001
0002
8
20 years
B01001
0002
9
21 years
B01001
0002
10
22 to 24 years
B01001
0002
11
25 to 29 years
B01001
0002
12
30 to 34 years
B01001
0002
13
35 to 39 years
B01001
0002
14
40 to 44 years
B01001
0002
15
45 to 49 years
B01001
0002
16
50 to 54 years
B01001
0002
17
55 to 59 years
B01001
0002
18
60 and 61 years
B01001
0002
19
62 to 64 years
B01001
0002
20
65 and 66 years
B01001
0002
21
67 to 69 years
B01001
0002
22
70 to 74 years
B01001
0002
23
75 to 79 years
B01001
0002
24
80 to 84 years
B01001
0002
25
85 years and over
B01001
0002
26
Female:
B01001
0002
27
Under 5 years
B01001
0002
28
5 to 9 years
B01001
0002
29
10 to 14 years
B01001
0002
30
15 to 17 years
B01001
0002
31
18 and 19 years
B01001
0002
32
20 years
B01001
0002
33
21 years
B01001
0002
34
22 to 24 years
B01001
0002
35
25 to 29 years
B01001
0002
36
30 to 34 years
B01001
0002
37
35 to 39 years
B01001
0002
38
40 to 44 years
89 | P a g e

-------
Appendix B
B01001
0002
39
45 to 49 years
B01001
0002
40
50 to 54 years
B01001
0002
41
55 to 59 years
B01001
0002
42
60 and 61 years
B01001
0002
43
62 to 64 years
B01001
0002
44
65 and 66 years
B01001
0002
45
67 to 69 years
B01001
0002
46
70 to 74 years
B01001
0002
47
75 to 79 years
B01001
0002
48
80 to 84 years
B01001
0002
49
85 years and over
pop= B01001.001
ageunder5m = B01001.003
age5to9m = B01001.004
agel0tol4m = B01001.005
agel5tol7m = B01001.006
age65to66m = B01001.020
age6769m = B01001.021
age7074m = B01001.022
age7579m = B01001.023
age8084m = B01001.024
age85upm = B01001.025
ageunder5f = B01001.027
age5to9f = B01001.028
agel0tol4f = B01001.029
agel5tol7f= B01001.030
age65to66f = B01001.044
age6769f = B01001.045
age7074f = B01001.046
age7579f = B01001.047
age8084f = B01001.048
age85upf = B01001.049
90 | P a g e

-------
Appendix B
RACE/ETHNICITY
HISPANIC OR LATINO ORIGIN BY RACE
Table.ID Sequence.Number Line.Number	Field
B03002
0005
NA

B03002
0005
NA
Universe: Total population
B03002
0005
1
Total:
B03002
0005
2
Not Hispanic or Latino:
B03002
0005
3
White alone
B03002
0005
4
Black or African American alone
B03002
0005
5
American Indian and Alaska Native alone
B03002
0005
6
Asian alone
B03002
0005
7
Native Hawaiian and Other Pacific Islander alone
B03002
0005
8
Some other race alone
B03002
0005
9
Two or more races:
B03002
0005
10
Two races including Some other race
B03002
0005
11
Two races excluding Some other race, and three or
races



B03002
0005
12
Hispanic or Latino:
B03002
0005
13
White alone
B03002
0005
14
Black or African American alone
B03002
0005
15
American Indian and Alaska Native alone
B03002
0005
16
Asian alone
B03002
0005
17
Native Hawaiian and Other Pacific Islander alone
B03002
0005
18
Some other race alone
B03002
0005
19
Two or more races:
B03002
0005
20
Two races including Some other race
B03002
0005
21
Two races excluding Some other race, and three or
races



pop3002 = B03002.001
nhwa = B03002.003
EDUCATIONAL ATTAINMENT FOR THOSE AGE 25+
SEX BY EDUCATIONAL ATTAINMENT FOR THE POPULATION 25 YEARS AND OVER
Table.ID Sequence.Number Line.Number Field
B15002	0043 NA
B15002	0043 NA Universe: Population 25 years and over
B15002	0043 1	Total:
91 | Page

-------
Appendix B
B15002
0043
2
Male:
B15002
0043
3
No schooling completed
B15002
0043
4
Nursery to 4th grade
B15002
0043
5
5th and 6th grade
B15002
0043
6
7th and 8th grade
B15002
0043
7
9th grade
B15002
0043
8
10th grade
B15002
0043
9
11th grade
B15002
0043
10
12th grade, no diploma
B15002
0043
11
High school graduate, GED, or alternative
B15002
0043
12
Some college, less than 1 year
B15002
0043
13
Some college, 1 or more years, no degree
B15002
0043
14
Associate's degree
B15002
0043
15
Bachelor's degree
B15002
0043
16
Master's degree
B15002
0043
17
Professional school degree
B15002
0043
18
Doctorate degree
B15002
0043
19
Female:
B15002
0043
20
No schooling completed
B15002
0043
21
Nursery to 4th grade
B15002
0043
22
5th and 6th grade
B15002
0043
23
7th and 8th grade
B15002
0043
24
9th grade
B15002
0043
25
10th grade
B15002
0043
26
11th grade
B15002
0043
27
12th grade, no diploma
B15002
0043
28
High school graduate, GED, or alternative
B15002
0043
29
Some college, less than 1 year
B15002
0043
30
Some college, 1 or more years, no degree
B15002
0043
31
Associate's degree
B15002
0043
32
Bachelor's degree
B15002
0043
33
Master's degree
B15002
0043
34
Professional school degree
B15002
0043
35
Doctorate degree
age25up = B15002.001
mO = B15002.003 (males age 25+ with zero education)
m4 = B15002.004 (males age 25+ with >0 up to 4th grade)
m6 = B15002.005
m8 = B15002.006
m9 = B15002.007
mlO = B15002.008
mil = B15002.009
ml2 = B15002.010 (males age 25+ with high school diploma)
92 | P a g e

-------
Appendix B
fO = B15002.020
f4 = B15002.021
f6 = B15002.022
f8 = B15002.023
f9 = B15002.024
flO= B15002.025
fll= B15002.026
fl2 = B15002.027
HOUSEHOLDS THAT ARE LINGUISTICALLY ISOLATED
"HOUSEHOLD LANGUAGE BY HOUSEHOLDS IN WHICH NO ONE 14 AND OVER SPEAKS ENGLISH ONLY OR
SPEAKS A LANGUAGE OTHER THAN ENGLISH AT HOME AND SPEAKS ENGLISH "VERY WELL"
Table.ID Seq.Number Line.Number Field
B16002	0044 NA
B16002	0044 NA "Universe: Households"
B16002	0044 1	"Total:"
B16002
0044
2
"English only"
B16002
0044
3
"Spanish:"
B16002
0044
4

No one 14 and over speaks English only or speaks English "very well"
B16002	0044 5
At least one person 14 and over speaks English only or speaks English "very well"
B16002	0044 6 Other Indo-European languages:
B16002	0044 7
No one 14 and over speaks English only or speaks English "very well"
B16002	0044 8
At least one person 14 and over speaks English only or speaks English "very well"
B16002	0044 9 Asian and Pacific Island languages:
B16002	0044 10
No one 14 and over speaks English only or speaks English "very well"
B16002	0044 11
At least one person 14 and over speaks English only or speaks English "very well"
B16002	0044 12 Other languages:
B16002	0044 13
No one 14 and over speaks English only or speaks English "very well"
B16002	0044 14
At least one person 14 and over speaks English only or speaks English "very well"
93 | P a g e

-------
Appendix B
hhlds = B16002.001
lingisospanish = B16002.004
lingisoeuro = B16002.007
lingisoasian = B16002.010
lingisoother = B16002.013
INDIVIDUALS BY RATIO OF INCOME TO POVERTY THRESHOLD
RATIO OF INCOME TO POVERTY LEVEL IN THE PAST 12 MONTHS
Table.ID
Sequence.Number
Line
.Number
Field

C17002
0049
NA



C17002
0049
NA
Universe:
Population for whom poverty
status is determined
C17002
0049
1


Total:
C17002
0049
2


Under .50
C17002
0049
3


.50 to .99
C17002
0049
4


1.00 to 1.24
C17002
0049
5


1.25 to 1.49
C17002
0049
6


1.50 to 1.84
C17002
0049
7


1.85 to 1.99
C17002
0049
8


2.00 and over
povknownratio = C17002.001
pov50 = C17002.002 (below 0.50 times poverty threshold)
pov99 = C17002.003 (0.5 to 0.99 times poverty threshold)
povl24 = C17002.004
povl49 = C17002.005
povl84 = C17002.006
povl99 = C17002.007
pov2plus = C17002.008
AGE OF OCCUPIED HOUSING UNITS (CORRELATED WITH LEAD PAINT)
YEAR STRUCTURE BUILT
Table.ID Sequence.Number Line.Number	Field
B25034	0104	NA
94 | P a g e

-------
Appendix B
B25034
B25034
B25034
B25034
B25034
B25034
B25034
B25034
B25034
B25034
B25034
0104
0104
0104
0104
0104
0104
0104
0104
0104
0104
0104
NA
1
2
3
4
5
6
7
8
9
10
Universe: Housing units
Total:
Built 2010 or later
Built	2000 to 2009
Built	1990 to 1999
Built	1980 to 1989
Built	1970 to 1979
Built	1960 to 1969
Built	1950 to 1959
Built	1940 to 1949
Built 1939 or earlier
builtunits = B25034.001
builtl950tol959 = B25034.008
builtl940tol949 = B25034.009
builtprel940 = B25034.010
Calculated Demographic Data Fields
Based on the raw counts from the ACS described above, various demographic variables were calculated for use
in EJSCREEN. Conditional formulas below are in R syntax, and generally indicate that a value of zero was used in
cases where the denominator was zero, to avoid division by zero. For example, the formula "pctmin =
ifelse(pop==0,0, as.numeric(mins ) / pop)" indicates that percent minority was calculated as the ratio of
number of minorities over total population of a block group, but was set to zero if the population was zero.
#	RACE/ETHNICITY COMBINED, CALCULATED VARIABLES
mins = pop - nhwa
pctmin = ifelse(pop==0,0, as.numeric(mins ) / pop)
#	POVERTY, LOW-INCOME CALCULATED VARIABLES
#	poverty ratios
num2pov = numlpov + povl24 + povl49 + povl84 + povl99
lowinc = num2pov
pct2pov = ifelse( povknownratio==0,0, num2pov/povknownratio)
pctlowinc = pct2pov
num2pov.alt = povknownratio - pov2plus
pct2pov.alt = ifelse( povknownratio==0,0, num2pov.alt/povknownratio)
95 | P a g e

-------
Appendix B
#	EDUCATIONAL ATTAINMENT CALCULATED VARIABLES
Iths = mO + m4 + m6 + m8 + m9 + mlO + mil + ml2 +
fO + f4 + f6 + f8 + f9 + flO + fll + fl2
pctlths = ifelse(age25up==0,0, as.numeric(lths ) / age25up)
#	LINGUISTIC ISOLATION CALCULATED VARIABLES
lingiso = lingisospanish + lingisoeuro + lingisoasian + lingisoother
pctlingiso = ifelse( hhlds==0,0, lingiso / hhlds)
#	AGE GROUPS CALCULATED VARIABLES
under5 = ageunder5m + ageunder5f
pctunder5 = ifelse( pop==0,0, under5/pop)
over64 = age65to66m + age6769m + age7074m + age7579m + age8084m + age85upm +
age65to66f + age6769f + age7074f + age7579f + age8084f + age85upf
pctover64 = ifelse( pop==0,0, over64/pop)
#	HOUSING CALCULATED VARIABLES (LEAD PAINT INDICATOR)
prel960 = builtprel940 + builtl940tol949 + builtl950tol959
pctprel960 = ifelse( builtunits==0,0, prel960/builtunits)
96 | P a g e

-------
Appendix B
Uncertainty and Limitations in the Demographic Data
Uncertainty in demographic data
As with every sample survey, sampling results in unavoidable approximations in every estimate that comes
from the survey. The Census Bureau clearly labels every data item as an "estimate," and accompanies each
with an estimate of its margin of error. Anyone using a screening tool should be aware of those demographic
uncertainties, together with uncertainties in the environmental measures, in tables, graphical displays and
descriptive materials.
Uncertainties are also discussed in section 1 (as general caveats), section 2 (with regard to buffer reports), and
Appendix B (in discussions of buffering details and demographic data).
Users of EJSCREEN must keep in mind the substantial uncertainty in estimated demographic and environmental
indicators used in screening tools such as EJSCREEN. Uncertainty is a critical consideration when using
EJSCREEN because the tool relies on demographic and environmental estimates at block group resolution. As
the Census Bureau makes clear in documentation of the American Community Survey (ACS), the margin of
error for an estimate in a given block group is often very large relative to the estimate, so an estimate of
percent low-income, for example, is often very uncertain for a single block group.
Combined with uncertainty in environmental data, this means EJ index values are often very uncertain at block
group resolution. Therefore, modest differences in percentile scores between block groups or small buffers
should not be interpreted as meaningful because of the uncertainties in demographic and environmental data
at the block group level. We do not have a high degree of confidence when comparing or ranking places with
only modest differences in estimated percentile. For this reason, it is critical that EJSCREEN results be
interpreted carefully and that additional information be used to supplement or follow up on screening, where
appropriate. Section 1 of this document discusses caveats and limitations further.
EPA cannot provide precise confidence intervals on EJ indexes or percentiles due to technical limitations in the
data made public by the Census Bureau and the challenges of quantifying uncertainty for the environmental
indicators. Technical documentation on methods and challenges in estimating uncertainty for calculated
demographic indicators using the ACS is available from the Census Bureau58 (with challenges described in
related technical documents59). ESRI also provides useful discussions of margin of error.60
It is likely that block group errors in the various data fields reported by Census (e.g., count with income-poverty
ratio below 0.5, count with ratio 0.5 to 1, etc.) are correlated. Relevant covariances, however, are not provided
by the Census Bureau. This means simple methods of approximating margin of error for a calculated variable
(e.g., percent low-income) may not be entirely adequate. In this case, it appears that a custom tabulation by
58http://www.census.gov/acs/www/Downloads/data documentation/Statistical Testing/2011StatisticalTesting3and5vear
.pdf
59	http://www.census.gov/acs/www/Downloads/data documentation/Accuracv/MultivearACSAccuracvofData2011.pdf
60	http://www.esri.com/software/american-communitv-survev/understanding-margin-error
97 | P a g e

-------
Appendix B
the Census Bureau would be the most accurate way to generate reliable estimates of the margin of error for
variables such as the percent low-income or the demographic indicator, for use in creating confidence intervals
around an EJ index or the percentile of that index. Future research may be able to produce reasonable
approximations of confidence intervals around block group or buffer estimates. EJSCREEN users should keep in
mind that using a buffer larger than the local block groups will produce more reliable estimates than a single
block group can provide.
Using 2x poverty rate
The rationale for using twice the poverty threshold rather than just the poverty threshold includes the
following considerations:
•	The effects of income on baseline health and probably on other aspects of susceptibility are not
limited to those below the poverty thresholds — those from lx to 2x poverty also have worse
health overall than those with higher incomes (Centers for Disease Control and Prevention, 2010),
and asthma rates, for example, begin to increase as income falls below twice the poverty threshold
(Centers for Disease Control and Prevention, 2011a).
•	Many studies in various fields use 2x poverty, and many others use lx poverty (e.g., see Su et al.,
2009); the same is true for prior EPA screening tools. There is precedent for both. However, a
rationale often mentioned is that today's poverty thresholds are too low to adequately capture the
populations adversely affected by low income levels, especially in high-cost areas. Some analysts
have concluded that the amount of income actually required for basic living costs without
government support is far higher than the current Federal poverty thresholds (Cauthen & Fass,
2008).
•	When using twice the poverty threshold, the number or percent low income happens to roughly
equal number or percent minority in the United States. This makes it convenient and simple to use
the average of the two without applying any other weights to them, and in this way each low-
income person affects the susceptibility indicator about as much as each minority person.
•	The Census Bureau has been developing experimental poverty measures that account for local
costs of living, but these are not yet in widespread use.61
Interpretation of Demographic Indexes
The demographic indexes are meant to reflect some of the combined impacts of multiple demographic factors.
The Census Bureau does not provide a tabulation of low-income residents by race/ethnicity at the block group
level62, so it is impossible to know what percentage of a block group is low-income minorities vs. low-income
non-minorities, for example. EJSCREEN simply defines the demographic index as the average of the percentage
of people who are low income and the percentage of people who are minorities. Therefore, this demographic
61	https://www.census.gov/hhes/povmeas/
62	Table B17001 and related tables provide tract-level cross-tabulations of race-ethnicity and poverty, but not percent low-
income as defined in EJSCREEN.
98 | Page

-------
Appendix B
index will be (equal to or) smaller than the percentage of people who are in at least one of these groups. In
other words, it is typically smaller than the share of people who are in one or more of these groups - just low
income, just minority, or both. The average will also be (equal to or) larger than the percentage of people who
are simultaneously in all of these groups. It is larger than (or equal to) the share of people who are
simultaneously low income and minority. The value of the demographic index is almost always larger than the
number of people who are simultaneously minority and low-income, because usually some people are in only
one of these demographic groups. Note that one person cannot be under five and over 64, so any one person
can be in up to five of the six demographic groups used in EJSCREEN.
The demographic index is also bounded by these two percentages (percent low income and percent minority).
For example, the actual percentage minority is larger than the value of the demographic index, if the
percentage low-income is lower than the percentage minority.
99 | P a g e

-------
Appendix C
APPENDIX C. TECHNICAL DETAILS ON PROXIMITY INDICATORS
Several of EJSCREEN's environmental factors are direct or indirect estimates of potential exposure or health
risks, such as the NATA cancer risk estimates and the ozone and PM2.5 concentration estimates. There are other
aspects of an individual's or a community's environmental concerns that are less readily quantified in terms of
emissions, concentrations, or risk estimates.
People may be concerned about living near facilities that handle hazardous substances, and other potential
sources of pollution, such as highways or abandoned waste sites. Concern over "locally undesirable land uses",
or LULUs, is in some cases founded on the potential for routine or episodic releases of pollutants to the air, land
or water, and the potential for such releases to cause human health or environmental adverse effects or other
societal disamenities.
The purpose of the proximity measures in EJSCREEN is to systematically and consistently quantify different
degrees of potential for these effects. We have developed a method to calculate a score that represents the
relative magnitude of the proximity of the population within a block group to facilities, waste sites, or traffic
surrounding it. A block group with more facilities closer to the block group's residential population will have a
higher score than a block group where facilities are further away. We have applied this method to these facility
or site types:
•	National Priorities List (NPL) sites (a key subset of "Superfund," sites).
•	Hazardous waste Treatment, Storage or Disposal Facilities (TSDFs), subject to regulations under the
Resource Conservation and Recovery Act (RCRA).
•	Risk Management Plan (RMP) facilities, which are facilities that maintain greater than certain
quantities of extremely hazardous substances, and are required to take certain actions, including
filing risk management plans, under section 112 (r) of the Clean Air Act.
•	Major direct dischargers to water permitted under the National Pollutant Discharge Elimination
System (NPDES).
We have developed a similar approach to represent proximity to and traffic volume on nearby highways.
In the sections below, we will describe the general approach, in terms of facility proximity. We will then
describe how it differs for traffic proximity. Then we will discuss certain adjustments we have made, mostly to
make the approach computationally efficient, and summarize the data sources and computational routine that
we applied to implement this approach. We conclude with caveats and other observations.
Calculating Proximity to Facilities or NPL Sites
Each of the more than 217,000 block groups for the U.S. states and the District of Columbia is made up of
between one block and several hundred blocks. Most block groups nationwide are smaller than approximately
0.5 square miles, an area that if circular would have a radius of about 640 meters. In block groups of this
median size, the average residence generally would be about 430 to 720 meters (or less than half a mile) away
100 | P a g e

-------
Appendix C
from a given point within the block group, such as a facility, as explained at the end of Appendix C. About 20-
25% of block groups covered an area smaller than a circle of radius 300-350 meters (almost one quarter of a
mile), as of the 2005-2009 geographies. Also, a very small number of block groups are extremely large in area,
in very rural locations.
All of a block group's blocks may have residential population estimated by the 5-year ACS, or only some, and
some block groups have no residents at all. Blocks and block groups vary greatly in geographic area, and in
population. The approach used here works first at the block level, based on measures of proximity to the
facilities in or near the blocks. The block-level measures are then aggregated among all the blocks within a
block group, weighted by the number of people in the different blocks.
Thus, while population is considered in aggregating the block scores, the measure does not increase or
decrease for block groups with higher or lower populations. The measure is, rather, a characteristic of the
residents of the block group, in the same way that cancer risk from NATA or ozone concentration are estimated
measures of the conditions of those places.
Let
i represent a particular facility
j represent a block within a block group
k represent a block group
dij is the distance, in kilometers, from block j's centroid to the given location of facility i
popjk is the estimated population of block j within block group k
popk is the total estimated population of block group k
f(dij) is a function representing the proximity of facilty i to block j, a declining function of the distance,
dij
BlockScorejk is the aggregation of the proximity influences of all facilities affecting block jk
BlockGroupScorek is the population-weighted aggregation of the block group's component blocks
We have chosen to define the proximity function as
f(dij) = 1 / dij
That is, a facility 1 kilometer from a block's population contributes twice the score as a facility 2 kilometers
from the same block. We note that we have made a choice in using inverse distance for this function. Air
dispersion modeling for pollutants following Gaussian plume assumptions would show a generally greater
drop-off in concentration, roughly with the second power to 2.5 power of one over distance. But actual
concentrations around individual plants follow often-complex patterns that depend on the particular mix of
stack vs. fugitive emissions, characteristics of stack height, exit velocity and temperature, the presence of
buildings or other land surface characteristics and meteorology. Some substances react readily with other
substances in the atmosphere, or precipitate out readily. It is not uncommon for concentrations to rise for
101 | P a g e

-------
Appendix C
some distance from the emitting source, and then to fall from that peak concentration. The Gaussian plume
model applies to gases, and emissions of particulates can drop off more quickly than gases.
Releases to land may follow extremely complex patterns of dispersion. Added to that are the very site-specific
characteristics of potential human exposure via drinking water, vapor intrusion or contact with contaminated
soils, etc. For water pollution, similar complexities exist, most notably that an effluent is carried away
downstream of a running body of water, dilution can be complicated by the presence of other water entering
stream segments, by volatilization, by biological and chemical interactions, and by deposition to sediments, and
finally by the treatment and removal of a water pollutant sent to a publicly-owned treatment works.
We also note that researchers and others have taken varied approaches to representing the proximity of
facilities to populations. The EJSM model of environmental justice concerns, developed for the state of
California, scored facility proximity in concentric rings around a population centroid (Pastor Jr., Morello-Frosch,
& Sadd, 2010; Sadd, Pastor, Morello-Frosch, Scoggins, & Jesdale, 2011). All facilities within 1 mile received a
score of 3. All within the 1 to 3 mile band received a score of 2, and those between 3 and 5 miles received a
score of 1. Anything beyond 5 miles received a score of zero. This step-wise scoring represents the judgment of
the model developers, influenced by interactions with various stakeholders.
Finally, we note that EJSCREEN's measure of proximity is intended to represent more than simply real or
potential human health adverse effects coming from exposure. Some parts of the environmental justice
literature reflect semi-quantitative factors, such as increased psychological stress, fear and other reactions to
the presence of LULUs. This is not the forum for sorting through those factors.
However, we have made a judgment call: For the purposes of this EJSCREEN tool, we represent a facility's
measure of proximity by the inverse of its distance from the estimated location of the average person. A block's
proximity score is the sum of the inverse distances of all the facilities of a particular type.
Note that for the minority of block groups in the United States with no residential population, we take a
straight average of the block scores.
The units for these measures are facilities per kilometer. A block group could have a score of 1.0 if all residents
were an average of one kilometer from a single facility, and all other facilities were so distant (> 5 km) as to
make no contribution to the score. Another block group could have a score of 1.0 if there were five facilities
that were all exactly five kilometers from the residents.
Calculating Proximity to Traffic
We have adopted essentially the same approach described above for representing proximity to highway
segments - an inverse distance-weighted sum of highway segments surrounding each block, and a population-
weighted sum of the individual blocks' contributions to the block group.
The highway segment database that we have used is described in section 3. These segments differ from a
facility database in that they are lines on a geographic area, rather than points that represent the facilities. In
102 | P a g e

-------
Appendix C
our approach, we find the distance from the block centroids to the nearest part of each surrounding highway
segment. The nearest point, dij, could be an end of the highway segment or some point in between the ends.
We also multiplied each dij by the annual average daily traffic estimate that is associated with each highway
segment. This is meant to reflect the traffic intensity, and this differs from the facility approach, where we have
taken each facility within each group as having equal importance. Also, for traffic proximity, the search radius is
500 meters and the score uses distance in meters, not kilometers.
Calculating Proximity - Additional Details
We address two modifications to the general method described above. The first deals with instances where a
facility or highway segment location is very close to the centroid of the block. The second is an accommodation
to the computational intensity of the general method.
Extremely Small dij Values
Our intention is to represent the proximity of facilities or highway segments to the population within each
block. All facilities and each part of all highway segments fall within one block. By chance, some portion of
those points fall very close to the block centroids.
We do not know how the population is geographically distributed within any block, but we assume that people
are more likely to be distributed across the blocks' expanses than to be concentrated at one point, such as the
centroid. In fact for rural, suburban and many non-high rise urban areas, people's residences are more likely to
be closer to the blocks' peripheries (bounded by roads) than clustered at the centroids. Thus, when a facility
location happens to be very close to the block centroid, it would result in an artificially high contribution to the
block's score. This is not a hypothetical problem: We have observed dij values well below 100 meters, and some
below 10 meters.
In looking for solutions to the problem, we conducted analyses and arrived at the approach we have adopted.
Blocks vary widely in their total area and in their shapes. Both can be found in the Census Bureau's Tiger shape
files. Dealing explicitly with the individual block shapes would be computationally very intensive because there
are over 11 million blocks. Since we cannot easily find out how the residents are actually distributed in those
areas, we made two simplifying assumptions:
•	residents are evenly distributed across the surface area of each block, and
1/2
•	each block can be represented by a circle whose radius is [Block area / Pi] .
We call this latter value the Block Area Equivalent Radius.
Our investigations indicate that for any dij less than 90% of the Block Area Equivalent Radius, 0.9 times that
value is a reasonable representation of the average distance from the facility for all residents in the block. We
call this the dij corrected-
103 | P a g e

-------
Appendix C
Our computational scheme determines the dij values as described above, tests for the comparison with 0.9 *
Block Area Equivalent Radius, and substitutes dij corrected values. We found that we needed to make that
correction for less than 1% of all facility / block combinations in an early testing data set that used 2005-2009
ACS data.
Accommodating to Computational Intensity - Combine a Distance Limit with a Nearest Facility
Approach
Our task is to compute a proximity score for each of the facility or site types and highway segments for each of
the more than 217,000 block groups, comprised of over 11 million blocks. The number of facilities nation-wide
varies from hundreds of TSDFs to many thousands of RMP facilities. Computing all the combinations would
require more computational time and resources than were available.
In addition, doing so would be wasteful and perhaps irrelevant. The one over distance function we have chosen
to represent concerns about facilities and highways drops off greatly for most facilities beyond the nearest
ones. The miniscule contribution of a facility 100 kilometers or more from a block is not only small, compared
with those that may be within 5 to 10 kilometers, but has little common-sense meaning, in our view.
Consequently, we have followed the general approach described above only for facilities or sites within 5
kilometers of a block's centroid, and within 500 meters for highway segments. Depending on the facility or site
type, we find that 30-40% of block groups have at least one facility (RMP, TSDF, or NPDES) within the 5
kilometers limit, and almost 10% have one or more NPL sites within 5 km, in the 2015 version of EJSCREEN.
Of course, every block and block group has one nearest facility, even though it may be beyond the 5 kilometers
horizon, and some of those may be fairly close to that limit. We have also calculated the distance to the facility
nearest to each of the blocks. For those blocks lacking anything within the 5 kilometers, we represented the
facility proximity by one over the distance to that single nearest facility.
This added computational complexity to the approach, but at far less cost than computing the full matrix of
millions of blocks times thousands of facilities and sites.
This hybrid approach results in every block (and thus every block group) having a nonzero, positive proximity
score. All of the resulting block proximity scores are necessarily less than the score had we computed the full
matrix, but we judge that this is a reasonable and practical compromise. Figure 1 shows histograms for
proximity scores. Counting only the single nearest beyond 5 km has the effect of shifting scores under 0.2 to
the left, to lower scores than if all were counted, but the graphs show no major discontinuities, suggesting this
limitation (counting only the nearest one) has little impact overall.
Data and Computational Scheme
Using the Census 2010 block boundaries, the distance to all facilities within 5 kilometers of all blocks (not just
block groups) was determined, and distance to the nearest facility at any distance was determined if none were
found within 5 km.
104 | P a g e

-------
Appendix C
The dij values were compared to the 0.9 * Block Area Equivalent Radius and corrected values were used when
necessary, before computing 1 / dij. The 1 / dij values were summed for each block to compute the
BlockScorejk- These were then rolled up to the block group level, applying the population weighting described
above, for the final BlockGroupScorek .
Caveats and Observations
Several aspects of the proximity analysis approach have been mentioned above, but deserve summary here.
•	We recognize that our selection of the inverse of distance is a design choice that represents our
judgment of a balance among competing factors.
•	We recognize that one could potentially attempt to distinguish among facilities within each facility
category by quantitative or qualitative measures of importance. These could include total pounds
released or toxicity-weighted releases for NPDES facilities; the number of accidental releases
and/or their apparent severity for RMP facilities; some classification of the likelihood of releases for
NPL sites or TSDFs; and general indications of scale for all of them. We note that CalEnviroscreen
has addressed this issue to some extent, and that the RSEI tool based on TRI data may be relevant
to future work on this issue. At this point, we have chosen not to develop any such potential scaling
adjustments.
•	We recognize that all location data are subject to potential error. While we have high confidence in
the block centroid locations, we know that the facility or site or roadway location data may contain
larger or smaller errors, and that for large facilities or sites, one point may not be an entirely
adequate representation of the location of its releases or of neighbors' perceptions.
•	We recognize that the computational accommodation we describe above results in a hybrid of
measures: For some block groups, all blocks have one or more facilities within 5 kilometers and the
score is the summation of all those potentially multiple facility/block combinations; for other block
groups, none of the blocks have a facility within 5 kilometers and the score is the contribution of
the single facility closest to each block; while for some block groups, we have a mix of those
situations. We believe that this is a reasonable compromise
105 | P a g e

-------
Appendix C
Note: The method described in the above section would be the case if homes and facilities were on average uniformly
distributed within block groups that were roughly circular on average, because the average distance between two random
points in a circle of radius R is 90% of R (Weisstein, Eric W. "Disk Line Picking." From Math World—A Wolfram Web
Resource. http://mathworld.wolfram.com/DiskLinePicking.html). This means that if a population is randomly spread over a
roughly circular block, a facility in the block typically would be 0.9R from the average person. Also, the average point in the
circle is 0.67R from the center, and 1.13R from the edge of the circle. We can describe this relationship using an equation
that is a portion of the formula for the distance between two random points in a circle of radius=l. The formula is
C f ~ \[ a + k2 - (2 k ^) cos(t) dt da
where b= the facility's distance from the center as a fraction of the radius, and the integral over a represents distances of
residences from the center. We can solve this equation using http://WolframAlpha.com, for b=0, 0.5, or 1, representing
points at the center, halfway to the edge, and at the edge of the circle. For example, we can use this equation for b=0.5 to
find that the average person, if randomly located in a circle of radius R, is a distance of about 0.8 R from a facility that is
halfway between the center and edge of the circle. For the distance between the average person and a randomly placed
facility in the circle, we use b=sqrt(0.5) instead, and the following would be used as the input to WolframAlpha:
lntegrate[(l/Pi) Sqrt[a + (Sqrt(0.5))A2 - 2 * (Sqrt(0.5)) * Sqrt[a] cos(t)], {a, 0, 1}, {t, 0, pi}] or
http://www.wolframalpha.com/input/?i=lntegrate%5B%281%2FPi%29+Sqrt%5Ba+%2B+%28Sqrt%280.5%29%29%5E2+-
+2+*+%28Sqrt%280.5%29%29+*+Sqrt%5Ba%5D+cos%28t%29%5D%2C+%7Ba%2C++0%2C++l%7D%2C++%7Bt%2C+0%2C+
pi%7D%5D+
106 | P a g e

-------
Appendix D
APPENDIX D. SUMMARY STATISTICS FOR INDICATORS
This appendix provides basic information and statistics on the environmental and demographic data used in
EJSCREEN.
Table 6 shows summary statistics for the 12 environmental indicators, including the population mean and
selected population percentiles. The mean score and high percentiles for each indicator provide useful
perspective on the magnitude of these environmental factors for the average or highly exposed individuals.
The mean neurological Hazard Index (HI) for example, is only xx, meaning that the estimated mean exposure is
only x% as high as the health-based Reference Concentration, and even the 99th percentile value is only
approximately xx. The respiratory HI, however, has a mean above xx, meaning the RfC is typically exceeded by a
factor of xx.63
ATA update pending
In the 2015 version of EJSCREEN, the PM2.5 level in the average person's block group (based on tract estimates)
was 9.7 ng/m3. Roughly 7-8% of U.S. residents had block group (based on tract) estimates above 12 ng/m3 in
these 2011 estimates. Note that 15 ng/m3 was the health-based annual ambient standard as of mid-2012, with
a revised standard of 12 ng/m3 finalized in December 2012. However, it is important to understand that the
EJSCREEN value does not indicate nonattainment of a standard because it is based on 2011 estimates of each
block group, from a combination of modeling and monitoring, while nonattainment is determined by individual
monitors (each intended to represent a relatively large area, often a county), based on three recent years of
monitoring data. Likewise, the ozone indicator cannot be compared to the ozone ambient standard.
pen
The NATA cancer risk mean	people, or a lifetime individual
cancer risk of xxlO"5, which is orders of magnitude lower than typical premature mortality risk estimates
associated with recent ambient levels of PM2.5.
The facility proximity indicators generally have mean scores of 0.05 to 0.31. The NPL's mean score of 0.10 is
comparable to the average person having one NPL site 10 kilometers away. The mean RMP score of 0.31 could
result from one RMP facility at 3.2 kilometers distance, for example, and the median is roughly 0.14 (or about
1/7), meaning that most of the U.S. population has at least one RMP within about 7 km of their home. About a
third of the population lives within 5 km of an RMP facility, but less than 7% of the population has any within 1
km.
Table 7 shows a similar set of statistics for the demographic data used in EJSCREEN. The overall US percents low
income and minority were 34% and 36% respectively, and the medians were somewhat lower. The 80th
percentiles were 53% and 69%. The top 5% lived in block groups with a Demographic Index above 80%. Note it
is more common to see block groups close to 100% minority than close to 100% low income.
63 High values for the neurological HI tend to be driven by exposure to acrolein above the Reference Concentration, which
is designed to protect against the risk of nasal lesions as the health endpoint.
107 | P a g e

-------
Appendix D
Table 6. Summary Statistics for Environmental Indicators
Environmental Indicator
Missing
Minimum
25%ile
50%ile
(median)
Pop.
Mean
75%ile
80%ile
90%ile
95%ile
99%ile
PM 2.5
1664
4.3
8.8
9.9
9.8
11
11
12
12
13
Ozone
1664
25
42
46
46
51
51
54
57
66
NATA DPM










NATA cancer risk



fA update pending|




NATA respiratory HI










NATA neurological HI










% pre-1960 (lead paint)
0
0.00
0.05
0.21
0.30
0.50
0.59
0.77
0.87
0.96
Proximity Traffic
253
0.00
11
35
110
100
130
250
430
1,200
Proximity NPL
0
0.00
0.02
0.05
0.10
0.10
0.12
0.19
0.31
0.88
Proximity RMP
0
0.00
0.08
0.14
0.31
0.30
0.39
0.76
1.2
2.4
Proximity TSDF
0
0.00
0.01
0.02
0.05
0.05
0.07
0.11
0.18
0.53
Proximity NPDES
0
0.00
0.08
0.13
0.25
0.24
0.30
0.55
0.89
1.9
Source: 2015 version of EJSCREEN. See body of report for sources and definitions of environmental indicators.
Notes: Population percentiles (and means) are shown, not block group percentiles (or means), so 80%ile means 80% of the population has a lower (or
exactly tied) block group score. Values in table have been rounded to two significant digits, except for PM, ozone, and DPM, which use three significant
digits. Numbers may differ slightly from those in EJSCREEN reports. Summary statistics for a given environmental factor exclude block groups where that
environmental indicator was not available (missing).
108 | P a g e

-------
Appendix D
Table 7. Summary Statistics for Demographics
Demographic variable
Missing
Minimum
25%ile
50%ile
(median)
Population
mean
75%ile
80%ile
90%ile
95%ile
99%ile
Demographic index
0
0
17%
29%
35%
50%
57%
71%
80%
90%
% low-income
0
0
17%
30%
34%
48%
53%
65%
74%
87%
% minority
0
0
9%
26%
36%
60%
69%
89%
96%
100%
% less than high school
0
0
5%
11%
14%
21%
24%
33%
42%
60%
% linguistic isolation
0
0
0%
1%
5%
6%
8%
16%
25%
44%
% under 5
0
0
4%
6%
7%
9%
10%
12%
14%
18%
% over 64
0
0
7%
12%
13%
17%
19%
23%
28%
44%
Supplementary demog.
index
0
0
11%
16%
18%
24%
27%
34%
39%
47%
Source: 2015 version of EJSCREEN. Calculated based on 2008-2012 5-year summary file, American Community Survey (ACS), from the US Census Bureau.
Note: Population percentiles (and means) are shown, not block group percentiles (or means), so 80%ile means 80% of the population has a lower (or
exactly tied) block group value. Values in table have been rounded to an integer percentile 0-100. Numbers may differ slightly from those in EJSCREEN
reports.
109 | P a g e

-------
Appendix D
Table 8 shows Spearman correlations in block group level scores for all pairs of the 12 environmental indicators
in the 2015 version of EJSCREEN. All correlations are positive except that ozone is weakly, sometimes
negatively, correlated with the other indicators. The strongest positive correlations were among the NATA-
derived factors: cancer, neurological HI, respiratory HI, and diesel particulate matter indicators. The TSDF
proximity indicator was correlated with these four. Similarly, the traffic indicator was correlated with the NATA
factors. All other coefficients were less than 0.50.
110 | P a g e

-------
Appendix D
Table 8. Spearman Correlation Coefficients for Environmental Indicators
cancer resp. neuro. Traffic % pre-	NPL	RMP TSDF NPDES

PM2.5
ozone
DPM
risk
HI
HI
proximity
1960
proximity
proximity
proximity
proximity
PM2.5
1
0.23




0.22
0.22
0.21
0.30
0.34
0.18
Ozone
0.23
1




-0.03
-0.16
-0.09
0.03
-0.02
-0.06
DPM


1









cancer risk



1


|NATA update pending|

resp. HI




1







neuro. HI





1






Traffic proximity
0.22
-0.03




i
0.17
0.33
0.3
0.35
0.3
% pre-1960
0.22
-0.16




0.17
1
0.18
0.17
0.08
0.17
NPL proximity
0.21
-0.09




0.33
0.18
1
0.2
0.37
0.25
RMP proximity
0.30
0.03




0.3
0.17
0.2
1
0.25
0.35
TSDF proximity
0.34
-0.02




0.35
0.08
0.37
0.25
1
0.17
NPDES proximity
0.18
-0.06




0.3
0.17
0.25
0.35
0.17
1
Source: 2015 version of EJSCREEN. See body of report for sources of environmental indicators.
Ill | P a g e

-------
Appendix D
Table 9. Spearman Correlation Coefficients for Demographic Indicators
% less	%
% % low- than high linguistic
minority income school isolation % under 5 % over 64
% minority
1
0.41
0.43
0.51
0.23
-0.34
% low-income
0.41
1
0.67
0.23
0.24
-0.15
% less than
high school
0.43
0.67
1
0.32
0.19
-0.05
% linguistic
isolation
0.51
0.23
0.32
1
0.17
-0.2
% under 5
0.23
0.24
0.19
0.17
1
-0.31
% over 64
-0.34
-0.15
-0.05
-0.2
-0.31
1
Source: 2015 version of EJSCREEN. Calculated based on 2008-2012 5-year summary file, American
Community Survey (ACS), from the US Census Bureau.
Table 9 shows the Spearman correlations in block group level scores for all pairs of the 6 demographic
factors. All correlations are positive except those between % over 64 and all other factors.
Figure 1 shows histograms or density plots of the environmental indicator data, showing the simple
distribution across block groups for each of the 12 indicators (i.e., these figures show the distribution of
block groups, not a population distribution, but the population distribution is very similar).
112 | P a g e

-------
Appendix D
Figure 1. Histograms of Block Group Environmental Indicators as ratio to mean value (log scale shows mean value as zero)
Q_
Z3
O
i_
O)
o
_o
_Q
A
	1	
0
PM2.5
Q_
Z5
O
i—
CD
-2
~~r
-1
Ozone
Q_
Z5
O
s	
O)
-2
-1
1
Air toxics cancer risk
ATA update pending
Q_
ZJ
O
s	
O)
-2
~~r
-1
1
Air toxics neuro. HI
CO
Q_
=5
O
o
o
Air toxics resp. HI
V)
Q_
=5
O
J*?
o
o
Diesel PM
(/)
Q_
Z5
O
O)
O
o
CO
Q_
=5
O
O)
o
o
Lead paint (% pre-1960)
^TmnnTTTirTFIITIITrmmT^	
Traffic proximity
O
O)
o
o
-2
NPL proximity
Q.
=3
O
O)
o
_o
_Q
	rtltflllllllUltTtTTlTTmT^	
-2 -1
RMP proximity
o
O)
o
o
-2
TSDF proximity
o
O)
o
o
r~
-2

~r
-1
NPDES proximity
Source: 2015 version of EJSCREEN. Note: Some extreme values are not shown on the x axis (for proximity indicators) and y axis (for ozone).
113 | P a g e

-------
Appendix D
[INTENTIONALLY BLANK]
114 | P a g e

-------
Appendix E
APPENDIX E. FORMULAS FOR DEMOGRAPHICS AND EJ INDEXES
The EJ indexes rely on demographic indexes combined with environmental indicators. The demographic
and EJ indexes are calculated as follows:
Demographic Index
This is the average of percent minority and percent low income in the block group. Percent low income
is defined in Appendix B, and is essentially all residents where household income is below twice the
federally defined poverty threshold, as a percentage of all those for whom this poverty ratio could be
determined (typically known for the vast majority of the block group's population).
Demographic Index = (% minority + % low-income) / 2
Supplementary Demographic Index
This is the average of percent minority, percent low income, percent less than high school education,
percent linguistically isolated, percent individuals under age 5, and percent individuals over age 64 in the
block group.
Supplementary Demographic Index =
(% minority + % low-income + % less than high school education + % linguistic isolation +
% individuals under age 5+ % individuals over age 64) / 6
115 | P a g e

-------
Appendix E
EJ Index
The EJ Index measures how much a particular place contributes to overall nationwide differences in
environmental indicator values between demographic groups. This EJ index is a combination of a block
group environmental factor, the population of the block group, and the demographic composition of the
block group. In this index, the demographic composition of the block group is the difference between
the block group's composition and the national average, as measured by the demographic index.
EJ Index =
(Environmental Indicator)
X (Demographic Index for Block Group - Demographic Index for US)
X (Block Group Population)
EJ Index with Supplementary Demographic Index
This EJ index is a combination of a block group environmental factor, the population of the block group,
and the demographic composition of the block group. In this index, the demographic composition of the
block group is the difference between the block group's composition and the national average, as
measured by the supplementary demographic index.
EJ Index with supplementary demographics =
(Environmental Indicator)
X (Supplementary Demog. Index for Block Group - Supplementary Demographic Index for US)
X (Block Group Population)
Supplementary EJ Index 1 Based on Demographic Index
This EJ index is a combination of a block group environmental factor, the population of the block group,
and the demographic index. This EJ index measures how much a particular place contributes to the total
burden faced by the subpopulations highlighted by the demographic index.
Supplementary EJ Index 1 =
(Environmental Indicator)
X (Demographic Index for Block Group)
X (Block Group Population)
116 | P a g e

-------
Appendix E
Supplementary EJ Index 1 Based on Supplementary Demographic Index
This EJ index is a combination of a block group environmental factor, the population of the block group,
and the supplementary demographic index. This EJ index measures how much a particular place
contributes to the total burden faced by the subpopulations highlighted by the supplementary
demographic index.
Supplementary EJ Index 1
with supplementary demographics =
(Environmental Indicator)
X (Supplementary Demographic Index for Block Group)
X (Block Group Population)
Supplementary EJ Index 2 Based on Demographic Index
This EJ index is a combination of a block group environmental factor and the demographic index.
Supplementary EJ Index 2 =
(Environmental Indicator)
X (Demographic Index for Block Group)
Supplementary EJ Index 2 Based on Supplementary Demographic Index
This EJ index is a combination of a block group environmental factor and the supplementary
demographic index.
Supplementary EJ Index 2
with supplementary demographics =
(Environmental Indicator)
X (Supplementary Demographic Index for Block Group)
117 | P a g e

-------
Appendix F
APPENDIX F. QUALITY CONTROL / QUALITY ASSURANCE
EPA's quality control guidelines emphasize transparency and reproducibility as useful in ensuring the
quality of data. EPA is providing a very high level of transparency in EJSCREEN by taking several steps
described here.
The EJSCREEN Technical Documentation (this document) has extensive details on the precise sources
and exact methods used, to ensure transparency. The transparency of the data inputs is also ensured
through references to further technical documentation from the providers of those data inputs, such as
the PM2.5 and ozone estimates, NATA, the Census demographic data, and the DOT traffic database.
Metadata is linked from the web-based tool, providing quick access to further technical details.
Furthermore, the full raw database of EJSCREEN indicators and indexes, and supplementary material,
will be available to expert users who wish to go beyond the web-based interface and conduct further
analysis or research. EPA also hopes to make available the Python and R code used to develop all the
indicators in EJSCREEN, including proximity scores, percentiles, and so on. Access will be provided
through the data download section of the EJSCREEN website (http://www2.epa.gov/eiscreen).
Extensive quality control/ quality assurance efforts were made in the development of EJSCREEN and a
very brief summary is provided here.
The starting point for most of the environmental indicators was information provided by EPA Offices
(i.e., latitude/ longitude data used to create proximity indicators, and the NATA results). Those sources
of information had already been subject to QA procedures in the respective offices, and the information
had already been released to the public. EPA's Office of Air and Radiation (OAR) provided PM2.5 and
ozone estimates based on public monitoring data, CMAQ results, and a fusion model to combine them.
The CMAQ and fusion model have previously been extensively documented in peer-reviewed journal
articles (Byun & Schere, 2006; Berrocal, Gelfand, & Holland, 2010a, 2010b, 2011). The lead paint
indicator was calculated from Bureau of Census data by an EPA contractor, and then independently
replicated by EPA, through separate ACS downloads and calculations. The traffic indicator was calculated
from publicly available DOT data, as explained in this report.
The calculations of environmental indicators from those inputs was conducted by an EPA contractor for
the proximity scores and the lead paint score, using their established QA/QC procedures, so EPA did not
attempt to replicate the proximity calculations. These calculations involved time-consuming proximity
calculations, and simple calculation of the lead paint indicator. The NATA and PM and ozone indicators
were simply taken directly from EPA and used in EJSCREEN.
The demographic indicators were calculated by an EPA contractor based on ACS data they obtained
from the Census Bureau. EPA was able to independently replicate 100% of the resulting indicators by
separately obtaining the raw ACS data.
118 | P a g e

-------
Appendix F
The same is true for the EJ Indexes and all of the percentiles, map bins (for color-coded maps), and
popup text fields used in EJSCREEN - the entire geodatabase was independently replicated by EPA using
only the environmental indicators as a starting point, and applying alternative algorithms and code for
development of percentiles and bins, as well as the rounding procedures defined for popup text.
EPA was also able to conduct some limited manual replication of buffer calculations, although truly
independently replicating those GIS algorithms is challenging given the need to use data on millions of
blocks and the challenge of identifying relevant blocks in a fashion that is independent of the
geoprocessing tool used for buffer analysis in EJSCREEN. Spot checks were conducted on buffer reports
to ensure raw data and percentile calculations, use of lookup tables, rounding, significant digits, and
floored percentiles were all handled correctly.
The extensive QA/QC process did uncover numerous complex data challenges early in the process, and
ultimately lead to a final database that could be fully independently replicated from environmental
indicators calculated from public information, providing strong assurance of the integrity of the data
processing and calculations.
119 | P a g e

-------
Appendix G
APPENDIX G. PEER REVIEW
EJSCREEN was submitted for peer review in early 2014, through a letter review process conducted by a
contractor with extensive experience in organizing peer review. Based on pre-defined criteria regarding
level of expertise in relevant subject areas, four experts were identified.
The reviewers were provided with a draft of this technical documentation, describing EJSCREEN's
development, purpose, and use of selected environmental indicators, demographics, and EJ indexes for
screening and mapping.
Reviewers were also provided a live webinar presentation and demonstration, along with time for
questions for EPA. One of the four was unable to attend the webinar but contacted EPA with questions
that EPA responded to in a phone conference call.
The reviews were completed in March of 2014. Each of the four reviewers provided a detailed
discussion of their technical comments, concerns, and recommendations.
All four agreed that the new environmental justice screening tool will be helpful to its users and is
generally very well done. Each did point to some weaknesses in the tool, suggesting that correcting
these shortcomings in the next version could strengthen the tool and help its users.
Of the more than 100 distinct comments from the expert peer reviewers, more than one third were
positive statements about the quality of EJSCREEN and the documentation. A sampling of direct quotes
includes the following:
•	"I would like to commend the EPA ... it does represent a major step forward and the EPA should
be recognized for this achievement"
•	"This documentation fairly represents the tremendously difficult task of creating this tool"
•	"very impressed by the quality of the work"
All of the reviewers also agreed that the EJSCREEN documentation is generally well-written, clear, and
easy to follow. Reviewers did ask for editorial changes, clarification, or further rationale in the
documentation, and such comments represented about one fourth of all the comments received. They
asked for clarification in some specific sections, such as more discussion of which indicators were chosen
and why, and which were left out and why. Many of these comments have already been taken into
consideration in this version of the Technical Document.
About one third of the comments were suggestions or requests for new data (in reports, maps, and data
files), typically recommending new or improved environmental indicators (e.g., air quality, water quality,
more facility types, etc.). Some comments made suggestions that would involve adding a new feature to
the EJSCREEN tool, rather than improving the data layers or the documentation. These will be taken into
consideration in discussions of possible future updates or upgrades to the tool.
120 | P a g e

-------
Appendix G
A handful of comments raised policy considerations and inherently challenging issues in screening and
mapping. These involved basic policy questions such as what is the best spatial resolution for these
maps, and whether to combine all 12 EJ indexes (Reviewers were divided on this topic). The 2015
version of EJSCREEN continues to use block groups, but recommends an emphasis on buffers as less
uncertain than a single block group estimate. It continues to use 12 separate indexes, but these issues
can be a topic of continuing discussions and exploration in the future as the public and others work with
the new tool.
On the whole, the reviewers' suggestions have already served to strengthen the tool and its
documentation, and will continue to inform discussions. By elaborating and clarifying the options and
choices made, EPA can help the users of EJSCREEN better understand its potential and its limitations.
Improved data and methods should be considered as well in future versions of EJSCREEN. EPA looks
forward to working across various offices, with stakeholders across the nation, and academics as well as
the public, on implementation and future enhancement of EJSCREEN.
121 | P a g e

-------
Appendix H
APPENDIX H. INITIAL FILTER APPROACH FOR SCREENING
What is the 80th percentile filter?
In past screening experience, EPA has found it helpful to establish a suggested Agency starting point for
the purpose of identifying geographic areas that may warrant further consideration, analysis, or
outreach. The use of an initial filter promotes consistency and provides a pragmatic first step for EPA
programs and regions when interpreting screening results. For early applications of EJSCREEN, EPA
identified the 80th percentile filter as that initial starting point. In other words, an area with any of the
12 EJ indexes at or above the 80th percentile nationally should be considered as a potential candidate
for further review. Further review may include considering other factors and other sources of
information such as health based information, local knowledge, proximity and exposure to
environmental hazards, susceptible populations, unique exposure pathways, and other federal, regional,
state, and local data. This filter is simply a starting point, and program offices and regions should
perform additional analysis before making any decisions about potential environmental justice issues. As
EPA gains further experience and insight into the performance of the tool and its applicability for
different uses, program offices and regions may opt to designate starting points that are more inclusive
or specifically tailored to meet programmatic needs more effectively.
The 80th percentile filter in EJSCREEN is not intended to designate an area as an "EJ community."
EJSCREEN provides screening level indicators, not a determination of the existence or absence of EJ
concerns. Nor does the use of the 80th percentile filter suggest that all of the 12 environmental
indicators are equal in terms of their impact on human health and the environment. Instead, the 80th
percentile filter encourages programs to consider environmental indicators outside of their areas of
concentration. The Agency may revise this approach in the future based on experience. This 80th
percentile filter is for internal EPA use and is not intended to apply to States or other organizations.
122 | P a g e

-------
Appendix H
[page intentionally blank]
123 | P a g e

-------