RSEI Data Dictionary, Version 2.3.6, RY2016

RSEI Data Dictionary

Contents

Facility-level Data Tables	2

Facility	2

EPA Program Identifiers for Reporting Facilities	5

Off-site	5

EPA Program Identifiers for Off-Site Facilities	7

Standard Industrial Classification (SIC)	7

North American Industry Classification System (NAICS)	8

Chemical	8

Maximum Contaminant Level (MCL)	12

Media	13

Submission	13

Release	14

Elements	14

Category	15

RSEI Geographic Microdata	17

Disaggregated Microdata	17

Aggregated Microdata	18

Averaged Block Group Microdata	19

Water Microdata	20

Other Available Data	20

Census Crosswalks	20

Population Data (US Decennial Census)	21

Shapefiles- Current Version (Grid geography)	22

Shapefiles- Older Version (Grid geography)	22

1


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

RSEI Data Dictionary

This document describes all of the data tables and fields used in the RSEI model and results data sets.
Additional information can be found in the RSEI methodology document.

Facility-level Data Tables

This dataset presents RSEI results at the facility release level, and is the basic dataset for results that can
be found in Envirofacts and EasyRSEI. These tables are distributed in the RSEI Queries database as well
as in a set of flat csv files. These tables link to the Geographic Microdata using keys like FacilityNumber,
ChemicalNumber, and ReleaseNumber. Note that the key values change with each version of RSEI, so
you must use the same version of these tables as the Microdata.

Facility

The facility table contains data for reporting facilities, including location, stack parameters and discharge
reach, and is also available in EasyRSEI. Note that, with Version 2.3.6, EPA program IDs for RCRA and
ICIS-NPDES/PCS are provided in a separate table.

RSEI Facility Table, spreadsheet

RSEI Facility Table, text format

Facility Data

Variable Name

Description

Facility 1D

Unique TRI identifier for facility (TRI Facility ID).

FacilityNumber

Internal identifier unique to each facility, (key for table)

Latitude

Final latitude of the facility in decimal degrees used for modeling.

Longitude

Final longitude of the

GridCode

Number that identifies the model grid within which the cell is located.

X

Assigned grid value based on latitude.

Y

Assigned grid value based on longitude.

RadialDistance

Distance from approximate center point of grid.

StackHeight

Height of facility stack that is emitting the pollutant (m).

StackVelocity

Rate at which the pollutant exits the stack (m/s).

StackDiameter

Diameter of facility stack that is emitting the pollutant (m).

StackHeightSource

Source of information on stack height.

StackVelocitySource

Source of information on stack velocity.

StackDiameterSource

Source of information on stack diameter.

NEIYear

National Emissions Inventory (NEI) version year, if NEI data were used for
stack parameters.

FacilityName

TRI facility name.

Street

Street address of facility.

City

City where the TRI facility is located.

County

County where the TRI facility is located.

State

State in which the facility is located.

2


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Facility Data

Variable Name

Description

ZIPCODE

Five-digit facility ZIP code.

ZIP9

Nine digit facility ZIP code, if reported.

FIPS

FIPS (Federal Information Processing Standard) code which identifies the
county associated with the facility.

STFIPS

FIPS (Federal Information Processing Standard) code which identifies the
state associated with the facility.

DUNS

The 9-digit number assigned by Dun & Bradstreet for the facility or
establishment within the facility.

REGION

EPA region where facility is located.

FederalFacilityFlag

Code describing federal status for purposes of Executive Order 12856.

FederalAgencyName

Name of Federal Agency of which the federal facility is a part.

ParentName

Name of the corporation or other business entity located in the U.S. that
directly owns at least 50 percent of the voting stock of the facility, as
submitted by the TRI facility.

ParentDUNS

The 9-digit number assigned by Dun & Bradstreet for the US parent
company.

StandardizedParentComp
any

Name of parent company, checked for consistency so that records can be
aggregated by parent company.

PublicContactName

Name submitted by TRI facility as public contact.

PublicContactPhone

Phone number submitted by TRI facility for public contact.

Extension

Extension number, if any, associated with contact phone number.

PCT_CH6

Percent of chromium released that is assumed to be hexavalent (the
remainder is assumed to be trivalent with negligible toxicity and not
modeled.

ChromHexPercent

Percent of chromium released that is assumed to be hexavalent (the
remainder is assumed to be trivalent with negligible toxicity and not
modeled (same as PCT_CH6).

Chrom Source

Source for PCT_CH6/ChromHexPercent.

ModChrom Releases

True if facility has released or transferred chromium or chromium
compounds to modeled media (fugitive/stack air releases, direct water,
POTWs or off-site incineration).

NewlndustryFlag

True if the facility's primary NAICS was added to TRI in the TRI industry
expansion beginning in reporting year 1998.

NAICS1

Facility-level primary North American Industry Classification System (NAICS)
code assigned by RSEI for modeling purposes (note that in TRI a facility can
have a different NAICS code for each Form R). If more than one primary
NAICS is reported by the facility, the most frequently reported primary
NAICS for the most recent year is selected. Information on NAICS can be
found at the Census website at https://www.census.gov/eos/www/naics/

NAICS2

Facility's most frequently reported non-primary 6-digit NAICS code.

NAICS3

second most frequently reported non

3


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Facility Data

Variable Name

Description

NAICS4

Facility's third most frequently reported non-primary 6-digit NAICS code.

NAICS5

Facility's fourth most frequently reported non

NAICS6

Facility's fifth most frequently reported non-primary 6-digit NAICS code.

NAICSCode3Digit

First 3 digits of facility's primary NAICS code.

NAICSCode4Digit

First 4 digits of facility's primary NAICS code.

NAICSCode5Digit

5

SIC1

Facility-level SIC code that corresponds to the assigned facility-level NAICS
code.

FRSID

EPA's Facility Registry System ID.

AssignedReach

NHDPIus reach identifier for final facility discharge reach.

AssignedCOMID

segment identifier for final facility discharge reach.

ReachSource

Source for final discharge assignment.

OutfallLatitude

Latitude for outfall.

OutfallLongitude

Longitude for outfall.

OutfallSource

Source for outfall coordinates.

NearReach

NHDPIus reach identifier for nearest discharge reach.

NearCOMID

segment identifier for nearest discharge reach.

NPDESReach

NHDPIus reach identifier for discharge reach as reported to ICIS-NPDES.

NPDESCOMID

segment identifier for discharge reach reported to ICIS

NPDESYear

Year of ICIS-NPDES data used.

DistanceToReach

The distance between an off site facility discharging to water and the reach
of the receiving water body (m).

HEM3ID

The ID assigned to the nearest National Weather Service (NWS) observation
station.

DistanceToHEM3

LatLongSource

Source of final lat/long found in 'Latitude' and 'Longitude' fields.

LLYear

Year of lat/long data.

LLNotes

Notes for facility location.

Confirmed

True if facility location has been confirmed via satellite image.

WaterReleases

True if facility reports direct releases to water for any year since 1988.

DistanceToTribalLand

Distance to nearest Tribal Land within ten miles (miles)

TribalLandName

Name of nearest Tribal Land within ten miles, if any.

Full NameTribal Land

Full or official name of Tribal land.

ChromReleases

True if facility reports chromium modeled releases or transfers for any year
since 1988.

ModeledReleases

True if facility reported modeled releases or transfers since 1988 (fugitive or
stack air releases, direct water releases, or transfers to off-site incineration
or POTWs).

4


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

EPA Program Identifiers for Reporting Facilities

This table contains program identifiers for each TRI reporting facility, from EPA's Facility Registry Service
(FRS). Resource Conservation and Recovery Act (RCRA) program, and National Pollutant Discharge
Elimination System (NPDES) . The FRS identifier can be used to link to any other EPA program
identifiers.

Program Identifiers for TRI Reporting Facilities, spreadsheet

Program Identifiers for TRI Reporting Facilities, text format

Program Identifiers for Reporting Facilities

Variable

Description

FRSID

FRS program identifier

tril, tri2...

TRI facility identifier associated with FRS record (FRS identifiers are assigned to
unique facilities in EPA's FRS system, and in a few cases there are multiple TRI
identifiers for one FRS identifier).

rcral, rcra2...

RCRA identifier associated with FRS record.

npdesl, npdes2...

NPDES identifier associated with FRS record.

Off-site



The Off-site table contains the condensed list of quasi-unique off-site facilities to which TRI reporters
transfer waste. Only incinerators and POTWs are modeled by RSEI, so verification of addresses and
locations are focused on those off-site facilities.

RSEI Off-Site Table,

spreadsheet

RSEI Off-Site Table, text format

Off-Site Data

Variable

Description

OffsitelD

Unique internal identifier for each off-site facility.

FacilityNumber

Unique internal identifier for each off-site facility. [Note this is
different from the FacilityNumber field in the Facility table]

POTWJncin

Identifies off-site facilities for which releases are modeled: 1= POTW;
2=lncinerator; 3=POTW and Incinerator.

Droplncinerator

True if off-site has been identified as a TRI reporter or a RCRA
hazardous waste incinerator.

Name

Best submitted name for off-site facility.

Street

Best submitted street address for off-site facility.

City

Best submitted city for off

State

Best submitted state for off-site facility.

ZIPCode

Best submitted ZIP code for offsite facility.

ZIP9

This variable is not yet implemented.

Latitude

Geocoded latitude in decimal degrees for off-site facility.

Longitude

Geocoded longitude in decimal degrees for off-site facility.

GridCode

Number that identifies the model grid within which the cell is located.

5


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Off-Site Data

Variable

Description

Country

Null if off-site facility is located in the U.S.; otherwise country in which
off-site facility is located.

X

Assigned grid value based on latitude.

Y

Assigned grid value based on longitude.

Radial Distance

Distance from approximate center point of grid.

StackHeight

Stack height used for modeling.

StackVelocity

velocity used for

StackDiameter

Stack diameter used for modeling.

StackParameterSource

Null if default stack parameters were used; otherwise source for stack
parameters.

HEM3ID

The ID assigned to the nearest National Weather Service (NWS)
observation station.

DistanceToHEM3

WBANID

The ID assigned to the Weather Bureau/Army/Navy Weatherstation
nearest to the facility.

DistanceToWBAN

The distance between a facility and the nearest WBAN weather station
(m).

WaterReleases

True if off-site facility receives transfers to POTW.

OutfallLongitude

Latitude associated with end of the pipe used for off-site facility's
discharge to water.

OutfallLatitude

Longitude associated with end of the pipe used for off-site facility's
discharge to water.

NearReach

14-digit NHDPIus reach identifier associated with the reach that is
nearest to off site facility.

NearComID

ComID from NHDPIus dataset that uniquely identifies reach segment
nearest facility.

DistanceToReach

The distance between an off-site facility discharging to water and the
reach of the receiving water body (m).

AssignedReach

14-digit NHDPIus reach identifier associated with reach assigned by
EPA or determined through QA.

AssignedComID

ComID from NHDPIus dataset that uniquely identifies reach segment
for assigned reach.

ReachSource

Data source linking stream reach to facility.

ReachNotes

Notes pertaining to stream reach assignment.

LocationType

Type of geocoded match.

LatLongSource

Source used to determine lat/longs. NA identifies records with
insufficient information to determine location.

LatLongYear

Year lat/long was last updated.

LockLL

True if location was confirmed as correct using satellite data.

CentroidAdjustment

True if facility's FRS coordinates were modified from front door or
street to approximate center of facility.

Notes on Coordinates

Notes on how lat/long was derived.

AdditionalSourcesForLocation

Web site, if any, used to determine location.

6


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Off-Site Data

Variable

Description

LocationConfidence

Code describing confidence in location assigned to off-site:

1	= confirmed in satellite view on map.

2	= Substantial information supporting location, including physical
features such as settling ponds (2a), or a match to a business entry in
google maps (2b).

3	= Geocoded address looks plausible given type of facility.

4	= No information to support geocoded address.

Foreign

1 if off-site is located outside the U.S.

GeoMatchType

If LatLongSource=ESRI, shows the basis upon which the coordinates
were assigned, such as street address, postal code, district, etc.

EPA Program Identifiers for Off-Site Facilities

This table contains program identifiers for each off-site facility that receives reported transfers from TRI
facilities. RSEI condenses the off-site transfer reports into approximately unique facilities (some
duplication may remain),and matches the off-sites to records in EPA's Facility Registry Service (FRS),
Resource Conservation and Recovery Act (RCRA) program, and National Pollutant Discharge Elimination
System (NPDES). Matches are based on name and address using approximate text matching; program
identifiers should be verified before any analysis is finalized.

Program Identifiers for TRI Off-Site Facilities, spreadsheet

Program Identifiers for TRI Off-Site Facilities, text format

Program Identifiers for Off-Site Facilities

Variable

Description

OffsitelD

RSEI internal identifier for each unique off-site facility.

FRSID

FRS program identifier

tril, tri2...

TRI facility identifier associated with FRS record (FRS identifiers are assigned to
unique facilities in EPA's FRS system, and in a few cases there are multiple TRI
identifiers for one FRS identifier).

rcral, rcra2...

RCRA identifier associated with FRS record.

npdesl, npdes2...

NPDES identifier associated with FRS record.

Standard Industrial Classification (SIC)

This table is no longer maintained in RSEI. NAICS codes are now used to determine industry-level stack
heights and chromium speciation rates.

7


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

North American Industry Classification System (NAICS)

NAICS codes are collected by TRI.

NAICS Table, spreadsheet
NAICS Table, text format

NAICS Data

Variable

Description

NAICSCode

Six-digit NAICS code.

LongName

Text description of code.

Chemical

The chemical table contains data for chemicals reported to TRI, including toxicity, physico-chemical
properties, and flag fields to facilitate user selections. The chemical table is also available in EasyRSEI.

RSEI Chemical Table, spreadsheet
RSEI Chemical Table, text format

Chemical Data

Field Name

Field Description

CASNumber

Chemical Abstracts Service Registry Number, which identifies a unique chemical. For
chemical categories, CAS Numbers begin with "N", followed by three digits.

CASStandard

The Chemical Abstracts Service Registry Number identifies a unique chemical. The
standard format contains three sets of numbers divided by hyphens (00-00-0).

ChemicalNumber

Unique internal identifier.

SortCAS

Chemical Abstracts Service Registry Number, which identifies a unique chemical,
formatted for sorting (no hyphens). For chemical categories, CAS Numbers begin with
"N", followed by three digits.

SortName

Common name of chemical, with initial modifiers moved to end of name. Used for
internal sorting purposes.

FullChemicalName

Full scientific name(s) of the chemical.

Common name(s) of the chemical.

Added

The year the chemical was added to the Toxics Release Inventory. This field is blank
when Chemical is invalid, mixture or trade secret.

Toxicity Source

All sources used for toxicity data, and date of addition to database.

RfClnhale

The inhalation reference concentration (RfC) is defined as "an estimate (with
uncertainty spanning perhaps an order of magnitude) of a continuous inhalation
exposure to the human population (including sensitive subgroups) that is likely to be
without appreciable risk of deleterious noncancer health effects during a lifetime".
Units are mg/m3.

8


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Chemical Data

Field Name

Field Description

RfCUF

The uncertainty factor (UF) is applied to the no-observed-adverse-effect level (NOAEL)
upon which the RfC is based, thereby reducing the dose. The UF accounts for
uncertainties in extrapolation from experimental data to an estimate appropriate to
humans.

RfCMF

The modifying factor (MF) is a value applied to the NOAEL when scientific uncertainties
in the study chosen for estimating the RfC are not explicitly addressed by the standard
UFs.

RfCConf

Confidence levels are assigned to the study used to derive the RfC, the overall
database, and to the RfC itself.

RfCSource

Source used for the RfC value.

RfCListingDate

Date that RfC was listed, if available.

RfCToxWeight

Toxicity weight based on the RfC (RfCToxWeight = 3.5/RfC). Noncancer/inhalation.

RfDOral

The oral reference dose (RfD) is "an estimate (with uncertainty spanning perhaps an
order of magnitude) of a daily exposure [by ingestion] to the human population
(including sensitive subgroups) that is likely to be without an appreciable risk of
deleterious effects during a lifetime", (mg/kg-day)

RfDUF

The uncertainty factor (UF) is applied to the no-observed-adverse-effect level (NOAEL)
upon which the RfD is based, thereby reducing the dose. The UF accounts for
uncertainties in extrapolation from experimental data to an estimate appropriate to
humans.

RfDMF

The modifying factor (MF) is a value applied to the NOAEL when scientific uncertainties
in the study chosen for estimating the RfD are not explicitly addressed by the standard
UFs.

RfDConf

Confidence levels are assigned to the study used to derive the RfD, the overall
database, and to the RfD itself.

RfDListingDate

Date that RfD was listed, if available.

RfDSource

Source used for the RfD value.

RfDToxWeight

Toxicity weight based on the (RfDToxWeight = 1/RfD). Noncancer/oral.

UnitRisklnhale

The unit inhalation risk is the excess lifetime risk due to a "continuous constant lifetime
exposure of one unit of carcinogen concentration"(51 FR 33998). (l/mg/m3)

QSTAROral

The oral cancer slope factor (ql*) or oral slope factor (OSF): a measure of the
incremental lifetime risk of cancer by oral intake of a chemical, expressed as risk per
mg/kg-day. (1/mg/kg-day)

9


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Chemical Data

Field Name

Field Description

WOE

Weight of evidence (WOE) categories indicate how likely a chemical is to be a human
carcinogen, based on considerations of the quality and adequacy of data and the type
of responses induced by the suspected carcinogen. EPA WOE classifications include the
following categories and associated definitions (51 FR 33996):
A Carcinogenic to humans
B Probable carcinogen based on:

•	B1 Limited human evidence

•	B2 Sufficient evidence in animals and

inadequate or no evidence in humans:

C Possible carcinogen
D Not classifiable
E Evidence of non-carcinogenicity

UnitRiskListingDate

Date that Unit Risk was listed, if available.

UnitRiskSource

Source used for the Unit Risk value.

lURToxWeight

Toxicity weight based on the IUR (lURToxWeight = IUR/2.8e-7). Cancer/inhalation.

QStarListingDate

Date that QStar was listed, if available.

QStarSource

Source used for the QStar value.

OSFToxWeight

Toxicity weight based on the QStar or OSF (OSFToxWeight = QSTAROral/le-6).
Cancer/oral.

WOEListingDate

Date that WOE was listed, if available.

WOESource

Source used for the WOE classification.

ITW

Inhalation Toxicity Weight: the RSEI toxicity weight for a chemical for the inhalation
pathway.

OTW

Oral Toxicity Weight: the RSEI toxicity weight for a chemical for the oral pathway.

ToxicityClassOral

This indicates whether the toxicity weight for the oral pathway is based on cancer or
noncancer health effects.

ToxicityClasslnhale

This indicates whether the toxicity weight for the inhalation pathway is based on cancer
or noncancer health effects.

ToxicityCategory

This indicates whether the oral and inhalation toxicity weights are based on cancer
health effects, non-cancer health effects, or both.

AirDecay

The rate at which a chemical degrades in air, due primarily to photooxidation by
radicals (hr-1).

Koc

The organic carbon-water partition coefficient, used in estimates of chemical sorption
to soil (mL/g).

H20Decay

The rate at which a chemical degrades in water, due to abiotic hydrolysis,
biodegradation, or photolysis (hr-1).

LOGKow

The logarithm of the octanol water partition coefficient. Kow is the ratio of a chemical's
concentration in the octanol phase to its concentration in the aqueous phase at
equilibrium in a two-phase octanol/water system.

10


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Chemical Data

Field Name

Field Description

Kd

The soil-water partition, or distribution, coefficient. For organics, the value is often
estimated as the product of Koc and foe (the fraction of organic carbon in the soil)
(L/kg).

WaterSolubility

The amount of chemical that dissolves in water at a particular temperature (mg/L).

POTWPartitionRemoval

Percent of chemical removed from the wastewater by the POTW (Publicly Owned
Treatment Works).

POTWPartitionSludge

Percent of total POTW removal efficiency attributable to sorption of the chemical to
sewage sludge.

POTWPartitionVolat

Percent of total POTW removal efficiency attributable to volatilization of the chemical.

POTWPartitionBiod

Percent of total POTW removal efficiency attributable to biodegradation of the
chemical.

IncineratorDRE

Destruction/removal efficiencies, expressed as the percent of chemical fed to the
incinerator that is not released to the air.

BCF

Bioconcentration factor: the ratio of a chemical's concentration in fish to its
concentration in water at equilibrium (L/kg).

Henrys

Henry's law constant: the ratio of a chemical's concentration in the air to its
concentration in the water at equilibrium (atm-m3/mol).

MCL

Maximum Contaminant Level, which is EPA's national primary drinking water standard
for the chemical. This is the current value; historical data are contained in the table,
'MCL.'

Molecular Weight

The mass in grams of one mole of molecules of the chemical.

HAPFlag

This flag marks the chemicals that are hazardous air pollutants, as defined by the Clean
Air Act.

CAAFlag

This flag marks the chemicals that are Clean Air Act pollutants.

PriorityPollutantFlag

priority pollutants, as defined by the Clean Water

Act.

SDWAFlag

This flag marks the chemicals that have national primary or secondary drinking water
standards under the Safe Drinking Water Act.

CERCLAFlag

This flag marks the chemicals that are regulated under Superfund (CERCLA—the
Comprehensive Environmental Response, Compensation, and Liability Act).

OSHACarcinogens

This flag indicates whether the chemical is a known or suspect human carcinogen based
on OSHA criteria. Known human carcinogens are defined as those that have been
shown to cause cancer in humans. Suspect human carcinogens have been shown to
cause cancer in animals. The list of chemicals flagged as OSHA carcinogens is based on
the list of carcinogens provided in the 1997 TRI Public Data Release.*

ExpansionFlag

This flag marks the chemicals that were added to the Section 313 toxic chemical list for
the 1995 Reporting Year.

Core88ChemicalFlag

This flag marks the chemicals that are common to all reporting years of TRI and that
have had no modifications of reporting requirements, as determined by the 1988 Core
Chemical List found on the TRI Explorer website.

11


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Chemical Data

Field Name

Field Description

Core95ChemicalFlag

This flag marks the chemicals that are common to TRI reporting years 1995 through the
current year and that have had no modifications of reporting requirements in that time
period, as determined by the 1995 Core Chemical List found on the TRI Explorer
website.

Core98ChemicalFlag

This flag marks the chemicals that are common to TRI reporting years 1998 through the
current year and that have had no modifications of reporting requirements in that time
period, as determined by the 1998 Core Chemical List found on the TRI Explorer
website.

CoreOOChemicalFlag

This flag marks the chemicals that are common to TRI reporting years 2000 through the
current year and that have had no modifications of reporting requirements in that time
period.

CoreOlChemicalFlag

This flag marks the chemicals that are common to TRI reporting years 2001 through the
current year and that have had no modifications of reporting requirements in that time
period. The only difference between this flag and the CoreOOChemicalFlag is the
inclusion of lead and lead compounds.

HPVFlag

Indicates whether the chemical is designated as a High Production Chemical.

HPVChallengeValue

Describes the value or combination of values assigned to the chemical by EPA's HPV
Challenge program to describe the chemical's status under the program.

PBTFlag

Indicates whether EPA has designated this chemical as a priority chemical under the
Persistent Bioaccumulative and Toxic (PBT) Chemical Program.

Metal

This flag indicates whether the chemicals are metals and also whether they are core
chemicals. (Core chemicals are those that are common to all reporting years of TRI and
which have had no modifications of reporting requirements.)

HasTox

Indicates that the chemical has a toxicity weight (either oral or inhalation) in the data
set.

MaxTW

Shows the greater of the two possible toxicity weights (oral or inhalation).

Notes

Additional information regarding assignment of toxicity or physicochemical data.

Maximum Contaminant Level (MCL)

MCLs are used to cap maximum concentrations in drinking water systems.
RSEI MCL Table, spreadsheet
RSEI MCL Table, text format

MCL Data

Variable

Description

CASNumber

Chemical Abstracts Service Registry Number, which identifies a unique
chemical. For chemical categories, CAS Numbers begin with "N", followed by
three digits.

12


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

MCL Data

CASStandard

The Chemical Abstracts Service Registry Number identifies a unique chemical.
The standard format contains three sets of numbers divided by hyphens (00-00-
0).

ChemicalNumber

Unique internal identifier (links to Chemical table).

Common name of the chemical.

| MCL1988...MCL2016

MCL for each year an MCL was in effect.

Media

The media table provides descriptions for the media codes used in the Release table.
RSEI Media Table, spreadsheet
RSEI Media Table, text format

Media Data

Variable

Description



Media

(RSEI Media Code)

Code associated with the media and/or method of release, as
in TRI Reporting Form R, except that "M" is replaced with "1"
site transfers.

reported by facility
in the code for off-

MediaText (Short
Description)

Descriptions of receiving media associated with Media Code.



TRICode

Code associated with the media and/or method of release, as
in TRI Reporting Form R.

reported by facility

TRICategory

-assigned waste treatment category.



LongDescription

Longer version of media text field.



Submission

The submission table contains Form R information submitted to TRI.

The Submission, Elements and Release tables are too large for spreadsheet format.
RSEI Submission Table, text format

Submission Data

Variable

Description

DCN

Unique identifier assigned by TRI to each facility submission (document control
number).

SubmissionNumber

Internal identifier assigned to each submission.

FacilityNumber

Internal identifier unique to each facility (links to Facility table).

ChemicalNumber

Internal identifier unique to each chemical (links to Chemical table).

SubmissionYear

Year of facility release.

13


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Submission Data

Variable

Description

Use

Code describing how chemical is used in reporting facility, as reported on TRI
Reporting Form R. See On-site Chemical Information for an explanation of the
codes.

MaxOnsite

Code describing the maximum amount of the chemical on-site at reporting
facility, as reported in TRI Reporting Form R. See On site Chemical
Information for an explanation of the codes.

Release



This table contains data for each chemical release. There can be multiple release records per submission
record.

The Submission,

Elements and Release tables are too large for spreadsheet format.

RSEI Release Table, text format

Release Data

Variable

Description

1 ReleaseNumber

Unique internal identifier.

SubmissionNumber Unique internal identifier (links to Submission table).

Media

Code associated with the media and/or method of release, as reported by
facility in TRI Reporting Form R. See Media table for explanation of codes.

PoundsReleased

Total pounds released, without accounting for treatment.

OffsiteNumber

Unique identifier for off-site facility receiving this release, if any. Links to Facility
Number in the Off site table.

TEF

Toxicity Equivalency Factor used to adjust toxicity for dioxins.

Elements



The Elements table contains the calculated results for each release. There can be multiple elements
records for each release. Note that all values in the elements table are rounded to six significant figures.

The Submission,

Elements and Release tables are too large for spreadsheet format.

RSEI Elements Table, text format

Elements Data

Variable

Description

ElementNumber

Unique internal identifier.

ReleaseNumber

Unique internal identifier (links to Release table).

PoundsPT
(TRI Pounds)

Total pounds after any treatment by POTWs or other offsite facilities.

14


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Elements Data

Variable

Description

ScoreCategory

Codes corresponding to the medium into which the chemical is released.
Examples of the information include: direct air releases from the stack using a
"rural" air dispersion model, fugitive air releases, releases to an onsite landfill.
[See Score Category Information for descriptions]

Score

Total Indicator Element score- modeled surrogate dose multiplied by toxicity
weight and by population, using the higher cancer/noncancer toxicity weight for
each air/water pathway.

Population

Total population exposed.

ScoreA

Score for children 0 through 9 years of age (inclusive).

PopA

Number of exposed children

ScoreB

Score for children 10 through 17 years of age (inclusive).

PopB

Number of exposed children 10

ScoreC

Score for adults 18 through 44 years of age (inclusive).

PopC

Number of exposed

ScoreD

Score for adults 45 through 64 years of age (inclusive).

PopD

Number of exposed

ScoreE

Score for adults 65 years old and greater.

PopE

Number of exposed

NCScore

(NonCancer Score)

Indicator Element score, limited to chemicals with non-cancer endpoints.

CScore

(Cancer Score)

Indicator Element score, limited to chemicals with cancer endpoints.

Hazard

Toxicity weight times TRI pounds, using the higher cancer/noncancer toxicity
weight for each air/water pathway.

HazardC
(Cancer Hazard)

Toxicity weight times TRI pounds, limited to chemicals with cancer endpoints.

HazardNC

(Non-Cancer Hazard)

Toxicity weight times TRI pounds, limited to chemicals with non-cancer
endpoints.

Category

The Category table describes the codes used in the Elements table to indicate the release pathway.

RSEI Category Table, spreadsheet
RSEI Category Table, text format

Category Data

Variable

Description

ScoreCategory

Codes corresponding to the medium into which the chemical is released. Examples of
the information include: volatilization from a transfer to a POTW, fugitive air releases,
releases to an onsite landfill.

Category

Descriptions of release media and other descriptors corresponding with the score
category codes.

15


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Category Data

Variable

Description

Model

A variable that is '1' when that category can be modeled and '0' when it cannot.

InhaleTox

A variable that is '1' when the model requires an inhalation toxicity score to model this
kind of release and '0' when it does not.

16


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

RSEI Geographic Microdata

A separate guidance is available for use with the Microdata.

Disaggregated Microdata

These are the raw Microdata files that contain the most disaggregated data possible. For each 810m grid
cell, the file contains scores, concentrations, and tox-weighted concentrations for each chemical release.
There may be multiple records for any one grid cell. Note that if two releases for the same chemical
(either from different facilities or one from a stack release and one from a fugitive release from the
same facility) affect the same grid cell, there will be separate records for each grid release.

Naming: These annual files have historically been named MicroXXXX_YYYY, where XXXX is the reporting
year for the data freeze, and YYYY is the year of the data contained in the file. So Micro 2014_2010 is
from the RY2014 RSEI update, and contains data for chemicals released in 2010. The new naming
convention substitutes the version number for the version year, as in vXXX_micro_YYYY, where XXX is
the version number and YYYY is the year of the data contained in the file; for example
v234_micro_2014.csv.There is one annual file for the entire country, which is over 100 GB in size.



Disaggregated Microdata Table

Field Number

Name

Description

1

GridCode

Identifies grid. .
14=Conterminous US
24=Alaska
34=Hawaii

44=Puerto Rico/Virgin Islands
54=Guam/Marianas
64=American Samoa

2

X

X-coordinate of grid cell

3

Y

Y Coordinate of grid cell

4

ReleaseNumber

Internal unique identifier for release
(lookup in table "Release")*

5

ChemicalNumber

Internal unique identifier of released chemical
(lookup in table "Chemical")*

6

FacilityNumber

Internal unique identifier of releasing facility (lookup
in table "Facility" if media = 1 or 2; if media = 6 or
750 or 754, then lookup in table "Offsite")*

7

Media

Code describing media into which chemical is
released.

(lookup in table "Media")*

8

Cone

Concentration of chemical for release/media at grid
cell.

9

ToxConc

Concentration multiplied by inhalation toxicity
weight

10

Score

Risk-related score (surrogate dose * toxicity weight *
population)

17


-------
RSEI Data Dictionary, Version 2.3.6, RY2016



Disaggregated Microdata Table

Field Number

Name

Description

11

ScoreCancer

Risk-related score (surrogate dose * toxicity weight *
population) using only toxicity values for cancer
effects

12

ScoreNonCancer

Risk-related score (surrogate dose * toxicity weight *
population) using only toxicity values for noncancer
effects

13

Pop

Number of people in grid cell (may be interpolated)

Aggregated Microdata

Aggregated Microdata files use the same data as the disaggregated files, but sum the chemical releases
over each grid cell. Because the values are summed, unweighted concentrations are not available (the
sum of the concentrations of different chemicals would be meaningless).

Naming: These annual files have historically been named MicroXXXX_YYYY, where XXXX is the reporting
year for the data freeze, and YYYY is the year of the data contained in the file. So Micro 2014_2010 is
from the RY2014 RSEI update, and contains data for chemicals released in 2010. The new naming
convention substitutes the version number for the version year, as in vXXX_micro_YYYY, where XXX is
the version number and YYYY is the data year; for example v234_micro_2014.csv.These files have
historically been named in the format AggMicroXXXX_YYYY_GCZZ, where XXXX is the reporting year for
the data freeze, YYYY is the year of the data contained in the file, and ZZ is the 2-digit grid code (see
Field 1 in the Table 1 below for grid codes). The new naming convention substitutes the version
number for the version year, as in vXXX_aggregated_micro_gcZZ_YYYY; for example,
v234_aggregated_m icro_gcl4_2014.csv.



Aggregated Microdata Table

Field Number

Name

Description

1

X

X-coordinate of grid cell

2

Y

Y Coordinate of grid cell

3

NumberOf Facilities

Number of facilities with releases affecting grid cell.

4

NumberOf Releases

Number of individual releases affecting grid cell.

5

NumberOfChemicals

Number of chemicals with nonzero concentrations
for grid cell.

6

ToxConc

Concentration multiplied by inhalation toxicity
weight, summed over all chemicals impacting cell

7

Score

Risk-related score (surrogate dose * toxicity weight *
population), summed over all chemicals impacting
cell

8

Pop



9

ScoreCancer

Risk-related score (surrogate dose * toxicity weight *
population) using only toxicity values for cancer
effects

18


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Aggregated Microdata Table

Field Number Name	Description	

10	ScoreNonCancer	Risk-related score (surrogate dose * toxicity weight *

population) using only toxicity values for cancer
effects

Averaged Block Group Microdata

These files are the same as the aggregated Microdata files, but instead of being presented at the grid
cell level, the values are averaged over Census block groups. The file BG_RSEI_XXXX_3yr is a csv file with
the block group-level data averaged over 2012 through 2014. There are also shape files
(tl_2010_bg_US_RSEI) with the same data; that is, the .dbf file and the .csv have the same fields.

Averaged Block Group Microdata

Field Number

Name

Description

1

GEOIDIO

US Census Block Group ID

2

ALAND10

Land area of the block group (m2)

3

AW ATE RIO

Water area of the block

4

TOXCONC

Average toxicity-weighted concentration of the cells
in the block group, averaged over three years.

5

PTOXCONC

Percentile associated with field TOXCONC.

6

SCORE

Average risk-related score (surrogate dose * toxicity
weight * population) of the cells in the block group,
averaged over three years.

7

PSCORE

Percentile associated with field SCORE.

8

NCSCORE

Average risk-related score (surrogate dose * toxicity
weight * population) of the cells in the block group,
averaged over three years. Score is calculated using
only noncancer toxicity weights.

9

PNCSCORE

Percentile associated with field NCSCORE.

10



Average risk-related score (surrogate dose * toxicity
weight * population) of the cells in the block group,
averaged over three years. Score is calculated using
only cancer toxicity weights.

11

PCSCORE

Percentile associated with field CSCORE.

12

POP

Average population of the cells in the block group,
averaged over three years.

13

PPOP

Percentile associated with field POP.

14

COVERED

Internal field.

15

FOUND



16

GC

Grid code.

19


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Water Microdata

This file contains the toxicity-weighted concentrations downstream of TRI discharges by stream
segment. All years of data are contained in the file, which is named NHDMicroResults_conc_agg_XXXX,
where XXXX is the reporting year of the data freeze.

Water Microdata

Field Number

Name

Description

1

ReleaseNumber

Internal unique identifier for release
(links to Release table).

2

Counter

Auto-increment count of COMIDs

3

Com ID

"Common Identifier" of a flowline (sub-segment of a
reach)- atomic unit of reach data that matches one-to-
one to NHDPIus.

4

ReachCode

Code for reach

5

Cone

Concentration of chemical in flowline (mg/L)

6

Sequence

Number defining pathway of release (used to indicate
branching).

7

TravelTime

Time(s) for release to go from top of flowline to bottom.

8

TravelLength

Distance (m) for release to go from top of flowline to
bottom

9

Paths

Number of branches in stream path

10

FCode

Descriptor from NHDPIus for type of flowline (e.g.,
pipeline, stream)

11

ResCode

Internal code

Other Available Data

Census Crosswalks

Each set of crosswalk files links the RSEI grid cell geography to a different US decennial census year.
There is one crosswalk for each area and decennial Census year (1990, 2000, 2010). Crosswalk files are
named by area (Alaska, Con(terminous) US, etc.). The last three fields in each file contain percent values
that can be used to adjust the block or cell contents when performing the crosswalk. PCT_B_C and
PCT_C_B are area-weighted and can be used for metrics that do not involve population, such as
concentration and toxicity-weighted concentration. PCT_PC_B is population weighted, and can be used
to crosswalk fields that involve population, like score and pop. Note that the "PCT_CP_B" field is not
available for the territories (VI, PR, GU, AS, MP). The Northern Mariana Islands are in the Guam file and
the Virgin Islands are in the Puerto Rico file. There are no crosswalks for Puerto Rico, the Virgin Islands,
Mariana Islands, Guam, or American Samoa for 1990. For these areas, RSEI uses 2000 block boundaries
and scales each cell's population by the overall ratio of 1990/2000 population for each area.

20


-------
RSEI Data Dictionary, Version 2.3.6, RY2016





Census Crosswalk Table

Field Number

Name

Description

1

GridID

Identifies grid.
14=Conterminous US
24=Alaska
34=Hawaii

44=Puerto Rico/Virgin Islands
54=Guam/Marianas
64=American Samoa

2

X

X coordinate of the cell address

3

Y

Y coordinate of the cell address

4

BlockJDOO

US Census Block ID

5

UR

Internal

6

PCT_B_C

Percent of the Census block that is within the cell
(Block to Cell)

7

PCT_C_B

Percent of the cell that is within the Census block
(Cell to Block)

8

PCT_PC_B

Percent of the cell's population that is within the
Census block (Population-Cell to Block)

Population Data (US Decennial Census)

RSEI Census data are contained in three tables, Census 90 (data from the 1990 Census), Census 00 (data
from the 2000 Census) and Census 10 (data from the 2010 Census). These three tables contain the
Census data that has been transposed onto the RSEI model grid. Each Census table is over 600 MB in
size. 1990 Census data have been provided by Geolytics, Inc.

Census data were last updated in 2012.

Census 90 Data

Variable

Description

Grid Code

Number that identifies the model grid within which the cell is located.

X

Assigned grid cell value based on latitude.

Y

Assigned grid cell value based on longitude.

Male0to9 through
Female65andUp

The number of people in the grid cell in each Census subpopulation
group in the year 1990.

PrimaryFIPS

The FIPS code for the county within which most or all of the grid cell is
contained.

21


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

Census 00 Data

Variable

Description

Grid Code

Number that identifies the model grid within which the cell is located.

X

Assigned grid cell value based on latitude.

Y

Assigned grid cell value based on longitude.

Male0to9 through
Female65andUp

The number of people in the grid cell in each Census subpopulation
group in the year 2000.

PrimaryFIPS

The FIPS code for the county within which most or all of the grid cell is
contained.



Census 10 Data

Variable

Description

Grid Code

Number that identifies the model grid within which the cell is located.

X

Assigned grid cell value based on latitude.

Y

Assigned grid cell value based on longitude.

Male0to9 through
Female65andUp

The number of people in the grid cell in each Census subpopulation
group in the year 2010.

PrimaryFIPS

The FIPS code for the county within which most or all of the grid cell is
contained.

Shapefiles- Current Version (Grid geography)

RSEI shapefiles define the grid and can be used for mapping. They do not contain any RSEI results. New
shapefiles were posted on the RSEI ftp site in early 2017. The shapes are the same; however, the fields
and format are different, and now additional files for grid cell sizes other than 810m are available. More
information on the RSEI grid can be found in the RSEI methodology document.

Attribute Table for Grid Shapefiles

Variable

Description

CELLX

Assigned grid cell value based on latitude.

Y

Assigned grid cell value based on longitude.

CLAT

Latitude for center point of grid cell.

CLONG

Longitude for center point of grid cell.

CX

Vertical distance from the grid center point to grid cell (m). Equivalent to
CELLX*grid size (m) (for standard RSEI grid, CELLX*810).

CY

Horizontal distance from the grid center point to grid cell (m). Equivalent to
CELLY*grid size (m) (for standard RSEI grid, CELLY*810).

Shapefiles- Older Version (Grid geography)

RSEI shapefiles define the grid and can be used for mapping. They do not contain any RSEI results. There
are two sets: polygon (con_us_810m_poly) and center point (con_us_810m). The grid is split into 4 files

22


-------
RSEI Data Dictionary, Version 2.3.6, RY2016

for each type, numbered 1-4. The attribute table is the same for all shapefiles. More information on the
RSEI grid can be found in the RSEI methodology document.

Attribute Table for Grid Shapefiles

Variable

Description

X

Assigned grid cell value based on latitude.

Y

Assigned grid cell value based on longitude.

LONGX

Easting coordinate for Albers projection.

LATY

Northing

LONGITUDE

Longitude for center point of grid cell.

LATITUDE

Latitude for center point of grid cell.

RADIALDIST

Radial distance from center point of grid (m).

AREA

Area of grid (m) (note that grid cells vary slightly in size).

NORTHADJ

Internal.

23


-------
[revised 12/18/2017]

RSEI Data Dictionary, Version 2.3.6, RY2016

24


-------