RSEI Data Dictionary, Version 2.3.6, RY2016 RSEI Data Dictionary Contents Facility-level Data Tables 2 Facility 2 EPA Program Identifiers for Reporting Facilities 5 Off-site 5 EPA Program Identifiers for Off-Site Facilities 7 Standard Industrial Classification (SIC) 7 North American Industry Classification System (NAICS) 8 Chemical 8 Maximum Contaminant Level (MCL) 12 Media 13 Submission 13 Release 14 Elements 14 Category 15 RSEI Geographic Microdata 17 Disaggregated Microdata 17 Aggregated Microdata 18 Averaged Block Group Microdata 19 Water Microdata 20 Other Available Data 20 Census Crosswalks 20 Population Data (US Decennial Census) 21 Shapefiles- Current Version (Grid geography) 22 Shapefiles- Older Version (Grid geography) 22 1 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 RSEI Data Dictionary This document describes all of the data tables and fields used in the RSEI model and results data sets. Additional information can be found in the RSEI methodology document. Facility-level Data Tables This dataset presents RSEI results at the facility release level, and is the basic dataset for results that can be found in Envirofacts and EasyRSEI. These tables are distributed in the RSEI Queries database as well as in a set of flat csv files. These tables link to the Geographic Microdata using keys like FacilityNumber, ChemicalNumber, and ReleaseNumber. Note that the key values change with each version of RSEI, so you must use the same version of these tables as the Microdata. Facility The facility table contains data for reporting facilities, including location, stack parameters and discharge reach, and is also available in EasyRSEI. Note that, with Version 2.3.6, EPA program IDs for RCRA and ICIS-NPDES/PCS are provided in a separate table. RSEI Facility Table, spreadsheet RSEI Facility Table, text format Facility Data Variable Name Description Facility 1D Unique TRI identifier for facility (TRI Facility ID). FacilityNumber Internal identifier unique to each facility, (key for table) Latitude Final latitude of the facility in decimal degrees used for modeling. Longitude Final longitude of the GridCode Number that identifies the model grid within which the cell is located. X Assigned grid value based on latitude. Y Assigned grid value based on longitude. RadialDistance Distance from approximate center point of grid. StackHeight Height of facility stack that is emitting the pollutant (m). StackVelocity Rate at which the pollutant exits the stack (m/s). StackDiameter Diameter of facility stack that is emitting the pollutant (m). StackHeightSource Source of information on stack height. StackVelocitySource Source of information on stack velocity. StackDiameterSource Source of information on stack diameter. NEIYear National Emissions Inventory (NEI) version year, if NEI data were used for stack parameters. FacilityName TRI facility name. Street Street address of facility. City City where the TRI facility is located. County County where the TRI facility is located. State State in which the facility is located. 2 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Facility Data Variable Name Description ZIPCODE Five-digit facility ZIP code. ZIP9 Nine digit facility ZIP code, if reported. FIPS FIPS (Federal Information Processing Standard) code which identifies the county associated with the facility. STFIPS FIPS (Federal Information Processing Standard) code which identifies the state associated with the facility. DUNS The 9-digit number assigned by Dun & Bradstreet for the facility or establishment within the facility. REGION EPA region where facility is located. FederalFacilityFlag Code describing federal status for purposes of Executive Order 12856. FederalAgencyName Name of Federal Agency of which the federal facility is a part. ParentName Name of the corporation or other business entity located in the U.S. that directly owns at least 50 percent of the voting stock of the facility, as submitted by the TRI facility. ParentDUNS The 9-digit number assigned by Dun & Bradstreet for the US parent company. StandardizedParentComp any Name of parent company, checked for consistency so that records can be aggregated by parent company. PublicContactName Name submitted by TRI facility as public contact. PublicContactPhone Phone number submitted by TRI facility for public contact. Extension Extension number, if any, associated with contact phone number. PCT_CH6 Percent of chromium released that is assumed to be hexavalent (the remainder is assumed to be trivalent with negligible toxicity and not modeled. ChromHexPercent Percent of chromium released that is assumed to be hexavalent (the remainder is assumed to be trivalent with negligible toxicity and not modeled (same as PCT_CH6). Chrom Source Source for PCT_CH6/ChromHexPercent. ModChrom Releases True if facility has released or transferred chromium or chromium compounds to modeled media (fugitive/stack air releases, direct water, POTWs or off-site incineration). NewlndustryFlag True if the facility's primary NAICS was added to TRI in the TRI industry expansion beginning in reporting year 1998. NAICS1 Facility-level primary North American Industry Classification System (NAICS) code assigned by RSEI for modeling purposes (note that in TRI a facility can have a different NAICS code for each Form R). If more than one primary NAICS is reported by the facility, the most frequently reported primary NAICS for the most recent year is selected. Information on NAICS can be found at the Census website at https://www.census.gov/eos/www/naics/ NAICS2 Facility's most frequently reported non-primary 6-digit NAICS code. NAICS3 second most frequently reported non 3 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Facility Data Variable Name Description NAICS4 Facility's third most frequently reported non-primary 6-digit NAICS code. NAICS5 Facility's fourth most frequently reported non NAICS6 Facility's fifth most frequently reported non-primary 6-digit NAICS code. NAICSCode3Digit First 3 digits of facility's primary NAICS code. NAICSCode4Digit First 4 digits of facility's primary NAICS code. NAICSCode5Digit 5 SIC1 Facility-level SIC code that corresponds to the assigned facility-level NAICS code. FRSID EPA's Facility Registry System ID. AssignedReach NHDPIus reach identifier for final facility discharge reach. AssignedCOMID segment identifier for final facility discharge reach. ReachSource Source for final discharge assignment. OutfallLatitude Latitude for outfall. OutfallLongitude Longitude for outfall. OutfallSource Source for outfall coordinates. NearReach NHDPIus reach identifier for nearest discharge reach. NearCOMID segment identifier for nearest discharge reach. NPDESReach NHDPIus reach identifier for discharge reach as reported to ICIS-NPDES. NPDESCOMID segment identifier for discharge reach reported to ICIS NPDESYear Year of ICIS-NPDES data used. DistanceToReach The distance between an off site facility discharging to water and the reach of the receiving water body (m). HEM3ID The ID assigned to the nearest National Weather Service (NWS) observation station. DistanceToHEM3 LatLongSource Source of final lat/long found in 'Latitude' and 'Longitude' fields. LLYear Year of lat/long data. LLNotes Notes for facility location. Confirmed True if facility location has been confirmed via satellite image. WaterReleases True if facility reports direct releases to water for any year since 1988. DistanceToTribalLand Distance to nearest Tribal Land within ten miles (miles) TribalLandName Name of nearest Tribal Land within ten miles, if any. Full NameTribal Land Full or official name of Tribal land. ChromReleases True if facility reports chromium modeled releases or transfers for any year since 1988. ModeledReleases True if facility reported modeled releases or transfers since 1988 (fugitive or stack air releases, direct water releases, or transfers to off-site incineration or POTWs). 4 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 EPA Program Identifiers for Reporting Facilities This table contains program identifiers for each TRI reporting facility, from EPA's Facility Registry Service (FRS). Resource Conservation and Recovery Act (RCRA) program, and National Pollutant Discharge Elimination System (NPDES) . The FRS identifier can be used to link to any other EPA program identifiers. Program Identifiers for TRI Reporting Facilities, spreadsheet Program Identifiers for TRI Reporting Facilities, text format Program Identifiers for Reporting Facilities Variable Description FRSID FRS program identifier tril, tri2... TRI facility identifier associated with FRS record (FRS identifiers are assigned to unique facilities in EPA's FRS system, and in a few cases there are multiple TRI identifiers for one FRS identifier). rcral, rcra2... RCRA identifier associated with FRS record. npdesl, npdes2... NPDES identifier associated with FRS record. Off-site The Off-site table contains the condensed list of quasi-unique off-site facilities to which TRI reporters transfer waste. Only incinerators and POTWs are modeled by RSEI, so verification of addresses and locations are focused on those off-site facilities. RSEI Off-Site Table, spreadsheet RSEI Off-Site Table, text format Off-Site Data Variable Description OffsitelD Unique internal identifier for each off-site facility. FacilityNumber Unique internal identifier for each off-site facility. [Note this is different from the FacilityNumber field in the Facility table] POTWJncin Identifies off-site facilities for which releases are modeled: 1= POTW; 2=lncinerator; 3=POTW and Incinerator. Droplncinerator True if off-site has been identified as a TRI reporter or a RCRA hazardous waste incinerator. Name Best submitted name for off-site facility. Street Best submitted street address for off-site facility. City Best submitted city for off State Best submitted state for off-site facility. ZIPCode Best submitted ZIP code for offsite facility. ZIP9 This variable is not yet implemented. Latitude Geocoded latitude in decimal degrees for off-site facility. Longitude Geocoded longitude in decimal degrees for off-site facility. GridCode Number that identifies the model grid within which the cell is located. 5 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Off-Site Data Variable Description Country Null if off-site facility is located in the U.S.; otherwise country in which off-site facility is located. X Assigned grid value based on latitude. Y Assigned grid value based on longitude. Radial Distance Distance from approximate center point of grid. StackHeight Stack height used for modeling. StackVelocity velocity used for StackDiameter Stack diameter used for modeling. StackParameterSource Null if default stack parameters were used; otherwise source for stack parameters. HEM3ID The ID assigned to the nearest National Weather Service (NWS) observation station. DistanceToHEM3 WBANID The ID assigned to the Weather Bureau/Army/Navy Weatherstation nearest to the facility. DistanceToWBAN The distance between a facility and the nearest WBAN weather station (m). WaterReleases True if off-site facility receives transfers to POTW. OutfallLongitude Latitude associated with end of the pipe used for off-site facility's discharge to water. OutfallLatitude Longitude associated with end of the pipe used for off-site facility's discharge to water. NearReach 14-digit NHDPIus reach identifier associated with the reach that is nearest to off site facility. NearComID ComID from NHDPIus dataset that uniquely identifies reach segment nearest facility. DistanceToReach The distance between an off-site facility discharging to water and the reach of the receiving water body (m). AssignedReach 14-digit NHDPIus reach identifier associated with reach assigned by EPA or determined through QA. AssignedComID ComID from NHDPIus dataset that uniquely identifies reach segment for assigned reach. ReachSource Data source linking stream reach to facility. ReachNotes Notes pertaining to stream reach assignment. LocationType Type of geocoded match. LatLongSource Source used to determine lat/longs. NA identifies records with insufficient information to determine location. LatLongYear Year lat/long was last updated. LockLL True if location was confirmed as correct using satellite data. CentroidAdjustment True if facility's FRS coordinates were modified from front door or street to approximate center of facility. Notes on Coordinates Notes on how lat/long was derived. AdditionalSourcesForLocation Web site, if any, used to determine location. 6 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Off-Site Data Variable Description LocationConfidence Code describing confidence in location assigned to off-site: 1 = confirmed in satellite view on map. 2 = Substantial information supporting location, including physical features such as settling ponds (2a), or a match to a business entry in google maps (2b). 3 = Geocoded address looks plausible given type of facility. 4 = No information to support geocoded address. Foreign 1 if off-site is located outside the U.S. GeoMatchType If LatLongSource=ESRI, shows the basis upon which the coordinates were assigned, such as street address, postal code, district, etc. EPA Program Identifiers for Off-Site Facilities This table contains program identifiers for each off-site facility that receives reported transfers from TRI facilities. RSEI condenses the off-site transfer reports into approximately unique facilities (some duplication may remain),and matches the off-sites to records in EPA's Facility Registry Service (FRS), Resource Conservation and Recovery Act (RCRA) program, and National Pollutant Discharge Elimination System (NPDES). Matches are based on name and address using approximate text matching; program identifiers should be verified before any analysis is finalized. Program Identifiers for TRI Off-Site Facilities, spreadsheet Program Identifiers for TRI Off-Site Facilities, text format Program Identifiers for Off-Site Facilities Variable Description OffsitelD RSEI internal identifier for each unique off-site facility. FRSID FRS program identifier tril, tri2... TRI facility identifier associated with FRS record (FRS identifiers are assigned to unique facilities in EPA's FRS system, and in a few cases there are multiple TRI identifiers for one FRS identifier). rcral, rcra2... RCRA identifier associated with FRS record. npdesl, npdes2... NPDES identifier associated with FRS record. Standard Industrial Classification (SIC) This table is no longer maintained in RSEI. NAICS codes are now used to determine industry-level stack heights and chromium speciation rates. 7 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 North American Industry Classification System (NAICS) NAICS codes are collected by TRI. NAICS Table, spreadsheet NAICS Table, text format NAICS Data Variable Description NAICSCode Six-digit NAICS code. LongName Text description of code. Chemical The chemical table contains data for chemicals reported to TRI, including toxicity, physico-chemical properties, and flag fields to facilitate user selections. The chemical table is also available in EasyRSEI. RSEI Chemical Table, spreadsheet RSEI Chemical Table, text format Chemical Data Field Name Field Description CASNumber Chemical Abstracts Service Registry Number, which identifies a unique chemical. For chemical categories, CAS Numbers begin with "N", followed by three digits. CASStandard The Chemical Abstracts Service Registry Number identifies a unique chemical. The standard format contains three sets of numbers divided by hyphens (00-00-0). ChemicalNumber Unique internal identifier. SortCAS Chemical Abstracts Service Registry Number, which identifies a unique chemical, formatted for sorting (no hyphens). For chemical categories, CAS Numbers begin with "N", followed by three digits. SortName Common name of chemical, with initial modifiers moved to end of name. Used for internal sorting purposes. FullChemicalName Full scientific name(s) of the chemical. Common name(s) of the chemical. Added The year the chemical was added to the Toxics Release Inventory. This field is blank when Chemical is invalid, mixture or trade secret. Toxicity Source All sources used for toxicity data, and date of addition to database. RfClnhale The inhalation reference concentration (RfC) is defined as "an estimate (with uncertainty spanning perhaps an order of magnitude) of a continuous inhalation exposure to the human population (including sensitive subgroups) that is likely to be without appreciable risk of deleterious noncancer health effects during a lifetime". Units are mg/m3. 8 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Chemical Data Field Name Field Description RfCUF The uncertainty factor (UF) is applied to the no-observed-adverse-effect level (NOAEL) upon which the RfC is based, thereby reducing the dose. The UF accounts for uncertainties in extrapolation from experimental data to an estimate appropriate to humans. RfCMF The modifying factor (MF) is a value applied to the NOAEL when scientific uncertainties in the study chosen for estimating the RfC are not explicitly addressed by the standard UFs. RfCConf Confidence levels are assigned to the study used to derive the RfC, the overall database, and to the RfC itself. RfCSource Source used for the RfC value. RfCListingDate Date that RfC was listed, if available. RfCToxWeight Toxicity weight based on the RfC (RfCToxWeight = 3.5/RfC). Noncancer/inhalation. RfDOral The oral reference dose (RfD) is "an estimate (with uncertainty spanning perhaps an order of magnitude) of a daily exposure [by ingestion] to the human population (including sensitive subgroups) that is likely to be without an appreciable risk of deleterious effects during a lifetime", (mg/kg-day) RfDUF The uncertainty factor (UF) is applied to the no-observed-adverse-effect level (NOAEL) upon which the RfD is based, thereby reducing the dose. The UF accounts for uncertainties in extrapolation from experimental data to an estimate appropriate to humans. RfDMF The modifying factor (MF) is a value applied to the NOAEL when scientific uncertainties in the study chosen for estimating the RfD are not explicitly addressed by the standard UFs. RfDConf Confidence levels are assigned to the study used to derive the RfD, the overall database, and to the RfD itself. RfDListingDate Date that RfD was listed, if available. RfDSource Source used for the RfD value. RfDToxWeight Toxicity weight based on the (RfDToxWeight = 1/RfD). Noncancer/oral. UnitRisklnhale The unit inhalation risk is the excess lifetime risk due to a "continuous constant lifetime exposure of one unit of carcinogen concentration"(51 FR 33998). (l/mg/m3) QSTAROral The oral cancer slope factor (ql*) or oral slope factor (OSF): a measure of the incremental lifetime risk of cancer by oral intake of a chemical, expressed as risk per mg/kg-day. (1/mg/kg-day) 9 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Chemical Data Field Name Field Description WOE Weight of evidence (WOE) categories indicate how likely a chemical is to be a human carcinogen, based on considerations of the quality and adequacy of data and the type of responses induced by the suspected carcinogen. EPA WOE classifications include the following categories and associated definitions (51 FR 33996): A Carcinogenic to humans B Probable carcinogen based on: • B1 Limited human evidence • B2 Sufficient evidence in animals and inadequate or no evidence in humans: C Possible carcinogen D Not classifiable E Evidence of non-carcinogenicity UnitRiskListingDate Date that Unit Risk was listed, if available. UnitRiskSource Source used for the Unit Risk value. lURToxWeight Toxicity weight based on the IUR (lURToxWeight = IUR/2.8e-7). Cancer/inhalation. QStarListingDate Date that QStar was listed, if available. QStarSource Source used for the QStar value. OSFToxWeight Toxicity weight based on the QStar or OSF (OSFToxWeight = QSTAROral/le-6). Cancer/oral. WOEListingDate Date that WOE was listed, if available. WOESource Source used for the WOE classification. ITW Inhalation Toxicity Weight: the RSEI toxicity weight for a chemical for the inhalation pathway. OTW Oral Toxicity Weight: the RSEI toxicity weight for a chemical for the oral pathway. ToxicityClassOral This indicates whether the toxicity weight for the oral pathway is based on cancer or noncancer health effects. ToxicityClasslnhale This indicates whether the toxicity weight for the inhalation pathway is based on cancer or noncancer health effects. ToxicityCategory This indicates whether the oral and inhalation toxicity weights are based on cancer health effects, non-cancer health effects, or both. AirDecay The rate at which a chemical degrades in air, due primarily to photooxidation by radicals (hr-1). Koc The organic carbon-water partition coefficient, used in estimates of chemical sorption to soil (mL/g). H20Decay The rate at which a chemical degrades in water, due to abiotic hydrolysis, biodegradation, or photolysis (hr-1). LOGKow The logarithm of the octanol water partition coefficient. Kow is the ratio of a chemical's concentration in the octanol phase to its concentration in the aqueous phase at equilibrium in a two-phase octanol/water system. 10 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Chemical Data Field Name Field Description Kd The soil-water partition, or distribution, coefficient. For organics, the value is often estimated as the product of Koc and foe (the fraction of organic carbon in the soil) (L/kg). WaterSolubility The amount of chemical that dissolves in water at a particular temperature (mg/L). POTWPartitionRemoval Percent of chemical removed from the wastewater by the POTW (Publicly Owned Treatment Works). POTWPartitionSludge Percent of total POTW removal efficiency attributable to sorption of the chemical to sewage sludge. POTWPartitionVolat Percent of total POTW removal efficiency attributable to volatilization of the chemical. POTWPartitionBiod Percent of total POTW removal efficiency attributable to biodegradation of the chemical. IncineratorDRE Destruction/removal efficiencies, expressed as the percent of chemical fed to the incinerator that is not released to the air. BCF Bioconcentration factor: the ratio of a chemical's concentration in fish to its concentration in water at equilibrium (L/kg). Henrys Henry's law constant: the ratio of a chemical's concentration in the air to its concentration in the water at equilibrium (atm-m3/mol). MCL Maximum Contaminant Level, which is EPA's national primary drinking water standard for the chemical. This is the current value; historical data are contained in the table, 'MCL.' Molecular Weight The mass in grams of one mole of molecules of the chemical. HAPFlag This flag marks the chemicals that are hazardous air pollutants, as defined by the Clean Air Act. CAAFlag This flag marks the chemicals that are Clean Air Act pollutants. PriorityPollutantFlag priority pollutants, as defined by the Clean Water Act. SDWAFlag This flag marks the chemicals that have national primary or secondary drinking water standards under the Safe Drinking Water Act. CERCLAFlag This flag marks the chemicals that are regulated under Superfund (CERCLA—the Comprehensive Environmental Response, Compensation, and Liability Act). OSHACarcinogens This flag indicates whether the chemical is a known or suspect human carcinogen based on OSHA criteria. Known human carcinogens are defined as those that have been shown to cause cancer in humans. Suspect human carcinogens have been shown to cause cancer in animals. The list of chemicals flagged as OSHA carcinogens is based on the list of carcinogens provided in the 1997 TRI Public Data Release.* ExpansionFlag This flag marks the chemicals that were added to the Section 313 toxic chemical list for the 1995 Reporting Year. Core88ChemicalFlag This flag marks the chemicals that are common to all reporting years of TRI and that have had no modifications of reporting requirements, as determined by the 1988 Core Chemical List found on the TRI Explorer website. 11 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Chemical Data Field Name Field Description Core95ChemicalFlag This flag marks the chemicals that are common to TRI reporting years 1995 through the current year and that have had no modifications of reporting requirements in that time period, as determined by the 1995 Core Chemical List found on the TRI Explorer website. Core98ChemicalFlag This flag marks the chemicals that are common to TRI reporting years 1998 through the current year and that have had no modifications of reporting requirements in that time period, as determined by the 1998 Core Chemical List found on the TRI Explorer website. CoreOOChemicalFlag This flag marks the chemicals that are common to TRI reporting years 2000 through the current year and that have had no modifications of reporting requirements in that time period. CoreOlChemicalFlag This flag marks the chemicals that are common to TRI reporting years 2001 through the current year and that have had no modifications of reporting requirements in that time period. The only difference between this flag and the CoreOOChemicalFlag is the inclusion of lead and lead compounds. HPVFlag Indicates whether the chemical is designated as a High Production Chemical. HPVChallengeValue Describes the value or combination of values assigned to the chemical by EPA's HPV Challenge program to describe the chemical's status under the program. PBTFlag Indicates whether EPA has designated this chemical as a priority chemical under the Persistent Bioaccumulative and Toxic (PBT) Chemical Program. Metal This flag indicates whether the chemicals are metals and also whether they are core chemicals. (Core chemicals are those that are common to all reporting years of TRI and which have had no modifications of reporting requirements.) HasTox Indicates that the chemical has a toxicity weight (either oral or inhalation) in the data set. MaxTW Shows the greater of the two possible toxicity weights (oral or inhalation). Notes Additional information regarding assignment of toxicity or physicochemical data. Maximum Contaminant Level (MCL) MCLs are used to cap maximum concentrations in drinking water systems. RSEI MCL Table, spreadsheet RSEI MCL Table, text format MCL Data Variable Description CASNumber Chemical Abstracts Service Registry Number, which identifies a unique chemical. For chemical categories, CAS Numbers begin with "N", followed by three digits. 12 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 MCL Data CASStandard The Chemical Abstracts Service Registry Number identifies a unique chemical. The standard format contains three sets of numbers divided by hyphens (00-00- 0). ChemicalNumber Unique internal identifier (links to Chemical table). Common name of the chemical. | MCL1988...MCL2016 MCL for each year an MCL was in effect. Media The media table provides descriptions for the media codes used in the Release table. RSEI Media Table, spreadsheet RSEI Media Table, text format Media Data Variable Description Media (RSEI Media Code) Code associated with the media and/or method of release, as in TRI Reporting Form R, except that "M" is replaced with "1" site transfers. reported by facility in the code for off- MediaText (Short Description) Descriptions of receiving media associated with Media Code. TRICode Code associated with the media and/or method of release, as in TRI Reporting Form R. reported by facility TRICategory -assigned waste treatment category. LongDescription Longer version of media text field. Submission The submission table contains Form R information submitted to TRI. The Submission, Elements and Release tables are too large for spreadsheet format. RSEI Submission Table, text format Submission Data Variable Description DCN Unique identifier assigned by TRI to each facility submission (document control number). SubmissionNumber Internal identifier assigned to each submission. FacilityNumber Internal identifier unique to each facility (links to Facility table). ChemicalNumber Internal identifier unique to each chemical (links to Chemical table). SubmissionYear Year of facility release. 13 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Submission Data Variable Description Use Code describing how chemical is used in reporting facility, as reported on TRI Reporting Form R. See On-site Chemical Information for an explanation of the codes. MaxOnsite Code describing the maximum amount of the chemical on-site at reporting facility, as reported in TRI Reporting Form R. See On site Chemical Information for an explanation of the codes. Release This table contains data for each chemical release. There can be multiple release records per submission record. The Submission, Elements and Release tables are too large for spreadsheet format. RSEI Release Table, text format Release Data Variable Description 1 ReleaseNumber Unique internal identifier. SubmissionNumber Unique internal identifier (links to Submission table). Media Code associated with the media and/or method of release, as reported by facility in TRI Reporting Form R. See Media table for explanation of codes. PoundsReleased Total pounds released, without accounting for treatment. OffsiteNumber Unique identifier for off-site facility receiving this release, if any. Links to Facility Number in the Off site table. TEF Toxicity Equivalency Factor used to adjust toxicity for dioxins. Elements The Elements table contains the calculated results for each release. There can be multiple elements records for each release. Note that all values in the elements table are rounded to six significant figures. The Submission, Elements and Release tables are too large for spreadsheet format. RSEI Elements Table, text format Elements Data Variable Description ElementNumber Unique internal identifier. ReleaseNumber Unique internal identifier (links to Release table). PoundsPT (TRI Pounds) Total pounds after any treatment by POTWs or other offsite facilities. 14 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Elements Data Variable Description ScoreCategory Codes corresponding to the medium into which the chemical is released. Examples of the information include: direct air releases from the stack using a "rural" air dispersion model, fugitive air releases, releases to an onsite landfill. [See Score Category Information for descriptions] Score Total Indicator Element score- modeled surrogate dose multiplied by toxicity weight and by population, using the higher cancer/noncancer toxicity weight for each air/water pathway. Population Total population exposed. ScoreA Score for children 0 through 9 years of age (inclusive). PopA Number of exposed children ScoreB Score for children 10 through 17 years of age (inclusive). PopB Number of exposed children 10 ScoreC Score for adults 18 through 44 years of age (inclusive). PopC Number of exposed ScoreD Score for adults 45 through 64 years of age (inclusive). PopD Number of exposed ScoreE Score for adults 65 years old and greater. PopE Number of exposed NCScore (NonCancer Score) Indicator Element score, limited to chemicals with non-cancer endpoints. CScore (Cancer Score) Indicator Element score, limited to chemicals with cancer endpoints. Hazard Toxicity weight times TRI pounds, using the higher cancer/noncancer toxicity weight for each air/water pathway. HazardC (Cancer Hazard) Toxicity weight times TRI pounds, limited to chemicals with cancer endpoints. HazardNC (Non-Cancer Hazard) Toxicity weight times TRI pounds, limited to chemicals with non-cancer endpoints. Category The Category table describes the codes used in the Elements table to indicate the release pathway. RSEI Category Table, spreadsheet RSEI Category Table, text format Category Data Variable Description ScoreCategory Codes corresponding to the medium into which the chemical is released. Examples of the information include: volatilization from a transfer to a POTW, fugitive air releases, releases to an onsite landfill. Category Descriptions of release media and other descriptors corresponding with the score category codes. 15 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Category Data Variable Description Model A variable that is '1' when that category can be modeled and '0' when it cannot. InhaleTox A variable that is '1' when the model requires an inhalation toxicity score to model this kind of release and '0' when it does not. 16 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 RSEI Geographic Microdata A separate guidance is available for use with the Microdata. Disaggregated Microdata These are the raw Microdata files that contain the most disaggregated data possible. For each 810m grid cell, the file contains scores, concentrations, and tox-weighted concentrations for each chemical release. There may be multiple records for any one grid cell. Note that if two releases for the same chemical (either from different facilities or one from a stack release and one from a fugitive release from the same facility) affect the same grid cell, there will be separate records for each grid release. Naming: These annual files have historically been named MicroXXXX_YYYY, where XXXX is the reporting year for the data freeze, and YYYY is the year of the data contained in the file. So Micro 2014_2010 is from the RY2014 RSEI update, and contains data for chemicals released in 2010. The new naming convention substitutes the version number for the version year, as in vXXX_micro_YYYY, where XXX is the version number and YYYY is the year of the data contained in the file; for example v234_micro_2014.csv.There is one annual file for the entire country, which is over 100 GB in size. Disaggregated Microdata Table Field Number Name Description 1 GridCode Identifies grid. . 14=Conterminous US 24=Alaska 34=Hawaii 44=Puerto Rico/Virgin Islands 54=Guam/Marianas 64=American Samoa 2 X X-coordinate of grid cell 3 Y Y Coordinate of grid cell 4 ReleaseNumber Internal unique identifier for release (lookup in table "Release")* 5 ChemicalNumber Internal unique identifier of released chemical (lookup in table "Chemical")* 6 FacilityNumber Internal unique identifier of releasing facility (lookup in table "Facility" if media = 1 or 2; if media = 6 or 750 or 754, then lookup in table "Offsite")* 7 Media Code describing media into which chemical is released. (lookup in table "Media")* 8 Cone Concentration of chemical for release/media at grid cell. 9 ToxConc Concentration multiplied by inhalation toxicity weight 10 Score Risk-related score (surrogate dose * toxicity weight * population) 17 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Disaggregated Microdata Table Field Number Name Description 11 ScoreCancer Risk-related score (surrogate dose * toxicity weight * population) using only toxicity values for cancer effects 12 ScoreNonCancer Risk-related score (surrogate dose * toxicity weight * population) using only toxicity values for noncancer effects 13 Pop Number of people in grid cell (may be interpolated) Aggregated Microdata Aggregated Microdata files use the same data as the disaggregated files, but sum the chemical releases over each grid cell. Because the values are summed, unweighted concentrations are not available (the sum of the concentrations of different chemicals would be meaningless). Naming: These annual files have historically been named MicroXXXX_YYYY, where XXXX is the reporting year for the data freeze, and YYYY is the year of the data contained in the file. So Micro 2014_2010 is from the RY2014 RSEI update, and contains data for chemicals released in 2010. The new naming convention substitutes the version number for the version year, as in vXXX_micro_YYYY, where XXX is the version number and YYYY is the data year; for example v234_micro_2014.csv.These files have historically been named in the format AggMicroXXXX_YYYY_GCZZ, where XXXX is the reporting year for the data freeze, YYYY is the year of the data contained in the file, and ZZ is the 2-digit grid code (see Field 1 in the Table 1 below for grid codes). The new naming convention substitutes the version number for the version year, as in vXXX_aggregated_micro_gcZZ_YYYY; for example, v234_aggregated_m icro_gcl4_2014.csv. Aggregated Microdata Table Field Number Name Description 1 X X-coordinate of grid cell 2 Y Y Coordinate of grid cell 3 NumberOf Facilities Number of facilities with releases affecting grid cell. 4 NumberOf Releases Number of individual releases affecting grid cell. 5 NumberOfChemicals Number of chemicals with nonzero concentrations for grid cell. 6 ToxConc Concentration multiplied by inhalation toxicity weight, summed over all chemicals impacting cell 7 Score Risk-related score (surrogate dose * toxicity weight * population), summed over all chemicals impacting cell 8 Pop 9 ScoreCancer Risk-related score (surrogate dose * toxicity weight * population) using only toxicity values for cancer effects 18 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Aggregated Microdata Table Field Number Name Description 10 ScoreNonCancer Risk-related score (surrogate dose * toxicity weight * population) using only toxicity values for cancer effects Averaged Block Group Microdata These files are the same as the aggregated Microdata files, but instead of being presented at the grid cell level, the values are averaged over Census block groups. The file BG_RSEI_XXXX_3yr is a csv file with the block group-level data averaged over 2012 through 2014. There are also shape files (tl_2010_bg_US_RSEI) with the same data; that is, the .dbf file and the .csv have the same fields. Averaged Block Group Microdata Field Number Name Description 1 GEOIDIO US Census Block Group ID 2 ALAND10 Land area of the block group (m2) 3 AW ATE RIO Water area of the block 4 TOXCONC Average toxicity-weighted concentration of the cells in the block group, averaged over three years. 5 PTOXCONC Percentile associated with field TOXCONC. 6 SCORE Average risk-related score (surrogate dose * toxicity weight * population) of the cells in the block group, averaged over three years. 7 PSCORE Percentile associated with field SCORE. 8 NCSCORE Average risk-related score (surrogate dose * toxicity weight * population) of the cells in the block group, averaged over three years. Score is calculated using only noncancer toxicity weights. 9 PNCSCORE Percentile associated with field NCSCORE. 10 Average risk-related score (surrogate dose * toxicity weight * population) of the cells in the block group, averaged over three years. Score is calculated using only cancer toxicity weights. 11 PCSCORE Percentile associated with field CSCORE. 12 POP Average population of the cells in the block group, averaged over three years. 13 PPOP Percentile associated with field POP. 14 COVERED Internal field. 15 FOUND 16 GC Grid code. 19 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Water Microdata This file contains the toxicity-weighted concentrations downstream of TRI discharges by stream segment. All years of data are contained in the file, which is named NHDMicroResults_conc_agg_XXXX, where XXXX is the reporting year of the data freeze. Water Microdata Field Number Name Description 1 ReleaseNumber Internal unique identifier for release (links to Release table). 2 Counter Auto-increment count of COMIDs 3 Com ID "Common Identifier" of a flowline (sub-segment of a reach)- atomic unit of reach data that matches one-to- one to NHDPIus. 4 ReachCode Code for reach 5 Cone Concentration of chemical in flowline (mg/L) 6 Sequence Number defining pathway of release (used to indicate branching). 7 TravelTime Time(s) for release to go from top of flowline to bottom. 8 TravelLength Distance (m) for release to go from top of flowline to bottom 9 Paths Number of branches in stream path 10 FCode Descriptor from NHDPIus for type of flowline (e.g., pipeline, stream) 11 ResCode Internal code Other Available Data Census Crosswalks Each set of crosswalk files links the RSEI grid cell geography to a different US decennial census year. There is one crosswalk for each area and decennial Census year (1990, 2000, 2010). Crosswalk files are named by area (Alaska, Con(terminous) US, etc.). The last three fields in each file contain percent values that can be used to adjust the block or cell contents when performing the crosswalk. PCT_B_C and PCT_C_B are area-weighted and can be used for metrics that do not involve population, such as concentration and toxicity-weighted concentration. PCT_PC_B is population weighted, and can be used to crosswalk fields that involve population, like score and pop. Note that the "PCT_CP_B" field is not available for the territories (VI, PR, GU, AS, MP). The Northern Mariana Islands are in the Guam file and the Virgin Islands are in the Puerto Rico file. There are no crosswalks for Puerto Rico, the Virgin Islands, Mariana Islands, Guam, or American Samoa for 1990. For these areas, RSEI uses 2000 block boundaries and scales each cell's population by the overall ratio of 1990/2000 population for each area. 20 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Census Crosswalk Table Field Number Name Description 1 GridID Identifies grid. 14=Conterminous US 24=Alaska 34=Hawaii 44=Puerto Rico/Virgin Islands 54=Guam/Marianas 64=American Samoa 2 X X coordinate of the cell address 3 Y Y coordinate of the cell address 4 BlockJDOO US Census Block ID 5 UR Internal 6 PCT_B_C Percent of the Census block that is within the cell (Block to Cell) 7 PCT_C_B Percent of the cell that is within the Census block (Cell to Block) 8 PCT_PC_B Percent of the cell's population that is within the Census block (Population-Cell to Block) Population Data (US Decennial Census) RSEI Census data are contained in three tables, Census 90 (data from the 1990 Census), Census 00 (data from the 2000 Census) and Census 10 (data from the 2010 Census). These three tables contain the Census data that has been transposed onto the RSEI model grid. Each Census table is over 600 MB in size. 1990 Census data have been provided by Geolytics, Inc. Census data were last updated in 2012. Census 90 Data Variable Description Grid Code Number that identifies the model grid within which the cell is located. X Assigned grid cell value based on latitude. Y Assigned grid cell value based on longitude. Male0to9 through Female65andUp The number of people in the grid cell in each Census subpopulation group in the year 1990. PrimaryFIPS The FIPS code for the county within which most or all of the grid cell is contained. 21 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 Census 00 Data Variable Description Grid Code Number that identifies the model grid within which the cell is located. X Assigned grid cell value based on latitude. Y Assigned grid cell value based on longitude. Male0to9 through Female65andUp The number of people in the grid cell in each Census subpopulation group in the year 2000. PrimaryFIPS The FIPS code for the county within which most or all of the grid cell is contained. Census 10 Data Variable Description Grid Code Number that identifies the model grid within which the cell is located. X Assigned grid cell value based on latitude. Y Assigned grid cell value based on longitude. Male0to9 through Female65andUp The number of people in the grid cell in each Census subpopulation group in the year 2010. PrimaryFIPS The FIPS code for the county within which most or all of the grid cell is contained. Shapefiles- Current Version (Grid geography) RSEI shapefiles define the grid and can be used for mapping. They do not contain any RSEI results. New shapefiles were posted on the RSEI ftp site in early 2017. The shapes are the same; however, the fields and format are different, and now additional files for grid cell sizes other than 810m are available. More information on the RSEI grid can be found in the RSEI methodology document. Attribute Table for Grid Shapefiles Variable Description CELLX Assigned grid cell value based on latitude. Y Assigned grid cell value based on longitude. CLAT Latitude for center point of grid cell. CLONG Longitude for center point of grid cell. CX Vertical distance from the grid center point to grid cell (m). Equivalent to CELLX*grid size (m) (for standard RSEI grid, CELLX*810). CY Horizontal distance from the grid center point to grid cell (m). Equivalent to CELLY*grid size (m) (for standard RSEI grid, CELLY*810). Shapefiles- Older Version (Grid geography) RSEI shapefiles define the grid and can be used for mapping. They do not contain any RSEI results. There are two sets: polygon (con_us_810m_poly) and center point (con_us_810m). The grid is split into 4 files 22 ------- RSEI Data Dictionary, Version 2.3.6, RY2016 for each type, numbered 1-4. The attribute table is the same for all shapefiles. More information on the RSEI grid can be found in the RSEI methodology document. Attribute Table for Grid Shapefiles Variable Description X Assigned grid cell value based on latitude. Y Assigned grid cell value based on longitude. LONGX Easting coordinate for Albers projection. LATY Northing LONGITUDE Longitude for center point of grid cell. LATITUDE Latitude for center point of grid cell. RADIALDIST Radial distance from center point of grid (m). AREA Area of grid (m) (note that grid cells vary slightly in size). NORTHADJ Internal. 23 ------- [revised 12/18/2017] RSEI Data Dictionary, Version 2.3.6, RY2016 24 ------- |