* O \ lafc/ \ ' ef PRO"^ Recommended Procedures for Development of Emissions Factors and Use of the WebFIRE Database ------- EPA-453/B-21-001 November 2021 Recommended Procedures for Development of Emissions Factors and Use of the WebFIRE Database U.S. Environmental Protection Agency Office of Air Quality Planning and Standards Sector Policies and Programs Division Research Triangle Park, NC ------- Disclaimer This document has been approved for publication by the Office of Air Quality Planning and Standards (OAQPS), U.S. Environmental Protection Agency. Mention of trade names or commercial products in this document does not constitute endorsement by the agency. This document provides guidance on how the EPA intends to develop emissions factors and how regulated parties and the general public can use the WebFIRE database. This document is not intended, nor can it be relied upon, to affect legal rights or obligations. The EPA may decide to follow the guidance provided in this document, or to act at variance with the guidance based on its analysis of the specific facts presented. i ------- TABLE OF CONTENTS Section Page No. Section 1.0 What is the Purpose of This Document? 1-1 Section 2.0 What is an Emissions Factor? 2-1 2.1 Emissions Data 2-2 2.2 Activity Data 2-3 Section 3.0 How Have We Historically Developed Emissions Factors? 3-1 Section 4.0 How are Emissions Factors Used? 4-1 Section 5.0 What are EPA's Revised Procedures for Developing Emissions Factors? 5-1 5.1 Data Collection 5-1 5.2 Test Data Evaluation 5-3 5.3 Detection Limit Procedures for Calculating Test Run Averages 5-4 5.4 Grouping of Candidate Data and Identification of Outliers 5-5 5.5 Emissions Factor Derivation and Quality Assessment 5-5 Section 6.0 EPA's Interactive Database for the Emissions Factors Program - What is WebFIRE? 6-1 6.1 What is WebFIRE? 6-1 6.2 How is WebFIRE Used? 6-1 6.3 Who Uses WebFIRE? 6-3 6.4 How Does WebFIRE Improve Emissions Factor Identification and Development? 6-4 Section 7.0 How Do I Find an Emissions Factor? 7-1 7.1 How Do I Identify and Retrieve an Emissions Factor from WebFIRE? 7-1 7.2 How Do I Obtain Background Information for My Selected Emissions Factor? 7-4 Section 8.0 What Parameters Should I Consider When Using or Deriving an Emissions Factor? 8-1 8.1 Source Category and Process Considerations 8-1 8.2 Control Device Considerations 8-3 8.3 Pollutant Test Method Considerations 8-3 Section 9.0 How Do I Develop a User-Defined Emissions Factor? 9-1 9.1 How Do I Use WebFIRE to Create a User-Defined Emissions Factor? 9-1 9.2 What are the Potential Impacts Associated with Applying a User-defined Emissions Factor? 9-3 Section 10.0 How Do I Submit Data to WebFIRE? 10-1 ii ------- 10.1 How Are Emissions Tests Documented? 10-3 10.2 What are the CDX and CEDRI and What are their Roles in Submitting Data to WebFIRE? 10-6 Section 11.0 What is the Data Review and Public Participation Process for Emissions Factor Development? 11-1 Appendix A - Procedures for Determining Individual Test Report Quality Ratings Appendix B - Procedures for Handling Test Data That are Below the Method Detection Limits Appendix C - Procedures For Determining Statistical Outliers Appendix D - Emissions Factor Development and Data Quality Characterization Procedures Appendix E - Statistical Procedures for Determining Valid Data Combinations Source classification codes for source categoires containing 15 or fewer sources iii ------- LIST OF FIGURES Figures Page No. Figure 5-1. EPA's Revised Procedures for Developing Emissions Factors 5-2 Figure 6-1. WebFIRE Overview 6-2 Figure 7-1. Procedures for Retrieving Emissions Factors from WebFIRE 7-3 Figure 9-1. Emissions Factor Derivation in WebFIRE 9-2 Figure 10-1. Typical Work Flow When Using the ERT 10-5 Figure 11-1. Overview of the WebFIRE Public Participation and Emissions Factor Development Process 11-2 Figure C-l. Procedures to Identify Data Outliers in a Candidate Data Set C-2 Figure D-l. Emissions Factor Representativeness Areas for Source Categories Containing More Than 15 Sources D-3 Figure D-2. Emissions Factor Representativeness Areas for Source Categories Containing 15 or Fewer Sources D-4 Figure D-3. Plot of CTR and N Data from Table D-3 D-8 Figure D-4. Plot of Selected Data from Table D-6 D-10 iv ------- LIST OF TABLES Tables Page No. Table 7-1. Data Fields Reported by WebFIRE Emissions Factor Search 7-4 Table A-l. Test Report Quality Rating Tool A-5 Table B-l. Summary of WebFIRE Procedures for Handling ADL, BDL, and DLL Test Data in Calculating a Test Average from Run-Level Data B-2 Table D-l. FQI and Boundary Line Equations D-2 Table D-2. Individual Test Data and Various Characteristics D-6 Table D-3. Individual Test Data Values Selected for Developing an Emissions Factor for a Source Category Containing 15 or Fewer Sources D-8 Table D-4. Individual Test Data and Various Characteristics for a Source Category with 15 or Fewer Sources D-9 Table E-l. Emissions Factor Characteristics for Group A and B E-2 Table E-2. Emissions Factor Characteristics for Group C and D E-3 ------- LIST OF ACRONYMS Acronym Term ADL Above Detection Limit AFS Air Facility System AFSEF AIRS Facility Subsystem Emission Factor AIRS Aerometric Information Repository System AP-42 Compilation of Air Pollutant Emission Factors, Volume I: Stationary Point and Area Sources ASCII American Standard Code for Information Interchange ASTM American Society for Testing and Materials BDL Below Minimum Detection Limit CAA Clean Air Act CARB California Air Resources Board CAS Chemical Abstracts Service CBI Confidential Business Information CDX Central Data Exchange CEDRI Compliance and Emissions Data Reporting Interface CEMS Continuous Emissions Monitoring System CO Carbon Monoxide C02 Carbon Dioxide CROMERR Cross-Media Electronic Reporting Regulation CSV Comma Separated Values CTR Composite Test Rating DGM Dry Gas Meter DLL Detection Level Limited EIS Emission Inventory System EMC Emission Measurement Center EPA Environmental Protection Agency ERT Electronic Reporting Tool ESA Electronic Signature Agreement ESP Electrostatic Precipitator EST Eastern Standard Time FAQs Frequently Asked Questions FIRE Factor Information Retrieval FQI Factor Quality Index HAP Hazardous Air Pollutants HTML Hypertext Markup Language ITR Individual Test Rating L&E Locating and Estimating MACT Maximum Achievable Control Technology MDL Minimum Detection Limit MLE Maximum Likelihood Estimator NEI National Emissions Inventory NELAP National Environmental Laboratory Accreditation Program NESHAP National Emission Standard for Hazardous Air Pollutants vi ------- Acronym NOx NSPS OAQPS PDF PDS PM PMio QA QA/QC QI RATA SCC SES S02 STAC THC TRI XATEF XML LIST OF ACRONYMS (Continued) Term Oxides of Nitrogen New Source Performance Standard Office of Air Quality Planning and Standards Portable Document Format Project Data Set Particulate Matter Particulate Matter with an aerodynamic diameter of 10 microns or less Quality Assurance Quality Assurance/Quality Control Qualified Individual Relative Accuracy Test Audit Source Classification Code Source Evaluation Society Sulfur Dioxide Stack Testing Accreditation Council Total Hydrocarbons Toxics Release Inventory Crosswalk/Air Toxics Emission Factor System Extensible Markup Language vii ------- Section 1.0 What is the Purpose of This Document? This guidance document describes the procedures, data evaluation criteria, and associated tools and data management systems that the U.S. Environmental Protection Agency (EPA) recommends for developing air pollutant emissions factors for stationary emissions units or processes. This document supersedes the previous EPA guidance document for emissions factor development {Procedures for Preparing Emission Factor Documents (EPA-454 R-95-015, November 1997)). This document presents an introduction to emissions factors and provides the historical background for how and why the EPA has developed emissions factors for stationary emissions units or processes. This document also describes the approach and procedures recommended by the EPA for developing new or revising existing emissions factors. This document provides an overview of the EPA's WebFIRE - an online data storage and emissions factor retrieval and development tool. Also discussed are the EPA's Electronic Reporting Tool (ERT) and WebFIRE template spreadsheet that facilitate the development and documentation of emissions test reports. In addition, this document presents procedures that may be followed by individuals and entities submitting emissions data and related process data to WebFIRE. Finally, this document provides an overview of the data review and public participation process that the EPA follows when developing new or revised emissions factors. This document is organized as follows: Section Contents 2.0 An overview of the characteristics that define an emissions factor. 3.0 A brief summary of the EPA's historical procedures used to develop emissions factors and the various support programs prepared by the agency. 4.0 A discussion of the various uses and limitations of emissions factors. 5.0 An overview of the agency's revised approach for developing EPA emissions factors. 6.0 An overview of WebFIRE, the EPA's online application for storage, retrieval and development of emissions factors. l-i ------- Section 1.0 What is the Purpose of This Document? Section Contents 7.0 The steps users can follow to identify and retrieve emissions factors from WebFIRE. 8.0 Considerations that should be evaluated when using or deriving emissions factors. 9.0 The procedures users can follow to develop a user-defined emissions factor from a collection of related data contained in WebFIRE. 10.0 The steps to follow to submit emissions and related process data to WebFIRE. 11.0 The process by which the public can participate in the periodic development of EPA's emissions factors. This document also contains the following appendices: Appendix Contents A Procedures for determining individual test report quality ratings B Procedures for handling test data that are below the method detection limits C Procedures for determining statistical outliers D Emissions factor development and data quality characterization procedures E Statistical procedures for determining valid data combinations F Source classification codes for source categories containing 15 or fewer sources 1-2 ------- Section 2.0 What is an Emissions Factor? An emissions factor is used to estimate air pollutant emissions from a normally-operating process or activity (e.g., fuel combustion, chemical production). An emissions factor relates the quantity of pollutants released to the atmosphere from a process to a specific activity associated with generating those emissions. For most application purposes, users typically assume that an emissions factor represents the average emissions for all emitting processes of similar design and characteristics (i.e., the emissions factor represents a population average). The simplest form of an emissions factor is a ratio of the mass of pollutant emitted per unit of activity generating the emissions (e.g., pounds of particulate matter (PM) emitted per ton of coal burned). Typically, emissions factors are used to estimate process emissions as follows: E = AxEFx [1 - (ER/100)] Where: E = emissions estimate, A = activity rate, EF = emissions factor, and ER = overall emissions reduction achieved by controls (%). Emissions factors for more complex processes or activities (e.g., paved and unpaved roads, organic liquid storage tanks) are typically expressed using empirical equations. The empirical equation relates independent variables to the source emissions and typically provides for improved predictive accuracy when compared to a simple emissions factor. For example, the following emissions factor for vehicles traveling on unpaved surfaces at industrial sites was taken from the EPA's Compilation of Air Pollutant Emission Factors, Volume I: Stationary Point and Area Sources (AP-42) (Fifth Edition, Section 13.2.2): E = k (s/12)a (W/3)b 2-1 ------- Section 2.0 What is an Emissions Factor? Where: E = particle size-specific emissions factor (pound/vehicle miles traveled), k = particle size multiplier (pound/vehicle miles traveled), s = surface material silt content (%), a, b = particle size-specific empirical constants, and W = mean vehicle weight (tons). 2.1 Emissions Data Typically, emissions data are obtained through direct measurement of releases from a process or activity (i.e., a sample of the process emissions is collected and analyzed). The emissions rate for the source, expressed in terms of mass of pollutant emitted per time unit (e.g., pounds of PM per hour), is calculated as the arithmetic average of the available, quality-assured test data. Depending on the sampling location and configuration of the process and associated control devices (if any), emissions data can reflect controlled or uncontrolled emissions. Direct measurements of facility or process emissions are conducted for a variety of reasons such as: • Characterize process emissions and/or control device performance, • Assess changes in process or control device operation on emissions, and • Demonstrate compliance with federal, state, local, or tribal air regulations. Emissions testing may also be conducted for purposes such as conducting relative accuracy test audits (RATAs), linearity checks (i.e., measures an instrument's ability to provide consistent sensitivity throughout the weighing range) and routine calibrations of continuous emissions monitoring system (CEMS) equipment. The emissions rate for a specific process can also be determined by using a mass balance approach. In general, mass balances are appropriate for use in situations where the mass of all the materials entering and exiting a process can be quantified. Using this mass balance approach, pollutant emissions are calculated as the difference in process inputs and outputs. For certain processes, a mass balance provides an easier and less expensive estimate of emissions than would be obtained by direct measurement. For example, carbon dioxide (CO2) emitted from a fuel combustion process can be estimated from the stoichiometric relationship of the chemical reactants (i.e., carbon contained in the fuel and oxygen in the combustion air), the amount of 2-2 ------- Section 2.0 What is an Emissions Factor? each reactant that is consumed in the combustion process and the amount of carbon remaining in any residual material (e.g., ash). Although a mass balance approach may be suitable for certain processes, this approach may not be appropriate to estimate emissions from a process or activity in which the accuracy or uncertainty of the quantities of input and output materials is a concern. 2.2 Activity Data The composition and magnitude of emissions generated by a process unit are affected by a variety of process parameters such as raw materials and fuels used; process operating conditions; equipment configuration and age; and the skill and experience of process operators. Activity data for use in developing emissions factors are the parameter(s) that directly influence the quality and quantity of emissions from a process unit. Generally, activity data are collected during an emissions test to verify that the process is operating at the desired production level (e.g., to satisfy an operating permit emissions limit). Activity data are typically expressed either in terms of a process input or output per time unit (e.g., gallons of oil burned per hour, tons of cement produced per day). For example, the activity data for a PM emissions factor for plywood manufacturing processes could be expressed in terms of the square feet of plywood produced per day. For an emissions rate determined using a material balance approach, the activity data would typically include one or more process parameters used in the material balance. 2-3 ------- Section 3.0 How Have We Historically Developed Emissions Factors? To assist the EPA in carrying out its responsibilities under the Clean Air Act, the Agency has developed methods with which to characterize and quantify air pollutant emissions from processes and activities on a nationwide basis. Because there were a large number of diverse emissions sources, developing national estimates based upon site-by-site emissions testing was not feasible. Consequently, we developed criteria and non-criteria pollutant emissions factors for certain industrial processes or source categories for use in preparing emissions inventories. These emissions factors were based upon emissions test data, material balance calculations, modeling and engineering judgment. In 1972, the EPA's Office of Air Quality Planning and Standards (OAQPS) published the first document containing the EPA's emissions factors and supporting documentation {Compilation of Air Pollutant Emission Factors, Volume I: Stationary Point and Area Sources (AP-42)). As an aid to end users, OAQPS developed relative quality ratings for the AP-42 emissions factors, based upon the EPA's analysis of the quality of the underlying test data values and how representative the emissions factor was for the particular source category for which it was developed. The letter-grade ratings (e.g., A for excellent, E for poor) were based primarily on engineering judgment and did not incorporate statistical error bounds or confidence intervals. Since its initial publication, we have periodically revised and updated AP-42 to incorporate new data and emissions-estimating methodologies. The last hard-copy version of AP-42 (fifth edition) was published in 1995; although, we have released six supplements (Supplement A through Supplement F) through 2000. After 2000, updates to AP-42 were provided only electronically. Currently, the fifth edition of AP-42, the supplements and related information are available at: www.epa.gov/air-emissions-factors-and-quantification/ap-42- compilation-air-emissions-factors. In addition to AP-42, we developed several other compilations of available emissions factors. To provide the user community with additional emissions factor information for air toxic pollutants beyond what was available in AP-42 at the time, we initiated the Locating & Estimating (L&E) document series in 1984. Unlike AP-42, which is organized by source 3-1 ------- Section 3.0 How Have We Historically Developed Emissions Factors? category, the majority of the L&E documents focused on a specific pollutant (e.g., arsenic, benzene) or related group of pollutants (e.g., polycyclic organic matter). The L&E documents made use of AP-42 emissions factors, where available; however, in some cases, the AP-42 emissions factors were revised or supplemented to present the most complete assessment of the emissions for the specific air pollutant. A total of 36 individual L&E documents were produced through 1998. We also compiled the Aerometric Information Retrieval System (AIRS) Facility Subsystem Emission Factors (AFSEF) and the Crosswalk/Air Toxics Emission Factors (XATEF) databases in 1990. The AFSEF database documented all emissions factors for criteria pollutants that existed in the AIRS mainframe look-up tables as of March 1990. The XATEF database contained emissions factors for toxic air pollutants that were developed based upon data available to the EPA through October 1990. Ultimately, the EPA retired the AFSEF and XATEF databases and created the Factor Information Retrieval (FIRE) Data System. The FIRE database contains emissions factors from all AP-42 sections posted by September 1, 2004, the L&E document series and the retired AFSEF and XATEF databases. Other specialized studies have produced documents containing average emissions rates for various processes which have been posted on the CHIEF web page and which may still represent the most currently-available estimation tools for those processes. In 1997, we provided guidance materials (Procedures for Preparing Emission Factor Documents, EPA-454/R-95-015, November 1997) that described the procedures, technical criteria and standards and specifications for developing and reporting air pollutant emissions factors for publication in either AP-42 or the L&E document series. This 1997 guidance document covered the compilation, review and analyses of new data and information and preparation of supporting documentation for emissions factor development. Although OAQPS has focused significant effort and resources on developing emissions factors, the procedures and guidance we have historically followed (documented in the EPA's Procedures for Preparing Emission Factor Documents, November 1997) have not kept pace with the increased volume of available emissions data or advances in information technology. For example, although AP-42 is available online, the format is analogous to a hard-copy document 3-2 ------- Section 3.0 How Have We Historically Developed Emissions Factors? which is not conducive to incorporating new data, making corrections to data, or conducting data analyses. Also, because of their complex and somewhat subjective nature, the past emissions factor development procedures were slow to incorporate new emissions test data and did not encourage active public participation. To address these shortcomings, we have revised our approach for developing emissions factors to be more responsive and transparent. Section 5.0 discusses our revised approach to developing and documenting emissions factors. 3-3 ------- Section 4.0 How are Emissions Factors Used? Emissions factors are used to develop emissions estimates for processes and activities in cases where direct measurements are unavailable. Emissions factors are typically developed to represent long-term (e.g., annual) average emissions and, accordingly, data used for developing the emissions factors is usually based on emissions testing collected during normal process operating conditions. Short-term emissions from a particular process will vary significantly over time (i.e., within-process variability) because of fluctuations in normal process operating conditions, control device operating conditions, raw materials, ambient conditions and other factors. Because of the relatively short duration of emissions tests and the limited range of conditions they represent, the available emissions and process data used to develop an emissions factor are not sufficient to account for these short-term emissions fluctuations. Historically, emissions factors developed by the EPA were intended to be available for use in preparing regional and national emissions inventories when valid site-specific information (including material balances or other engineering calculations) were not available. These inventories are typically the first part of the development of a regional or national control strategy to reduce area-wide emissions. These inventories are important tools in air quality management because they are used to estimate ambient pollutant concentrations; to model pollutant dispersion and transport in the atmosphere; and to develop and assess control strategies. Despite their original purpose, we are aware that emissions factors have been applied by other entities (e.g., federal, state, tribal and local agencies; consultants; industries) for purposes beyond the intended use of supporting national and regional emissions inventory programs. We remain concerned that emissions factors have been applied to these non-emissions inventory uses without consideration of the limitations inherent in the use of emissions factors (e.g., factors are not particularly suitable to developing short-term or site-specific emissions estimates). Users of emissions factors should consider the impact of the reliability of emissions factors on their non-inventory programs (e.g., apply statistical procedures to account for variability). Such creators and users of emissions factors may wish to conduct periodic retesting to confirm or revise as necessary, the emissions factor. 4-1 ------- Section 5.0 What are EPA's Revised Procedures for Developing Emissions Factors? Beginning in 2003, OAQPS, the National Academy of Sciences and the EPA's Office of Inspector General conducted a review of the agency's emissions factors program. Based upon the feedback received from stakeholders (e.g., industry, state/local/tribal entities, the EPA's program offices, environmental organizations ), we revised our historical approach to developing emissions factors to reduce the level of subjectivity involved in the emissions factor development process. Our revised approach also improves the transparency and responsiveness of the process and encourages meaningful public participation. Figure 5-1 provides an overview of our revised approach for developing new emissions factors and for revising existing factors using test data submitted electronically to WebFIRE. The following sections describe the key revisions that we implemented in our approach regarding the collection of emissions data and supporting documentation, the evaluation of data, and the development and assessment of emissions factors. 5.1 Data Collection Based upon the review of our emissions factor program, we found that most emissions testing information and associated data are currently generated electronically. To take advantage of advances in information technology and the more widespread availability of electronic data production, our revised approach focuses on collecting new emissions data available in an electronic format. To aid facilities in planning and reporting the results of emissions tests, we developed the ERT and the WebFIRE template spreadsheet (see Section 10.1). The ERT replaces time- intensive manual methods for test planning, test data compilation and reporting and data quality assurance evaluations. The WebFIRE template spreadsheet is an alternative way that allows data that were collected using test methods not supported by the ERT and data collected prior to January 1, 2012, (the inception date of the ERT) to be submitted to EPA. Because of the prevalence of electronic data, we believe that our transition from the use of predominantly hard- copy resources (e.g., test reports, technical publications) for emissions factor development to the use of data in an electronic format will be relatively easy. The use of an electronic format will facilitate the ongoing collection, incorporation and analysis of new test data and supporting 5-1 ------- Section 5.0 What are EPA's Revised Procedures for Developing Emissions Factors? documentation. Also, use of the ERT and the WebFIRE template spreadsheet will enable us to streamline the emissions factor development process through more rapid data handling and quality assurance checks. Figure 5-1. EPA's Revised Procedures for Developing Emissions Factors 5-2 ------- Section 5.0 What are EPA's Revised Procedures for Developing Emissions Factors? 5.2 Test Data Evaluation Historically, the EPA's quality ratings of emissions test data and test reports were largely subjective because each test program presented different issues (i.e., no two facilities, their operation or the tests conducted at those facilities are exactly alike). Typically, the EPA developed letter-grade quality ratings (A through E) for test reports based upon the agency's review of the following criteria areas: • Process operation, • Test method and sampling procedures, • Process information, and • Analysis and calculations. To reduce the subjectivity of our qualitative assessment of the emissions, process and control device data collected during an emissions test, we have developed a more objective rating system for test reports (see Appendix A). The rating system is intended to produce unbiased and consistent assessments of the information included in test reports which, in turn, will help us to better characterize the process and the quality of emissions values. The rating system consists of a set of objective review questions developed for the EPA's manual and instrumental test methods that assess the quality of the process, control device and measurement data collected during an emissions test in the following criteria areas: • General information, • Process and control device information, • Sampling locations, • Test methods and reporting requirements, • Sampling equipment calibrations, • Sample recovery; • Laboratory analysis, and • Documentation. The Individual Test Rating (ITR) is a numeric score determined for each test report as the prorated sum of the individual scores assigned to each review question based upon the answers provided (see Appendix A). 5-3 ------- Section 5.0 What are EPA's Revised Procedures for Developing Emissions Factors? Our rating system is designed to allow for potential increases in the ITR value through independent review by a regulatory agency. In cases where a regulatory reviewer affirms the original responses provided to the review questions, additional points are assigned to the ITR value originally assigned by the ERT or the WebFIRE template spreadsheet when the measurement data were initially recorded by the testing contractor. If the regulatory reviewer determines that the initial review points were incorrectly assigned, the points originally assigned to a particular review question are deducted from the ITR. 5.3 Detection Limit Procedures for Calculating Test Run Averages The EPA defines the minimum detection limit (MDL) as "the minimum concentration of a substance that can be measured and reported with 99 percent confidence that the analyte concentration is greater than zero and is determined from an analysis of a sample in a given matrix containing the analyte." Essentially, the MDL is the smallest amount of a substance that an analytical method can reliably be distinguished from zero, at a specified confidence level, from the instrument signal produced by a blank sample. The ERT and the WebFIRE template spreadsheet provide emissions data at the test run level and provide a flag for each test run that designates whether the emissions value is above the MDL of the test method (ADL), below the MDL (BDL), or detection level limited (DLL) for cases where some of the measurement data used to derive the test run value are BDL. For each ERT or WebFIRE template spreadsheet submission, we calculate the average of the test run values and assign a detection limit flag to the average as follows: 1. If all test runs are ADL, we calculate the average of the test run values and assign an ADL flag. 2. If the test runs are a mix of ADL and DLL (or all DLL) data, we calculate the average of the test run values and assign a DLL flag. 3. If all test runs are BDL, we calculate the average using the BDL/2 test run values and assign a BDL flag. 4. If the test runs are a mix of ADL (or DLL) and BDL data, we calculate the average value excluding the BDL/2 test run value(s) that are greater than the highest ADL (or DLL) value and assign a DLL flag. 5-4 ------- Section 5.0 What are EPA's Revised Procedures for Developing Emissions Factors? Appendix B contains a more detailed discussion of the procedures that we follow for handling detection limit flags. 5.4 Grouping of Candidate Data and Identification of Outliers To assemble data sets for calculating candidate emissions factors, we group the calculated average emissions values (and the underlying data for existing AP-42 emissions factors, if available) by unique combinations of Source Classification Code (SCC), pollutant, control device configuration and units (i.e., mass of emissions per activity). We then subject the candidate data set to statistical outlier tests to determine if we should eliminate any average emissions values from the emissions factor calculations. A statistical outlier refers to one or more values that do not conform to the statistical pattern established by other values under consideration for the same process. These outlier values can be caused by an unusual process condition or circumstance that produced an unexpected and unrepresentative variation in the process emissions. For the purposes of identifying outliers, our revised approach for developing emissions factors uses the Dixon Q test or the Rosner test, depending on the number of average emission values in the candidate data set. If there are fewer than three values in the subject data set, WebFIRE does not conduct the outlier analysis or calculate the candidate emissions factor. Appendix C contains a detailed discussion of the procedures we use to determine outliers. Our procedure omits outliers when calculating the value of the candidate emissions factor. Although not included when calculating candidate emissions factors, we do not remove values identified as outliers by the Dixon Q or Rosner tests from the WebFIRE database. Facilities routinely submit emissions data to WebFIRE and, as the data population in the unique groupings increases over time, the statistical characteristics (e.g., average, minimum, and maximum values) of the data grouping that determine whether a value is an outlier also change (i.e., a value identified as an outlier today may not be identified as an outlier tomorrow when the data population contains more values). 5.5 Emissions Factor Derivation and Quality Assessment After evaluating the candidate data set for outlier values, we follow a step-wise procedure to: (1) calculate an emissions factor value using the average emissions values that result in the 5-5 ------- Section 5.0 What are EPA's Revised Procedures for Developing Emissions Factors? highest quality rating and the most representative factor for the source category of interest, and (2) assign the quality rating of the resulting emissions factor. The procedures for calculating the emissions factor value and assessing factor quality are based upon an evaluation of the number of individual sources in the source category for which we are developing the emissions factor, the individual test report quality rating (ITR) and the number of individual test data values used to calculate the emissions factor. WebFIRE does not calculate a candidate emissions factor value in cases where all the data in a unique grouping are BDL. Appendix D contains a detailed description of the emissions factor development, and data quality characterization procedures. 5-6 ------- Section 6.0 EPA's Interactive Database for the Emissions Factors Program - What is WebFIRE? 6.1 What is WebFIRE? WebFIRE is the EPA's online emissions factors repository, retrieval, and development tool. The WebFIRE database contains the EPA's emissions factors for criteria and hazardous air pollutants (HAP) for industrial and non-industrial processes. In addition, WebFIRE contains the individual test data values, where available, and supporting documentation used to develop the factors and other data submitted to the EPA by federal, state, tribal, and local agencies; consultants; and industries. For each emissions factor and individual test data value, WebFIRE contains descriptive information such as industry and source category type, control device information, the pollutants emitted and supporting documentation. The home page for WebFIRE and links to Frequently Asked Questions (FAQs) and background information on data contained in the WebFIRE system can be found at: www.cfpub.epa. gov/webfire/. 6.2 How is WebFIRE Used? Currently, the primary function of WebFIRE for the public is to provide storage and retrieval of emissions factors and emissions test data. The EPA also expanded WebFIRE to provide public users tools for calculating and assessing the representativeness of emissions factors derived from a set of individual test data values selected by the public user. Figure 6-1 provides an overview of WebFIRE and its basic functionality for the public user. 6-1 ------- Section 6.0 EPA's Interactive Database for the Emissions Factors Program-What is WebFIRE? Figure 6-1. WebFIRE Overview 6-2 ------- Section 6.0 EPA's Interactive Database for the Emissions Factors Program-What is WebFIRE? To retrieve an EPA emissions factor, WebFIRE allows users to specify one or more of the following search criteria: SCC, pollutant, control device, and AP-42 section. WebFIRE allows users to include revoked emissions factors in the search results. The emissions factor search also provides a link that presents the data values used to derive the selected emissions factor and the data values excluded from the emissions factor calculation. Section 7.0 provides a more detailed discussion of the WebFIRE emissions factors search and retrieval tools. The EPA also added to WebFIRE a tool that allows a user to calculate an emissions factor from a set of individual test data values stored in WebFIRE. These WebFIRE tools incorporate our revised approach for developing emissions factors (see Section 5.0). In general, the user selects the individual test data values to be used in developing an emissions factor and WebFIRE evaluates the data set to identify outlier values and confirms that the data set does not consist of only BDL values. Following the outlier value analyses and BDL assessment, WebFIRE calculates an emissions factor value from the data set that best represents the process of interest. WebFIRE also assigns a relative quality rating to the user-defined emissions factor. Section 9.0 discusses the WebFIRE emissions factor development tools in more detail. Appendices B through D contain the BDL and outlier analyses and the calculations and procedures for deriving an emissions factor. 6.3 Who Uses WebFIRE? The data storage, retrieval, and emissions factor development capabilities of WebFIRE are available online to all public and private entities. Examples of WebFIRE users include, but are not limited to: • Federal, state, local, or tribal air pollution control and regulatory agency personnel (example uses include: emissions inventory development, preparation of emissions estimates for dispersion modeling, comparison of a site-specific emissions factor to an EPA emissions factor for a given process). • Environmental staff at industrial facilities (example uses include: emissions and process data submittal; comparison of process emissions to an EPA emissions factor or other related data). • Environmental organizations (example uses include: air emissions oversight). • Engineering consultants, university researchers, and international air agencies. 6-3 ------- Section 6.0 EPA's Interactive Database for the Emissions Factors Program-What is WebFIRE? Periodically, the EPA will use the test data and development tools contained in WebFIRE to revise existing and derive new emissions factors as discussed in Section 11.0. The EPA also anticipates using the test data submitted to WebFIRE to inform our air rule development efforts under the Clean Air Act. 6.4 How Does WebFIRE Improve Emissions Factor Identification and Development? The emissions factor repository, retrieval, and development tools in WebFIRE facilitates the EPA's progress towards our goal of developing an interactive emissions factors program that will incorporate new data as they become available and produce high-quality emissions factors in a timely manner. We also believe that the benefits of online data access and electronic data submittal provided by WebFIRE will provide for easier, more effective involvement by the public interested in developing and improving emissions factors. WebFIRE also allows the EPA to shift the role of OAQPS from that of sole developer of emissions factors to that of a facilitator. Because of this shift, we can focus more resources on overseeing the emissions factor program and ensuring that the program develops high-quality emissions factors using a consistent and transparent approach. 6-4 ------- Section 7.0 How Do I Find an Emissions Factor? 7.1 How Do I Identify and Retrieve an Emissions Factor from WebFIRE? The WebFIRE emissions factor search allows you to focus the factor retrieval process by entering multiple search criteria including: • see,1 • Control device type, • Pollutant or pollutant group type, and • Specific AP-42 section. WebFIRE also allows you to expand your search to include emissions factors that have been revoked by EPA. Figure 7-1 provides an overview of the factor retrieval process. For each of the search criteria, WebFIRE provides a dropdown menu presenting the available selections in the database. For the SCC, control device, and pollutant criteria, WebFIRE also provides a filter box that allows you to limit the dropdown selections to only those that contain the filter text (e.g., entering "refine" in the SCC text box reduces the dropdown menu selections to only those SCCs that contain "refine" in the code or text description, such as "refinery"). Table 7-1 lists the data fields WebFIRE provides for each emissions factor record. Depending on the search criteria you enter, WebFIRE will return one or more emissions factors. At this stage of the search, you have the option of: (1) creating a summary report of the information shown on the results page (Figure 7-1, Option A), or (2) obtaining additional background information for the emissions factor that you selected (see Section 7.2). To accommodate various end uses of the retrieved data (e.g., emissions calculations, incorporation into a text file), WebFIRE provides you with the following reporting formats: 1 The EPA uses the SCCs to organize data for anthropogenic air pollutant sources that have similar production and emissions characteristics (e.g., gasoline storage tanks, polymer manufacturing facilities) into related groups or source categories. Section 8.1 provides an overview of the SCC system. 7-1 ------- Section 7.0 How Do I Find An Emissions Factor? • Comma Separated Values (CSV) format (for importation into a spreadsheet or database), • Extensible Markup Language (XML) format (for importation into XML parsing applications), • American Standard Code for Information Interchange (ASCII) format (for importation into other applications), and • Hypertext Markup Language (HTML) format (for printing). 7-2 ------- Section 7.0 How Do I Find An Emissions Factor? Figure 7-1. Procedures for Retrieving Emissions Factors from WebFIRE 7-3 ------- Section 7.0 How Do I Find An Emissions Factor? Table 7-1. Data Fields Reported by WebFIRE Emissions Factor Search Emissions Factor Record Data Elements Description Emissions factor Numerical value and units of the emissions factor see Source Classification Code Pollutant name Chemical name of pollutant factor NEI pollutant code National Emissions Inventory (NEI) code assigned to the pollutant CAS number Chemical Abstract Service (CAS) number assigned to the pollutant Pollutant code Identification number assigned to the pollutant in the National Emissions Inventory (NEI) Quality score Composite Test Rating (CTR) for EPA factors Emissions factor representativeness Qualitative characterization of how well an emissions factor statistically represents the population of similar facilities in a source category Primary control device The first control device applied to the process Second control device The second control device applied to the process Third control device The third control device applied to the process Fourth control device The fourth control device applied to the process Fifth control device The fifth control device applied to the process Status Identifies emissions factors as individual test data value, EPA factor, or draft emissions factor undergoing review Data source type Refers to the original document(s) from which factors were obtained Restriction type Refers to caveats or special considerations prior to use of the emissions factor References Test report or citation where the factor was derived AP-42 section Identifies the specific AP-42 section where the process data can be found Formula Empirical equation used to express an emissions factor Date Represents the date the emissions factor was developed/revised Notes Additional information to assist the user in understanding and applying an emissions factor 7.2 How Do I Obtain Background Information for My Selected Emissions Factor? At the search results page, WebFIRE provides you the option of retrieving additional detailed information for the emissions factor that you selected (Figure 7-1, Option B) by clicking on the "Details" link located at the right-hand side of the search results page. This detailed information is intended to give you a better understanding of your specific factor so you can make better decisions regarding its applicability. 7-4 ------- For the AP-42 factors we developed using our historical approach discussed in Section 3.0, the Details link provides you with information such as the citation for the data; the applicable AP-42 section; formulas and equations that are applicable to the factor; and information on process configurations, operating conditions, control device configurations, and test conditions relevant to the emissions factor that you selected. For the factors we develop using our revised approach discussed in Section 5.0, the Details link provides you with three tables. Table 1 summarizes the emissions factor and Tables 2 and 3 present the data EPA included and excluded from the calculation, respectively. Tables 2 and 3 also provide links to the electronic submissions of the individual emissions test reports. 7-5 ------- Section 8.0 What Parameters Should I Consider When Using or Deriving an Emissions Factor? When you are selecting or deriving an emissions factor for use in developing an emissions estimate for a particular process or activity, the primary considerations are generally: • How well the emissions factor represents the process for which the emissions estimate is being developed, • The effect on emissions due to the presence (or absence) of a control device or technique, and • The underlying test method used to measure the pollutant(s) represented by the emissions factor. 8.1 Source Category and Process Considerations The EPA uses SCCs to classify different types of anthropogenic emissions activities. Each SCC represents a unique, source category-specific process or function that emits an air pollutant. The SCCs are used as a primary identifying data element in EPA's WebFIRE, the NEI and other EPA databases. The SCCs are also used by many regional, state, local, and tribal agency emissions data systems. There are two types of SCCs: 8-digit and 10-digit. The 8-digit SCCs follow the pattern 1-22-333-44 and the 10-digit SCCs follow the pattern 11-22-333-444. The codes use a hierarchical system in which the definition of the emissions process becomes increasingly more specific as you move from left to right. The first level of description provides the most general information on the category of emissions. The fourth category is the most detailed and describes the specific emitting process. Point source SCCs have historically had only 8 digits; however, there are numerous 10-digit SCCs that characterize point source processes such as aircraft emissions and ground support equipment emissions at airport facilities. Ten-digit SCCs primarily represent nonpoint and mobile source emissions. You can download the current list of SCCs and their descriptions from the EPA's Emission Inventory System (EIS) website: https://sor-scc-api.epa.gov/sccwebservices/sccsearch/. At this website, you can search for SCCs by entering keywords, sectors, or partial numeric codes 8-1 ------- Section 8.0 What Parameters Should I Consider When Deriving a User-Defined Emissions Factor? in the filter box or explore the SCC list using a keyword filter box or selecting one or more preset filter options (e.g., include retired SCCs). The EPA periodically updates and improves the SCCs. As technologies have changed over the years, the EPA has recognized the need to remove outdated SCCs and add SCCs for new emissions processes. A review of existing SCCs has shown several instances of duplicate SCCs for the same process. The EPA is retiring duplicate SCCs to help ensure that each emissions process has a unique SCC. In addition, the EPA is working to assign SCCs to emissions sources that are currently regulated, but do not have SCCs. We are also making other changes to ensure that the assignment of an SCC is consistent with the descriptions associated with the hierarchy of digits that comprise each SCC. The SCC revisions improve the overall organization of the SCC list by reducing the likelihood of a user choosing an incorrect SCC for their particular process. The EPA designed the SCCs to categorize processes that create emissions; therefore, one objective of revising the SCCs is to remove control device descriptions from the current SCC list. Another objective of the SCC revision process is to reduce the use of miscellaneous SCCs, such as those including "99" codes. Often these are labeled in the SCC list as "other not classified," "specify in comments field," or "miscellaneous." These types of labels are not sufficient to classify emissions processes. Therefore, the EPA is removing these SCCs from WebFIRE. The EPA's new approach will allow entities submitting test data to WebFIRE to propose new SCC(s) for their emissions processes in an effective and logical way. Upon receipt of a request to establish a new SCC, the EPA will perform an analysis to determine if the proposed SCC is unique or if an existing SCC should be used. We will base our analysis upon the uniqueness of the emissions profile of the process and other relevant considerations. It is important to note that the revisions that we are currently making to the SCC process do not change the fundamental role that SCCs play in the emissions factor program or the way that users will be able to search for specific emissions factors. These revisions should improve the overall data quality of the emissions factors by ensuring that the data upon which the emissions factors are based are grouped in the appropriate SCC. In addition, the WebFIRE emissions factor search function automatically applies EPA's updated SCCs to the search results. 8-2 ------- Section 8.0 What Parameters Should I Consider When Deriving a User-Defined Emissions Factor? For example, if a user searches for an emissions factor using an SCC that EPA has retired or mapped to a new SCC, the WebFIRE search results alert the user to the change in SCC status and provide emissions factors for the new SCC, if available, and for the SCC the user originally selected when specifying the search criteria. The SCC bulk download available from the EPA EIS website provides a cross-walk so that you can identify revised SCCs by their old SCC number. 8.2 Control Device Considerations In addition to assessing the production process or activity for which you are selecting or developing an emissions factor, you should have a clear understanding of the operation and performance characteristics of any control techniques or technologies that are used to reduce emissions from the process. When you are selecting or developing a controlled emissions factor, you should determine if the control device reflected in the emissions factor record is comparable to the type and configuration of any control device that is applied to the process for which you are developing the emissions estimate. You may also want to assess whether the pollutant of interest is reduced or eliminated by a particular type of control device, or determine whether a piece of equipment functions as an integral part of the process (e.g., a cyclone that separates product from a pneumatic conveying system, cooling coils in a vapor degreaser that reduce solvent loss) or whether it is a control device (e.g., a cyclone that reduces PM emissions from a wood sawmill, a thermal oxidizer that reduces organic emissions from a process vent). You may also find that a clear understanding of control device operation is useful when assessing the performance of control devices that are operated in series (WebFIRE accommodates up to five control devices for a single emissions factor record). 8.3 Pollutant Test Method Considerations The selection of a test method and how the method is applied to measure emissions from the process can affect the representativeness of the emissions data and the resulting emissions factor developed from the data. The majority of the emissions factors contained in WebFIRE are based upon direct emissions measurements. In most cases, these measurements were obtained using the EPA's reference test methods that were created to support development, implementation and compliance with federal standards (e.g., New Source Performance Standards 8-3 ------- Section 8.0 What Parameters Should I Consider When Deriving a User-Defined Emissions Factor? (NSPS), National Emission Standards for Hazardous Air Pollutants (NESHAP)). In addition, some emissions factors are based upon data collected using non-EPA test methods (e.g., methods developed by the California Air Resources Board (CARB)). The EPA's reference test methods provide direct measurement of specific chemical species (e.g., carbon monoxide (CO), sulfur dioxide (SO2)), emissions from a process or control device. The EPA's reference test methods for measuring PM or total hydrocarbons (THC) measure the emissions of a group or class of pollutants rather than an individual compound or chemical species. In these cases, for example, the term "filterable PM" is considered to apply to the material that is captured upstream and on the sampling train filter maintained at a specific temperature. Consequently, the temperature at which the sampling train is operated affects the amount of "filterable" material collected (e.g., operating the sampling train at a lower temperature would tend to capture more compounds that have high vapor pressures). When you are considering an emissions factor developed from PM or THC data, you should be aware of the underlying test method and conditions under which the test was conducted to determine if the emissions factor is appropriate for the pollutant for which you are using or preparing the emissions estimate. Often, an understanding of how the method is conducted can overcome confusion related to applying the data and to comparing emissions from different facilities. 8-4 ------- Section 9.0 How Do I Develop a User-Defined Emissions Factor? 9.1 How Do I Use WebFIRE to Create a User-Defined Emissions Factor? WebFIRE allows the public to develop a user-defined emissions factor using the same procedures that the EPA follows to develop new or to revise existing emissions factors (see Section 5.0). Figure 9-1 shows the steps for developing a user-defined emissions factor. First, a user should obtain all the individual test data values contained in WebFIRE that are related to the emissions process of interest to you by specifying the appropriate search criteria at the emissions factor development page in WebFIRE. After you have obtained the list of individual test data values that match your search criteria, you can select the candidate data set containing the values that you want to use to develop the user defined emissions factor by highlighting the check box next to each data record of interest. WebFIRE calculates the emissions factor value from this candidate data set using the outlier, BDL, factor derivation and quality assessment tools discussed in Section 5.0. This development tool is not applicable to the emissions factors that are expressed as empirical equations because they contain more than one variable. After WebFIRE calculates the user-defined emissions factor, you can generate a report to provide documentation of the emissions factor development. The report provides a summary of the user-defined emissions factor, the test data values used to derive the factor, the corresponding SCC for the emissions factor, applicable control devices, the Composite Test Rating (CTR) for the factor (see Appendix D), and how well the emissions factor represents air emissions from the process associated with the SCC. The report also shows the values and supporting information for the individual test data values that were used to derive the emissions factor. Because the WebFIRE database will not retain user-defined emissions factors after they are created, we recommend preparing a report for any user-defined emissions factor that you develop. 9-1 ------- Section 9.0 How Do I Develop a User-Defined Emissions Factor? Figure 9-1. Emissions Factor Derivation in WebFIRE 9-2 ------- Section 9.0 How Do I Develop a User-Defined Emissions Factor? 9.2 What are the Potential Impacts Associated with Applying a User-defined Emissions Factor? Applying a user-defined emissions factor may affect whether or not you conclude your source is subject to certain regulations. For example, applying a user-defined emissions factor to a site-specific emissions estimate could show that a facility is not subject to a particular emissions standard where the previous use of an emissions factor indicated that the emissions standard was applicable. For this reason, we encourage you to be judicious and responsible in your application of a user-defined emissions factor. We also encourage you to create and maintain the WebFIRE report (see Section 9.1) that documents the development of the user- defined emissions factor. 9-3 ------- Section 10.0 How Do I Submit Data to WebFIRE? To ensure consistency of data submittals from many different facilities and entities, we have two ways to submit data. For EPA to develop or revise existing emissions factors, it is important for a user to submit the data results to WebFIRE in the format of the EPA's ERT or an ERT compatible XML schema. If you have data you would like submitted to WebFIRE that is collected by methods not supported by the ERT, you can voluntarily use the WebFIRE template spreadsheet, available at: www.epa.gov/electronic-reporting-air-emissions/electronic-reporting- tool-ert and send the file to EPA via mail or email. The ERT (see Section 10.1) is an electronic alternative to submitting paper test reports and supporting documentation. The WebFIRE template spreadsheet allows for electronic submission of data collected using test methods not supported by the ERT, or test data collected prior to January 1, 2012. The Compliance and Emissions Data Reporting Interface (CEDRI) at the EPA's Central Data Exchange (CDX) is the data upload application for submitting ERT files. The CDX (see Section 10.2) is part of the Environmental Information Exchange Network and provides industry an easy and secure reporting service. If you have an existing CDX account (e.g., you submit reports for the EPA's Toxics Release Inventory (TRI) Program), you can use your current user ID and password to log in to CDX by navigating to the https://cdx.epa.gov/ link, entering your user ID and password then selecting the "Log In" button in the header of the page. After you log in, you will be able to select the "Add Program Service" button on the MyCDX Services page to add CEDRI to your list of CDX applications. You will then be able to follow the instructions provided on the subsequent pages to complete the identity verification process to obtain approval from EPA to access CEDRI. If you do not have an existing account with the CDX, you can complete the online registration process by navigating to the CDX home page (https://cdx.epa.gov/) and clicking the "Register with CDX" button in the header of the page. After completion of the user registration component, you will be able to follow the instructions provided on the subsequent pages to complete the identity verification process in order to obtain approval to access the CEDRI data 10-1 ------- Section 10.0 How Do I Submit Data to WebFIRE? upload program. During the registration process, you have the option of registering as a "preparer" or as a "certifier." If you are preparing reports for signature and subsequent submission by an authorized representative of a facility, you should register as a preparer. The certifier is the duly authorized representative of the source or more commonly referred to as the "owner or operator" of the facility. The certifier is authorized to modify the package a preparer has assembled, sign and submit the package to the CDX. Contractors are prohibited from registering as a certifier. Contractors are permitted to register as a preparer and may assemble submission packages, such as the ERT, for the certifier's approval and signature. If you are the signature authority for the facility (i.e., certifier), you may either use the LexisNexis electronic identity validation service or the paper-based Electronic Signature Agreement (ESA) validation process to register as a certifier. We strongly encourage certifiers to use the electronic identity validation process as the paper-based approval of the ESA typically takes 5 to 10 business days. If you choose to use the paper-based validation process, you will be required to mail your signed ESA to the CDX Reporting Center. The CDX Reporting Center will request the phone number of the signature authority's employ er/authorizing official to verify employment. For any questions regarding the CDX, the CDX Help Desk (https://cdx.epa.gov/Help) is available for data submission technical support between the hours of 8:00 am and 6:00 pm (Eastern Standard Time (EST)) at 1-888-890-1995 or helpdesk@epacdx.net. The CDX Help Desk can also be reached at 970-494-5500. The WebFIRE template spreadsheet should not be submitted through the CEDRI upload application. The template should be emailed to the EPA's Info CHIEF mailbox at: Chief Info@epa.gov or mailed on electronic media to EPA at the following address: U.S. EPA Group Leader Measurement Policy Group, OAQPS Mail Code D243-05 RTP, NC 27711 10-2 ------- Section 10.0 How Do I Submit Data to WebFIRE? 10.1 How Are Emissions Tests Documented? There are two approaches to documenting emissions tests electronically: the ERT, and the WebFIRE template spreadsheet. The ERT is a database application developed by EPA to aid facilities in planning and reporting the results of emissions tests. The ERT replaces time- intensive manual test planning, test data compilation and reporting, and data quality assurance evaluations. When utilized to its fullest potential, the ERT can also facilitate coordination among the facility, the testing contractor and the regulatory agency (e.g., for compliance demonstrations) in planning and preparing for the emissions test. The current version of the ERT, a list of the EPA test methods that are currently supported by the ERT and guidance on the use of the ERT can be found at: www.epa.gov/electronic-reporting-air-emissions/electronic-reporting- tool-ert. The EPA's Emission Measurement Center (EMC) provides information regarding the EPA's test methods and can be found at: www.epa.gov/emc/. The ERT documents the following key information, some of which is required by the EPA reference test methods for stationary sources: • SCC specification, • Process data from existing air permits (e.g., process throughput rates), • Process rate levels during actual testing, • Descriptions of the source, unit process and control devices associated with the test, • Process upsets or malfunctions during testing, • Process flow diagram, • Sampling locations, • Test methods used, • Deviations made to any test method, and • Output flow rates and pollutant concentrations. Figure 10-1 shows the typical steps followed when using the ERT. The ERT consists of: (1) a database application, (2) the project data set (PDS), and (3) a data upload spreadsheet. The database application contains all of the data input screens, reports, calculations and other items necessary to create and distribute a test report. The application also incorporates our evaluation system (see Section 5.2 and Appendix A) so that each test is assigned a numeric score (the ITR) that assesses the quality of the measurement data and associated information collected during an emissions test. The PDS database contains the measurement data for a single test report. This file 10-3 ------- Section 10.0 How Do I Submit Data to WebFIRE? is exchanged between the source test contractor, the client and the regulatory agency, if necessary (e.g., for a compliance test). To provide flexibility to ERT users, the Microsoft Excel® spreadsheet can be used to upload the sampling hardware and field measurement data recorded during a test into the PDS rather than entering the data directly into the PDS through the application. 10-4 ------- Section 10.0 How Do I Submit Data to WebFIRE? Figure 10-1. Typical Work Flow When Using the ERT Upon completion, the ERT contains all of the emissions data and supporting information (e.g., equipment calibration documentation) prepared and collected for the test. In addition, testers can attach supporting information to the ERT or an electronic copy (PDF) of the entire report (optional) and create a submission package. When creating the submission package, the ERT automatically creates an XML export file for the WebFIRE emissions factor database. The format of this ERT output file is specifically designed to provide inputs for the data fields contained in WebFIRE (e.g., emissions value and units, SCC, ITR). To facilitate incorporation of the data into WebFIRE, the output file is configured to accept emissions values expressed in terms of mass of pollutant emitted per unit 10-5 ------- Section 10.0 How Do I Submit Data to WebFIRE? of activity. The output file also accepts emissions test results that are expressed as a concentration or an emissions rate (i.e., mass emitted per time unit) which may be able to be expressed in units that are suitable for use in emissions factor development. The EPA developed the WebFIRE template spreadsheet for companies, associations, and agencies to provide emissions data collected using test methods not supported by the ERT. The WebFIRE template spreadsheet is also applicable to emissions data collected prior to January 1, 2012 (test methods used to collected emissions data after this date include the requisite data to enter EPA's emissions factor development process). The structured format of the WebFIRE template spreadsheet organizes the basic source information used for emissions factor development, including supplementary information for more detailed characterization of the source and the emissions measurements, and the spreadsheet contains the same rating system used in the ERT for assessing the quality of the emissions test and assigning the ITR. Additionally, submitters should include a PDF copy of the entire report documenting the source test in the completed WebFIRE template spreadsheet file. Submission of the WebFIRE template spreadsheet to the EPA is not a requirement but a voluntary option to provide stack test data for EPA to use in potentially deriving an emissions factor. Use of the ERT and the WebFIRE template spreadsheet will provide for consistent criteria to quantitatively assess the quality of the data collected during the emissions test and to standardize the test report contents. The use of the ERT and the WebFIRE template spreadsheet also improves the availability of the supporting documentation necessary to conduct such an evaluation. Additionally, the ERT and the WebFIRE template spreadsheet lay the groundwork for future capabilities to electronically exchange information contained in the test reports with facility, state, local, or federal data systems. 10.2 What are the CDX and CEDRI and What are their Roles in Submitting Data to WebFIRE? Electronic environmental data submissions to EPA, including submission of emissions data for use in WebFIRE, can be made through the CDX using the CEDRI data upload application. 10-6 ------- Section 10.0 How Do I Submit Data to WebFIRE? The CDX is part of the Environmental Information Exchange Network that was developed by the EPA and the states to facilitate online sharing of electronic environmental information among EPA, states, tribes, localities and other entities. The CDX is a broad-based tool that offers industry, states, tribes and other stakeholders a fast, easy and secure reporting service. As part of EPA's e-government initiative, the CDX helps to ensure that both the public and regulatory agencies can access the information used to document environmental performance, understand environmental conditions and make sound decisions to protect the environment. The benefits of the CDX to the EPA and related program offices include: • Elimination of redundant infrastructure and its associated costs, • Facilitation of faster, lower-cost implementation of new or modified data flows, • Integration of data to agency data repositories, • Establishment of consistent procedures for electronic signatures, • Reduction in the time needed to make information publicly accessible, • Reduction in the record management costs by elimination of redundant recordkeeping, and • Compliance with the Cross-Media Electronic Reporting Regulation (CROMERR). The benefits to industry, states, local agencies and tribes associated with the CDX include: • Reduction of overall reporting burden, • Improvement in data accessibility, • Electronic confirmation that information was received and that the electronic form was filled out correctly, • Reduction in the time and costs associated with environmental data submission requirements, • Simplification of reporting to a single point in the EPA instead of many separate programs, • Faster securing of submission through built-in edit and data quality checks, • Improvement of security and transmission of confidential business information (CBI) through registration and authentication, • Reduction of burden of complying with new or changing requirements, and • Streamlining of reporting through the Exchange Network and Web Services. 10-7 ------- Section 10.0 How Do I Submit Data to WebFIRE? The EPA expects facilities to produce and submit an increased amount of new emissions test data in response to regulations that require the electronic submission of emissions tests to demonstrate compliance with federal air regulations. In the future, we anticipate that the EPA will use the capabilities of the CDX to provide for electronic exchange of information in test reports with facility, state and federal data systems. For example, the ERT and WebFIRE template spreadsheet allow sources to document facility- specific information that may also be required under other regulatory data systems, such as the Air Facility System (AFS). Such systems contain compliance, enforcement and permit data for stationary sources of air pollution regulated by the EPA and state/local/tribal agencies. Transfers to other data systems such as the NEI, TRI and Title V reporting may also be desirable. CEDRI job aides can be found at: https://www.epa.gov/electronic-reporting-air- emissions/cedri. Files submitted through the CDX/CEDRI are stored in the CDX CROMERR archive and a copy of the file is retained in the WebFIRE database. To submit files through the CEDRI application, you should accept the certification conditions that the documents and attachments were prepared under your direction or supervision and that, to the best of your knowledge, the information is true, accurate and complete. After accepting the certification conditions, you will be prompted to re-validate your username and password, answer the validation question and officially sign the submission. Shortly after submission, you will receive email notification stating whether the files were successfully or unsuccessfully submitted. Submissions can fail for a variety of reasons, including presence of an invalid file (e.g., improper file extension), an incomplete file, or system errors. If any system errors occur after you upload and sign the submission file, you will be prompted to re-submit the files or contact the CDX Help Desk. 10-8 ------- Section 11.0 What is the Data Review and Public Participation Process for Emissions Factor Development? An overview of the public participation and data review process used by the EPA when implementing Clean Air Act section 130 for source test and/or emissions factor data is shown in Figure 11-1.The Clean Air Act states "The Administrator shall permit any person to demonstrate improved emissions estimating techniques, and following approval of such techniques, the Administrator shall authorize the use of such techniques. Any such technique may be approved only after appropriate public participation." Periodically, the EPA will review, compile, and analyze the data contained in WebFIRE for the purposes of revising existing and developing new emissions factors, as appropriate. We generally consider the following criteria to determine if emissions factor development is warranted: • The amount of new source test/emissions factor data that have been received; • The degree of variability with existing emissions factors in WebFIRE; and • EPA's programmatic needs related to new rules, policies, and other EPA tools. If we receive a substantial amount of new information for a given process type and that process is a significant emitter of one or more pollutants, the agency may consider review and development of new emissions factors. If we receive only a few new data values for a process type, it is less likely that the new data alone would initiate the extensive factor review and development process. Another point that we consider is the difference and variability between the existing emissions factors in WebFIRE and the newer data. If the newer data do not significantly change the existing factor(s), the need to revise the factor would be less urgent. Lastly, decisions to initiate factor review and development may be tied to programmatic issues and schedules occurring within the EPA. For example, new data or the need for improved emissions factors may be driven by new regulations that are under development or that were recently promulgated. Also, emissions inventory requirements may be in place that call for new emissions factors. ll-l ------- Section 11.0 What is the Data Review and Public Participation Process for Emissions Factor Development? Figure 11-1. Overview of the WebFIRE Public Participation and Emissions Factor Development Process 11-2 ------- Section 11.0 What is the Data Review and Public Participation Process for Emissions Factor Development? When one or more of these considerations call for it, the EPA may be prompted to initiate the emissions factor review and development activities. As a result of this process, the EPA will generally draft new and/or revised emissions factors for specific processes (i.e., SCCs). The EPA will publicly announce the availability of these draft emissions factors and invite public review and comment via the Air and Emissions and Quantification Website (https://www.epa.gov/air- emissions-factors-and-quantification/documentation-supporting-draft-and-final-emissions- factors). The public announcement would be in the form of an EPA Listserv email notification via the InfoCHIEF Listserv (www.epa.gov/chief/chief-listserv). The public can join the CHIEF Listserv by sending an email to ioin-chief@lists.epa.gov. These notifications will describe the nature of the new emissions factors that EPA developed and their associated source categories. Typically, the public would have a 60-day review and comment period for the draft factors. Examples of some topics to consider when preparing comments include, but are not limited to: • The validity and accuracy of the test methods applied to obtain sample measurements, • The validity and accuracy of the analytical procedures used to quantify measurements, • The completeness, thoroughness and transparency of the source test documentation, • The correlations made between process parameters and test data conditions, • The accuracy of the assigned SCC and control device codes, and • The adequacy and accuracy of the process description for the source category and the associated documentation. The process for submitting comments (e.g., format and method of submittal, due dates, submittal address) would be described in the data availability announcements. Commenters should review all information pertinent to the correct calculation of emissions factors from the underlying test data. The review should address how well the mass or concentration measurement data were combined with process operating data (e.g., fuel use, material throughput, item production, power output) to yield an emissions factor. If controls are in place, control device operating conditions should be correctly associated with process conditions and factored into the emissions factor development. It is particularly important that reviewers confirm the process and source category associations made for the data. New or revised process flow diagrams and/or schematics should be submitted if an industry has undergone significant changes since the last revision. These process associations should be made using SCCs, recognizing that, in some cases, new SCCs may be required. 11-3 ------- Section 11.0 What is the Data Review and Public Participation Process for Emissions Factor Development? At the conclusion of the public comment period, the EPA evaluates the comments received and makes any appropriate modifications to the data in WebFIRE. If commenters provide new emissions test data for use in emissions factor development, we would consider combining the newer data with the existing data for a given source type or category. When determining valid combinations of existing and new data, we use statistical analyses that are based upon the Student's t-test (see Appendix E). If the comments identify issues or raise questions that the EPA cannot address, the original submitter may be contacted for reconciliation. After all comments are addressed and the EPA is satisfied with the quality of the emissions factor data, we will make the final emissions factor available to the public in WebFIRE (https://cfpub.epa.gov/webfire/). The previous emissions factor, if any, would be flagged as "revoked." 11-4 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings 1.0 Introduction Historically, the EPA's quality ratings of emissions test data and test reports were largely subjective because each test program presented different issues (i.e., no two facilities, their operation or the tests conducted at those facilities are exactly alike). Typically, the EPA developed letter-grade quality ratings (A through E) for test reports based upon the agency's review of the following criteria areas: • Process operation, • Test method and sampling procedures, • Process information, and • Analysis and calculations. To reduce the subjectivity of quality reviews, the individual test rating (ITR) assigned by the Electronic Reporting Tool (ERT) is based upon process, control device, and emissions testing documentation provided by the source and responses to questions that assess the quality of the process, control device, and emissions data collected during a source test. The methodology used by the ERT for assessing the quality of emissions test data follows the same basic principles as the EPA's historic methodology. However, the ERT procedure provides a consistent objective framework for test contractors to follow when compiling test reports, and for regulatory agency reviewers to follow when assessing data quality. The test report quality rating methodology consists of three components: (1) the assignment of points by the ERT based upon the source's entry of information into specific data areas and attachments, (2) an adjustment of the points assigned by the ERT based upon a regulatory agency review, and (3) the normalization of the points for a maximum ITR of 100 such that the ERT assigned score is 80 percent of the total and the remainder is based upon the regulatory agency review. Table A-l shows the types of information and documentation used by the ERT to assign points and the questions that are used to evaluate the quality of data submitted to the ERT. The information requested in the table is indicative of a complete and well-documented test report. The ERT assigns points based on the assumption that the information and documentation provided by the source is true, accurate, and complete. The adjustment to the points assigned by the ERT may result in a modest increase in the points when the regulatory agency review verifies that the information contained in the documentation provides an acceptable level of quality. The adjustment to the points assigned by the ERT may result in a decrease when the regulatory agency review reveals incorrect measurement procedures, unrepresentative process operation, or other inaccurate information. Supplementary points are assigned by the ERT when documentation is provided showing certification or accreditation of those individuals or organizations involved with the testing program. It is important to note that well-performed and documented test reports will receive a sufficiently high rating to justify their use in developing emissions factors without any supplementary points. Neither a state review nor participation by accredited organizations or certified individuals is required. However, these added components can improve the ITR of the test report. A-l ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings It is also important to note that while a significant level of subjectivity has been removed from the quality assessment of source tests for emissions factors development, the points awarded are not a direct indicator of the precision, accuracy, and usability of the data for other purposes. For simplicity, the point assignment employs a "Yes/No" criteria rather than a graded assessment. Some of the components may not directly affect the precision, accuracy, or usefulness of the final result, but would bolster the confidence in the result. For example, reagent blanks and calibrations conducted prior to a test verify that the reagents and equipment comply with method requirements for the first test and increase the probability that the blanks and calibrations conducted after the test will comply with the method requirements. Also, some components do not result in completely unusable results at a given value. For example, a test with results below the method detection level may be adequate for demonstrating compliance when emissions calculated at the detection limit are significantly below the applicable limit. The judgment of an experienced and knowledgeable individual can estimate the range of potential change that a minor variation in an established test methodology has on the final result. While the use of a specific emissions test may not be used for emissions factor development, this data may be usable for other purposes when the bounds for that use are defined and assessed. 2.0 ERT Assessment The ITR of the source test report is based upon the information and attachments provided by the source. The ERT calculates the score based upon the completeness of the report in the areas of process data, control device information, test method performance and quality assurance. The information listed under "Supporting Documentation Provided" in Table A-l identifies information the source or source test contractor provides and the criteria the ERT uses in assigning points to calculate the quality indicator. The EPA assigned different relative weightings to supporting documentation components due to the importance associated with their potential to affect the overall precision, accuracy, representativeness, and reliability of the final results. Only those items related to the information collected during the test are used in calculating the initial score. Using the completeness of the data and supplemental attachments, we normalize the score so that the ITR score is limited to 80 points when only the ERT assessment is performed. Table A-l also identifies criteria that, if satisfied, can provide supplementary points above the maximum of 76 awarded by the ERT. The ERT awards supplementary points whenever: 1. The source test company meets the competency requirements as an Air Emission Testing Body (AETB) as defined by the American Society for Testing and Materials (ASTM) standard D7036-12 or the field test leader is a current Qualified Individual (QI) as defined by ASTM standard D7036-12. 2. The analysis laboratory is certified or accredited to perform the analysis. A-2 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings The ERT assigns an extra two points for each of the above accreditations or certifications that are demonstrated in the test report. As a result, the ERT could assign a maximum of 80 points if a QI was crew leader or the test company was an AETB and the laboratory was accredited by a national independent or state accreditation program. Some of the information requested in Table A-l is specific to certain test methods. For example, the isokinetic sampling requirements (listed under "Raw sampling data and test sheets") is only applicable when the test method collects pollutants that are in a particulate form. In cases like this, the ERT would not include the points associated with these items in either the points assigned or the maximum potential points used to normalize the ITR score. As a result, the ERT will not give the test report a lower rating if the test method used does not require isokinetic sampling. Instead, quality ratings depend upon the testing requirements. For example, if an instrumental test method is used, the ERT will use only those questions that pertain to the method to evaluate the quality of the test. Because the ERT normalizes the overall score based upon the maximum score that can be assigned for any given method, the fact that some questions that do not apply to the particular test method are not scored does not reduce the overall maximum score possible for one test method relative to another method. 3.0 Regulatory Agency Review The quality of an emissions factor is only as good as the source data upon which it is based. In the majority of cases, the test report, which is typically prepared by the testing contractor, is the only documentation available for assessing the potential reliability (e.g., precision, accuracy, representativeness) of the emissions data for emissions factor development as described in Appendix D. In all cases, the quality of the underlying source data can be more thoroughly assessed when the test report is independently reviewed by a regulatory agency. The maximum quality rating for a test report that is not reviewed by a regulatory agency is 80 points (76 points assigned for the base ERT review and 4 additional points assigned if testing or analyses were conducted by certified or accredited individuals and organizations). The regulatory agency review can raise the initial ITR score to a maximum of 100 points. However, a negative evaluation by a regulatory reviewer can result in reducing the value of the initial scoring significantly. Under the ERT quality rating procedure, the regulatory agency reviewer evaluates the responses to certain questions (shown in Table A-l) contained in the Quality Assessment (QA) Review section of the ERT. If the reviewer makes the assessment requested by the question and concludes that the documentation is complete, correct, and provides support of the proper performance of this item, additional points are added to the score given by the ERT. The points that are added with a positive response are shown in the fifth column of Table A-l. If the reviewer determines that points were incorrectly assigned (i.e., the information contained in the ERT file is incomplete, erroneous, or not consistent with the test method), points are deducted from the value determined by the initial scoring. The points deducted from the initial score for each component are shown in the sixth column of Table A-l. In addition, the possibility exists that the ERT did not assign points for an item, or part of an item, because that item was not documented in the correct location of the test report. If a positive validation of a misplaced item is provided by the regulatory reviewer, the ERT adds the prorated points (shown in the fourth A-3 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings column of Table A-l) that would have been assigned for the appropriate placement of the item in the test report. Regulatory agency reviewers may submit their review to EPA at any time, but we anticipate the majority of the reviews will be associated with agency assessments of test reports prepared by facilities to demonstrate compliance with applicable regulations. We recognize that the public comment and review process that is associated with revising or establishing an emissions factor (see Section 11) may result in additional reviews. These reviews will be evaluated by EPA staff and any corrections may be incorporated into the existing quality assessment of the test data, as appropriate. The results of the regulatory agency review and accepted public reviews may be used in calculating a new or revised emissions factor. A-4 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings Table A-l. Test Report Quality Rating Tool Supporting Documentation Provided Points Assigned if Documentation is Present Regulatory Agency Review Prorated Documentation Points Points Added with Affirmative Response Points Subtracted with Negative Response3 As described in ASTM D7036-12 Standard Practice for Competence of Air Emission Testing Bodies, does the testing firm meet the criteria as an AETB or is the person in charge of the field team a QI for the type of testing conducted? A certificate from an independent organization (e.g.. Stack Testing Accreditation Council (STAC), California Air Resources Board (CARB), National Enviromnental Laboratory Accreditation Program (NELAP)) or self- declaration provides documentation of competence as an AETB. 2 As described in ASTM D7036-12 Standard Practice for Competence of Air Emission Testing Bodies, does the testing firm meet the criteria as an AETB or is the person in charge of the field team a QI for the type of testing conducted? A certificate from an independent organization (e.g., STAC, CARB, NELAP) or self-declaration provides documentation of competence as an AETB. 2 0 2 1 Was a representative of the regulatory agency on site during the test? 0 1 0 Is a description and drawing of test location provided? 3 Is a description and drawing of test location provided? 3 1 3 Has a description of deviations from published test methods been provided, or is there a statement that deviations were not required to obtain data representative of typical facility operation? 6 Is there documentation that the source or the test company sought and obtained approval for deviations from the published test method prior to conducting the test or that the tester's assertion that deviations were not required to obtain data representative of operations that are typical for the facility? 6 2 6 1 Were all test method deviations acceptable? 6 0b 6b a This column shows the points subtracted with a negative response if points were assigned for this item by the initial scoring. b These points are added for an affirmative response or subtracted for a negative response if this item is applicable to the test method used. If this item is not applicable, points are neither added nor subtracted. A-5 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings Table A-l. Test Report Quality Rating Tool (Cont.) Supporting Documentation Provided Points Assigned if Documentation is Present Regulatory Agency Review Prorated Documentation Points Points Added with Affirmative Response Points Subtracted with Negative Response" Is a full description of the process and the unit being tested (including installed controls) provided? 3 Is a full description of the process and the unit being tested (including installed controls) provided? 3 1 3 Has a detailed discussion of source operating conditions, air pollution control device operations and the representativeness of measurements made during the test been provided? 6 Has a detailed discussion of source operating conditions, air pollution control device operations and the representativeness of measurements made during the test been provided? 6 2 6 Were the operating parameters for the tested process unit and associated controls described and reported? 60 Is there documentation that the process monitors have been calibrated and that the calibration is acceptable? 12 4 12 Was the process capacity documented? 12 4 12 Was the process operating within an appropriate range for the test program objectives? 12 4 12 Were process data collected concurrent with testing? 12 4 12 Were data included in the report for all parameters for which limits will be set? 12 4 12 a This column shows the points subtracted with a negative response if points were assigned for this item by the initial scoring. b These points are added for an affirmative response or subtracted for a negative response if this item is applicable to the test method used. If this item is not applicable, points are neither added nor subtracted. A-6 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings Table A-l. Test Report Quality Rating Tool (Cont.) Supporting Documentation Provided Points Assigned if Documentation is Present Regulatory Agency Review Prorated Documentation Points Points Added with Affirmative Response Points Subtracted with Negative Response" Is there an assessment of the validity, representativeness, achievement of data quality objectives (DQO) and usability of the data? 9 Did the report include descriptions of the representativeness of the facility operations, control device operation and the measurements of the target pollutants, and were any changes from published test methods or process and control device monitoring protocols identified? 9 3 9 Have field notes addressing issues that may influence data quality been provided? 0 Were all sampling issues handled such that data quality was not adversely affected? 0 0 111 Manual Test Method Questions Have the following been included in the report: dry gas meter (DGM) calibrations, pitot tube and nozzle inspections? 54 Was the DGM pre-test calibration within the criteria specified by the test method? 9 3 9 Was the DGM post-test calibration within the criteria specified by the test method? 9 3 9 Were thermocouple calibrations within method criteria? 9 3 9 Was the pitot tube inspection acceptable? 9 3 9 Were nozzle inspections acceptable? 9 3 9 Were flow meter calibrations acceptable? 9 3 9 a This column shows the points subtracted with a negative response if points were assigned for this item by the initial scoring. b These points are added for an affirmative response or subtracted for a negative response if this item is applicable to the test method used. If this item is not applicable, points are neither added nor subtracted. A-l ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings Table A-l. Test Report Quality Rating Tool (Cont.) Supporting Documentation Provided Points Assigned if Documentation is Present Regulatory Agency Review Prorated Documentation Points Points Added with Affirmative Response Points Subtracted with Negative Response" Was the Method 1 sample point evaluation included in the report? 12 Were the appropriate number and location of sampling points used? 12 4 12 Were the cyclonic flow checks included in the report? 12 Did the cyclonic flow evaluation show the presence of an acceptable average gas flow angle? 12 4 12 Were the raw sampling data and test sheets included in the report? 126 Were all data required by the method recorded? 12 4 12 Were the required leak checks performed and did they meet method requirements? 30 10 180 Was the required minimum sample volume collected? 18 6 18 Did probe, filter and impinger exit temperatures meet method criteria (as applicable)? 24 8 24 Did isokinetic sampling rates meet method criteria? 24 8b 120b Was the sampling time at each point greater than 2 minutes and the same for each point? 18 6 18 Did the report include a description and flow diagram of the recovery procedures? 30 Was the recovery process consistent with the method? 6 2 6 Were all blanks collected in the field? 6 2b 6b a This column shows the points subtracted with a negative response if points were assigned for this item by the initial scoring. b These points are added for an affirmative response or subtracted for a negative response if this item is applicable to the test method used. If this item is not applicable, points are neither added nor subtracted. A-8 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings Table A-l. Test Report Quality Rating Tool (Cont.) Supporting Documentation Provided Points Assigned if Documentation is Present Regulatory Agency Review Prorated Documentation Points Points Added with Affirmative Response Points Subtracted with Negative Response" Where performed, were blank corrections handled per method requirements? 9 3b 9b Were sample volumes clearly marked on the jar or measured and recorded? 9 3 9 Was the laboratory certified/accredited to perform these analyses? 2 Was the laboratory certified/accredited to perform these analyses? 2 0 2 (only if points were assigned in the initial ERT scoring) Did the report include a complete laboratory report and flow diagram of sample analysis? 132 Did the laboratory note the sample volume upon receipt? 9 3 9 If sample loss occurred, was the compensation method used documented and approved for the method? 9 0 120 Were the physical characteristics of the samples (e.g., color, volume, integrity, pH, temperature) recorded and consistent with the method? 9 3 9 Were sample hold times within method requirements? 9 3b 9b Does the laboratory report document the analytical procedures and techniques? 6 2 6 Were all laboratory QA requirements documented? 15 5 15 a This column shows the points subtracted with a negative response if points were assigned for this item by the initial scoring. b These points are added for an affirmative response or subtracted for a negative response if this item is applicable to the test method used. If this item is not applicable, points are neither added nor subtracted. A-9 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings Table A-l. Test Report Quality Rating Tool (Cont.) Supporting Documentation Provided Points Assigned if Documentation is Present Regulatory Agency Review Prorated Documentation Points Points Added with Affirmative Response Points Subtracted with Negative Response" Were analytical standards required by the method documented? 12 4 12 Were required laboratory duplicates within acceptable limits? 12 4 12 Were required spike recoveries within method requirements? 12 4 12 Were method-specified analytical blanks analyzed? 12 4 12 If problems occurred during analysis, is there sufficient documentation to conclude that the problems did not adversely affect the sample results? 15 0 15 Was the analytical detection limit specified in the test report? 6 2 6 Is the reported detection limit adequate for the purposes of the test program? 6 2b 6b Were the chain-of-custody forms included in the report? 12 Do the chain-of-custody forms indicate acceptable management of collected samples between collection and analysis? 12 4 12 Instrumental Methods Questions Did the report include a complete description of the instrumental method sampling system? 3 Was a complete description of the sampling system provided? 3 1 3 a This column shows the points subtracted with a negative response if points were assigned for this item by the initial scoring. b These points are added for an affirmative response or subtracted for a negative response if this item is applicable to the test method used. If this item is not applicable, points are neither added nor subtracted. A-10 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings Table A-l. Test Report Quality Rating Tool (Cont.) Supporting Documentation Provided Points Assigned if Documentation is Present Regulatory Agency Review Prorated Documentation Points Points Added with Affirmative Response Points Subtracted with Negative Response" Did the report include calibration gas certifications? 27 Were calibration standards used prior to the end of the expiration date? 12 4 12 1 Did calibration standards meet 1 method criteria? 15 5 15 Did the report include interference tests? 9 Did interference checks meet method requirements? 9 3b 9b Were the response time tests included in the report? 12 Was a response time test performed? 12 4 12 Were the calibration error tests included in the report? 12 Did calibration error tests meet method requirements? 12 4 12 Did the report include drift tests? 9 Were drift tests performed after each run and did they meet method requirements? 9 3 9 Did the report include system bias tests? 24 Did system bias check results meet method requirements? 24 8 120 Were the converter efficiency tests included in the report? 12 Was the NOx converter test acceptable? 12 4b 12b Did the report include stratification checks? 15 Was a stratification assessment performed? 15 5 15 Did the report include the raw data for the instrumental method? 54 Was the duration of each sample run within method criteria? 9 3 9 1 Was an appropriate traverse performed during sample collection, or was the probe placed at an appropriate center point (if allowed by the method)? 12 4 12 a This column shows the points subtracted with a negative response if points were assigned for this item by the initial scoring. b These points are added for an affirmative response or subtracted for a negative response if this item is applicable to the test method used. If this item is not applicable, points are neither added nor subtracted. A-ll ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings Table A-l. Test Report Quality Rating Tool (Cont.) Supporting Documentation Provided Points Assigned if Documentation is Present Regulatory Agency Review Prorated Documentation Points Points Added with Affirmative Response Points Subtracted with Negative Response" Were sample times at each point uniform and did they meet the method requirements? 9 3 9 Were sample lines heated sufficiently to prevent potential adverse data quality issues? 12 4 12 Were all data required by the method recorded? 12 4 12 a This column shows the points subtracted with a negative response if points were assigned for this item by the initial scoring. b These points are added for an affirmative response or subtracted for a negative response if this item is applicable to the test method used. If this item is not applicable, points are neither added nor subtracted. A-12 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings 4.0 Rationale for Evaluation Criteria The rationale for including the specific information considered in calculating the ITR are provided below. 1. Completeness Review - The documentation specified in the "Supporting Documentation Provided" are used to assess certain aspects of the test program impacting the quality (e.g., accuracy, precision, reliability, representativeness, consistency with published methods, etc.) of the test data. A complete test report should include: information on the location and contacts for the facility, information on the contacts for the test team, information describing the tested process including process and control device operations relevant for characterizing emissions, information describing the characteristics of the test location(s), a schematic or drawing of the test location(s), description of the published test method(s) used, descriptions of the changes that were necessary to conduct the test, identification of any relevant applicable requirements for which the test will be used, and the identification of any audit and data quality indicators used for verifying the reliability of the test method(s) performed. Documentation of the conduct of the test methods, deviations from required test methods and laboratory reports describing the analysis of the test samples are valuable as indicators of the precision and accuracy of emissions data. The conditions during the time of sampling and the operating parameters for the process and any air pollution controls are indicative of the reliability and representativeness of the emissions measured during the test period. If the various pieces of information listed here are not provided, conformance to the test method cannot be determined and the precision and accuracy of the data cannot be verified. 2. Calibration Reports - Calibration reports provide documentation that equipment has been inspected, properly maintained and is operating correctly during testing. If calibration data are not present, or if the calibration data have expired, the results of testing cannot be considered accurate. Calibration errors will lead to inaccurate measurements and therefore inaccurate emissions rates. • Manual Test Methods - Equipment used to measure flow rate and temperature should be properly inspected and calibrated to ensure accurate results. Flow rate and temperature are important factors in source testing and have a direct impact on the calculation of emissions rates. Faulty or mis-calibrated equipment can lead to inaccurate readings and inaccurate results. • Instrumental Test Methods - Similar to the manual methods, this information is used to determine if analyzers are operating correctly for each test. This data includes pre-test calibration checks, bias determinations for each test run, and equipment operational checks. If the information in this section is missing, the data contained in the test report cannot be considered accurate. A-13 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings 3. Raw Data Reports • Manual Test Methods - The documentation in this section of the raw data report verifies the information reported in the test program and confirms that field QA activities have been performed. This section provides documentation of stack characteristics, exhaust gas conditions and sample point evaluation, all of which are important for properly characterizing emissions. A complete laboratory report, including recovery procedures and chain-of-custody forms, provides a good indication of how well the samples were recovered, handled, and analyzed. • Instrumental Test Methods - With the exception of raw data, this information is required by the reference methods and is used to verify that operating limits for instrumentation are within acceptable ranges. Stratification checks are now required by the EPA reference methods in some instances and this documentation verifies that sampling procedures were appropriate for the exhaust conditions at the time of the test. • Process and Facility Operation - Process and operating data are key components in demonstrating that the facility is operating within normal conditions and that the data collected are representative of normal operation. This information also allows for the calculation of production-based emissions factors. Documentation of control devices and their monitoring parameters verifies that devices are working properly, provides information that can later be used as indicators of continued performance and assures that testing was conducted under typical control conditions. 4. QA Review - The evaluation criteria listed below are based upon the QA requirements of the EPA's reference methods, New Source Performance Standards (NSPS) and National Emission Standards for Hazardous Air Pollutants (NESHAP). • Manual Test Method QA - Calibration criteria evaluated in this review are specified in the reference methods and address field measurement equipment calibrations and inspections. These criteria establish the minimum operating limits for measurement equipment that provide confidence in the accuracy and precision of the test results. This information addresses the critical elements of the test equipment that have a direct impact on measurement and subsequent calculation of sample volumes, effluent flow rates and pollutant concentrations. • Laboratory QA - Laboratory information evaluated in this review is directly related to the accuracy of the laboratory analysis of pollutant samples collected in the field. Listed items have a direct impact on the analysis of the samples and the reliability of the test data. For example, sample integrity during transport is assessed by comparing sample volumes to the values recorded prior to shipping, which may indicate potential loss of sample media. Another example is analytical detection limits, which should be sensitive enough to measure the pollutant of interest at concentrations appropriate for the test plan. • Instrumental Test Method QA - The QA checks for instrumental test methods are specified in the reference methods. These checks are designed to demonstrate that the sampling system and analyzers are: A-14 ------- Appendix A Procedures for Determining Individual Test Report Quality Ratings i. Capable of meeting minimum acceptance criteria for acquiring a representative effluent sample, and ii. Operating in a stable environment. This information verifies that the analytical accuracy and precision of the measurement results are acceptable for regulatory programs. • Process Data OA - The evaluation criteria listed in this review are based upon the instrumental test method evaluations for data accuracy and representativeness. Process disruptions may have a negative impact on the accuracy of the data. Calibration information establishes the reliability and accuracy of the values used to calculate emissions rates. • Other OA Indicators - Among other factors that should increase the assurance of high-quality data from a source emissions test is the participation of qualified individuals during the field testing. A QI (e.g., someone recognized by the Source Evaluation Society (SES) or meeting the criteria outlined in ASTM standard D7036-12) is someone who has demonstrated a high level of knowledge and ability consistent with an experienced field test team leader responsible for emissions test planning, preparation, conduct, and reporting. Another factor is the presence of a qualified observer during the field emissions testing. Such an observer may be an independent technical expert or a representative of the state, local or federal agency familiar with source emissions testing and who was on site to monitor progress during the test. A-15 ------- Appendix B Procedures for Handling Test Data Tha i are Below the Method Detection Limits ------- Appendix B Procedures for Handling Test Data That are Below the Method Detection Limits 1.0 Introduction In some cases, the result of a process emissions test is not an emissions rate, but a determination that the target pollutant was not present at or above the minimum detection limit of the test method (MDL). The EPA defines MDL as the minimum concentration of a substance that can be measured and reported with a given level of confidence that the analyte concentration is greater than zero. The MDL is determined from an analysis of a sample in a given matrix containing the analyte. For purposes of emissions factor development, that level of confidence is 99 percent. Stated another way, the MDL is the smallest amount of a substance that an analytical method can reliably distinguish from zero, at a specified confidence level, from the signal produced by a blank sample. It is important to understand that the MDL is a statistical parameter and not a chemical one. For EPA test methods (e.g., Method 5 - Particulate Matter) where a single analytical technique is specified, the MDL will be the same for all source tests. However, the MDL can vary from substance to substance and from measurement process to measurement process in cases where the test method (e.g., Method 29 - Metals Emissions from Stationary Sources) allows for alternative analytical techniques. In these cases, variability is introduced into MDLs by the analysts conducting the measurements, the equipment and chemicals used in the measurements and the Quality Assurance/Quality Control (QA/QC) procedures used. A separate MDL should be generated for each test program. After MDLs have been developed, the results of the testing can be compared. Results that are less than the MDL are referred to as below the MDL (BDL). Test run values where some measurement data from the test method are BDL are referred to as detection level limited (DLL). 2.0 Description of Procedures We have developed specific procedures for handling ADL, BDL, and DLL data at the test run level in ERT submissions when calculating test averages and for addressing cases where some or all of the data included in the candidate data set selected for use in developing emissions factors are ADL, BDL, and/or DLL. Note that we apply the procedures for determining the test average in this appendix prior to conducting the data outlier tests described in Appendix C so that appropriate values are used in outlier analyses. It is not unusual for environmental data to contain some BDL values. Because such values are expected, data users have developed calculation techniques to account for these BDL values that exist, but are difficult to quantify with the accuracy typically associated with ADL values. Generally, these calculation techniques recognize that small and large sample sizes do not warrant rigorous mathematical approaches to provide a numerical value that replaces a value found to be BDL. On the other hand, medium sample sizes warrant mathematical approaches that provide numerical values associated with a maximum likelihood estimator (MLE), a value found via calculation to be between V2 the MDL and the MDL. These approaches generally work well for programs managed by other agency offices tasked with establishing regulatory emissions limits and determining compliance for specific individual facilities in narrowly-defined source categories. However, such rigor is overly complicated for the WebFIRE emissions factor development program because emissions factors B-l ------- Appendix B Procedures for Handling Test Data That are Below the Method Detection Limits are, by design, representative of generic facilities in broadly-defined source categories. As a result, the procedures adopted for handling ADL, BDL, and DLL data in the derivation of emissions factors are more straightforward and are based upon two general principles. First, as emissions test values generally represent the average of three test runs, a data set containing more than 10 test values is based upon more than 30 individual test runs. According to the central limit theorem, such a data set is important because as one obtains 30 or more individual samples (i.e., test runs), the distribution of those samples approaches that of a normal distribution whose statistical characteristics are obtained readily. Second, the use of ADL and DLL data is preferred over the use of BDL data in cases where adequate amounts of ADL data are available. This generally reduces the uncertainty associated with emissions factors derived, in part, from data that are BDL. In understanding the recommended procedures for handling ADL, BDL, and DLL data, note that a test run refers to the net period of time during which an emissions sample is collected, as well as to the amount of pollutant emitted during that time period. Likewise, a test refers to the net period of time over which separate runs, typically three, are conducted, as well as to the average amount of pollutant emitted over the test period. In most cases, the emissions test data contained in the ERT are used by sources to demonstrate compliance with regulatory limits. Although we acknowledge that analytical laboratories and state regulatory agencies use varying approaches in addressing ADL, BDL, and DLL data for compliance assessments, the EPA's preferred approach is to report the BDL data as "real" values and to flag the data appropriately when calculating test averages. Table B-l summarizes our procedures for calculating an average emissions value and assigning a detection limit flag from ERT test run-level data. In calculating an emissions factor value from a candidate data set where some of the test averages are flagged as BDL, we do not include BDL values that are greater than the highest ADL or DLL value in the candidate data set. We also do not calculate emissions factors in cases where all of the candidate data are BDL. Table B-l. Summary of WebFIRE Procedures for Handling ADL, BDL, and DLL Test Data in Calculating a Test Average from Run-Level Data Types of Test Run Data Basis for Calculating Average Value All test runs are ADL WebFIRE calculates the average of the test run values and assigns a flag of ADL to the calculated average. All test runs are DLL or a mix of ADL and DLL WebFIRE calculates the average of the test run values and assigns a flag of DLL to the calculated average. All test runs are BDL WebFIRE calculates the average using the BDL/2 values and assigns a flag of BDL to the calculated average. B-2 ------- Appendix B Procedures for Handling Test Data That are Below the Method Detection Limits Table B-2. Summary of WebFIRE Procedures for Handling ADL, BDL and DLL Test Data in Calculating a Test Average Types ol- l ost Run linsis lor (':ilciil;ilin<> A\cr;i«>c Ysilue All test runs are a mix of ADL, DLL, and/or BDL values WebFIRE calculates the average using the ADL and DLL values and 'A the BDL values, provided that 'A the BDL is equal to or less than the highest ADL or DLL value, and assigns a flag of DLL to the calculated average. When the BDL is greater than the highest ADL or DLL value, that BDL value is excluded from the average calculation. B-3 ------- Appendix C Procedures For Determining Statistical Outliers ------- Appendix C Procedures for Determining Statistical Outliers 1.0 Introduction After a candidate data set containing more than three test values has been selected for emissions factor development and the BDL analysis has been performed (see Appendix B), WebFIRE will conduct a set of tests (i.e., the Dixon Q Test or the Rosner Test) to identify values in the candidate data set that are statistical outliers (i.e., a value that does not conform to the statistical pattern established by other values under consideration). These tests are incorporated into the EPA's WebFIRE (see Section 6.2) and are based on algorithms in ProUCL, an EPA- developed statistical package available to the public free of charge.2 We neither endorse ProUCL or any other statistical package, nor limit our ability to use ProUCL or any other statistical package, as other statistical packages are capable of performing the requisite outlier analysis. Emissions data are usually log-normally distributed; therefore, for the purposes of evaluating outliers for emissions factor development, we assume that all emissions test data values in the candidate data set follow log normal distributions. Thus, we log-transform every test value in the candidate data set prior to conducting outlier tests. 2.0 Description of Procedures In WebFIRE, the outlier test is applied to the log-transformed values in the candidate data set in an iterative process. Each run of the outlier test identifies whether a low or high value is an outlier, and the test is applied until all outliers have been identified and removed from the candidate data set. However, the data values removed from the candidate data set are not removed from the WebFIRE database because the outlier designation is relative to the population of values selected for the candidate data set (i.e., an outlier in one data set may be an acceptable value in a different data set, especially when differing data sets are being compared using a t-test). The general approach to use for determining outliers is shown in Figure C-l. If the candidate data set contains less than three test values, a statistical outlier test is not performed by WebFIRE because statistical analyses cannot determine outliers from such a small sample size. Moreover, with just two values it is impossible to tell which one might be the outlier. If there are three to 24 test values in the candidate data set, WebFIRE applies the Dixon test to determine outliers. If there are 25 or more test values for analysis, the Rosner test is used to identify outliers. Consistent with ProUCL, all outlier tests in WebFIRE are performed using the 95-percent confidence level using a 1-tailed statistical test, meaning that we are willing to accept a 5 percent risk of rejecting a valid observation. 2 ProUCL is described and can be downloaded from the following Internet address: www.epa.gov/land- research/proucl-software. C-l ------- Appendix C Procedures for Determining Statistical Outliers Figure C-l. Procedures to Identify Data Outliers in a Candidate Data Set C-2 ------- Appendix C Procedures for Determining Statistical Outliers If an outlier is detected by WebFIRE, it is flagged in the data set and the number of valid test data values remaining in the candidate data set is determined. The Rosner test or the Dixon test, as determined by the number of test data values, is performed again. Outliers are removed from the candidate data set and the appropriate outlier test is performed again until the candidate data set does not contain outliers. When the data set does not contain outliers, WebFIRE calculates the average of the remaining test values (not the log-transformed values) and uses that average as the emissions factor value. C-3 ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures 1.0 Introduction The procedures used in WebFIRE to determine which individual test data values (i.e., average values derived from multiple test runs) to use in deriving an emissions factor are based upon two premises: (1) higher-quality data are preferred over lower-quality data, and (2) more test data values are preferred over fewer test data values. These concepts are combined with simple statistical procedures to derive the approach used by WebFIRE in assigning a quality rating to the derived emissions factor. This quality rating indicates how well the derived factor represents the average of the emissions from a particular source category. These procedures are described in detail in the following sections. 2.0 Terms and Definitions As a prelude to presenting these procedures, it is important to explain and define the parameters used for the emissions factor calculations and data quality characterizations: 1. Individual Test Rating (ITR) - The ITR value is the quality indicator assigned to individual source test reports by the ERT. This value is based upon the level of documentation available in the test report, the use and conformance with established the EPA reference test method (or other test methods with comparable precision and accuracy), and the operation of the source and associated emissions controls at known and representative conditions. The ITR ranges from a high of 100 to a low of 0. The ERT procedures for calculating the ITR are presented in Appendix A. 2. Composite Test Rating (CTR) - The CTR is a weighted-average quality indicator for groups of test reports. An inverse square weighting of the ITR values for the test reports is used in calculating the CTR. As with the ITR, the CTR ranges from a high of 100 to a low of 0. 3. Factor Quality Index (FQI) - The FQI is a numerical indicator representing the derived emissions factors ability to estimate emissions for the entire national population. The FQI is dependent upon both the CTR and the number of test values used to develop the emissions factor. The FQI is analogous to the standard error of the mean (oM) in statistical calculations. In statistical calculations, oM provides an indication of the confidence associated with an estimate of the mean of a population when a given number of samples are obtained from the population. The oM is calculated from the standard deviation of the samples (or other estimate of the populations variability) divided by the square root of the number of samples. In the FQI, the parameter 100/CTR simulates the function of the standard deviation in that measurements with great variability (due to variations between sources in the population, variations with individual sources, precision and accuracy of the methods used for measurement, and other factors affecting variations in the measured values) are larger in value than measurements with less variability. In the FQI, the minimum value is associated with emissions tests that are judged to have the greatest precision and accuracy of sources operating at representative conditions. This is the appropriate data set selection for use in emissions factor derivation as increases in the oM and increases in the number of samples used to estimate the mean of the population serve to reduce the value of the FQI in proportion to the estimated reliability of the D-l ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures estimate of the mean. In addition, like oM, equal values of FQI provide comparable reliability in the estimate of the population mean irrespective of differences in the CTR and the number of samples used (i.e., test values) for estimating the population mean. 4. Emissions factor quality indicator - There are three quality indicators used to characterize the calculated emissions factor: • Highly representative is assigned to emissions factors having the lowest FQI rating. • Moderately representative is assigned to emissions factors having an intermediate FQI rating. • Poorly representative is assigned to emissions factors having the highest FQI rating. 5. Boundary criteria - Boundary criteria refers to the specific conditions that determine which quality rating (i.e., poorly representative, moderately representative or highly representative) is assigned to an emissions factor. Based upon our experience with developing emissions factors, we determined that, for source categories containing more than 15 sources, an emissions factor derived from three tests with a CTR of 100 (FQI = 0.5774) qualifies for a moderately-representative rating. Likewise, an emissions factor derived from more than 11 tests with a CTR of 100 (FQI = 0.3015) qualifies for a highly- representative rating. These criteria are designed to allow for the development of highly- representative emissions factors without the burden of conducting an inordinate amount of emissions tests. For source categories containing 15 or fewer sources, it is appropriate to allow fewer tests to attain a specific quality rating. An emissions factor developed from more than one test with a CTR of 100 (FQI = 1.000) qualifies for a moderately- representative rating and more than three tests with a CTR of 100 (FQI = 0.5774) qualifies the emissions factor for a highly-representative rating. For both source category population sizes, degradation of the CTR warrants an increase in the number of tests to compensate for the decrease in the average test quality to achieve the same FQI. Table D-l provides the boundary line equations for the two population sizes and Figures D-l and D-2 provide the graphical relationship between the CTRs and the number of tests appropriate for the boundary conditions, respectively. Ta )le D-l. FQI and Boundary Line Equations If the source category contains ... Then use these bounc ary line equations ... Poorly to moderately representative Moderately to highly representative More than 15 sources FQI = 0.5774 N = 30,000 * CTR"2 FQI = 0.3015 N = 110,000 * CTR"2 15 or fewer sources FQI = 1 N= 10,000 * CTR"2 FQI = 0.5774 N = 30,000 * CTR"2 D-2 ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures Figure D-l. Emissions Factor Representativeness Areas for Source Categories Containing More Than 15 Sources 90 80 70 60 v> 30 20 Highly Representative Moderately Representative Poorly Representative Composite Test Rating, % D-3 ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures Figure D-2. Emissions Factor Representativeness Areas for Source Categories Containing 15 or Fewer Sources 60 65 70 75 Composite Test Rating, % 3.0 Procedures The following steps summarize the specific calculation and data quality characterization procedures used in WebFIRE to calculate a new or revise an existing emissions factor from a candidate data set that has been subjected to the WebFIRE BDL and outlier analyses (See Appendices B and C, respectively). The steps described in this section are performed when deriving a user-defined emissions factor. • Step 1 - WebFIRE arranges the individual test data values being considered in descending order by: (1) the ITR and (2) the test data value. • Step 2 - Beginning with the second individual test data value and continuing sequentially in order, WebFIRE calculates the CTR using the following equation: D-4 ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures CTR = n n f i 2 /=1 1-0.5 JTRU N Where: CTR = Composite Test Rating, ITR = Individual Test Rating (assigned by ERT), and N = Number of tests with ITRs equal to or greater in value as those included in the candidate data set. It should be noted that a CTR is calculated for each combination of individual test values in the data set potentially used to derive an emissions factor. For example, using a data set consisting of 10 test values, WebFIRE would calculate 9 CTRs, beginning with the first two data points, then the first three data points, and so forth until a CTR is calculated for all 10 data values. • Step 3 - For each calculated CTR, WebFIRE calculates the FQI using the following equation: 100 FQI ~ 0CTR X iV)0-5 Where: FQI = Factor Quality Index, CTR = Composite Test Rating associated with the data set selected for deriving the emissions factor, and N = Number of tests with ITRs equal to or greater in value as those included in the candidate data set. • Step 4 - WebFIRE compares the calculated FQI with the FQI for the previous ITR grouping. If the FQI associated with the larger grouping (i.e., more data values) is less than the FQI with fewer data values, then WebFIRE proceeds back to Step 2 to perform the next sequence in the calculations. If the FQI associated with the larger grouping is greater than the preceding FQI, then WebFIRE does not include the test data value responsible for the increase in the FQI in calculating the emissions factor and excludes the remaining data (with lower ITRs) from consideration. • Step 5 - WebFIRE calculates the emissions factor using all test data values that were included in calculating the lowest FQI. This includes all test data values with higher ITRs than the ITR value that resulted in an increased FQI value. D-5 ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures • Step 6 - WebFIRE determines if the SCC corresponding to the candidate data set selected by the user contains 15 or fewer sources. Appendix F lists the SCCs that we expect to contain 15 or fewer sources. • Step 7 - WebFIRE compares the FQI for the test values used to calculate the emissions factor with the corresponding boundary criteria for assigning one of the three emissions factor quality ratings. Different boundary criteria are used for source categories containing 15 or fewer sources and for source categories containing greater than 15 sources. Example 1 Table D-2 below contains an example set of 35 individual test data values selected to develop an emissions factor for SCC 303010. The table shows the test data values, their corresponding ITR and N values, and the calculated CTR and FQI values. The table also indicates whether or not the test data value should be used to calculate an emissions factor and the representativeness of the resulting emissions factor (not shown in the table). Table D-2. Individual Test Data and Various Characteristics Individual Use for Test EF EF Value ITR CTR N FQI Average? Representativeness 0.0108 98 98.00 1 1.0204 Yes Poorly 0.1100 98 98.00 2 0.7215 Yes Poorly 0.0917 92 95.87 3 0.6022 Yes Poorly 0.0212 92 94.86 4 0.5271 Yes Moderately 0.0339 91 94.05 5 0.4755 Yes Moderately 0.0027 91 93.52 6 0.4365 Yes Moderately 0.0563 89 92.83 7 0.4072 Yes Moderately 0.0165 89 92.32 8 0.3829 Yes Moderately 0.0158 88 91.81 9 0.3631 Yes Moderately 0.0044 88 91.41 10 0.3460 Yes Moderately 0.0675 88 91.08 11 0.3310 Yes Moderately 0.0043 88 90.81 12 0.3179 Yes Moderately 0.0449 74 89.10 13 0.3113 Yes Moderately 0.0203 73 87.58 14 0.3052 Yes Moderately 0.0603 70 85.97 15 0.3003 Yes Highly 0.0425 70 84.64 16 0.2954 Yes Highly D-6 ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures Table D-2. Individual Test Data and Various Characteristics (Cont.) Individual Use for Test EF EF Value ITR CTR N FQI Average? Representativeness 0.0130 70 83.51 17 0.2904 Yes Highly 0.1440 69 82.45 18 0.2859 Yes Highly 0.0177 68 81.45 19 0.2817 Yes Highly 0.0317 68 80.58 20 0.2775 Yes Highly 0.0052 68 79.82 21 0.2734 Yes Highly 0.1350 68 79.14 22 0.2694 Yes Highly 0.0006 60 77.90 23 0.2677 Yes Highly 0.0023 45 74.85 24 0.2727 No Not applicable 0.0724 45 72.33 25 0.2765 No Not applicable 0.0960 44 70.08 26 0.2799 No Not applicable 0.0538 40 67.54 27 0.2850 No Not applicable 0.0170 38 65.07 28 0.2904 No Not applicable 0.0132 35 62.48 29 0.2972 No Not applicable 0.0124 34 60.14 30 0.3036 No Not applicable 0.0029 30 57.41 31 0.3128 No Not applicable 0.0018 30 55.16 32 0.3205 No Not applicable 0.0083 30 53.28 33 0.3268 No Not applicable 0.0009 30 51.66 34 0.3319 No Not applicable 0.0034 30 50.27 35 0.3362 No Not applicable Figure D-3 shows a plot of the CTR and N data in Table D-2 and the boundaries created by the line equations. In developing the emissions factor for the example data set, the first 23 values in Table D-2 are included in the emissions factor calculation because the FQI increases for the first time between the 23rd and 24th pair. Using the first 23 values yields an emissions factor of 0.0413 with a quality rating of "highly representative." D-7 ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures Figure D-3. Plot of CTR and N Data from Table D-3 45 50 55 60 65 70 75 80 85 90 95 100 Composite Test Rating, % Example 2 Table D-3 contains another example set of individual test data values selected for use in developing an emissions factor for SCC 303011, which is expected to contain 15 or fewer sources per Table D-l. Table D-3. Individual Test Data Values Selected for Developing an Emissions Factor for a Source Category Containing 15 or Fewer Sources Individual Test Data Value ITR 0.0015 45 0.0004 60 0.0055 30 0.0019 30 0.0012 30 0.0640 30 0.0113 30 D-8 ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures Table D-3. Individual Test Data Values Selected for Developing an Emissions Factor for a Source Category Containing 15 or Fewer Sources (Cont.) Individual Test Data Value ITR 0.0088 30 0.0029 88 0.0611 92 0.0402 70 0.0299 74 0.0375 89 0.0118 68 0.0072 99 Table D-4 shows the same data after the data have been sorted and the N, CTR and FQI values have been calculated. The table also indicates whether or not the test data value should be used to calculate an emissions factor and the representativeness of the resulting emissions factor. Table D-4. Individual Test Data and Various Characteristics for a Source Category with 15 or Fewer Sources Individual Use for Test EF EF Value ITR CTR N FQI Average? Representativeness 0.0072 99 99.00 1 1.0101 Yes Poorly 0.0611 92 95.31 2 0.7419 Yes Moderately 0.0375 89 93.06 3 0.6204 Yes Moderately 0.0029 88 91.71 4 0.5452 Yes Highly 0.0299 74 87.16 5 0.5131 Yes Highly 0.0402 70 83.42 6 0.4894 Yes Highly 0.0118 68 80.56 7 0.4692 Yes Highly 0.0004 60 76.80 8 0.4603 Yes Highly 0.0015 45 69.75 9 0.4779 No Not applicable 0.0012 30 58.11 10 0.5442 No Not applicable 0.0019 30 51.97 11 0.5801 No Not applicable 0.0088 30 48.12 12 0.6000 No Not applicable 0.0113 30 45.45 13 0.6103 No Not applicable 0.0640 30 43.48 14 0.6147 No Not applicable 0.0055 30 41.97 15 0.6152 No Not applicable D-9 ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures Figure D-4 shows a plot of the CTR and N values shown in Table D-4 and the boundaries created by the line equations. In developing the emissions factor for the example data set, the first 8 values in Table D-4 are included in the emissions factor calculation because the FQI increases for the first time between the 8th and 9th pair. Using the first 8 values yields an emissions factor of 0.0239 with a quality rating of "highly representative " Figure D-4. Plot of Selected Data from Table D-6 27 24 21 18 CO 9 6 3 '¦yy.'-yy.'- Highly Representative 1111111111 8th value Moderately Representative 60 65 70 75 Composite Test Rating, % For test data submitted to WebFIRE using ERT, a numerical ITR value will be assigned to the data by ERT prior to incorporation in WebFIRE. For data that were incorporated into WebFIRE prior to the development of ERT (e.g., the underlying data used to develop AP-42 emissions factors), the current subjective, letter-grade quality ratings have been converted to numerical values as follows: Test Data Letter Grade Equivalent ITR Score A 80 B 60 C 45 D 30 For example, a previous test rated as a "B" that is part of the candidate data set for emissions factor development would have an ITR value of 60 for use in calculating the CTR. We D-10 ------- Appendix D Emissions Factor Development and Data Quality Characterization Procedures used this approach because it would be time intensive and prohibitively costly to reevaluate every previous test report and assign it an ITR based on the rating system contained in the ERT. D-ll ------- Appendix E Statistical Procedures for Determining Valid Data Combinations ------- Appendix E Statistical Procedures for Determining Valid Data Combinations 1.0 Introduction As new emissions data are incorporated into WebFIRE, we expect that, periodically, we will want to determine whether a new data set should be combined with an existing data set for a given source type or category. When determining whether data sets should be combined, we will follow the procedures specified in this appendix. We anticipate applying these procedures on a case-by-case basis, most likely on data that are expected to be from the same type of emissions units, with similar types of emissions controls and under the same type of operational process. For example, a statistical analysis would be performed on source test data for the following processes at a Portland cement plant: a dry- process kiln, a wet-process kiln, a preheater kiln and a preheater/precalciner kiln. Each of the processes employs either an electrostatic precipitator (ESP) or a fabric filter. Emissions from the processes and control type combinations (e.g., a dry-process kiln controlled by an ESP and a wet-process kiln controlled by a fabric filter) would be compared to determine if the data sets should be combined. These procedures would not be applied to source test data from processes or controls that are clearly separate and distinct (e.g., coke oven emissions and electric arc furnace emissions) nor would they be applied to source test data that are clearly representative of the same source type, same fuel or same controls. In cases where it is acceptable to combine the new and existing data, the BDL and outlier calculation procedures found in Appendix B and Appendix C, respectively, are used in the emissions factor development process. Simple statistical characteristics such as the number of values, the mean and the variance can be used to represent a data set for computational purposes. Comparison of similar characteristics between data sets can determine whether the data sets are from the same population of values. If the data sets are determined to be from the same population of values, the data sets can be combined into a single, combined data set, often referred to as a pool. Pooled values are preferred over individual values because pooled values provide the best estimate of a population's variance. 2.0 Description of Procedures The data combination assessment procedures that we would use to determine whether a new data set should be combined with an existing data set are based upon use of the Student's t-test. For this analysis, we use a two-tailed test rather than a one-tailed test. The EPA uses the following steps to determine if it is appropriate to combine new data with existing data: 1. Obtain all emissions test data (i.e., the number of values and the numerical values of the data set) for the new data set and the data used to calculate the existing emissions factor. Include those data values that were previously identified in the emissions factor development for the source type or category as potential outliers. The new and existing data should represent average emissions test values, not test run values. Calculate the natural log of each value to create the data sets for use in the Student's t-test. E-l ------- Appendix E Statistical Procedures for Determining Valid Data Combinations 2. Prepare a null hypothesis that the data sets are from the same distribution (the means of the two sets are equal) and an alternative hypotheses that the data sets are not from the same distribution (the means of the two sets are unequal). 3. Conduct a Student's t-test on the data sets assuming unequal variances. By assuming an unequal variance, the variance of the data set and the characteristics of equivalency do not need to be determined. Calculate the absolute value of the Student's t-test statistic. 4. Find tcriticai values at the 0.05 significance level for the appropriate number of degrees of freedom. If the absolute value of the Student's t-test statistic is greater than the tcriticai value, the means are assumed to be unequal (i.e., the data sets should not be combined). If the absolute value of the Student's t-test yields a value that is less than or equal to the tcriticai value, the means are assumed to be equal (i.e., the data sets can be combined). The two examples below illustrate the use of the data combination assessment procedures based on the Student's t-test. In both examples, the determination as to whether to combine the two data sets is based on an assessment of the difference in the means of the data sets. Accepting the null hypothesis means that we are 95 percent confident that the differences between the means of the Group A Source Test Data and Group B Source Test Data are not statistically significant; therefore, we can combine both data sets. If we accept the alternate hypothesis (reject the null hypothesis), we are 95 percent confident that the differences between the means of the two source test data sets are statistically significant and we cannot combine Group A Source Test Data and Group B Source Test Data to derive an emissions factor. Example 1 Table E-l presents two data sets: Group A, which is used to calculate the current emissions factor of 0.0118 pounds of pollutant per ton of fuel combusted, and Group B, which is from a similar source category with similar controls and operated under a similar process. Table E-l. Emissions Factor Characteristics for Group A and B Group A Source Test Data Group B Source Test Data 0.0015 0.0029 0.0004 0.0611 0.0055 0.0402 E-2 ------- Appendix E Statistical Procedures for Determining Valid Data Combinations Table E-l. Emissions Factor Characteristics for Group A and B (Cont.) Group A Source Test Data Group B Source Test Data 0.0019 0.0299 0.0012 0.0375 0.064 0.0118 0.0113 0.0072 0.0088 Using an alpha of 0.05, these values yield a t-test statistic whose absolute value is 1.401 and a tcriticai value of 2.160. Since the absolute value of the t-test statistic is less than the tcriticai value, the analysis shows that the means of Group A and Group B are equal. Therefore, the null hypothesis is accepted, meaning that the data sets are from the same distribution; thus their means are the same. Given that the means of Groups A and B are equal, the individual test data sets can be combined and a revised emissions factor could be calculated using the procedures specified in Appendices B through D. If the means had been unequal, the Group A and B individual test data sets would not be combined. Example 2 Table E-2 presents two data sets: Group C, which is used to calculate the current emissions factor of 0.0015 pounds of pollutant per ton of fuel combusted, and Group D, which is from a similar source category with similar controls and operated under a similar process. Table E-2. Emissions Factor Characteristics for Group C and D Group C Source Test Data Group D Source Test Data 0.0005 0.0029 0.0015 0.0029 0.0025 0.0029 Using an alpha of 0.05, these values yield a t-test statistic whose absolute value is 2.425 and a Wicai value of 4.303. Since the absolute value of the t-test statistic is less than the tcriticai value, the analysis shows that the means of Group A and Group B are equal. Therefore, the null hypothesis is accepted, meaning that the data sets are from the same distribution; thus their means are the same. Given that the means of Groups A and B are equal, the individual test data sets can be combined and a revised emissions factor could be calculated using the procedures specified in Appendices B through D. If the means had been unequal, the Group A and B individual test data sets would not be combined. E-3 ------- Appendix F Source classification codes for source CATEGOIRES CONTAINING 15 OR FEWER SOURCES Appendix F is now located at: https://www.epa.gov/svstem/files/documents/2021-08/appendix-f- scc-codes-for-less-than-15-sources O.pdf l ------- United States Office of Air Quality Planning and Standards Publication No. EPA-45 3/B-21 -001 Environmental Protection Sector Policies and Programs Division November 2021 Agency Research Triangle Park, NC ------- |