&EPA United States Office of EPA 9355.0-91 Environmental Protection Emergency and April 2005 Agency Remedial Response PEER REVIEW DRAFT GUIDANCE ON SURFACE SOIL CLEANUP AT HAZARDOUS WASTE SITES: IMPLEMENTING CLEANUP LEVELS PEER REVIEW DRAFT ------- GUIDANCE ON SURFACE SOIL CLEANUP AT HAZARDOUS WASTE SITES: IMPLEMENTING CLEANUP LEVELS PEER REVIEW DRAFT Prepared for: Office of Emergency and Remedial Response U.S. Environmental Protection Agency Prepared by: Industrial Economics, Incorporated 2067 Massachusetts Avenue Cambridge, MA 02140 Dr. Shahrokh Rouhani New Fields, Incorporated 1349 W. Peachtree Street Atlanta, GA 30309 April 2005 ------- DISCLAIMER This document presents current OSWER technical options regarding implementation of cleanup levels for surface soil at hazardous waste sites. EPA and state personnel may use and accept other technically sound approaches, either on their own initiative, or at the suggestion of potentially responsible parties or other interested parties. Therefore, interested parties are free to raise questions and objections about the substance of this document and the appropriateness of the application of the options to particular situations. EPA will, and states should, consider whether or not the approaches in this document are appropriate in each situation. This document does not impose any requirements or obligations on EPA, states, or other federal agencies, or the regulated community. The sources of authority and requirements in this matter are the relevant statutes and regulations (e.g., the Comprehensive Environmental Response, Compensation and Liability Act, and the Resource Conservation and Recovery Act). EPA welcomes public comments on this document at any time and may consider such comments in future revisions of this document. ------- TABLE OF CONTENTS 1.0 INTRODUCTION 1 2.0 GENERAL CONCEPTS 3 3 .0 RECOMMENDED OPTIONS FOR IMPLEMENTING CLEANUP LEVELS 9 4.0 METHODS FOR IMPLEMENTING CLEANUP LEVELS AS AREA AVERAGES .15 Iterative Truncation Method 17 Confidence Response Goal Method 20 Geostatistical Methods 22 5.0 APPENDIX A: GEOSTATISTICAL REFERENCE SOURCES AM) SOFTWARE 33 6.0 APPENDIX B: SAMPLE GEOSTATISTICAL PROCEDURES FOR COMPUTING EPC 34 Approach 34 Example Application 36 Key Potential Method Advantages 37 7.0 BIBLIOGRAPHY 39 l ------- 1.0 INTRODUCTION Achieving EPA's goal of protecting human health and the environment typically requires that successful soil cleanup at hazardous waste sites eliminate unacceptably high risks associated with potential exposure to contaminated soils. It is important to achieve this goal in a cost-effective manner. This document provides guidance on approaches to surface soil cleanup that address these considerations and can help risk managers find the most cost-effective way to reduce human health risk.1 Risk assessors and statisticians should also find this guidance useful in providing information needed to support risk management decisions. Such decisions are not purely statistical; while they require statistical inputs, they are ultimately management decisions. This guidance does not address development of remediation goals, but rather focuses on how cleanup goals should be implemented at a site in delineating what soil areas to remediate and the concentration above which soil should be removed to achieve a protective cleanup. This guidance does not address the related determination of exposure point concentrations used for risk assessment. This topic is covered in: Supplemental Guidance to RAGS: Calculating the Concentration Term; and Calculating Upper Confidence Limits for Exposure Point Concentrations at Hazardous Waste Sites.2 The guidance also does not address site assessment sampling design. Nor does it address post-cleanup confirmatory sampling to be done regardless of how the cleanup goals are implemented. This topic is covered in: Methods for Evaluating the Attainment of Cleanup Standards, Volume I.3 The Multi-Agency Radiation Site Survey and Site Investigation Manual (MARSSIM) provides guidance on survey design for confirmatory sampling for radionuclides,4 and is based on an area average approach and could be adapted for other contaminants. A vital concept in this document is the difference between the implementation of a cleanup level as a not-to-exceed level or as an area average. The not-to-exceed option typically entails treating or removing all soil with contaminant concentrations exceeding the cleanup level. The area average option typically involves treating or removing soils with the highest contaminant concentrations such that the average (usually the upper confidence limit of the average) concentration remaining onsite after remediation is at or below the cleanup level. A key factor driving the choice between these options is the basis for the cleanup level. The method used in 1 The guidance is not applicable to sites where ecological risks are driving cleanup decisions, or where contaminants have the potential to leach to groundwater and where such leaching is the basis for cleanup. 2 U. S. EPA Office of Solid Waste and Emergency Response, A Supplemental Guidance to RAGS: Calculating the Concentration Term, Publication 9285.7-081, 1992; and U.S. EPA Office of Emergency and Remedial Response, Calculating Upper Confidence Limits for Exposure Point Concentrations at Hazardous Waste Sites, OSWER 9285.6-10, December 2002. 3U.S. EPA Office of Solid Waste and Emergency Respons q,Methods for Evaluating the Attainment of Cleanup Standards, Volume 1, Publication 230/02-89-042, February 1989. 4 U. S. EPA, U. S. DOE, U. S. NRC, Multi-Agency Radiation Survey and Site Investigation Manual (MARSSIM), EPA 402-R-97-016, Rev. 1, August 2000. 1 ------- implementing the cleanup level should be compatible with the method used in establishing the cleanup level. Another important factor is the nature and extent of site assessment data. Therefore, consideration of the approach to implementing cleanup levels should be interwoven with decisions about sampling and risk assessment and should be addressed early in remedial action scoping and delineation sampling for excavation. This guidance first discusses general concepts important to understand when making decisions about how to implement surface soil cleanup levels at hazardous waste sites. Then, the document presents in detail two recommended options for implementing cleanup levels: not-to-exceed and area average. To help risk managers decide whether to implement cleanup levels as not-to-exceed levels or as area averages, this part of the guidance discusses these options with respect to their advantages, disadvantages, and appropriate use. The remainder of the guidance focuses on potential methods for implementing cleanup levels as area averages and presents three different statistical methods for calculating remediation action levels that ensure that post-remediation area average contaminant concentrations achieve cleanup levels. This emphasis on the area average approach is not intended to suggest that the area average is EPA's preferred approach, but rather to fully explain this new approach as an option for consideration. The purpose of this document is to present the options and discuss factors to consider in the decision to use a not- to-exceed approach or an area average approach. The document does not establish EPA policy on how to implement cleanup levels; such decisions will depend on many site-specific factors, some of which are discussed in this document. 2 ------- 2.0 GENERAL CONCEPTS Exposure unit. The application of cleanup levels at a site usually requires understanding the behavior of the receptors and how these receptors are exposed to contaminants across the geographic area of the site. A key concept is the exposure unit (EU). The exposure unit generally is the geographic area within which a receptor comes in contact with a contaminated medium during the exposure duration.5 The exposure unit should be defined based on the receptor, exposure medium, and nature of the receptor's contact with the medium. If the receptor is a resident exposed to soils in his/her yard, the area within which exposure occurs will likely be relatively small; exposure units for residential receptors are typically less than one acre in size. EPA's Soil Screening Guidance assumes a 0.5 acre source area for residential exposure.6 Other receptors, such as industrial workers and recreational adults may be exposed to contaminants across much larger areas at a site, and a much larger EU may be appropriate. Definition of the exposure unit is critical to the success of a cleanup strategy that involves applying cleanup levels as area averages, since the averages should be calculated for the exposure unit and should accurately reflect the average exposure of receptors in that area. Exposure point concentration. The exposure point concentration (EPC) normally is an essential element in evaluating exposure and risk at hazardous waste sites, both pre- and post-remediation. The EPC is defined in EPA's Risk Assessment Guidance for Superfund: Volume III - Part A as "the average chemical concentration to which receptors are exposed within an exposure unit."7 The spatial and temporal variability of contamination and the receptor's behavior and activity within the exposure unit can all influence the EPC. For "reasonable maximum exposure" (RME), the Risk Assessment Guidance for Superfund (RAGS) recommends using the average value with a specified level of confidence to represent "a reasonable estimate of the concentration likely to be contacted over time."8 This average value generally is based on the assumption that contact is spatially random. For risk assessments based on point estimates of the EPC, the EPC is usually calculated as the 95 percent upper confidence limit (UCL) for the mean. For lead, risk generally is estimated using the IEUBK Model which uses an arithmetic mean EPC. 5 U. S. EPA Office of Solid W aste and Emergency Response, Risk Assessment Guidance for Superfund: Volume III - Part A, Process for Conducting Probabilistic Risk Assessment, EPA 540-R-02-002, December 2001. 6 U. S. EPA Office of Emergency and Remedial Response. Soil Screening Guidance: Technical Background Document. EPA 540/F-95/128, July 1996. 7 U. S. EPA Office of Solid Waste and Emergency Response, Risk Assessment Guidance for Superfund: Volume III - Part A, Process for Conducting Probabilistic Risk Assessment, EPA 540-R-02-002, December 2001. 8 U. S. EPA Office of Emergency and Remedial Response, Risk Assessment Guidance for Superfund, Volume I - Human Health Evaluation Manual (Part A). Interim Final. EPA 1540/1-89/002, 1989. 3 ------- Guidance on the calculation of UCLs for EPCs at Superfund sites is provided in: Supplemental Guidance to RAGS: Calculating the Concentration Term, and Calculating Upper Confidence Limits for Exposure Point Concentrations at Hazardous Waste Sites.9 Preliminary remediation goal. Chemical-specific preliminary remediation goals (PRGs) are concentration goals for individual chemicals for specific medium and land use combinations. There are two general sources of chemical-specific PRGs: concentrations based on ARARs, and concentrations based on risk assessment.10 Cleanup level. Cleanup levels generally are based on PRGs and are refined by considering the cost and implementability of remedial alternatives, including the technical feasibility of achieving the risk-based PRG, and other criteria outlined in the National Contingency Plan (NCP). The cleanup level generally is a chemical-specific concentration chosen by the risk manager as appropriate for likely future land use based on the PRG and other practical considerations.11 In some cases, the cleanup level is the same as the PRG. Cleanup levels are documented in the Record of Decision (ROD). Decisions about whether to implement the cleanup level as a not-to-exceed level or as an area average depend to some extent on the degree of uncertainty in the protectiveness of the cleanup. This degree of uncertainty can be determined by many factors, including but not limited to the effectiveness and adequacy of site sampling, the exposure assumptions in the risk assessment, and the toxicity of the chemicals of concern. Remediation action level. The remediation action level (RAL) is a concept that goes hand-in-hand with the application of the cleanup level as an area average. The RAL in most cases is the maximum concentration that may be left in place within an exposure unit such that the average concentration (or 95% UCL of the average) within the EU is at or below the cleanup level.12 The RAL is generally determined statistically. Post-remediation sampling typically is needed to ensure that the appropriate cleanup level has been met as an area average. Nature and extent of contamination. Information about contaminant concentrations, contaminant toxicity, and the spatial and temporal variability of contamination in most cases is critical in determining how to implement cleanup levels at hazardous waste sites. Therefore adequate characterization of contamination at the site is essential. Adequate characterization depends on the spatial structure of contamination and on the sampling design. The number, location, volume, and depth of samples relative to the contamination affect the adequacy of sampling. As illustrated in 9 U. S. EPA Office of Solid Waste and Emergency Response, A Supplemental Guidance to RAGS: Calculating the Concentration Term, Publication 9285.7-081, 1992; and U.S. EPA Office of Emergency and Remedial Response, Calculating Upper Confidence Limits for Exposure Point Concentrations at Hazardous Waste Sites, OSWER 9285.6-10, December 2002. 10 U. S. EPA Office of Solid Waste and Emergency Respons q,Risk Assessment Guidance for Superfund: Volume III - Part A, Process for Conducting Probabilistic Risk Assessment, EPA 540-R-02-002, December 2001. "U.S. EPA Office of Solid Waste and Emergency Response, Risk Assessment Guidance for Superfund: Volume III -Part A, Process for Conducting Probabilistic Risk Assessment, EPA 540-R-02-002, December 2001. 12 U. S. EPA Office of Solid Waste and Emergency Response,Risk Assessment Guidance for Superfund: Volume III - Part A, Process for Conducting Probabilistic Risk Assessment, EPA 540-R-02-002, December 2001. 4 ------- Exhibit 1, extrapolating the results of a small number of samples to a large area can be misleading unless the contaminant distribution across the large area is uniform. Clearly, for areas with heterogeneous distribution of contamination (e.g., scattered or dumped), the more extensive the sampling data the more representative they will be of the exposure concentration. There are also situations where the depth of the sample relative to the vertical distribution of contamination at the site can be important in accurate representation of the exposure concentration. This is illustrated in Exhibit 2. For sampling data to accurately represent the exposure concentration, they should generally be representative of the contaminant populations at the same scales as the remediation decisions and the exposures on which those decisions are based. Uncertainty associated with sampling error can be very large, particularly at sites where there is significant spatial heterogeneity in contaminant concentrations.13 Where there is spatial heterogeneity, sampling variability can lead to erroneous cleanup decisions. Clearly, the more extensive the sampling data, the more accurately they will represent the exposure concentration. Thorough characterization of site contamination typically includes understanding of the following: Spatial variability. Spatial variability can be determined based on historical and physical information on contaminant release, fate, and transport, and on spatial plots of site data. Spatially correlated contamination typically exists when sampling data or other information reveal a structured pattern of highly contaminated areas surrounded by gradually decreasing concentrations. Within an exposure unit, if the data show that high contaminant concentrations occur randomly among samples with low concentrations, then the contamination is not normally considered to be spatially correlated. • Temporal variability. Temporal variability refers to contaminant conditions that may vary over time due to factors such as migration or natural degredation of contaminants in the environment. While the time frame for sampling should provide data that are representative of the time frame of potential exposure, site characterization should note any uncertainties due to mismatches between short timeframes for sampling and longer time frames for exposures. 13 Technical and Regulatory Guidance for the Triad Approach: A New Paradigm for Environmental Project Management, prepared by The Interstate Technology & Regulatory Council Sampling, Characterization and Monitoring Team, December 2003, provides more in-depth discussion of this issue. 5 ------- Exhibit 1 SAMPLING UNCERTAINTY - LATERAL HETEROGENEITY AND SAMPLE SIZE Uniform Concentration Distribution (area average = 20 ppm) Small Sample (Sample = 20 ppm) Large Sample (Sample = 20 ppm) Scattered High Concentrations (area average = 20 ppm) Small Sample (Sample = 10 ppm) Large Sample (Sample = 20 ppm) Dumped High Concentrations (area average = 20 ppm) Small Sample (Sample = 10 ppm) Large Sample (Sample = 20 ppm) Contaminant concentration O Sample 6 ------- Exhibit 2 SAMPLING UNCERTAINTY - VERTICAL HETEROGENEITY AND SAMPLE DEPTH Surface Contamination: Sample Concentration: 20 ppm 10 ppm • Acute and chronic toxicity. Contaminants present at hazardous waste sites may pose human health risks from short-term exposures, as well as from long-term exposures. Therefore contaminants need to be evaluated for their acute and chronic toxicity, and the toxicity generally should be matched to the exposure duration and frequency. Nature of exposure. Information on the spatial and temporal variability of contamination and on the behavior and activities of the receptor guides assumptions about exposure and is important in determining how to implement cleanup levels at hazardous waste sites.14 Specific characteristics of exposure may include: • Random and non-random exposure. Understanding how a receptor moves across an exposure unit and contacts contaminated soils is a key consideration in determining whether it is appropriate to apply a cleanup level as an area average. At most sites, it is reasonable to assume that random exposure occurs over the long-term. Short-term exposures, however, may be non-random. For example, a resident may move randomly across his/her property spending equal amounts of time in all areas over the long-term period of residence, but intense short-term exposure may occur as a result of a construction project, such as building a shed. Another example of intense short-term exposure would be a child playing in a sand box. Both types of exposure, together with chronic and acute toxicity of contaminants, should generally be addressed in developing a strategy for surface soil cleanup. 14EP A's Supplemental Guidance for Developing Soil Screening Levels for Superfund Sites, OSWER 9355.4-24, December 2002, describes various mechanisms of human exposure to surface soil. 7 ------- Size of exposure unit. Averaging site data for evaluating exposure and risk normally requires that the size of the exposure unit be appropriate for the receptor being considered. If the exposure unit is larger than the area throughout which the receptor is exposed, then the average for the area will not accurately reflect actual exposure. For example, an adult resident may have a larger area of exposure than a child. If the adult-sized exposure unit is used for the child, then a cleanup strategy based on an area average for the adult-sized exposure unit may not be protective for the child. Similarly, a worker at an outdoor commercial facility (e.g., garden center) may be exposed to a larger area than a customer. • Exposure duration. Similar to the size of the exposure unit, the exposure duration should be consistent with the time scale of receptor activities. In addition, the exposure duration determines which toxicity criteria are used. Appropriate acute or subchronic toxicity values should be used for short-term exposures, and chronic toxicity values should be used for long-term exposures. Conceptual site model (CSM). The application of cleanup levels at a site typically requires knowledge of the distribution of contaminants in soil and how that distribution is related to the sample data and to risks from potential exposure. The CSM is a comprehensive representation of the site that documents these current site conditions and relationships. It generally characterizes the distribution and variability of contaminant concentrations across the site based on knowledge of contaminant, release, fate, and transport mechanisms, and on sampling data. As shown in Exhibit 1, this information is critical to developing a sampling plan that generates data that accurately represent exposure concentrations. At sites with spatially correlated contamination (e.g., dumped high concentrations), statistical sampling is more powerful when data from different contaminant populations are separated. The CSM also identifies all potential exposure pathways, migration routes, and potential receptors.15 The CSM is initially developed from existing site data. It is a key component of the Remedial Investigation/Feasibility Study (RI/FS) and Data Quality Objectives (DQO) process, and it should be continually revised as new site investigations produce updated or more accurate information. 15 The process for developing a conceptual site model is described in: U.S. EPA Office of Solid Waste and Emergency Response, Supplemental Guidance for Developing Soil Screening Levels for Superfund Sites, OSWER 9355.4-24, December 2002, p. 4-7. 8 ------- 3.0 RECOMMENDED OPTIONS FOR IMPLEMENTING CLEANUP LEVELS This section presents two recommended options for risk managers to implement cleanup levels: (1) as not-to-exceed levels, or (2) as area averages. Implementing cleanup levels as area averages instead of not-to-exceed levels represents a less stringent and less costly option. Implementing cleanup levels as not-to-exceed levels may not be necessary in situations where receptor exposure activity is random but may still be the preferred option based on consideration of the nine criteria for remedy selection, and more specifically compliance with ARARs, the degree of uncertainty in the risk estimates, the adequacy of site characterization, the level of community acceptance, and other criteria discussed in this section.16 In determining what soils to remediate and to what levels, risk managers at hazardous waste sites may need to decide between remediating all soils to concentrations at or below the cleanup levels specified in the ROD, or leaving in place some soils with concentrations above the cleanup levels while ensuring that the estimated post-remediation EPC for a given exposure unit is below the cleanup level. These options can be characterized as implementing the cleanup level as a "not-to-exceed" value or as an "area average." Implementing the cleanup level as a not-to-exceed value normally means that soil removal or treatment will continue until the analysis of soil samples indicates that all soil with contaminant concentrations exceeding the cleanup level has been removed or treated. As shown in Exhibit 3, after remediation is complete, the highest concentration within the EU should be at the cleanup level, and the average or UCL concentration (i.e., post-remediation EPC) within the EU should be lower than the cleanup level. If the risk-based PRG is chosen by the risk manager as the cleanup level, then applying it as a not-to-exceed level should result in a post-remediation EPC that is below this protective level. The area average approach involves removing or treating the areas of the EU with the highest contaminant concentrations until the average or UCL concentration (i.e. post-remediation EPC) is at or below the cleanup level.17 (See Exhibit 3.) This approach normally requires establishing a cleanup level that is the desired post-remediation EPC, and making a statistical determination of a "remediation action level (RAL)," the level to which all contaminant concentrations in soil within an EU are reduced to ensure that the estimated post-remediation EPC for the EU is at or below the cleanup level. The RAL is itself a maximum concentration, or not-to-exceed level, for the purposes of site remediation. Remediating or removing all soil with contaminant concentrations above the 16 See Guide to Selecting Superfund Remedial Actions, OSWER Publication 9355.0-27FS, April 1990, for description of nine criteria. Compliance with ARARs is one of the nine criteria. Generally, it is not appropriate to use the area average approach if an ARAR is either designated as a not-to-exceed level or was developed based on factors other than average risk. 17 Because the area average approach assumes random exposure within the exposure unit, all areas (including clean areas) within the exposure unit should be included in the calculation. As discussed in Section 4.0, there are different methods for spatially correlated and non-spatially correlated data to address hot spots (i.e., areas with particularly high contaminant concentrations). 9 ------- RAL should enable risk managers to ensure that the estimated post-remediation EPC achieves the cleanup level. Exhibit 3 RECOMMENDED OPTIONS FOR IMPLEMENTING CLEANUP LEVELS Not-to-Exceed Approach Clean Fill Post- Remediation EPC Cleanup Level RAL Pre- Remediation EPC Area Average Approach Clean Fill Low ~ Soils not remediated ~ Soils remediated Pre- Remediation EPC Cleanup Level Post- Remediation EPC Soil Contaminant Concentration • RAL -> High Key issues in applying the area average approach include defining the EU over which to average concentrations, and determining what locations to remediate in order to reduce the EU EPC to a level at or below the cleanup level. In addition, it is important to consider the possibility of acute or subchronic health effects from short-term exposure to contaminants at concentrations above the cleanup level. Because soils with contaminant concentrations exceeding the cleanup level will be left onsite, it is important to ensure that those concentrations are not so high that they pose acute or subchronic health risks if exposure to them occurs. Therefore, if this approach is used, the RPM should conduct a separate assessment of potential acute effects to determine the contaminant concentration at which acute effects are likely to occur. The RAL should be below that concentration to ensure protection against acute effects. If acute toxicity data are insufficient to either determine whether the RAL is protective for acute effects or to establish an alternative protective level, then the area average approach should not be used. In general, the method used to determine the RAL should be compatible with the method used to determine the EPC. To the extent that the RAL is designed to ensure that the average post-remediation concentration (i.e., post-remediation EPC) is at or below the cleanup level, calculation of the post-remediation EPC should be consistent with calculation of the pre-remediation 10 ------- EPC, in terms of the averaging basis, EU size, and contaminant distribution. However, in some instances the methods may be different. For example, the distribution of the contaminant concentration may be different pre- and post-remediation, or the EU size may be different if the land use is changing. When deciding to apply the cleanup level as a not-to-exceed level or as an area average, a variety of factors come into play, including: Exposure. Using the area average approach is based on the assumption of random exposure. If exposure is not random across the EU, but rather receptors spend more time in areas of high concentration, then remediating soils such that the average post-remediation concentration achieves the cleanup level may not be protective of the receptor with non-random exposure. The exposure assumptions made in determining which approach to use should be consistent with the exposure assumptions used in the risk assessment and the determination of the EPC. • Size of exposure unit and adequacy of sampling. If the size of the exposure unit is different pre- and post-remediation due to change in the land use, the sampling may not be appropriate to support implementation of either approach. For example, if the sampling was designed for large, non- residential, pre-remediation exposure units, and future residential land use requires smaller exposure units, the sampling may not be extensive enough to adequately characterize exposures within the smaller unit. In this case, it may be necessary to do more extensive sampling so that the sample accurately represents the post-remediation exposure. Toxicity. If the cleanup level is based on acute exposure, it should be implemented as a not-to-exceed level, because any short-term exposure exceeding the cleanup level could cause adverse health effects. Even if the cleanup level is based on chronic exposure, care should be taken in implementing it as an area average based on an RAL, because it is important to ensure that the RAL (which may be significantly higher than the cleanup level) is protective of acute effects. For example, a single instance of a child ingesting a handful of soil containing malathion could cause acutely toxic effects in the child. Therefore the RAL must be low enough to be protective against acute effects of single and short-term exposures, as well as chronic effects of long-term exposure. At present EPA does not have acute toxicity criteria, therefore consultation with a toxicologist may be necessary to determine if the RAL is sufficiently protective for acute effects. Risk managers can seek assistance in identifying appropriate toxicity values from EPA's regional toxicologists and risk assessors. In addition, the Agency for Toxic Substances and Disease Registry (ATSDR) publishes Minimal Risk Levels (MRLs) that may, in some cases, be suitable for use as acute toxicity 11 ------- values.18 In the absence of sufficient acute toxicity data, the not-to-exceed approach should be used if acute exposures are likely. Confidence in theprotectiveness of the cleanup level Uncertainty about the protectiveness of the cleanup level may indicate that it is most appropriately implemented as a not-to-exceed level. If the cleanup level is the risk-based PRG calculated as the 95 percent UCL of the average post-remediation contaminant concentration, there is less than a five percent chance that average exposure at that level will pose significant risk. Therefore in instances where there is adequate data coverage and exposure units are well defined an area average approach may be appropriate. However, if the cleanup level is less "conservative" because it is not based on an average post-remediation EPC, or is significantly higher than the PRG because of practical or technical considerations (e.g., cost, implementability), then it may not be protective if implemented as an area average. ARARs. Federal and state requirements for cleanup must also be considered to ensure compliance. If the cleanup level is an ARAR that the state or tribe designates as a not-to-exceed level, or an ARAR that was developed based on factors other than average risk, then it may not be appropriate to implement the cleanup level as an area average. Quality and quantity of site characterization data. Confidence in the degree to which contaminant concentration data accurately represent soils that the receptor contacts over time may influence the decision to use the area average approach. The discussion on page 5 and a report prepared by The Interstate Technology & Regulatory Council Sampling, Characterization and Monitoring Team19 emphasize the importance of adequate sampling. Site managers should also refer to Data Quality Objectives Process for Hazardous Waste Site Investigations for guidance on what constitutes high quality site characterization data.20 In addition the Guidance for Data Usability in Risk Assessment (Part A and Part B) may be helpful. Site characterization data should meet the Data Quality Objectives described in 18 ATSDR MRLs were developed in response to a CERCLA mandate and represent the highest exposure levels that would not lead to the development of non-cancer health effects in humans based on acute (1-14 days), subchronic (15-364 days), and chronic (365 days and longer) exposures via oral and inhalation pathways. MRLs are based on non- cancer health effects only. MRLs are available from ATSDR's website, http://atsdrl ,atsdr.cdc.gov:8080/mrls.html. 19 Technical and Regulatory Guidance for the Triad Approach: A New Paradigm for Environmental Project Management, prepared by The Interstate Technology & Regulatory Council Sampling, Characterization and Monitoring Team, December 2003. 20 U.S. EPA, Data Quality Objectives Process for Hazardous Waste Site Investigations, EPA QA/G-4HW, EPA/600/R-00/007, January 2000; U.S. EPA, Guidance for Data Usability in Risk Assessment (Part A), EPA 9285.7- 09A, April 1992; and U.S. EPA, Guidance for Data Usability inRisk Assessment (PartB), EPA 9285.7-09B, May 1992. 12 ------- the guidance. However, if site characterization or sampling data are insufficient to provide confidence in the use of the area average method, then the cleanup level should be implemented as a not-to-exceed level because it generally provides more certainty about the protectiveness of the cleanup. The area average approach is specifically intended for situations where adequate site characterization data are available. Applications of area average methods to sites with limited or incomplete data are inappropriate. However, if the quality of site characterization data is the only factor limiting the use of the area average approach, it may be more cost-effective to spend more on sampling to improve the quality of the data before deciding to implement the cleanup level as a not-to-exceed level where the area average approach could save on remediation costs.21 Cost-effectiveness. In considering implementation of the cleanup level as an area average, it is important to identify the associated costs and cost savings. The area average approach is likely to be more cost-effective for removal remedies than for in situ treatment remedies. The marginal cost of additional excavation and disposal of soils with concentrations between the cleanup level and RAL usually will be relatively high compared to the marginal cost of additional treatment to reduce soil concentrations to the cleanup level. In addition, the area average approach may require more extensive site characterization (as described above) and statistical analysis. These additional costs may be offset by remediation cost savings due to the less extensive cleanup required. Risk managers should consider these trade-offs to determine the most cost-effective approach. Community acceptance. Community acceptance is an important consideration in using the area average approach. Because an area average cleanup will leave in place some soils with contaminant concentrations that are above the cleanup levels set forth in the ROD, the community may not be confident in the remedy's protectiveness. The community will likely be more comfortable knowing that all soils achieve the ROD-specified cleanup level. Statistical expertise. The area average approach requires statistical expertise. As discussed in Section 4.0, various methods requiring varying degrees of expertise are available. The iterative truncation method requires a basic understanding of area averaging, and most risk managers should be able to implement it without specialized expertise. The confidence response goal method is somewhat more sophisticated, requiring some statistical training to understand, but it is formulaic and can be automated. The geostatistical method requires highly specialized statistical expertise and usually will have 21 Determining the point at which increased sampling offsets remediation cost savings will depend largely on the sampling costs and marginal remediation costs which are very site-specific. 13 ------- significant resource implications. The availability of resources or statistical support should be considered in deciding on an appropriate approach and in project scoping. Taking these and other appropriate factors into consideration, the risk manager should decide whether it is appropriate to implement the cleanup level as a not-to-exceed level or as an area average. In instances where there is adequate data coverage and the exposure units are well defined, an area average approach may be appropriate. Specific situations in which this approach would not be appropriate are when: Exposure within the EU is non-random; The cleanup level is based on acute exposure; The cleanup level is not derived from a UCL of average post-remediation concentrations (e.g., it considers factors other than risk); The quality of site characterization data is not optimal, and it is not worth investing in additional sampling; Given the site conditions (complexity, size, characterization, contaminant distribution), it is not cost-effective to do the necessary sampling and statistical analysis; The community most likely will not accept leaving soil onsite with contaminant concentrations that exceed the cleanup level. 14 ------- 4.0 METHODS FOR IMPLEMENTING CLEANUP LEVELS AS AREA AVERAGES Site remediation typically involves making cleanup decisions for large volumes of contaminated soil based on data from many, very small soil samples. Such decisions are made with the help of statistical methods. Simple, classical statistical approaches assume that there is no spatial correlation between contaminant concentrations within an EU, and they rely on the mean and standard deviation of the sample data to accurately represent the exposure concentration in the EU. More sophisticated, geostatistical approaches are designed to estimate the spatial distribution of contaminant concentrations in the EU based on sample data, and in this way explicitly address differences in scale between sample data and cleanup decisions. Both simple and geostatistical methods generally are available for calculating RALs when implementing cleanup levels as area averages. There is a growing recognition that geostatistical methods are generally more appropriate because contamination at most sites has spatially correlated distribution patterns and more than one contaminant population. Geostatistical methods are generally more rigorous and account for the relationship between contaminant concentrations and the size, location, and geography of the EU. Simpler, classical statistical approaches may be applicable in some situations where contaminant concentrations are not spatially structured (i.e., have a uniform distribution) or where areas of an exposure unit can be delineated into subsections within which contamination is likely to be homogeneous. The appropriate method depends in large part on the distribution of contaminants at the site. Non-spatial techniques can be used when there is no spatial correlation between contaminant concentrations at the exposure unit level. A good example is a landfill site where waste was disposed across a large area, a process resulting in small randomly-located spots of high contaminant concentrations interspersed within areas of lower concentrations. On the other hand, spatial techniques, or geostatistics, normally are used when there is a spatial correlation between contaminant concentrations. For example, a site where the major source of contamination is a unlined liquid waste storage lagoon should have very high contaminant concentrations in and around the lagoon and lower concentrations at increasing distances from the lagoon. The contaminant distribution pattern at a site can be determined based on historical and physical information and spatial plots of site data. If the data show that contaminant concentrations are homogeneous across the exposure unit, or high contaminant concentrations occur randomly among samples with low concentrations, then statistical determination of RALs can be performed using simple, non-spatial techniques. If, on the other hand, sampling data reveal a structured pattern with highly contaminated areas surrounded by gradually decreasing concentrations, then statistical determination of RALs using spatial techniques may be more appropriate. If the exposure pattern is a mixture of isolated hot spots and areas with lower random concentrations, then spatial techniques or a combination of spatial and non-spatial techniques should be used. Non-spatial techniques typically allow rapid calculation of RALs by assuming that contaminant concentrations are devoid of any spatial structure or correlation, and that the sampling is unbiased and accurately represents exposure concentrations. Non-spatial techniques are based on the mean and standard deviation of the sample contaminant concentration data and how those metrics change as soils with high contaminant concentrations are replaced with clean fill during 15 ------- remediation. Calculated RALs will differ for each EU depending on the mean, standard deviation, and distribution of contaminant concentration data. Even if two EUs have the same average concentration, the RALs will differ if the variances and distributions are different. Non-spatial statistical computations yield exposure concentrations that are independent of the size, location or geometry of the EU. If the sample soil concentrations display spatial structures or correlations, or if the samples do not accurately represent exposure or are collected in a biased way (e.g., over- sampling of areas thought to have high concentrations), then application of non-spatial statistical techniques should result in unreliable RALs. RPMs should avoid this pitfall and ensure successful implementation of these approaches by using thorough site characterization based on sufficient sample sizes and unbiased sampling. Spatial techniques, on the other hand, use available contaminant concentration data to model the spatial correlation underlying the measured values, thus directly accounting for differences in the different scales of sample data and EUs in the cleanup decision process. EU grids are placed over a map of the area of interest and, using an averaging geostatistical technique called block kriging, average EU concentrations are determined for pre- and post-remediation conditions. Currently, block kriging is the only commonly used procedure that is capable of estimating area averages based on correlated point measurements. An alternative geographic approach is currently being developed to accommodate spatially correlated contamination and match the sample representativeness to the scale of the remediation decision. The approach involves dividing EUs into discrete remedial subsections that reflect different contaminant populations and within which contamination is homogeneous.22 This general approach has been used at at least one site.23 It requires extensive sampling and Monte Carlo simulation to adequately represent each remediation subsection, and its advantages over geostatistics are not well established. As is the case for non-spatial techniques, when using geostatistical techniques, RPMs should use good sample data and site characterization in order to apply the techniques successfully.24 An added consideration, however, can be the relatively high analytic costs of applying geostatistical techniques. They usually require highly specialized statistical expertise, and are time- and resource-intensive. If this approach is a likely option, time and resource issues should be addressed in the project scoping. 22 Evan Englund, U.S. EPA, National Exposure Research Laboratory, March 2005, personal communication. 23 Ryti explored the relationship of the statistical properties of exposure concentrations and the EU geometry as part of the remedial design of the Piazza Road dioxin site (an EPA Superfund site in Missouri). Ryti divided EUs into remedial subunits, referred to as "cells." Using a Monte Carlo simulation approach, soil contaminant concentrations for each cell were estimated. During each simulation round, the cell with highest simulated concentration was selected. If the EU containing the selected cell had an average concentration in excess of the cleanup level, the cell was targeted for cleanup. This process was repeated until the cell with the next highest simulated concentration had a value less than the cleanup level. The spatial statistical characteristics of correlated/structured data demonstrated by Ryti are formalized in geostatistical block kriging techniques. Ryti, Randall T., Superfund Soil Cleanup: Developing the Piazza Road Remedial Design, Journal of Air and Waste Management, 43:197-202, February 1993. 24 Composite samples cannot be used for these methods because they do not represent contaminant concentrations at specific locations. 16 ------- In this section, we present three statistical methods that have been suggested for use in implementing cleanup levels as area averages — iterative truncation, confidence response goal (CRG), and geostatistics. They reflect the spectrum of simple to complex. The two at the simple end of the spectrum, the iterative truncation and CRG methods, are non-spatial and have limited applicability. The most complex, geostatistical method, is spatial and more widely applicable because most sites exhibit spatial structure to contamination. All three methods focus on calculating RALs that should ensure that the average soil contaminant concentration within an EU after remediation is at or below the cleanup level. Exhibit 4 summarizes the pros and cons of the three approaches which are discussed in detail below. Iterative Truncation Method As discussed for purposes of this guidance, the iterative truncation method is a simplistic approach to calculating the RAL for surface soil cleanup. It is based on the identification and removal of soils with high contaminant concentrations to lower estimated post-remediation EPCs to levels at or below the cleanup levels. Iterative truncation is used for non-spatial data, it assumes that each sample is an uncorrected, unbiased representation of a remediation area within the site or EU. As indicated, iterative truncation involves removing (truncating) high values in the sample concentration measurements and calculating a hypothetical post-remediation EPC. For this reason, it is inappropriate to use composite samples. Each iteration entails replacing the next highest value with the concentration of clean fill.25 With each iteration, a new EPC is calculated and compared to the cleanup level. At the end of the process when the estimated post-remediation EPC is at or below the cleanup level, an RAL that will achieve that EPC is specified. In practical terms, the steps involved in iterative truncation are as follows: Order the sampling data from lowest to highest concentration. Starting with the highest concentration, remove a sample and replace it with the post-remediation concentration (e.g., concentration found in clean fill). Recalculate the post-remediation EPC for the new data set and compare the resulting EPC to the cleanup level. If the EPC is higher than the cleanup level, repeat the process iteratively until the EPC is less than or equal to the cleanup level. 25 To ensure that the concentration in clean fill is achieved during remediation, the concentration for clean fill should either be based on data from likely fill material or it should be tested during construction to ensure that it does not exceed the estimated value. 17 ------- Exhibit 4 SUGGESTED STATISTICAL METHODS FOR IMPLEMENTING CLEANUP LEVELS AS AREA AVERAGES NON-SPATIAL Iterative Truncation Method Pros: Simple; no statistical expertise needed. Cons: Very sensitive to highest contaminant concentrations in the sample; if the highest sample concentrations are not representative of the highest concentrations at the site, the resulting RAL may not be protective. Cautions: Inappropriate for use with composite data. Inappropriate for use with spatially correlated data. If sampling data are biased such that higher concentration areas are over-sampled, the resulting RAL will be unnecessarily low. Confidence Response Goal Method Pros: Less sensitive than iterative truncation to the representativeness of the highest sample concentrations. Accounts for different statistical distributions of contaminant concentration data. Cons: May entail some statistical expertise. Difficult to communicate results to public, due to mathematical complexity. Cautions: Inappropriate for use with composite data. Inappropriate for use with spatially correlated data. If sampling data are biased such that high concentration areas are over-sampled, the resulting RAL will be unnecessarily low. SPATIAL Geostatistical Method Pros: Can be used with spatially correlated data. Can be used with biased sample data (e.g., over-sampling of hot spots). Cons: May entail geostatistical expertise and specialized software. More costly and time consuming than non-spatial methods. Cautions: Consider the value of the information gained from geostatistical approach to ensure that the anticipated benefits justify the costs. 18 ------- When the calculated post-remediation EPC in the data set is less than or equal to the cleanup level, the highest sample concentration remaining in the data set is designated the RAL. During each iteration, removal and replacement of selected concentrations impacts the distribution of the remaining data. Such changes may require reevaluation of the distribution of the resulting data set at each iteration. These iterative reevaluations and EPC calculations can be performed efficiently by utilizing ProUCL.26 Schulz and Griffin applied the iterative truncation method to actual site data for a residential exposure unit at western mining site contaminated with arsenic. Applying iterative truncation to the data set of 36 soil samples resulted in an RAL which indicated remediation of three areas. If the cleanup goal were applied as a not-to-exceed level, 16 of the 36 areas would require remediation.27 Generally, the iterative truncation method will fail to produce an adequate RAL for cleanup if site characterization is incomplete. Because this method is solely reliant on the sampling data, it is sensitive to the highest contaminant concentrations in the sample. If the highest sample concentrations are not representative of the highest concentrations in the EU and there are actually areas with higher concentrations, then the resulting RAL may not be protective. To examine the sensitivity of the method to representative sampling, Schulz and Griffin applied the iterative truncation method to sampling data from two sites. They determined the RAL using the sampling data and then increased the two highest sample concentrations by 50 percent and recalculated the RAL. The recalculated RAL was sufficiently lower than that based on the actual sampling data, as to require remediation at an additional location.28 Obviously, the more complete and representative the sampling data, particularly at the high concentrations, the more reliable this method. Some sampling plans may be biased towards areas of known or suspected contamination. Applying the iterative truncation approach to such data sets will result in unnecessarily low RALs. The iterative truncation method uses the sample data to calculate the post-remediation EPC, as high concentration samples are removed. Because the low concentration areas are not adequately represented in biased data, applying the method would result in unrealistically high EPCs, as samples are iteratively removed. Therefore to get the average at or below the cleanup level would require an unnecessarily low RAL, and the advantages of area averaging would not be realized. 26 Singh, A., A.K. Singh, R.W. Maichle, ProUCL Version 3.0, User Guide, EPA, April 2004. (http://www. epa.gov/nerlesd 1 /tsc/images/proucl3 apr04.pdf) 27 Schulz, T.W. and S. Griffin, Practical Methods for Meeting Remediation Goals at Hazardous Waste Sites, Risk Analysis, Vol. 21, No. 1, 2001. 28 IBID. 19 ------- The iterative truncation method is also sensitive to the variability of sample concentrations. If the variability is high and the number of samples small, the RAL resulting from this approach may be over-protective. Since the RAL is a sample concentration (i.e, the highest sample concentration next to the one that gives an EPC less than or equal to the cleanup level when replaced with the concentration of clean fill), gaps in the data may force the RAL lower than necessary to achieve the cleanup level. To use this method with confidence, it is important to have good site characterization based on extensive, unbiased, and representative sampling, and the resulting data should adequately represent random, long-term exposure to receptors. Due to these limitations, it is important to consider the following criteria in determining whether iterative truncation is an appropriate method to use. Sample size is sufficient. Small sample sizes translate to large uncertainty in estimates of post-remediation EPCs and resulting RALs. • Sampling design yields a representative distribution of measurements within the EU. Simple random sampling may fail to represent a patchy distribution of contaminants. Similarly, over-sampling high concentration areas may fail to represent random movement of receptors. • Assumptions about post-remediation distribution of concentrations are reasonable. If these assumptions are shown to be incorrect by post-remediation sampling, the process for developing RALs may need to be repeated and additional remediation may be required. Confidence Response Goal Method Bowers, Shifrin and Murphy developed a method for calculating what they term the "confidence response goal" (CRG) which under this guidance is the same for all practical purposes as an RAL, that is, a not-to-exceed level for EU remediation that will ensure that the area average for the EU is at or below the cleanup level.29 The basic premise of this method is that the CRG can be expressed as a function of the mean and standard deviation of contaminant concentration, and the cleanup level. The average post-remediation concentration is determined from the pre-remediation distribution that is truncated at the CRG, and a second superimposed distribution that represents the concentration of contaminant in clean fill. The average concentration of the post-remediation distribution is a weighted average of the portion of the pre-remediation distribution with concentrations below the CRG and the concentration of the clean fill which replaces all pre-remediation concentrations that exceed the CRG. The original method was based on the assumption that the sample data are derived from a spatially uncorrected lognormal distribution and accurately represent the distribution of contamination in the EU. Subsequent investigations by 29 Bowers, T.S., N.S. Shifrin, and B.L. Murphy, Statistical Approaches to Meeting Soil Cleanup Goals, Environmental Science and Technology, 30, 1437-1444, 1996. 20 ------- Singh and Singh indicated that although CRG method is a reasonable procedure, its application based on the assumption of lognormality of contaminant concentrations can result in RALs that are not adequately protective. Specifically, Singh and Singh simulated sampling from a lognormal distribution of contamination and found that the process yields very high RALs such that no cleanup is recommended, even when the true average concentration exceeds the cleanup level.23 Schultz etal. demonstrated that the CRG method yields reliable results if certain other types of distributions are used. For this purpose, they developed equations for computing the CRG (c*) for normal, exponential, and uniform continuous distributions. The equations for normal and exponential distributions are presented below.24 Normal distribution: CUG =co+(M-co)® {^—^) <7 <7 where, CUG = cleanup goal (same as the cleanup level) ca = contaminant concentration in clean fill c* = CRG (same as RAL) H = mean contaminant concentration a = standard deviation of contaminant concentrations 0(.) = standard normal cumulative density function = standard normal density function Exponential distribution: CUG =cne~e -c-6eT +6 where, CUG = cleanup goal (same as the cleanup level) ca = contaminant concentration in clean fill c* = CRG (same as RAL) 6 = mean contaminant concentration used in the exponential model 23 Singh, A.K., and A. Singh, Review of the Confidence Removal Goal Approach for Site Cleanup, Technology Support Center, U.S. EPA, Las Vegas, Nevada, 1996. 24 Schultz, B., A.K. Singh, A. Singh, An Evaluation of the Confidence Response Goal Approach for Making Remediation Decisions at Superfund Sites,Environmetries, 13 725-732, 2002. 21 ------- These equations can be used for determining the CRG (c*). Site assessors can readily solve these equations in a spreadsheet calculation or compiler language. Schultz etal. also developed a CRG equation for the uniform continuous distribution. A similar CRG equation can be developed for the gamma distribution which has proved to be a useful model for skewed data sets.25 While the potential remediation cost savings associated with the use of the CRG method are compelling, risk managers should be aware of the need for statistical expertise in applying the method correctly and the difficulty in communicating the results to the public and gaining community acceptance. Communities typically are concerned that the RAL be protective, and may find the explanation of the CRG method difficult to understand. The mathematical complexity of the CRG method may be a detriment. The difficulty of community acceptance is illustrated by a site that was planned to be redeveloped as an urban park. In this case, there were zoning restrictions against industrial and commercial use, and the property was zoned for residential use. However, the future use of the proposed park was not specified (e.g., passive green space, a botanical garden, athletic fields). The RAL determined using the CRG method suggested considerable savings in cleanup costs compared to implementing the cleanup level as a not-to-exceed value. Citizens favoring the park's use as athletic fields turned out in force at the proposed plan public meeting and voiced their objections to the remedial action level calculated using the CRG method. These citizens were opposed to any implementation of the cleanup level as an area average because receptors could potentially have intense direct contact with the soil over a small exposure unit (e.g., football field). The citizens were very distrustful of the scientists' explanation of the highly statistical CRG method. The lack of community acceptance of the CRG approach forced the remedial proj ect manager to implement the PRG as a not-to-exceed level. This example illustrates the importance of making sure that the implementation of the cleanup level is appropriate to the potential exposures, and that the community is educated from the outset regarding plans for developing cleanup levels and implementing them as area averages. EPA's guide to Superfund community involvement provides suggestions for presenting technical material to communities.26 Geostatistical Methods While non-spatial methods assume that the contaminant concentrations within an EU are uncorrected, at many sites contaminant concentrations reveal clear spatial patterns, where highly impacted zones are surrounded by marginally impacted areas with gradually decreasing contaminant concentrations. Geostatistical techniques are statistical procedures designed to process spatially 25 Singh, A., A.K. Singh, and R.J. Iaci, Estimation of the Exposure Point Concentration Term Using a Gamma Distribution, Technology Support Center Issue, EPA/600/R-02/084, EPA Technology Support Center for Monitoring and Site Characterization, Las Vegas, Nevada, October 2002. 26U.S. EPA, Superfund Community Involvement Handbook, EPA 540-K-01-003, April 2002. http://www.epa. gov/Superfund/action/community involvement.htm. 22 ------- correlated data. The presence of spatially correlated data is quite common at hazardous waste sites because of structured patterns in the distribution of contamination. For example, sites impacted by migration of contaminants from a concentrated localized source, such as an unlined lagoon for liquid waste storage, might exhibit spatial patterns of contamination. Contaminant concentrations in and around the lagoon may be higher than those at greater distances from the lagoon. Spatial correlation also arises because sampling data are sometimes collected in a biased fashion and may be clustered around hot spots of contamination. In such areas, soil samples can be influenced by the same phenomena and are therefore not independent. This type of spatially correlated data is suitable for geostatistical analyses. While geostatistics can accommodate biased data, excessive bias can provide misleading results. For example, geostatistical interpolation would over-estimate the extent of the impacted area in a situation where a highly sampled hot spot is surrounded by unsampled non- impacted areas. Exhibit 5 displays an example of a structured pattern of contamination. At this site, soil concentrations display a well defined spatial distribution consistent with the release pattern of the original site contaminants. Under such conditions, cleanup computations can be performed using spatial geostatistical techniques that take into account the size and location of the targeted EU. The computational steps involved in determining RALs for soil cleanup using geostatistical techniques are outlined below. These steps, as depicted in Exhibit 6, are computationally intensive and require the use of specialized software. (Appendix A provides an overview of available software.) • Step 1: Iso-concentration Mapping. This step involves an initial analysis of available contaminant concentration data to determine whether a transformation of data is warranted, whether segregation of site data into separate populations should be pursued, or whether anomalous values should be eliminated.27 After any necessary manipulation of the data, the resulting data set is subject to variogram analysis.28 This is a process for computing and modeling the spatial correlation underlying the measured values. A function is fit to the sample variograms to determine the variogram model. This involves examining all pairs of measured values. For each pair, the difference between the measured values, the angular direction between them, and their separation distance are computed. Pairs along a selected direction with similar separation distances are grouped, and for each group, the sample variogram is computed as one-half the mean of the pairs' squared differences. In order to ensure the reliability of the selected variogram model, the 27 Use of log transformed data in block kriging normally would yield results that do not correspond to the block averages. Therefore, log transformation of data is not recommended unless the geostatistical software used is capable of producing the correct back-transformed block averages. 28 Variogram analysis is an assessment of spatial correlation. Englund and Sparks (1988) define the variogram as a plot of the variance (one-half the mean squared difference) of paired sample measurements as a function of the distances (and optionally of the direction) between samples. 23 ------- computation process should be conducted consistent with the applicable ASTM standard guides listed in Appendix A. Iso-concentrations maps based on the variogram model are then generated to depict the extent of • • 9Q contamination. Step 2: EPC Computation. At this stage, a grid of EUs is superimposed on the site map and average contaminant concentrations (EPCs) are estimated for each EU, as depicted in Exhibit 6. The geostatistical EPC computational process is called block kriging and is described in detail in Appendix B. For sites with well defined hot spots, care must be taken to position the EU grids such that individual EUs cover hot spots without undue mixing with surrounding, less-impacted areas. Those EUs where EPCs are higher than cleanup levels are referred to as the exceeding EUs. • Step 3: RAL Determination. In this step, the iso-concentration map is used to identify zones that must be remediated in order to reduce the EPCs of the exceeding EUs to levels less than or equal to cleanup levels. This is an iterative process. During each iteration, points with estimated values in excess of a given cutoff concentration are replaced with concentrations equal to post-remedial conditions. Upon replacement, EPCs of exceeding EUs are re-computed. If there are still exceeding EUs, the cutoff concentration is decreased iteratively until full compliance is attained. The final cutoff concentration is defined as the RAL. • Step 4: Cleanup Extent. Once the RAL is determined, the original iso-concentration map is used to define zones with concentrations in excess of the RAL. The contoured zone is the area that requires remediation. In many instances, existing data can also be used as part of the post-remediation confirmatory analyses. It is important to note that geostatistical techniques are not a substitute for collecting sample data; the reliability of the results depends on adequate sampling data. The adequacy of the data can be evaluated by examining the kriging standard deviations for each estimate. Insufficient sampling is particularly evident when the kriged values are low and their standards deviations are high. In such cases, additional sampling generally will be necessary. 29 In case of a "not-to-exceed" cleanup criterion, the iso-concentration map can be used to delineate the extent of the area requiring remediation. 24 ------- Exhibit 5 EXAMPLE OF STRUCTURED SOIL CONTAMINATION 99 Contaminant Concentration in top 11 (in mg/kg) 25 ------- Exhibit 6 SAMPLE COMPUTATIONAL STEPS FOR CLEANUP OF STRUCTURED SOIL CONTAMINATION Step 1. ISO-concentration Mapping • Exploratory Data Analysis • Variogram Analysis • Point Kriging/Contouring Step 2. EPC Computation • Define Exposure Unit Grid • Block Kriging to Compute EPC for each EU block • Identify Exceeding EU blocks Step 3. Determining RAL • RAL = concentration that, if removed, would bring all EU blocks into compliance Step 4. EPC Confirmation • Confirmatory sampling and/or EPC Recomputation • All EU blocks in compliance ralL^/\ 9 a 26 ------- Despite the broad applicability of geostatistical techniques, their computationally-intensive, time-consuming nature may limit their practical use. In general, geostatistical techniques are not recommended for cases where the following conditions exist: Site historical and physical information, or pre-remediation plots of contaminant concentrations indicate the presence of a random, non-spatial contamination pattern; Use of non-spatial statistical methods that ignore the spatial correlation and biased nature of data would not lead to overly-conservative, cost-prohibitive cleanups; Adequate technical/computational resources are not available; or The anticipated benefits from geostatistical analysis do not justify the costs. Example Application To illustrate the potential application of geostatistical methods to determine and apply RALs, we present a case study of a site with a structured pattern of soil contamination (previously shown in Exhibit 5). • Step 1: Iso-concentration Mapping. Exhibit 7 displays the cumulative probability distribution of soil concentrations of the site contaminant of concern. This distribution exhibits multiple population characteristics due to the presence of a cluster of elevated samples. The variograms for the complete data set, as well as for the segregated data set without clustered measurements above 5,000 ppm, are shown in Exhibit 8. As shown in Exhibit 8A, the inclusion of the elevated measurements masks the true underlying correlation of the site data. Therefore, the elevated cluster is segregated in the variogram analysis in order to clearly decipher and model the underlying spatial correlation (Exhibit 8B). The variogram model is used to estimate contaminant concentrations at locations throughout the EU. These estimated concentrations are computed as weighted averages of nearby measured values. This estimation process is directly dependent on the variogram model, which is used to attain minimum variance and eliminate bias. The resulting values, together with data on elevated cluster, are used to generate the iso-concentration map, as displayed in Exhibit 9. 27 ------- Exhibit 7 SAMPLE PROBABILITY PLOT OF SOIL CONTAMINANT OF CONCERN 41400 27600 *5 in 0) 04 13800 0 Cumulative Percent Normal Probability Plot Data Theme: Lead in Top 1' (in mg/kg) s s sS s s yys s # 10 30 50 70 90 99 Statistics Mean: 1015.66 Median: 144.5 Variance: 8.576E+06 Skewness: 8.86674 28 ------- Exhibit 8 SAMPLE VARIOGRAMS OF SOIL CONTAMINANT OF CONCERN Step 2: EPC Computation. Using the available soil contaminant concentration data, the selected model variogram, and the cleanup level, exceeding EU cells are identified and highlighted in Exhibit 10. This example assumes a residential scenario, where the EPC is computed as average contaminant concentrations over half acre EUs. For purposes of this example, the cleanup level is 400 ppm. Exhibit 8 displays the EU grid, showing exceeding EUs with bold (red) outlines. Step 3: RAL Determination. The RAL is determined iteratively. In these iterations, the post-remedial concentrations are defined as having a 5 ppm concentration with a 1 ppm standard deviation, which is assumed to be the background concentration of local soil for the investigated contaminant of concern. The RAL is calculated to be 750 ppm. Step 4: Cleanup Extent. Upon determination of the RAL, the extent of zones that require remediation for each scenario is depicted using the iso-concentration map of the contaminant of concern (Exhibit 11). 29 ------- Exhibit 9 SAMPLE ISO-CONCENTRATION MAP OF SOIL CONTAMINANT OF CONCERN I 1 1392 -4000 ~ 4000 - 100000 I No Data 30 ------- Exhibit 10 SAMPLE EXPOSURE UNIT GRID Kriged Contaminant r~i < 100 I | 100-400 I I 400 - 1392 I 1 1392 -4000 ^4000- 1 00000 I No Data 31 ------- Exhibit 11 SAMPLE RAL DELINEATION FOR SOIL CONTAMINANT OF CONCERN 32 ------- 5.0 APPENDIX A: GEOSTATISTICAL REFERENCE SOURCES AND SOFTWARE For detailed explanation of geostatistical techniques and applications, readers are referred toMatheron(1971); Journel andHuijbregts (1978), Isaaksand Srivastava(1989); and ASCE (1990a and 1990b). The American Society of Testing and Material (ASTM) provides a series of standard guides to perform geostatistical analyses. These guides are: ASTM, Standard Guide for Content of Geostatistical Site Investigations, D5549-94, 1994. ASTM, Standard Guide for Analysis of Spatial Variations in Geostatistical Site Investigations, D5522-96, 1996. ASTM, Standard Guide for Selection of Kriging Methods in Geostatistical Site Investigations, D5523-96, 1996. ASTM, Standard Guide for Selection for Simulation Approaches in Geostatistical Site Investigations, D5524-96, 1994. As noted, the use of geostatistics requires specially designed software. EPA has taken the lead in promotion of geostatistics by producing the first public-domain software package, known as GEO-EAS (Geostatistical Environment Assessment Software), developed by Englund and Sparks (Englund and Sparks 1988). This package was followed by another EPA package, known as GEOPACK, developed by Yates and Yates (EPA 1990). The successful results of application of GEO-EAS prompted the EPA to recommend its use in spatial environmental data analysis (EPA 1989 and 1990). EPA Region V FIELDS (Field Environmental. Decision Support) Team has also developed software for processing and analysis of spatial data that can be used in conjunction with geostatistical software.30 Many public domain and commercial software packages are also available. Among them is GSLIB (Deutsch and Journel 1992) which is public domain software containing a comprehensive library of geostatistical Fortran codes. Commercial software, such ISATIS (Geovariances 2000) and ArcGIS Geostatistical Analyst (ESRI2002), provides comprehensive tools to perform geostatistical analysis. 30 Http://www.epa.gov/region5fields/htm/software.htm. 33 ------- 6.0 APPENDIX B: SAMPLE GEOSTATISTICAL PROCEDURES FOR COMPUTING EPC [Consider adding this to the UCL Guidance or perhaps more appropriately to RAGS Appendix C, Using Geostatistics to Represent the Concentration Term for Probabilistic Risk Assessment (EPC Guidance).] Exposure point concentration (EPC) is one of the key variables in estimating exposure in risk calculations. For purposes of this guidance, the EPC is not a point value but rather an average value for an exposure unit (EU). In cases where data are collected randomly and are not spatially correlated, the EPC is usually estimated as the upper confidence level (UCL) of the arithmetic mean concentration. However, due to the typical bias of field data toward impacted zones, the collected data can be biased, clustered, and correlated. Under such conditions, the use of non-spatial statistical techniques could lead to overly-conservative (i.e., elevated) EPCs. In such cases, geostatistics can be used to compute the EPC in order to avoid overly-conservative results.31 Approach Computation of the EPC for a specified EU can be conducted by using the geostatistical estimation process known as block kriging.32 One of the available approaches for this process is demonstrated below. Step 1. Determine the spatial correlation between any two points (i.e., the variogram y i, j) based on the available data. Step 2. Define the extent of the Exposure Unit (EU) and identify its nearby measured values (zb z2, , zj o o o o o o o o° o o o o o o o o o o o o o ° o o ° o o o o o o o o o o Available Site Data 31 Excessive bias resulting from extensive hot spot sampling and no sampling in surrounding unimpacted areas, however, can provide misleading results using geostatics. In such cases, geostatistical interpolation would over-estimate the impacted areas. 32 Englund and Sparks (1988) defines kriging as a weighted-moving-average interpolation method where the set of weights assigned to samples minimizes the estimation variance, which is computed as a function of the variogram model, the locations of the samples relative to each other, and the point or block being estimated. Each estimated value is accompanied by its estimation (kriging) standard deviation. 34 ------- Nearby Measured Values Step 4. To minimize the estimation variance of Zeu (i.e., the most accurate linear estimate of ZEU), subject to the unbiasedness criterion (i.e., no systematic error), solve the kriging system below to determine the unknown estimation weights (w,, Xi,i n,2 ¦ ¦ yu i ~w~ 1 1 Y 2,1 Y 2,2 ¦ ¦ Y 2, n 1 Y2,EU y„,i r„,2 ¦ ¦ y„,„ 1 Y n,EU 1 1 . . 1 0 1 where: Yu variogram between measured values zt and zj Yieu ~ variogram between the measured value r, and the block representing the exposure unit EU w, estimation weight of the measured value r, Lagrange multiplier 35 ------- Step 5. Compute the block-average concentration over the EU (Z eli) and its measure of accuracy (pEU): ZEU =TJW,Z, i=1 n n n °EU = 2X W-yi,EU "EE wiwpi,J + ^EU,EU i=1 i=\ j=1 where: Z*EU = estimated average concentration over EU oEU = kriging standard deviation of Z*EU Step 6. Compute EPC as the Upper Confidence Limit (UCLeu) of average concentration over EU (95% UCL shown below)33: EPC=UCLeu=Z;u+\.64cteu The results of block kriging (i.e., the block-average concentration and its standard deviation) can be used to calculate the EPC as the upper confidence limit of the block average concentration based on a pre-defined tolerance level (e.g., 95%). This is a computationally intensive process that requires the use of specialized software, such as EPA's GEO-EAS program (Englund and Sparks 1988). Detailed descriptions of block kriging are provided in textbooks, such as Journel and Huijbregts (1978) and Isaaks and Srivastava (1989). Standard Guide for Selection of Kriging Methods in Geostatistical Site Investigations, produced by the American Society for Testing and Materials (ASTM), provides further guidance on selecting and conducting the appropriate kriging procedure. Example Application Exhibit B-l illustrates a case study in which the EPC over a block, representing a 2.4-acre EU is computed. This EU represents the exposure domain of an ecological receptor to soil concentrations of a contaminant of concern. This data set displays a high degree of spatial correlation due to the nature of past releases at the site. In the block kriging process, measured concentrations of the investigated contaminant within and outside of the EU are used. This is accomplished by computing estimation weights that incorporate the location of sampling points relative to each other, as well as relative to the EU. In general, data points that are isolated and close 33 In block kriging, the area mean is calculated as the average of all point values that constitute the block (e.g., EU). Based on the Central Limit Theory, such an average has a strong tendency toward normal distribution, regardless of the distribution of the underlying point values. Therefore, the UCL is calculated in accordance with the normality assumption of the area mean value, using the block average and its corresponding estimation standard deviation. 36 ------- to the EU receive higher estimation weights, while clustered data far from the EU are assigned lower weights. Exhibit B-l EXAMPLE OF COMPUTING EPC USING BLOCK KRIGING Key Potential Method Advantages Computation of the EPC through block kriging may offer the following advantages. The computational process can accommodate biased, clustered and correlated data sets. In fact, block kriging is the only commonly used process that can estimate area averages based on correlated point measurements. The correlation of data automatically governs the estimation process (i.e., estimation is performed based on the modeled variogram). The location, shape and size of the EU are directly incorporated in the estimation process. The resulting EPCs are specifically computed for each EU. 37 ------- Generally, the advantages of block kriging are mainly achieved through computationally-intensive and time-consuming processes that require specialized software and training. However, if results of simpler methods are deemed acceptable (i.e., they do not produce unreasonably high estimates of EPCs for individual EUs at a site), then more complicated geostatistical analyses can be forgone in favor of non-spatial statistical computations. 38 ------- 7.0 BIBLIOGRAPHY ASCE Task Committee on Geostatistical Techniques in Geohydrology, S. Rouhani, Chairman and Principal Author, Review of Geostatistics in Geohydrology, I. Basic Concepts, ASCE Journal of Hydraulic Engineering, 116(5), 612-632, May 1990a. ASCE Task Committee on Geostatistical Techniques in Geohydrology, S. Rouhani, Chairman and Principal Author, Review of Geostatistics in Geohydrology, II. Applications, ASCE Journal of Hydraulic Engineering, 116(5), 633-658, May 1990b. ASTM, Standard Guide for Analysis of Spatial Variations in Geostatistical Site Investigations, D5522-96, 1996. ASTM, Standard Guide for Content of Geostatistical Site Investigations, D5549-94, 1994. ASTM, Standard Guide for Selection for Simulation Approaches in Geostatistical Site Investigations, D5524-96, 1994. ASTM, Standard Guide for Selection of Kriging Methods in Geostatistical Site Investigations, D5523-96, 1996. Bowers, T.S., Shifrin, N.S., and Murphy, B.L., Statistical Approaches to Meeting Soil Cleanup Goals, Environmental Science & Technology, 30, 1437-1444, 1996. Cohen, J.T., Bowers, T.S., Lampson, D.W., Lampson, M.A., and Merrill, D.E., Quantification of Exposure Area Cleanup Thresholds When Contaminant Levels are Uncertain, American Statistical Association 1997 Proceedings of the Section on Statistics and the Environment, Alexandria, VA, 1997. Deutsch, C. V., and A. G. Journel, GSLIB: Geostatistical Software Library and User's Guide, Oxford University Press, 1992. Englund, E., and A. Sparks, GEO-EAS (Geostatistical Environmental Assessment Software) User's Guide, EPA 600/4-88/033, ENMSL, Environmental Protection Agency, Las Vegas, 1988. ESRI, ArcGIS Geostatistical Analyst, http://www.esri.com/software/arcgis/arcgisxtensions/ geostatistical/index.html, 2002. Geovariances, ISATISReference Guide, www.geovariances.fr, 2000. Isaaks, E.H., and R.M. Srivastava, Applied Geostatistics, Oxford University Press, 561 p., 1989. Journel, A.G., and C.J. Huijbregts, Mining Geostatistics, Academic Press, London, 600 p., 1978. 39 ------- Matheron, G., The Theory of Regionalized Variables and Its Applications, Les Cahiers du Centre de Morphologie Mathmatique de Fontainebleau, No. 5, 211 p., 1971. Ryti, Randall T., Superfund Soil Cleanup: Developing the Piazza Road Remedial Design, Journal of Air and Waste Management, 43: 197-202, February 1993. Singh, A., A.K. Singh, and R.J. Iaci, Estimation of the Exposure Point Concentration Term Using a Gamma Distribution, Technology Support Center Issue, EPA/600/R-02/084, EPA Technology Support Center for Monitoring and Site Characterization, Las Vegas, Nevada, October 2002. Singh, A., A.K. Singh, R.W. Maichle, ProUCL Version 3.0, User Guide, EPA, April 2004. Singh, A.K., A. Singh, Review of the Confidence Removal Goal Approach for Site Cleanup, Technology Support Center, U.S. EPA, Las Vegas, Nevada, 1996. Schultz, B., A.K. Singh, A. Singh, An Evaluation of the Confidence Response Goal Approach for Making Remediation Decisions at Superfund Sites, Environmetrics, 13 725-732, 2002. Schulz, Terry W., and Susan Griffin, Practical Methods for Meeting Remediation Goals at Hazardous Waste Sites, Risk Analysis, Vol. 21, No. 1, 2001. U.S. Environmental Protection Agency, Calculating Upper Confidence Limits for Exposure Point Concentrations at Hazardous Waste Sites, OSWER 9285.6-10, December 2002. U.S. Environmental Protection Agency, Supplemental Guidance for Developing Soil Screening Levels for Superfund Sites, OSWER 9355.4-24, December 2002. U.S. Environmental Protection Agency, Superfund Community Involvement Handbook, EPA 540-K- 01-003, April 2002. U. S. Environmental Protection Agency, Risk Assessment Guidance for Superfund: Volume III-Part A, Process for Conducting Probabilistic Risk Assessment, EPA 540-R-02-002, December 2001. U.S. Environmental Protection Agency, Multi-Agency Radiation Site Survey and Investigation Manual (MARSSIM), EPA 402-R-97-016, Rev. 1, August 2000. U. S. Environmental Protection Agency, Data Quality Objectives Processfor Hazardous Waste Site Investigations, EPA/600/R-00/007, January 2000. U.S. Environmental Protection Agency, Soil Screening Guidance: Technical Background Document, EPA 540/F-95/128, July 1996. U.S. Environmental Protection Agency, Supplemental Guidance to RAGS: Calculating the Concentration Term, EPA 9285.7-08, May 1992. 40 ------- U.S. Environmental Protection Agency, Guidance for Data Usability in Risk Assessment (Part A), EPA/9285.7-09A, April 1992. U.S. Environmental Protection Agency, Guidance for Data Usability in Risk Assessment (PartB), EPA/9285.7-09B, May 1992. U.S. Environmental Protection Agency, Risk Assessment Guidance for Superfund: Volume 1: Human Health Evaluation Manual Part B, Development of Risk-Based Preliminary Remediation Goals, Washington, D.C., December 1, 1991. U.S. Environmental Protection Agency, Guide to Selecting Superfund Remedial Actions, OSWER 9355.0-27FS, April 1990. U.S. Environmental Protection Agency, Risk Assessment Guidance for Superfund, Volume 1 - Human Health Evaluation Manual (Part A). Interim Final. EPA 1540/1-89/002, 1989. U.S. Environmental Protection Agency, Methods for Evaluating the Attainment of Cleanup Standards, Volume 1: Soils and Soil Media, EPA230/02-89-042, 1989. 41 ------- |