United States Office of Wastewater EPA 833-R-10-003 Environmental Management (June/2010) Agency (4203M) http://www.epa.aov/npdes/permitbasics &EPA National Pollutant Discharge Elimination System Test of Significant Toxicity Implementation Document June 2010 ------- NATIONAL POLLUTANT DISCHARGE ELIMINATION SYSTEM TEST OF SIGNIFICANT TOXICITY IMPLEMENTATION DOCUMENT An Additional Whole Effluent Toxicity Statistical Approach for Analyzing Acute and Chronic Test Data U.S. Environmental Protection Agency Office of Wastewater Management Water Permits Division 1200 Pennsylvania Avenue, NW Mail Code 4203M EPA East - Room 7135 Washington, DC 20460 June, 2010 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 NOTICE AND DISCLAIMER This document provides the basis for implementing the Test of Significant Toxicity (TST) approach under the National Pollutant Discharge Elimination System (NPDES) for permitting authorities (states and Regions) and persons interested in analyzing whole effluent toxicity (WET) test data using the traditional hypothesis testing approach as part of the NPDES Program under the Clean Water Act (CWA). This document describes what the U.S. Environmental Protection Agency (EPA) believes is another statistical option to analyze valid WET test data for NPDES WET reasonable potential and permit compliance determinations. The document does not, however, substitute for the CWA, an NPDES permit, or EPA or state regulations applicable to permits or WET testing; nor is this document a permit or a regulation itself. The TST approach does not result in changes to EPA's WET test methods promulgated at Title 40 of the Code of Federal Regulations Part 136. The document does not and cannot impose any legally binding requirements on EPA, states, NPDES permittees, or laboratories conducting or using WET testing for permittees (or for states in evaluating ambient water quality). EPA could revise this document without public notice to reflect changes in EPA policy and guidance. Finally, mention of any trade names, products, or services is not and should not be interpreted as conveying official EPA approval, endorsement, or recommendation. ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 CONTENTS EXECUTIVE SUMMARY v ACRONYMS AND ABBREVIATIONS xi GLOSSARY xiii 1.0 INTRODUCTION 1 1.1 Terminology and Concepts 1 1.2 Background on the TST Approach 3 2.0 TST METHODOLOGY 5 2.1 Regulatory Management Decisions for the TST Approach 5 2.2 Setting the Test Method-Specific Alpha Level 6 3.0 USING THE TST APPROACH IN WET DATA ANALYSES 9 3.1 Summary of Test Method-Specific Alpha Values 9 3.2 Calculating Statistics for Valid WET Data Using the TST Approach 10 4.0 IMPLEMENTING THE TST APPROACH IN WET NPDES PERMITS 13 4.1 Reasonable Potential (RP) WET Analysis 13 4.2 NPDES WET Permit Limits 14 5.0 RECOMMENDATIONS FOR NPDES IMPLEMENTATION OF THE TST APPROACH 15 5.1 EPA Regions and NPDES States (Permitting Authorities) 15 5.2 NPDES Permittees 15 6.0 SUMMARY OI THE TST APPROACH 17 7.0 LITERATURE CITED 19 APPENDICES A Step-by-Step Procedures for Analyzing Valid WET Data Using the TST Approach B Critical t Values for the TST Approach C Application of the TST Approach to Ambient Toxicity Programs D Example NPDES Permit Language Using the TST Approach E WET Reasonable Potential Analysis Using the TST Approach ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 TABLES Table Page 1 Expression of null and alternative hypotheses used in traditional hypothesis testing and resulting decisions based on this approach 3 2 Expression of null and alternative hypotheses using the TST approach and relationships between error rates and resulting decisions 4 3 Summary of alpha (a) levels or false negative rates recommended for different WET test methods using the TST approach 9 iv ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 EXECUTIVE SUMMARY The U.S. Environmental Protection Agency (EPA or the Agency) has developed a new statistical approach that assesses the whole effluent toxicity (WET) measurement of wastewater effects on specific test organisms' ability to survive, grow, and reproduce. The new approach is called the Test of Significant Toxicity (TST) and is a statistical method that uses hypothesis testing techniques based on research and peer-reviewed publications. The TST approach examines whether an effluent, at the critical concentration (e.g., in-stream waste concentration or IWC, as recommended in EPA's Technical Support Document (TSD) (USEPA 1991) and implemented under EPA's WET National Pollutant Discharge Elimination System (NPDES) permits program and the control within a WET test differ by an unacceptable amount (the amount that would have a measured detrimental effect on the ability of aquatic organisms to thrive and survive). EPA Regions and their NPDES states can still use EPA's TSD approaches. The TST approach is another statistical option to analyze valid WET test data. Since the inception of EPA's NPDES WET Program in the mid 1980s, the Agency has striven to advance and improve its application and implementation under the NPDES Program. The TST approach explicitly incorporates test power (the ability to correctly classify the effluent as non- toxic, also see reference in the glossary under power) and provides a positive incentive to generate valid, high quality WET data to make informed decisions regarding NPDES WET reasonable potential (RP) and permit compliance determinations. Once the WET test has been conducted (using multiple effluent concentrations and other requirements as specified in the EPA WET test methods), the TST approach can be used to analyze the WET test results to assess whether the effluent discharge is toxic at the critical concentration. Performing the EPA WET test where the minimum five required test concentrations (pursuant to the EPA WET test methods) can establish a concentration-response curve. The TST approach is designed to be used for a two concentration data analysis of the IWC or a receiving water concentration (RWC) compared to a control concentration. Using the TST approach, permitting authorities will have more confidence when making NPDES determinations as to whether a permittee's effluent discharge is toxic or non-toxic. Use of the TST approach does not result in any changes to EPA's WET test methods; however, a facility might desire to modify its future WET tests by increasing the number of replicates over the minimum required (USEPA 1995, 2002a, 2002b, 2002c) by the approved EPA WET test method to increase test power, which is the probability of declaring an effluent non-toxic if the organism response at the IWC is truly acceptable. If WET tests have already been performed, the WET data generated cannot be modified to increase the number of test replicates because the TST analysis is done on valid WET data generated within a WET test. The TST approach was developed on the basis of extensive analyses and detailed research. EPA used valid WET data from more than 2,000 WET tests to develop and evaluate the TST approach. The TST approach was tested using nine different WET test methods comprising twelve biological endpoints (e.g., reproduction, growth, survival) and representing most of the different types of WET test designs currently in use. More than one million computer simulations were also used to select error rates achieving EPA's regulatory management decisions for the TST approach. ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Background In the NPDES Program, an effluent sample is declared toxic relative to a permitted WET limit if the no observed effect concentration (NOEC) is less than the permitted IWC using a hypothesis statistical approach. In that traditional hypothesis approach, the question being answered is, "Is the mean response of the organisms the same in the control and at the IWC?" The hypothesis testing approach has four possible outcomes: (1) the IWC is truly toxic and is declared toxic, (2) the IWC is truly non-toxic and is declared non-toxic, (3) the IWC is truly toxic but is declared non-toxic, and (4) the IWC is truly non-toxic but is declared toxic. The latter two possible outcomes represent decision errors that can occur with any hypothesis testing approach. In the NPDES WET Program, those two types of errors can occur when test control replication is very good (i.e., test is very precise) so that a very small difference between IWC and control is declared toxic (outcome [4] above), and when test control replication is poor (i.e., the test is very imprecise) so that even large differences in organism response between the IWC and control cannot be distinguished as statistically different, and the effluent is incorrectly classified as non- toxic (outcome [3] above). Organism responses to the IWC and control are unlikely to be exactly the same. The difference might be so small that even if statistically significant, it would be considered biologically negligible. Another approach for assessing an effluent's toxicity on the basis of collected WET data might be to rephrase the question, "Does the mean WET test response in the control and the IWC differ by a defined biological amount?" That approach is known as the test of bioequivalence, which the Food and Drug Administration has successfully used to evaluate drugs, as have many researchers in other biological fields. Using the TST approach, the question is, "Is the organism response at the IWC less than or equal to a fixed fraction of the control response (e.g., 75 percent of the control mean response)?" That fixed fraction, expressed as a decimal between 0.00 and 1.00, is termed "b " in the TST approach. Thus, the hypothesis being tested is written as follows: mean response [IWC] < b x mean response [control]. The TST approach requires defining what is considered toxic. For chronic testing (i.e., for both lethal and sublethal toxicity test endpoints) in EPA's NPDES WET Program, the b value in the TST analysis is set at 0.75, which means that a 25 percent effect (or more) is considered evidence of unacceptable chronic toxicity. IWC responses substantially less than a 25 percent effect would be interpreted to have a lower risk potential. The regulatory management decision (RMD) for acute WET methods is set at 0.80, which means that a 20 percent effect (or more) is considered evidence of unacceptable acute toxicity. The acute RMD toxicity threshold is higher than that for chronic WET methods because of the severe environmental implications of acute toxicity (lethality or organism death). For more discussion on the b values of 0.75 (chronic toxicity) and 0.80 (acute toxicity), see Section 2.1 of this document. EPA's RMDs using the TST approach identify true toxicity in WET tests most of the time when it occurs, while also minimizing the probability that the IWC is declared toxic when in fact it is not. That objective requires additional RMDs regarding acceptable maximum false positive (ftor beta using a TST approach) and false negative rates (a or alpha using a TST approach). In the TST approach, the RMDs are defined as (1) declare a sample toxic at least 75 percent of the time (alpha, a < 0.25) when there is unacceptable toxicity (20 percent effect for acute and 25 percent effect for chronic test methods), and (2) declare an effluent non-toxic no more than 5 percent vi ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 (beta, |3 < 0.05) of the time when the mean effect at the critical effluent concentration is < 10 percent for both acute and chronic WET tests (including for sublethal endpoints). For more discussion on the RMDs, see Section 2.1 of this document. On the basis of EPA's analyses, the alpha levels shown in Table ES-1 are recommended for the nine WET test methods examined using the TST approach. An important feature of the TST approach is that the false negative error rate (rate of declaring a toxic effluent to be non-toxic) is established, which, under the traditional hypothesis testing approach, had not been established by EPA previously. For more discussion on the inclusion of the beta error rate in the TST approach, see Section 1.2 of this document and Section 1.1 on the current approach in EPA's 1991 TSD. A demonstrated benefit of the TST approach is that increasing within-test replication (the test power) results in a lower rate of WET tests being declared toxic using the TST approach when the IWC is truly non-toxic. Results obtained from the TST analyses using the nine EPA test methods should be applicable to other EPA WET methods not examined. For example, results generated under this project for the fish Pimephalespromelas survival and growth test is extrapolated to other EPA fish survival and growth tests (e.g., Menidia sp., Cyprinus variegatus, Atherinops affinis) because those test methods use a similar test design (e.g., number of replicates, number of organisms tested) and measure the same endpoints. Summary More than 2,000 WET test results and more than one million simulations were conducted to develop the technical basis for the TST approach. The approach builds on the strengths of the traditional hypothesis testing approach, including use of robust statistical analyses and published EPA documents regarding WET data analysis and interpretation. The TST approach yields a rigorous statistical interpretation of valid WET data by incorporating transparent RMDs and established alpha and beta error rates, which can provide incentives to generate test results having greater test power. Because the approach considers statistical test power, its use will result in greater confidence in WET regulatory decisions. In addition, the TST approach provides a positive incentive for the permittee to generate valid, high quality WET data by either increasing the number of test replicates for the IWC and the control within a test and/or achieving better precision within a test through improved WET test method performance (e.g., a high level of quality assurance and quality control). Permitting authorities should consider the practical programmatic shift from the traditional hypothesis testing approach to the TST approach by opening a dialogue with their regulated community. In addition, they might want to begin to identify what changes might be needed to assimilate the TST approach into any regulations, policy, guidance, and training in their respective NPDES WET Programs. Again, the traditional hypothesis testing approach under EPA's TSD is still considered valid as applied; however, that approach can now be advanced through the TST approach by providing new incentives to permittees to provide valid, high quality WET data. ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Table ES-1. Summary of alpha (a) levels or false negative rates recommended for different WET test methods using the TST approach EPA WET test method b value Probability of declaring a toxic effluent non-toxic False negative (a) error3 Chronic Freshwater and East Coast Methods Ceriodaphnia dubia (water flea) survival and reproduction 0.75 0.20 Pimephales promelas (fathead minnow) survival and growth 0.75 0.25 Selenastrum capricornutum (green algae) growth 0.75 0.25 Americamysis bahia (mysid shrimp) survival and growth 0.75 0.15 Arbacia punctulata (Echinoderm) fertilization 0.75 0.05 Cyprinodon variegatus (Sheepshead minnow) and Menidia beryllina (inland silverside) survival and growth 0.75 0.25 Chronic West Coast Marine Methods Dendraster excentricus and Strongylocentrotus purpuratus (Echinoderm) fertilization 0.75 0.05 Atherinops affinis (topsmelt) survival and growth 0.75 0.25 Haliotis rufescens (red abalone), Crassostrea gigas (oyster), Dendraster excentricus, Strongylocentrotus purpuratus (Echinoderm) and Mytilus sp (mussel) larval development methods 0.75 0.05 Macrocystis pyrifera (giant kelp) germination and germ-tube length 0.75 0.05 Acute Methods Pimephales promelas (fathead minnow), Cyprinodon variegatus (Sheepshead minnow), Atherinops affinis (topsmelt), Menidia beryllina (inland silverside) acute survival13 0.80 0.10 Ceriodaphnia dubia, Daphnia magna, Daphnia pulex, Americamysis bahia acute survival13 0.80 0.10 Notes: a. (1) declare a sample toxic at least 75 percent of the time (alpha < 0.25) when there is unacceptable toxicity (20 percent effect for acute and 25 percent effect for chronic test methods) and (2) declare an effluent non-toxic no more than 5 percent of the time (beta < 0.05) when the mean effect at the critical effluent concentration is 10 percent for both acute and chronic WET tests (including sublethal endpoints). For more discussion on the RMDs, see Section 2.1 of this document. b. Based on four replicate test design In addition, EPA recommends the following: • Permitting authorities should decide up front which approach (the EPA's 1991 TSD approach, the TST approach, or another scientifically defensible approach that is sufficient to meet the statutory and regulatory requirements) they will follow (including for their RP procedures) and use the selected approach consistently in all their state NPDES permits. Permitting authorities should ensure that the most environmentally protective approach is consistently used across all permits when assessing valid WET data (e.g., WET RP) for viii ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 NPDES permit requirements (e.g., WET limits, monitoring frequencies, toxicity identification evaluation/toxicity reduction evaluation) and avoid selecting the approach that underestimates the true toxicity of the permitted effluent discharge. • Where a small data set exists (fewer than four valid WET tests performed and reported in the previous 5 years), permitting authorities should use the TSD approach for determining RP. With small WET data sets, the TSD's RP multiplying factor is more conservative for environmental water quality protection purposes than the TST. The TST approach is intended for larger data sets (four or more) because it does not use an RP multiplying factor. • If WET tests have already been performed, the WET data generated cannot be modified to increase the number of test replicates within a test. The decision to increase the number of within test replicates is a decision that needs to be made before conducting the WET tests. • Where a permittee has concerns about WET data quality, EPA recommends increasing the number of replicates in tests, even if the permitting authority has not yet adopted the TST approach. ix ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 x ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 ACRONYMS AND ABBREVIATIONS CFR Code of Federal Regulations CV coefficient of variation CWA Clean Water Act DMR discharge monitoring report EC effect concentration EPA U.S. Environmental Protection Agency IC25 25 percent inhibition concentration IWC in-stream waste concentration LC50 50 percent lethal concentration LOEC lowest observed effect concentration MDL maximum daily limit NOEC no observed effect concentration NPDES National Pollutant Discharge Elimination System QA/QC quality assurance/quality control RMD regulatory management decision RP reasonable potential RPMF reasonable potential multiplying factor RWC receiving water concentration SWAMP Surface Water Ambient Monitoring Program (California) TAC test acceptability criteria TIE toxicity identification evaluation TRE toxicity reduction evaluation TSD Technical Support Document for Water Quality-Based Toxics Control TST Test of Significant Toxicity TU toxicity unit WET whole effluent toxicity xi ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 GLOSSARY Acute Toxicity Test is a test to determine the concentration of effluent or ambient waters that causes an adverse effect (usually mortality) on a group of test organisms during a short-term exposure (e.g., 24, 48, or 96 hours). Acute toxicity is determined using statistical procedures (e.g., point estimate techniques or a t-test). Ambient Toxicity is measured using a toxicity test on a sample collected from a receiving waterbody. Chronic Toxicity Test is a short-term test in which sublethal effects (e.g., reduced growth or reproduction) are usually measured in addition to lethality. Coefficient of Variation (CV) is a standard statistical measure of the relative variation of a distribution or set of data, defined as the standard deviation divided by the mean. The CV can be used as a measure of precision within and between laboratories, or among replicates for each treatment concentration. Confidence Interval is the numerical interval constructed around a point estimate of a population parameter. Effect Concentration (EC) is a point estimate of the toxicant concentration that would cause an observable adverse effect (e.g., mortality, fertilization). EC25 is a point estimate of the toxicant concentration that would cause an observable adverse effect in 25 percent of the test organisms. False Negative is when the in-stream waste concentration is declared non-toxic but in fact is truly toxic. In the traditional hypothesis approach, false negative error rate is denoted by Beta (|3). In the TST approach, false negative error rate is denoted as Alpha (a), which applies when the percent effect in the critical effluent concentration is > 25% for a given test. False Positive is when the in-stream waste concentration is declared toxic but in fact is truly non-toxic. In the traditional hypothesis approach, false positive error rate is denoted by Alpha (a). In the TST approach, false positive error rate is denoted as Beta ((3), which applies when the percent effect in the critical effluent concentration is < 10% for a given test. Hypothesis Testing is a statistical approach (e.g., Dunnett's procedure) for determining whether a test concentration is statistically different from the control. Endpoints determined from hypothesis testing are no observed effect concentration and lowest observed effect concentration (LOEC). The two hypotheses commonly tested in WET are: Null hypothesis (Ho): The effluent is non-toxic. Alternative hypothesis (Ha): The effluent is toxic. Inhibition Concentration (IC) is a point estimate of the toxicant concentration that would cause a given percent reduction in a nonlethal biological measurement (e.g., reproduction or growth), calculated from a continuous model (i.e., Interpolation Method). IC25 is a point estimate of the toxicant concentration that would cause a 25 percent reduction in a nonlethal biological measurement. xiii ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 In-stream Waste Concentration (IWC) is the concentration of a toxicant or effluent in the receiving water after mixing. The IWC is the inverse of the dilution factor. It is sometimes referred to as the receiving water concentration (RWC). Lethal Concentration, 50 percent (LC50) is the toxicant or effluent concentration that would cause death to 50 percent of the test organisms. Lowest Observed Effect Concentration (LOEC) is the lowest concentration of an effluent or toxicant that results in statistically significant adverse effects on the test organisms (i.e., where the values for the observed endpoints are statistically different from the control). No Observed Effect Concentration (NOEC) is the highest tested concentration of an effluent or toxicant that causes no observable adverse effect on the test organisms (i.e., the highest concentration of toxicant at which the values for the observed responses are not statistically different from the control). National Pollutant Discharge Elimination System (NPDES) is the national program for issuing, modifying, revoking and reissuing, terminating, monitoring and enforcing permits, and imposing and enforcing pretreatment requirements, under the Clean Water Act sections 307, 318, 402, and 405. Power (or test power) in the context of the Test of Significant Toxicity approach, is the probability of correctly declaring an effluent non-toxic when, in fact, it has an acceptably low level of toxicity. Precision is a measure of reproducibility (which is a statistical term about the ability to reproduce similar results across test replicates with in a test treatment) within a data set. Precision can be measured both within a laboratory (within-laboratory) and between laboratories (between-laboratory) using the same test method and toxicant. Quality Assurance (QA) is a practice in toxicity testing that addresses all activities affecting the quality of the final effluent toxicity data. QA includes practices such as effluent sampling and handling, source and condition of test organisms, equipment condition, test conditions, instrument calibration, and replication, use of reference toxicants, recordkeeping, and data evaluation. Quality Control (QC) is the set of more focused, routine, day-to-day activities carried out as part of the overall QA program. Reasonable Potential (RP) is where an effluent is projected or calculated to cause an excursion above a water quality standard based on a number of factors including the four factors listed in Title 40 of the Code of Federal Regulations Part 122.44(d)(l)(ii). Reference Toxicant Test is a check of the sensitivity of the test organisms and suitability of the test methodology using the reference toxicant required by the EPA WET test methods. Reference toxicant data are part of a routine QA/QC program to evaluate the performance of laboratory personnel and the robustness and sensitivity of the test organisms. xiv ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Regulatory Management Decision (RMD) is the decision that represents the maximum allowable error rates and thresholds for toxicity and non-toxicity that would result in an acceptable risk to aquatic life. Replicate is two or more independent organism exposures of the same treatment (i.e., effluent concentration) within a WET test. Replicates are typically separate test chambers with organisms, each having the same effluent concentration. Sample is defined as a representative portion of a specific environmental matrix that is used in toxicity testing. For this document, environmental matrices could include effluents, surface waters, groundwater, stormwater, and sediment. Significant Difference is defined as a statistically significant difference (e.g., 95 percent confidence level) in the means of two distributions of sampling results. Statistic is a computed or estimated quantity such as the mean, standard deviation, or coefficient of variation. Test Acceptability Criteria (TAC) are test method-specific criteria for determining whether toxicity test results are acceptable. The effluent and reference toxicant must meet specific criteria as defined in the test method (e.g., for the Ceriodaphnia dubia survival and reproduction test, the criteria are as follows: the test must achieve at least 80 percent survival and an average of 15 young per surviving female in the control and at least 60% of surviving organisms must have three broods). t-test (formally Student's t-Test) is a statistical analysis comparing two sets of replicate observations—in the case of WET, only two test concentrations (e.g., a control and IWC). The purpose of this test is to determine if the means of the two sets of observations are different (e.g., if the IWC or ambient concentration differs from the control [i.e., the test result is pass or fail]). Type I Error (alpha a) is the error of rejecting the null hypothesis (Ho) that should have been accepted. Type II Error (beta P) is the error of accepting the null hypothesis (H0) that should have been rejected. Toxicity Test is a procedure to determine the toxicity of a chemical or an effluent using living organisms. A toxicity test measures the degree of effect on exposed test organisms of a specific chemical or effluent. Welch's t-test is an adaptation of Student's t-test intended for use with two samples having unequal variances. Whole Effluent Toxicity (WET) is the total toxic effect of an effluent measured directly with a toxicity test. xv ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 xvi ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 1.0 INTRODUCTION Whole effluent toxicity (WET) test methods are laboratory procedures that measure biological effects (e.g., survival, growth, reproduction) on aquatic organisms exposed to effluents or storm water discharged to receiving waters in implementing the National Pollutant Discharge Elimination System (NPDES) Program under the Clean Water Act (CWA) section 402. Since the publication of EPA's Technical Support Document for Water Quality-based Toxics Control (TSD) (USEPA 1991), permitting authorities have requested alternative approaches for analyzing WET test data that would provide increased confidence in the data assessment and simplify the NPDES permit decision-making process with respect to WET. In response to those requests, EPA developed the TST approach as another statistical option to analyze valid WET test data. This document presents the NPDES programmatic features of the TST statistical approach for analyzing valid WET data and how it can be used to support permitting authorities and permittees when analyzing and interpreting WET test data. Use of the TST approach does not result in any changes to EPA's WET test methods, nor does it preclude the use of EPA's TSD approaches for analyzing valid WET data, or another scientifically defensible approach that is sufficient to meet the statutory and regulatory requirements. 1.1 Terminology and Concepts This section briefly summarizes the major statistical concepts and terminology involved in WET analysis so as to give the reader a context with which to understand the TST approach and how it differs from current statistical approaches used to analyze valid WET data. This TST implementation document is not intended to provide a detailed discussion of WET test methods, data interpretation, or statistics, and it is assumed that the reader will consult EPA's TSD, WET test method documents, and other WET-related documents (e.g., Understanding and Accounting for Method Variability in Whole Effluent Toxicity Applications, USEPA 2000). In the NPDES Program, WET tests examine organism responses to effluent, typically along a dilution series (USEPA 1995, 2002a, 2002b, 2002c). Acute WET methods measure the lethal response of test organisms exposed to effluent (USEPA 2002c). The principal response endpoints for those methods are the effluent concentration that is lethal to 50 percent of the test organisms (LC50) or the effluent concentration at which survival is significantly lower than the control. Chronic WET methods often measure both lethal and sublethal responses of test organisms. The statistical endpoints used in chronic WET testing are the no observed effect concentration (NOEC) and the 25 percent inhibition concentration (IC25). The NOEC endpoint is determined using a hypothesis testing approach that identifies the maximum effluent concentration at which the response of test organisms is not significantly different from the control. From a regulatory perspective, an effluent sample is declared toxic if the NOEC is less than the in-stream waste concentration (IWC) specified through the WET limitations in the permit. The IC25, by contrast, is a point estimation approach. It identifies the concentration at which the response of test organisms is 25 percent below that observed in the control concentration, and it interpolates the effluent concentration at which this magnitude of response is expected to occur. From a regulatory perspective, an effluent sample is declared toxic if the IC25 is less than the IWC specified through the WET limitations in the permit. This document focuses only on the hypothesis testing approach and not on point estimation approaches for analyzing and interpreting WET data. 1 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 In any hypothesis testing approach, two hypotheses are stated: the null hypothesis and the alternative hypothesis. The statistical concepts associated with the traditional hypothesis testing approach currently used in WET analysis are summarized in Table 1. Using that approach, the null hypothesis is that the IWC is non-toxic (i.e., the organism response at the IWC is equal to or better than the response in the test control). The alternative hypothesis is that the IWC is toxic (i.e., the organism response is worse in the IWC than in the control). With any hypothesis testing approach, two types of decision errors occur: (1) conclude that the null hypothesis is correct when in fact it is not or (2) conclude that the null hypothesis is incorrect (i.e., reject the null hypothesis) and thereby declare that the alternative hypothesis is correct, when in fact the null hypothesis is correct. In WET testing, the first type of error above is referred to as a false negative, meaning that the IWC is declared non-toxic when in fact it is toxic. The second type of error above is referred to as a false positive in WET testing, meaning that the IWC is declared toxic when in fact it is not. In the traditional hypothesis testing approach summarized in Table 1, statisticians have assigned Greek letters to the two types of errors identified above. Alpha (or a) refers to the false positive error rate. Beta (or (3) refers to the rate of false negatives. In the EPA WET test methods supporting the NPDES WET Program (USEPA 1995, 2002a, 2002b), a was established but (3 was not. Therefore, the application of a from the EPA test methods and implemented under EPA's TSD, recommended that the maximum rate of false positives that should be observed should be low (no more than 5 percent or a = 0.05), but the rate of false negatives was not similarly controlled and is not currently evaluated in WET testing. As a result, the rate of false negatives in the NPDES WET Program has not been controlled. Put another way, the statistical power of these tests, the ability to correctly classify the IWC as toxic (where power is defined as 1-P, Table 1) has not been controlled. As noted previously in this section, a hypothesis testing approach determines whether the organism response at the IWC is significantly worse than that in the control. In practice, this statistical approach relies on two properties of the data: the average values in the control and the IWC (e.g., average fish weight in each test concentration), and the variability observed among replicates (i.e., organisms' responses from multiple replicates) within the IWC and the control. Whether the IWC is considered toxic depends on both of those data properties, which in many cases results in a well-established, statistically rigorous way to evaluate WET data. However, there are two types of situations in which the traditional hypothesis testing approach can yield equivocal results in WET testing: (1) in tests where within-test variability is high and (2) in tests where within-test variability is exceptionally low. In the first case, because within-test variability is high, it will be difficult to determine statistically whether the organism response to the IWC is worse than the control. That could result in more false negatives than would otherwise be the case. In the second case above, because within-test variability is very low, it will be relatively easy to show statistically significant differences in organism response between the IWC and the control. That could result in more false positives (as defined in the TST approach) than would otherwise be the case. 2 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Table 1. Expression of null and alternative hypotheses used in traditional hypothesis testing and relationships between error rates and resulting decisions based on this approach. Entries correspond to the probability decision given in parentheses. The probability of a false positive (i.e., rejecting a null hypothesis that should not have been rejected) is represented by a and the probability a false negative (i.e., failing to reject the null hypothesis when it should have been rejected) is represented by (3. Decision True condition Null hypothesis Treatment mean > Control mean Sample is non-toxic Alternative hypothesis Treatment mean < Control mean Sample is toxic Treatment mean > Control mean Sample is non-toxic Correct decision (1-a) False negative (P) Treatment mean < Control mean Sample is toxic False positive (a) Correct decision (1 - p) (power) 1.2 Background on the TST Approach The TST is an alternative statistical approach for analyzing and interpreting valid WET data that also uses a hypothesis testing approach but in a different way, building on previous work conducted by EPA in the NPDES WET Program (USEPA 2000) and other researchers (Erickson and McDonald 1995; Shukla et al. 2000; Berger and Hsu 1996). The TST approach is based on a type of hypothesis testing referred to as bioequivalence testing. Bioequivalence is a statistical approach that has long been used in evaluating clinical trials of pharmaceutical products (Anderson and Hauck 1983) and by the Food and Drug Administration (Hatch 1996; Aras 2001; Streiner 2003). The approach has also been used to evaluate the attainment of soil cleanup standards for contaminated sites (USEPA 1989) and to evaluate effects of pesticides in experimental ponds (Stunkard 1990). In the context of the NPDES WET Program, the TST approach assesses whether the response of test organisms at the IWC (e.g., fish weight or number of neonates per female) is less than a predetermined proportion of the control response that is considered unacceptably toxic. Once the WET test has been conducted (using multiple effluent concentrations and other requirements have been met as specified in the EPA methods), the TST approach is designed to be used for a two concentration data analysis of the in-stream waste concentration (IWC) or a receiving water concentration (RWC) compared to a control concentration. The null hypothesis using the TST approach is that the IWC is significantly more toxic (i.e., results in a worse organism response) compared to the control (see Table 2). The alternative hypothesis using the TST approach is that the IWC is non-toxic. Thus, the null and alternative hypotheses using the TST approach are opposite of what they are under the traditional hypothesis testing approach described in Section 1.1. In addition, the meaning of a and (3 are also opposite from what they represent in the traditional hypothesis approach. Under the TST approach, a is associated with false negatives, and (3 is associated with false positives. Statistical power using the TST approach is the ability to correctly classify the IWC as non-toxic (Table 2). The proportion or fraction of the control response that represents the toxicity threshold is denoted as b in the equations in Table 2 and is expressed as a decimal between 0.00 and 1.00. For example, 3 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 a b value set at 0.85 would mean that a response at the IWC that is at least 85 percent of the control response in the test (i.e., no more than a 15 percent effect) would be considered a lower risk for environmental impacts. Using the TST hypothesis approach in the NPDES WET Program has several benefits. By incorporating b in the hypothesis equation, using the TST approach, there is explicit acknowledgement of the fact that the organism response at the IWC can be less than the control organism response by a certain amount and still be considered acceptable (i.e., non-toxic). In that way, truly non-toxic samples (as defined in the TST approach) can be addressed in a clearer manner than is possible with the traditional hypothesis testing approach as practiced in the NPDES WET Program. A low false positive rate in the TST approach is further addressed by having a low J3 (|3 < 0.05), which means more statistical power to identify an acceptable effluent (as defined by EPA's regulatory management decisions [RMDs]) as non-toxic in the NPDES WET Program. In addition, because the null hypothesis in the TST approach is opposite to what is used in the traditional hypothesis testing approach, false negatives are explicitly addressed (a in the TST approach addresses the false negative rate). As mentioned previously, the current NPDES WET Program does not control for false negatives. Thus, the TST approach allows permitting authorities to minimize the occurrence of false negatives (i.e., declaring the IWC non- toxic when it is actually exhibiting unacceptable toxicity), while also minimizing the occurrence of false positives (i.e., declaring the IWC toxic when it is actually acceptable). The TST approach has the added advantage of providing permittees with a clear incentive to improve the precision of test results (e.g., decrease within-test variability and/or use more replicates within a WET test than the minimum required in the EPA WET test method) to reach a definitive conclusion as to whether unacceptable toxicity is observed in a test. Thus, using the TST approach, a permittee can in fact prove a negative, i.e., that their effluent is acceptable (non- toxic). Table 2. Expression of null and alternative hypotheses using the TST approach and relationships between error rates and resulting decisions based on this approach. Entries correspond to the probability decision given in parentheses. The probability of a false positive (i.e., rejecting a null hypothesis that should not have been rejected) is represented by a and the probability a false negative (i.e., failing to reject the null hypothesis when it should have been rejected) is represented by (3. Decision True condition Null hypothesis Treatment mean b« Control mean Sample is non-toxic Treatment mean b« Control mean Sample is non-toxic False negative (a) Correct decision (1-p) (power) 4 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 2.0 TST METHODOLOGY 2.1 Regulatory Management Decisions for the TST Approach Toxicity is not an absolute quantity but rather an effect that is determined relative to a control or reference sample using a given WET test method. In the TST approach, what is considered unacceptable or acceptable toxicity are explicit RMDs. For chronic testing in EPA's NPDES WET Program, the b value in the TST null hypothesis is set at 0.75, which means that a 25 percent effect (or more) is considered a demonstration of unacceptable toxicity in a given WET test. Using a 25 percent effect threshold as the b coefficient is consistent with EPA's use of a 25 percent inhibition concentration (IC25) as an acceptable WET endpoint for examining chronic WET data. Responses substantially less than a 25 percent effect would be interpreted as a lower risk potential. The unacceptable toxicity RMD threshold for acute WET methods is set higher than that for chronic WET methods because of the severe environmental implications of acute toxicity (lethality or organism death). Therefore, for acute WET tests, the b value in the TST approach is set at 0.80 (i.e., > 20 percent effect in the effluent in acute WET tests is considered unacceptable). For both acute and chronic WET test methods, the low-risk RMD threshold is set at a 10 percent mean effect at the IWC within a WET test. Thus, one can prove the negative (i.e., an effluent is acceptable or considered non-toxic under NPDES) if that condition is met in a WET test. For mean effect levels greater than 10 percent but less than the unacceptable toxicity RMD threshold (20 percent for acute and 25 percent for chronic WET tests), the TST approach will still declare the IWC non-toxic depending on within-test variability: the lower the variability in the WET test, the more likely the sample will be declared non-toxic on the basis of the mean responses observed under these test conditions. EPA's RMDs using the TST approach are used to specify unacceptable toxicity in WET tests most of the time when it occurs (i.e., a low false negative rate). As mentioned previously, under the traditional hypothesis testing approach currently used in the NPDES WET Program, the false negative rate was not controlled. Using the TST approach, the false negative rate RMD is 0.05 < a < 0.25, which translates to at least 75 percent probability that an effluent causing unacceptable toxicity will be declared toxic. As noted in the previous paragraph, the unacceptable toxicity RMD threshold is defined as > 20 percent effect of the IWC in acute WET tests and > 25 percent effect of the IWC in chronic WET tests. EPA also desires to minimize the probability that the IWC is declared toxic when in fact it is acceptable (i.e., low false positive rate). Under the traditional hypothesis testing approach currently used in the NPDES WET Program, the false positive rate is set at 0.05 or 5 percent. Therefore, in the TST approach, the desired false positive rate is also set at 0.05 or 5 percent ((3 < 0.05). A |3 = 0.05 in the TST approach means that 95 percent of the time, a truly acceptable effluent (< 10 percent mean effect at the IWC) will be declared non-toxic in the NPDES WET Program. Depending on the minimum WET test design required in the EPA methods (e.g., number of replicates and number of organisms per test concentration) and achievable laboratory control precision for a WET test method, a will be set between 0.05 and 0.25 while still 5 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 maintaining a J3 < 0.05. Extensive analyses were used to identify the lowest a for a given WET test method for which (3 = 0.05 and all other RMDs are met. The RMD thresholds above represent boundaries in terms of desired a and (3 rates. An a = 0.20 for a chronic test method, for example, means that the Type I error rate will be approximately 20 percent at a mean effect of 25 percent. At higher levels of effect in the IWC, actual Type I error rates would be lower; at lower mean effect levels in the IWC, Type I error rate would be somewhat higher, depending on the test method. Therefore, at mean effect levels between the 10 percent non-toxic RMD boundary and the unacceptable toxicity RMD boundary (20 percent for acute and 25 percent for chronic WET test methods), there are differing probabilities of an effluent being declared toxic depending on within-test variability and the difference in mean responses observed between control and IWC. As a result, there will be some instances in which TST will declare a test toxic, whereas the traditional hypothesis approach would declare that test non-toxic (particularly when within-test variability is high or the mean effect at the IWC is near 25 percent, as explained in Section 1.1). Similarly, there will be some instances in which TST will declare an effluent non-toxic but the traditional hypothesis approach would declare that test toxic (when within-test variability is low and the mean effect at the IWC is less than the 20 percent toxicity RMD threshold for acute test methods or 25 percent for chronic toxicity test methods, as explained in Section 1.1). WET test design and the types of WET endpoints measured influence test sensitivity (e.g., control coefficient of variation or CV). Therefore, TST a error rates are identified for different types of test designs. For example, all fish chronic WET test methods that use a similar test design and have the same type of test endpoints (e.g., growth and survival) would have the same a value. Varying a by WET test design is appropriate for the TST approach. Given the way that the hypotheses are formulated in the TST approach (see Table 2), a represents what is considered (3 in the traditional hypothesis testing approach, and an acceptable (3 error was not identified in the current EPA TSD's approach to the EPA NPDES WET Program. Setting a as well as (3 in the TST approach addresses both false positives and false negatives. 2.2 Setting the Test Method-Specific Alpha Level Several types of analyses were conducted to determine the appropriate a level for each WET test method. First, representative effluent and reference toxicant data meeting EPA WET test method's test acceptability criteria (TAC) were obtained from several state databases, which included multiple laboratories and wastewater effluents. Valid effluent WET data that met the following data selection requirements were considered to be a representative sample. • Cover a range of NPDES permitted facility types, including both industrial and municipal permittees • Represent many facilities for a given EPA WET test method (i.e., no one facility dominates the data for a given WET test method) • Cover a range of target (design) effluent dilutions on which WET reasonable potential (RP) and NPDES permit compliance are based, ranging from 10 percent to 100 percent effluent concentrations • Generated by several laboratories for a given EPA WET test method 6 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 • Cover a range of observed effluent toxicity for each EPA WET test method (e.g., NOECs range from <10 percent to 100 percent effluent) For each of the nine EPA WET test methods examined, control precision was calculated on the basis of valid WET data compiled in this project. A similar analysis was performed for the control response for each of the nine test methods (e.g., mean number of offspring per female in the chronic Ceriodaphnia dubia test method) to characterize typical achievable test performance in terms of control response. A Monte Carlo simulation analysis (a statistical method) was used to estimate the percentage of WET tests that would be declared toxic using the TST approach as a function of different a levels, within-test variability (control and effluent variability), and different effect levels. That analysis identified probable false positive error rates (i.e., declaring an effluent toxic when in fact it is not) under all WET test scenarios encountered. Using the RMDs defined above, an appropriate a level was then identified for each WET test design given a desired (3 error of < 5 percent (0.05) when there is a 10 percent mean effect at the IWC. By simulating thousands of WET tests for a given scenario (mean percent effect and control CV), the percentage of tests declared toxic under a given effluent assessment scenario could be calculated and compared with other scenarios. 7 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 8 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 3.0 USING THE TST APPROACH IN WET DATA ANALYSES 3.1 Summary of Test Method-Specific Alpha Values On the basis of all the analyses conducted in this project, EPA recommends the following alpha levels when using the TST approach in a two concentration (i.e., two treatments) data analysis comparison (e.g., IWC and control) (see Table 3). Table 3. Summary of alpha (a) levels or false negative rates recommended for different WET test methods using the TST approach EPA WET test method b value Probability of declaring a toxic effluent non-toxic False negative (a) error3 Chronic Freshwater and East Coast Methods Ceriodaphnia dubia (water flea) survival and reproduction 0.75 0.20 Pimephales promelas (fathead minnow) survival and growth 0.75 0.25 Selenastrum capricornutum (green algae) growth 0.75 0.25 Americamysis bahia (mysid shrimp) survival and growth 0.75 0.15 Arbacia punctulata (Echinoderm) fertilization 0.75 0.05 Cyprinodon variegatus (Sheepshead minnow) and Menidia beryllina (inland silverside) survival and growth 0.75 0.25 Chronic West Coast Marine Methods Dendraster excentricus and Strongylocentrotus purpuratus (Echinoderm) fertilization 0.75 0.05 Atherinops affinis (topsmelt) survival and growth 0.75 0.25 Haliotis rufescens (red abalone), Crassostrea gigas (oyster), Dendraster excentricus, Strongylocentrotus purpuratus (Echinoderm) and Mytilus sp (mussel) larval development methods 0.75 0.05 Macrocystis pyrifera (giant kelp) germination and germ-tube length 0.75 0.05 Acute Methods Pimephales promelas (fathead minnow), Cyprinodon variegatus (Sheepshead minnow), Atherinops affinis (topsmelt), Menidia beryllina (inland silverside) acute survival13 0.80 0.10 Ceriodaphnia dubia, Daphnia magna, Daphnia pulex, Americamysis bahia acute survival13 0.80 0.10 Notes: a. (1) declare a sample toxic at least 75 percent of the time (alpha < 0.25) when there is unacceptable toxicity (20 percent effect for acute and 25 percent effect for chronic test methods) and (2) declare an effluent non-toxic no more than 5 percent of the time (beta < 0.05) when the mean effect at the critical effluent concentration is 10 percent for both acute and chronic WET tests (including sublethal endpoints). For more discussion on the RMDs, see Section 2.1 of this document. b. Based on four replicate test design 9 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 3.2 Calculating Statistics for Valid WET Data Using the TST Approach Appendix A includes a step-by-step guide for using the TST approach to analyzing WET test data. The appendix also includes a statistical flowchart and several examples. Note that the WET test method should follow the test condition requirements as specified in EPA's approved WET methods (USEPA 1995, 2002a, 2002b, 2002c). The TST approach is used to statistically compare organism responses from two concentrations (i.e., treatments) of the WET test, the IWC and the control. Percent data (quantal data), such as percent survival or percent germination from a WET test, is first transformed as required in the EPA WET test manuals. Other types of WET data (e.g., growth or reproduction data) are not transformed. Data are then analyzed using Welch's t-test, a well-known modification of the standard t-test (Zar 1996), which is appropriate for the TST approach (see Appendix A). Appendix B lists the critical t values that apply to WET testing using the TST approach given the number of degrees of freedom and the a level that applies for a given WET test method from Table 3 of this document. If the calculated t value for the WET test is greater than the critical t value (see Table B-l), the null hypothesis is rejected, i.e., the test result is Pass and the effluent is declared non-toxic. If the calculated t value is less than the critical t value in Appendix B, the null hypothesis is not rejected, i.e., the test result is Fail and the effluent is declared toxic. Appendix A contains examples that demonstrate the formulae used in the TST approach and are designed to illustrate how the outcome is influenced by within-test variability and the mean effect of the IWC using the TST approach. Four different case examples are presented, three of which have equal variances between control and IWC: (1) Ceriodaphnia reproduction data having relatively high within-test variability, (2) Ceriodaphnia reproduction data having relatively low within-test variability and the same effect as in Example 1, (3) growth data from two fathead minnow chronic WET tests, both with relatively high within-test variability but small mean effect at the IWC; one test was conducted with the minimum number of replicates required in the EPA WET test method (four replicates) and the other test was conducted a priori with six replicates per concentration; and (4) calculations using the TST approach for an acute fathead minnow WET test. Case Example #1 in Appendix A: Demonstrates a benefit of the TST approach by addressing false negatives. A WET test that has relatively high within-test variability for a given WET test method and has an effect at the IWC approaching the RMD threshold (25 percent in this case because it is a chronic WET test) is declared toxic using the TST approach. Using the traditional hypothesis testing approach as recommended in the TSD, such test data typically lead to a conclusion that the effluent is not toxic (i.e., a false negative). Case Example #3 in Appendix A: Demonstrates the benefits of increased within-test replication using the TST approach. Increasing the replication before conducting the test, which thereby improves the precision and power of the WET test, increases the chances of rejecting the null hypothesis and declaring a truly acceptable effluent as non-toxic using the TST approach. That increases the ability to prove the negative, i.e., that an effluent is declared not toxic. 10 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 The TST approach can also be used for ambient toxicity (i.e., receiving water) tests and stormwater toxicity testing programs because the TST approach compares two treatments (for application of the TST approach to ambient toxicity testing, see Appendix C). 11 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 12 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 4.0 IMPLEMENTING THE TST APPROACH IN WET NPDES PERMITS The TST approach is an alternative approach for analyzing and interpreting valid WET data. Use of the TST approach does not result in any changes to EPA's WET test methods. WET limits are simpler to communicate and understand (for example permit language for acute and chronic WET monitoring using the TST statistical analysis approach, see Appendix D) than the TSD approach. EPA recommends that permitting authorities decide up front which approach (the 1991 TSD approach, the TST approach, or another scientifically defensible approach that is sufficient to meet the statutory and regulatory requirements) they will incorporate and consistently use in their state's NPDES implementation procedures, including their RP procedures. The permitting authority should use the selected WET statistical approach consistently in all of their state NPDES permits. 4.1 Reasonable Potential (RP) WET Analysis NPDES permitting authorities conducting an RP analysis must follow Title 40 of the Code of Federal Regulations (CFR) section 122.44(d)(1) to determine whether a discharge will "cause, have the [RP] to cause, or contribute to" an excursion of a numeric criterion or a narrative WET criterion. Some states have state-specific WET RP approaches in their water quality control plan or other NPDES policy or guidance. For RP calculations using the TST approach, EPA recommends that permitting authorities use all valid WET test data generated during the current permit term and any additional valid data that are submitted as part of the permit renewal application. The TST RP approach necessitates having at least a minimum of four valid WET tests to address effluent representativeness (see EPA's TSD, Chapter 3, p. 57, under Step 2 in the section Steps in Whole Effluent Characterization Process). EPA also recommends that states request that their permittees provide the actual test endpoint responses for the control (i.e., control mean) and IWC concentration (i.e., IWC mean) for each WET test conducted to make it easier for permit writers to find the necessary WET test results when determining WET RP. WET test data are then analyzed according to the TST approach using the IWC and control test concentrations for all the valid WET test data available. For data sets with fewer than four valid WET data points, RP should be assessed using EPA's TSD RP approach because it addresses small WET data sets by incorporating an RP multiplying factor (see Section 3.3.2 of the TSD, p. 54) to account for effluent variability in small WET data sets. If WET test data are available and the TST statistical approach indicates that the IWC is toxic in any WET test, RP has been demonstrated (40 CFR 122.44(d)( 1 )(i)). Similar to the TSD approach, the TST approach can establish the existence of RP for WET even when no tests have been declared toxic using the TST to address concerns regarding the "potential to cause or contribute to toxicity." Appendix E presents the approach used to determine RP using the TST approach. Note that using the TST approach might be to the permittee's advantage. If the permittee decides to incorporate additional replicates for the control and the IWC within a WET test, beyond the minimum required in the WET test method, the test power is increased. More test replicates increases test power, which means a higher probability of declaring a sample as non-toxic using the TST approach if the effluent is truly non-toxic. A demonstration is provided in Appendix A (Case Example #3), which illustrates that as an intended consequence of the TST approach 13 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 methodology. Thus, using the TST approach, a permittee has a greater ability to prove the negative (i.e., their effluent does not have RP). In those cases where the WET RP outcome is yes, a WET limit is expressed in the permit. In those situations where the RP outcome is no, WET monitoring requirements should still be incorporated in the permit. Also in the permit, a test result of Fail (i.e., sample declared toxic) during monitoring, would trigger additional steps in the permit. In either of those situations— either a WET limit or a WET monitoring requirement, if toxicity is demonstrated—states should specify an approach to address toxicity in the permit. Doing so often includes increased frequency of WET testing and additional permit requirements to perform a toxicity reduction evaluation. 4.2 NPDES WET Permit Limits Using the TST approach, WET NPDES permit limits would be expressed as no significant toxicity of the effluent at the IWC using the TST analysis approach. A test result of Pass is when the calculated t value is greater than the critical t value. A test result of Fail is when the calculated t value is less than the critical t value. Beyond assessing WET data for the NPDES Program, WET tests are used to assess toxicity of receiving water (watershed assessment for CWA section 303(d) determinations) and stormwater samples. Often as a first assessment of receiving or stormwater toxicity, researchers test a control and a single concentration (e.g., 100 percent receiving water or stormwater). In such cases, the TST approach can be used in the same way a t-test is used. Such analysis is used to determine whether organism response in a specified ambient concentration is significantly different than the control organism response (for further information, see Appendix C). 14 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 5.0 RECOMMENDATIONS FOR NPDES IMPLEMENTATION OF THE TSTAPPROACH 5.1 EPA Regions and NPDES States (Permitting Authorities) Permitting authorities should consider adding the TST approach to their implementation procedures for analyzing valid WET data for their current NPDES WET Program. Permitting authorities should consider the practical programmatic shift from the traditional hypothesis testing approach to the TST approach by opening a dialogue with their regulated community. In addition, they might want to begin to identify what changes might be needed to assimilate the TST approach into any regulations, policy, guidance, and training within their respective NPDES WET Programs. EPA also recommends that permitting authorities decide up front which RP approach (the 1991 TSD approach, the TST approach, or another scientifically defensible approach that is sufficient to meet the statutory and regulatory requirements) the permitting authority will incorporate and consistently use in their state's NPDES implementation procedures. The permitting authority should then use the WET statistical approach (either the TSD approaches or the TST data analysis approach) selected throughout all its state NPDES permits. Again, the traditional hypothesis testing approach recommended in EPA's TSD is still considered valid as applied; however, that approach can now be advanced through the TST approach by providing new incentives to permittees to generate valid, high quality WET data. The RMDs incorporated into the TST approach were selected on the basis of considerable research and analysis involving several of the EPA WET test methods. Lower b values (i.e., for chronic test methods using a 0.70 instead of 0.75 b is unacceptable) are not recommended because it would mean that a lower fraction of test control response (i.e., greater effect at the IWC) is considered acceptable. EPA chose the acute and chronic b values to minimize effects on aquatic ecosystems. Likewise, the alpha values identified by EPA using the TST approach were determined on the basis of the predetermined b values and therefore should not be altered. The permitting authority should consider carefully how the TST approach will be implemented in NPDES permits. Example permit language is shown in Appendix D. In consideration of maintaining NPDES WET Program implementation consistency, the TST approach should be used in place of, and not in addition to, the traditional hypothesis testing (NOEC) approach for WET analysis. 5.2 NPDES Permittees One of the intended benefits of the TST approach is that increasing the precision and power of the WET test increases the chances of declaring a truly acceptable effluent as non-toxic. The permittee has greater control over the interpretation of WET test results using the TST approach because the RMDs are transparent, and the level of WET data quality needed to obtain unequivocal results can be determined beforehand. For example, conducting tests with more test replicates improves the power of the WET test, which can then support and provide a defensible basis for a permittee's demonstration that its effluent is acceptable (i.e., in compliance with the permit) if the mean effect is truly within the RMDs as defined in the TST approach. Using the TST approach, there is a lower rate of WET tests declared toxic for tests that are truly acceptable because of the increased power of the WET test when the permittee increases its number of 15 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 replicates in a WET test or achieves better replication within a test through improved test method performance. Thus, the TST approach increases the ability of the permittee to prove the negative, that the effluent is non-toxic if it is truly acceptable. Where a permittee has concerns about WET data quality, EPA recommends increasing the number of replicates in tests, even if the permitting authority has not yet adopted the TST approach. 16 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 6.0 SUMMARY OF THE TST APPROACH EPA's TSD approaches are valid and can still be used by EPA Regions and their NPDES states. The TST approach is another statistical option for analyzing valid WET test data. The TST approach can be applied to acute (survival) and chronic (sublethal) endpoints and is appropriate to use for both freshwater and marine EPA WET test methods. The TST approach requires no more time or expertise than is presently expended when using the TSD hypothesis testing statistical approach and can be used with a well-recognized statistical test. Below is a brief outline of both the TST and TSD hypothesis testing approaches relevant to the information in this document and a short list of the benefits derived when using the TST approach. TST Approach • Considered additional guidance only—TST is a statistical approach for analyzing WET test data as an alternative option to the traditional hypothesis testing approach provided in EPA's TSD • Expresses NPDES WET permit limit "as no significant toxicity of the effluent at the in- stream waste concentration" using the TST analysis approach • Provides a positive incentive to NPDES permittees to generate valid, high quality WET data to the permitting authority by improving test performance or increasing the number of replicates within a WET test (which increases statistical power of WET test) • Addresses both false negative (declared non-toxic when actually toxic) and false positive (declared toxic when actually non-toxic) error rates in a WET test Traditional Hypothesis Test (EPA TSD) • Existing approaches remain valid and can still be used by NPDES permitting authorities • In existing guidance, WET permit limits are expressed as no observed effect concentration (NOEC) at the IWC • Provides relatively less incentive to permittees to generate high quality valid, WET data or to increase the number of replicates within a WET test to increase statistical power of a WET test • False negative error rate in a WET test is not addressed Benefits When Using the TST Approach in WET Data Analysis • The TST approach is similar to statistical concepts used in other EPA programs and at other federal agencies • Transparent RMDs. RMDs are transparent because they are incorporated into the WET data analysis process, e.g., what effect level is considered toxic and what effect level is considered acceptable. • WET test method-specific alpha and beta error rates. Both error rates are directly incorporated into the TST statistical approach, thereby increasing confidence in WET test interpretation. • High quality WET test data incentive. Provides a positive incentive for the permittee to generate valid, high quality WET data; better test performance (lower within-test 17 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 variability) helps ensure appropriate WET decisions using the TST approach (e.g., a truly acceptable effluent will be declared non-toxic). • Streamlined, simpler statistical analysis. Flowchart for analyzing valid WET data under the TST approach is much simpler because fewer statistical tests are needed. • RP analysis is simpler. Because the calculation of the individual test result, using the TST statistical approach, incorporates both error rates in the analysis, the RP determinations can rely on a direct calculation of the percent effect at the IWC. Thus, the RP procedures are much simpler to use than the RP statistical procedures recommended in the TSD. 18 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 7.0 LITERATURE CITED Anderson, S., and W. Hauck. 1983. A new procedure for testing equivalence in comparative bioavailability and other clinical trials. Communications in Statistics-Theory and Methods 12:2663-2692. Aras, G. 2001. Superiority, non-inferiority, equivalence, and bioequivalence-revisited. Drug Information Journal 35:1157-1164. Berger, R., and J. Hsu. 1996. Bioequivalence trials, intersection-union tests and equivalence confidence sets. Statistical Science 11:283-319. Erickson, W., and L. McDonald. 1995. Tests for non-inferiority of control media and test media in studies of toxicity. Environmental Toxicology and Chemistry 14:1247-1256. Hatch, J. 1996. Using statistical equivalence testing in clinical biofeedback research. Biofeedback and Self-Regulation 21:105-119. Shukla, R., Q. Wang, F. Fulk, C. Deng, and D. Denton. 2000. Bioequivalence approach for whole effluent toxicity testing. Environmental Toxicology and Chemistry 19:169-174. Streiner, D. 2003. Unicorns do exist: A tutorial on "proving" the null hypothesis. Canadian Journal of Psychiatry 48(11):756—761. Stunkard, C. 1990. Tests of proportional means for mesocosms studies. Technical Report. University of Maryland, Department of Measurement, Statistics, and Evaluation, College Park, MD. USEPA (U.S. Environmental Protection Agency). 1989. Methods for Evaluating the Attainment of Cleanup Standards, Vol. I - Soils and Solid Media. EPA 230/02-89-042. U.S. Environmental Protection Agency, Office of Policy, Planning and Evaluation, Washington, DC. USEPA (U.S. Environmental Protection Agency). 1991. Technical Support Document for Water Quality-based Toxics Control. EPA/505/2-90-001. U.S. Environmental Protection Agency, Office of Water, Washington, DC. USEPA (U.S. Environmental Protection Agency). 1995. Short-Term Methods for Estimating the Chronic Toxicity of Effluents and Receiving Waters to West Coast Marine and Estuarine Organisms. EPA/600/R-95-136. U.S. Environmental Protection Agency, National Exposure Research Laboratory, Cincinnati, OH, and Office of Research and Development, Washington, DC. USEPA (U.S. Environmental Protection Agency). 2000. Understanding and Accounting for Method Variability in Whole Effluent Toxicity Applications Under the NPDES Program. EPA/83 3-R-00-003. U.S. Environmental Protection Agency, Office of Water, Washington, DC. 19 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 USEPA (U.S. Environmental Protection Agency). 2002a. Short-Term Methods for Estimating the Chronic Toxicity of Effluents and Receiving Waters to Freshwater Organisms. 4th edition. EP A/821/R-02-013. U.S. Environmental Protection Agency, Office of Water, Washington, DC. USEPA (U.S. Environmental Protection Agency). 2002b. Short-Term Methods for Estimating the Chronic Toxicity of Effluents and Receiving Waters to Marine and Estuarine Organisms. 3rd ed. EPA/82l/R-02-14. U.S. Environmental Protection Agency, Office of Water, Washington, DC. USEPA (U.S. Environmental Protection Agency). 2002c. Methods for Measuring the Acute Toxicity of Effluents and Receiving Waters to Freshwater and Marine Organisms. 5th ed. EPA/82 l/R-02-012. U.S. Environmental Protection Agency, Office of Water, Washington, DC. Zar, J. 1996. BiostatisticalAnalysis 3rd ed. Prentice Hall Publishers, New Jersey. 20 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 APPENDIX A STEP-BY-STEP PROCEDURES FOR ANALYZING VALID WHOLE EFFLUENT TOXICITY DATA USING THE TEST OF SIGNIFICANT TOXICITY APPROACH A-1 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 A-2 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 APPENDIX A: STEP-BY-STEP PROCEDURES FOR ANALYZING VALID WET DATA USING THE TST APPROACH The following is a step-by-step guide for using the TST approach to analyze valid WET data for the NPDES Program. This guide is applicable for a two-concentration valid WET data analysis of an in-stream waste concentration (IWC) or a receiving water concentration (RWC) compared to a control concentration. For further information regarding conducting WET tests and proper quality assurance/quality control needed, see the EPA WET test method manuals. Refer to the flowchart shown in Figure A-l in this appendix as you proceed through this guide. Step 1: Conduct WET test following procedures in the appropriate EPA WET test method manual. That includes following all test requirements specified in the method (USEPA 1995 for chronic west coast marine methods, USEPA 2002a for chronic freshwater test methods, USEPA 2002b for chronic east coast marine test methods, and USEPA 2002c for acute freshwater and marine WET test methods). Step 2: For each test endpoint specified in the WET test method manual (e.g., survival and reproduction for the Ceriodaphnia chronic WET test method), follow Steps 3-7 below. Note that the guide refers to an effluent concentration tested, which is assumed to be the IWC as specified in the permit or a receiving water concentration for ambient testing. For example, if no mixing zone is allocated, the IWC is 100 percent effluent. Note: If there is no variance (i.e., zero variance) in the endpoint in both concentrations being compared (i.e., all replicates in each concentration have the same exact response), then skip the remaining steps in the flowchart and do the following. Compute the percent difference between the control and the other concentration (e.g., IWC) and compare the percent difference against the RMD values of 25% for chronic and 20% for acute endpoints. Percent mean effect is calculated as: , ti "t j/~~\ Mean Control Response - Mean Response at IWC % Effect at IWC = x 100 Mean Control Response If the percent mean response is > the RMD, the sample is declared toxic and the test is "Fair. If the percent mean response is < the RMD, the sample is declared non-toxic and the test is "Pass". Step 3: For data consisting of proportions from a binomial (response/no response; live/dead) response variable, the variance within the z'th treatment is proportional to P, (1 - P,), where P, is the expected proportion for the treatment. That clearly violates the homogeneity of variance assumption required by parametric procedures such as the TST procedure because the existence of a treatment effect implies different values of P, for different treatments, i. Also, when the observed proportions are based on small samples, or when P, is close to zero or one, the normality assumption might be invalid. The arcsine square root (arcsine VF) transformation is used for such data to stabilize the variance and satisfy the normality requirement. The square root of percent data (e.g., percent survival, percent fertilization), expressed as a decimal fraction (where 1.00 = 100 percent) for each treatment, is first calculated. The square root value is then A-3 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 arcsine transformed before analysis in Step 4. Note: Excel and most statistical software packages can calculate arcsine values. Step 4: Conduct Welch's t-test (Zar 1996) using Equation 1: • , Yt-bxY Equation 1 t= - 5 St2 b2Sc2 — + — nt nc where Y,c = Mean for the control ^ = Mean for the IWC sc2 = Estimate of the variance for the control st2 = Estimate of the variance for the IWC nc = Number of replicates for the control nt = Number of replicates for the IWC b = 0.75 for chronic test methods; 0.80 for acute test methods Note on the use of Welch's t-test: Welch's t-test is appropriate to use when there are an unequal number of replicates between control and the IWC. When sample sizes of the control and treatment are the same (i.e., nt = nc). Welch's t-test is equivalent to the usual Student's t-test (Zar 1996). Step 5: Adjust the degrees of freedom (df) using Equation 2: Equation 2 o= r 2 i2 n 2 ft | )2 nt nc r 2 i 2 n 2 ( nt , nc nt~l nc-1 For tests using Welch's t-test, df is the value obtained for v in Equation 2 above. Because v is most likely a non-integer, round v to the next smallest integer, and that number is the df. Step 6: Using the calculated t value from Step 4, compare that t value with the critical t value table in Appendix B using the test method-specific alpha values shown in Table A-l. To obtain the correct critical t value, look across the table for the alpha value that corresponds to the WET test method (for the alpha value, see Appendix A, Table A-l) and then look down the table for the appropriate df. Step 7: If the calculated t value is less than the critical t value, the IWC is declared toxic and the test result is Fail. If the calculated t value is greater than the critical t value, the IWC is not declared toxic and the test result is Pass. A-4 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Conduct WET test Apply arcsine square root transformation for percent data (e.g., survival); do not transform other types of WET data (e.g., growth or reproduction) Calculate t value using TST Welch's t-test Calculated t value > critical t value? i 1 "Pass" "Fail" IWC is NOT Toxic IWC IS Toxic Figure A-1. Statistical flowchart for analyzing valid WET data using the TST approach for control and the IWC, receiving water, orstormwater. A-5 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Table A-1. Summary of alpha (a) levels or false negative rates recommended for different WET test methods using the TST approach EPA WET test method b value Probability of declaring a toxic effluent non-toxic False negative (a) error3 Chronic Freshwater and East Coast Methods Ceriodaphnia dubia (water flea) survival and reproduction 0.75 0.20 Pimephales promelas (fathead minnow) survival and growth 0.75 0.25 Selenastrum capricornutum (green algae) growth 0.75 0.25 Americamysis bahia (mysid shrimp) survival and growth 0.75 0.15 Arbacia punctulata (Echinoderm) fertilization 0.75 0.05 Cyprinodon variegatus (Sheepshead minnow) and Menidia beryllina (inland silverside) survival and growth 0.75 0.25 Chronic West Coast Marine Methods Dendraster excentricus and Strongylocentrotus purpuratus (Echinoderm) fertilization 0.75 0.05 Atherinops affinis (topsmelt) survival and growth 0.75 0.25 Haliotis rufescens (red abalone), Crassostrea gigas (oyster), Dendraster excentricus, Strongylocentrotus purpuratus (Echinoderm) and Mytilus sp (mussel) larval development methods 0.75 0.05 Macrocystis pyrifera (giant kelp) germination and germ-tube length 0.75 0.05 Acute Methods Pimephales promelas (fathead minnow), Cyprinodon variegatus (Sheepshead minnow), Atherinops affinis (topsmelt), Menidia beryllina (inland silverside) acute survival13 0.80 0.10 Ceriodaphnia dubia, Daphnia magna, Daphnia pulex, Americamysis bahia acute survival13 0.80 0.10 Notes: a. (1) declare a sample toxic at least 75 percent of the time (alpha < 0.25) when there is unacceptable toxicity (20 percent effect for acute and 25 percent effect for chronic test methods) and (2) declare an effluent non-toxic no more than 5 percent of the time (beta < 0.05) when the mean effect at the critical effluent concentration is 10 percent for both acute and chronic WET tests (including sublethal endpoints). For more discussion on the RMDs, see Section 2.1 of this document. b. Based on four replicate test design A-6 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Case Example 1: Chronic Ceriodaphnia Reproduction Test with High Within-Test Variability Step 1: Conduct WET test Replicate/statistic Control Treatment 1 27 32 2 38 28 3 27 25 4 34 28 5 37 20 6 35 15 7 30 27 8 31 31 9 36 31 10 39 30 Mean 33.4 26.7 Std. deviation 4.402 5.417 N (# of replicates) 10 10 Step 2: Follow Steps 3-7 for each endpoint required in the test method The following example is for chronic Ceriodaphnia dubia reproduction endpoint only. Step 3: Transform data using an arcsine square root transformation, if necessary Not necessary because reproduction is not percent data. Step 4: Conduct Welch's t-test Yt -b*Yc _ 26.7-(0.75x33.4) t = ¦ \S; ti, b2s; = 0.82 [29.34 (0.75)2 (19.38) 10 Step 5: Adjust the df v = f s; \n* b2S 10 29.34 (0.75)2(19.38) 10 10 (st2^ 2 f b2Sc2^ ( 29.34V U j -I- { nc J I 10 J I = 15 (0.75)2 (19.38) 10 nt -1 tb ~ 1 10-1 10-1 Step 6: Calculated t value > critical t value? 15 df and test method alpha = 0.20 (Table A-l) Critical lvalue = 0.87 0.82 < 0.87 Step 7: Declare effluent toxic or not Calculated t < critical t value. Therefore, effluent is declared toxic; test result is FAIL. A-7 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Case Example 2: Chronic Ceriodaphnia Reproduction Test with Low Within-Test Variability Step 1: Conduct WET test Replicate/statistic Control Treatment 1 29 31 2 38 28 3 31 25 4 34 28 5 36 22 6 35 21 7 30 27 8 31 26 9 36 29 10 34 30 Mean 33.4 26.7 Std. deviation 2.989 3.268 N (# of replicates) 10 10 Step 2: Follow Steps 3-7 for each endpoint required in the test method The following example is for chronic Ceriodaphnia dubia reproduction endpoint only. Step 3: Transform data using an arcsine square root transformation, if necessary Not necessary because reproduction is not percent data. Step 4: Conduct Welch's t-test t = ¦ Yt -bxYc lSt2 b2S; nf tL 26.7-(0.75x33.4) 110.68 (0.75)2 (8.93) = 1.32 10 10 Step 5: Adjust the df f ct2 b2S2^ »t ac 10.68 (0.75r(8.93) 10 10 M 2 'b2Sc2> 2 f 10.68 V2 (0.75)2 (8.93) nt V — + - "c V I 10 J , 10 \ y nt-1 nP - 1 10-1 10-1 = 16 Step 6: Calculated t value > critical t value? 16 df and test method alpha = 0.20 (Table A-l) Critical lvalue = 0.86 1.32 > 0.86 Step 7: Declare effluent toxic or not Calculated t > critical t value. Therefore, effluent is declared Non-Toxic; test result is PASS A-8 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Case Example 3: Benefit of Increased Replication in Chronic Fish Growth Test with Low Mean Effect and High Within-Test Variability Step 1: Conduct WET test Replicate/statistic Control Treatment 1 0.366 0.303 2 0.399 0.379 3 0.354 0.311 4 0.422 0.236 Mean 0.385 0.307 Std. deviation 0.031 0.058 N (# of replicates) 4 4 Step 2: Follow Steps 3-7 for each endpoint required in the test method Step 3: Transform data using an arcsine square root transformation, if necessary Not necessary because growth is not percent data. Step 4: Conduct Welch's t-test z-bxT t = S,2 b%2 0.307 -(0.75x0.385) 0.00342 | (0.75)2 (0.00096) ^ = 0.58 Step 5: Adjust the df 2? 2 V , rs, nt nc nt-1 nc-l 0.00342 | (0.75) (0.00096) - = 4 ' 0.00342 j 4-1 f 2 (0.75) (0.00096) 4-1 Step 1: Conduct WET test Replicate/statistic Control Treatment 1 0.366 0.303 2 0.399 0.379 3 0.354 0.311 4 0.422 0.236 5 0.343 0.364 6 0.407 0.247 Mean 0.382 0.307 Std. deviation 0.032 0.058 N (# of replicates) 6 6 Step 2: Follow Steps 3-7 for each endpoint required in the test method Step 3: Transform data using an arcsine square root transformation, if necessary Not necessary because growth is not percent data. Step 4: Conduct Welch's t-test f.-bxY t = s; b2s: M 2 'b2S/ 2 5 v. nc \ V 0.307-(0.75x0.382) 0.00342 (0.75)2(0.00101) 6 6 Step 5: Adjust the df r St2 | b2sc2^2 , = 0.79 "I n,. r ?\ st2 'b2S2<2 nt-1 nc -I 0.00342 (0.75) (0.00101) - + - = 7 Step 6: Calculated t value > critical t value? 4 df, alpha = 0.25 (Table A-l); Critical /'value = 0.74 0.58 < 0.74 Step 7: Effluent is declared toxic, test result is FAIL. 000342V 6 J 6-1 (0.75) (0.00101) + -^ 6-1 Step 6: Calculated t value > critical t value? 7 df, alpha = 0.25 (Table A-l); Critical /'value = 0.71 0.79 > 0.71 Step 7: Effluent is declared Non-Toxic; test result is PASS. A-9 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Case Example 4: Fish Acute Toxicity Test Example Step 1: Conduct WET test Replicate/statistic Control Treatment 1 10 10 2 10 8 3 10 9 4 10 8 Mean 10 8.75 Std. deviation 0.000 0.958 N (# of replicates) 4 4 Step 2: Follow Steps 3-7 for each endpoint required in the test method The following example is for acute Pimephalespromelas survival endpoint only. Step 3: Transform data using an arcsine square root transformation Replicate/statistic Control Treatment 1 1.571 1.571 2 1.571 1.107 3 1.571 1.249 4 1.571 1.107 Mean 1.571 1.259 Variance 0.000 0.219 N (# of replicates) 4 4 Step 4: Conduct Welch's t-test t = ¦ Yt-bxYc IS,2 b2S; - + ^ 1.259-(0.80x1.571) 10.048 | (0.80)2 (0.000) = 0.02 nf tL Step 5: Adjust the df 2 rr 2 l)S nt V £ nt nt-1 f 1 1 ^ b2Sc2 nc"I f 0.048 (0.80)2 (O.OOO)^2 0.048 4-1 {(0.80)2 (O.OOO)^2 4-1 = 3 Step 6: Calculated t value > critical t value? 3 df, alpha = 0.10 (Table A-l) Critical lvalue = 1.64 0.02 < 1.64 Step 7: Declare effluent toxic or not Therefore, effluent is declared toxic; test result is FAIL. A-10 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Literature Cited USEPA (U.S. Environmental Protection Agency). 1995. Short-Term Methods for Estimating the Chronic Toxicity of Effluents and Receiving Waters to West Coast Marine and Estuarine Organisms. EPA/600/R-95-136. U.S. Environmental Protection Agency, National Exposure Research Laboratory, Cincinnati, OH, and Office of Research and Development, Washington, DC. USEPA (U.S. Environmental Protection Agency). 2002a. Short-Term Methods for Estimating the Chronic Toxicity of Effluents and Receiving Waters to Freshwater Organisms. 4th edition. EP A/821/R-02-013. U.S. Environmental Protection Agency, Office of Water, Washington, DC. USEPA (U.S. Environmental Protection Agency). 2002b. Short-Term Methods for Estimating the Chronic Toxicity of Effluents and Receiving Waters to Marine and Estuarine Organisms. 3rd ed. EPA/82l/R-02-14. U.S. Environmental Protection Agency, Office of Science and Technology, Washington, DC. USEPA (U.S. Environmental Protection Agency). 2002c. Methods for Measuring the Acute Toxicity of Effluents and Receiving Waters to Freshwater and Marine Organisms. 5th ed. EPA/82 l/R-02-012. United States Environmental Protection Agency, Office of Water, Washington, DC. Zar, J. 1996. BiostatisticalAnalysis. 3rd ed. Prentice Hall Publishers, New Jersey. A-11 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 A-12 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 APPENDIX B CRITICAL t VALUES FOR THE TEST OF SIGNIFICANT TOXICITY APPROACH B-1 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 B-2 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Table B-1. Critical values of the t distribution. One tail probability is assumed. Degrees of freedom Alpha 0.25 0.20 0.15 0.10 0.05 1 1 1.3764 1.9626 3.0777 6.3138 2 0.8165 1.0607 1.3862 1.8856 2.92 3 0.7649 0.9785 1.2498 1.6377 2.3534 4 0.7407 0.941 1.1896 1.5332 2.1318 5 0.7267 0.9195 1.1558 1.4759 2.015 6 0.7176 0.9057 1.1342 1.4398 1.9432 7 0.7111 0.896 1.1192 1.4149 1.8946 8 0.7064 0.8889 1.1081 1.3968 1.8595 9 0.7027 0.8834 1.0997 1.383 1.8331 10 0.6998 0.8791 1.0931 1.3722 1.8125 11 0.6974 0.8755 1.0877 1.3634 1.7959 12 0.6955 0.8726 1.0832 1.3562 1.7823 13 0.6938 0.8702 1.0795 1.3502 1.7709 14 0.6924 0.8681 1.0763 1.345 1.7613 15 0.6912 0.8662 1.0735 1.3406 1.7531 16 0.6901 0.8647 1.0711 1.3368 1.7459 17 0.6892 0.8633 1.069 1.3334 1.7396 18 0.6884 0.862 1.0672 1.3304 1.7341 19 0.6876 0.861 1.0655 1.3277 1.7291 20 0.687 0.86 1.064 1.3253 1.7247 21 0.6864 0.8591 1.0627 1.3232 1.7207 22 0.6858 0.8583 1.0614 1.3212 1.7171 23 0.6853 0.8575 1.0603 1.3195 1.7139 24 0.6849 0.8569 1.0593 1.3178 1.7109 25 0.6844 0.8562 1.0584 1.3163 1.7081 26 0.684 0.8557 1.0575 1.315 1.7056 27 0.6837 0.8551 1.0567 1.3137 1.7033 28 0.6834 0.8546 1.056 1.3125 1.7011 29 0.683 0.8542 1.0553 1.3114 1.6991 30 0.6828 0.8538 1.0547 1.3104 1.6973 inf 0.6745 0.8416 1.0364 1.2816 1.6449 B-3 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 B-4 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 APPENDIX C APPLICATION OF THE TEST OF SIGNIFICANT TOXICITY APPROACH TO AMBIENT TOXICITY PROGRAMS C-1 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 C-2 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 APPENDIX C: APPLICATION OF THE TST APPROACH TO AMBIENT TOXICITY PROGRAMS In ambient and stormwater toxicity testing, a laboratory control and a single concentration (i.e., 100 percent ambient water or stormwater) are often tested. In these two-concentration WET tests, the objective is to determine if a given sample or site water is toxic, as indicated by a significantly different organism response compared to the control. In the WET testing design, the determination of Pass or Fail (i.e., non-toxic or toxic) is ascertained using a traditional t-test (USEPA 2002c). EPA test methods recommend (USEPA 1995, 2002a, 2002b, 2002c) that the statistical significance (i.e., Pass/Fail) of a two-sample test design for ambient and stormwater toxicity testing be determined only using either a modified t-test (if homogeneity of variance is not achieved) or a traditional t-test (if homogeneity of variance is achieved). To demonstrate the value of the TST approach in ambient toxicity programs, ambient toxicity test data from California's Surface Water Ambient Monitoring Program (SWAMP) was used for 409 chronic tests for Ceriodaphnia dubia and 256 chronic tests for Pimephalespromelas using EPA's 2002 WET test methods (USEPA 2002a). Valid WET data for each EPA WET test method were subjected to the same statistical analyses as described in Section 2 of this document. Chronic Ceriodaphnia dubia Ambient Toxicity Tests Table C-l summarizes results of the 409 Ceriodaphnia dubia ambient toxicity tests analyzed and an a = 0.20 for this test method. Although the majority of the tests examined resulted in the same decision using either the TST or the traditional t-test approach, approximately 6 percent of the tests (24 tests) would have been declared non-toxic using the traditional t-test approach with mean effect levels > 25 percent. In addition, 2 percent of the tests (7 tests) would have been declared toxic using the traditional t-test approach at mean effect levels <15 percent and as low as 7 percent. Table C-1. Comparison of results of chronic Ceriodaphnia ambient toxicity tests using the TST approach and the traditional t-test analysis, a = 0.20 and b value = 0.75 for the TST approach, a = 0.05 for the traditional hypothesis testing approach Both approaches declare toxic Only TST declares toxic Only traditional approach declares toxic Both approaches declare non-toxic 19.8% 5.9% 1.7% 72.6% Figure C-l shows ranges of CV values observed in Ceriodaphnia dubia ambient toxicity tests for those samples declared toxic using either the TST approach or the traditional t-test, but not both approaches. As expected, within-test variability was relatively high (higher CVs) for those tests found non-toxic using a t-test but toxic using the TST approach. The results demonstrate the lack of control of false negative rates using the traditional hypothesis testing approach when control variability is relatively high. Under those conditions, the traditional t-test did not have the power to detect toxicity when it was present. Figure C-l also demonstrates that the TST approach recognizes a negligible effect as non-toxic when within-test variability is relatively low and the C-3 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 mean percent effect is well below the risk management level of 25 percent. Under such conditions, the traditional t-test declared some samples toxic using this WET test method, even when the mean effect was as little as 7 percent. The TST approach, however, declared all such samples non-toxic using the recommended a = 0.20. Thus, the TST approach reduces the number of tests declared as toxic when effects are actually well below the risk management decision. Chronic Ceriodaphnia ambient WET tests that are identified as non-toxic (Pass) using the traditional hypothesis approach (NOEC) generally have high within-test variability (high control CVs) as compared to using the TST approach. 0.8 -r 0.7 - 0.6 - o 0.5 - £ 0.4 - > ° 0.3 - 0.2 - 0.1 - 0 - Figure C-1. Range of CV values observed in chronic C. dubia ambient toxicity tests for samples that were found to be non-toxic using the standard t-test but toxic using the TST approach (NOEC Pass) and for those samples declared toxic using t-test but not the TST approach (TST Pass). California's SWAMP WET test data. Similar to the Ceriodaphnia ambient test data, within-test variability was higher in those chronic fathead minnow ambient tests found non-toxic using a t-test but toxic using the TST approach (Figure C-2). Similarly, those tests declared non-toxic by the TST approach but toxic using t-test had lower within-test variability and mean effect levels < 25 percent (Figure C-2). Thus, similar to the chronic Ceriodaphnia ambient tests, data from chronic fathead minnow ambient tests demonstrate that the TST approach can provide as much protection as the traditional t-test approach while also identifying those samples that are truly acceptable from a regulatory management decision. A ~ ~ 25th %ile ¦ MIN A Median -MAX ¦ 75th %ile IOEC Pass TST Pass C-4 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 Fish ambient WET tests that are identified as non-toxic using the traditional hypothesis approach (NOEC) generally have high within-test variability (high control CVs) as compared to using the TST approach. 0.4 0.3 (/> d> 3 « 0.2 > O 0.1 0 ~ 25th %ile ¦ MIN a Median x MAX x 75th %ile T NOEC Pass TST Pass Figure C-2. Range of CV values observed in chronic P. promelas ambient toxicity tests for samples that were declared to be non-toxic using the standard t-test but toxic using the TST approach (NOEC Pass) and for those samples declared toxic using t-test but not the TST approach (TST Pass). California's SWAMP WET test data. Literature Cited USEPA (U.S. Environmental Protection Agency). 1995. Short-Term Methods for Estimating the Chronic Toxicity of Effluents and Receiving Waters to West Coast Marine and Estuarine Organisms. EPA/600/R-95-136. U.S. Environmental Protection Agency, National Exposure Research Laboratory, Cincinnati, OH, and Office of Research and Development, Washington, DC. USEPA (U.S. Environmental Protection Agency). 2002a. Short-Term Methods for Estimating the Chronic Toxicity of Effluents and Receiving Waters to Freshwater Organisms. 4th edition. EP A/821/R-02-013. U.S. Environmental Protection Agency, Office of Water, Washington, DC. USEPA (U.S. Environmental Protection Agency). 2002b. Short-Term Methods for Estimating the Chronic Toxicity of Effluents and Receiving Waters to Marine and Estuarine Organisms. 3rd ed. EPA/82l/R-02-14. U.S. Environmental Protection Agency, Office of Science and Technology, Washington, DC. C-5 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 USEPA (U.S. Environmental Protection Agency). 2002c. Methods for Measuring the Acute Toxicity of Effluents and Receiving Waters to Freshwater and Marine Organisms. 5th ed. EPA/82 l/R-02-012. United States Environmental Protection Agency, Office of Water, Washington, DC. C-6 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 APPENDIX D EXAMPLE NATIONAL POLLUTANT DISCHARGE ELIMINATION SYSTEM PERMIT LANGUAGE USING THE TEST OF SIGNIFICANT TOXICITY APPROACH D-1 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 D-2 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 APPENDIX D: EXAMPLE NPDES PERMIT LANGUAGE USING THE TST APPROACH ACUTE WHOLE EFFLUENT TOXICITY (WET) NPDES PERMIT LANGUAGE xx. Acute Whole Effluent Toxicity (WET) Requirements 1. Monitoring Frequency The permittee must conduct monthly/quarterly/semiannual acute toxicity tests on 24- hour composite effluent samples. Once each calendar year, at a different time of year from the previous years, the permittee must split a 24-hour composite effluent sample and concurrently conduct two toxicity tests using a fish and an invertebrate species; the permittee must then continue to conduct routine monthly/quarterly/semiannual toxicity testing using the single, most sensitive species. Acute toxicity test samples must be collected for each point of discharge at the designated NPDES sampling station for the effluent (i.e., downstream from the last treatment process and any in-plant return flows where a representative effluent sample can be obtained). During years 1, 2, 3, 4, and 5 of the permit, a split of each sample must be analyzed for all other monitored parameters at the minimum frequency of analysis specified by the effluent monitoring program. 2. Freshwater Species and WET Test Methods Species and short-term WET test methods for estimating the acute toxicity of NPDES effluents are in the fifth edition of Methods for Measuring the Acute Toxicity of Effluents and Receiving Waters to Freshwater and Marine Organisms (EPA/82 l/R-02/012, 2002; Table IA, 40 CFR Part 136). The permittee must conduct 96-hour static renewal toxicity tests with the following vertebrate and invertebrate species, respectively: • Vertebrate: The fathead minnow, Pimephalespromelas (Acute Toxicity Test Method 2000.0) • Invertebrate: The daphnid, Ceriodaphnia dubia (Acute Toxicity Test Method 2002.0) 3. Acute WET Permit Triggers a. There are no acute toxicity effluent limits for this discharge. For this permit, the determination of Pass or Fail from a multiple-effluent concentration acute toxicity test at the IWC is determined using the Test of Significant Toxicity (TST) approach that is described in the National Pollutant Discharge Elimination System Test of Significant Toxicity Implementation Document (EPA/833/R-10-003). The acute WET permit trigger is any one WET test where a test result is Fail (during the monthly reporting period) at the acute in-stream waste concentration (IWC). For this discharge, the IWC is XXX percent (e.g., either is 100 percent or an effluent at the mixing zone to be determined at the time of permit issuance) effluent. To calculate D-3 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 either a Pass or Fail of a multiple-effluent concentration acute toxicity test at the IWC, follow the instructions in Appendix A in the National Pollutant Discharge Elimination System Test of Significant Toxicity Implementation Document. A Pass result indicates no toxicity of the multiple-effluent concentration test at the IWC, and a Fail result indicates toxicity of the multiple-effluent concentration test at the IWC. The permittee must report either a Pass or a Fail on the Discharge Monitoring Report (DMR) form. If a result is reported as Fail, the permittee must follow Section 6 (Accelerated Toxicity Testing and TRE/TIE Process) of this permit. -OR- 3. Acute WET Permit Limit b. There is an acute toxicity effluent limit for this discharge. For this permit, the determination of Pass or Fail from a multiple-effluent concentration acute toxicity test at the IWC is determined using the Test of Significant Toxicity (TST) approach which is described in the National Pollutant Discharge Elimination System Test of Significant Toxicity Implementation Document (EPA/833-R-10-003). The acute WET permit trigger is any one WET test where a test result is Fail (during the monthly reporting period) at the chronic in-stream waste concentration (IWC). For this discharge, the IWC is XXX percent (e.g., either is 100 percent or an effluent at the mixing zone to be determined at time of permit issuance) effluent. To calculate either a Pass or Fail of the multiple-effluent concentration acute toxicity test at the IWC, follow the instructions in Appendix A in the National Pollutant Discharge Elimination System Test of Significant Toxicity Implementation Document. A Pass result indicates no toxicity of the multiple-effluent concentration at the IWC and a Fail result indicates toxicity of the multiple-effluent concentration test at the IWC. The permittee must report either a Pass or a Fail on the DMR form. If a result is reported as Fail, the permittee must follow Section 6 (Accelerated Toxicity Testing and TRE/TIE Process) of this permit. 4. Quality Assurance - EPA WET Test Methods a. Quality assurance measures, instructions, and other recommendations and requirements are in the EPA 2002 WET test methods manual previously referenced. b. This permit is subject to a determination of Pass or Fail from a multiple-effluent concentration acute toxicity test at the IWC (for statistical flowchart and procedures, see Appendix A, Figure A-l of the National Pollutant Discharge Elimination System Test of Significant Toxicity Implementation Document). The acute in-stream waste concentration (IWC) for this discharge is XXX percent effluent. c. Effluent dilution water and control water should be prepared and used as specified in the EPA WET test methods manual Methods for Measuring the Acute Toxicity of Effluents and Receiving Waters to Freshwater and Marine Organisms (EPA/821/R- 02/012, 2002). D-4 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 d. If organisms are not cultured in-house, concurrent testing with a reference toxicant must be conducted. If organisms are cultured in-house, monthly reference toxicant testing is sufficient. Reference toxicant tests and effluent toxicity tests must be conducted using the same test conditions (e.g., same test duration). e. If either the reference toxicant or effluent toxicity tests do not meet all test acceptability criteria in the EPA WET test methods manual, the permittee must resample and retest within 14 days. f. If the discharged effluent is chlorinated, chlorine must not be removed from the effluent sample before toxicity testing without written approval by the permitting authority. 5. Initial Investigation TRE Work Plan Within 90 days of the permit effective date, the permittee must prepare and submit to the permitting authority a copy of its Initial Investigation Toxicity Reduction Evaluation (TRE) Work Plan (1-2 pages) for review. That plan must include steps the permittee intends to follow if toxicity is measured above an acute WET permit limit or trigger and should include the following, at minimum: a. A description of the investigation and evaluation techniques that would be used to identify potential causes and sources of toxicity, effluent variability, and treatment system efficiency. b. A description of methods for maximizing in-house treatment system efficiency, good housekeeping practices, and a list of all chemicals used in operations at the facility. c. If a Toxicity Identification Evaluation (TIE) is necessary, an indication of who would conduct the TIEs (i.e., an in-house expert or outside contractor). 6. Accelerated Toxicity Testing and TRE/TIE Process a. If an acute WET permit limit or trigger is exceeded and the source of toxicity is known (e.g., a temporary plant upset), the permittee must conduct one additional toxicity test using the same species and EPA WET test method. This WET test must begin within 14 days of receipt of WET test results exceeding an acute WET permit limit or trigger. If the additional toxicity test does not exceed an acute WET permit limit or trigger, the permittee may return to the regular testing frequency. b. If an acute WET permit limit or trigger is exceeded and the source of toxicity is not known, the permittee must conduct six additional toxicity tests using the same species and EPA WET test method, approximately every two weeks, over a 12-week period. This testing must begin within 14 days of receipt of WET test results exceeding an acute WET permit limit or trigger. If none of the additional toxicity tests exceed an acute WET permit limit or trigger, the permittee may return to the regular testing frequency. D-5 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 c. If one of the additional toxicity tests (in paragraphs 6. a or 6.b) exceeds an acute WET permit limit or trigger, within 14 days of receipt of this WET test result, the permittee must initiate a TRE using, according to the type of treatment facility, EPA WET TRE manual, Toxicity Reduction Evaluation Guidance for Municipal Wastewater Treatment Plants (EPA/833/B-99/002, 1999) or EPA WET TRE manual, Generalized Methodology for Conducting Industrial Toxicity Reduction Evaluations (EPA/600/2- 88/070, 1989). In conjunction, the permittee must develop and implement a Detailed TRE Work Plan that must consist of the following: further actions undertaken by the permittee to investigate, identify, and correct the causes of toxicity; actions the permittee will take to mitigate the effects of the discharge and prevent the recurrence of toxicity; and a schedule for such actions. d. The permittee may initiate a TIE as part of a TRE to identify the causes of toxicity using the same species and EPA WET test method and, as guidance, EPA WET TIE/TRE method manuals: Methods for Aquatic Toxicity Identification Evaluations: Phase I Toxicity Characterization Procedures (EPA/600/6-91/003, 1991); Methods for Aquatic Toxicity Identification Evaluations, Phase II Toxicity Identification Procedures for Samples Exhibiting Acute and Chronic Toxicity (EPA/600/R-92/080, 1993); Methods for Aquatic Toxicity Identification Evaluations, Phase III Toxicity Confirmation Procedures for Samples Exhibiting Acute and Chronic Toxicity (EPA/600/R-92/081, 1993). 7. Reporting of Acute Toxicity Monitoring Results a. The permittee must submit a full laboratory report for all toxicity testing as an attachment to the Discharge Monitoring Report (DMR) for the month in which the toxicity test was conducted; the laboratory report must contain the following: the toxicity test results, the dates of sample collection and initiation of each toxicity test; all results for effluent parameters monitored concurrently with the toxicity test(s); and progress reports on TRE/TIE investigations. b. The permittee must provide the actual test endpoint responses for the control (i.e., control mean) and IWC concentration (i.e., IWC mean) for each WET test conducted to make it easier for permit writers to find the necessary WET test results when determining WET RP. c. The permittee must notify the permitting authority in writing within 14 days of exceedance of an acute WET permit limit or trigger. Such notification must describe actions the permittee has taken or will take to investigate, identify, and correct the causes of toxicity; the status of actions required by this permit; and schedule for actions not yet completed; or reason(s) that no action has been taken. 8. Permit Reopener for Acute Toxicity In accordance with 40 CFR Parts 122 and 124, this permit may be modified to include effluent limitations or permit conditions to address acute toxicity in the effluent or receiving waterbody, as a result of the discharge; or to implement new, revised, or newly interpreted water quality standards applicable to acute toxicity. D-6 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 CHRONIC WET NPDES PERMIT LANGUAGE xx. Chronic Whole Effluent Toxicity (WET) Requirements 1. Monitoring Frequency The permittee must conduct monthly/quarterly/semiannual chronic toxicity tests on 24- hour composite effluent samples. Once each calendar year, at a different time of year from the previous years, the permittee must split a 24-hour composite effluent sample and concurrently conduct three toxicity tests using a fish, an invertebrate, and an alga species; the permittee must continue to conduct routine monthly/quarterly/semiannual toxicity testing using the single, most sensitive species. Chronic toxicity test samples must be collected for each point of discharge at the designated NPDES sampling station for the effluent (i.e., downstream from the last treatment process and any in-plant return flows where a representative effluent sample can be obtained). During years 1, 2, 3, 4, and 5 of the permit, a split of each sample must be analyzed for all other monitored parameters at the minimum frequency of analysis specified by the effluent monitoring program. 2. Freshwater Species and EPA WET Test Methods Species and short-term EPA WET test methods for estimating the chronic toxicity of NPDES effluents are in the fourth edition of Short-term Methods for Estimating the Chronic Toxicity of Effluents and Receiving Waters to Freshwater Organisms (EPA/82l/R-02/013, 2002; Table IA, 40 CFRPart 136). The permittee must conduct static renewal toxicity tests with the following: • Fathead minnow, Pimephalespromelas (Larval Survival and Growth Test Method 1000.0) • Daphnid, Ceriodaphnia dubia (Survival and Reproduction Test Method 1002.0); • Green alga, Selenastrum capricornutum (also named Raphidocelis subcapitata) (Growth Test Method 1003.0). 3. Chronic WET Permit Triggers a. There are no chronic toxicity effluent limits for this discharge. The chronic WET permit trigger is any one WET test (either biological endpoint of survival or sublethal) where a test result is Fail (during the monthly reporting period) at the chronic in-stream waste concentration (IWC). For this discharge, the IWC is XXX percent (e.g., either is 100 percent or an effluent at the mixing zone to be determined at time of permit issuance) effluent. To calculate either a Pass or Fail of the multiple- effluent concentration chronic toxicity test at the IWC, follow the instructions in Appendix A in the National Pollutant Discharge Elimination System Test of Significant Toxicity Implementation Document (EPA/833-R-10-003). A Pass result indicates no toxicity at the IWC, and a Fail result indicates toxicity at the IWC. The D-7 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 permittee must report either a Pass or a Fail on the DMR form. If a result is reported as Fail, the permittee must follow Section 7 (Reporting of Chronic Toxicity Monitoring Results) of this permit. -OR- 3. Chronic WET Permit Limits b. There is a chronic toxicity effluent limit for this discharge. The chronic WET permit trigger is any one WET test (either biological endpoint of survival or sublethal) where a test result is Fail (during the monthly reporting period) at the chronic in-stream waste concentration (IWC). For this discharge, the IWC is XXX percent (e.g., either is 100 percent or an effluent at the mixing zone to be determined at time of permit issuance) effluent. To calculate either a Pass or Fail of the multiple-effluent concentration chronic toxicity test at the IWC, follow the instructions in Appendix A in the National Pollutant Discharge Elimination System Test of Significant Toxicity Implementation Document (EPA/833-R-10-003). A Pass result indicates no toxicity at the IWC, and a Fail result indicates toxicity at the IWC. The permittee must report either a Pass or a Fail on the DMR form. If a result is reported as Fail, the permittee must follow Section 7 (Reporting of Chronic Toxicity Monitoring Results) of this permit. 4. Quality Assurance - EPA WET Test Methods a. Quality assurance measures, instructions, and other recommendations and requirements are in the EPA WET test methods manual previously referenced in this permit. b. This permit is subject to a determination of Pass or Fail from a multiple-effluent concentration chronic toxicity test at the IWC (for statistical flowchart and procedures, see National Pollutant Discharge Elimination System Test of Significant Toxicity Implementation Document, Appendix A, Figure A-l). The chronic in-stream waste concentration (IWC) for this discharge is XXX percent (e.g., either is 100 percent or an effluent at the mixing zone to be determined) effluent. c. Effluent dilution water and control water should be standard synthetic dilution water as described in the EPA WET test methods manual, Short-term Methods for Estimating the Chronic Toxicity of Effluents and Receiving Waters to Freshwater Organisms (EPA/82 l/R-02/013, 2002). If the dilution water is different from test organism culture water, a second control using culture water must also be used. d. If organisms are not cultured in-house, concurrent testing with a reference toxicant must be conducted. If organisms are cultured in-house, monthly reference toxicant testing is sufficient. Reference toxicant tests and effluent toxicity tests must be conducted using the same test conditions (e.g., same test duration). D-8 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 e. If either the reference toxicant or effluent toxicity tests do not meet all test acceptability criteria in the EPA WET test methods manual, the permittee must resample and retest within 14 days. f. Following Paragraph 10.2.6.2 of the freshwater EPA WET test methods manual, all chronic toxicity test results from the multi-concentration tests required by this permit must be reviewed and reported according to EPA guidance on the evaluation of concentration-response relationships in Method Guidance and Recommendations for Whole Effluent Toxicity (WET) Testing (40 CFR Part 136) (EPA/82l/B-00-004, 2000). g. If the discharged effluent is chlorinated, chlorine must not be removed from the effluent sample before toxicity testing without written approval by the permitting authority. 5. Initial Investigation TRE Work Plan Within 90 days of the permit effective date, the permittee must prepare and submit to the permitting authority a copy of its Initial Investigation Toxicity Reduction Evaluation (TRE) Work Plan (1-2 pages) for review. That plan must contain steps the permittee intends to follow if toxicity is measured above a chronic WET permit limit or trigger and should include the following, at minimum: a. A description of the investigation and evaluation techniques that would be used to identify potential causes and sources of toxicity, effluent variability, and treatment system efficiency. b. A description of methods for maximizing in-house treatment system efficiency, good housekeeping practices, and a list of all chemicals used in operations at the facility. c. If a Toxicity Identification Evaluation (TIE) is necessary, an indication of who would conduct the TIEs (i.e., an in-house expert or outside contractor). 6. Accelerated Toxicity Testing and TRE/TIE Process a. If a chronic WET permit limit or trigger is exceeded and the source of toxicity is known (e.g., a temporary plant upset), the permittee must conduct one additional toxicity test using the same species and EPA WET test method. This WET test must begin within 14 days of receipt of WET test results exceeding a chronic WET permit limit or trigger. If the additional toxicity test does not exceed a chronic WET permit limit or trigger, the permittee may return to their regular testing frequency. b. If a chronic WET permit limit or trigger is exceeded and the source of toxicity is not known, the permittee must conduct six additional toxicity tests using the same species and EPA WET test method, approximately every two weeks, over a 12 week period. This testing must begin within 14 days of receipt of WET test results exceeding a chronic WET permit limit or trigger. If none of the additional toxicity tests exceed a chronic WET permit limit or trigger, the permittee may return to their regular testing frequency. D-9 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 c. If one of the additional toxicity tests (in paragraphs 6.a or 6.b) exceeds a chronic WET permit limit or trigger, within 14 days of receipt of this WET test result, the permittee must initiate a TRE using as guidance, according to the type of treatment facility, the EPA TRE manual, Toxicity Reduction Evaluation Guidance for Municipal Wastewater Treatment Plants (EPA/ 833/B-99/002, 1999) or EPA TRE manual, Generalized Methodology for Conducting Industrial Toxicity Reduction Evaluations (EPA/600/2-88/070, 1989). In conjunction, the permittee must develop and implement a Detailed TRE Work Plan that must contain the following: further actions undertaken by the permittee to investigate, identify, and correct the causes of toxicity; actions the permittee will take to mitigate the effects of the discharge and prevent the recurrence of toxicity; and a schedule for such actions. d. The permittee may initiate a TIE as part of a TRE to identify the causes of toxicity using the same species and EPA WET test method and, as guidance, EPA WET TIE/TRE method manuals: Toxicity Identification Evaluation: Characterization of Chronically Toxic Effluents, Phase I (EPA/600/6-91/005F, 1992); Methods for Aquatic Toxicity Identification Evaluations, Phase II Toxicity Identification Procedures for Samples Exhibiting Acute and Chronic Toxicity (EPA/600/R-92/080, 1993); Methods for Aquatic Toxicity Identification Evaluations, Phase III Toxicity Confirmation Procedures for Samples Exhibiting Acute and Chronic Toxicity (EPA/600/R-92/081, 1993). 7. Reporting of Chronic Toxicity Monitoring Results a. The permittee must submit a full laboratory report as an attachment to the DMR for all toxicity testing for the month in which the toxicity test was conducted; the laboratory report must contain the following: the toxicity test results, the dates of sample collection and initiation of each toxicity test; all results for effluent parameters monitored concurrently with the toxicity test(s); and progress reports on TIE/TRE investigations. b. The permittee must provide the actual test endpoint responses for the control (i.e., control mean) and IWC concentration (i.e., IWC mean) for each WET test conducted to make it easier for permit writers to find the necessary WET test results when determining WET RP. c. The permittee must notify the permitting authority in writing within 14 days of exceedance of a chronic WET permit limit or trigger. The notification must describe actions the permittee has taken or will take to investigate, identify, and correct the causes of toxicity; the status of actions required by this permit; and schedule for actions not yet completed; or reason(s) that no action has been taken. 8. Permit Reopener for Chronic Toxicity In accordance with 40 CFR Parts 122 and 124, this permit may be modified to include effluent limitations or permit conditions to address chronic toxicity in the effluent or receiving waterbody, as a result of the discharge; or to implement new, revised, or newly interpreted water quality standards applicable to chronic toxicity. D-10 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 APPENDIX E WHOLE EFFLUENT TOXICITY REASONABLE POTENTIAL ANALYSIS USING THE TEST OF SIGNIFICANT TOXICITY APPROACH E-1 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 E-2 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 APPENDIX E: WET RP ANALYSIS USING THE TST APPROACH For reasonable potential (RP) calculations using the TST approach, EPA recommends that permitting authorities use all the valid WET test data generated during the current permit term and any additional valid data that are submitted as part of the permit renewal application. The permitting authority should be using at least a minimum of four valid WET tests to address effluent representativeness using the TST RP approach. WET test data are then analyzed according to the TST approach using the IWC and control test concentrations for all valid WET test data available. For the RP approach, data sets with fewer than four valid WET data points should be assessed using EPA's Technical Support Document (TSD) RP approach because it addresses small WET data sets by incorporating an RP multiplying factor (see Section 3.2.2 of the TSD, p. 54) to account for effluent variability in small WET data sets. EPA also recommends that states request that their permittees provide the actual test endpoint responses for the control (i.e., mean of control) and IWC concentration (i.e., mean of IWC) for each WET test conducted to make it easier for permit writers to find the necessary data with which to calculate WET RP with this approach. EPA recommends that permitting authorities decide up front which approach (the 1991 TSD approach, the TST approach, or another scientifically defensible approach that is sufficient to meet the statutory and regulatory requirements) they will incorporate and consistently use in their state's NPDES implementation procedures, including for their RP procedures. Permitting authorities should consistently use the selected WET statistical approach in all the state NPDES permits. All valid WET test data are then analyzed according to the TST approach using the IWC and control test concentrations. If WET test data are available and the TST statistical approach indicates that the IWC is toxic in any WET test ("effluent cause(s) toxicity"), RP has been demonstrated (40 CFR 122.44(d)(l)(i)). For example, if results of five WET tests are available using the TST approach and the results are Pass, Pass, Fail, Pass, Pass, because at least one test was a Fail (i.e., TST declared the effluent toxic in at least one test), RP has been demonstrated. To address concerns regarding the "potential to cause or contribute to toxicity," a second assessment is applied to determine whether the effluent has RP even if all test results are Pass using the TST approach. The current TST approach results in four outcomes with respect to RP at the IWC: 1. Caused (effluent is toxic): RP is demonstrated if any one test using the TST approach indicates a test result is Fail (i.e., using the statistical test (Appendix A) and t table (Appendix B), the test result is Fail; see Example A below in Table E-l); 2. Potential to Cause: Effluent has reasonable potential to cause (RP is demonstrated) if any test exhibits a mean effect at the IWC >10 percent as compared to the mean control response, even if the test result is Pass using TST (see examples B-D, Table E-l); and 3. No RP (effluent is non-toxic at the IWC): Effluent does not cause or have reasonable potential to cause if the tests are each a Pass using the TST approach and the mean effect at the IWC is always <10 percent. E-3 ------- NPDES Test of Significant Toxicity Implementation Document June, 2010 4. Insufficient valid WET data (fewer than 4 tests or no data): If fewer than four valid WET data are available, follow the TSD RP procedure for WET. The second outcome is where the determination of RP is critical to demonstrate that the discharge has the reasonable potential to cause an excursion above the state toxicity water quality standards. In the TST approach, the regulatory management decision threshold for non- toxicity in WET tests under the NPDES WET Program is 10 percent mean effect at the IWC. At or below that mean effect level, the TST approach is designed to declare a WET test as non-toxic (i.e., Pass) most (at least 95 percent) of the time to help control for false positives. For purposes of RP assessment then, a 10 percent mean effect level at the IWC is used as a threshold, above which potential to cause is indicated, and the effluent has demonstrated RP. Any test with a mean effect at the IWC >10 percent would demonstrate a potential for RP even if the TST test result is Pass. Equation E-l below demonstrates how the effluent effect is calculated at the IWC. , ti "t 7 Mean Control Response-Mean Response at IWC .• % Effect at IWC = x 100 Equation E-l Mean Control Response Table E-1. Examples illustrating the reasonable potential approach using TST and data from Ceriodaphnia chronic survival and reproduction WET tests Example Pass/Fail based on TST analysis Mean control response Mean response @ IWC % effect at IWC Reasonable potential? A Fail 26.3 17.0 35.4% Yes B Pass 26.3 23.4 11.0% Yes C Pass 28.6 22.0 23.1% Yes D Pass 22.4 20.9 6.7% No E-4 ------- |