Ecological Effects Test Guidelines OCSPP 850.4000: Background and Special Considerations-Tests with Terrestrial and Aquatic Plants, Cyanobacteria, and Terrestrial Soil-Core Microcosms


&EPA
United States
Environmental Protection
Agency
Office of Chemical Safety
and Pollution Prevention , ' ' zlr""' °
(7101)      January 2012
       Ecological Effects
       Test Guidelines

       OCSPP 850.4000:
       Background and
       Special
       Considerations-Tests
       with Terrestrial and
       Aquatic Plants,
       Cyanobacteria, and
       Terrestrial Soil-Core
       Microcosms

-------
NOTICE

This guideline is one of a series of test guidelines established by the United States
Environmental Protection Agency's Office of Chemical Safety and Pollution Prevention
(OCSPP) for use in testing pesticides and chemical substances to develop data for
submission to the Agency under the Toxic Substances Control Act (TSCA) (15 U.S.C. 2601,
et seq.), the Federal Insecticide, Fungicide and Rodenticide Act (FIFRA) (7 U.S.C. 136, et
seq.), and section 408 of the Federal Food, Drug and Cosmetic (FFDCA) (21 U.S.C. 346a).
Prior to April 22, 2010, OCSPP was known as the Office of Prevention, Pesticides and Toxic
Substances (OPPTS). To distinguish these guidelines from guidelines issued by other
organizations, the numbering convention adopted in 1994 specifically included OPPTS as
part of the guideline's number. Any test guidelines developed after April 22, 2010 will use
the new acronym (OCSPP) in their title.

The OCSPP harmonized test guidelines serve as a compendium of accepted scientific
methodologies and protocols that are intended to provide data to inform regulatory decisions
under TSCA, FIFRA, and/or FFDCA. This document provides guidance for conducting the
test, and is also used by EPA, the public, and the companies that are subject to data
submission requirements under TSCA, FIFRA, and/or the FFDCA. As a guidance
document, these guidelines are not binding on either EPA or any outside parties, and the
EPA may depart from the guidelines where circumstances warrant and without prior notice.
At places in this guidance, the Agency uses the word "should." In this guidance, the use of
"should" with regard to an action means that the action is recommended rather than
mandatory. The procedures contained in this guideline are strongly recommended for
generating the data that are the subject of the guideline, but EPA recognizes that departures
may be appropriate in specific situations. You may propose alternatives to the
recommendations described in these guidelines, and the Agency will assess them for
appropriateness on a case-by-case basis.

For additional information about these test guidelines and to access these guidelines
electronically, please go to http://www.epa.gov/ocspp and select "Test Methods &
Guidelines" on the left side navigation menu. You may also access the guidelines in
http://www.requlations.qov grouped by Series under Docket ID #s: EPA-HQ-OPPT-2009-
0150 through EPA-HQ-OPPT-2009-0159, and EPA-HQ-OPPT-2009-0576.

-------
OCSPP 850.4000: Background and special considerations: tests with terrestrial
and aquatic plants, cyanobacteria, and terrestrial soil-core microcosms.

(a) Scope—

       (1) Applicability. This guideline is intended to be used to help develop data to submit to
       EPA under the Toxic Substances Control Act  (TSCA) (15 U.S.C. 2601, et seq.),  the
       Federal Insecticide, Fungicide, and Rodenticide Act (FIFRA) (7 U.S.C. 136, et seq.), and
       the Federal Food, Drug, and Cosmetic Act (FFDCA) (21 U.S.C. 346a).

       (2) Background.  This guideline provides general information applicable to  conducting
       OCSPP  Series  850, Group D  toxicity tests with  terrestrial and  aquatic  plants and
       cyanobacteria (formerly referred to as blue-green algae).  The source materials used in
       developing this harmonized OCSPP test guideline include OPP 120-1  Overview, 120-2
       Definitions,  120-3 Basic Test Standards, and 120-4 General Evaluation and Reporting
       Requirements, Pesticide  Assessment Guidelines Sub-division  J—Hazard Evaluation:
       Nontarget Plants); ASTM E 1963-02, Standard Guide for Conducting Terrestrial Plant
       Toxicity Tests, the Pesticides Reregi strati on Rejection  Rate Analysis: Ecological Effects
       report;  and  the background materials in the OCSPP Series 850, Group  D  specific
       guidelines.

       (3) General.

             (i) The OCSPP Series 850, Group D provides guidelines applicable to  conducting
             laboratory   and  field   toxicity  tests   with   terrestrial  and   aquatic   plants,
             cyanobacteria,  and terrestrial  (soil-core)  microcosms.    Where appropriate,
             individual  guidelines in  the OCSPP Series  850, Group D are  harmonized with
             OECD test guidelines.   The  guidelines in the OCSPP Series,  Group D  are
             applicable  to evaluating the  hazards  and risks  of industrial  chemicals and
             pesticides  to  various  terrestrial  and aquatic  plant  species and  cyanobacteria
             resulting from direct or indirect exposure. Data concerning the determination of
             outdoor pesticidal effects on plants and cyanobacteria are used in evaluating risks
             to nontarget plants in ecological risk assessment of pesticides (40 CFR part 158,
             paragraph (k)(28) of this guideline).  These data are also of use in assessments of
             potential off-target injury to endangered and threatened plant species listed by the
             Fish  and Wildlife Service,  Department of Interior, and  when  phytotoxicity
             concerns arise from incidents or during  Special Review.  Phytotoxicity data  are
             also  occasionally requested in  order to  assess the potential hazard of  certain
             pesticides to plants within the pesticide treatment area (target area testing).

             (ii) Information is provided on the design and conduct of tests with terrestrial and
             aquatic plants, cyanobacteria,  and terrestrial plant microcosms, emphasizing  the
             importance of adequately  characterizing the  test  substance, use of  suitable
             experimental designs, and establishing the physical and chemical conditions of
             the test system in order to provide a scientifically sound understanding of how the
             test  substance  behaves under  test  conditions.    Also  considered  are  an
             identification of the factors that can affect the test outcome  and interpretation of
             test results. This general information is primarily applicable to the guidelines for
             laboratory toxicity tests and microcosms, since  field tests are designed on a case-
                                      Page 3 of 29

-------
by-case basis. However, the OCSPP 850.4000 guideline lists critical elements of
quality assurance and reporting standards common to all the guidelines in OCSPP
Series 850, Group D guidelines.

(iii) The OCSPP Series 850, Group D guidelines have generally been validated in
formal round-robin tests or through repeated use.

(iv) Each  submitted study should meet the data quality objectives for which the
test is designed.   Test validity elements critical  to determining the scientific
soundness and acceptability of the study have been listed for each guideline in the
OCSPP Series 850,  Group D.

(v) Certain aspects of the OCSPP 850.1000 guideline are also relevant to toxicity
studies with aquatic plants and microorganisms (the OCSPP 850.4400 guideline,
Aquatic plant toxicity test using Lemna spp.;  the OCSPP 850.4450 guideline,
Aquatic plants field study; and the OCSPP 850.4500  guideline,  Algal toxicity;
and the OCSPP 850.4550 guideline, Cyanobacteria toxicity).

(vi) The guidelines  contained in the OCSPP Series 850, Group  D  recommend
specific procedures to be used in almost all  circumstances in order to result in a
satisfactory study result, but also provide general guidance that allows for  some
latitude, based upon study-specific circumstances.  It is  recognized that certain
problems, some of which are unavoidable, may arise both before and during
testing and provisions have thus been made in the  guidelines for dealing with
those that are commonly encountered.  These guidelines  provide  for exceptions,
while at the same time maintaining a high level of scientifically sound, state-of-
the-art guidance so  that following this guidance will  provide ecological effect
information that is scientifically defensible for its intended use, while also taking
into consideration the chemistry and environmental fate of the test  substance. For
a  satisfactory  test,  the experimental  design,  execution  of the  experiments,
classification of the organism, sampling, measurement, and data analysis should
be accomplished by use of sound scientific techniques recognized by the scientific
community.   Uniformity of  procedures,  materials, and  reporting  should  be
maintained throughout  the  toxicity  evaluation process.   Refinements  of the
procedures to increase their accuracy and effectiveness are encouraged.  When
such refinements include major modifications of any test procedure, the Agency
should be consulted before implementation.  Also when in doubt, users of these
guidelines  should  consult  with  the  appropriate  regulatory  authorities for
clarification or additional information before proceeding.  All references supplied
with respect to protocols or other test  standards are provided as recommendations.

(vii) For pesticides, a tiered testing approach given in 40 CFR 158.540 for plant
protection data requirements provides for greater efficiency of testing resources
while assuring data development as  warranted for  hazard or risk  assessment.
Tests in the lower tiers (Tier I and Tier II) are designed to screen  test substances
to determine their potential to cause  adverse affects  on seedling emergence (the
OCSPP 850.4100 guideline), vegetative vigor (the OCSPP 850.4150 guideline),
and  aquatic  plant  and cyanobacteria  growth and  reproduction (the  OCSPP

                         Page 4 of29

-------
850.4500 and OCSPP 850.4550 guidelines).  For pesticides, a Tier I test, referred
to as a limit test in these Group D guidelines, tests a single concentration and
compares effects observed with appropriate controls.  Tier II testing for pesticides
(multiple-concentration definitive test in these Group D guidelines provides for
generation   of  dose-response  curves  for  test  substances   which  are  known
phytotoxicants  or which  in Tier I  testing demonstrated phytotoxicity.  Tier III
plant tests  currently include  the  Terrestrial  Plants  Field  Study  (the OCSPP
850.4300 guideline); Terrestrial Soil-Core Microcosm Test (the OCSPP 850.4900
guideline);  and Aquatic Plants Field Study (the OCSPP 850.4450 guideline) and
are designed on a  case-by-case basis to  address specific  objectives concerning
detrimental  effects  on nontarget plants. Progression to Tier III would occur on a
case-by-case basis to further refine  and characterize the estimate of phytotoxicity
risk.  For the evaluation of pesticides, the criteria to proceed from one tier to the
next are given in 40 CFR 158.540.  Testing of pesticides in any of these tiers can
be expanded by conducting additional tests that are normally  specific to industrial
chemicals (e.g., the OCSPP 850. 4800 Plant Uptake  and Translocation Test;  the
Rhizobium-Legume Toxicity Test (the OCSPP 850.4600 guideline)  and adapting
them to pesticide use as needed to address specific exposure and effect concerns.

(viii) Phytotoxicity  data may also be used to evaluate the potential hazard and risk
of industrial chemicals. Phytotoxicity data are requested when there are verified
terrestrial exposures or aquatic exposures and are also occasionally requested for
industrial chemicals to assess their potential hazard to crop  plant test species.
This testing is part of the Tier I (base set)  suite of tests in the OPPT testing
scheme developed  for determining environmental effects  (see the references in
paragraphs (k)(10), (k)(ll), (k)(16), (k)(17),  (k)(30),  and (k)(31)of this guideline
for further  details).  This testing scheme is deterministic  for  the  most part,
flexible, sequential, consistent, iterative, transparent,  discriminatory  of the extent
of toxicity, and applicable to all types  of chemicals.

(ix) For industrial chemicals,  dose-response testing occurs at Tier I  to determine
as quickly   and as  efficiently  as possible,  the potential  phytotoxicity of these
chemicals during early seedling emergence and  growth of terrestrial  plants (the
OCSPP  850.4230  guideline) and  to aquatic  plants  (the  OCSPP  850.4500
guideline).   In  contrast to pesticides, the potential phytotoxicity  of industrial
chemicals is often not characterized. Thus, range-finding tests, a preliminary step
to define dose-response  testing, are  more commonly conducted than limit  or
maximum challenge tests  that use only one test concentration.  If phytotoxicity is
noted  at Tier  I multiple-concentration  definitive testing,  additional  seedling
emergence  tests with more plant species are conducted at Tier II to better delimit
this toxicity to sensitive plant groups  or under specific exposure scenarios.  For
example, if monocot species exhibit the greatest sensitivity,  additional monocots
are tested;  if a wetland monocot is  found to be the most  sensitive, additional
wetland monocots (and also dicots) are then tested. Chronic  (reproductive effect)
tests may sometimes be part of Tier II (or more commonly part of Tier III). Tier
II plant tests for industrial chemicals are designed to investigate potential chronic
cumulative  toxicity and bioconcentration  potential of industrial chemicals. Tier
III plant tests currently are the Plant Uptake and Translocation Test (the OCSPP
                          Page 5 of29

-------
              850.4800 guideline), possibly the Terrestrial Soil-Core Microcosm  Test (the
              OCSPP  850.4900  guideline),  and the Rhizobium-Legume  Toxicity  Test (the
              OCSPP 850.4600 guideline).  Field testing is conducted at Tier IV, if necessary
              for widely distributed chemicals with high  exposure, which are toxic to plants.
              Testing  of industrial chemicals  in  any of these  tiers can  be  expanded by
              conducting  additional tests that are normally specific to  pesticides  (e.g.,  the
              OCSPP  850.4100, 850.4150, and 850.4300  guidelines) and adapting them to
              industrial chemicals as needed. These pesticide-specific tests should be viewed as
              supplements to the TSCA plant tests,  not as substitutes.  For industrial chemicals,
              the base set Tier I tests and requirement to proceed from one tier to the next are
              referenced in paragraphs (k)(10), (k)(ll), (k)(16), (k)(17), (k)(30), and (k)(31)of
              this guideline.

              (x) While performing field tests, all necessary measures  should be taken to ensure
              that nontarget plants and animals, especially endangered or threatened species,
              will not be adversely affected either by direct hazard or by impact on food supply
              or food chain.

(b) Definitions.  Terms used in the OCSPP Series 850, Group D guidelines have the meanings
set forth  in Section 3  FIFRA  regulations at  40 CFR  152.3 (Pesticide  Registration  and
Classification Procedures);  40  CFR 158.300 (Product Chemistry Definitions); 40  CFR part  160
(Good Laboratory Practice  Standards); the OCSPP 850.1000 guideline (background for aquatic
organism  testing); and in  TSCA Section 3 regulations 40 CFR part 792  (Good Laboratory
Practice Standards);  and the Agency's "Terms  of  Environment,  Glossary, Abbreviations  and
Acronyms" (see paragraph (k)(22) of this guideline). The definitions below apply to the OCSPP
Series 850 Group D test guidelines  and where applicable, the individual test guidelines contain
additional or test-specific definitions.

       Active ingredient (a.i.) is  any substance  (or group of structurally similar  substances if
       specified by the Agency)  that will prevent,  destroy, repel or mitigate any pest, or that
       functions as a plant regulator, desiccant,  or defoliant within the meaning of FIFRA (40
       CFR152.3).

       Adjuvant is a subsidiary ingredient or additive in a mixture which modifies, enhances or
       prolongs  by  physical  action the  activity of  the  active  ingredient(s).   Examples of
       agricultural chemical adjuvants include but are not limited to surfactants, crop oils, anti-
       foaming agents, buffering  compounds, drift control agents, compatibility agents, stickers
       and spreaders.

       Algae includes the green algae (Chlorophyta), golden algae and diatoms (Chrysophyta),
       brown algae (Phaeophyta), and red algae  (Rhodophyta).   Organisms formerly classified
       as blue-green algae (Cyanophyta) are now classified as Cyanobacteria.

       Aquatic plants includes those plants that are totally aquatic (free-floating  or attached,
       submersed or emerged) that may inhabit still or flowing water bodies.

       Axenic is a culture of one organism free from other organisms.

       Concentration-response curve is the graphical and mathematical relationship between the
                                       Page 6  of29

-------
concentration of a substance and a specific biological response produced from toxicity
tests when percent response (e.g., growth) values are plotted against concentration of test
substance for a given exposure duration. This is also referred to as the dose-response
curve or concentration-effect curve.

Control refers to test organisms exposed to test conditions and test matrix (water,
sediment, medium, etc.) in the absence of any introduced test substance as part of the test
design for the purpose of establishing a basis of comparison with a test substance for
known chemical or biological measurements.

Culture (noun) refers to the organisms which are raised on-site or maintained under
controlled conditions to produce test organisms through reproduction.

Culture (verb) is to grow, raise, or maintain organisms under controlled conditions to
produce test organisms through reproduction.

Desirable plants are those plants that are not to be detrimentally affected during pesticide
application. They may include crops, ornamentals, or native plants inside or outside of
the area of intended application.

Direct exposure refers to the direct application of a pesticide or industrial chemical to a
plant or the location where the plant would reside.

Effect concentration (ECX) is the experimentally derived concentration of a test substance
in test matrix (e.g., water, growth medium, soil, sediment) that would be expected to
cause a specified effect in x percent (x%) of a group of test organisms under specified
exposure conditions.

Effect concentration, median (ECso) is the experimentally derived concentration of a test
substance in a test matrix (e.g., water, growth medium, soil, sediment) that would be
expected to cause a defined effect in 50% of a group of test organisms under specified
exposure conditions.

Formulation, as used within these guidelines, is a packaged end use product (e.g., dust,
wettable powder, emulsifiable concentrate, ultra low volume, etc} of the test substance
and may contain one or more active ingredients and one or more inert ingredients.

Germination refers to the physiological events associated with re-initiation of embryo
growth and mobilization of reserve nutrients in seeds. The emergence of the seedling
radicle from the seed coat defines the end of germination and the beginning of early
seedling growth.

Hormesis refers to a stimulatory effect on a given response variable, occurring at a low
exposure concentration of an otherwise toxic test substance (e.g., where the higher
exposure concentrations produce adverse effects).

Indirect exposure refers to exposure of plants or cyanobacteria resulting from movement
of the pesticide or industrial chemical through the environment by runoff, soil erosion,
spray drift, etc.

Page 7 of29

-------
Inert ingredient is any substance (or group of structurally similar substances if designated
by the Agency), other than an active ingredient, which is intentionally included in a
pesticide product (40 CFR 152.3).

Inhibition concentration (ICX) is the experimentally derived concentration of a test
substance in a test matrix (e.g., water, growth medium, soil, sediment) that would be
expected to cause a given percent, x, inhibition or reduction in a non-quantal response
from the smoothed mean control response. For example, the IC25 for growth is the
concentration of test substance that would cause a 25% reduction in growth in a test
population from the control response and the ICso is the concentration of test substance
that would cause a 50% reduction in growth from the control response.

Limit of detection (LOD) is the analytic level below which the qualitative presence of the
material is uncertain. This is typically defined by the lowest concentration producing a
signal two standard deviations above the background noise from a matrix blank sample.

Limit of quantitation (LOQ) is the analytic level below which the quantitative amount of
the material is uncertain. This is typically defined by the lowest concentration of fortified
matrix successfully analyzed.

Limit test is a toxicity test performed with a single test substance concentration and a
control to establish that the value for the measurement endpoint of concern (e.g., ECso) is
greater than the test substance concentration (limit concentration).

Lowest observed effect concentration (LOEC) is the lowest concentration of a test
substance to which organisms are exposed under specified exposure conditions that
causes a statistically significant adverse effect as compared to the control(s). Throughout
these guidelines, the terms LOEC and lowest observed adverse effect concentration
(LOAEC) have the same meaning in these guidelines.

Maximum acceptable toxicant concentration (MATC) is the highest concentration at
which a test substance can be present and not be toxic to the test organism. The MATC
lies within the range between the LOEC and no observed effect concentration (NOEC).
Operationally, for industrial chemicals, the MATC is defined as the geometric mean of
these values. The MATC is also referred to (in the Pre-Manufacture Notification (PMN)
program of OPPT) as the chronic value or chronic no-effect-concentration (NEC).

Measured concentration is an analytically derived measure above the method detection
limit.

Measurement endpoint is a quantitative measurable response to a stressor that is used to
infer a measure of protection or evaluate risk to valued environmental entities. Examples
of measurement endpoints include, but are not limited to, number of organisms that
survive, visual phytotoxicity, growth measurements (e.g., algal population density, plant
height, plant dry weight, etc}. Each test-specific guideline identifies the measurement
endpoint(s) to be determined by the proscribed study. The term "measurement endpoint"
is used synonymously with the term "measures of effect."

Medium is the chemically-defined culture solution used in microcosms and in culturing
Page 8 of29

-------
and testing certain organisms such as aquatic plants.

Method detection limit (MDL) is operationally defined as the concentration of constituent
that, when processed through the complete method, produces a signal with 99%
probability that it is different from the blank. It is computed as the standard deviation
multiplied by the Student's t constant corresponding to the appropriate degrees of
freedom (n-1). Thus, for seven spiked samples prepared at the hypothetical LOQ, the
MDL is 3.143 times the standard deviation of the mean of the seven replicate
measurements.

Nominal concentration is, for aquatic tests, the calculated concentration of active
ingredient that would exist if all test substance added to the test solution was completely
dissolved and did not dissipate in any way. For terrestrial tests, it is the target
concentration intended for application to test plants.

Nontarget microorganism is any microorganism species to which the pesticide is not
directly applied. These species are not intended to be controlled, injured, killed, or
detrimentally-affected in any way by a pesticide.

No observed effect concentration (NOEC) is the highest concentration of a test substance
to which organisms are exposed under specified exposure conditions that does not cause
a statistically significant adverse effect as compared to the control(s). The NOEC is the
test concentration immediately below the LOEC and can only be defined in the presence
of the LOEC. Throughout these guidelines, the terms NOEC and no observed adverse
effect concentration (NOAEC) have the same meaning.

Nontarget plant is any plant species to which the pesticide is not directly applied. These
species are not intended to be controlled, injured, killed, or detrimentally-affected in any
way by a pesticide. Nontarget plants include desirable or undesirable plants outside of
the target area.

Pest-free refers to being as free of pests as reasonably possible. For all phytotoxicity
tests, damaging insects, pathogens, and surrounding weeds should be controlled so that
healthy desirable plants are available for testing. With this action detrimental effects can
be attributed to the test substance in question, not to another pesticide, or to weeds,
damaging insects, or pathogens.

Phytotoxicity or plant toxicity refers to unwanted detrimental deviations from the normal
pattern of appearance, growth, and function of plants in response to the test substance.
The phytotoxic response may occur during germination, growth, differentiation,
maturation, and reproduction of plants, and may be of a temporary or long-term nature.
Phytotoxic responses include adverse effects on growth habit, yield, and quality of plants
or their commodities to the extent that a relationship between cause and effect can be
established.

Plants comprise vascular and nonvascular plants (including algae and cyanobacteria).

Reagent water is water that has been prepared by deionization, glass distillation, or
reverse osmosis.
Page 9 of29

-------
Replicate is the experimental unit within a toxicity test.  It is the smallest physical entity
to which treatments can be independently assigned.

Solubility is the amount of chemical dissolvable in test water and is operationally defined
as the  amount  of test   substance  retained in  the  supernatant  of a  conventionally
centrifuged sample of test medium or dilution water.  This amount of test substance is
considered to represent a conservative measure of the most bioavailable fraction which
may include some colloidal  material not removed by centrifugation in addition to the
truly dissolved fraction.

Static renewal system is a static system in which the test solution is renewed at specified
intervals during the test.

Stock solution is the concentrated solution of the test substance which is  dissolved and
introduced into the dilution water or test medium.

Support medium is the matrix used to support the plant during growth.  There are three
types of support  media: natural, formulated,  and artificial  media.  Natural  support
medium is  derived entirely from a combination of natural soils found in the environment.
Formulated support medium is  derived from a combination  of both natural soils and
artificial media  (including components  such as sphagnum moss).  Artificial  support
medium is derived entirely from washed quartz sand or glass beads.

Target area is the  area intentionally treated with a pesticide when label use directions are
followed.

Target area plants are  all  plants located within  the target  area, and  includes both
desirable and undesirable species.

Test chamber is the container in which the test organisms are maintained during the test
period.

Test substance refers to  the specific  form of a chemical  substance or mixture being
evaluated (e.g., pesticide  active ingredient or formulation, industrial chemical).

Test solution refers to the test substance and the dilution water or growth medium in
which the test substance is dissolved or suspended.

Terrestrial plants  are plants that do not require an aquatic medium for growth and may
include plants that inhabit semi-aquatic areas such as swamps or wetlands.

Treatment group refers to the set of replicates that receive the same amount (if any) of the
test substance; controls are treatment groups that receive none of the test substance.

Typical end-use product  (TEP) is a term used to convey direction to a data producer to
use a commonly used end-use product, a pesticide formulation for field or other  end use
(excludes products with  labeling that allows  use of  the  product to formulate  other
pesticide products), as the test substance.  The term  includes any physical apparatus used
to deliver or apply the pesticide if distributed or sold with the pesticide.

                               Page 10 of 29

-------
Vehicle is any agent which facilitates the mixture, dispersion, or solubilization of a test
substance with a carrier (e.g., water, growth medium) used to expose the test organisms
(40 CFR 160.3, 40 CFR 792.3).

(1) Laboratory facilities and equipment. The type of facilities and equipment for
conducting the toxicity tests with the organisms in this group of guidelines varies
depending upon the nature of the test and the organism. In general, these toxicity tests
use normal laboratory glassware, supplies and equipment as well as equipment for
maintaining the organisms under the test conditions and controlling the test conditions
(e.g., temperature, humidity, and lighting). See specific OCSPP Series 850, Group D
guidelines for identification of any atypical facility, equipment, or supplies used in the
test. Construction materials and equipment that are toxic, may affect toxicity, or that may
sorb test substances should not be used.

(2) Maintenance and reliability. All equipment used in conducting the test, including
equipment used to prepare and administer the test substance, and equipment to maintain
and record environmental conditions, should be of such design and capacity that tests
involving this equipment can be conducted in a reliable and scientific manner.
Equipment should be inspected, cleaned, and maintained regularly, and be properly
calibrated. All materials that will come in contact with the test organisms and test
substance should be cleaned before use. Cleaning procedures should be appropriate to
remove known or suspected contaminants.

(3) Permits. Experimental use permits may be required for the terrestrial testing of
pesticides under field conditions involving more than 10 acres such as in studies
described in the OCSPP 850.4300 (terrestrial plant field testing) guideline. Permits may
be required for aquatic field testing of pesticides of more than 1 acre.

(4) Field equipment. If relevant, the application equipment used in testing products in
small field plot studies should be designed to simulate conventional application
equipment. This can be accomplished by using the basic components of commercial
application equipment in the design of the small-plot equipment. For example, nozzle
types, sizes, and arrangements on small plot sprayers can be identical to those used by
growers on commercial ground sprayers. Single-row commercial granular application
equipment mounted on a garden tractor for small plot trials should produce results
comparable to those from a multiple of such units on a large tractor. For large-scale field
trials, commercial application equipment should be used. Specific details as to
descriptions of equipment design, adjustment, and operation should be provided in test
reports.

(d) Experimental design and data analysis—

(1) Design elements. Elements of experimental design such as the number of test
treatments, progression factor between treatment levels, number of replicates, and
number of organisms per replicate and per treatment are based upon the purpose of the
test, variability expected in response measurements, and the type of statistical procedures
that will be used to evaluate the results. See the test-specific guidelines for specific
Page 11 of 29

-------
information relating to these aspects of test design.  General principles of test design are
set forth in this guideline.  General guidance on the statistical analysis of ecotoxicity tests
can be  found in the  references in paragraphs (k)(l), (k)(4), (k)(13), (k)(14),  (k)(15),
(k)(25), (k)(26), and (k)(27) of this guideline.

(2) Calculation of endpoints—

       (i) Background—

              (A) Qualitative data.  Some of the data generated in phytotoxicity tests
              are qualitative,  such  as  ratings  based  upon  visual  symptoms  of
              phytotoxicity.   Qualitative data  such as  ratings  are  not  statistically
              analyzed for these tests but may be used to report qualitative no-effect
              levels.

              (B) Quantitative data.

                     (1) Quantitative data  generated in phytotoxicity tests may be of
                     three types:

                           (a) Quantal (dichotomous), where the variable has only two
                           mutually exclusive  outcomes, (e.g., dead or  alive)—note
                           that quantal data are a special  case of discrete data;

                           (b)  Discrete, where there  is a finite number of values
                           possible or there is a space on the number line between two
                           possible  values (e.g., number  of fronds, fruits or seeds
                           produced); or

                           (c) Continuous, where the variable can assume a continuum
                           of possible outcomes (e.g., height, weight).

                     (2) These data may  be analyzed using regression-based techniques
                     or hypothesis-testing procedures depending on the objectives and
                     endpoints  of a  specific test  guideline.  For  information  on the
                     advantages  and disadvantages of  these approaches,  see  the
                     references in paragraphs (k)(7),  (k)(8), (k)(14), (k)(15), and  (k)(18)
                     of this  guideline.  Specific test guideline objectives, either point
                     estimate or hypothesis-based endpoints or both, are  identified in
                     each specific test guideline.

       (ii) Point estimates and concentration-response or dose-response tests.  This
       type of toxicity test is designed to allow calculation of a concentration- or dose-
       response curve (mathematical model) and to estimate one  or more specific points
       (point estimates) on the curve,  such as an EC25 or ECso.  Regarding terminology,
       the term ICX is more appropriately used for continuous endpoints, rather than ECX.
       Because of the normal variation in sensitivity of individuals within a group of test
       organisms, a measure of the degree of certainty in the model parameters and the
       point estimate  value(s)  should be determined.

                                Page 12 of 29

-------
       (A) No single statistical technique is appropriate for all data sets, and the
       assumptions and requirements of each method should be known before
       using (see paragraphs (k)(l), (k)(9), (k)(12), (k)(18),  and (k)(29) of this
       guideline).  Not all methods  suitable for continuous data are appropriate
       for quantal data (see paragraphs (k)(6) and (k)(12) of this guideline). For
       point estimate tests,  regression-based  methods  that  model  the full
       concentration- or dose-response relationship and provide error estimates of
       the model parameters and point estimate(s) are desired.  The regression
       model used to fit data should be  recorded,  and the error estimates of the
       model  parameters  (e.g.,  standard error  of slope and intercept), and
       goodness-of-fit should be calculated and recorded. For  a point estimate
       (e.g., ICso) the  95% confidence interval and standard  error are calculated
       and recorded.  If data do not fit a  regression-based  model, other point
       estimator methods (e.g., binominal, moving average, trimmed  Spearman -
       Karber,  linear  interpolation  (e.g., Bootstrap  ICp))  are available  (see
       paragraphs (k)(23), (k)(26) and  (k)(27) of this guideline).   The method
       used to estimate the endpoint and, if applicable, 95%  confidence interval
       for the point estimate, should be recorded.

       (B) To account for experimental variability, select a statistical method that
       retains  data  for  the  individual  replicates  through  the  regression
       calculations as opposed to pooling the replicates.  Where hormesis occurs,
       it is  preferable  to select a statistical  method that  provides for analysis of
       the impact of  stimulation on the calculated  endpoints as  opposed  to
       discarding stimulatory data.

       (C) Concentration-effect models  are good estimating tools  only for  the
       range of concentrations  used to  fit them; therefore,  endpoints  that  are
       extrapolated  beyond the  range  of the concentrations  tested would  be
       considered to  be  of  lower  confidence or  potentially, of such  low
       confidence that they would not be appropriate to estimate.

(iii) Hypothesis-based methods—

       (A) Multiple-concentration  or  multiple-dose definitive tests.   In this
       type of test, the purpose is to determine if the biological response to a
       treatment  level differs  from the  response  of  the control.   Hypothesis
       testing-based endpoints, expressed as the NOEC and LOEC (or NOEL and
       LOEL),  are calculated by determining  statistically significant differences
       from the control.  The null hypothesis  is that no  difference exists among
       the mean (or median if nonparametric), control and treatment responses.
       The  alternative hypothesis is that the treatment(s) result in an  adverse
       biological   effect  relative to  the  control  sample.     Parametric  and
       nonparametric  analysis  of  variance  (ANOVA)  tests  and  multiple-
       comparison tests  are often appropriate for continuous data and  may  be
       appropriate for some categorical  data (rank, order, score).   Contingency
       table tests  are usually appropriate for categorical data.  Parametric tests are
       based on  normal  distribution theory and  assume that  the data within

                         Page 13 of 29

-------
treatments  are  a  random  sample  from   an  approximately  normal
distribution  and that the  error variance is  constant  among treatments.
These assumptions should be examined using appropriate tests, and data
transformations  (see  paragraph  (d)(2)(v)(A))  of  this  guideline  or
nonparametric techniques  should be used where the assumptions  are not
met. Where possible multiple comparison tests that restrict the number of
comparisons  made  should  be used.    Generally, the  more  powerful
multiple-comparison  tests  are those  which assume  a dose-response
relationship  in the  data.  When  the  assumption of a monotonic dose-
response  holds,  Williams'  and  Jonckheere's  test,  respectively,  are
examples of parametric and nonparametric tests that can be used.  When
the assumption of a monotonic dose-response fails, Dunnett's t-test and
either Steel's  many-one rank  test or the Wilcoxon rank sum test with
Bonferroni adjustment, respectively,  are examples  of  parametric  and
nonparametric multiple comparison tests requiring no assumption about
the dose-response but which  restrict comparisons of the treatments to a
control.  A measure of the sensitivity of the test, such as the minimum
significant   difference  (parametric   tests),   should   be   calculated.
Alternatively,  a calculation of the  number of replicates necessary to
achieve data quality objectives given  the actual measured test responses
and variability should be  made.  At  a  minimum, the percent  reduction
from the control for each treatment should be calculated.

(B) Types of decisions and errors.

       (1) Table 1 presents the two possible outcomes and decisions that
       can be  reached in the statistical hypothesis  tests discussed in
       paragraph (d)(3)(ii)(A) of this guideline:

              (a) There is no difference among the mean control  and
              treatment responses; or

              (b) There is a difference among the  mean control  and
              treatment  responses  (concerned with  direction,   where
              response is adverse relative to the control).

       (2) Statistical tests of hypothesis can be designed to control for the
       chances of making incorrect decisions.  The types of incorrect and
       correct decisions that can be made in a hypothesis-based test and
       the probability of making these decisions are represented in Table
       1. For multiple comparison tests the Type I error rate is controlled
       to account for multiple test comparisons.
                  Page 14 of 29

-------
       Table 1.—Types of Errors and the Probabilities of Making Correct and Incorrect
Decisions Based on the Results of Testing
Test Decision Outcome:
Treatment Response = Control
Response
Treatment Response < Control
Response
Actual (or True) Condition:
Treatment Response =
Control Response
Correct Decision
probability = 1- alpha (a)
Type I error (False positive)
probability = a
Treatment Response < Control
Response
Type II error (False negative)
probability = beta (p)
Correct Decision
probability (Power of test) = 1-p
                    (C) Power of the test.  Power of the test versus percent reduction in
                    treatment response relative to the control mean at various coefficients of
                    variation  is  provided in  the  reference  in  paragraph  (k)(23) of  this
                    guideline.  Examples are specifically given for 5  and 8 replicates for a
                    one-tailed test alpha  (a) of  0.05 and 0.10.  Effects on the number of
                    replicates at  various  coefficients of variation  are also provided in the
                    reference in paragraph (k)(23) of this guideline for various low a and beta
                    (P) values (i.e., a + P = 0.25). See also the references in paragraphs (k)(9)
                    and (k)(24) of this guideline.

                    (D) Limit test. In a limit test it is only necessary to ascertain that: a fixed
                    point standard (such as the £€25 or IC25 for terrestrial plants or ECso or
                    ICso for aquatic plants) is greater than a given "limit" concentration (Tier I
                    test for pesticides); and the response at a given "limit" concentration does
                    not differ from  the  control response.   Only one  treatment, the  "limit"
                    concentration, and the appropriate control(s) are tested.  This is referred to
                    as a limit test or maximum challenge concentration test.

                           (1) Fixed point standard.  For a fixed point limit test, the  null
                           hypothesis is that the limit treatment mean response as compared
                           to the control response  is greater than  or equal to a fixed point
                           response  of  concern (e.g., 25%   for  terrestrial  plants).    The
                           alternative hypothesis is that the limit treatment mean response as
                           compared to the control  response is  less than  the fixed point
                           response  of concern. (Concerned with direction, where response  is
                           inhibition relative to the control switch hypotheses around.)

                           (2) Difference between two means (or medians).  For testing  if
                           the treatment level affects the test organism, the null hypothesis  is
                           that the  treatment  mean (or  median) response  is equal to the
                           control response mean (or median)  level  and the alternative
                           hypothesis is that the treatment mean response differs from the
                           control response.   The  direction   of the  alternative hypothesis
                           depends on what is considered  an adverse direction for the specific
                           response  being evaluated, such as decreased survival, shoot height
                           and biomass  or increased mortality as compared to the  control
                           response.  Examples of parametric  and nonparametric two-group
                           comparison tests are Student's t-test and Wilcoxon rank sum test,
                           respectively.
                                     Page 15 of 29

-------
       (iv) Transformations, outliers, and non-detects—

              (A) Transformations.  Transforma-tion of data (e.g., square root,  log,
              arcsine-square root) may be  useful for a number of statistical  analysis
              purposes.  The two main reasons are to satisfy assumptions for statistical
              testing and to derive a linear relationship between two variables, so that
              linear  regression analysis  can  be  applied.   Added benefits  include
              consolidating data that may be spread out or that have several  extreme
              values (see reference  in paragraph (k)(24) of this guideline).  Once the
              data have been transformed, all statistical  analyses are performed on the
              transformed data.

              (B) Outliers. Outliers are measurements that are extremely large or small
              relative to  the  rest  of  the data  and,  therefore,  are suspected  of
              misrepresenting the  population from which they were collected.  Unless
              there  is  a  known  documented  reason  for  the  outlier(s),   such  as
              measurement system  problems  or instrument breakdown, the statistical
              analyses performed  should at a minimum include results using the full
              data set (i.e.,  the suspected outlier(s) are not discarded).  Outliers should
              not be discarded based on  a  statistical  outlier test  (see  reference in
              paragraph  (k)(24) of this  guideline).   The  analyst  may  conduct  all
              statistical analysis of  the data with both a full and truncated  (presumed
              outliers are discarded) data set, however, so that the effect of the presumed
              outlier(s) on the conclusion may be assessed.

              (C) Nondetects.  Data generated from chemical  analysis that falls below
              the LOD  of the analytical  procedure  are  generally  described as not
              detected,  or  nondetects, (rather than as  zero or not present)  and the
              appropriate LOD should be  reported.  There are a variety of  ways to
              evaluate data that include  both detected  and non-detected values (see
              reference  in  paragraph  (k)(24) of  this  guideline).   However,  for  a
              satisfactory test  in a number of the  Group D guidelines, test substance
              concentrations should not be below the LOD (see specific OCSPP Series
              850, Group D guidelines), except in controls.

(3) Selection of test treatments—

       (i) Point estimate and concentration-response or dose-response test.  Toxicity
       tests where the objective is  the concentration-response curve and a specific point
       on the curve  (e.g.,  1C50) usually consist of one or  more control treatments and at
       least five test treatments which should bracket the  specific point(s) of concern for
       the test.  To obtain a reasonably precise estimate of the ICso using probit analysis
       for example,  one or more treatments  should be between, but not including, 0 and
       50% and one or more treatments should be between, but not including, 50 and
       100%. Where the objective is to derive the  concentration-response  curve and to
       determine more than one specific point response on the curve (e.g., IC25 and ICso),
       the use of additional treatment levels to ensure that both point response values are
       each bracketed is encouraged. The spacing between test treatments depends upon

                               Page  16 of 29

-------
the expected slope of the concentration- or dose-response curve, information
about which can be gained during a range-finding test. The minimum geometric
ratio between doses is 2. The maximum geometric ratio between doses should be
4, with adequate justification documented for using a ratio of 4. When the
objective of the test is to determine a regression-based estimate and sample size
constraints apply, the use of more treatment levels is preferable to the use of more
replicates. The inclusion of additional treatment levels rather than additional
replicates results in better characterization of the overall concentration- or dose-
response relationship.

(ii) Hypothesis-based test—

(A) Multiple-concentration or -dose definitive test. Each test usually
consists of one or more control treatments and at least five test treatments
which span the expected environmental concentrations and where at least
the lowest treatment level is the NOEC (or NOEL). The test treatments
are usually selected in a geometric series in which the ratio is minimally 2
to 4. A key assumption is that the response data are monotonic with
increasing concentration or dose (i.e., the degree of biological effect
increases as treatment concentration increases) or that there is a threshold
response such that a NOEC (or NOEL) for a given biological response
should not occur at a treatment concentration higher than one found to be
statistically different from the control for the given biological response.
Where these assumptions do not hold it is recommended that additional
concentrations be included to better characterize the relationship of the
biological response with exposure concentration. If the failure is
suspected to be due to high variability in a given response measurement,
the number of replicates should be increased.

(B) Limit test. A limit test consists of a single treatment level and the
appropriate control(s). Individual OCSPP Series 850 Group D guidelines
identify the concentration or dose that satisfies the limit treatment level
test for that guideline.

(4) Randomization. For test results to be satisfactory test treatments should be randomly
assigned to individual test chambers or the group of test chambers constituting a replicate
and the test chambers or replicates randomly assigned to locations. When practically
feasible, the locations should be randomly reassigned during the test. Randomized block
designs may be desirable for terrestrial plant tests, completely randomized designs may
also be used. For test results to be satisfactory, test organisms (for example, seeds or
seedlings) should ideally be randomly assigned to the test containers; where this is not
practical impartial assignment can be used. (Note: random assignment as used here
implies a mathematically-based unbiased assignment method and impartial assignment
implies a non mathematically-based unbiased assignment procedure.) All test chambers
should be treated as similarly as possible to eliminate potential bias in test results. The
methods used to randomize treatments among test chambers and test chambers among
locations should be recorded, as well as methods of impartial organism assignment to test
chambers.

Page 17 of 29

-------
(5) Number of replicates. The number of replicate test chambers for a given treatment
is dependent upon the objective of the specific guideline test. Except for field tests which
are designed on a case-by-case basis, the minimum number of replicates for a given test
is described in each individual OCSPP Series 850 Group D guideline. The number of
replicates selected should yield results that are statistically significant at the 90 to 95%
level of confidence with a significance level of less than 0.10. The sample size and
number of replicates for each tested plant species in the tiered test scheme should be
sufficient to calculate the IC25 or ICso (progression criteria for terrestrial and aquatic
plants, respectively) and to detect differences of concern when treatments are compared
to controls. For terrestrial plant tests, a replicate or experimental unit will usually consist
of more than one test chamber (pot or flat) because, for a satisfactory test, plants should
not be crowded in the test chambers. For a satisfactory test, the group of pots or flats
constituting an experimental unit or replicate should be kept together throughout the test
duration. For tests with algae, cyanobacteria, and aquatic vascular plants, the test
chamber constitutes the experimental unit or replicate. All test chambers within a
replicate and all replicates within a treatment should be treated as similarly as possible.
Except for field tests which are designed on a case-by-case basis, the minimum number
of replicates for a given test described in each individual OCSPP Series 850 Group D
guidelines.

(i) Regression-based test. When the objective of the test is to determine a
regression-based estimate and sample size constraints apply, the inclusion of
additional concentrations rather than additional replicates results in better
characterization of the overall concentration-response relationship. For pesticides
the objective of some OCSPP Group D guideline tests includes determination of
both a regression-based point estimate (e.g., K^s) and a hypothesis-based
endpoint (e.g., NOEC) in which case the minimum number of replicates will be
determined by the hypothesis-based method.

(ii) Hypothesis-based test. For hypothesis-based tests, the determination of the
test-specific number of replicates depends upon the objectives of the test, the
statistical method(s) that may be used, the coefficient of variation, the size of
effect to be detected, and the acceptable error rate. (Note: several of the non-
parameteric multiple-comparison tests recommended cannot be performed
without at least a minimum of 4 replicates). Individual testing facilities should
consider variability observed in their laboratory and adjust the number of
replicates upward where the minimum replication number identified in the test
specific guideline is not sufficient to provide the statistical power to detect
adverse effects to the test organisms or, if appropriate, identify and correct any
environmental, handling, and culturing conditions, etc. that are resulting in the
high variability.

(6) Controls. Control groups are used to ensure that effects observed are associated with
or attributed only to the test substance exposure. In phytotoxicity evaluations, all treated
plots, plants, and commodities should be compared directly to untreated control plots,
plants, and commodities in order for the study to be considered satisfactory. The
appropriate control group should be similar in every respect to the test group except for
exposure to the test substance. Within a given study, all test organisms including the

Page 18 of 29

-------
controls should be from the same source. To prevent bias, a system of random assignment
of the test plants to test and control groups should be used for an acceptable test. (In field
trials, it is useful to have one set of replicates assigned, in adjacent plots, to a control and
in order from lowest concentration to highest concentration for visual comparison, with
the remaining replicates randomly assigned). Where a carrier, vehicle, or adjuvant other
than water is used, appropriate experiments and controls should be included to
distinguish the possible action of the carrier, vehicle, or adjuvant. Untreated control
(check) plots should be treated and evaluated in the same manner as the treatment plots
with respect to other pesticides or chemical (fertilizers, etc} and cultural practices. A
vehicle control (solvent control) is also tested if a vehicle was used to prepare the test
substance. To demonstrate satisfactorily that the vehicle has no unacceptable effect, the
highest concentration of the vehicle that was added to any of the test chambers is used in
the vehicle control. It is recommended that the vehicle concentration be the same at each
treatment level. If either the control or vehicle control results are not satisfactory for the
test, indicating problems with test organisms or test procedures, the test results should be
considered unacceptable. If both the control and the vehicle control results verify test
organism health and status, the control and vehicle control results are compared using an
appropriate statistical method to determine if there is an effect of the vehicle on the test
organisms. If there is a statistically significant difference between the control and the
vehicle control, indicating either a positive or negative vehicle effect for any of the
measured response variables using an a-level of 0.05, the study may be considered
unacceptable.

(e) Test substance characterization—

(1) Background information on the test substance. The information in paragraphs
(e)(l)(i) through (e)(l)(vi) of this guideline should be known about the test substance
prior to testing:

(i) Chemical name; CAS number; molecular structure; source; lot or batch
number; purity and/or percent a.i.; identities and concentrations of major
ingredients and major impurities; radiolabeling if any, location of label(s), and
radiopurity; date of most recent assay and expiration date for sample.

(ii) Appropriate storage and handling conditions for the test substance to protect
the integrity of the test substance. (Note: health and safety precautions should
also be known. These considerations are beyond the scope of these guidelines
and depend upon the characteristics of the test substance).

(iii) Physical and chemical properties of the test substance, including solubility in
water and various solvents; vapor pressure; hydrolysis at various pH; pKa; soil
sorption behavior, etc. Of particular relevance are rates for processes such as
hydrolysis, photolysis, and volatilization.

(iv) Stability, and if relevant, solubility, under the test conditions (refer to OCSPP
850.1000 guideline).

(v) Physical and chemical properties and stability information for the analytical
standard (if applicable).
Page 19 of 29

-------
       (vi) Analytical method for quantification of the test substance in the test solutions,
       test matrix (support medium) or in the dosing solutions.  Analyses are conducted
       with the specific  media  which will be used during the test; i.e.,  under test
       conditions.

(2) Preliminary analyses.

       (i) The Agency recommends preliminary testing  of the  test substance.   The
       information about  stability  and  solubility  of the  test  substance  should be
       developed under actual test conditions.  This information can be gained while
       doing the range-finding studies.

       (ii) Information on  the  behavior of  a test  substance  should  be based on
       experiments conducted under the same conditions as those occurring during the
       test.  These include but are not limited to:

              (A) Test matrix characteristics (e.g., support medium, culture medium,
              soil, etc}.

              (B) Temperature, humidity, lighting, etc.

              (C) With test organisms in place (when practical).

              (D) Use of the same test containers.

       (iii) The tests in paragraphs (e)(2)(iii)(A) through  (e)(2)(iii)(D) of this guideline
       should be performed:

              (A) Stability trials conducted under actual test conditions.

              (B) If relevant, solubility trials conducted under test conditions.

              (C) Chemical analysis methods as  detailed in paragraph  (g)  of this
              guideline.

              (D) Determination of storage stability of the test substance in the samples
              to be collected for  chemical analyses should be determined. This includes
              determining whether and how samples can be stored for future analysis.

(3) Sample storage.  If samples of the treatment solutions  or other exposure matrices
collected for chemical analysis cannot be analyzed immediately, they should be handled
and stored appropriately to minimize loss of the test substance.  Loss could be caused by
such processes as microbial degradation,  hydrolysis, oxidation,  photolysis, reduction,
sorption, or volatilization.  Stability determination under  storage conditions, whether it
refers to storing the test substance before testing or storing samples awaiting analysis, is
required by GLP regulation.  Test substance stability under storage conditions  should be
documented.
                               Page 20 of 29

-------
(4) Analytical test substance determinations.

(i) For terrestrial tests, solutions to be applied should be measured at test initiation
for a satisfactory test. See specific guidelines for any additional testing.

(ii) For aquatic tests see the OCSPP 850.1000 guideline for guidance.

(iii) For field tests, media and frequency of testing depends on the objective of the
study and is determined on a case-by-case basis.

(5) Application of test substance. For a satisfactory study, application from equipment
should be made from lowest concentration to highest concentration so as to minimize
residue carryover.

(6) Mode of action. When they have been determined, the primary and secondary modes
of action with respect to plant morphogenic and biochemical levels should be reported.

(f) Preparation of test substances.

(1) Procedures for test substance preparation for studies with algae, cyanobacteria, and
aquatic vascular plants are addressed in the OCSPP 850.1000 guideline. In terrestrial
plant toxicity tests, exposure to the test substance is typically accomplished through
application of the test substance on soil (or other suitable growth media matrix) or to the
leaves and other above-ground portions of the plants. Test substance may be added by
weight or prepared in a concentrated stock solution. For pesticides, the testing of typical
end product (TEP) in plant tests generally obviates the need for addition of a vehicle
since a formulation may already contain a vehicle (see paragraph (f)(3) of this guideline).

(i) The preferred choice for preparation of a stock solution is to use reagent water
(deionized, distilled or reverse osmosis water), providing test substance can be
dissolved in water. If the test substance cannot be dissolved in reagent water,
vehicles are often used. If a vehicle, i.e. a solvent, is absolutely necessary to
dissolve the test substance, the amount used should not exceed the minimum
volume necessary to dissolve the test substance. If the test substance is a mixture,
formulation or commercial product, none of the ingredients is considered a
vehicle unless an extra amount is used in its preparation for testing.

(ii) If a vehicle is used to prepare the test substance, a vehicle control is also
included in the test, in addition to the untreated (negative) control. The same
batch of vehicle (solvent) control used to prepare the test treatments is used in the
vehicle control. For a valid test, the selected vehicle should not affect the test
plants at the concentration used. A vehicle should not interfere with the
metabolism (degradation) of the test substance, alter the chemical properties of
the test substance, or produce physiological or toxic effects to plants.

(2) All techniques used in stock solution preparation (shaking, stirring, sonication,
heating, solvent, etc) should be recorded. The appearance of the stock solution should be
observed and recorded.

Page 21 of 29

-------
(3) Generally, for terrestrial plant phytotoxicity testing of pesticides, the TEP is the actual
substance tested. The submitter should consult with the Agency if any form of the test
substance other than TEP is to be tested. Aquatic plant studies may be conducted using
the TEP or technical grade of the a.i.; however, if an overspray exposure is used for
emergent or floating aquatic vascular plants, the test substance should be the TEP. If an
adjuvant is recommended on the product label, representative adjuvants should be
included in the test at the recommended dosage for an acceptable test. The TEP selected
for testing should be the one with the highest percentage a.i. and/or the one most widely
used. The use of TEP testing should eliminate the need for a separate vehicle control as
the vehicle is part of the formulation. An untreated (negative) control is still included in
the test.

(g) Analytical methods and sampling for verification of exposure—

1) Method validation.

(i) The analytical method used to measure the test substance should be validated
before beginning the definitive test by appropriate laboratory practices. An
analytical method is not acceptable if likely degradation products of the test
substance give positive or negative interferences which cannot be systematically
identified and mathematically corrected, unless it is shown that such degradation
products are not present in the test system during the test.

(ii) Method validation is conducted for the purpose of determining the linear
range, detection limit, accuracy and precision (repeatability and reproducibility)
of the method for analysis of the test substance under the conditions of the test.
Thus, quality control (fortification) samples should be prepared at concentrations
spanning the range of concentrations to be used in the definitive test, using the
same procedures (vehicles, etc) and in the same matrix (water, soil, etc)
representative of what will be used in the test.

(iii) The method validation should include a determination of linearity between
detector response and test substance concentration, the LOQ, the MDL, accuracy
(average percent recovery) and precision (relative standard deviation). The
method validation should establish the acceptance criteria for the quality control
(QC) samples that will be prepared and analyzed during the test.

(2) Collection of samples. Samples should be collected in such a manner as to provide
an accurate representation of matrix being sampled. For toxicity tests with algae,
cyanobacteria and aquatic vascular plants, these will be aqueous samples and the OCSPP
850.1000 guideline should be consulted. For terrestrial plant toxicity tests, the samples
will generally consist of the dosing solutions, other dosing matrix, or the support
medium. Analyses, if conducted, are performed to confirm the initial concentration of the
test substance applied. Samples should be processed and analyzed immediately, or
handled and stored in a manner which minimizes loss of test substance through microbial
degradation, photodegradation, chemical reaction, volatilization, sorption or other
processes.
Page 22 of 29

-------
       (3) Analysis  of test samples.  Concurrent with each analysis of test samples, quality
       control (fortified) samples should be analyzed. Quality control samples are prepared by
       adding known amounts of the test  substance to the test matrix.  Minimally, one quality
       control sample  should be at the low end of the test concentration range and one quality
       control sample at the high end. A control (zero-level fortification) sample should also be
       included.   Test sample  recoveries  may be  corrected for  inherent method bias as
       determined from concurrent analysis of freshly fortified quality control samples.

(h) Reference toxicants.  Historically,  reference toxicity testing has been thought to provide
three types of information relevant to the interpretation of toxicity test data: first, an indication of
the relative "health" of the organisms used  in the test; second, a demonstration that the laboratory
can perform  the  test procedure  in a reproducible manner over  a  period  of time; and  third,
information to indicate whether  the  sensitivity  of a particular strain or population in use at  a
laboratory is  comparable to that of those used in other facilities and how sensitivity varies over
time.  However, performance of control organisms over time may be a better indicator of success
in handling and testing of at least some organisms.   Nonetheless,  periodic reference toxicant
testing can provide  an indication of the  overall comparability of results within and among
laboratories.  Although a positive control is not standard for each test, a quarterly or semiannual
positive control  (on a guideline-specific  basis) can  serve as  a  means of detecting possible
interlaboratory or temporal variation. A reference toxicant might also be desirable when there is
any significant change in source or maintenance of test organisms or in other test conditions.

(i) Monitoring of test conditions.  Test conditions are specified in each test-specific guideline in
the OCSPP  Series 850 Group D.   These conditions include  environmental  factors such as
temperature, humidity, and lighting.  Methods used for monitoring test conditions should be in
accordance with established methods (e.g., those published by the U.S. EPA, ASTM, APHA et
al., etc. (see  paragraphs  (k)(l), (k)(2), (k)(3) of this  guideline).  Environmental conditions of
specific field  sites should be recorded daily throughout the duration of the test.

       (1) Temperature.  For greenhouse,  growth  chamber and laboratory toxicity tests or
       studies it  is  desirable that temperature be monitored continuously (recorded at least
       hourly). Alternatively, the maximum and minimum should be measured daily (which is  a
       minimum  of  at least two measurements during each 24 hour period during the study).
       Temperature  measurements should be made in at least one representative location. For
       field experiments the maximum and minimum  air temperature should be measured daily.

       (2) Humidity.  Where applicable, humidity should be monitored continuously in at least
       one representative location.

       (3) Lighting.  Guidance for lighting in greenhouse, growth chamber  and laboratory
       toxicity tests  or studies can be found in the references given  in paragraphs (k)(3), (k)(5)
       and (k)(19) of this guideline.  The photoperiod  (day length) should be  recorded and  a
       photosynthetically active  radiation  (PAR) sensor (measures light energy in the 400-700
       nanometer wavelength range) should be used  to  measure light quality in micromoles of
       photons per  square meter  per second  (|imol/m2/s).    For  field experiments, the
       approximate light quantity (usually expressed in degree of cloudiness) and photoperiod
       (day length) should be observed and recorded.  Frequency of light quality monitoring for
       a test is detailed in each individual OCSPP Series 850 Group D guideline.

                                      Page 23 of 29

-------
       (4) Biological observations.

              (i) Where test substances are applied in the field, the effects of that test substance
              on nontarget plants in the system and along the immediate border should be
              observed and  recorded, including plant vigor, phytotoxicity  or other visible
              symptoms,   and  delay  or  acceleration  of  vegetative  growth, flowering or
              sporulation, and maturation.

              (ii)  Monitor all  variations, either inhibitory or stimulatory, between the treated
              test organisms  and the  untreated control  test  organisms for toxicity  tests
              conducted in greenhouses,  growth chambers or laboratories.  Such variations may
              be due to phytotoxicity (chlorosis, necrosis, and wilting), formative effects (leaf
              and stem deformation), and/or effects on growth or development rates.

              (iii) Uniform scoring procedures should be used to evaluate the observable toxic
              responses.

              (iv) At  least two methods of evaluation (such as quantitative and qualitative
              determinations) should be used in the evaluation of test  substance effects on
              growth,  reproduction, and yield of plants in  greenhouse and controlled growth
              chamber experiments.  When direct measurements cannot be made, such as in
              large field evaluations, a O-to-100 or O-to-10 rating scale should be used, where 0
              indicates no injury and 100 or 10 indicates a  total effect or kill produced by the
              test substance.  An explanation of the steps of the rating scale employed should be
              included with the report.
(j) Reporting—
       (1) Background information. In addition to the reporting requirements prescribed in the
       Good Laboratory Practices Standards (40 CFR part 792 and 40 CFR part 160), the report
       should include the information in paragraphs (j)0)(i) through (j)0)(v) of this guideline:

             (i) Test facility (name and location),  study dates, and personnel.  If conducted
             outside of a laboratory or greenhouse, report the geographic location and describe
             the relation of this location to the occurrence  or culture of the test species in the
             surrounding area.

             (ii) The name of the sponsor, study director, principal investigator, names of other
             scientists or professionals, and the names of all supervisory personnel involved in
             the study.

             (iii) Raw data sufficient to allow independent confirmation of the study authors'
             conclusions should be presented  with the study report.  Raw data includes all
             measurements recorded  during the study  including, but not limited to,  effects
             (mortality, growth, etc),  environmental conditions (temperature,  etc) and test
             substance concentration  or dose measured as  specified and are necessary for the
             reconstruction and evaluation of the report of that study.  The absence of raw data
             may make the study incomplete and impossible to review for scientific soundness
             and thus can lead to rejection of the study as scientifically sound.
                                      Page 24 of 29

-------
       (iv) The signed and dated reports  of each of the  individual scientists or other
       professionals involved in the study, including each  person who, at the request or
       direction of the testing facility or sponsor, conducted an analysis or evaluation of
       data or specimens from the study after data generation was completed.

       (v) The locations where all raw data and the final report are stored.

       (vi) The  statement prepared and signed by the quality assurance unit identifying
       whether  or not the study was conducted in compliance  with Good Laboratory
       Practices Standards (40 CFR part  792 or 40 CFR part  160). Alternatively the
       statement can  indicate it was  conducted under  OECD  Principles of Good
       Laboratory Practice, in accordance with the multilateral  agreement with  OECD
       member countries.

(2) Data elements. The test report should provide a complete and accurate description of
test procedures and evaluation of test results including but not limited to the material in
paragraphs (j)(2)(i) through (j)(2)(xv) of this guideline.

       (i) Objectives and  procedures stated  in  the  approved protocol,  including  any
       changes or deviations or occurrences which may have influenced the results of the
       test.

       (ii) Identification of the test substance (including source,  lot, batch number,  and
       purity), and known physical and chemical properties that are pertinent to the test.
       Provide  the  physical  state,  water solubility (if  relevant), pH,  stability  and
       degradation properties under test conditions and stability under storage conditions
       if stored prior to use  or prior to sample analysis.   Where appropriate a cross-
       reference to OCSPP  Series 830  (Product Properties  Test Guidelines) guideline
       study results can be used to report this data.

       (iii) Methods of preparation of the test substance and the concentrations or doses
       used  in limit  and definitive testing.  If vehicles  and adjuvants  (e.g. diluents,
       suspending agents, and emulsifiers) are used, the name, composition, and source
       of the vehicle and adjuvant, the nominal concentration of the test substance in the
       carrier, and the vehicle concentration(s) used in the test.

       (iv) Information about the test organisms (common  name, genus, species,  family,
       and cultivar or variety), a statement providing the  rationale for selection of the
       particular species used in the test, source, and life stage.

       (v) A description of the test system used in definitive, limit or any preliminary
       testing  this  includes  a  description  of  substrate   source,  properties  and
       characteristics;  description of the experimental unit  (test container, pots, flats,
       field  plot,  etc.), including number of test organism  per  test container and per
       experimental unit; number of replicates per treatment; controls; and method of
       test substance application including dosage rates or spray volume per unit area,
       application equipment (type,  nozzle,  orifice, pressure),  time (season, stage of
       growth) and number  of applications, tank mixture (if applicable), adjuvants (if
       applicable).
                                Page 25 of 29

-------
(vi) Results of measurements of test substance.  All analytical procedures should
be  described  and  documentation provided  such  that  a bench chemist  can
independently determine what equipment to use and perform the analysis.  Also
include the raw data, standards, quality control samples, and chromatograms from
samples taken during either definitive or range-finding tests, not of standard or
samples from recovery tests.  For a satisfactory test, the accuracy of the method,
LOD, MDL, and LOQ should be given.

(vii) Exposure may be expressed as units of test substance per unit of land area
treated (e.g for terrestrial plant studies,. Ib a.i./A), units of concentration  (e.g.,
milligrams per liter for aquatic plant studies), units per flow rate,  or units of test
substance per unit volume applied to obtain a specified degree of foliage coverage
(such as to runoff).

(viii) For  pesticides, test substance concentrations or dosages should be recorded
in units of a.i. or the acid equivalent of the a.i. as appropriate.

(ix) If a product is applied more than once within a year or growing season, each
rate and the interval between applications should be recorded.

(x)  Methods,  frequency, and  results  of environmental  monitoring  performed
during  the  study  (air  temperature,   light  quality,  humidity,  photo-  and
thermoperiods, etc.) and other records of test conditions such as description of
watering schedules and any cultural practices during the test (if applicable).

(xi)  Biological observations  should be reported  in  sufficient  detail to  allow
complete  independent evaluation of the results (see specific test guidelines in this
group for specific measures of effect).   For target area applications include a
description of the stage of growth or development of nontarget plants within or
adjacent to the target area.

(xii) The stage of plant development  and study  dates  when  adverse results
occurred  and  subsided (if plants  recovered) should be recorded.  Any lack of
effects by the test substance should also be recorded.

(xiii) All data developed during the study that is suggestive or predictive of toxic
effects and all concomitant gross toxicological manifestations.

(xiv) Calculated endpoints and a description of all statistical methods, including:
software used, handling of outlier data points, handling of non-detect or zero
values, tests to validate the assumptions of the analyses, level of significance, any
data transformations, for hypothesis tests  a measure of the sensitivity of the test
(either the minimum significant difference or the percent change from the control
that this minimum difference represents). Raw data should be reported to allow
independent verification of statistical procedures.

(xv) Methods used  for  test chamber and treatment  randomization as well as
methods for random or impartial assignment of test organisms to test chambers.

                         Page 26 of 29

-------
(k) References. The references in this paragraph should be consulted for additional background
material on this test guideline.

       (1) American Public Health  Association, American Water Works Association,  Water
       Environment Federation, 1998.  Standard Methods for the Examination of Water and
       Wastewater, 20* edition. Part 8010, Toxicity: Introduction.

       (2) American Society for Testing and Materials, 2002.  ASTM E 1963-02.  Standard
       guide for conducting terrestrial plant toxicity tests.  In Annual Book of ASTM Standards,
       Vol. 11.06, ASTM, West Conshohocken, PA.  Current edition approved December  10,
       2002.

       (3) American Society for Testing and Materials, 2002.  ASTM E 1733-95.  Standard
       guide for the use of lighting in laboratory testing. In Annual Book of ASTM Standards,
       Vol. 11.06, ASTM, West Conshohocken, PA.  Current edition approved September  10,
       1995; Reapproved 2002.

       (4) American Society for Testing and Materials, 2003.  ASTM E 1847-96.  Standard
       practice for statistical analysis of toxicity tests conducted under ASTM guidelines.  In
       Annual_Book of ASTM  Standards, Vol.  11.05, ASTM, West Conshohocken, PA. Current
       edition approved December 10, 1996; Reapproved 2003.

       (5) Anon., 1981. Photosynthetically active radiation.  In: Physiological Plant Ecology.
       Lange, O.L.,  P. Nobel, B. Osmond, and H. Ziegler, eds. Vol. 12A, Encyclopedia of Plant
       Physiology. Springer-Verlag. Berlin, Heidelberg, New York.

       (6) Bruce, R.D. and DJ. Versteeg, 1992.  A statistical procedure for modeling continuous
       toxicity data. Environmental Toxicology and Chemistry 11:1485-1494.

       (7) Chapman, G.A., B.S. Anderson, AJ. Bailer, R.B. Baird, R. Berger, D.T. Burton, D.L.
       Denton, W.L. Goodfellow, M.A. Heber, L.L. McDonald,  TJ. Norberg-King  and PJ.
       Ruffier, 1996. Methods and appropriate  endpoints.  In Whole Effluent Toxicity Testing,
       D.R. Grothe,  K.L. Dickson and D.K. Reed-Judkins, eds., SETAC Press, Pensacola, FL.

       (8) deBruijn, J.H.M. and M.  Hof,  1997. How to measure no effect.  Part IV: How
       acceptable is the ECX  from an environmental policy point of view?   Environmetrics
       8:263-267.

       (9) Fairweather, P.G., 1991.  Statistical power and design requirements for environmental
       monitoring. Australian Journal of Marine Freshwater Research 42:555-567.

       (10) Nabholz, J.V.,  1991. Environmental hazard and risk  assessment under the Toxic
       Substances Control Act. Science of the Total Environment, 109/110: 649-665.

       (11) Nabholz, J.V., P. Miller and M. Zeeman,  1993.  Environmental risk assessment of
       new chemicals  under  the  Toxic Substances Control  Act (TSCA)  Section  5.   In
       Environmental Toxicology and Risk Assessment, Landis, W.G., Hughes, J.S., and Lewis,
       M.A.,  eds., ASTM STP  1179, American  Society for Testing and Materials, Philadelphia,
       PA, pp. 40 -55

                                     Page 27 of 29

-------
(12) Nyholm, N., P.S.  Sorenson, K.O. Kusk,  and E.R.  Christensen,  1992.  Statistical
treatment  of data from  microbial  toxicity tests.   Environmental  Toxicology  and
Chemistry 11:157-167.

(13) Organization for Economic Co-operation  and Development, 1998.   Report of the
OECD Workshop on Statistical Analysis of Aquatic Toxicity Data.   OECD Series on
Testing and Assessment, No. 10. ENV/MC/CHEM(98)18

(14) Organization for  Economic Co-Operation  and Development,  2006.   Current
Approaches in the Statistical Analysis of Ecotoxicity Data: A Guidance to Application.
OECD Series on Testing and Assessment, No. 54. ENV/JM/MONO(2006)18

(15) Pack, S.,  1993.  A review of statistical data  analysis and experimental design in
OECD aquatic toxicology test guidelines. Report to OECD. Paris.

(16) Smrchek,  J.C.,  R. Clements, R.  Morcock,  and W.  Rabert, 1993.   Assessing
ecological hazard under TSCA: methods  and evaluation of data, In  Environmental
Toxicology_and Risk Assessment, Landis, W.G., Hughes, J.S., and Lewis, M.A., eds.,
ASTM STP 1179, American Society for Testing and Materials, Philadelphia, PA, pp. 22-
39.

(17) Smrchek, J.C. and M.G. Zeeman, 1998.  Assessing risks to ecological systems from
chemicals. In Handbook of Environmental Risk Assessment and Management, P. Calow,
ed., Blackwell Science, Ltd., Oxford, UK, pp. 24-90, Chapter 3.

(18) Stephan, C.E., 1997.  Methods for calculating an LCso-  In Aquatic Toxicology and
Hazard Evaluation, ASTM  STP 634, F.L.  Mayer and J.L. Hamelink, eds., American
Society for Testing and Materials, Philadelphia,  PA.

(19) Thimijan, R.W., and R.D. Heins, 1982.  Photometric, radiometric and quantum light
units of measure: a review of procedures for interconversion. HortScience 18:818-822.

(20) U.S.  Environmental Protection  Agency, 1982.  Pesticide Assessment Guidelines
Subdivision J, Hazard Evaluation: Nontarget Plants.   Office of Pesticides and Toxics,
Washington, D.C. EPA-540/9-82-020, October  1982.

(21) U.S. Environmental Protection Agency, 1994.  Pesticides Reregi strati on Rejection
Rate  Analysis:  Ecological  Effects.   Office   of Prevention,  Pesticides  and  Toxic
Substances, Washington, D.C.  EPA 738-R-94-035.

(22) U.S. Environmental Protection Agency, 1997.  Terms of Environment, Glossary,
Abbreviations,  and  Acronyms.   Communications, Education, and  Public  Affairs,
Washington, D.C. EPA 175-B-97-001.

(23) U.S. Environmental Protection Agency, 2000.  Methods for Measuring the  Toxicity
and   Bioaccumulation   of   Sediment-Associated   Contaminants  with  Freshwater
Invertebrates, Second Edition.  Office of Water, Washington, D.C. EPA 600/R-99/064,
March 2000.
                              Page 28 of 29

-------
(24) U.S.  Environmental Protection  Agency,  2000.   Guidance for Data  Quality
Assessment,  Practical  Methods  for  Data  Analysis.    EPA  QA/G9.    Office  of
Environmental Information, Washington, DC.  EPA/600/R-96/084, July.

(25) U.S.  Environmental  Protection Agency, 2002.  Methods for measuring the acute
toxicity of effluents and  receiving waters to freshwater  and marine organisms.  Fifth
edition, Office of Water, Washington, D.C. EPA 821-R-02-012

(26) U.S.  Environmental  Protection Agency,  2002.  Short-term methods for estimating
the chronic toxicity of  effluents and receiving waters to  freshwater organisms.  Fourth
edition, Office of Water, Washington, DC. EPA 821-R-02-013

(27) U.S.  Environmental  Protection Agency,  2002.  Short-term methods for estimating
the chronic toxicity of effluents and receiving waters to marine and estuarine organisms,
Third edition, Office of Water, Washington, DC. EPA 821-R-02-014.

(28) U.S.  Environmental  Protection Agency,  Code  of Federal Regulations (CFR) Title
40—Pesticide   Programs  Subchapter  E—Pesticide  Programs.    Part   158—Data
Requirements for Pesticides.

(29) VanEwijk, P.H. and J.A. Hoekstra, 1993.  Calculation of the ECso and its confidence
interval when a subtoxic  stimulus is present.  Ecotoxicology and Environmental Safety
25:25-32.

(30) Zeeman, M. and J.  Gilford,  1993. Ecological hazard  evaluation and risk assessment
under EPA's Toxic Substances Control Act (TSCA): an introduction. In Environmental
Toxicology and Risk Assessment, Landis, W.G.,  Hughes, J.S., and Lewis,  M.A., eds.,
ASTM STP 1179, American  Society for Testing and Materials, Philadelphia, PA,  pp. 7-
21.

(31) Zeeman, M.G., 1995. Ecotoxicity testing and estimation methods developed under
Section 5  of the Toxic Substances Control Act (TSCA), In Fundamentals of Aquatic
Toxicology,_2n  Edition, G.M. Rand, ed., Taylor and Francis, Washington, DC, pp. 703-
715.
                              Page 29 of 29

-------