United States Environmental Protection Agency Environmental Research Laboratory Gulf Breeze FL 32561 Research and Development EPA/600/S3-87/016 July I987 Project Summary Field Validation of Multi- Species Laboratory Test Systems for Estuarine Benthic Communities Robert J. Diaz, Mark Luckenbach, Sandra Thornton, Morris H. Roberts, Jr. Robert J. Livingston, Christopher C. Koenig, Gary L. Ray, and Loretta E. Wolfe The major objective of this project was to determine the validity of using multi-species laboratory systems to evaluate the response of estuarine benthic communities to an introduced stress. Over a 5-year period, experi- ments in Apalachicola Bay, Florida, and the York River, Virginia, sought to (1) develop criteria for microcosm tests to evaluate the capacity of microcosms to model natural communities in the presence and absence of pollution- induced stress, and (2) assess the validity of extrapolating test results from one location to another. Individual species response patterns in the micro- cosms were highly variable and seldom showed good agreement with patterns in the field. Species richness in the microcosms and field sites showed good temporal agreement and provided a conservative indicator of community response to a toxic stress. An ecolog- ically based guild approach to grouping species proved to be a powerful and reliable method of extrapolating from microcosm test results to responses of field communities. This Project Summary was deve- loped by EPA's Environmental Research Laboratory, Gulf Breeze, FL, to announce key findings of the research project that is fully docu- mented in a separate report of the same title (see Project Report ordering information at back). Introduction Objectives The principal goal of environmental toxicology is to understand pollution- induced changes in ecological systems, primarily in an effort to predict the environmental consequences of a toxi- cant. Recently, emphasis has been placed on using multi-species laboratory systems to evaluate the response of aquatic ecosystems to an introduced stress. The assumption has generally been made that multi-species systems provide more realistic estimates of the effects of toxicants on complex natural ecosystems than do single species tests. A requisite part in the development of a laboratory multi-species aquatic test system is field verification. The authors define field verification as the testing of the capacity of specific laboratory test systems to predict the responses of ecosystems to toxicants. The process of field verification raises methodological considerations concerning criteria for conducting tests and interpreting data. An essential part of this process is simultaneous investigation of commun- ity dynamics in the laboratory test system and in the natural communityfrom which it was derived. In a 5-year project designed to field verify multi-species laboratory systems (microcosms) for use with estuarine benthic communities, four major issues were addressed: (1) development of criteria for conducting microcosm tests ------- and for interpreting the data; (2) evalua- tion of the capacity of a microcosm to track natural field communities in the absence of a toxicant; (3) comparison of community response patterns of labor- atory and field communities to a pollution-induced stress; and (4) deter- mination of the validity of extrapolating from microcosm tests conducted in one location to natural communities in another. Focus has been on the infaunal macroinvertebrate communities from unvegetated, soft-sediment sites in Florida (Apalachicola estuary) and Virgi- nia (York River estuary). The approach was to synoptically conduct field and laboratory experiments at both locations over different seasons. Experimental Approach The study sites in the Apalachicola Bay system (East Bay and St. George Sound) were located in polyhaline and oligoh- aline areas, and those in the York River were located in the meso-polyhaline portion of the estuary. All sites were shallow (1-2 m), unvegetated areas. Sediments in the oligohaline sites were silty sand, and in the polyhaline and meso-polyhaline sites sediments were predominantly fine sands. Microcosms ranging from approxi- mately 0.1 to 1.0 m were used to evaluate the influence of microcosm size on the system response. Microcosms were constructed of a series of cores collected with hand-operated box cores (10 cm x 20 cm x 10 cm deep). Core samples were placed in trays on a seawater table in the same arrange- ments as the original field orientations. For the duration of the experiments, microcosms were maintained in flow- through seawater tables where condi- tions of light, temperature and salinity were similar to the field. Macroinverte- brate samples in both the field and laboratory were collected in random sampling designs with coring devices (5 cm diameter, VIMS; 7.5 cm, FSU). Samples were preserved, rinsed onto 500- and 250-//m-mesh sieves, and the organisms identified to species and enumerated. Over a 5-year period seven field- laboratory experiments were conducted during spring and fall (Table 1). These seasons represented periods of peak biological activity. While the basic protocol for the tests remained similar, several different field and laboratory treatments were employed in the various tests. In all tests, replicated laboratory microcosms maintained in flow-through systems were sampled simultaneously with field treatments located in the same sites from which the microcosm com- munities were derived. The various treatments included predator exclusion and inclusion cages, and both field and laboratory dosing with unpolluted, hydro- carbon polluted (from the Elizabeth River, VA) and pentachlorophenol (PCP) pol- luted sediments. Toxicant dosing proce- dures in the field and laboratory were investigated by testing both the total volume and application method of toxicant-laden sediments. Sampling interval and duration were similar in a field-laboratory tests, but an effort wa made to determine the appropriat sampling schedule to observe response in the dosed experiments. A major focus of the work was t identify response variables that ade quately reflect important community level responses in both the field am laboratory systems. Community-widi descriptors (total number of individuals number of species, species diversity an< evenness), numerical abundances o dominant species, and biomass ani productivity estimates were all used a: Table 1. Sampling Schedules for the Combined (FSU-VIMS) Experimental Program (1981 1985) Weekly samples A. FSU 1. oligohaline stations (11 /24/81-11/17/83) 2. polyhaline station (11 /2S/81-3/15/84) B. VIMS 1. polyhaline marine lab station (10/13/79-12/18/83) Microbiological data A. FSU 1. oligohaline stations (fall 1982; spring 1983) 2. polyhaline stations (spring 1982) B. VIMS 1. marine lab station (spring 1982) Combined (field-laboratory) experiments A. Spring 1982 1. Florida 2. Virginia B. Fall 1982 1. Florida 2. Virginia C. Spring 1983 1. Florida 2. Virginia D. Fall 1983 1. Florida 2. Virginia 3. Treatments included: a. Field controls b. Field predator exclusion cages c. Field predator inclusion cages d. Microcosm controls e. Field and lab treatments dosed with PCP E. Spring 1984 1. Virginia only F. Spring 1985 1. Florida (station ML) 2. Virginia 3. Treatments included: a. field controls b. microcosm controls c. replicate lab and field treatments dosed with PCP d. azoic sediments G. Fall 1985 1. Florida (station ML) 2. Virginia 3. Treatments as in F. 3. ------- measures of community response. A more fruitful approach was to categorize individual species into guilds based upon ecological similarities and to treat these groups as ecological units in assessing response to toxicant stress. Project Results The laboratory-field experiments coupled with the weekly monitoring data permitted us to address each of the objects stated above. Criteria for con- ducting microcosm toxicity tests are outlined below and detailed in the final report. Definition of ecologically relevant functional groupings and the classifica- tion of species into guilds provided the basis for successful comparisons between laboratory and field commun- ities and between the Florida and Virginia test systems. Comparisons of different microcosm sizes suggested that small benthic microcosms (approximately 0.1 m2) provide good laboratory systems. These small microcosms had similar commun- ity patterns to larger (0.8 - 1.0 m2) microcosms and are easier to construct and handle. Sampling of small micro- cosms is a destructive process and each replicate microcosm may be sampled at only one time, therefore requiring a large number of replicate microcosms. Experimental durations of up to 6 weeks and sampling intervals of 1-2 weeks were used in the various field- laboratory experiments. In one experi- ment, samples were taken within 24 hours after dosing. Experience suggests that early sampling after dosing (within 24-48 hours) followed by increasing intervals up to 5 weeks provided a good sampling regime. Beyond 5 weeks micro- cosms may begin to experience changes in sediment geochemistry that cause the laboratory to diverge greatly from the field. The addition and monitoring of toxic substances are critical steps in any laboratory microcosm test syrtem. The experiments suggest that a good dosing procedure was to apply a 1 -cm thick layer of toxicant-laden sediments to the surface. The authors found that this same amount of uncontaminated sedi- ments did not have noticable effects upon the community. Dosing with less sedi- ment (1 mm thick) and the same toxicant load proved to be less effective. In tests using benthic microcosms derived from other habitats, it is important to include a treatment that adds uncontaminated sediments. The microbiotic component in a labor- atory microcosm is highly variable, and its capacity to predict natural trends depends on a combination of habitat characteristics in the area of origin. Microbial communities in microcosms deviate progressively with time from field associations and extended equilibration periods are ill advised. In microcosms of sediments from polyhaline areas, microbes did not follow field conditions as closely as those in microcosms from oligohaline portions of the estuary. Without detailed knowledge of microbial ecology in the source area an interpre- tation of results from laboratory micro- cosms could be misleading. Comparisons of field and laboratory community dynamics of infaunal macroinvertebrates revealed much vari- ation between experiments and loca- tions, but some generalization emerged. The population dynamics of many dom- inant species in both the Apalachicola Bay and the York River estuary were similar in the microcosms and the field. This generalization was qualified by finding that some species occasionally underwent rapid population blooms in the laboratory microcosms. For instance, in Florida experiments the polychaete Mediomastus ambiseta sometimes exhi- bited large population increases in the laboratory relative to the field. A similar response was observed in Virginia for the oligochaete Paranais littoralis. These population increases are related to the ability of these organisms to reproduce in the microcosm where new individuals survive better than in the natural field site. This pattern has been experimen- tally observed in field cage treatments for M. ambiseta. The major population fluctuations in both the field and the laboratory (for undosed treatments) were associated with recruitment events. Since recruitment intensities for most species differ between the laboratory and the field, recruitment events may result in substantial differences between field and laboratory populations. Moreover, the year-to-year variability in the timing and intensity of recruitment for any given species introduces a stochastic element into microcosm testing when single- species fluctuations are emphasized. Using community-level parameters to describe field and laboratory system responses avoids some of the variability associated with individual species fluc- tuations. Species richness provided a good descriptor of the macroinvertebrate communities (Figure 1). In undosed treatments, species numbers in the field and the laboratory were often similar and very conservative. In dosing experiments with both hydrocarbon contaminated sediment and PCP-laden sediment, the species richness component showed similar negative responses in both the field and laboratory communities (Figure 1). By contrast total numbers of Individ uals in the system fluctuated widely, largely as a result of recruitment events, and were not particularly responsive to toxicant treatments. Species diversity and evenness measures reflect combi- nations of these two components and were variable in their correspondence between the laboratory and field sites. Species richness is an important com- ponent of natural systems which is modeled well in aquatic microcosms and proved to be a sensitive community-level response to stress by a toxicant. Grouping species into guilds based upon functional groups according to trophic, mobility, and dispersal modes proved to be a powerful approach for interpreting community responses. This approach served two purposes. First, it permitted identifying those guilds of organisms for which laboratory micro- cosm populations do not serve as good analogs of natural populations in the absence of any toxicant (e.g., those brooding or asexually reproducing spe- cies which have capability of blooming within the microcosm). These types of organisms may be excluded a priori horn analyses to assess toxic effects. The second advantage is that identifying types of organisms that act as similar ecological units facilities comparisons between microcosms and field sites from different locations. For instance, while the species composition varies between the Virginia and Florida sites, function- ally similar ecological groups are found in both sites and provide a basis for comparison. Figure 2 shows summary examples of this approach for one guild which was modeled well in the laboratory and one which was not. Comparisons between field-laboratory experiments in Florida and Virginia identified both similarities and differen- ces. The most notable difference is that major recruitment periods at each loca- tion occur in different seasons. The major recruitment period in the York River estuary is in the spring with only a minor fall recruitment; the pattern in the Apalachicola Bay is temporally reversed. Since, as noted above, recruitment events play a major role in the population ------- A. Florida B. Virginia 30 20 1 if <0 .5> 10 14 12 10 c •c •$ u Field Control 0135 Time Since Initiation (Weeks) 01359 Time Since Initiation (Weeks) Figure 1. Field High Dose Lab Control Lab High Dose Species richness in field/laboratory PCP-dosed treatments from fall 1985 in Apalachicola Bay, Florida and the York River. Virginia. fluctuations observed, this becomes an important issue when attempting to infer the responses of natural communities to a toxic stress in one location based upon microcosm experiments located in another. Conclusions The authors conclude that properly constructed and replicated multi-species laboratory test systems with estuarine macrobenthic invertebrates can serve as effective tools for predicting responses to toxicant stress. Several caveats apply. Variability in natural estuarine systems is high, necessitating large numbers of experimental replicates and samples to observe even major responses. Species richness measures provide a conserva- tive indicator of community response to pollution-induced stress which is not subject to much of the variation observed for abundance measures. However, this measure may also gloss over much of the ecologically relevant response to the stress. The population dynamics of individual species within laboratory microcosms are too variable to provide adequate models of field populations, but grouping species into ecologically similar guilds alleviates much of this problem. The detailed ecological data required to construct these groups may be difficult to obtain for many species. The authors emphasize the importance of good ecological characterization of the habitats from which the microcosms are derived and the habitats about which inferences are to be made. When con- siderations of different recruitment seasons are taken into account, similar community responses to a toxicant are found in both the Florida and Virginia experiments. Experiments with PCP dosing conducted in Florida during the spring of 1985 had a similar response (in species richness) to experiments in the fall of 1985 in Virginia, and the fall experiments in Florida resembled those from the spring in Virginia. At each location experiments conducted during the peak reproductive seasons resulted in blooms of single species in the microcosm. A functional guild approach to analyzing community response patt- erns enhances the ability to make predictions concerning toxic responses in one site based upon laboratory exper- iments conducted at another site. It was concluded that laboratory micro- cosms can provide a valuable tool for assessing natural benthic community responses to introduced toxicants, pro- vided that the caveats and conditions described above are heeded. The authors recommend using microcosms to provide realistic estimations of field effects. Recommendations The results of the field-laboratory comparison experiments indicate that multi-species laboratory aquatic micro- cosms may yield valuable information regarding the responses of natural communities to pollution-induced stress. However, several very important cau- tions are offered for conducting and interpreting microcosm toxicity tests and in extending the findings to natural systems: 4 ------- Guild: Deposit-Feeder, DETRIV/OMNIV, Mobile Borrower, Limited Dispersal. Apalachicola Bay, FL Guild. Deposit-Feeder, DETRIV/OMNIV, Mobile Burrower, Wide Dispersal York River, VA 300 200 § I I 1 roo \ i n> 50 40 30 20 10 123456789 Time Since Initiation (Weeks) 0 01234569 Time Since Initiation (Weeks) Field Lab Figure 2. Comparison of temporal patterns for 2 guilds in the laboratory and the field. Data are composites of the control treatments in all tests. Data for the guild with limited dispersal (for which only Florida data are shown) reveal that although lab and field abundances track one another well initially, individuals in this group may undergo population blooms in the lab The guild with wide dispersal (Virginia data shown) shows a consistent pattern through the first 5 weeks with some divergence between lab and field thereafter. 1. Close attention must be paid to physio-chemical characteristics of microcosms and it is important that these lie within realistic ranges for field values at the time the exper- iment is conducted. 2. Monitoring of toxicant levels and distribution within the microcosms throughout the experiment is necessary to evaluate dissipation and breakdown of toxicants. 3. The high spatial variability inherent in benthic communities necessi- tates that sufficient replicates be employed. 4. The temporal variation in recruit- ment adds a nearly random com- ponent to the community response in microcosm tests from year to year and site to site. To overcome this problem it is mandatory that microcosm tests, while being prop- erly timed to correspond with biologically important reproductive seasons, identify and exclude spe- cies with aberrant recruitment patterns in the laboratory test system. 5. Successful extension of toxicity test results from laboratory micro- cosms to the field sites from which they were derived and beyond to other sites requires detailed knowl- edge of the systems. Just as impor- tant as an understanding of the physical conditions of the habitats is a good knowledge of the repro- ductive seasons and modes, tro- phic types, and life history charac- teristics of the component fauna. Since species composition will vary between sites it is necessary to characterize the response of differ- ent "ecological types" (guilds) m microcosms if the results are to be useful ------- |