UnilH-d 3t^*
Environmental protection
/, . Genetic Diversity as an Indicator
of Ecosystem Condition and Sustainability
"^ f t
... • >
,
BIODIVERSITY
*•
[
.
Utility for Regional Assessments of Stream
Condition In the Eastern United States
* *
.-*i
-------
Condition and Sustainability
EPA/600/R-03/056
September 2002
Genetic Diversity as an Indicator of
Ecosystem Condition and Sustainability
Utility for Regional Assessments of Stream
Condition in the Eastern United States
Mark J. Bagley
Susan E. Franson
Suzanne A. Christ
Eric R. Waits
Greg P. Toth
U.S. Environmental Protection Agency
National Exposure Research Laboratory
Ecological Exposure Research Division
Molecular Ecology Research Branch
Cincinnati, OH 45268
Recycled/Recyclable
Printed with vegetable-based ink on
paper that contains a minimum of
50% post-consumer fiber content
processed chlorine free.
-------
Genetic Diversity as an
Ill I I
Notice
The research described in this report has been funded wholly or in part by the U.S. Environmental Protection
Agency. This document has been prepared at the EPA National Exposure Research Laboratory (Ecological
Exposure Research Division, Cincinnati, Ohio) based on data collected by EPA and Sobran, Inc. (Contract 68-C6-
0019) and with publication assistance provided by CSC, Inc. (Contract 68-WO-1032).
Mention of trade names or commercial products does not constitute endorsement or recommendation of use.
The correct citation for this document is:
Bagley, M.J., S.E. Franson, S.A. Christ, E.R. Waits, G.P. Toth. 2002. Genetic Diversity as an Indicator of
Ecosystem Condition and Sustainability: Utility for Regional Assessments of Stream Condition in the Eastern
United States. U.S. Environmental Protection Agency, Cincinnati, OH.
-------
Condition and Sustainability
Preface
In 1995, the U.S. Environmental Protection Agency's National Exposure Research Laboratory (NERL)
enhanced its ecological research and development efforts to support the delivery of the "next generation"
of biological indicators. These new ecological condition and stressor diagnostic indicators would follow
the risk paradigm organization of the U.S. EPA's Office of Research and Development (ORD). ORD's
Ecological Research Strategy (US EPA, 1998) called for the "development of a set of indicators for estu-
arine, stream, and lake systems that can be interpreted relative to status and changes in fundamental eco-
logical and hydrologic processes that influence and constrain the integrity and Sustainability of these sys-
tems." Genetic diversity indicator research was initiated in NERL to address this objective.
In 1998, ORD's National Center for Environmental Research (NCER) requested proposals from the aca-
demic research community for Science to Achieve Results (STAR) grants focused on "Ecological
Indicators". Research sponsored by these grants emphasized genetic diversity and landscape ecology, both
of which can be interpreted at a number of geographic scales, a requirement for the next generation of eco-
logical indicators. In May 2000, NCER sponsored a review of genetic diversity science including both
ORD scientists and STAR grant recipients during which a roundtable discussion delineated contributions
as unique to either academic or federal laboratories. Large, regional-scale evaluations of genetic diversi-
ty within species, deemed beyond the scope of academic laboratories, were seen to be appropriate for fed-
eral laboratories and easily incorporated into existing environmental monitoring studies such as the
Environmental Monitoring and Assessment Program (EMAP). These views were mirrored by recom-
mendations in the July 1999 NSF Task Force on the Environment document "Environmental Science and
Engineering for the 21st Century" which called for genetic diversity research within federal laboratories.
Current advances in molecular biological science and technology have converged with classical genetic
research, large-scale field biological monitoring, and remote sensing capabilities to provide unprecedent-
ed opportunities for multifaceted studies of species population structure and dynamics. These types of
studies are critical to understanding the integrity and Sustainability of ecosystems.
This report chronicles significant strides made in the development of an ecological indicator based on
genetic diversity that is suitable for environmental monitoring studies at a range of geographic scales.
Implementation of regional studies of genetic diversity required development of protocols for inclusion of
genetic sampling in field studies and large-scale laboratory throughput of these samples. Robust statisti-
cal methods facilitated the meaningful interpretation of genetic diversity DNA fingerprinting data in the
context of other environmental data collected concurrently. Documentation of these procedures for meas-
uring genetic diversity is presented herein, along with the background and rationale for employing genet-
ic diversity as an ecological indicator. Case studies are presented which demonstrate the application of
genetic diversity in two field-monitoring efforts. Finally, recommendations are given for genetic diversi-
ty study design and technology transfer based on field and laboratory experience with large-scale studies.
-------
Genetic Diversity as an
Ill I I
Summary
Genetic diversity is a fundamental component of biodiversity and is as critical to sustainability of our nat-
ural resources as are diversity of species and ecosystems. It encompasses all of the genetically determined
differences that occur between individuals of a species. Virtually all species are composed of populations
that exist somewhat independently of each other, and thus genetic diversity exists both within and among
populations. Levels of genetic diversity in any one population are determined primarily by four forces:
(1) mutation, the ultimate source of all genetic diversity; (2) migration, the exchange of individuals
between populations; (3) natural selection, the removal of "unfit" individuals from the population; and (4)
genetic drift, random changes in gene frequency each generation due to limited numbers of breeding
adults. The natural history of a species and the structure and dynamics of populations provide the arena
in which these forces interact to drive evolutionary adaptation of populations to their environments. Thus,
natural and anthropogenic environmental changes lead to changes in genetic diversity, both within and
among populations, and genetic diversity measurement can provide insights into the consequences of envi-
ronmental changes.
Genetic diversity can be measured by examining common morphological or morphometric traits. Such
observable characteristics often result from the interaction of many genes, the expression of which is influ-
enced by environmental factors. Assessments of molecular markers based directly on DNA have simple
inheritance patterns and are not influenced by environmental factors. This simplification of the genetic
system allows precise estimates of genetic diversity for any one marker and, by assessing many markers,
can provide more precise estimates of overall levels of genetic diversity within and among populations of
a species. Mathematical tools have been developed that allow diagnosis of the relative strengths of the
four genetic forces and, indirectly, properties of populations, such as population size, breeding structure,
and dispersal abilities.
Measurement of genetic diversity with molecular markers can add value to assessments of ecological con-
dition derived from other ecological indicators, such as landscape and species assemblage indicators.
Population parameters can be effectively estimated with molecular markers and used to characterize the
geographic structure and connectivity of populations critical to interpreting data for ecological assess-
ments. Genetic diversity also serves as an independent indicator of environmental condition.
Environmental stressors typically reduce genetic diversity, primarily through the forces of selection and
genetic drift, so that a recent reduction in genetic diversity is indicative of deteriorating environmental
condition. As an indicator of ecological condition, genetic diversity integrates the genetic effects of mul-
tiple sources and is cumulative over time. In addition, it is a naturally 'scalable' indicator, as the geo-
graphic structuring of genetic diversity at the population, watershed, and regional levels is easily inferred.
The importance of genetic diversity to long-term sustainability is widely accepted, although the efficacy
of molecular measures of genetic diversity for diagnosing extinction risk remains unclear and needs fur-
ther investigation. Standing levels of genetic diversity in populations contribute to long-term sustainabil-
ity in several ways. First, the ability of populations to adapt to changing environments is directly depend-
ent on the amount of genetic diversity they possess. Second, small populations that lose genetic diversity
may experience fitness reductions and increased extinction risk. Finally, populations that are adapted to
IV
-------
Condition and Sustainability
local conditions may become less fit if individuals from other areas that are adapted to different conditions
are allowed to interbreed with them; the ensuing reduction in genetic diversity between populations influ-
ences long-term sustainability of the species.
This report documents research undertaken to determine if the theoretical promise of genetic diversity as
an ecological indicator is realized in real-world applications. Results of two case studies confirm that
genetic diversity is a useful indicator of environmental condition. The first case study incorporated the
genetic diversity indicator in a larger Regional Environmental Monitoring and Assessment Program study
of the Eastern Cornbelt Plains Ecoregion, done in collaboration with US EPA Region 5 and Ohio EPA.
Genetic diversity of a small cyprinid minnow, the central stoneroller (Campostoma anomalum), was meas-
ured at 91 sites in nine watersheds using the RAPD (random amplified polymorphic DNA) fingerprinting
technique. Although the RAPD technique was chosen primarily for ease of technology transfer, experi-
ences with the technique suggested that it might not be robust to the normal variations in equipment and
technical skills that exist among different laboratories. Nonetheless, the genetic diversity data obtained
proved highly informative. Although sample sizes varied and were sometimes small (3-10 individuals per
site), large differences in genetic diversity within sites and among sites were detected. Significant differ-
ences in the average levels of genetic diversity within populations were observed among major river
drainage basins, leading to the conclusion that populations of stonerollers are highly differentiated within
the Eastern Cornbelt Plains Ecoregion and there is geographical structuring of these populations within
and among watersheds. Genetic diversity is related to environmental condition, particularly impacts from
urbanization, channelization, and impaired riparian zones. Expected relationships between genetic diver-
sity and existing ecological indicators such as the IBl and QHEI were seen, although the small degree of
correlation suggests that the genetic diversity indicator provides supporting and not highly redundant
information for environmental condition assessments.
The second case study examined the genetic diversity indicator applied to populations of the creek chub
(Semotilus atromaculatus) in a small region of western Pennsylvania and West Virginia underlain by coal-
bearing geology and for which the history of coal mining operations is known. Samples of between 9 and
28 creek chubs were collected from 10 sites within 4 watersheds. Two molecular methods were used: the
amplified fragment length polymorphism (AFLP) fingerprinting technique was used to assess diversity in
the nuclear genome, while a portion of the mitochondrial genome was assessed using DNA sequencing.
Mitochondrial DNA differences showed a strong spatial component. The nuclear DNA also differentiat-
ed the populations although the genetic structure was not as strong as that seen in the mitochondrial DNA.
Environmental factors (derived from principal components analysis of 25 key environmental measure-
ments) accounted for about half of the differences in mitochondrial DNA diversity and virtually all of the
differences in nuclear DNA (AFLP) diversity.
These two case studies clearly demonstrate that genetic diversity can serve as an indicator of environ-
mental condition. They also provided the practical experience upon which recommendations for future
implementation are based.
At present, genetic diversity indicators will be used most effectively if they are incorporated into multi-
indicator assessments at large, regional scales. Typically, reduced genetic diversity in particular popula-
-------
Genetic Diversity as an
Ill I I
tions is inferred from assessments of spatially separated populations, although it can be detected more eas-
ily through temporal monitoring. Thus, incorporation of genetic diversity indicators into monitoring pro-
grams at intensively studied index sites will be useful. Regional genetic diversity data can then serve as
baseline data for future monitoring of temporal patterns in genetic diversity at different spatial scales.
A number of different molecular technologies can be used for genetic diversity analysis, including
allozyme, DNA fingerprint, microsatellite DNA, and mitochondrial DNA fragment or sequence analysis.
At present, it appears that the most cost-effective strategy is to incorporate microsatellite markers into
existing or planned ecological assessments. It may be beneficial to supplement microsatellite analysis
with mitochondrial DNA analysis since mitochondrial DNA can yield complementary information. This
approach is the most technologically challenging of all the genetic diversity assessment options. Thus, the
recommendation is that a "three-laboratory approach" be used to obtain and interpret genetic diversity
data. The lead lab would be the regional field lab, which would be responsible for design of the assess-
ment, field collections, and preparation of DNA samples. A marker development laboratory would design
molecular markers specific for the target species identified by the regional lab. A genetic analysis labora-
tory would use the molecular markers and DNA samples obtained from the other labs to perform the genet-
ic diversity assessment and, together with the regional laboratory, derive the ecological assessment.
VI
-------
Condition and Sustainability
Table of Contents
Preface iii
Summary iv
Table of Contents vii
1 Introduction 1
1.1 What is Genetic Diversity? 1
2 Rationale for a Genetic Diversity Indicator 4
2.1 Relevance of genetic diversity to assessment of ecological condition 5
2.2 Relevance of genetic diversity to assessment of population sustainability 11
2.3 Measurement of Genetic Diversity 13
3 Case Studies 16
3.1 Genetics of Central Stonerollers in The Eastern Cornbelt Plains Ecoregion 16
3.1.1 Background 16
3.1.2 Key findings and their implications 19
3.2 Genetics of Creek Chubs in a Mining-Impacted Region 32
3.2.1 Background 32
3.2.2 Key Findings and their Implications 35
3.3 Generalizations from the Case Studies 40
4 Considerations when Implementing a Genetic Diversity Assessment 43
4.1 Sampling Design 43
4.2 Species Choice 45
4.3 Which Genetic Marker? 46
4.4 Sample Size Considerations 53
4.5 Personnel Training and Specialized Equipment 55
4.6 Information Management 55
4.7 Costs 56
4.8 Summary Recommendations 59
Glossary 62
References 65
Acknowledgements 72
Appendix 1: Laboratory and analytical procedures for RAPD analysis 73
Appendix 2: Laboratory and Analytical Procedures for AFLP Analysis 76
Appendix 3: Laboratory and Analytical Procedures for DNA Sequence Analysis 77
VII
-------
Genetic Diversity as an Indicator of Ecasjfijjll
ill
viii
-------
Condition and Sustainability
1. Introduction
Historically, the U.S. Environmental Protection Agency and other governmental agencies charged with
monitoring and safeguarding the quality of aquatic environments tended to concentrate their efforts on
assessment of the immediate toxicological effects of discharges from known pollution sources. Gradually,
as the more flagrant pollution sources were identified and removed, it became clear that much of the
remaining pollution in our waterways could not be ascribed to specific point sources but was generated at
low levels from many nondescript sources. In addition, there was concern that aquatic ecosystem health
was not a simple function of water quality and could not be fully measured based on the results of a series
of short, standardized laboratory tests. Following the publication of an influential report over a decade ago
(U.S. Environmental Protection Agency, 1988), the EPA began to shift its ecological research to focus
attention on ecosystem responses to cumulative, regional, and long-term anthropogenic disturbances. The
EPA's Environmental Monitoring and Assessment Program (EMAP) was born of this movement, and was
given the mission to evaluate the status and trends in the condition of the Nation's ecological resources
(Messer,etal, 1991).
A number of ecological indicators have since been developed that have, in one fashion or another, aided
in measuring the overall health of aquatic ecosystems. These include indicators based on types of assem-
blages of species present and landscape or land cover indices. The application of these indicators has
clearly expanded our toolbox for monitoring the status of aquatic ecosystems. Nonetheless, important
information about the current status and predicted future status of the Nation's aquatic ecosystems still
remain to be addressed. For example, current indicators tell us little about the biological independence or
connectivity of different geographic areas. How important to ecosystem health are the natural migration
barriers and corridors that separate and connect different populations? If populations of a species from
one region are extirpated, will individuals that are similarly adapted to that environment replace them?
How much of the evolutionary history of a species will be lost if the habitat of one distinct population is
destroyed to make room for a new development?
In addition, we know little about the many hidden effects of anthropogenic stressors on species and ecosys-
tems that do not manifest as gross deformities, dead or dying fish, or extinct species. Do these stressors
evoke heritable changes in organisms that may affect the Sustainability of populations many generations
into the future? If past or current stressors have altered the genetic characteristics of populations, have
these changes affected their ability to withstand the additional stresses of future human population growth,
global warming, and associated environmental modifications? Are some populations of a species geneti-
cally "pre-programmed" to withstand severe environmental modifications while others are not?
These are important questions that must be answered to truly understand the condition and Sustainability
of the Nation's ecological resources. Clearly, they cannot be answered using simple streamside assays or
laboratory tests. To answer these questions, we must come to understand the dynamics of populations and
how those dynamics will be altered in the face of anticipated environmental changes. To understand the
long-term effects of environmental changes on populations, we must learn how environmental modifica-
tions impact genetic diversity within and among populations and how changes in genetic diversity ulti-
mately influence the ability of populations to withstand further environmental change.
1.1 What is Genetic Diversity?
The term 'genetic diversity' has been defined in various ways to suit different purposes. Here, we restrict
our interests to intraspecific genetic diversity, which represents the range of heritable differences of a trait
-------
Genetic Diversity as an
Ill I I
or set of traits among individuals within a species and includes diversity among individuals within popu-
lations as well as variation among different populations (Figure 1-1). The term is essentially synonymous
with 'genetic variation' or 'genetic variance'. As defined, we consider genetic diversity a trait of popula-
tions and species, not of individuals. Examples include variation in the genetic determinants of eye color,
growth rate, and disease resistance within and among populations. Similar variation can be assessed in
specific stretches of DNA using molecular markers. Genetic diversity is one of the three components of
biodiversity, along with species diversity and ecosystem diversity. In a sense, it is the fundamental orga-
nizational component of biodiversity, since species diversity is built from genetic diversity, and ecosystem
diversity derives from species diversity. Thus, any goal to monitor or maintain biodiversity is incomplete
and superficial if it considers only species and ecosystems.
Genetic diversity is shaped by past population processes and affects the sustainability of species and pop-
ulations in the future. Physical, chemical, and biological environments are constantly changing over short
and long time scales. To keep up with changing environments, species must adapt or face extinction. The
ability of species to adapt to altered environments is directly related to the amount of genetic diversity
available to natural selection. For two populations that are under similar environmental pressures to adapt,
the population with the greater genetic diversity for fitness traits is expected to adapt more quickly. That
the maintenance of genetic diversity is key to the long-term survival of most species is a central paradigm
of the field of conservation genetics (Soule, 1987).
The main forces that determine current levels of genetic diversity within species are: mutation, migration,
selection, and genetic drift (Figure 1-2). Mutation is the ultimate source of all genetic diversity, but is usu-
ally a relatively weak force in comparison to the other three. Most new mutations are either neutral or
harmful with respect to fitness. A very small proportion of mutations is expected to increase fitness.
Migration or gene flow represents the movement of breeding individuals between populations. It is usu-
ally a very strong force on genetic diversity, although it can be weak for species with low dispersal abili-
ties. Disturbance of a species' migration regime may have substantial effects on fitness. In general, migra-
tion has the effect of increasing genetic diversity within populations and homogenizing genetic differences
among populations.
Natural selection, along with genetic drift, is one of the main forces that cause separate populations to
become differentiated. It is the primary force that increases the average fitness of populations. Selection
can have variable effects on genetic diversity, depending on the form of selection that is operating. In
cases where an environmental change has caused a different set of genes or genotypes to be optimal, selec-
tion is expected to decrease genetic diversity at selected loci, at least until the population mean has shift-
ed to the new genetic optimum.
Genetic drift represents the random change in gene frequencies in each generation due to a finite number
of breeders producing each new generation. It is the statistical equivalent of sampling error. Without the
input of new mutations, all genetic diversity within species would eventually be lost due to the effects of
genetic drift. Genetic drift tends to be a very strong force on genetic diversity in small populations, even
for loci that are under intense selection. As a consequence, adaptation is more difficult in small popula-
tions while random fixation of deleterious alleles that would normally be removed by selection is more
likely. The effect of genetic drift in large populations is weaker but may still be a dominant force for loci
that are not under selection. Genetic drift has the effect of decreasing genetic diversity within populations
and increasing genetic diversity among populations.
-------
r
Diversity as an Indicator of Ecosystem Condition and Sustainability
o.
(D p_o ^
°=0=!
ro .^ o
at?
" «g o
§ = is
- o
±^ 1- Q. O Q.
£ n > , o
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustain
I I I
Migration (+)
Genetic drift (-)
Local adaptation (-)
Within
Populations
Selection (+/-)
Mutation / recombination (+)
Genetic drift (-)
Migration (-)
Genetic drift (+)
Local adaptation (+).
Among
Populations
o
Species
Figure 1-2. Forces that influence overall (species-level) genetic diversity, as well as components of genetic diversity
among and within populations. Forces that are annotated with a + generally increase genetic diversity; those anno-
tated with a - generally decrease diversity. Selection may either increase or decrease genetic diversity, depending
on specific circumstances.
2 Rationale for a Genetic Diversity Indicator
It is now a widely accepted practice to integrate genetic management ideas into resource management
planning (e.g, Hessel, 1992; Moritz, 1994). The primary goal of genetic management is to ensure that suf-
ficient genetic diversity is retained in order to maintain short-term fitness and long-term sustainability and
to ensure that local adaptations are not lost due to intermixing of previously distinct populations. An addi-
tional goal is to identify and protect those populations that represent highly distinct or important evolu-
tionary lineages, because if such populations become extinct, an important component of the genetic diver-
sity of the species would be lost. The idea that genetic monitoring can be used as an indicator of envi-
ronmental condition is less prevalent in resource management plans (salmon management in the Pacific
Northwest being a notable exception), primarily because most resource agencies still consider genetic
techniques to be novel, expensive, or technically challenging. This has created the somewhat paradoxical
situation in which resource managers pay close heed to conservation genetic principles, but have not
-------
ITTT
jjjU''&f.Ecosystem Condition and Sustainability
adopted policies to measure the success of man-
agement actions in maintaining genetic diversity.
Nonetheless, an enormous database of genetic
diversity studies exists in the scientific literature
and can provide guidance in the development and
evaluation of genetic diversity indicators.
of the of
a for
is
to the
of
I • •
2.1 Relevance of genetic diversity to
assessment of ecological condition
The genetic diversity of populations responds to
environmental heterogeneity via alterations in the
relative strengths of the four opposing genetic forces: mutation, migration, selection, and genetic drift.
The balance and cumulative history of these forces determines actual levels of genetic diversity at any one
time. The accuracy and sensitivity of measurements of genetic diversity has steadily increased with
advances in molecular marker technologies. An enhanced ability to describe patterns of genetic diversity
in space and time has, in turn, made it easier to diagnose the genetic mechanisms that produce these pat-
terns (Carvalho, 1998).
By far, the easiest genetic forces to measure in natural populations are equilibrium levels of migration
(gene flow) and genetic drift. Gene flow provides a direct measure of the evolutionary connectivity of
populations while genetic drift provides a measure of effective population size. Populations that have low
connectivity with others become genetically differentiated and more unique. These populations are often
assigned higher conservation 'value' because a large portion of the genetic diversity of the species would
be lost if they were extirpated (Moritz, 1995). Populations with low effective population size may be more
vulnerable (see section 2.2) and therefore may be targeted for immediate conservation efforts. The use of
genetic markers to measure connectivity and effective population size and to assign conservation values
and priorities is commonplace, as evidenced by numerous articles in scientific journals such as Molecular
Ecology, Evolution, Conservation Biology, and Conservation Genetics. Thus, the concept that genetic
diversity is a useful indicator of population structure and conservation status is well established. However,
in these studies, evaluations of genetic diversity are almost always focused narrowly on the status of par-
ticular populations residing in specific habitats; inferences to overall ecological condition based on genet-
ic diversity are not usually made.
For the purpose of performing environmental assessments, one of the most important potential uses of a
genetic diversity indicator is simply to characterize the natural population structure of component popula-
tions within ecosystems. The effective population size and connectivity of populations are fundamental
attributes of populations that can be diagnosed only with genetic markers. Because populations are the
basic biological units that respond to changes in the environment, delimitation of population boundaries is
critical to effective ecosystem monitoring and management.
Without this basic knowledge of population structure, it is impossible to judge the extent or range over
which local or regional environmental impacts will affect ecosystems. In addition, concordance of popu-
lation genetic structure across several species may indicate biogeographic boundaries not recognized
through analysis of species distributions alone (Moritz and Faith, 1998). By defining population bound-
aries, genetic diversity indicators provide fundamental data that enhance the value and interpretation of
other ecological assessment data, such as those obtained from landscape and species assemblage indica-
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustainabilit
I I I
tors.
In addition to delineating these long-term, evolutionary characteristics of populations, a useful indicator
of ecological condition should be responsive to environmental change (US EPA, 1998). Genetic diversi-
ty responds to environmental change, primarily through the actions of genetic drift and selection, but also
through mutation and gene flow. The indicator
'remembers' population effects. In other words,
once genetic diversity is reduced it will remain low
until mutation or gene flow replenishes it (Figure
2-1). Because the effects of selection and drift are
cumulative across generations, there is often a lag
time before significant changes in genetic diversity
are revealed (Bickham et a/., 2000). Thus, a genet-
ic diversity indicator is expected to be useful pri-
marily for multigenerational exposures; the rele-
vant timescale for indicator response is years or
decades, not months. Exceptions occur when pop-
ulations are severely bottlenecked or severe selec-
tion pressures are imposed on populations, induc-
ing rapid and severe changes in genetic diversity.
Decreases in genetic diversity in response to experimental bottlenecks of laboratory populations are well
known (Hartl and Clark, 1997). In general, these studies show that declines in heterozygosity at marker
loci follow theoretical predictions regarding the loss of genetic diversity in relation to effective population
size. "Natural" population bottleneck experiments, such as those that occurred when elephant seals were
hunted to near extinction and when European starlings were introduced to North America, also show a
remarkable correspondence with genetic theory (Cabe, 1998; Hoelzel, 1999).
Genetic diversity indicators provide
fundamental data that enhance the
value and interpretation of other
ecological assessment data, such
as those obtained from landscape
and species assemblage indicators.
1.2
e
8 0.8
31 0.6
2
oj 0.4
o>
1 0.2
Ne
•2
•5
•20
•100
•5-100
°>
Generations
Figure 2-1. Predicted loss of heterozygosity over time from genetic drift. Heterozygosity (a measure of genetic
diversity within populations) decreases more rapidly in populations with smaller effective population sizes (Ne). For
the series labeled "5-100", the population started at an effective size of Ne=5 and remained there for five genera-
tions. Starting at generation 6 the effective size grew to Ne=100. Note that information about the past bottleneck
continues to be reflected by heterozygosity after the population has rebounded.
Numerous field studies have demonstrated a correlative relationship between levels of genetic diversity
and single or mixed contaminant exposures. Examples of changes in the genetic structure of plant popu-
lations exposed to mine wastes are well known, and were recently reviewed by Shaw (1998). Several
examples of correlations between genetic diversity and contaminant exposures for freshwater and marine
species are listed in Table 2-1. In most of these cases, the frequency of a particular allele or genotype shift-
-------
irrr
', Jii-^yf^
mtUEiCOsystem Condition and Sustainability
...
a
is
I I
ed dramatically in exposed populations relative to
frequencies in reference populations. Often, a sim-
ilar shift was observed for the same marker locus
(e.g., the allozyme locus GPI-2) in different
species, suggesting that selection was operating at
that particular marker (Gillespie and Guttman,
1999). In addition, genetic diversity of exposed
populations was often lower than for reference
populations, suggesting that the effective sizes of
exposed populations were reduced thereby induc-
ing strong genetic drift effects. In at least one case,
a significant difference in genetic diversity
between reference and exposed populations was
observed when indicators based on species assem-
blages suggested homogeneity (Roark and Brown, 1996).
Many factors can contribute to differences in allele frequencies so demonstration of population differences
does not, by itself, convincingly demonstrate that the specific contaminants directly impacted genetic
diversity. However, a causal link between susceptibility and resistance to specific contaminants and genet-
ic diversity has been demonstrated in several laboratory and mesocosm experiments (Table 2-1). For
example, Gillespie and Guttman (1989) found that central stonerollers with allozyme genotypes that were
common in contaminated environments had greater survival following laboratory exposure to copper than
those with other genotypes. While significant effects on genetic diversity have been observed for a large
number of organisms and experimental exposures, other studies have failed to identify significant genetic
changes, even in cases where correlations were observed in natural populations (e.g., Diamond et al.,
1989). Although a number of potential reasons can explain the lack of genetic response in these studies,
they demonstrate that genetic diversity is as yet an imperfect indicator of exposure.
Besides chemical inputs, other anthropogenic stressors that affect survival or selection would be expected
to impact genetic diversity as well. In practice, few examples of genetic diversity assessments relative to
recent environmental alterations have been reported. An exception is the rather large number of studies
that have documented the strong effects on genetic diversity of interspecific and intraspecific hybridiza-
tion due to species introductions (e.g., Campton and Johnston, 1985; Gyllensten et al, 1985; Leary et al,
1995; Williams et al., 1996; Ayres et al., 1999). The effect of hybridization on genetic diversity is imme-
diate and, if the hybridizing taxa are genetically distinct, easy to detect with genetic markers. The effect
of habitat fragmentation on genetic diversity also has been investigated. Increased genetic differentiation
of populations in relation to habitat shrinkage has been reported for woodpeckers (Haig et al., 1996),
lizards (Schneider et al, 1998), mint plants (Morden and Loeffler, 1999), and beetles (Knutsen et al.,
2000). The construction of a highway was reported to have increased genetic differentiation between vole
populations on each side of the road (Gerlach and Musolf, 2000). Smith et al. (1983) reported results of
a survey of genetic diversity in mosquitofish that suggested genetic differentiation due to water impound-
ments. If population size within fragments is decreased, fragmentation should result in lower genetic
diversity within populations as well, but this was either not observed or not reported in these studies.
Although there is much interest in measuring the ecological effects of environmental mutagens and car-
cinogens, studies of the heritable effects of these exposures on natural populations are extremely rare.
However, measures of genomic mutation rates and inferences to possible changes in mutation rates, both
globally and in specific environments, are extremely important. Population genetics theory suggests that
even a very small increase in the mutation rate may decrease the fitness of many populations enough to
-------
Genetic Diversity as an
p I1 I1
Table 2-1. Natural populations and laboratory or mesocosm studies of freshwater and marine taxa that indicate a
correlation between genetic diversity and exposure to contaminants (modified and extended from Gillespie and
Guttman, 1999).
Known
Stressors
Mercury, other
metals
Acidity, [Al]
Arsenate
Pesticides
PAH
(fluoranthene)
Radionuclides
Overall water
quality,
complex
effluents
Molecular Marker
Allozyme
Allozyme
____A21ozyjn^___
Allozyme
Allozyme
^^^AM5?XH!£^^-
RAPD, Allozyme
RAPD
Allozyme
CYP1A Sequence
MtDNA/RAPD
MtDNA
RAPD
RAPD
Allozyme
Taxa Natural Population References
(NP)Or
Laboratory /
Mesocosm (L/M)
Study
Gastropods
Shrimp
Fish
Fish
Shrimp
Gastropods
Insect
Fish
Amphipod
Shrimp
Gastropod
Mollusc
Fish
Fish
Fish
Amphipod
Fish
Fish
Fish
Bivalve
Fish
Amphipod
Fish
Fish
Fish
Fish
Fish
Fish
Copepod
Fish
Crayfish
Mussel/Barnacle
Fish
NP
NP
NP
NP
NP
NP
NP
L/M
L/M
L/M
L/M
L/M
L/M
L/M
L/M
L/M
L/M
L/M
L/M
NP
NP
NP
L/M
L/M
T/M
NP
L/M
L/M
L/M
L/M
NP
NP
L/M
NP
NP
NP
NP
NP
NP
NP
NP
NP
NP
Nevoe/a/., 1984
Bentone/a/., 1994
Nevoe/a/., 1984
Reagleretal., 1993
Keklake/a/., 1994
Bentone/a/., 1994
Roark and Brown, 1996
Diamonds a/., 1989
Heaglere/a/., 1993
Mulvey etal, 1995
Tatarae/a/., 1999
Ben-Shlomo andNevo, 1988
Lavie and Nevo, 1986b
Benton and Guttman, 1992a,b
Chagnon and Guttman, 1989
Schluetere/a/., 1995, 1997, 2000
Duane/a/.,2000a
Ben-Shlomo andNevo, 1988
Lavie and Nevo, 1982, 1986a,b
Moragaetal., 2002
Larno etal., 2001
Koppe/a/., 1992
Koppetal., 1992
Duane/a/.,2000a
Newman et al., 1989
Hughes et al, 1991
Brown Sullivan and Lydy, 1999
Tanguy etal., 1999
Schlueter, et al, 2000
Duane/a/.,2000b
Larno et al., 2001
Theodorakis and Shugart, 1997
Theodorakis and Shugart, 1998
Gillespie and Guttman, 1989, 1993
Fore et al., 1995a,b
Heithaus and Laushman, 1997
Roy etal., 1996
Murdoch and Hebert, 1994
Street and Montagna, 1996
Nadige/a/., 1998
Krone etal., 1999
Maet al, 2000
Roark et al, 2001.
..
-------
Condition and Sustainability
drive them to extinction (Lynch, 1996). Measurement of mutations with highly mutable molecular mark-
ers (e.g., microsatellites, minisatellites) can reveal differences in mutation rates at both the individual and
population levels. At the population level, an increase in the mutation rate is expected to result in a large
frequency of very rare alleles. In practice, it may be more straightforward to measure the accumulation of
new mutations directly, by assessing genotypes of parents and progeny, or by assessing the frequency of
novel genotypes in gametes of reproductive individuals. For example, analysis of multi-locus DNA fin-
gerprints of herring gulls and their offspring from an industrialized urban harbor indicated higher muta-
tion rates than for gulls originating from areas with lower anthropogenic inputs (Yauk and Quinn, 1996).
A similar analysis of microsatellite loci in barn swallows near Chernobyl, demonstrated two-to-tenfold
higher mutation rates near the radioactively contaminated area (Ellegren et a/., 1997).
The scientific literature reviewed in this section shows a clear relationship between genetic diversity and
ecological stressors that is consistent with genetic theory. In general, different stressors should have pre-
dictable consequences for measurements of genetic diversity. This is not to say that genetic diversity is
likely to be useful to diagnose specific stressors, but that an established cause and effect relationship exists.
Thus, loss of habitat that causes a decrease in the rate of migration between populations (fragmentation)
is expected to increase the component of genetic diversity among populations, which may be measured as
a decrease in the average similarity between populations or an increase in estimates of one of Wright's
(1978) F-statistics, FST (Figure 2-2). If fragmentation results in small breeding population sizes, then a
second predicted outcome would be reduced within-population genetic diversity or heterozygosity (H),
and reduced numbers of segregating alleles per locus (Na). Similarly, significant degradation of habitat
due to contaminant exposure, siltation, changes in flow regime, or other modifiers that greatly reduce pop-
ulation size are expected to decrease genetic diversity within populations (reduced H, Na) and greatly
reduce genetic diversity of genetic markers that are linked to loci affecting adaptation to the altered habi-
tat. The predicted effect of introduced species depends on whether they can successfully hybridize with
native populations. Thus, introduced species are expected to have an effect similar to habitat degradation
(decreased H, Na) due to competition or predation but will increase genetic diversity through hybridiza-
tion. Exposure to environmental mutagens is predicted to increase the average number of alleles at a mark-
er locus (Na) but, since each new mutation will be rare in the population, should have little effect on het-
erozygosity.
One of the principle advantages of genetic diversity measures is that they are highly scalable indices. For
aquatic populations, the basic units of measure are populations that reside in different stretches of streams
or rivers that are nested within watersheds that are nested within larger regions. Measures of genetic diver-
sity, such as Wright's F statistics or Nei's genetic diversity statistics (reviewed in Nei, 1987) are especial-
ly appropriate for analyzing this hierarchical structure. For example, these population statistics can be
used to assess the proportion of all genetic diversity that resides within populations, within local water-
sheds, within larger river drainages, and within regions. In fact, the only limitation to the number of hier-
archical levels of analysis that can be assessed are biological, as the range a species can occupy ultimate-
ly limits the regional application of genetic diversity indicators.
Another important advantage of a genetic diversity indicator is that it naturally lends itself to analysis of
temporal trends. In fact, temporal monitoring of genetic diversity, either regionally or at index sites, is the
preferred application of the indicator (see section 3.1.1). Temporal monitoring provides the most direct
and most effective measure of actual changes in the level or structure of genetic diversity. Analytical tech-
niques have been developed to estimate the effective number of breeding adults from temporal changes in
genetic diversity (Waples, 1990; Jorde and Ryman, 1996).
-------
Genetic Diversity as an
TTT
STRESSOR
EFFECT
MEASURE
Habitat Loss /
Fragmentation
Habitat
Degradation
Exotics /
Admixtures
Dispersal
"ST
Breeders
* H, N
Adaptation
Gene Flow/
Hybridization
Environmental
Mutagens
Mutations
*H, Na at
linked loci
Figure 2-2. Population-level effects of different stressors and their predicted effects on measurements of genetic
diversity. FST is a measure of population subdivision. Heterozygosity (H) and the number of segregating alleles
(Na) are measures of genetic diversity within populations.
Some characteristics of genetic diversity measures that support its use as an indicator of ecological condi-
tion are summarized in Table 2-2.
Table 2-2. Rationale for indicator of ecological condition based on genetic diversity
• Population-level measure rather than individual-level measure
• Measure of cumulative impact of multiple stressors on populations
• Integrative measure at multiple geographic and temporal scales
• Can be implemented as a nondestructive measure (e.g., DMA can be taken from fin
tissue or scales)
• May identify problems within species before species assemblage indicators
become significant
• Well-defined relationships between indicator and the size and connectivity
of populations
• Highly complementary to species assemblage and landscape indicators
• Useful indicator of population trends through temporal monitoring
• Quantitative measure of population and community "conservation worth"
10
-------
iror
ijjOffijcosystem Condition and Sustainability
2.2 Relevance of genetic diversity to assessment of population sustainability
Genetic diversity is both an input and an output of ecosystems. As shown in the previous section, its role
as an output tells us something about the processes occurring within ecosystems. As an input, genetic
diversity tells us something about the future sustainability of ecosystems. The importance of genetic diver-
sity for sustainability of ecosystems is due to its role in determining population fitness in the short term
and population sustainability in the long term.
Due to rapidly increasing human pressures, current rates of environmental modification may be greater
than those most modern species have experienced. For populations with little genetic variation, we expect
that the rate of adaptation to altered environments will be slow. If the rate of environmental change is too
great, then limitations on genetic diversity combined with high demographic costs of selection will not
allow populations to replace their numbers each generation and the populations will slowly slide towards
extinction (Lande and Shannon, 1996; Lynch, 1996; Lande, 1999). Rapid anthropogenic changes such as
global warming are expected to greatly increase the importance of genetic diversity to population persist-
ence in the foreseeable future (Lande, 1999).
is
the
of
i I I
Besides loss of adaptive potential, low genetic
diversity within populations can have an immediate
effect on overall fitness. Experimental evidence
suggests that most, if not all, diploid species harbor
recessive lethal alleles and partially recessive sub-
lethal alleles (Lande, 1999). If genetic diversity is
reduced rapidly, strong genetic drift will result and
many individuals will be homozygous for the
(sub)lethal alleles, allowing them to be fully
expressed. If the population size becomes very
small, sublethal alleles may actually become fixed
(all individuals homozygous for the deleterious
allele), causing a permanent decrease in fitness
and, consequently, increasing extinction risk. The
effect is often called "inbreeding depression" since it is associated with inbred populations in laboratory
experiments. The effects of inbreeding depression on population survival can be severe; most experi-
mental populations of animals that are purposefully inbred will eventually become extinct due to contin-
uous erosion of fitness (Soule, 1980; Frankham, 1995). On the other hand, a more gradual decline in
genetic diversity may allow (sub)lethal alleles to be purged by selection without greatly harming fitness.
A correspondence between loss of genetic diversity at marker loci and reduced fitness has been difficult
to document in wild populations, primarily because it is difficult to quantify population fitness in natural
populations. Nonetheless, evidence for a relationship between particular fitness components and het-
erozygosity has been shown in a number of species (reviewed in Allendorf and Leary, 1986). More recent
studies also suggest that a relationship between genetic diversity and fitness holds for a number of natural
populations (Table 2-3). In only one case, however, was a direct relationship between genetic diversity
and population extinction observed.
In addition to genetic diversity within populations, maintenance of genetic diversity among populations is
important to the long-term sustainability of many species. Populations of many species are adapted to
unique local conditions. In the long run, continued evolution of the species depends on these unique pop-
ulations and a major conservation effort is dedicated to seeking out and protecting these 'evolutionarily sig-
nificant' lineages. Removal of barriers to dispersal, either through habitat modification or translocation of
11
-------
Genetic Diversity as an
Ill I I
Table 2-3. Examples of genetic diversity studies of natural populations that demonstrated an association between
genetic diversity and population fitness.
Endpoint
Local population extinction
Lifetime breeding success
Colony growth and survival
Fertility, hatching rate
Mortality, growth, fecundity,
developmental stability
Developmental stability
Male reproductive success
Male reproductive success
Birth weight, neonatal survival
Molecular marker
Allozyme,
micro satellite
Micro satellite
Allozyme
Micro satellite
Allozyme
mtDNA,
microsatellites
Allozyme
Allozyme
Micro satellite
Taxon
Butterfly
Red deer
Ant
Prarie chicken
Topminnow
Elephant Seal
Butterflies
(2 species)
Gastropod
Seal
Reference
Saccheriefa/., 1998
Slate efa/., 2000
Cole and Wiemsasz, 1999
Westemeir etal, 1998; Bouzat
etal, 1998
Quattro and Vrijenhoek, 1989
Hoelzel etal., 2002
Carter and Watt, 1988
Rolan-Alvarez etal, 1995
Coltmanefa/., 1998
individuals, allows the transfer of locally maladaptive genes into populations, thereby reducing population
fitness. This effect is sometimes called "outbreeding depression". It is believed that the widespread prac-
tice of rearing fish stocks to supplement wild populations may have inadvertently reduced fitness and
caused local extinctions of many fish populations (Nehlsen etal., 1991). In addition, previously isolated
but reproductively compatible species have been allowed to come into contact, creating "hybrid swarms"
(Leary et a/., 1995). In the worst cases, native species diversity may be overrun by genes from hybridiz-
ing introduced species, effectively eliminating native species through dilution.
Long-term sustainability is affected by the ability of populations to adapt to changing environments.
Gilpin and Soule (1986) summarized the theoretical relationship between genetic diversity, adaptation,
demography, and population extinction in the "extinction vortex" model. Under normal circumstances, a
temporary decrease in population fitness due to a change in the environment is corrected via the effects of
selection acting on genetic diversity. The population is able to "track" the environment as it changes.
However, under circumstances of low genetic diversity where insufficient alleles are available for selec-
tion, adaptation will be slowed and inefficient (Figure 2-3). Instead, the population size will be reduced
due to poor fitness, which reduces genetic diversity and, in turn, reduces fitness. The separate processes
of fitness loss, population loss, and demographic effects feed off each other until the population finally
crashes.
While the concept that adaptation is dependent on genetic diversity is a cornerstone of evolutionary biol-
ogy, there is little empirical evidence that measurements of genetic diversity are correlated with long-term
sustainability or adaptive potential. Difficulty in defining and measuring adaptive potential may partly
explain the present lack of evidence. However, a more fundamental problem regarding inferences made
from measurements of genetic diversity with molecular markers also should be considered. Most molec-
ular markers are selected and analyzed because they show sufficient variation to elucidate patterns of
genetic variation within and among populations. With a few exceptions (e.g., some cases in Table 2-1),
little or nothing is known about the direct influence of these molecular markers on fitness, although the
12
-------
Condition and Sustainability
vast majority is believed to be effectively neutral (unselected). Neutral markers are preferable for sever-
al questions of interest, such as estimation of effective population sizes, levels of connectivity between
populations, and evolutionary distinctness of certain populations. They are problematic, however, when
the desired inference is to levels of genetic diversity at loci that directly influence fitness. These loci are
clearly under the influence of selection and therefore influences from neutral loci may not be accurate.
The issue is still under debate (Vrijenhoek, 1994; Lynch, 1996; Rodriguez-Clark, 1999) but, in general, it
is reasonable to conclude that if genetic diversity is high at neutral molecular markers then genetic diver-
sity at fitness loci also is probably high. If, however, genetic diversity is low at neutral markers then genet-
ic diversity at fitness loci is potentially depressed, but may be sufficient to avoid inbreeding depression
and loss of adaptive potential. A second way to approach the problem of extinction risk is to use genetic
diversity measures as estimates of effective population size. Estimates of the rate at which new genetic
variation in quantitative traits (e.g., fitness) is generated by mutation were used to suggest that populations
should be maintained at a minimum effective population size of 500 individuals for long-term sustain-
ability (Soule, 1980; Franklin, 1980). This number was estimated as the minimum needed to maintain
enough quantitative genetic variation to adapt to future environmental changes. Lande (1995) subse-
quently suggested that the minimum effective size estimate was too small by an order of magnitude. In
addition, at very low effective sizes, populations are at risk of immediate extinction from demographic
effects. Therefore, estimates of genetic diversity at neutral markers should be useful indirect measures of
both short and long- term sustainability because they can be used to estimate effective population sizes.
Some characteristics of genetic diversity that support its use as an indicator of ecological sustainability are
summarized in Table 2-4.
Table 2-4. Rationale for indicator of ecosystem sustainability based on genetic diversity.
• Indicator of long and short-term extinction risk when combined with demographics
• Measure of potential to adapt to modified environments (e.g., susceptibility to novel pathogens)
• Amenable to temporal monitoring, trends analysis
• Can be implemented as a nondestructive measure
2.3 Measurement of Genetic Diversity
A variety of methods have been developed to measure genetic diversity within species. The oldest and
most widely known is to simply measure morphological traits of individuals within populations. For traits
that are determined predominantly by single genes, such as flower color or seed color in Mendel's peas,
this provides a direct and straightforward measure of genetic variation. Fitness however, as well as most
other morphological, physiological, and behavioral traits, is influenced by many genes, by environmental
differences, and by interactions between different alleles, loci, and environments. In order to measure
genetic variation of these traits, it is necessary to first control for environmental variation, a decidedly dif-
ficult task when working with natural populations.
Over the last several decades, a number of molecular markers have been developed to aid in the measure-
ment of genetic diversity (see Avise, 1994). Since the 1970s, innumerable studies have assessed genetic
diversity in the electrophoretic properties of proteins, particularly blood group proteins and enzymes
(allozymes). The advantage of this system of analysis over measurement of morphological characters is
that the marker patterns are almost entirely free of environmental influences and one or, at most, a few
genes determine each pattern. A number of additional techniques emerged in the 1980s and 1990s for
measuring genetic diversity directly at the DNA level. One of the advantages of DNA-based analysis is
13
-------
Genetic Diversity as an
1 I I
Inbreeding depression
Heterozygosity
t Genetic Drift
Inbreeding
+ Adaptation
Environment tracking)
+ Population size
Density
A Replacement
rate
variability
Demographic randomness
Figure 2-3. Simplified diagram of Gilpin and Soule's (1986) Extinction Vortex model. Only the adaptation and
inbreeding vortices are shown. Two other vortices (population size and density) also interact with these vortices to
cause population extinction. Ne: effective population size.
that genetic diversity is measured directly at the most fundamental level, before intracellular modifiers
such as gene transcription, RNA splicing, and protein translation and post-translational processing can
modify the pattern. Another advantage of new DNA-based methods is that essentially all of the genetic
material within organisms is available for analysis. This has allowed researchers to target for analysis par-
ticular types of DNA that are most suited for answering specific questions. For example, mitochondrial
DNA rarely (if ever) recombines, has a relatively high mutation rate, and is usually only transmitted from
the maternal parent to offspring. This makes mitochondrial DNA particularly useful for tracking the
maternal pedigree of populations back through time (e.g., for humans, back to the 'mitochondrial Eve'),
and it makes genetic diversity in mitochondrial DNA particularly susceptible to the effects of small popu-
lation size (genetic bottlenecks). Other types of DNA may be targeted because they have exceptionally
high mutation rates and therefore generate many alleles, because they code for genes with known func-
tions that may respond to particular environmental modifiers, or because they are believed to be selectively
neutral so genetic diversity at these loci more accurately reflects evolutionary relationships between pop-
ulations and species.
14
-------
Condition and Sustainability
The mathematical foundations for measuring and interpreting genetic diversity and population genetic
structure were developed in the early part of the last century. However, progress in understanding genet-
ic processes in populations was limited until the recent explosion of molecular biological methods made
it possible to accurately apply and test genetic theory in the field. Presently, genetic diversity indicators
can be used to assess effective population sizes, past or present population bottlenecks, gene flow and,
under some circumstances, mutation rates and selection. The availability of new types of genetic markers
with different genetic properties, together with inputs from related efforts such as the human genome proj-
ect and DNA forensics, has fueled a new round of mathematical development that is continuing today. As
new methods to measure and interpret genetic diversity are rapidly increasing, the information about past
and present population processes that ultimately can be extracted from analyses of molecular markers is
almost certain to experience phenomenal growth.
15
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustain
I I I
3 Case Studies
Two case studies are presented in this section to demonstrate the technical feasibility of incorporating
genetic diversity measurements into an ecological assessment and to help evaluate the environmental rel-
evance of this indicator. Many variables can affect the success of a genetic diversity indicator, ranging
from the quality of the genetic markers employed, to the technical abilities of the people involved, to the
appropriateness of the species surveyed. The two case studies described here utilized different approach-
es with different technical requirements. The first study represents a large regional survey in which a large
number of sites were assessed in order to create a regional genetic diversity profile for a single, moder-
ately sensitive minnow species. The study was based on the RAPD fingerprinting technique and utilized
very few samples per site. The second study included a much smaller number of sites, but evaluated more
fish per site. A moderately tolerant fish was evaluated in this study with both AFLP and mitochondrial
sequence data. Ideally, a genetic diversity assessment will evaluate multiple species over the study region.
In fact, analyses of additional fish species are underway in both study areas; this multi-species assessment
will be presented in a future document. Both case studies demonstrate the utility of integrating genetic
diversity information and other ecological indicator data into a single ecological assessment.
3.1 Genetics of Central Stonerollers in The Eastern Cornbelt Plains Ecoregion
3.1.1 Backgroun d
The Central Stoneroller (Campostoma anomalum). The study organism is a small minnow in the fam-
ily Cyprinidae (Figure 3-1). It is common and relatively abundant in runs or riffles over hard bottom in
clear running streams throughout much of the Eastern and Midwestern USA, exclusive of Southeastern
states. Stonerollers are bottom feeders, feeding primarily on algae and detritus they scrape off of rocks.
Males build nests in gravel pools at the tops of riffles in which females lay between 100 and 500 eggs.
Stonerollers are classified as moderately tolerant, preferring clean water in organically enriched streams
with thick growths of attached algae.
Figure 3-1. The central stoneroller (Campostoma anomalum). (Photo
courtesy of the Ohio Department of Natural Resources)
16
-------
Condition and Sustainability
The Study Area. The Eastern Cornbelt Plains (ECBP) Ecoregion (Omernick, 1995) encompasses most of
central and western Ohio as well as the central and eastern parts of Indiana (Figure 3-2). A finger of the
ecoregion extends into southern Michigan. The region once was composed of tall grass prairie, eastern
deciduous forest, and wetlands. Modern agriculture, industrialization, and urbanization have introduced
both point source (e.g., industrial discharge, waste water treatment plants, combined sewer overflows) and
non-point source (e.g., nutrient enrichment, toxic chemicals, sedimentation) stressors to the network of
streams in this ecoregion. Figure 3-3 provides an aerial view of the typical ECBP landscape. The genet-
ic study concentrated on watersheds in the eastern part of the ECBP, primarily in Ohio. Major watersheds
within this region include the Scioto, Great Miami, and Little Miami, which drain into the Ohio River, and
the Maumee, Sandusky, and Huron watersheds, which drain into Lake Erie.
Integration. This work was undertaken as part of a Regional Environmental Monitoring and Assessment
Program (REMAP) study initiated as a collaborative effort between USEPA Region 5, the Ohio EPA and
USEPA-NERL to answer questions about the overall ecological condition of the ecoregion. Specifically,
the REMAP project's purpose was to assess the stream resource status and to develop biological measure-
ments that would serve as quantitative indicators of the condition of those stream resources, the type and
magnitude of stress placed on the streams, and whether the resource condition changed over time.
Randomly selected sample sites along first through third order (wadeable) streams represented a variety
of land uses and impact sources (Figure 3-4) present in the ecoregion.
Field sampling. Fish were collected by electroshocking (Figure 3-5) using the EMAP protocols for wade-
able streams (USEPA, 1993). Central stonerollers were collected from 91 sites representing 10 large
watersheds in 1995. Sample sizes at each site were small (Table 3-1) because genetic diversity sampling
was not the primary purpose of the REMAP assessment. The caudal peduncle and caudal fin were removed
and placed into a cryovial labeled with the site name, frozen in liquid nitrogen, and shipped back to the
laboratory where the sample was stored at -80°C.
Laboratory methods and data analysis. A brief description of laboratory and analytical procedures is
provided here. We recommend that readers interested in designing a genetic study using this or related
techniques consult the more detailed description in Appendix 1.
DNA was extracted from each sample using standard procedures. For this study, a genetic "fingerprint"
or profile was constructed for each fish using the technique of randomly amplified polymorphic DNA
(RAPD) (Williams et a/., 1990; Welsh and McClelland, 1990). This is a relatively simple technique based
on agarose gel electrophoresis of anonymous PCR (polymerase chain reaction) fragments. A drawback of
the technique is that scoring of gels (detecting and sizing bands) has been found to be less repeatable than
for some other methods (e.g., AFLP, microsatellite analysis). To overcome this problem, bands were
scored semi-automatically using a fluorescence imaging system (Fluorimager 595, Molecular Dynamics)
and dedicated image analysis software (FragmeNT, Molecular Dynamics) (Figure 3-6). Additionally, each
sample was analyzed in triplicate. Bands were classified into size groups (bins) using cluster analysis.
These bins were further refined by discriminant analysis of band size and intensity characteristics. In
excess of 200 such bins were identified; however, 53 bins that were most repeatably scored (based on com-
parison of the three replicates for each individual) were selected for analysis. The genetic profile for each
individual was derived from the composite information from three replicates for these 53 fragment size
bins. These profiles were compared using a "similarity index" (Lynch, 1990; Leonard et a/., 1999).
Average genetic similarities within populations provide an inverse measure of genetic diversity while aver-
age genetic similarity between two populations provides an inverse measure of the "genetic distance"
between them.
17
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustain
I I I
V
25 Q 25 SO Milts
ECBP StudySites
RF-1
ECBP
St. Joseph
Maumee
Sandusky
I Huron
Muskingum
Scioto
Great Miami
Little Miami
Wabash
White
Figure 3-2. Map showing sampling locations for the study. Major watersheds are highlighted. Borders between
USGS watersheds (8-digit hydrologic units or HUCs) also are indicated.
18
-------
r
Diversity as an Indicator of Ecosystem Condition and Sustainability
Figure 3-3. Aerial view of typical Eastern Cornbelt Plains landscape.
3.1.2 Key findings and their implications
Populations of stream fish are highly differentiated within the Eastern Cornbelt
Plains Ecoregion. A common statistic used to measure how different or similar populations are with-
in a geographic area is F^ (Section 2.1). Technically, FST estimates the "inbreeding", or increased likeli-
hood that the two copies of a gene within an individual are identical (homozygous), due to population
genetic structure. An estimate of F^ = 0 indicates that there is no evidence for geographic structure from
the samples analyzed and thus they may represent a single population. An estimate of FST =1 indicates
that none of the populations share the same alleles at polymorphic loci. For diploid sexual species, FST
values generally fall between 0 (no genetic structure and 0.3 (large genetic structure), with cases of FST >
0.4 being quite unusual. For stonerollers sampled from these ECBP sites, our estimate of FST was about
0.39 (Table 3-2) so we conclude that stoneroller populations are highly structured within the study region.
A nice feature of the FST statistic is that it can be used to estimate gene flow, or the average number of
migrants between sites each generation. Table 3-2 shows that over the entire study area, an average of just
0.4 individuals move between populations each generation (a generation is about 2-4 years for stonerollers
in this area). Of course, individuals should be much more likely to migrate to nearby streams, rather than
to streams that are more distant. Looking within major watersheds, and for large watersheds within USGS
8-digit hydrologic units (HUCs), we see that the average number of migrants between sites does tend to
be greater in the smaller areas. However, the pattern is not consistent for different watersheds, as some
(e.g., the Scioto) show very little gene flow, on average, while others (e.g., the Wabash) show very high
gene flow.
19
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustain
I I I
Figure 3-4. A highly modified suburban stream. Urban run-off, non-native riparian growth,
bank erosion, and combined sewer overflows are common urban and suburban impacts to
streams.
Figure 3-5. Backpack electroshocking for central stonerollers.
20
-------
Condition and Sustainability
Table 3-1. Geographic location of 91 REMAP sites in the Eastern Cornbelt Plains Ecoregion from which central
stonerollers were sampled. The 11-digit USGS hydrological unit code (HUC) for the sub-basin is provided, along
with the REMAP site identification number, stream order classification, ecological impacts identified, Qualitative
Habitat Evaluation Index (QHEI), Index of Biotic Integrity (IBI), and number of stonerollers collected. KEY— RM:
River mile; A: agriculture/livestock; R: riparian degradation, C: channelization; U: urban/suburban/industrial; S: waste
water treatment plant or combined sewer overflow; n.c.: not calculated.
Basin
St. Joseph
Maumee
Sandusky
Huron
Muskingum
Scioto
USGS HUC
04050001 110
04100004010
04100006040
04100007010
04100007010
04100008030
04100008030
04100011 030
04100011 030
04100011 050
04100011 060
04100011 060
04100011 070
04100011 100
04100012010
04100012010
04100012030
04100012040
05040002010
05040003010
05040003010
05040006010
05040006 040
05060001 010
05060001 010
05060001 010
05060001 020
05060001 030
05060001 070
05060001 070
05060001 080
05060001 090
05060001 090
05060001 090
05060001 100
05060001 140
05060001 140
05060001 140
05060001 140
05060001 150
05060001 170
05060001 180
05060001 220
05060002010
Site Name
Pigeon Creek
E. Br. St. Marys R. (Clear Creek)
Beaver Creek
Dry Run
Auglaize River
Eagle Creek
Eagle Creek
Broken Sword Creek
Broken Sword Creek
Trib. to St. James Creek (RM 0.85)
Oak Run
LittleTymochtee Creek
Sycamore Creek
E. Br. Wolf Creek
Marsh Run
Walnut Creek
Trib. to trib. to Huron R.
Chappel Creek
Trib. to Black Fk. Mohican R. (RM 54.45)
Kokosing River
Kokosing River
Otter Fk. Licking R.
Trib. to S. Fk. Licking R.
Taylor Creek
Trib. to Taylor Creek (RM 7.60)
Trib. to Scioto River (RM 227.76)
Rush Creek
Panther Creek
Grassy Run
Trib. to Mill Creek (RM 41 .24)
Eversole Run
Flat Run
Trib. to Olentangy R. (RM 76.95)
Rocky Fork
Shaw Creek
Blacklick Creek
Blacklick Creek
Rocky Fk. Big Walnut
Rocky Fk. Big Walnut
Bunker Run
(none provided)
Lick Run
Hellbranch Run
Yellowbud Creek
Site
ID
399
173
403
268
270
283
284
436
446
280
273
275
448
431
440
443
428
426
435
142
423
145
151
168
172
263
170
277
155
171
123
140
437
438
138
129
130
146
148
144
152
338
126
308
Stream
Order
3
3
2
1
3
3
3
3
3
2
3
3
3
3
3
3
1
2
2
3
2
2
2
2
1
2
3
2
2
3
3
3
1
3
3
3
3
2
2
3
2
2
3
3
Impacts
A
A
A,R,C
A,R,C
A
A
A
A,R,C
A
A,R
A,R,C
A
A
A
A
A
U,R,C
none
A
none
A
R,C
none
A
A
A
A
A
A,C
A
none
none
A,R,C
R
A,C
U,S
U,S
U
A
none
n.c
A
U,S
A
QHEI
80
57
18.5
38
72.5
49.5
49
34
59.5
33
24
52.5
62.5
54.5
51
69
30.5
49
58
42.5
73
26
61
71.5
31
32
67.5
62.5
27
58.5
71.5
79
24.5
53.5
44
63.5
75
67.5
67
68.5
n.c
54
75
80.5
IBI
36
29
26
46
44
34
24
36
32
34
28
20
42
28
34
46
28
36
40
44
53
36
38
36
28
36
37
40
26
40
48
43
30
32
38
48
38
48
36
54
n.c
46
45
56
Sample
Size
10
10
9
10
10
10
10
7
10
9
10
8
10
10
10
10
9
10
9
10
10
10
10
10
3
9
9
10
10
9
10
9
8
10
10
10
6
3
8
10
10
10
18
20
(continues on next page)
21
-------
Genetic Diversity as an
Ill I I
Table 3-1. (Continued)
Basin
Great Miami
Little Miami
Wabash
White
USGS HUC
05060002010
05060002010
05060002010
05060002 030
05060002 030
05060002 040
05060003 020
05060003 040
05060003 040
05060003 050
05060003 050
05060003 060
05060003 090
05080001 010
05080001 020
05080001 030
05080001 030
05080001 040
05080001 150
05080001 170
05080002 030
05080002 040
05080002 040
05080002 040
05080002 040
05080002 050
05080002 060
05080002 060
05080002 070
05080002 080
05080002 080
05080003 080
05090202010
05090202010
05090202010
05090202 020
05090202 020
05090202 040
05090202 050
05090202 080
05090202 080
05090202 1 00
05120101 010
05120101 010
05120104070
05120107010
05120204020
Site Name
Hargus Creek
Scippo Creek
Congo Creek
Opossum Run
Bradford Creek
Hay Run
Sugar Creek
Bull Creek
Hardin Creek
Paint Creek
Sugar Run
Clear Creek
Trib. to Little Creek (RM4.22)
Trib. to S. Fk. Great Miami R. (RM 5.27)
Willow Creek
Brandywine Creek
Blue Jacket Creek
Little Indian Creek
Kings Creek
E. Fk. Buck Creek
Twin Creek
Trib. to Twin Creek (RM 18.29)
Toms Run
Bantas Fork
Toms Run
Millers Creek
Sevenmile Creek
Paint Creek
Fourmile Creek
Indian Creek
Indian Creek
Trib. to Dry Fk. Whitewater R. (RM 6.73)
Little Miami River
Trib. to N. Fk. L. Miami R. (RM 5.60)
Lisbon Fork
Clark Run
Little Beaver Creek
Painters Creek
N. Br. Caesar Creek
E. Fk. Todd Fork
E. Fk. Todd Fork
Turtle Creek
(none provided)
Wabash River
W Br Twelvemile Cr.
Middle Fork Creek
Sixmile Creek
Site
ID
341
342
343
125
294
309
292
302
330
305
306
329
307
160
271
162
163
236
166
217
248
254
255
258
259
324
256
257
1
353
355
358
220
221
223
224
253
320
297
313
314
318
177
178
87
100
35
Stream
Order
3
3
3
2
3
3
3
2
2
3
2
1
2
3
2
2
2
2
3
2
2
2
2
3
2
2
3
3
3
3
3
1
3
2
2
2
2
2
3
3
3
3
3
3
2
1
2
Impacts
u,s
A,S
A
R
A
A
A,R
none
A
A
A
A
none
A
A,R,C
A,R,C
A
A
A
A,R,C
A
A
A
A
none
none
A
A
A,R,C
S
none
none
A
A,R,C
A,R
A
U,S
A
A,R
A
A
S
n.c
A
A
A,C
None
QHEI
58
73.5
82
72.75
84.5
63.5
43
76
71
76.5
62
50
53
76.5
26
30.5
32
75
58
49
59
54
62
75
82
65.5
73.5
74
29
71.5
80.5
68.5
75.5
23.5
54
58
91
71.5
49
60
54.5
58.5
n.c
66
53.5
58
60
IBI
36
54
54
47
55
48
50
54
50
37
48
40
44
52
46
38
20
50
38
40
48
50
n.c
42
42
46
46
42
24
45
40
46
38
28
46
38
38
46
44
50
46
54
n.c.
31
31
40
48
Sample
Size
10
10
10
10
9
10
10
10
10
3
10
7
10
9
10
9
8
10
10
19
10
7
10
10
10
10
9
9
8
7
9
10
10
4
10
10
10
10
7
10
10
9
10
9
10
10
5
22
-------
r
Diversity as an Indicator of Ecosystem Condition and Sustainability
Populations of stream fish are geographically structured among and within water-
sheds. Another way to look at these data is to examine the patterns of relatedness between each of the
sampling sites. One can think of the populations as connected in a phylogenetic tree-like pattern, with the
caveat that, unlike relationships between different species, the patterns of relationships are determined by
both evolutionary descent and ongoing gene flow. Figure 3-7 shows a tree diagram constructed from a
matrix of genetic distance between each of the stoneroller collection sites using the neighbor joining
method. Populations in the same basin are colored similarly. The genetic distance between any two pop-
ulations is represented as the total length of the branch between them. Two things are immediately obvi-
ous from the figure. First, groups of populations in the same watershed tend to cluster together on the tree
(genetically similar). Second, some watersheds have more than one cluster of genetically related popula-
tions while some clusters include more than one watershed. For example, the Scioto basin (sites colored
green in Figure 3-7) is clearly separated into two highly distinct genetic groups. Examination of the geo-
graphic location of sites (Figure 3-2) indicates that the smaller genetic group is composed of populations
located in the extreme northern part of the basin. All are from a single USGS watershed (HUC 50600001),
but other populations from this same watershed are from the second genetic group. In contrast, popula-
tions in the Little Miami (sites shown in blue) and Great Miami (sites shown in purple) watersheds are not
highly distinct from each other.
Another way to visualize genetic relationships is with multidimensional scaling (MDS). In MDS analy-
sis, the relationships between each population and all other populations are represented in two dimensions
in order to more easily observe patterns. In Figure 3-8, the separation between the two genetic groups in
the Scioto basin is evident, as is the lack of strong separation between the Little Miami and Great Miami
Figure 3-6. Interpretation of marker patterns is aided by gel image analysis software
23
-------
Genetic Diversity as an Indicator
1 1 1
basins. However, it also is clear that many of the populations from streams in the Erie drainage are genet-
ically very similar, and are not strongly differentiated from streams in the Wabash basin. Based on this
genetic analysis, we can define at least 5 large genetic groups within the ECBP ecoregion.
The major significance of this analysis is that understanding the genetic structure of stream fish popula-
tions is requisite to drawing conclusions about the status of those species and, by extension, the habitats
that support them. Populations within each of the five genetic groups are likely to have some degree of
genetic interdependence among them, either due to ongoing gene flow or recent separation. These popu-
lations are likely to share common genetic traits (e.g, susceptibilities, behaviors) and have some degree of
demographic connectivity. Populations in different genetic groups, however, are likely to be highly inde-
pendent, demographically, genetically, and evolutionarily. These genetic groups, therefore, are appropri-
ate units for performing ecological analyses if the assessment question concerns the current and future sta-
tus of stoneroller populations. If central stoneroller population structure parallels that of other minnow
species, the geographic boundaries between these genetic groups delineate appropriate assessment units
for minnows in general. It should be noted that the boundaries of these genetic groups do not compare
well to watersheds defined by 8-digit USGS hydrologic unit codes (HUCs), although these are the typical
assessment units in current use.
A useful
method for measuring genetic diversity with DNA fingerprint data is to estimate the average genetic sim-
ilarity of individuals within populations (Sw; Lynch, 1990). This is an inverse measure of genetic diver-
sity, so populations with high genetic similarity are less diverse. Average genetic similarity within these
populations ranged from 58% to 84%. A somewhat surprising result of this analysis was that genetic diver-
sity within populations differed greatly among the five genetic groups (Figure 3-9). Populations in the
Great Miami and Little Miami basins (genetic groups 1 and 2) were slightly more genetically diverse (less
average genetic similarity within populations) than populations in the lower Scioto Basin (genetic group
3) and were much more diverse than populations in the Erie drainage (genetic group 5). These differences
in genetic diversity at geographically broad scales are not likely to reflect different anthropogenic impacts.
Instead, they probably reflect evolutionary differences in the amount of diversity within these groups. The
general trend of decreasing diversity with increasing latitude may reflect residual effects of the Wisconsin
glaciation, which would have wiped out fish populations over most of the study area. Regardless of how
the differences arose, these results suggest that the future sustainability of populations in the Little Miami
and Great Miami basins is much less in question than the sustainability of Erie drainage populations.
Table 3-2. Summary of population genetic structure analysis.
Geographic grouping
Overall
Maumee
Sandusky
Huron
Muskingum
Scioto
HUC 5060001
HUC 5060002
HUC 5060003
Little Miami
Great Miami
HUC 5080001
HUC 5080002
Wabash
Populations
91
6
7
4
5
34
20
7
7
10
19
7
11
4
FST
0.388
0.046
0.033
0.026
0.298
0.325
0.360
0.065
0.096
0.148
0.110
0.077
0.120
0.006
Average number of
migrants per generation
(Nem)
0.4
5.2
7.3
9.4
0.6
0.5
0.4
3.6
2.4
1.4
2.0
3.0
1.8
41.4
Significance
level
< 0.001
0.001
0.16
0.05
< 0.001
< 0.001
< 0.001
< 0.001
< 0.001
< 0.001
< 0.001
< 0.001
< 0.001
0.31
24
-------
I~TT
Genetic Diversity as an Indicator of Ecosystem Condition and Sustainability
140. , 306
123
263
172
. 438
342
152
305
178
446
273
280
448
275
436
341
130
338
Figure 3-7. Tree diagram depicting the estimated genetic relationships between populations at each site. This is an
unrooted tree which was made by employing the neighbor-joining algorithm to impose a tree structure on a matrix of
Nei's genetic distances between each site. Site IDs are color coded to match the map in Figure 3.2.
25
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustain
I I I
I * $ * $ f
E E E
_:• ^ _:•
d1 O CJI
E E E -e
•3 5 J3 §
X :- ^k
2 ° ° Si
O O LJ *^-
w) w v) Q
OQOaaaQQQQOQQOQQIN — '-'-
OOOOOOOOOOOOOOOOOOOQ
1O O O O O Q O ^" "^J" ''J' ^C* 4O ^D "^O "90 "DO •OT iN CN CN
c^ i— ^~ i— T~ ^^* T"~ CT- o CTI c^> c^ >r^ -TTp ^r^ r^ r~i T~ i— ^~
^r «r ^y *r sr "^y ^3f u^ uft lA u^ ^^ ^b1 in I" u? >un ift irt irt
OOOOQOOQOQOOOOOOODOQ
o
a
26
-------
I~TT
Genetic Diversity as an Indicator of Ecosystem Condition and Sustainability
A
A
AB C
I
1
I
>W
Genetic Group
Figure 3-9. Box and whisker plot of genetic similarity within populations (Sw) for each of the five genetic groups
identified in Figure 3-8. Groups with low average genetic similarity have higher genetic diversity. Groups with differ-
ent letters have significantly different levels of genetic diversity (p < 0.05, Duncan's multiple range test).
Genetic diversity within populations is affected by environmental condition.
Correlations between the average genetic similarity within populations and various environmental meas-
ures are presented in Table 3-3. The environmental measures include the Qualitative Habitat Evaluation
Index (QHEI) and 8 metrics that are used to create the index. Other measures include tissue metabolite
assays for benzo(a)pyrene (BAP)-type compounds (generally associated with combustion by-products)
and naphthalene (NAPH)-type compounds (associated with oil contamination), and EROD assays which
measure enzyme activity induced by planar xenobiotics such as halogenated hydrocarbons and polycyclic
aromatic hydrocarbons (PAHs). The raw correlations were generally small. Among the QHEI metrics.
only the overall QHEI index and the cover and pool metrics were significant, although all correlations with
genetic similarity were negative, as would be expected (larger values of the metrics indicate better envi-
ronmental quality).
27
-------
Genetic Diversity as an
Ill I I
Table 3-3. Spearman (rank) correlations between average genetic similarity within populations (Sw) and various envi-
ronmental measures.
Environmental Variable
Qualitative Habitat Evaluation Index
Substrate QHEI metric
Cover QH El metric
Channel QHEI metric
Riparian QHEI metric
Pool QHEI metric
Riffle QHEI metric
Gradient QHEI metric
Gradient (m/km)
Average sampling depth (cm)
Latitude
Longitude
Stream Order (first-third)
Watershed Area (hectares)
Watershed perimeter (km)
Elevation (meters)
BAP (ug/mg protein)
NAPH (ug/mg protein)
EROD (pmol/min/mg protein)
N1
84
84
84
84
84
84
84
84
84
83
86
86
86
86
86
86
78
78
76
Mean
58.33
12.20
11.53
13.02
5.21
6.16
2.50
7.71
3.02
34.58
40.17
-83.63
2.47
8361
54.62
279.86
0.15
27.77
6.96
Std. Dev
16.85
5.31
3.99
4.27
1.96
2.64
1.95
2.20
2.98
17.96
0.62
0.77
0.63
3844
16.82
40.09
0.11
10.19
7.84
Spearman
Correlation
-0.229*
-0.166
-0.216*
-0.160
-0.022
-0.269*
-0.135
-0.121
-0.101
-0.302**
0.492***
0.268*
0.088
-0.050
-0.050
-0.170
-0.088
0.104
0.070
*p<0.05 **p<0.01 ***p< 0.001
Sample sizes vary because not all environmental measures were available for all sites.
As an exploratory analysis, variables that suggested a strong association with genetic similarity in simple
correlation analyses were included in a linear model to examine their explanatory power in determining
average genetic similarity [Sw was transformed (SW/(1-SW) to approximate normality]. Both quantitative
variables (Table 3-3) and categorical variables (genetic group and the impact factors from Table 3-1) that
were significant at the p < 0.1 level were included in the initial model. The least significant effects were
eliminated in a stepwise fashion until only effects that were significant at the p < 0.05 level were left in
the final model.
The final model (Table 3-4) was highly significant (F8 74 = 10.8, p < 0.0001) and explained about half of
the variation in Sw (R2 = 0.53). Significant effects in the final model included three impact factors: Urban,
Riparian, and Channelization, along with genetic group and average stream depth at the sampling location.
Latitude, though highly significant in the correlations analysis, was not significant in the linear model
because it was strongly associated with genetic group. It is interesting that the riparian and channeliza-
tion impact factors were significant while the QHEI riparian and channel metrics were not. The explana-
tion for this may be a nonlinear response of genetic diversity to habitat degradation, as only highly affect-
ed sites were classified as having impacts to the riparian zone or channelized habitat. Stream depth may
be significant as it is an indicator of stream size, and thus the carrying capacity of the habitat.
28
-------
nnr
Condition and Sustainability
Table 3-4. Final model explaining genetic similarity within populations following stepwise elimination of non-signifi-
cant effects.
Source
Impact factor- Urban
Impact factor - Riparian
Impact factor- Channelization
Major Genetic Group
Depth (covariate)
Degrees of
Freedom
1
1
1
4
1
Mean Square
5.369
4.882
2.452
6.311
1.974
F Value
11.20
10.19
5.12
13.17
4.12
Probability
0.001
0.002
0.027
< 0.001
0.046
We can use these relationships to derive a more precise ecological indicator based on genetic diversity.
Specifically, the deviation of the estimated genetic diversity within each site from its predicted value based
on historical biogeographic factors (genetic group) and habitat carrying capacity (stream depth) indicates
the relative genetic health of the population. Figure 3-10 shows a representation of this type of analysis,
in which the corrected genetic diversity at each site is classified as low, moderate, or high. As with any
ecological indicator, these results should be interpreted cautiously and only in relation to other indicators
of ecological condition.
is
dition, In addition to indicators that might directly affect genetic diversity, other environmental variables
were measured that are not expected to affect genetic diversity directly, but might be similarly affected. If
measures of genetic diversity are redundant with other indicators, then direct measures of genetic diversi-
ty may be unnecessary. Table 3-5 demonstrates that genetic similarity within central stoneroller popula-
tions is significantly correlated with two common indicators of environmental condition, the Index of Well
Being and Index of Biotic Integrity, but the correlation coefficients are small. Genetic similarity also is
correlated with several other environmental measures, however these correlation coefficients also are
small. We interpret these results to signify that the genetic similarity measure for central stonerollers is
related to but not highly redundant with these other indicators of ecological condition.
29
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustain
I I I
• High genetic divecisty
Moderate genetk dtvernty
Not measured
Figure 3-10. Map depicting the relative genetic diversity at each site after correcting for genetic group and average
stream depth at the sampling site. Green: highest one third of sites; yellow: middle third of sites; red: lowest third
of sites
30
-------
Condition and Sustainability
Table 3-5. Spearman rank correlations of condition indicators with average genetic similarity.
Variable
Number of fish (per 300 m)
Weight of fish (kg/300 m)
Total fish species
Shannon Diversity Index (on number)
Shannon Diversity Index (on weight)
Index of Well Being
Index of Biotic Integrity
Number of sunfish species
Number of darter species
Number of sucker species
Number of Cyprinid minnows
Number of intolerant spp.
Percent tolerant fish
Percent top carnivores
Percent omnivores
Percent fish with anomalies
Number of sensitive or
moderately sensitive species
Number of sensitive species
Percent simple lithophils
Number simple lithophils
N
83
83
83
83
83
83
83
83
83
83
83
83
83
83
83
83
83
83
83
83
Mean Std.Dev. Spearman
Correlation
1454.00
14.73
17.88
1.93
1.79
8.32
40.71
2.40
3.57
1.98
7.03
0.76
45.79
1.72
18.10
0.28
822.59
4.31
27.63
6.34
1108.00
15.94
6.13
0.35
0.35
1.12
8.58
1.35
1.80
1.20
2.13
1.18
20.81
2.37
13.86
0.65
832.45
3.47
14.98
2.94
-0.235*
-0.243*
-0.310**
-0.025
-0.078
-0.228*
-0.287**
-0.168
-0.163
-0.270*
-0.321**
-0.130
0.137
-0.139
-0.096
0.049
-0.260*
-0.306**
-0.047
-0.302**
:p<0.05
p<0.01
p< 0.001
31
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustain
I I I
3.2 Genetics of Creek Chubs in a Mining-Impacted Region
3.2.1 Backgroun d
The Creek Chub (Semotilus atromaculatus). Like the central stoneroller, this organism is a minnow in
the family Cyprinidae (Figure 3-11). It is common in small to moderate size hard bottom streams from
the Great Plains to the Atlantic coast. Creek chubs are omnivorous, eating primarily insects and plant
material. Some adults can reach substantial size (250-300 mm) and become significant fish predators.
Males build and defend nests in gravel beds where there is moderate current. Females lay a clutch of
approximately 25-50 eggs in the nest, which is covered with gravel by the male but not defended. Creek
chubs are fairly tolerant of turbidity and pollution.
Figure 3-11. The creek chub (Semotilus atromaculatus) (Photo courtesy of the Ohio
Department of Natural Resources)
The Study Area. The EPA initiated a regional demonstration project in the mid 1990s to evaluate the
condition of wadeable streams in the Middle Atlantic region of the United States. For this pilot study, we
focused on a smaller region within Pennsylvania and West Virginia underlain by coal-bearing geology and
for which coal mining operations are historically known. Five watersheds, as determined by USGS 8-digit
hydrologic units (HUCs) were represented: Upper Ohio, Connoquenessing, Upper Ohio-Wheeling, Little
Muskigum-Middle Island, and Little Kanawha (Figure 3-12); all are within the Western Allegheny Plateau
ecoregion. While this region is primarily forested, between 14 and 28% of the stream length in each of
the watersheds studied in this pilot has agricultural land cover in the riparian zone (USEPA, 1997).
Integration. This study was performed in conjunction with the EPA's Mid-Atlantic Integrated Assessment
(MAIA) project, under the Environmental Monitoring and Assessment Program (EMAP). EMAP is a
research program charged with developing the tools necessary to monitor and assess the status and trends
of national ecological resources and typically employs a range of ecological indicators in order achieve
this purpose.
32
-------
r
Diversity as an Indicator of Ecosystem Condition and Sustainability
• Sample Silts
Rf-l
States
K ffigi' III I
l)SH3flKI5
(HCGOIQI
y
97-074
9S-I32
97-UI6
WV
¥74*24
98-133
97-023
98-036
» II » 4U
Figure 3-12. Map indicating the location of sample sites within the five watersheds studied. The USGS 8-digit
hydrologic units are indicated. These watersheds are (from North to South): Upper Ohio, Connoquenessing, Upper
Ohio-Wheeling, Little Muskigum-Middle Island, and Little Kanawha. All sites fall within Pennsylvania and West
Virginia.
Field sampling. Fish were collected using the EMAP protocols for wadeable streams (USEPA, 1993).
Sites were chosen probabilistically in order to establish the overall condition of first through third order
streams in the area. Measurements of physical habitat, water and sediment chemistry, and biotic assem-
blage structure were used to develop quantitative indicators of the condition of stream resources and the
types and magnitudes of stresses placed on streams. Samples of between 9-28 creek chubs were collect-
ed in 1997-98 from 10 sites within the five watersheds (Table 3-6).
33
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustain
I I I
Laboratory methods and data analysis. A brief description of laboratory and analytical procedures is
provided here. We recommend that readers interested in designing a genetic study using this or related
techniques consult the more detailed description in Appendices 2 and 3.
Total genomic DNA was extracted from the caudal fin of each sample using standard procedures. For this
case study, two analytical techniques were used: a genetic "fingerprint" was constructed for each fish
based on its nuclear DNA characteristics and a portion of its mitochondrial DNA was sequenced. The
DNA fingerprint was produced using the technique of amplified fragment length polymorphism (AFLP;
Vos et al. 1995). This is a relatively simple technique based on gel electrophoresis of anonymous PCR
fragments. Twenty percent of the creek chub samples were analyzed in duplicate. The genetic profile for
each individual was derived using the presence or absence of 109 polymorphic AFLP markers. As in the
Figure 3-13. Manual gel loading of samples in an acrylamide gel for electrophoresis.
first case study, these genetic profiles were compared using a similarity index approach (Lynch, 1990;
Leonard et al., 1999). A 590 base pair (bp) portion of the cytochrome B gene was sequenced in order to
provide a measure of genetic diversity in the mitochondrial genome.
To provide an ecological context for the genetic diversity patterns, a set of 25 environmental measures also
were examined. These environmental factors were analyzed and reduced to six independent "factors" that
explained most of the environmental differences between sites using principal components analysis.
Reducing the number of environmental variables and interdependence among them is very helpful in
understanding the difference between sites and performing statistical analyses. Relationships between
genetic diversity and these environmental factors were explored with multiple regression analysis.
34
-------
r
Diversity as an Indicator of Ecosystem Condition and Sustainability
—_
Figure 3-14. An AFLP fingerprint gel. The first lane and every 8th lane following it contains a 10 bp ladder to aid in
sizing the fragments. Lanes between these ladders contain AFLP fingerprints for different creek chub samples.
Polymorphisms are scored as the presence or absence of individual fragments in each lane.
3.2.2 Key Findings and their Implications
Mitochondrial DNA differences between creek chubs have a strong spatial compo-
nent. Among the DNA samples sequenced, 27 unique types of mitochondrial DNA (mitochondrial hap-
lotypes) were identified. By inspecting the pattern of mutational differences between these different
forms, a network or phylogenetic tree can be constructed that shows how one mitochondrial form has
evolved into another through time. One way to represent this data visually is in the form of a minimum
spanning network (Figure 3-15). In the figure, each colored circle represents one of the mitochondrial
DNA haplotypes identified. The frequency of the mitochondrial type in the overall sample is proportion-
al to the area of the circle, and its spatial distribution is represented by different colors. Lines connecting
circles represent mutations that differentiate different mitochondrial haplotypes; black dots represent hap-
lotypes that are inferred to exist (or have existed) but were not observed. The figure shows that two com-
mon, presumably ancestral haplotypes were identified, along with many related but infrequent haplotypes.
The five populations from the four northern watersheds have mitochondria that are from one group, while
the five populations from the Little Kanawha watershed (HUC 05030203) were predominantly of the other
35
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustainabilit
I I I
Table 3-6. Geographic location of 10 sites within the Western Allegheny Plateau ecoregion from which creek chubs
were sampled. The name and 8-digit USGS hydrological unit code (HUC) for each watershed is provided, along with
the individual stream/site name, stream ID, stream order classification, and number of creek chubs used in nuclear
(AFLP) and mitochondrial genome analysis. Colors correspond to those of the watersheds depicted in Figure 3-12.
us
Watershed H
MAIA
« «.»*.
Stream
Order
Sample Size
for AFLP
analysis
Sample Size
for
mitochondrial
analysis
Upper Ohio
Connoquenes
sing
Upper Ohio-
Wheeling
Little
Muskigum-
Middle Island
05030105
05030101
05030106
05030106
Taylor Run
Millers Run
98-134
97-073
Short Run Creek 97-074
Upper Bowman Run 98-132
27
14
14
28
27
14
14
25
05030201 Little Fishing Creek 97-016
05030203
05030203
05030203
05030203
Neal Run
Spruce Creek
Walnut Fork
Oil Creek
Left Fork Steer
Creek
97-008
97-024
98-133
97-023
98-036
13
14
27
13
15
13
13
26
13
13
mitochondrial group. It is interesting to evaluate this geographically structured genetic pattern (or "phy-
logeographic" pattern) in relation to the geologic history of this area. Prior to the Pleistocene glaciations.
streams in the four northern watersheds drained into the Great Lakes/Laurentian basin while streams in the
Little Kanawha watershed drained into the Mississipi River via the ancestral Teays River. Current patterns
of mitochondrial genetic differentiation appear to reflect this historical structure.
A useful way to evaluate the geographic structure of mitochondrial DNA difference is with an analysis of
molecular variance (AMOVA, Excoffier et ctl, 1992), which is analogous to the common analysis of vari-
ance. Using this method, we can determine the proportion of all major variation that is due to differences
among regional groups (the northern four watersheds vs. the southernmost watershed), differences among
populations within watersheds, and differences among individuals within populations. The results of this
analysis indicate that 65% of all mitochondrial variability is associated with the regional differences, while
7% is associated with differences among populations within watersheds. Just 28% of mitochondrial vari-
ability occurs among individuals within populations (Figure 3-16). This information can be used to derive
an estimate ofFST of 0.72, indicating that mitochondrial haplotypes of creek chubs are very highly differ-
entiated in the study area.
36
-------
r
Diversity as an Indicator of Ecosystem Condition and Sustainability
Creek chub populations are strongly differentiated in the nuclear genome, though
the genetic structure is not as strong as for mitochondrial DNA. As was shown for
stoneroller DNA fingerprints in the first case study, the patterns of genetic similarities within and among
populations can be converted into genetic distances and subjected to a standard cluster analysis to reveal
the evolutionary relationships among populations. Figure 3-17 shows the pattern of evolutionary rela-
tionships among these ten populations. The length of the branch between any two populations is roughly
Figure 3-15. Minimum spanning network for 27 mitochondrial DNA haplotypes identified by sequencing a portion the
cytochrome B gene. Colors correspond to those of the watersheds depicted in Figure 3-12. Circle area is proportional
to the number of samples possessing that haplotype (the actual number is indicated if more than one. The letter "V"
indicates a transversion-type mutation. Black nodes represent hypothetical haplotypes not identified in the sample.
proportional to the genetic distance between them. The basic pattern observed for this nuclear DNA analy-
sis is similar to that observed for mitochondrial DNA: populations in the Little Kanawha watershed appear
to be genetically differentiated from the four northern watersheds. In addition, there is some suggestion
from the network diagram of a north-south cline.
An analysis of molecular variance can be constructed for the nuclear AFLP data, similar to the procedure
for mitochondrial data. The results of the nuclear DNA analysis are strikingly different than for the mito-
37
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustainabilit
I I I
chondrial DNA. Just 5% of nuclear DNA variation is due to differences between the southern watershed
and the four northern watersheds. While differences among populations within each of these groups
accounts for 8% of the variation, similar to mitochondrial DNA, some 87% of all nuclear DNA variation
exists within populations (Figure 3-18). In contrast to the large mitochondrial FST statistic of 0.72, FST
calculated for nuclear AFLPs is just 0.13. This translates into estimates of effective migration rates
between populations of 0.2 based on mitochondrial DNA, and 1.7 based on nuclear DNA. Although we
cannot be sure of the cause of this discrepancy, it should be noted that the estimate based on mitochondr-
ial DNA tracks female gene flow (since mitochondria are maternally inherited) while nuclear DNA tracks
both male and female gene flow.
Among
Regions
64.8%
Within
Populations
27.8%
Among
Populations
within
Regions
7.4%
Figure 3-16. Distribution of mitochondrial DNA variation, as indicated by an analysis of molecular variance (AMOVA).
For this analysis, the four northernmost watersheds were considered one region while the southernmost watershed
was considered a second region. Each of the variance components was significantly greater than zero (p < 0.0001).
Environmental factors account for about half of the differences in mtDNA diversi-
ty within sites, and virtually all of the differences in nuclear (AFLP) diversity with-
in Sites. Measures of genetic diversity within each population, along with several environmental meas-
ures recorded for each site, are provided in Table 3-7. AFLP diversity was actually measured as average
pairwise similarity of individuals within the population (an inverse measure of genetic diversity), as was
done for RAPD fingerprint data in the first case study. Mitochondrial genetic diversity was measured as
the average number of mutational differences between two mitochondrial haplotypes in the population.
Six independent environmental "factors" (as determined by principal components analysis and varimax
rotation) accounted for about 98% of all the environmental variance in the 25 measurements taken.
Different types of measurements were related and were associated with different factors (Table 3-8).
Factor 1, for example, included many geochemical measurements (aluminum, calcium, chloride, etc)
while factor 4 was associated with spatial scale measurements (size of watershed, stream width and depth).
The relationship between mitochondrial genetic diversity (haplotype diversity) and these six environmen-
tal factors was explored by multiple regression analysis (forward selection model). Factor 2, which was
38
-------
r
Diversity as an Indicator of Ecosystem Condition and Sustainability
0.01
Figure 3-17. An unrooted neighbor joining network based on Nei's genetic distances (see Lynch, 1991), as estimat-
ed from AFLP polymorphisms. Populations in the same basin are colored similarly and correspond to the watersheds
depicted in Figure 3-12.
associated with environmental variables nitrate, total nitrogen, total phosphorous, and organic carbon, was
the only factor that was found to significantly explain mitochondrial diversity within populations. This
factor explained about 50% of the differences in mitochondrial diversity (R2 = 0.51, F = 8.4, p = 0.02).
Three environmental factors were found to significantly affect nuclear (AFLP) diversity within popula-
tions. Latitude and variables that were highly confounded with latitude, which included channel slope,
elevation, silica and zinc concentrations, (Factor 3) explains about 43% of the differences in AFLP diver-
sity (Table 3-9). Nitrogen, phosphorous, and organic carbon, (Factor 2) which was important in explain-
39
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustainabilit
I I I
ing mitochondrial diversity, explains an additional 35% of AFLP diversity within sites. Finally, stream pH
and ammonium ions (Factor 5) explain an additional 18% of AFLP diversity. Together these environ-
mental factors explained almost all of the differences in AFLP diversity among sites (R2 = 0.98). Since
latitude and variables highly correlated to latitude accounted for 43% of differences, we conclude that at
least 53% of differences among sites in AFLP diversity are explained by non-geographic environmental
factors. Thus, it appears that both mitochondrial and nuclear genetic diversity of creek chubs in this area
are responsive to stream condition.
It should be noted that we hypothesized that geochemistry would have the strongest effect on creek chub
genetic diversity, since it represented the greatest environmental gradient and should be associated with
mining impacts. However, no association was observed between geochemistry and genetic diversity.
Interestingly, factor 2, which represented nitrogen/nitrate/phosphorus/organic carbon, was significantly
associated with both mitochondrial and nuclear genetic diversity. This may suggest that agricultural inputs
have a greater impact on stream condition in this area than mining inputs.
Among
Regions
64.8%
Within
Populations
27.8%
Among
Populations
within
Regions
7.4%
Figure 3-18. Distribution of nuclear (AFLP) DMA variation, as indicated by an Analysis of Molecular Variance
(AMOVA). For this analysis, the four northernmost watersheds were considered one region while the southernmost
watershed was considered a second region. Each of the variance components was significantly greater than zero
(p< 0.0001).
3.3 Generalizations from the Case Studies
Though quite different in scope, geographic area, and in the nature of the organisms examined, the two
case studies reveal several commonalities. First, the population structure of stream fishes in the Eastern
United States can be quite strong, even for very common, ubiquitous species. This structure is partly a
consequence of natural barriers to migration within stream networks, but the data indicates that it cannot
be entirely explained by watershed boundaries. In fact, defining the USGS watersheds (8-digit hydrolog-
ic units) as the basic assessment unit would appear to be misleading for both species studied. Future eco-
logical assessments that incorporate the population structure of stream fishes when measuring ecological
condition will result in more precise estimates of the condition offish populations.
40
-------
I~TT
Genetic Diversity as an Indicator of Ecosystem Condition and Sustainability
Table 3-7. AFLP and mtDNA diversity (mean ± S.D.) and some key environmental measures for 10 sites sampled.
Colors correspond to the 8-digit HUCs as depicted in Figure 3-12.
Stream
ID
98-134
97-073
97-074
98-132
97-016
97-008
97-024
98-133
97-023
98-036
AFLP
within population
similarity
0.828
±0.025
0.899
±0.007
0.882
±0.006
0.879
±0.005
0.868
±0.008
0.845
± 0.022
0.898
±0.006
0.899
±0.005
0.907
±0.009
0.911
±0.005
mtDNA
pair wise
differences
0.507
± 0.443
0.407
±0.398
1.000
± 0.722
0.581
± 0.484
1.643
±1.078
2.538
±1.458
1.198
±0.814
0.548
± 0.466
0.000
±0.000
0.308
±0.338
Latitude
(decimal
degrees)
41.054
40.347
40.160
39.786
39.606
39.254
39.073
39.054
38.870
38.763
Conductivity
(US)
672
1192
3590
308
213
339
153
162
188
158
Sulfate
(|leq/L)
2849
9222
28569
807
447
616
316
297
233
176
Total
Nitrogen
(Hg/L)
741
624
243
273
195
2011
223
111
260
368
Watershed
Area
(km*)
28
57
49
11
98
3
31
9
70
28
pH
8.31
6.61
8.31
8.01
7.77
7.69
7.73
7.48
7.44
7.50
In addition to strong genetic differences between populations, both assessments also found genetic diver-
sity within populations that varied in amount among populations. Both studies found strong associations
between the amount of this genetic diversity and various measures of environmental condition.
Unfortunately, the types of environmental information collected in the two studies were different so it is
not possible to generalize the types of environmental factors most likely to influence levels of genetic
diversity. However, the correlations that were observed should prove useful in focusing research to diag-
nose particular stressors that may impact fish populations. It is important to note that these inferences were
made for fish populations that appeared outwardly healthy, as the species were numerically abundant at
each of the sites measured. However, the genetic data revealed a scarcity of alleles in some populations,
presumably a consequence of differential environmental quality, that suggest concerns about future sus-
tainability.
41
-------
Genetic Diversity as an
Ill I I
Table 3-8. Environmental factors revealed by principal components analysis and the percent of total environmental
variation attributed to each factor. The mean and range of values for each variable within the factors are presented.
Factor 1
(Geochemistry)
(37.43%)
Conductivity
698 uS
(1 53-3590)
Aluminum
27 ug/l
(5-1 84)
Calcium
2,628 ueq/l
(940-7,394)
Chloride
1 ,044 ueq/l
(85-6,471)
Potassium
80 ueq/l
(39-179)
Magnesium
1 ,366 ueq/l
(341-6,066)
Sodium
3,669 ueq/l
(192-27,665)
Sulfate
4,353 ueq/l
(176-28,569)
Factor 2
(N/P/C)
(24.33%)
Nitrate
15 ueq/l
(1-81)
Total Nitrogen
505 ug/l
(111-2,011)
Total
Phosphorus
63 ug/l
(5-470)
Organic Carbon
80 mg/l
(39-1 79)
Factor 3
(Latitudinal
dines)
(14.14%)
Latitude
39.6°
(38.76-41 .05)
Elevation
274m
(240-350)
Channel Slope
1.0%
(0.10-2.10)
Silica
6 mg/l
(3-13)
Zinc
0.01 mg/l
(0.00-0.04)
Factor 4
(Spatial Scale)
(11.81%)
Watershed Area
38.4 km2
(3.2-98.1)
Stream Width
5.6 m
(2.4-11.1)
Stream Depth
29.9 cm
(13.2-59.4)
Factor 5 Factor 6
(pH-Ammonium) (Substrate
(6.71%) Condition)
(6.35%)
pH Percent
7.69 Substrate
(6.61-8.31) (>16mm)
68%
(20-96)
Ammonium Embeddedness
5.8 ueq/l 55%
(0.0-32.8) (10-86)
Percent Riffle
31%
(5-50)
Table 3-9. Forward stepwise multiple regression of AFLP within population similarity value as dependent variable.
Three variables met the 0.05 significance level for entry into the model.
Environmental
Factor
partial
R2
model
R2
F value Pr>F
PCA Factor 3
(Latitudinal clines) 0.4328
PCA Factor 2
(N/P/C) 0.3489
PCA Factor 5
(pH/Ammonium) 0.1841
0.4328
0.7917
0.9758
6.10
12.06
45.60
0.0387
0.0104
0.0005
42
-------
Condition and Sustainability
4 Considerations when Implementing a Genetic Diversity Assessment
Suggestions that genetic diversity should be used as an indicator of ecological health are not new (e.g.,
Beardmore et al, 1980; Nevo et al, 1988; Fore et al, 1995a, 1995b). However, specific guidance on
how genetic markers should be incorporated into an ecological assessment program is lacking. The
overwhelming majority of reported scientific studies of genetic diversity are at relatively small scales,
incorporating assays of at most a few dozen populations. Aspects of how to scale-up to region-wide
analyses with dense geographic sampling, including data management and quality assurance issues, have
not been considered. In addition, the focus of genetic diversity studies in the scientific literature is usu-
ally on the status of the species under study, not the ecosystem. The US EPA is currently engaged in
several studies designed to evaluate the utility and practicality of implementing genetic diversity indica-
tors as part of ecological assessment or monitoring. Based on our experiences with these studies and
relevant information from the scientific literature, several guidelines are suggested.
4.1 Sampling Design
Two different types of sampling designs have traditionally been employed for ecological assessments:
source-biased studies, in which sites with known exposures are compared to reference sites, and region-
wide studies, in which a large number of sites are sampled according to a defined sampling scheme in
order to create a regional profile. Some studies may have aspects of both designs, in which sites with
known exposures are compared to a relatively large number of "reference" sites within the region. Both
designs are amenable to genetic diversity analysis. Many examples of the source-biased design applied
to genetic diversity exist in the ecotoxicological literature (see Table 2-1). In addition, there are many
examples in the conservation genetics literature of genetic diversity analyses that incorporate regional
scales, although they rarely include large numbers of sample sites within the region. To our knowledge,
no examples yet exist of intensive regional ecological assessments that have utilized a genetic diversity
indicator.
The source-biased design has obvious cost advantages when the assessment question of interest is
whether a known, local exposure has an impact on the genetic diversity of resident populations.
However, considerable care must be exercised when implementing this design. Because the intent of
this design is to measure a recent genetic change, the reference population(s) will ideally be identical to
the test population(s) in all aspects except for the application of the specific exposure, yet independent
of the test population(s) following the exposure. Thus, the populations must have had similar genetic
diversity before the exposure, either because they recently diverged or because they experienced high
gene flow prior to the exposure. In addition, significant gene flow between the populations must have
stopped immediately following the exposure and any genetic differentiation that occurred must have
been due to the exposure and not to other population or environment-related factors. These standards
are likely to be difficult to meet. In practice, genetic diversity is often measured at a number of refer-
ence sites and compared to the exposed site. If genetic diversity at the exposed site is outside the norm
for the reference sites then the exposure is implicated as the cause of the change in genetic diversity.
Here too, there can be difficulties with interpretation. Often, reference populations are chosen to be geo-
graphically distant from the exposed population in order to ensure that they represent "reference condi-
tions". Typically, it is not clear that the reference populations are not each more closely related to each
other than any is to the exposed population and that any genetic diversity differences uncovered did not
predate the exposure.
Regional studies offer much greater ability to characterize patterns of intraspecific genetic diversity and
their possible causes than do source-biased studies. Genetic diversity will naturally vary among popula-
43
-------
Genetic Diversity as an Indicator of Ecosyiiijjf
I 1 1
tions for a variety of reasons, including variation
in the size of populations that different habitats
can support, as well as evolutionary relationships
and patterns of dispersal among populations.
These natural levels of genetic diversity are
indicative of fundamental population data, such as
effective population sizes and population bound-
aries (section 2.1). In addition, any attempt to
determine whether anthropogenic factors have
influenced present levels of genetic diversity must
be able to distinguish historical (evolutionary) pat-
terns from recent change. Since regional assess-
ments allow better characterization of the natural
variation in genetic diversity measures, they can
provide guidance for selecting specific areas for
more intensive study. For example, if genetic
diversity of one population is determined to be
low, it can be compared to evolutionarily similar
populations to determine whether a recent genetic
change is implicated.
ic to
of
is tte
to
af
• I I
The natural synergisms that genetic diversity data lend to landscape-level analyses and species assem-
blage studies suggest that incorporation of genetic diversity sampling into existing regional assessment
programs, including EMAP and Regional EMAP programs, is the preferred strategy to obtain genetic
diversity data at present. Sampling of target species for genetic diversity analyses can easily be accom-
plished within existing EMAP guidelines, and will help reduce costs of sample collection. One of the
advantages of DNA-based analyses is that sufficient DNA can be obtained from a single fish fin-clip to
perform a large number of genetic analyses. Thus, tissue can easily be collected in the course of normal
field identification procedures and, in most cases, specimens can be released back into the environment.
In the long run, the most efficient method to measure anthropogenic changes in genetic diversity is to
measure genetic changes directly through temporal monitoring. For this purpose, regional assessments
provide an excellent means to collect the necessary baseline genetic data for future comparisons. In
addition to regional-scale assessments, intensive temporal analyses of genetic changes at a network of
index sites will be extremely valuable. Index sites typically are assessed with a greater range of diag-
nostic and condition indicators, which will allow
for better integration of the genetic data. Since
index sites are intended for long-term monitoring,
they provide an excellent opportunity to measure
the temporal scale of genetic changes. For both
regional and index site studies, it will be critical
that DNA or tissue is archived for future analyses
as part of the assessment. Given the rapid
changes occurring in molecular methodologies, it
is likely that whatever marker is used to measure
genetic diversity initially will not be the optimal
strategy at later stages of the assessment. The
availability of archived samples will allow future
retrospective analyses to assess genetic changes
using the most appropriate technologies available.
In the the
to
in is to
i i i
44
-------
Condition and Sustainability
Scale-up issues. Several project management issues emerge when the scale of genetic analysis changes
from assessment of genetic structure at a few sites at a single point in time to assessment at regional
scales and/or over time periods that may encompass decades. Clearly, the greater management complex-
ity and cost of this type of project requires greater forethought in design of the genetic survey. The utili-
ty of a pilot study to guide project management cannot be overemphasized. The pilot study serves to
validate the choice of molecular markers and species in the study region and allows for initial assess-
ment of the feasibility of more intensive or large-scale sampling (Baverstock and Moritz, 1996). It is
very likely that the project plan will be redefined based on the results of the pilot study. For example, a
target species may be difficult to sample within the region or a number of molecular markers that were
found to be useful in other areas may lack polymorphism (and thus discriminatory power) within the
region. A pilot study also may determine whether the scale or intensity of sampling for a regional
assessment is appropriate.
Management of a large genetic diversity survey will be logistically simpler if it can be divided into
smaller units that are analyzed individually. For example, the EPA's pilot study of genetic diversity for
central stonerollers throughout the Eastern Cornbelt Plains Ecoregion (section 3.1) was divided into
analysis units that were equivalent to large watersheds or basins. In addition, analyses were completed
for central stonerollers before attempting to assess genetic diversity in additional species. Geographical
and species-stratified analysis has the additional advantage that fairly intensive genetic diversity assess-
ments can be completed for specific basins on a regular basis throughout the life of the project. While
the advantages of such compartmentalization of analyses are clear, it does lead to a potential for bias if
variation in laboratory procedures occurs over time (thus, between sample units). Inclusion of duplicate
samples from prior analyses as a type of positive control against temporal variation in laboratory proce-
dures should control for this type of bias.
Additional scale-up issues involve planning for adequate data management structure and are dealt with
in section 4.6.
4.2 Species Choice
Most genetic diversity studies in the scientific literature are focused on conservation or management of a
particular species, usually one that is threatened, endangered, or of some economic importance. In
choosing species as indicators of environmental health, other issues clearly come into play. Table 4-1
presents "optimal" species attributes for environmental monitoring. Many of these species attributes
stem from consideration of basic population genetics. Genetic diversity of species that are highly sensi-
tive to degraded habitat and that have short generation intervals will respond more quickly and more
dramatically than other species. Species with low dispersal ability will have higher exposure to specific
environments and may provide finer resolution of environmental differences between sites. Asexual
species, including microbes and algae, are not optimal mainly because the distinction between intraspe-
cific and interspecific genetic diversity is blurred. Valid use can be made of such species (e.g., Ford et
a/., 1998) but they become functionally equivalent to species assemblage indicators.
Other species characteristics listed in Table 4-1 help make collection, analysis, and interpretation of the
data simpler. Selection of species that are easily distinguished morphologically will ensure that genetic
diversity is measured at the intraspecific level, and that comparisons are not erroneously made between
different species (in fact, cryptic species complexes are readily diagnosed using molecular markers,
which is one advantage of combining analysis of genetic diversity with species assemblage assess-
ments). Selection of broadly distributed species allows simpler analysis of scale issues. Use of species
that are important to resource managers will allow easier integration of genetic diversity monitoring into
existing monitoring programs. Species that are moderately abundant within the study area are easier to
45
-------
Genetic Diversity as an
Table 4-1. Optimal characteristics of species assessed with a genetic diversity indicator.
Optimal Species Characteristics
• Short generation interval
• Moderate-high exposure to stressor(s)
• Moderate-high sensitivity to stressor(s)
• Low-moderate dispersal ability or highly philopatric
• Sexual reproduction
• Native species
• Broad distribution
• Moderate abundance
• Management importance
• Morphologically distinct
• Known life-history parameters (age structure, sex ratio, etc)
• Availability of comparative genetic and demographic data
• Availability of historical DNA or tissue samples
• Amenable to laboratory culture
collect, although these species are usually not management priorities. The population genetic structure
of species that have not been excessively moved around is generally easier to interpret than the structure
of introduced species. This is because the stocking history of introduced species is generally poorly
known; non-native species could be useful indicators in cases where the history of introductions and
their sources are well documented. Availability of historical data, including the availability of archival
samples (DNA, fins, scales, or whole preserved specimens) is useful for reconstructing changes in
genetic diversity that may have occurred prior to or during known exposures in the past. Finally, the
selection of species that can be cultured in the laboratory will aid in assigning causality to genetic
changes, if such studies are desired in the future.
Ideally, the indicator will be applied to several species, since a multi-species index should better predict
ecosystem status than a single-species index. Genetic responses of individual species are not always
predictable (Gillespie and Guttman, 1999), as different species "see" different aspects of the environ-
ment, and not always what we expect. If several species are selected then additional considerations
become relevant, such as sampling from phylogenetically and ecologically diverse taxa.
4.3 Which Genetic Marker?
Several books have been published in recent years that review the biological and analytical properties of
different molecular markers (e.g., Avise, 1994; Hillis et al, 1996; Caetano-Anolles and Gresshoff
1997). A general consensus is that no genetic marker is best for all applications and in the hands of all
investigators; each provides different insights and requires different levels of investment in equipment
and training. Here, we will summarize the properties of some of the most common genetic markers and
discuss their relative strengths and weaknesses as ecological indicators.
46
-------
r
Diversity as an Indicator of Ecosystem Condition and Sustainability
Allozymes. Allozyme electrophoresis is a simple and time-honored technique in the field of population
genetics. The principle of the technique is that allelic forms of enzyme proteins with different net
charges will have different mobilities when induced to move through a matrix by application of an elec-
tric current (electrophoresis). Alternative forms of the enzyme at the enzyme locus (alleles) migrate dif-
ferent distances through the matrix and are visualized by histochemical staining (Figure 4-1). Numerous
manuals have been developed that outline general equipments needs, procedures, and gel pattern (zymo-
gram) interpretations (see May, 1992; Murphy et al, 1996; and references therein).
Although allozymes are often thought of as "old technology", they have some clear advantages over
other genetic markers as ecological indicators. Few other markers can match allozymes in the simplici-
ty and economy of standard procedures. Allozymes have a much longer history than other genetic mark-
ers and have been analyzed in many more laboratories so the historical database of comparative popula-
tion genetic data is much larger for this marker than for any other. In fact, it is highly likely that
allozyme datasets can be found for any of the common stream fishes in the country. In comparison to
some other markers, allozyme techniques suffer from a limited number of marker loci available for
study and a limited number of alleles per locus. Less than a dozen polymorphic markers are assessed in
typical studies, and most of these markers only segregate for two or three variant alleles. As noted earli-
er, allozyme loci are more likely to be affected by natural selection than most DNA markers, which may
bias estimates of gene flow and genetic diversity. For example, allozyme markers suggested significant
gene flow in oysters along the eastern and Gulf coasts of North America, but both mitochondrial and
nuclear DNA markers indicated a sharp biogeographic boundary between northern and southern popula-
tions in northeastern Florida (reviewed in Avise, 1994).
Figure 4-1. A histochemically stained starch gel showing GPI allozyme loci of rainbow trout. Each vertical lane rep-
resents a different individual.
47
-------
Genetic Diversity as •anAriKiiSlJ
1 1 I
'' - ''-.
Mitochondrial DNA is a circular DNA molecule containing about 16,500 base pairs that is present in
multiple copies in the cells of eukaryotic organisms. Mitochondrial DNA has a number of properties
that lend it to ecological assessments. Mutation rates tend to be higher than for most nuclear DNA
regions, so large numbers of alleles (called haplotypes) are generated. Each individual usually only pos-
sesses one mitochondrial haplotype, which it typically inherits from its maternal parent. In addition,
recombination within the mitochondrial genome appears to be rare or absent. Unlike different allozyme
alleles, mitochondrial DNA sequences (Figure 4-2) can be analyzed to determine patterns of evolution-
ary relationships between different haplotypes. Thus, analysis of mitochondrial DNA sequences adds a
different dimension to the analysis of genetic diversity; one can move beyond asking whether two indi-
viduals are the same or different at a genetic locus to ask how different they are at that locus. This pro-
vides a straightforward method for assessing (maternal) genealogical relationships between individuals
of a population, and between individuals of different populations and different species.
Figure 4-2. Electropherogram of DNA sequence generated by an automated genetic analyzer. Identification of muta-
tional relationships among mitochondrial types (haplotypes) can elucidate patterns of evolutionary relationships
among populations.
Mitochondrial DNA is not without drawbacks, the most important of which is that the lack of recombi-
nation within the molecule causes the entire mitochondrial genome to behave effectively as a single
genetic marker; different mitochondrial genes are linked and therefore not independent. Since the histo-
ry of a mitochondrial lineage is not identical to the population history (most notably, it says nothing
about male contributions), interpretations made strictly from mitochondrial DNA may be erroneous.
The tremendous information content derived from DNA sequencing comes at a cost in terms of equip-
ment, supplies, and technical expertise (Figure 4-3). Constant technological improvements are leading
to rapid reductions in these costs. For example, the development of the polymerase chain reaction
(PCR) and "universal" PCR primers has negated the need to isolate mitochondrial DNA away from
nuclear DNA, or to clone specific fragments prior to sequencing (instead, the target sequence is simply
PCR amplified). Meanwhile, a number of techniques have been developed that reduce the need for
DNA sequencing or the number of required sequencing reactions. A common strategy is to screen for
restriction fragment length polymorphisms (RFLPs). RFLPs provide a coarse indicator of DNA
sequence variability, typically capturing less than one-eighth of the DNA sequence variability in a
region. However, RFLP patterns contain information about relationships between haplotypes and can be
analyzed relatively inexpensively. Other methods are used to "prescreen" mitochondrial DNA, so that
only unique or previously uncharacterized haplotypes are sequenced. These include single strand con-
formation polymorphism analysis (SSCP), denaturing gradient gel electrophoresis (DGGE) and various
commercial strategies. These techniques typically identify 80% to 100% of single-base mutations within
DNA, but usually say little about haplotype relationships. Still, when combined with DNA sequencing,
they can be more informative than RFLP analysis with only slightly more technical difficulty and cost
(Figure 4-3).
48
-------
I~TT
Genetic Diversity as an Indicator of Ecosystem Condition and Sustainability
Information per Marker
RAPD
AFLP
Minisatellite
VNTR
DMA
Sequencing
RFLP
SSCP DGGE
Cost/Equipment/Technical Inputs
Figure 4-3. Relative advantages and disadvantages of different molecular marker strategies in relation to information
per marker (darker colors in the figure being more informative), the number of markers per typical study, and costs per
study in terms of capital outlay and technical expertise. The figure is not based on quantitative data and is presented
for illustration purposes only.
Nuclear DNA Sequencing, RFLPs, and Prescreening Strategies. Strategies that are
available for analysis of mitochondrial DNA also are available for the analysis of nuclear DNA genes.
with similar advantages and limitations. The general strategy is often referred to as single-copy nuclear
DNA (scnDNA) analysis. Typically, non-protein coding intervening sequences (introns) or flanking
regions are targeted for analysis since they are usually more polymorphic than coding sequences. The
primary advantage of scnDNA analysis is that many more genetic markers that are independent are
available for analysis, so it can be highly complementary to mitochondrial DNA analysis. The develop-
ment of gene sequence databases for many organisms, combined with PCR technology, has made avail-
able a number of well-studied genes for population analyses. A number of "universal" PCR primers
have been published (e.g., Palumbi, 1996) to aid in the development of genetic markers for different
species, although a certain amount of primer modification is often required. Because many of the gene
sequences available for analysis are believed to have an impact on fitness, these, like allozymes, have
the potential to be developed as diagnostic indicators of natural selection (and thus population stress).
There are a number of disadvantages with scnDNA markers. In general, levels of polymorphism are low
compared to mitochondrial DNA and some other nuclear DNA markers. In addition, the analysis is rela-
tively intensive, even when using mutation-prescreening techniques, so relatively few scnDNA markers
are generally analyzed per study.
49
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustain
I I I
Multi-locus DNA Fingerprints. DNA fingerprinting is a strategy that is in many respects the opposite
of scnDNA analysis. Instead of targeting single, relatively well-characterized genes, DNA fingerprints
target many anonymous chromosomal regions for analysis simultaneously. Typically, little is known
about these regions except that they possess a small region of similarity to specific probes or PCR
primers. Any region that has such homology presents as a distinct DNA fragment or "band" following
gel electrophoresis. For any one individual, the pattern of presence and absence of bands can be highly
complex, resembling a bar code. With the most variable markers, individuals can be uniquely identified
by these band patterns. The most common DNA fingerprinting technique is random amplified polymor-
phic DNA (RAPD) analysis (Figure 4-4a), in which short primers of arbitrary sequence are used to
amplify DNA fragments from 10-50 discrete regions in the genome. A newer method called amplified
fragment length polymorphism (AFLP) analysis (Figure 4-4b) is similar, but depends on amplification of
polymorphic fragments generated by restriction enzymes (RFLP) from arbitrary regions of the genome.
I 1 I I I I
= -.
Figure 4-4. Examples of multilocus DNA fingerprints. Both RAPD (A) and AFLP (B) fragments can be generated
without a priori knowledge of an organism's sequence. As dominant markers, homozygous and heterozygous indi-
viduals cannot be distinguished. The AFLP fingerprint here differs from Figure 3-14 because this is a multiplex
AFLP reaction generated with an automated genetic analyzer.
50
-------
r
Diversity as an Indicator of Ecosystem Condition and Sustainability
There are two principal advantages of multilocus fingerprints for ecological assessments. The first is
that no sequence information about the genome of the organism is needed in order to apply the methods.
Thus, marker development costs are minimal and species can be chosen for analysis based on ecological
or management criteria rather than the amount of sequence information already known. The second
advantage is that genetic differences between individuals can be distinguished with relative economy.
Dozens to hundreds of markers are analyzed in a typical DNA fingerprinting study.
The overriding disadvantage of DNA fingerprints is the poor quality of genetic information from each
individual fingerprint band. Individual fingerprint bands cannot reliably be assigned to independent
genetic loci. More importantly, RAPD and AFLP markers are dominant markers, which means that indi-
viduals that are heterozygous for a marker (i.e, only one chromosome of the pair has the marker) cannot
be distinguished from individuals that are homozygous for the marker (both chromosomes have the
marker), significantly decreasing the genetic information available. As a result of these ambiguities,
comparisons are generally made in terms of overall 'similarities', taking into account the proportion of
bands that are shared between individuals. Another concern with DNA fingerprinting methods is that
sophisticated procedures must be implemented to reliably sort through the complex fingerprint patterns
to identify homologous fingerprint bands from different individuals. Related to this is concern about the
overall reliability of fingerprinting methods, particularly RAPD fingerprints. The RAPD technique is
known to be very fickle, and adherence to exact protocols by different labs, often including use of the
same brand of equipment and reagents, is considered critical to repeatability.
Microsatellite DNA Markers. Microsatellite DNA, also called simple sequence repeats (SSRs) are
regions of repetitive DNA that consist of tandem repeats of a core sequence of two to five base pairs,
such as (CA), (TAGA), and (CAT). Different alleles at a microsatellite locus differ in the number of tan-
dem repeats of the core sequence. These sequences appear to be ubiquitous in the genomes of eukary-
otes, and thousands of potential microsatellite markers could be developed for most species.
Figure 4-5. Flourescently labeled microsatellites detected using an automated genetic analyzer. Use of flourescent-
ly labeled markers allows differentiation between multiple loci (illustrated above by blue, green and yellow labeled
markers) within the same reaction (multiplex PCR) thus reducing cost and increasing throughput.
51
-------
Genetic Diversity as an
Ill I I
Microsatellite DNA markers have some tremendous strengths for ecological assessments. They are sub-
ject to very high mutation rates relative to scnDNA, sometimes producing dozens of alleles (Figure 4-5).
Like allozymes and scnDNA markers, inheritance of microsatellite markers is codominant, so heterozy-
gotes can usually be reliably differentiated from homozygotes. The proportion of individuals that are
heterozygous in a population is much higher than for other nuclear loci, often approaching 100%.
Although heterozygosity at microsatellite loci is somewhat less sensitive to genetic bottlenecks than
mitochondrial DNA diversity, it is much more sensitive than other nuclear DNA markers due to the large
number of rare segregating alleles at these loci. Loss of these rare alleles actually provides a more sen-
sitive measure of population bottlenecks than does heterozygosity (Leberg, 1992). It appears that the
predominant mode of mutation is to an allele one repeat-unit different from the original allele [e.g., from
(CA)12 to (CA)13], thus genealogical information can be captured from allelic relationships of microsatel-
lite loci, although this information is less precise than that captured from DNA sequence analysis of
mitochondrial DNA and nuclear genes. The primary disadvantage of microsatellite DNA markers is
development cost. Technical expertise necessary for microsatellite marker development is greater than
for any of the other markers listed, although, like DNA sequencing, the technical demands are decreas-
ing. Microsatellite markers are developed from non-protein coding DNA regions and, therefore are not
conserved across taxa, so that microsatellite DNA markers developed for one species are often only use-
ful for very similar species. The number of organisms for which microsatellite markers have been
developed is increasing rapidly, so it is possible that microsatellite development will be less costly in the
near future. In addition, the very large number of alleles present at some microsatellite loci requires
large sample sizes be used to estimate allele frequencies accurately.
Recommendations. This report has considered only a subset of the available genetic markers, but these
are the most common and well supported in the scientific literature. The "best" marker for ecological
assessments will vary, depending on the specifics of each situation. Until reliable methods are devel-
oped that allow economical analysis of nuclear DNA sequences from more than just a few genes per
study, RFLP, SSCP, DGGE and sequence analysis of nuclear genes will likely be less informative than
allozyme or microsatellite studies per unit effort. Mitochondrial DNA can be highly informative and
offers insights not available from analysis of nuclear DNA. However, information from mitochondrial
DNA may be misleading if interpreted alone so it is recommended that mitochondrial DNA be assessed
only in conjunction with other markers.
Among the remaining markers, microsatellites undoubtedly offer the best combination of information
per genetic marker and potential for analysis of many genetic markers. Although most microsatellite
studies of natural populations to date have utilized relatively few microsatellite markers, there is now no
technical reason why dozens, even hundreds of microsatellite markers cannot be developed and applied
to genetic analyses. The technical challenges of microsatellite development can be overcome (for exam-
ple, several commercial laboratories will now develop microsatellite markers on contract), however, an
advanced laboratory is still required for microsatellite analysis. Reliable scoring of microsatellite mark-
ers requires the use of labeled PCR primers (either isotope or fluorescence) for visualization so, mini-
mally, laboratories must have the ability to detect these labels. For large-scale, regional analyses utiliz-
ing many microsatellite markers, automated laboratory analysis using commercial genetic analyzers
(automated sequencers) is essential.
The many advantages of allozymes, (economy, standardized methods, large existing database of infor-
mation, homology of loci across species) should not be overlooked, particularly when microsatellite
analysis is infeasible. Targeted analysis of specific allozyme loci (e.g., GPI) also may be useful as diag-
nostic indicators of specific stressors (e.g., heavy metals), and could complement analyses of other
genetic markers. However, the requirements for ultra-cold storage of tissue samples in the lab and in
52
-------
Condition and Sustainability
the field may make allozyme analysis impractical for regional analyses, particularly if the genetic collec-
tions are done as part of a multi-indicator assessment. In addition, if genetic diversity of allozymes is
too low overall in the indicator species (as determined, perhaps, by a small pilot study), or if lethal sam-
pling is not acceptable, then other methods should be explored. AFLPs and RAPDs, in that order,
should be considered if neither allozyme nor microsatellite studies are feasible. Both methods allow
assessment of many different genetic markers, but identification of AFLP markers is believed to be more
repeatable between laboratories. However, AFLP analysis, like microsatellites, requires a more
advanced laboratory able to detect isotopically or fluorescently labeled PCR products. Mitochondrial
DNA analysis may be combined with any of the above nuclear markers to provide semi-independent
genetic information. For example, analysis of evolutionary relationships among mitochondrial haplo-
types may provide information on the historical biogeography of the species. Such information can help
interpret patterns of genetic diversity in nuclear markers. A flow chart to aid in choosing genetic mark-
ers is provided in Figure 4-6.
4.4 Sample Size Considerations
Clearly, statistical power to detect differences in genetic diversity within populations and to detect genet-
ic differentiation among populations will increase with increasing sample size. Two different sample
sizes need to be considered: the number of individuals sampled per population and the number of mark-
ers assessed per individual. For any one marker, the ability to discriminate allele frequency differences
between populations is low so large numbers of individuals need to be sampled from each population.
\n general, as more markers are assessed per individual, fewer individuals need to be sampled from
each population. However, the balance between the number of markers and the number of individuals
varies with the marker chosen. Recommendations for determining appropriate sample sizes are
reviewed in current literature (Baverstock and Moritz, 1996).
In general, genetic markers that allow estimation of molecular relationships (DNA sequences, RFLP, and
microsatellites) should require smaller sample sizes to achieve the same statistical power as markers that
only consider allele frequencies, although this depends on the level and complexity of molecular rela-
tionships. Estimates of allele frequencies for microsatellite markers with high polymorphism (e.g., 10 or
more alleles) can have very high standard errors since few individuals in the sample will possess any
one allele.
For dominant markers (RAPD, AFLP) in which heterozygotes cannot be distinguished from one of the
homozygotes, less information is available per locus and therefore more individuals or more markers
must be sampled for the same statistical power as analysis of codominant loci. Often, logistical prob-
lems limit the number of individuals that can be sampled so statistical power is gained by sampling as
many loci as possible. Simulation studies by Mariette et al. (1999) suggest that at least four times as
many AFLP markers are needed as microsatellite markers to estimate genetic diversity within popula-
tions that are at equilibrium between migration and genetic drift. Comparatively more AFLP or RAPD
markers will be needed to measure genetic diversity of populations that are not at equilibrium.
Mitochondrial DNA, which is only transmitted from female parent to offspring (haploid inheritance),
also has less genetic information than a single codominant marker, based on analysis of haplotype fre-
quencies alone. Reduction in the standard error of estimates of haplotype frequencies can only be
accomplished by increasing the number of individuals sampled.
Although it is difficult to come up with specific numbers, as a rule of thumb, sample sizes of between
50 and 100 samples per site are typically targeted in population genetics studies, although actual sample
sizes are often lower. Analyses that utilize dominant marker systems should aim to assess between 100-
53
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustainabilit
I I I
Are suitable microsatellite markers
available, or can they be developed or
purchased?
NO
YES
Is the lab equipped for
fluorescent DMA fragment
detection?
YES
NO
Is there
allozyme
diversity?
I
NO
YES
Consider
microsatellite
study
YES
Is lethal sampling
and -70° field
storage feasible?
NO L
Consider
allozyme
study
NO
Is the lab equipped for
fluorescent DNA
fragment detection?
Consider
RAPD study
Does the lab have a DNA
seauencina capability?
YES
Consider
AFLP study
YES
Consider combination of
mitochondria I and
nuclear genetic markers
Figure 4-6. Flow diagram for deciding the best genetic marker or combination of genetic markers in relation to avail-
able resources.
54
-------
Condition and Sustainability
200 polymorphic markers. Analyses that utilize codominant markers should aim for between 20 and 50
polymorphic loci.
4.5 Personnel Training and Specialized Equipment
Significant technical expertise is required for field sampling, laboratory analysis, and data analysis and
interpretation. The degree of field expertise is equivalent to that needed for species assemblage indica-
tors. Personnel must be able to operate sampling gear such as backpack electroshockers or seines effec-
tively. In addition, they must have the ability to identify target species in the field. In practice, species
discriminatory skills are probably less important than for assemblage indicators because the validity of
the field identification will likely be confirmed by the genetic analysis.
Laboratory analyses require varying levels of skills. DNA extraction and quantitation techniques have
become highly commercialized and many simple kits can be used that yield DNA of suitable quality and
concentration for PCR-based analyses. The primary skill necessary is accurate pipetting. PCR analysis
also is relatively straightforward and usually only requires good pipetting skills. However, a number of
factors can disrupt the PCR process, and troubleshooting problems is a common occurrence in PCR
assays. For this reason it is recommended that M.S.-level person with molecular biology training be on
hand to supervise or consult during the PCR phase. A similar skill-level is recommended for gel prepa-
ration, sample loading and electrophoresis, and for operating automated DNA sequencers of genetic ana-
lyzers. Interpretation of gel patterns to determine the allelic complement at each marker analyzed gener-
ally requires a skilled, M.S.-level molecular biologist. In general, improvements in technology, such as
highly robotic capillary electrophoresis systems for genetic analysis, have decreased the training require-
ments necessary to perform genetic analyses. Development of microsatellite markers requires sophisti-
cated molecular biology skills.
Equipment requirements to perform different laboratory operations are described in Table 4-2. A num-
ber of software packages are available for genetic analysis, and similar analyses can be performed with
general-purpose statistical software such as SAS®. Selection and interpretation of appropriate indices is
best done in consultation with a population geneticist or a statistician who is familiar with genetic data.
4.6 Information Management
A typical genetic diversity study will generate large amounts of data. Critical data sets include the loca-
tions and dates of sampling, the number of individuals of each species that are sampled per site, and the
genotype of each individual at each of the molecular markers analyzed. These data sets must be related
to other databases that may exist, including phenotypic data (age, size, developmental abnormalities)
biomarker data, chemical and physical habitat data, assemblage indicator data, and landscape-level data.
This suggests that use of relational database software will be useful to manage the data, particularly if
this software has already been incorporated to manage linked databases. However, simpler database
management tools, including spreadsheets and simple database software, can be appropriate for less
complex datasets.
Large numbers of tissue samples and DNA extractions will be collected that must be archived for valida-
tion purposes and to aid future retrospective assessments. Minimally, a database is needed that docu-
ments for each sample a unique sample ID, a population ID, the storage location of the tissue sample,
the storage location of the DNA sample, dates of collection, DNA extraction, DNA quantification, as
well as amounts and quality assessment of the archived material. If voucher specimens were collected
along with the genetic samples then the database should include this link as well. Field data collection
55
-------
Genetic Diversity as an Indicator of Ecosystem Condition and Sustain
I I I
forms and forms that document dates of laboratory manipulations and spatial orientations of samples
during those manipulations (DNA extraction, PCR, electrophoresis, marker scoring) will need to be
physically archived.
Metadata requirements include documentation of field collection procedures, as well as detailed labora-
tory methods (see appendices) and data analysis procedures. The procedure for translating genotypic
data for an array of genetic markers into a data file must be explained, including descriptors of each of
the fields in the data file. Similarly, documentation of the rationale and procedures for the statistical
analyses, including software documentation, are needed.
4.7
Costs
Monetary costs of implementation of a genetic diversity indicator are incurred during sampling, labora-
tory analysis, and data analysis. Costs of field sampling for DNA analyses are similar to costs to collect
species assemblage and biomarker data, and will generally be shared with those indicators. McCormick
and Peck (2000) estimate the cost to field a contracted crew of 4 people at $1200 per site visited. They
estimate the cost of field equipment, including a backpack electrofishing unit, to be $3515 per crew,
with a 15% annual maintenance and depreciation rate. Sampling for allozyme analyses requires the use
of special storage containers and a regular supply of dry ice or liquid nitrogen. This may add an addi-
tional $50 per site in supply costs, plus approximately $400 in cold storage equipment. One possible
consequence of the necessity for cold storage is that the crew may not be able to remain at remote sites
Figure 4-7. DNA quantitation is performed using a commercially available fluorescent nucleic acid stain that is
detected with a fluorescent scanner.
56
-------
r
Diversity as an Indicator of Ecosystem Condition and Sustainability
Figure 4-8. Thermal cyclers are used for fragment amplification (RAPD, AFLP, microsatellites) and DMA sequencing
assays.
Figure 4-9. A capillary-based, auto-loading genetic analyzer can perform both fragment analysis (AFLP, microsatel-
lite) and DMA sequencing.
57
-------
Genetic Diversity as an
Ill I I
Table 4-2. List of standard and specialized equipment for different types of genetic marker studies. * optimal but not
necessary.
All laboratories
1. Freezer-samples
2. Freezer- chemicals
3. Refrigerator
4. UV transilluminator
5. Ultrapure water source
6. Pipetters
7. Variable speed centrifuges
8. Microcomputer with statistical genetics analysis software
Standard Equipment - DNA laboratories
1. Separate sample preparation room/area (DNA extraction)
2. PCR room/area
3. Post-PCR room/area with photodocumentation and/or fragment analysis equipment
4. thermal cyclers
5. Agarose gel electrophoresis rigs
6. Microcentrifuges (10,000 RPM)
Specialized equipment- allozymes
1. -80 freezers
2. dry ice or liquid nitrogen and canisters
3. Starch gel electrophoresis rigs
4. Power supplies (to 350 V, 150mA)
5. Incubator oven.
6. Chiller (for cooling starch gels during run)*
Specialized Equipment - microsatellites
Microsatellite Development
1. Hybridization oven
2. Incubator oven
3. Automated DNA sequencer
4. Shaking incubator
5. Fluorescence detection system (see microsatellite screening equipment)
Microsatellite Screening
1. Fluorescence detection system, either
(a) Acrylamide gel electrophoresis rigs, 1000 V power supply, fluorescence scanner, fragment
analysis software, microcomputer
(b) Automated DNA sequencer with fragment analysis software
Specialized Equipment - RAPD
1. Power supplies (to 350 V, 150 mA)
2. Specialized agarose gel electrophoresis rigs for recirculating buffer
3. Chiller unit (to 0° C) for cooling agarose gels during run*
4. Microcomputer with fragment analysis software
Specialized Equipment - AFLP
1. Fluorescence detection system, either
(a) acrylamide gel electrophoresis rigs, 1000 V power supply, fluorescence scanner, or fragment
analysis software, microcomputer
(b) Automated DNA sequencer with fragment analysis software
58
-------
Condition and Sustainability
long before samples must be shipped to the laboratory, which may increase sampling costs.
Laboratory costs are more variable, and depend greatly on both the type of molecular marker assessed
and the technical skills of laboratory personnel. All estimates given here are provided with the caveat
that they are likely to change rapidly. As with field sampling, labor is the greatest cost. Based on
review of the EPA's pilot study that utilized the RAPD fingerprinting technique, it is estimated that a
crew of four laboratory technicians can extract DNA from 95 samples, perform nine RAPD PCR assays
per sample, electrophorese each PCR product individually, and size each of the fingerprint bands in a
period of approximately 9 days. Assuming atypical sample size of 50-100 individuals per site, this indi-
cates that a crew of four will require between one and two weeks to assay a single site. Supply costs,
including enzymes, agarose, plastics and chemicals, are estimated at $9 per sample for nine RAPD
assays. Equipment costs included two fluorimagers and associated computer hardware and software
($70,000 each) ten agarose gel units ($4000), chiller ($3000), and many smaller items (pipetters, stirrers,
centrifuges, incubators, approximately $10,000). We assume an average depreciation of approximately
20%. The fluorimagers could reasonably be replaced with cheaper photodocumentation systems for less
than $20,000.
Guidance on costs of other types of molecular marker studies can be obtained from estimates by com-
mercial laboratories. It is assumed that this is a maximum cost estimate because a profit margin is built
in, but depreciation costs are built into the estimates and these labs may experience higher efficiency
than other labs. Allozyme electrophoresis typically costs between $10 and $40 per sample, with
between 5 and 15 polymorphic loci (and often many more monomorphic loci) assayed. Costs to set up
an allozyme laboratory fall in the range of $10,000 to $20,000. DNA sequencing typically costs $25-
$40 per sample, for a sequence of up to 500 bp. It is often recommended that sequences be generated
for both the upper and lower DNA strands, doubling the cost. This cost does not include the cost of
DNA extraction and PCR amplification of the target locus, approximately $2 per sample. Automated
DNA sequencers vary greatly in price and quality, but useful models can be obtained for between
$70,000 and $130,000. Microsatellites and AFLP fingerprints that are assayed on automated DNA
sequencer cost between $2 and $5 per sample run. For microsatellites, a single automated run may
include between one and 8 distinct PCR reactions, increasing the efficiency of the assay. Costs of DNA
extraction ($2) are not included.
Costs of marker development range from essentially zero for RAPD, AFLP, and allozymes to several
thousand dollars for microsatellites. At least one commercial laboratory will guarantee production often
usable microsatellite markers for any species for a cost of $20,000.
4.8 Summary Recommendations
Among the myriad different ecological indicators available for environmental assessments, the optimal
niche of a genetic diversity indicator is to map patterns of population structure and to identify cumula-
tive genetic changes in populations through spatial and/or temporal comparisons. Like other indicators
of ecological condition, its value will be greatly enhanced if it is interpreted as part of a multi-indicator
assessment. By its nature, genetic diversity is a generalized indicator of long-term changes in popula-
tions; it will be difficult to assign causation to any specific stressor. There are exceptions to this rule.
For example, assays for specific allozyme genotypes may be developed as diagnostic indicators of par-
ticular classes of stressors (e.g., heavy metals) and, as more is learned about functional consequences of
nucleotide variation at specific genes, DNA-level diagnostic indicators also may be developed.
Presently, however, a genetic diversity indicator will be most useful as an integrative indicator of genetic
effects imposed by multiple stressors. This suggests that the genetic diversity indicator will prove most
59
-------
Genetic Diversity as an
useful as one component of a multi-indicator regional or index-site assessment.
Ill I I
Although the point can be debated, of the currently available technology, microsatellite analysis, perhaps
combined with mitochondrial DNA analysis, is likely to provide the most useful information per unit
effort for both regional genetic diversity assessments and temporal genetic diversity monitoring. The
US EPA's Molecular Ecology Research Branch employs this methodology for most current and planned
assessments. Unfortunately, this strategy also requires relatively advanced instrumentation and techni-
cal expertise compared to other strategies. The EPA's ECBP pilot genetic diversity study (section 3.1)
was conceived and implemented to use RAPD fingerprinting precisely because of concerns about the
ability to transfer genetic diversity indicator technologies to end-users. Because RAPD fingerprinting is
technically simple, it was assumed that it would have the greatest prospects for technology transfer.
However, during the course of the pilot study it became clear that the trade-off that comes with ease of
implementation is that of extreme sensitivity to minor variations in laboratory techniques. Similar con-
cerns are echoed in the scientific literature (see Perez et al., 1997). Concerns about repeatability
between different laboratories appear to be great enough to negate any perceived technology transfer
advantages. Current work with AFLP fingerprints indicates much less concern with repeatability, but
technology transfer is considerably more difficult than for RAPD analysis. Transfer of allozyme tech-
nology should not be difficult, but the needs for lethal sampling and methods to transport samples at
ultra-cold temperatures limit its general application.
While complete protocols for development and analysis of microsatellite markers may be difficult to
transfer to environmental labs, certain aspects of the analysis are relatively straightforward. As men-
tioned, methods for extracting DNA from animal tissues have been commercialized and are now sold by
several vendors as kits. The quality and quantity of DNA extracted using commercial kits is typically
very high. In addition, improvements in thermal cycler technology, together with packaging of PCR
reagents as standard assay kits have greatly eased technology transfer of PCR. Several vendors now sell
test kits for genotyping domestic animals and humans with fluorescent microsatellite DNA markers.
This suggests that regional field labs could accomplish major parts of the genetic diversity analysis
while one or two "core" molecular biology labs handle other aspects. A model for this approach is
shown in Figure 4-10. The regional lab, perhaps with some advice from a genetic analysis lab, would
design the assessment, which presumably will be performed in conjunction with other ecological indica-
tors. A marker development laboratory can then be employed to develop a panel of markers that are
appropriate to the assessment. The regional lab can then collect samples and prepare DNA for analysis.
The actual PCR reactions can be performed by the regional lab or by the genetic analysis lab. The
genetic analysis lab performs the genetic analysis and derives the genetic diversity interpretation. The
regional and genetic analysis labs can then assess the ecological significance of the genetic diversity
indicator data.
Under this scenario, the marker development laboratory could be a commercial laboratory or a laborato-
ry internal to the EPA. The EPA currently has the required expertise for such a laboratory in the
Molecular Ecology Research Branch of NERL. A laboratory internal to the EPA, possibly the same lab
that is used for marker development, or a laboratory under contract to the EPA could perform the genetic
analysis.
60
-------
I~TT
Genetic Diversity as an Indicator of Ecosystem Condition and Sustainability
Regional Laboratory
Marker Development Laboratory
Design Assessment
Field Sampling
DNA extraction
(PCR?)
• Develop and test
microsatellites, other
markers
• (PCR)
• Marker screening
• Genetic Diversity
assessment
Ecological
interpretation
Genetic Analysis Laboratory
Figure 4-10. A model showing how three different labs, the regional field lab, a genetic analysis lab, and a marker
development lab could interact to apply a genetic diversity indicator such as microsatellites or mitochondrial DNA
sequences to an environmental assessment.
61
-------
Genetic Diversity as an
Glossary
Ill I I
Adaptation: (noun) A genetically determined trait that enhances the ability of an organism to survive
and reproduce in its environment; (verb) the evolutionary process by which a population undergoes pro-
gressive genetic modification to increase its ability to survive and reproduce in a given environment.
Allele: one of several alternative forms of a gene that differs from other forms by a mutation in the
DNA sequence.
Codominance: the condition where both alleles at a locus are expressed and influence the observed
properties of the heterozygote.
Diploid: An organism that normally has two sets of chromosomes, and thus two copies of each gene.
Dominance: the condition in which one allele (the dominant allele) masks the expression of the other
(the recessive allele), so that heterozygotes cannot be distinguished from individuals that are homozy-
gous for the dominant allele.
Effective population size: The size of an "ideal population" (with a mathematically simplified breeding
structure) that loses genetic diversity at a rate equivalent to the population under study. The effective
population size is typically smaller than the census size because of differences in the breeding success of
individuals, skewed sex ratios, variation in population size, and other factors. The effective population
size provides a measure of how fast genetic diversity is being lost from the population under study.
Electrophoresis: The separation of DNA, RNA, or proteins by differential migration through a matrix
in the presence of electric current. Electrophoretic mobility is determined by differences among mole-
cules in size, charge, or shape.
Fitness: The genetic contribution by an individual's descendants to future generations of a population.
Individuals with greater fitness have genotypes that are better matched to the environment than those
with lower fitness.
Gene flow: The exchange of genes between generation via migration of individuals that will eventually
breed.
Genetic diversity: Variation among individuals for some heritable trait. Intraspecific genetic diversity
may be partitioned into at least two components: genetic diversity within populations and genetic diver-
sity among populations.
Genetic drift: Stochastic change in allele frequency due to random variations in the contributions of
breeding adults to the next generation. Since one consequence of genetic drift is that one or more alleles
will not be passed to the next generation, genetic drift results in a decrease in genetic diversity.
Genetic marker: an easily discerned attribute or probe that indicates the genotype of an individual at a
locus. Examples include molecular markers (usually DNA-based), cytological markers, and morpholog-
ical markers
Genome: The complete set of genetic information contained within an individual
62
-------
Condition and Sustainability
Genotype: The genetic make-up of an organism, typically with respect to one or a few genes of interest,
as distinguished from its appearance or phenotype.
Haplotype: A mitochondrial DNA genotype. One of several forms of the circular mitochondrial DNA
molecule that differs from other forms due to mutations in the DNA sequence.
Heterozygote: For diploid organisms, an individual whose cells contain two forms (alleles) of a gene,
one derived from each parent.
Heterozygosity: The proportion of individuals in a population that are heterozygous (possess more than
one allele) at a locus
Inbreeding: Mating among related individuals.
Inbreeding depression: The deterioration in a population's fitness and other traits due to a reduction in
genetic diversity within populations.
Loci: Plural of locus.
Locus: A site on a chromosome where a gene or other measurable variation resides. The DNA at this
chromosomal site may or may not have any functional significance to the organism.
Mitochondria: self-replicating organelles found in the cytoplasm of all eukaryotic cells that produce
energy via oxidative phosphorylation.
Molecular marker: An easily discerned measure of the genotype of an individual at a locus based on
molecular biological methods. Molecular markers are usually based on attributes of DNA, but may be
based on RNA or protein.
Monomorphic: State in which virtually all individuals have the same genotype at a locus. It is a prop-
erty of a locus in a population.
Neutral marker: A genetic marker derived from a locus that has a negligible effect on the ability of the
organism to survive and reproduce.
Outbreeding depression: The deterioration in a population's fitness or other traits due to immigration of
individuals from other populations that disrupts local adaptations.
Phenotype: The observable structural and functional characteristics of an organism that result from the
interaction of the genotype with the environment; the outward appearance of the organism.
Philoptatry: The tendency of species or groups to remain in or habitually return to their native regions
or territories. Organisms that are highly philopatric tend to stay in a defined geographic location.
Phylogeography: Reconstruction of biogeographic relationships, usually among populations within a
species, by identifying the phylogenetic relationship between a genetic trait or set of traits in relation to
geography.
63
-------
Genetic Diversity as an
Ill I I
Polymerase chain reaction (PCR): A process for the exponential amplification of a specific region of
DNA using DNA primers that flank the region of interest and a DNA polymerase to catalyze the reac-
tion.
Polymorphic: State in which individuals in a population have more than one genotype at a locus. It is a
property of a locus in a population.
Polymorphism: One of several forms of a genetic characteristic at a locus in a population. Also, the
presence in a population of two or more relatively common forms of a gene, chromosome, or genetically
determined trait.
Population: A group of conspecific organisms that occupy a more or less well defined geographic
region and exhibit reproductive continuity from generation to generation; it is generally presumed that
ecological and reproductive interactions are more frequent among these individuals than between them
and the members of other populations of the same species.
Recessive: An allele whose properties are not observed because they are masked by the expression of a
dominant allele. Thus, it is only expressed in homozygotes.
Trait: Any aspect of the appearance, behavior, development, biochemistry, or other feature of an organ-
ism.
64
-------
Condition and Sustainability
References
Allendorf, F.W., and Leary, R.F. (1986) Heterozygosity and fitness in natural populations of
animals. In Soule (ed.) Conservation Biology: The Science of Scarcity and Diversity.
Sinauer. Sunderland, MA. Pp. 57-76.
Avise, J.C. (1994) Molecular Markers, Natural History and Evolution. Chapman and Hall,
New York.
Ayres, D.R., Garcia-Rossi, D., Davis, H.G., and Strong, D.R. (1999) Extent and degree of
hybridization between exotic (Spartina altemiflord) and native (S. foliosd) cordgrass (Poaceae)
in California, USA determined by random amplified polymorphic DNA (RAPDs).
Molecular Ecology, 8, 1179-1186.
Baverstock, P.R., and Moritz, C. (1996). Project design. In: Hillis, D. M., Moritz, C. Mable,
B.K. (eds) Molecular Systematics. Second Edition. Sinauer Associates, Sunderland MA.
Pp. 17-27.
Beardmore, J.A., Barker, C.J., Battaglia, B., Berry, R.J., Crosby Longwell, A., Payne, J.F.,
and Rosenfield, A. (1980) The use of genetical approaches to monitoring biological effects
of pollution. Rapports et Proces-Verbaux des Reunions, 179, 299-305.
Ben-Shlomo, R. and Nevo, E. (1988) Isozyme polymorphism as monitoring of marine environments:
the interactive effect of cadmium and mercury pollution on the shrimp, Palaemon elegans.
Marine Pollution Bulletin, 19, 314-317.
Benton, M.J. and Guttman, S.I. (1992a) Allozyme genotype and differential resistance to mercury
pollution in the caddisfly, Nectopsyche albida \. Single-locus genotypes. Canadian Journal
of Fisheries and Aquatic Sciences, 49, 142-146.
Benton, M.J. and Guttman, S.I. (1992b) Allozyme genotype and differential resistance to mercury
pollution in the caddisfly, Nectopsyche albida II. Multilocus genotypes. Canadian Journal
of Fisheries and Aquatic Sciences, 49, 147-149.
Benton, M.J., Diamond, S.A., and Guttman, S.I. (1994) A genetic and morphometric comparison
oftfelisoma trivolvis and Gambusia holbrooki from clean and contaminated habitats.
Ecotoxicology and Environmental Safety, 29, 20-37.
Bickham, J.W., Sandhu, S., Hebert, P.D.N., Chikhi, L., and Athwal, R. (2000) Effects of chemical
contaminants on genetic diversity in natural populations: implications for biomonitoring and
ecotoxicology. Mutation Research, 463, 33-51.
Bouzat, J.L., Cheng, H.H., Lewin, H.A., Westemeier, R.L., Brawn, J.D., and Paige, K.N. (1998)
Genetic evaluation of a genetic bottleneck in the greater prarie chicken. Conservation Biology,
12, 836-843.
Brown Sullivan, K., and Lydy, M.J. (1999) Differences in survival functions of mosquitofish
(Gambusia affinis) and sand shiner (Notropis ludibundus) genotypes exposed to pesticides.
Environmental Toxicology and Chemistry, 18, 906-911.
Cabe, P.R. (1998) The effects of founding bottlenecks on genetic variation in the European
starling (Sturnus vulgaris) in North America. Heredity, 80, 519-525.
Caetano-Anolles, G and Gresshoff, P.M. (eds) (1997) DNA Markers: Protocols, Applications,
and Overviews. Wiley-Liss, Inc., New York.
Campton, D.E., and Johnston, J.M. (1985) Electrophoretic evidence for a genetic admixture of
native and nonnative rainbow trout in Yakima River, Washington. Transactions of the
American Fisheries Society, 114, 782-793.
Carter, PA., and Watt, W.B. (1988) Adaptation at specific loci. V. Metabolically adjacent enzyme
loci may have very distinct experiences of selective pressures. Genetics,\\9, 913-924.
Carvalho, GR. (1988) Molecular ecology: origins and approaches. In: Carvalho, GR. (ed.) Advances in
Molecular Ecology, IOS, Amsterdam. Pp. 1-24.
65
-------
Genetic Diversity as an
Ill I I
Chagnon, N.L. and Guttman, S.I. (1989) Differential survivorship of allozyme genotypes in
mosquitofish populations exposed to copper or cadmium. Environmental Toxicology
and Chemistry, 8, 319-326.
Cole, B.J., and Wiernasz, B.C. (1999) The selective advantage of low relatedness. Science,
285, 891-893.
Coltman, D.W., Bowen, W.D., Wright, J.M. (1998) Birth weight and neonatal survival of harbour
seal pups are positively correlated with genetic variation measured by microsatellites.
Proceedings of the Royal Society of London B, 265, 489-495.
Diamond, S.A., Newman, M.C., Mulvey, M., Dixon, P. M., and Martinson, D. (1989) Allozyme
genotype and time to death of mosquitofish, Gambusia affinis (Baird and Girard), during
acute exposure to inorganic mercury. Environmental Toxicology and Chemistry, 8, 613-622.
Duan, Y., Guttman, S.I., Oris, J.T., and Bailer, J.A. (2000a) Genotype and toxicity relationships among
Hyalella azteca: \. Acute exposure to metals or low pH. Environmental Toxicology and
Chemistry, 19, 1414-1421.
Duan, Y., Guttman, S.I., Oris, J.T., Huang, X., and Burton, GA. (2000b) Genotype and toxicity
relationships among Hyalella azteca: II. Acute exposure to fluoranthene-contaminated sediment.
Environmental Toxicology and Chemistry, 19, 1422-1426.
Ellegren, H., Lindgren, G, Primmer, C.R., and Moller, A.P. (1997) Fitness loss and germline
mutations in barn swallows breeding in Chernobyl. Nature, 389, 593-596.
Excoffier, L., Smouse, P.E., and Quattro, J.M. (1992) Analysis of molecular variance inferred from
metric distances among DNA haplotypes: Applications to human mitochondrial DNA
restriction data. Genetics, 131, 479-491.
Ford, T., Sorci, J., Ika, R., and Shine, J. (1998) Interactions between metals and microbial
communities in New Bedford Harbor, Massachusetts. Environmental Health Perspectives,
106 (Supplement 4),1033-1039
Fore, S.A., Guttman, S.I, Bailer, A.J., Altfeter, D.J., and Counts, B.V. (1995a) Potential use of
genetic diversity as an indicator of water quality. I. Pimephales notatus. Ecotoxicology
and Environmental Safety, 30, 24-35.
Fore, S.A., Guttman, S.I, Bailer, A.J., Altfeter, D.J., and Counts, B.V. (1995b) Potential use of
genetic diversity as an indicator of water quality. II. Campostoma anomalum. Ecotoxicology
and Environmental Safety, 30, 36-46.
Frankham, R. (1995) Inbreeding and extinction: A threshold effect. Conservation Biology, 9, 792-799.
Franklin, A.R. (1980) Evolutionary change in small populations. In Soule, M.E., and Wilcox,
B.A. (eds.) Conservation Biology: An Evolutionary-Ecological Perspective. Sinauer,
Sunderland, MA. Pp 135-149.
Gerlach, G. and Musolf, K. (2000) Fragmentation of landscape as a cause for genetic subdivision
in bank voles. Conservation Biology, 14, 1066-1074.
Gillespie, R.B. and Guttman, S.I. (1989) Effects of contaminants on the frequencies of allozymes
in populations of the central stoneroller. Environmental Toxicology and Chemistry, 8, 309-317.
Gillespie, R.B. and Guttman, S.I. (1993) Correlations between water quality and frequencies of
allozyme genotypes in spotfin shiner (Notropis spilopteris} populations. Environmental
Pollution, 81, 147-150.
Gillespie, R.B., and Guttman, S.I. (1999) Chemical-induced changes in the genetic structure of
populations: effects on allozymes. In: Forbes, V.E. (ed.) Genetics and Ecotoxicology. Taylor
and Francis, Philadelphia. Pp. 55-77.
Gilpin, M.E. and Soule, M.E. (1986) Minimum viable populations: processes of species extinction. In
Soule (ed.) Conservation Biology: The Science of Scarcity and Diversity. Sinauer. Sunderland,
MA. Pp 19-34.
66
-------
Condition and Sustainability
Gyllensten, U.R., Leary, R.F., Allendorf, F.W., and Wilson, A.C. (1985) Introgression between two
cutthroat trout species with substantial karyotypic, nuclear, and mitochondrial genomic
divergence. Genetics, 111, 905-915.
Haig, S.M., Bowman, R. and Mullins, T.D. (1996) Population structure of red-cockaded woodpeckers
in south Florida; RAPDs revisited. Molecular Ecology, 5, 725-734.
Hartl, D.L. and Clark, A.G. (1997) Principles of Population Genetics. 3rd edition. Sinauer,
Sunderland, MA.
Hessel, D.L. (1992) Genetic resource program strategic plan. U.S. Forest Service, Timber
Management. Washington, B.C.
Heagler, M.G, Newman, M.C. , Mulvey, M., and Dixon, P.M (1993) Allozyme genotype in
mosquitofish, Gambusia holbrooki, during mercury exposure. Temporal stability, concentration
effects and field verification. Environmental Toxicology and Chemistry, 12, 385-395.
Heithaus, M.R., and Laushman, R.H. (1997) Genetic variation and conservation of stream fishes:
influence of ecology, life history, and water quality. Canadian Journal of Fisheries and
Aquatic Sciences, 54, 1822-1836.
Hillis, D. M., Moritz, C. Mable, B.K. (1996) Molecular Systematics. Second Edition.
Sinauer Associates, Sunderland MA.
Hoelzel, A.R. (1999) Impact of population bottlenecks on genetic variation and the importance of
life-history; A case study of the northern elephant seal. Biological Journal of the Linnean
Society, 68, 23-39.
Hoelzel, A. R., Fleischer, R. C., Campagna, C., Le Boeuf, B. J., and Alvord, G (2002). Impact of
a population bottleneck on symmetry and genetic diversity in the northern elephant seal.
J. Evol. Biol. 15, 567-575.
Hughes, J.M., Harrison, D.A., and Arthur, J.M. (1991) Genetic variation at the PGI locus in the
mosquitofish Gambusia affinis (Poeclidae) and possible effect on susceptibility to an
insecticide. Biological Journal of the Linnean Society, 44, 153-167.
Jorde, P.E., and Ryman, N. (1996) Demographic genetics of brown trout (Salmo trutta) and estimation
of effective population size from temporal change of allele frequencies. Genetics, 143,
1369-1381.
Keklak, M.M., Newman, M.C., and Mulvey, M. (1994) Enhanced uranium tolerance of an exposed
population of eastern mosquitofish (Gambusia holbrooki Girard 1859). Archives of
Environmental Contamination and Toxicology, 27, 20-24.
Knutsen, H., Rukke, B.A., Jorde, P.E., and Ims, R.A. (2000) Genetic differentiation among
populations of the beetle Bolitophagus reticulatus (Coleoptera : Tenebrionidae) in a
fragmented and a continuous landscape. Heredity, 84, 667-676.
Kopp, R.L., Guttman, S.I., and Wissing, T.E. (1992) Genetic indicators of environmental stress in
central mudminnow (Umbra limi) populations exposed to acid deposition in the
Adirondack Mountains. Environmental Toxicology and Chemistry, 11, 665-676.
Krane, D.E., Sternberg, B.C., and Burton, GA. (1999) Randomly amplified polymorphic DNA
profile-based measures of genetic diversity in crayfish correlated with environmental impacts.
Environmental Toxicology and Chemistry, 18, 504-508.
Lande, R. (1995) Mutation and conservation. Conservation Biology, 9, 782-791.
Lande, R. (1999) Extinction risks from anthropogenic, ecological, and genetic factors. In L.F.
Landweber and A.P. Dobson (eds.) Genetics and the Extinction of Species. Princeton University
Press. Princeton, NJ. Pp 1-22.
Lande, R. and Shannon, S. (1996) The role of genetic variability in adaptation and population
persistence in a changing environment. Evolution, 50, 434-437.
Lavie, B., and Nevo, E. (1982) Heavy metal selection f phosphoglucose isomerase allozymes in
marine gastropods. Marine Biology, 71, 17-22.
67
-------
Genetic Diversity as an
Ill I I
Lavie, B. and Nevo, E. (1986a) Genetic selection of homozygote allozyme genotypes in marine
gastropods exposed to cadmium pollution. The Science of the Total Environment, 57, 91-98.
Lavie, B. and Nevo, E. (1986b) The interactive effects of cadmium and mercury pollution on
allozyme polymorphisms in the marine gastropod Cerithium scabridum. Marine Pollution
Bulletin, 17, 21-23.
Larno, V., Laroche, J., Launey, S., Flammarion, P., and Devaux, A. (2001). Responses of
Chub (Leuciscus cephalus) populations to chemical stress, assessed by genetic markers,
DNA damage and Cytochrome P4501A induction. Ecotoxicology. 10, 145-158.
Leary, R.F., Allendorf, F.W., and Sage, O.K. (1995) Hybridization and introgression between introduced
and native fish. In (Schramm, H.L. Jr. and Piper, R.G.) Uses and Effects of Cultured Fishes in
Aquatic Ecosystems. American Fisheries Society Symposium 15, Bethesda, MD.
Leberg, P.L. (1992) Effects of population bottlenecks on genetic diversity as measured by
allozyme electrophoresis. Evolution, 46, 447-494.
Leonard, A.C., Franson, S.E., Hertzberg, VS., Smith, M.K., and Toth, G.P. (1999) Hypothesis
testing with the similarity index. Molecular Ecology, 8, 2105-2114.
Lynch, M. (1990) The similarity index and DNA fingerprinting. Molecular Biology and Evolution,
75 478-484.
Lynch, M. (1991) Analysis of population genetic structure by DNA fingerprinting. In Burke, T, Dolf,
G., Jeffreys, A.J., and Wolff, R. (eds.) DNA Fingerprinting: Approaches and Applications.
Birkhauser Verlag, Boston.
Lynch, M. (1996) A quantitative-genetic perspective on conservation issues. In Avise, J.C. (ed.),
Conservation Genetics: Case Histories from Nature, Chapman Hall, New York. Pp 471-501.
Ma, X. L., Cowles, D. L., and Carter, R. L. (2000). Effect of pollution on genetic diversity in the
bay mussel Mytilus galloprovincialis and the acorn barnacle Balanus glandula. Marine
Environmental Research. 50, 559-563.
Mariette, S., Lecorre, V. and Kremer, A. (1999) Sampling within the genome for measuring
within-population diversity: trade-offs between markers. In Molecular Tools for Biodiversity.
Compendium of the Research Project: Development, optimization and validation of molecular
tools for assessment of biodiversity in forest trees (EM Gillet,ed.) European Union DGXII
Biotechnology FW IV Research Programme.
May, B. (1992) Starch gel electrophoresis of allozymes. In: Hoelzel, A.R. (ed.) Molecular Genetic
Analysis of Populations: A Practical Approach. 2nd Ed. Oxford University Press. New York.
Pp. 1-28.
McCormick, F.H., and Peck, D.V. (2000) Application of the indicator evaluation guidelines to a
multimetric indicator of ecological condition based on stream fish assemblages. Chapter 4 in:
U.S. Environmental Protection Agency. Evaluation Guidelines for Ecological Indicators. Office
of Research and Development. EPA/620/R-99/005. U.S. Environmental Protection Agency,
Washington, D.C.
Messer, J.J., Linthurst, R.A., and Overton, W.S. (1991) An EPA program for monitoring ecological
status and trends. Environmental Monitoring and Assessment, 17, 67-78.
Moraga, D., Mdelgi-Lasram, E., Romdhane, M. S., EL Abed, A., Boutet, I., Tanguy, A., and Auffret,
M. (2002). Genetic responses to metal contamination in two clams: Ruditapes decussates
and Ruditapesphilippinarum. Marine Environmental Research. 54, 521-525.
Morden, C.W. and Loeffler, W. (1999) Fragmentation and genetic differentiation among subpopulations
of the endangered Hawaiian mint Haplostachys haplostachya (Lamiaceae). Molecular Ecology,
8, 617-625.
Moritz, C. (1994) Defining 'evolutionarily significant units' for conservation. Trends in Ecology
and Evolution, 9, 373-375.
68
-------
Condition and Sustainability
Moritz, C. (1995) Uses of molecular phytogenies for conservation. Philosophical Transactions of
the Royal Society of London, Series B, 349, 113-118.
Moritz, C. and Faith, D.P. (1998) Comparative phylogeography and the identification of genetically
divergent areas for conservation. Molecular Ecology, 7, 419-429.
Mulvey, M., Newman, M.C., Chazal, A., Kelak, M.M., Heagler, M.G., and Hales, L.S. Jr. (1995) Genetic
and demographic responses of mosquitofish (Gambusia holbrooki Girard 1859) populations
stressed by mercury. Environmental Toxicology and Chemistry, 14, 1411-1418.
Murdoch, M.H., and Hebert, P.D.N. (1994) Mitochondrial DNA diversity of brown bullhead from
contaminated and relatively pristine sites in the Great Lakes. Environmental Toxicology and
Chemistry, 13, 1281-1289.
Murphy, R.W., Sites, J.W. Jr., Buth, D.G, and Haufler, C.H. (1996). Proteins: Isozyme electrophoresis.
In: Hillis, D. M., Moritz, C. Mable, B.K. (eds) Molecular Systematics. Second Edition. Sinauer
Associates, Sunderland MA. Pp. 51-120.
Nadig, S.G, Lee, K.L., and Adams, S.M. (1998) Evaluating alterations of genetic diversity in sunfish
populations exposed to contaminants using RAPD assay. Aquatic Toxicology, 43, 163-178.
Nehlsen, W., Williams, J.E., and Lichatowich, J.A. (1991) Pacific salmon at the crossroads: Stocks at
risk from California, Oregon, Idaho, and Washington. Fisheries, 16, 4-21.
Nei, M. (1972) Genetic distance between populations. American Naturalist, 106, 283-292.
Nei, M. (1987) Molecular Evolutionary Genetics. Columbia University Press, New York, NY, USA.
Nevo, E., Ben-Shlomo, R. and Lavie, B. (1984) Mercury selection of allozymes in marine organisms:
Predictions and verification in nature. Proceedings of the National Academy of Sciences USA,
81, 1258-1259.
Nevo, E., Lavie, B., and Ben-Shlomo,R. (1988) Population genetic structure of marine organisms
as detectors and monitors of marine pollution. FAO/UNEP Scientific Reports, 24, 37-49.
Newman, M.C., Diamond, S.A., Mulvey, M., and Dixon, P. (1989) Allozyme genotype and time
to death of mosquitofish, Gambusia afftnis (Baird and Girard), during acute toxicant exposure:
a comparison of arsenate and inorganic mercury. Aquatic Toxicology, 15, 141-156.
Omernick, J.M. (1995) Ecoregions: A framework for environmental management. In Davis, W. and
Simon, T. (eds.) Biological Assessment and Criteria: Tools for Water Resource Planning and
Decision Making. Lewis Publishers, Boca Raton, FL. Pp 49-62.
Palumbi, S.R. (1996) Nucleic Acids II: The polymerase chain reaction. In: Hillis, D. M., Moritz,
C. Mable, B.K. (eds) Molecular Systematics. Second Edition. Sinauer Associates,
Sunderland MA. Pp. 205-247.
Perez, T., Albornoz, J, and Dominguez, A. (1998) An evaluation of RAPD fragment reproducibility
and nature. Molecular Ecology, 1, 1347-1357.
Quattro, J.M. and Vrijenhoek, R.C. (1989) Fitness differences among remnant populations of the
endangered Sonoran topminnow. Science, 245, 976-978.
Roark, S., and Brown, K. (1996) Effects of metal contamination from mine tailings on allozyme
distributions of populations of Great Plains fishes. Environmental Toxicology and Chemistry,
15, 921-927.
Roark, S.A., Andrews, J. F., and Guttman, S. I. (2001). Population genetic structure of the
western mosquitofish, Gambusia affinis, in a highly channelized portion of the San Antonio river
in San Antonio, TX. Ecotoxicology, 10, 223-227.
Rodriguez-Clark, K.M. (1999) Genetic theory and evidence supporting current practices in captive
breeding for conservation. In L.F. Landweber and A.P. Dobson (eds.) Genetics and the
Extinction of Species. Princeton University Press. Princeton, NJ. Pp 47-74.
Rolan-Alverez, E., Zapata, C., and Alvarez, G (1995) Multilocus heterozygosity and sexual selection
in a natural population of the marine snail Littorina mariae (Gastropoda: Prosobranchia).
Heredity, 75, 17.25.
69
-------
Genetic Diversity as an
Ill I I
Roy, N.K., Konkle, B., Wirgin, I. (1996) Characterization of CYP1A1 gene regulatory elements in
cancer-prone Atlantic tomcod. Pharmacogenetics, 6, 273-277.
Saccheri, I., Kuussaari, M., Kankare, M., Vikman, P., Fortelius, W., Hanki, I. (1998) Inbreeding
and extinction in a butterfly metapopulation. Nature, 392, 491-494.
Schlueter, M.A, Guttman, S.I., Oris, J.T., and Bailer, A.J. (1995) Survival of copper-exposed
juvenile fathead minnows (Pimephales promelas) differs among allozyme genotypes.
Environmental Toxicology and Chemistry, 14, 1727-1734.
Schlueter, M.A, Guttman, S.I., Oris, J.T., and Bailer, A.J. (1997) Differential survival of fathead
minnows, Pimephales promelas, as affected by copper exposure, prior population stress,
and allozyme genotypes. Environmental Toxicology and Chemistry, 16, 939-947.
Schlueter, M.A, Guttman, S.I., Duan, Y, Oris, J.T., Huang, X., and Burton, G A. (2000) Effects of
acute exposure to fluoranthene-contaminated sediment on the survival and genetic variability
of fathead minnows (Pimephales promelas). Environmental Toxicology and Chemistry, 19,
1011-1018.
Schneider, C.J., Cunningham, M., and Moritz, C. (1998) Comparative phylogeography and the
history of endemic vertebrates in the Wet Tropics rainforests of Australia. Molecular Ecology,
75 487-498.
Shaw, A.J. (1998) The evolution of heavy metal tolerance in plants: adaptation, limits, and costs.
In. Forbes, V.E. (ed.) Genetics and Ecotoxicology. Taylor and Francis, Philadelphia. Pp. 9-30.
Slate, J., Kruuk, L.E.B., Marshall, T.C., Pemberton, J.M., Clutton-Brock, T.H. (2000)
Inbreeding depression influences lifetime breeding success in a wild population of red deer
(Cervus elaphus), Proceedings of the Royal Society Biological Sciences, SeriesB, 267,
1657-1662.
Smith, M.W., Smith, M.H, and Chesser, R.K. (1983) Biochemical genetics of mosquitofish. I.
Environmental correlates, a temporal and spatial heterogeneity of allele frequencies within a
drainage. Copeia, 1, 182-193.
Soule, M.E. (1980) Thresholds for survival: Maintaining fitness and evolutionary potential. In M.E.
Soule and B.A. Wilcox (eds.), "Conservation Biology" An Evolutionary-Ecological Perspective.
Sinauer, Sunderland, MA. Pp. 151-169.
Soule, M.E. (1987) Viable Populations for Conservation. Cambridge University Press, Cambridge, U.K.
Street, G.T. and Montagna, PA. (1996) Loss of genetic diversity in harpacticoida near offshore
platforms. Marine Biology, 126, 271-282.
Tanguy, A., Fernandez-Castro, N., Marhic, A., and Moraga, D. (1999) Effects of an organic
pollutant (tributytin) on genetic structure in the Pacific Oyster Crassostrea gigas.
Marine Pollution Bulletin, 38, 550-559.
Tatara, C.P., Mulvey, M., and Newman, M.C. (1999) Genetic and demographic responses of
mosquitofish (Gambusia holbrooM) populations exposed to mercury for multiple generations.
Environmental Toxicology and Chemistry, 18, 2840-2845.
Theodorakis, C.W., and Shugart, L.R., (1997) Genetic Ecotoxicology :II. Population genetic structure
in radionuclide-contaminated mosquitofish {Gambusia affinis). Ecotoxicology, 6, 335-354.
Theodorakis, C.W., and Shugart, L.R., (1998) Natural selection in contaminated environments: A
case study using RAPD genotypes. In: Forbes, V.E. (ed.) Genetics and Ecotoxicology. Taylor
and Francis, Philadelphia. Pp. 123-149.
U.S. Environmental Protection Agency (1988) Future Risk: Research Strategies for The 1990s.
Science Advisory Board. SAB-EC-88-040. US Environmental Protection Agency.
Washington, B.C.
70
-------
Condition and Sustainability
U.S. Environmental Protection Agency: Klemm, D.J., and Lazorchak, J.M. (1993) Environmental
Monitoring and Assessment Program. Surface Waters and Region 3 Regional Environmental
Monitoring and Assessment Program. 1993 Pilot Field Operation and Methods Manual.
Streams, USEPA, ORD, NERL-Cincinnati, Washington, D.C.
U.S. Environmental Protection Agency (1997) An Ecological Assessment of the United States
Mid-Atlantic Region. EPA/600/R-97/130. U.S. Environmental Protection Agency,
Washington, D.C.
U.S. Environmental Protection Agency (1998) Ecological Research Strategy. Office of Research
and Development. EPA/600/R-98/086. U.S. Environmental Protection Agency,
Washington, D.C.
Vos , P., Rogers, R., Bleeker, M. Reijans, M., van der Lee, T., Homes, M., Firjter, A., Pot, J.,
Peleman, J., Kuiper, M., and Zabeau, M. (1995) AFLP: a new technique for DNA fingerprinting.
Nucleic Acids Research, 23, 4407-4414.
Vrijenhoek, R.C. (1994) Genetic diversity and fitness in small populations. In Loeschcke, V., Tomiuk, J.,
and Jain, S.K. (eds) Conservation Genetics. Birkhauser Verlag Basel. Switzerland. Pp.37-53.
Waples, R.S. (1990) Conservation of Pacific salmon. III. Estimating effective population size.
Journal of Heredity, 81, 267-276.
Welsh, J. and McClelland, M. (1990) Fingerprinting genomes using PCR with arbitrary primers.
Nucleic Acids Research, 18, 7213-7218.
Westemeier, R.L, Brawn, J.D., Simpson, S.A., Esker, T.L, Jansen, R.W., Walk, J.W., Kerchner,
E.L. (1998) Tracking the long-term decline and recovery of an isolated population.
Science, 282, 1695-1698.
Williams, J.G., Kubelik, A.R, Livak, K.J., Rafalski, J.A., and Tingey, S.V. (1990) DNA
polymorphisms amplified by arbitrary primers are useful as genetic markers. Nucleic Acids
Research, 18,6531-6535.
Williams, R.N., Shiozawa, O.K., Carter, J.E., and Leary, R.F. (1996) Genetic detection of
putative hybridization between native and introduced rainbow trout populations of the
upper Snake River. Transactions of the American Fisheries Society, 125, 387-401.
Wright, S. (1978) Evolution and the Genetics of Populations. Vol. 4: Variability Within and
Among Natural Populations. University of Chicago Press. Chicago.
Yauk, C.L., and Quinn, J.S. (1996) Multilocus DNA fingerprinting reveals high rate of heritable
genetic mutation in herring gulls nesting in an industrialized urban site. Proceedings of
the National Academy of Sciences, USA, 93, 12137-12141.
71
-------
Genetic Diversity as an
Ill I I
Acknowledgements
A number of individuals and organizations were instrumental in collecting data and contributing to this
document. Tom Wessendarp and Tammy Goyke (USEPA) provided valuable contributions as part of the
genetic diversity team. Field crews from DynCorp, including Dan Williams, Scott Jacobs, Joseph
Loucek, Steve Harmon, Jason Jannot and Dennis McMullen helped to collect fish samples. The Ohio
EPA providing environmental data on the Eastern Cornbelt Plains study as part of a joint Ohio EPA and
USEPA Region 5 REMAP assessment. Environmental data for the Mid-Atlantic Region was provided by
field crews participating in the EMAP MAIA project. Laboratory analysis and data management was
provided by Rich Silbiger, Manju Garg, Ana Braam, Jodie Walker, Malika Humphries, Jared Smith, Paul
Weaver, Richard Converse, Sandra Millward, Shawn Mills and Tony Leonard of PAI/SoBran. Susan
Cormier, Bhagya Subramanian, and Annette Roth (USEPA) provided additional laboratory and data
support. Graphics support was provided by Keith Adams of Computer Sciences Corporation. Frank
McCormick, Jim Lazorchak and Kate Smith provided helpful discussions and support of these projects.
72
-------
Condition and Sustainability
Appendix 1: Laboratory and analytical procedures for RAPD analysis
DNA Extraction. For DNA purification, a portion of the frozen tail section (approximately 25 mg) was
homogenized in a 1.5 ml microfuge tube containing 100|il PBSET (standard PBS, lOOmM EDTA, 0.1%
Triton X). 400|il of PBSET, 21 \A Proteinase K (20mg ml -1) and 30|il of 20% SDS was added to the
homogenate. The tube was mixed by gentle inversion and incubated overnight at 65 °C. Following
digestion, the preparation was centrifuged at 12,000 rpm for 10 minutes to remove particulate and undi-
gested material. The supernatant was removed to a clean microfuge tube, incubated for 10 minutes at
91 °C to inactivate the Proteinase K, then allowed to cool to room temperature. 10 |il of RNAse(A) (20
mg ml •') was added to the supernatant and incubated for 30 minutes at 42 °C. 1 ml of ProCipitate was
added to the tube, which was then mixed by gentle inversion on a rotator for 5 minutes and centrifuged
at 14,000 rpm for 15 minutes at room temperature. To facilitate the separation of the DNA-containing
supernatant and the semi-solid, protein-containing pellet, a dab of light PhaseLock gel (5'-3', Inc.) was
added to the inside of the microfuge cap and the sample was centrifuged for an additional 15 minutes at
14,000 rpm at room temperature. The supernatant was carefully transferred to a 2.0 ml microfuge tube,
after which an equal volume of chloroform-isoamyl alcohol (24:1) was addedto the supernatant and
mixed by gentle inversion on a rotator for 15 minutes. The aqueous fraction was transferred to a new
1.5 ml microfuge tube and the DNA was precipitated by adding 1 |il of glycogen (20 mg ml'1) and an
equal volume of ice cold 100% isopropanol, mixing well by inverting, and incubating at -20 °C for 2
hours. The precipitated DNA was pelleted by centrifuging for 15 minutes at 15,000 rpm at 0 °C. The
supernatant was removed and the DNA washed with ice cold 70% ethanol. The DNA was again pelleted
by centrifuging for 15 minutes at 15,000 rpm at 0 °C. The supernatant was removed and the DNA
washed again with ice cold 95% ethanol. The DNA was pelleted one last time by centrifuging for 15
minutes at 15,000 rpm at 0 °C. The supernatant was removed and the DNA pellet was allowed to air
dry for 10 minutes at room temperature. After drying, the pellet was resuspended in 200 |il of IX PCR
buffer (lOmM Tris-HCl, 50mM Kcl, pH 8.3), warmed for 15 minutes in a wet block at 65 °C, then incu-
bated overnight at 37 °C in ensure complete resuspension. The DNA stock concentration was estimated
by comparing the fluorescence of 1000-fold dilutions of the stock with known standards both prepared
with PicoGreen fluorescent dye (Molecular Probes, Inc.) and scanned using a Fluorlmager 595
(Molecular Dynamics) with 488 nm excitation and 530df30 bandpass filter. Stock DNA was diluted to a
working concentration of 1.25 ng jil'1 with IX PCR buffer (lOmM Tris-HCl, 50mM Kcl, pH 8.3).
PCR. RAPD analysis used was modified slightly from the original descriptions (Williams et a/., 1990;
Welsh and McClelland, 1990). A total reaction volume of 25 |il was prepared by combining a 5 |il
aliquot of the PCR master mix (2mM Tris- HC1 (pH 8.5), 10 mM KC1, 3mM MgCl2, 1 pmole RAPD
primer, 2.0 units of Native Tag polymerase (Gibco BRL), 0.05% acetamide, ImM dNTP mix) with a 20
|ll aliquot of diluted DNA (1.25 ng (il'1) in a 0.2 ml PCR tube (individual or strip) and mixing well. All
PCR prep was done on ice. All media preparation and sample aliquoting was done using aerosol resist-
ant pipet tips. Samples were amplified by 34 cycles of 45 seconds at 94 °C, 1 minute at 41 °C, 1 minute
at 72 °C on a RoboCycler with Hot Top (Stratagene). Three decamer oligonucleotides (primers) were
used to generate RAPD profiles; OPF-04 (5'-GGTGATCAGG-3'), OPL-02 (5'-TGGGCGTCAA-3'),
OPL-05 (5'-ACGCAGGCAC-3'). Triplicate PCR reactions were performed, 95 samples plus 1 negative
control per replicate.
Gel Electrophoresis. A 15% Ficoll 400/0.25% bromophenol blue solution was added to the completed
RAPD products, along with 400 and 900 bp fragments and buffered salt solution; the additional DNA
fragments provided in-lane standards to aid in sizing. Samples were arranged on a 20-lane agarose gel
(1.65% w v-' containing IX TBE [89mM Tris Base, 89mM Boric Acid, lOmM Na2 EDTA-H20]) such
that each sample lane was placed adjacent to a molecular weight marker lane (500/100 bp ladder,
73
-------
Genetic Diversity as an
Ill I I
GenSura). Samples were electrophoresed at 7 °C, 6.8 V cm'1, for 4.5 hours with continuous buffer (0.6X
TBE) recirculation. The gels were subsequently stained in 1000ml of IX TBE with 4 |jg ml"1 of ethidi-
um bromide for 30 minutes with constant agitation and then destained in water for 15 minutes. The gels
were scanned using a Fluorlmager 595 (Molecular Dynamics) with 514 nm excitation and 590df30
bandpass filter, producing a visual image of the PCR products.
Image Analysis. Bands were declared using FragmeNT Analysis software (Molecular Dynamics) and
then further edited visually to eliminate both false bands and those bands below a level of optical densi-
ty that could be reliably scored. Sample band molecular weights were calculated by a point-to-point
logarithmic interpolation method within FragmeNT using the standard lane nearest to the unknown.
Marker Identification. Despite the large number of molecular weight ladders, sizing errors inevitably
occurred so fragment size alone is not a reliable indicator of band homology. Bands were classified
into size groups (bins) using cluster analysis. These bins were further refined by discriminant analysis
of band size and intensity characteristics. In excess of 200 such bins were identified; however, 53 bins
that were most repeatably scored (based on comparison of the three replicates for each individual) were
selected for analysis. The number of bins created ranged from 39 for primer F04 to 42 for primer L05.
Bins in which the average band intensity was low (less than 0.5% of total lane intensity) were excluded
from the analysis, as were bins in which the repeatability of scoring across replicates was poor. Once the
bins were created, each bin was assessed for each individual to determine whether the bins were "filled"
or empty. If a band was observed within the size bin for two of the three replicates scored then the bin
was considered filled for that particular individual.
Similarity Analysis. RAPD fingerprints were compared between each pair of individuals using Lynch's
(1990) similarity index (S), which is defined for two individuals, x and y, as Sxy=2Nxy/(Nx + Ny), where
Njy is the number of bins that are filled for both individuals, and Nx and Ny are the total number of bins
filled for individuals x and y, respectively. The average similarity of individuals within a population
(S w) provides an inverse measure of the genetic diversity within that population. Comparisons of aver-
age similarities among_individuals in the same populations to average similarities among individuals in
different populations (S^) provide a measure of population substructure. Hypothesis tests for significant
differences in Sw's were constructed and tested according to the method of Leonard et al. (1999).
Analogs of Nei's (1972) genetic distance estimate D' between each pair of populations were calculated
following the method of Lynch (1991). Population substructure was estimated using the estimator of
Wright's (1978) FST for DNA fingerprint data described by Lynch (1991). Estimates of the average
number of migrants among populations were made using the approximate equilibrium relationship
FST= !/(!+ 4Nem), where Ne is the effective size and m is the rate at which migrants are exchanged
between populations per generation. Nem, therefore, represents the average (effective) number of
migrants exchanged between populations each generation. Dendrograms relating genetic relationships
among populations were assessed using unweighted pair group means analysis (UPGMA).
Quality Assurance. Several quality assurance measures were employed throughout this study.
Incorporation of three replicates of each RAPD reaction into the design helped to ensure that RAPD fin-
gerprints were highly repeatable. Each set of 95 fish to be processed was assigned a blind sample num-
ber based on the tube assignment of a 96-well PCR plate (8 rows [A-H] x 12 columns [1-12]). This
sample number was carried through all steps of DNA extraction and quantitation, PCR, gel electrophore-
sis and scanning, FragmeNT analysis, and subsequent data handling. A SAS database was maintained to
record the dates and any comments of all aforementioned steps. To avoid any PCR block position bias,
the 95 samples and negative control within one PCR replicate were randomly assigned a position in the
96-well PCR block using a SAS-based random number generator. To avoid any gel lane position bias,
74
-------
Condition and Sustainability
all samples and the negative controls for one PCR replicate of 3 primers (3 primers x 96 DNA
samples/negative control = 288 PCR reactions per replicate) were randomly assigned a lane position on
a gel (288 PCR reactions / 12 reactions per gel = 24 gels). A pre-printed form listing the load order of
samples and specific gel numbers was used at the time of gel loading and any deviations in the assigned
order were noted and changes were made to the database. Following electrophoresis and gel scanning,
gel images were reviewed to verify that the samples were actually loaded in the lanes as assigned. The
appearance of bands in a gel lane designated as a negative control was used to indicate contamination
during the PCR preparation; any questionable replicate was discarded and repeated. Any PCR reactions
that failed to amplify any bands were noted and an additional replicate PCR was performed. Any gels
that produced a poor visual image due to problems associated with electrophoresis were run a second
time with the balance of the PCR reaction. SAS-based programs were employed to compare the original
gel load order assignments with those actually reported as an output from the FragmeNT Analysis; dis-
crepancies were flagged and reconciled where possible.
75
-------
Genetic Diversity as an
Appendix 2: Laboratory and Analytical Procedures for AFLP Analysis
Ill I I
DNA Extraction. DNA was extracted from a section of tail fin (25-50mg) using a commercial kit
(DNeasy, Qiagen) and quantitated using a fluorescently labeled nucleic acid stain (PicoGreen, Molecular
Probes).
AFLP digestion/ligation/amplification. 200 ng of genomic DNA was restriction digested and ligated
overnight at room temperature in a single 15 yd reaction as follows: IX T4 ligase buffer (50mM Tris
Hcl, lOmM MgCl2, lOmM dithiothreitol, ImM ATP), 50mM NaCl, 75 ng BSA, 5 units each Eco RI and
Mse I, 100 Weiss units T4 DNA ligase, 30 pmol Eco RI adapter and 300 pmol Mse I adapter. The
digestion-ligation products were diluted 10-fold with TLE [lOmM Tris-HCL, pH 7.6, O.lmM EDTA].
Preamplification reactions were performed using 5 (il of the diluted digestion-ligation product in a 20 (il
reaction with standard EcoRI (Eco+G) and Msel (Mse+C) adapter primers (30ng each), 20mM Tris
HCL, 50mM KC1, 1.5mM MgCl2, 0.2mM each dNTP, and 0.5 units DNATaq polymerase. The follow-
ing thermocycling profile was used for preamplification: 72°C for 2 min, 24 cycles of 94°C for 30 sec,
65°C for 30 sec, 72°C for 1 min, and ending with a final extension at 72°C for 30 min. The preamplifi-
cation products were diluted 10-fold with TLE (lOmM Tris-HCL, pH 7.6, O.lmM EDTA). Selective
amplification reactions were performed using a 5 (il of the diluted pre-amplification product in a 20 (il
reaction with 20mM Tris HCL, 50mM KC1, 1.5mM MgC12, 0.2mM each dNTP, 0.5 units DNA Taq
polymerase, 5 ng MseC+1 primer, 15 ng EcoG+2 primer with a 5' fluorescein attachment. Ten
EcoG+2/MseC+l primer combinations were used for selective amplification PCR conditions for the
selective amplification were as follows: 12 cycles of 94°C for 30 sec, 65°C for 30 sec (dropping 0.7°C
per cycle), 72°C for 1 min, followed by 24 cycles of 94°C for 30 sec, 56°C for 30 sec, 72°C for 1 min; a
final extension at 72°C for 30 min completed the PCR.
Gel Electrophoresis. Selective amplification products were diluted 1:1 with sequencing load dye
(deionized formamide, 0.5 mg/ml xylene cyanole); 28 samples and 5 lanes of a 10 bp size ladder were
electrophoresed on 5% denaturing polyacrylamide gels in 0.5X TBE buffer at 25W for 1.3 hours. Gels
were scanned using a Molecular Dynamics 595 Fluorlmager (488nm excitation, 530df30 filter) to visu-
alize the fluorescein-tagged amplification products.
Fragment Analysis. Each lane was assigned a score based on visual quality (0 = no amplification, 1 =
very few and/or light bands, 2 = only very intense bands are able to scored, 3 = dark bands are scorable,
4 = most bands are distinct over a range of intensity); lanes with a score <3 were not analyzed and those
samples were subjected to repeat amplification and electrophoresis. Images were evaluated using Cross
Checker (v 2.91) and a binary table indicating presence or absence for each sample/primer combo/mark-
er was generated. A second Cross Checker evaluation was made by another analyst and the resulting
binary scores were compared with the original interpretation. Replicates (DNA extraction through selec-
tive amplification) were done on 20% of the samples; binary scores from the replicate samples were
used to determine reliability of a particular AFLP marker. Those markers whose binary scores were in
disagreement, either between analysts or replicate samples were eliminated from the final analysis.
Within and among population similarities were calculated; pairwise t-tests of the within population simi-
larities were performed. A genetic distance matrix (Nei, 1987) based on the among- population similari-
ties was constructed to test for evidence of population subdivision. A molecular analysis of variance
(AMOVA, Excoffier et a/., 1992) was performed to partition genetic diversity within and among popula-
tions.
76
-------
Condition and Sustainability
Appendix 3: Laboratory and Analytical Procedures for DNA Sequence Analysis
Mitochondrial DNA amplification. Genomic DNA was amplifed with mitochondrial DNA primers for
part of the cytochrome B region. Reactions were performed using 1.5 yd of diluted genomic DNA (25
ng/(il) in a 20 (il reaction with 50 (il each of cytochrome B primers GLU (5'-TGA CTT GAA GAA
CCA CCG TT-3') and THR (5'ATC TTC GGA TTA CAA GAC CGA); 20mM Tris HCL, 50mM KC1,
1.5mM MgCl2, 0.2mM each dNTP, and 0.5 units DNA Taq polymerase (Gibco BRL). The following
thermocycling profile was used for preamplification: 94°C for 5 min, 28 cycles of 94°C for 40 sec, 52°C
for 30 sec, 72°C for 1 min, and ending with a final extension at 72°C for 5 min. The reaction products
were diluted 1:1 with buffered load dye containing bromophenol blue and were electorphoresed in 1%
agarose gels containing ethidium bromide in 0.6X TBE buffer at 250V for 2.5 hours. The band was
visualized under UV light, excised from the gel, and purified using QIAquick kit (Qiagen). The result-
ing amplification product was quantitated using using a fluorescently labeled nucleic acid stain
(PicoGreen, Molecular Probes) and diluted to a concentration of 2.5ng/ (il.
Mitochondrial DNA sequencing. Cycle sequencing reactions were performed in 20 (il using 1 (il of
template (2.5 ng/ (il purified cytochrome B reaction), 2 (il ABI Ready Reaction mix (ABI BigDye
Terminator [v.l] kit), 60mM Tris, 1.5mM MgCl2, and 0.8 M GLU primer. Positive control reactions
were similarly prepared using M13 primer and pGEM template. The following thermocycling profile
was used for cycle sequencing: 25 cycles of 94°C for 10 sec, 50°C for 5 sec, 60°C for 4 min. Excess
terminator dyes were eliminated by isopropanol precipitation of the product. The sample was resus-
pended in deionized formamide and capillary electrophoresed (Applied Biosystems 3100 Genetic
Analyzer; 50cm array, POP-6 polymer).
Sequence analysis. Sequences were reviewed in Sequence Analysis (v3.7, Applied Biosystems), edited
and aligned by hand. Genetic structure was analyzed by AMOVA and minimum spanning networks
using the software Arlequin (v2.0).
77
-------
Genetic Diversity as an Indicator of Ecasjfijjll
ill
78
-------
------- |