Seminar Publication National Conference on Environmental Problem Solving with Geographic Information Systems


&EPA
            United States
            Environmental Protection
            Agency
            Office of Research and
            Development
            Cincinnati, Ohio 45268
EPA/625/R-95/004
September 1995
           Publication
National Conference on
Environmental
Problem-Solving with
Geographic Information
Systems
            Cincinnati, Ohio
            September 21-23,1994

-------
                                                      EPA/625/R-95/004
                                                       September 1995
                    Seminar Publication

National Conference on Environmental Problem-Solving
          with Geographic Information Systems
                       September 21-23, 1994
                         Cincinnati, Ohio
                  U.S. Environmental Protection Agency
                  Office of Research and Development
              National Risk Management Research Laboratory
              Center for Environmental Research Information
                         Cincinnati, Ohio

-------
                                       Notice
This document has been subjected to EPA's peer and administrative review and has been approved
for publication as an EPA document. The views expressed are those of the authors and do not
necessarily reflect those of the Agency. Mention of trade names or commercial products does not
constitute endorsement or recommendation for use.

When an NTIS number is cited in a reference, that document is available from:

     National Technical Information Service
     5285 Port Royal Road
     Springfield, VA 22161
     703-487-4650

Please note that the Soil Conservation Service of the U.S. Department of Agriculture is  now the
Natural  Resources  Conservation  Service; "Soil Conservation Service" is  used throughout this
document because this was the name in use at the time of the conference.

-------
                                            Contents
                                                                                             Page

GIS Concepts

GIS Uncertainty and Policy: Where Do We Draw the 25-Inch Line?
       James E. Mitchell	  3

Data Quality Issues Affecting GIS Use for Environmental Problem-Solving
       Carol B. Griffin	  15

You Can't Do That With These Data! Or: Uses and Abuses of Tap Water Monitoring Analyses
       Michael R. Schock and Jonathan A. Clement	  31

Ground-Water Applications

Using GIS/GPS in the Design and Operation of Minnesota's Ground Water
Monitoring and Assessment Program
       Tom Clark, Yuan-Ming Hsu,  Jennifer Schlotthauer, Don Jakes, and Georgianna Myers.	  45

Use of GIS in Modeling Ground-Water Flow in the Memphis, Tennessee, Area
       James Outlaw and Michael Clay Brown	  50

MODRISI:  A PC Approach to GIS and Ground-Water Modeling
       Randall R. Ross and Milovan S. Beljin	  60

GIS in Statewide  Ground-Water Vulnerability Evaluation to Pollution Potential
       Navulur Kumar and Bernard A. Engel	  66

Verification of Contaminant Flow Estimation With GIS and Aerial Photography
       Thomas M. Williams	  74

Geology of Will and Southern Cook  Counties, Illinois
       Edward Caldwell Smith	  81

Watershed Applications

The Watershed Assessment Project:  Tools  for Regional Problem Area Identification
       Christine Adamus	  97

-------
Contents (continued)

Page

Watershed Stressors and Environmental Monitoring and Assessment Program
Estuarine Indicators for South Shore Rhode Island
John F. Paul and George E. Morrison 101

CIS Watershed Applications in the Analysis of Nonpoint Source Pollution
Thomas H. Cahill, Wesley R. Homer, and Joel S. McGuire 110

Using CIS To Examine Linkages Between Landscapes and Stream Ecosystems
Carl Richards, Lucinda Johnson, and George Host 131

Nonpoint Source Water Quality Impacts in an Urbanizing Watershed
Peter Coffin, Andrea Dorlester, and Julius Fabos 142

A CIS for the Ohio River Basin
Walter M. Grayman, Sudhir R. Kshirsagar, Richard M. Males, James A. Goodrich,
and Jason P. Heath 151

Nonpoint Source Pesticide Pollution of the Pequa Creek Watershed, Lancaster County, Pennsylvania:
An Approach Linking Probabilistic Transport Modeling and CIS
Robert T. Paulsen and Allan Moose 156

Integration of CIS With the Agricultural Nonpoint Source Pollution Model:
The Effect of Resolution and Soils Data Sources on Model Input and Output
Suzanne R. Perlitsh 164

XGRCWP, a Knowledge- and CIS-Based System for Selection, Evaluation, and
Design of Water Quality Control Practices in Agricultural Watersheds
Runxuan Zhao, Michael A. Foster, Paul D. Robillard, and David W. Lehning 175

Integration of EPA Mainframe Graphics and CIS in a UNIX Workstation Environment
To Solve Environmental Problems
William B. Samuels, Phillip Taylor, Paul Evenhouse, and Robert King 183

Wetlands Applications

Wetlands Mapping and Assessment in Coastal North Carolina:
A CIS-Based Approach
Lori Sutter and James Wuenscher 199

Decision Support System for Multiobjective Riparian/Wetland Corridor Planning
Margaret A. Fast and Tina K. Rajala 213

Design of CIS Analysis To Compare Wetland Impacts on Runoff
in Upstream Basins of the Mississippi and Volga Rivers
Tatiana B. Nawrocki 218
IV

-------
Contents (continued)

Page

Water Quality Applications

Vulnerability Assessment of Missouri Drinking Water to Chemical Contamination
Christopher J. Barnett, Steven J. Vance, and Christopher L Fulcher 235

Reach File 3 Hydrologic Network and the Development of CIS Water Quality Tools
Stephen Bevington 238

EPA's Reach Indexing Project: Using CIS To Improve Water Quality Assessment
Jack Clifford, William D. Wheaton, and Ross J. Curry 242

Environmental Management Applications

Ecological Land Units, CIS, and Remote Sensing: Gap Analysis in the Central Appalachians
Ree Brannon, Charles B. Yuill, and Sue A. Perry. 255

A CIS Strategy for Lake Management Issues
Michael F. Troge 261

A Watershed-Oriented Database for Regional Cumulative Impact Assessment and Land Use Planning
Steven J. Stichter 266

A CIS Demonstration for Greenbelt Land Use Analysis
Joanna J. Becker 273

CIS as a Tool for Predicting Urban Growth Patterns and Risks
From Accidental Release of Industrial Toxins
Samuel V. Noe 278

Integration of CIS and Hydrologic Models for Nutrient Management Planning
Clyde W. Fraisse, Kenneth L Campbell, James W. Jones, William G. Boggess,
and Babak Negahban 283

Other GIS Applications

Expedition of Water-Surface-Profile Computations Using GIS
Ralph J. Haefner, K. Scott Jackson, and James M. Sherwood 295

Small Is Beautiful: GIS and Small Native American Reservations—
Approach, Problems, Pitfalls, and Advantages
Jeff Besougloff 299

A CIS-Based Approach to Characterizing Chemical Compounds
in Soil and Modeling of Remedial System Design
Leslie L Chau, Charles R. Comstock, and R. Frank Keyser 302

-------
                                      Contents (continued)

                                                                                             Page

Polygon Development Improvement Techniques for Hazardous Waste Environmental Impact Analysis
       David A. Padgett	 308

Comparing Experiences in the British and U.S. Virgin Islands in Implementing CIS
for Environmental Problem-Solving
       Louis Potter and Bruce Potter	 313

Application of CIS for Environmental Impact Analysis in a Traffic Relief Study
       Bruce Stauffer and Xinhao Wang	 322
                                                VI

-------
A cknowledgments
The success of the conference and this document is due largely to the talents and efforts of many
individuals. Gratitude goes to each person involved.
Authors

A special thanks goes to the many authors of the papers presented in this document. Their efforts
in preparing papers made this document possible and led to the overall success of the conference.
Peer Reviewers

The following individuals peer reviewed this document:
• Jim Goodrich, U.S. Environmental Protection Agency, National Risk Management Research
Laboratory, Cincinnati, Ohio.
• Nancy Phillips, Hollis, New Hampshire.
• John Shafer, Earth Sciences & Resources Institute, University of South Carolina, Columbia,
South Carolina.

Editorial Review and Document Production

Heidi Schultz, Eastern Research Group, Inc., Lexington, Massachusetts, directed the editorial
review and production of this document.

Technical Direction and Coordination

Daniel Murray and Susan Schock, U.S. Environmental Protection Agency, Office of Research and
Development, National Risk Management Research Laboratory, Center for Environmental Re-
search Information, Cincinnati, Ohio, coordinated the preparation of this document and provided
technical direction throughout its development.

Special Thanks

Special thanks goes to the following individuals for their support:
• Tom Davenport, U.S. Environmental Protection Agency, Region 5, Chicago, Illinois.
• Mike Forrester, Urban and Regional Information Systems Association, Washington, DC.
• Bill French, American Society for Photogrammetry and Remote Sensing, Bethesda, Maryland.
• Carol Griffin, Idaho Falls, Idaho.
• Lyn Kirschner, Conservation Technology Information Center, West Lafayette, Indiana.
• Kevin Klug, Association of American Geographers, Washington, DC.
VII

-------
•   John Lyon, Ohio State University/American Society of Photogrammetry and Remote Sensing,
    Columbus, Ohio.
•   Mark Monmonier, Department of Geography, Syracuse University, Syracuse, New York.
•   Sam Noe, Joint Center for CIS & Spatial Analysis, University of Cincinnati, Cincinnati, Ohio.
•   Nancy Phillips, Hollis, New Hampshire.
•   Javier Ruis, U.S. Department of Agriculture, Soil Conservation Service, Fort Worth, Texas.
•   Don Schregardus, Ohio Environmental Protection Agency, Columbus, Ohio.
                                          VIM

-------
                                            Introduction
The  National Conference on Environmental Problem-
Solving with Geographic Information Systems was held
in  Cincinnati, Ohio, September 21 to 23, 1994. The
conference was a forum for over 450 environmental
professionals to exchange information and approaches
on how to use geographic information systems (CIS) to
define, assess, and solve environmental problems.
Cross-media pollutant transport and watershed-based
decision-making have made the process of solving en-
vironmental problems more complex. The application of
CIS  to environmental problem-solving has greatly in-
creased our ability to  manipulate and analyze relational
and spatial data, providing environmental decision-mak-
ers with a powerful tool for analyzing  multimedia envi-
ronmental data over  increasingly broader areas (e.g.,
watersheds, states, regions). While the approach  to
using CIS varies from application to application, a com-
mon, technically sound framework for applying  CIS to
environmental problems should  be developed and im-
plemented. This  conference was an initial step in defin-
ing this framework by examining the following areas:

• Problem identification and definition.
• Data requirements  (e.g., coverage, scale), availabil-
  ity, documentation,  reliability, and acquisition.
• Approaches considered  and  selected for problem-
  solving.
• Unique  challenges  and pitfalls encountered.

• Interpretation of results, including level of confidence
  achieved based on data quality and approach taken.
Presenters were requested to address one or more of
these areas in papers and  posters that focused on
applications of CIS to specific environmental problems.

This document presents peer-reviewed papers from the
conference. The papers have been organized by gen-
eral topic area as follows:

• CIS Concepts
• Ground-Water Applications

• Watershed Applications
• Wetlands Applications

• Water Quality Applications
• Environmental Management Applications

• Other CIS Applications

The purpose of this document is to share the information
presented at the conference with individuals who were
unable to  attend. This document will be useful to indi-
viduals who are currently applying CIS to environmental
situations  or considering CIS for application in environ-
mental problem-solving. These individuals include envi-
ronmental regulatory personnel at the federal, state and
local level; university professors, researchers, and stu-
dents; private sector personnel,  including industry rep-
resentatives and environmental consultants; and other
interested persons. The goal of sharing this information
with a broader audience is to help users apply CIS to
environmental  problem-solving with  a greater aware-
ness of the power and limitations of this very useful tool.
                                                  IX

-------
GIS Concepts

-------
GIS Uncertainty and Policy: Where Do We Draw the 25-Inch Line?
James E. Mitchell
Institute for Environmental Studies, Louisiana State University, Baton Rouge, Louisiana
Abstract

The growing availability of improved hardware and soft-
ware for geographic information systems (GIS) has out-
stripped most users' ability to identify and represent
uncertainty in the available data. In practice, the prolif-
eration and compounding of errors and uncertainty in-
crease as information becomes more easily handled
and combined from different sources.

Various stages of GIS database development and analy-
sis generate different forms and amounts of error and
uncertainty. In most cases, inherent uncertainty within
source data is simply ignored and its nature eventually
lost through subsequent processing. Both the location
of features and their attributes can include error and
uncertainty. By the time decision-makers receive mapped
information, it is typically represented as correctly lo-
cated and attributed.

The use of weather and climate information provided by
the National Climatic Data Center (NCDC) is a common
example of this scenario. Weatherstation locations pro-
vided by NCDC are reported to the nearest truncated
degree-minute. A minute is one-sixtieth of a degree of
arc. In the center of the continental United States, 1
minute of latitude averages approximately 6,000 feet
and 1 minute of longitude averages approximately 4,800
feet. Thus, the station location is only known to lie within
a box of approximately 1 square mile. Map repre-
sentations of these data should reflect this uncertainty.

Under the Municipal Solid Waste Landfill (MSWLF) Cri-
teria, the U.S. Environmental Protection Agency has
dictated that the 25-inch precipitation contour line be
used as a regulatory boundary for the level of protection
required at municipal landfill sites. The way in which
these lines are created and interpreted has important
policy implications. Indeed, the cost and practicality of a
given location must take this into account. If the 25-inch
precipitation figure is critical, characterizing its uncer-
tainty is also important.
In this work, uncertainty is considered a property of the
data (1). A Monte Carlo procedure is used to represent
the stochastic character of contour lines generated from
point data with known locational uncertainty. The 30-
year normal precipitation data for Kansas are used as
an example. The results of this study are compared with
the 25-inch contour used for regulatory purposes in
Kansas. This study demonstrates that the method of
interpolation greatly influences the resulting contours. In
addition, locational uncertainty changes the results un-
predictably using four different contouring methods. Fi-
nally, the differences have potentially significant policy
implications. The nature and origin of these factors are
discussed.

Problem Statement

The increasing power of geographic information sys-
tems (GIS) and the availability of digital data have en-
abled users and decision-makers to perform complex
spatial analyses for a great variety of environmental
applications (2). The rapid expansion of GIS has re-
sulted in a parallel growing concern about the quality of
data (3).

An understanding of error and uncertainty is critical for
proper use of spatial information. For the purposes of
this discussion, error is defined as a deviation between
the GIS representation of a feature and its true value (4).
For a location, this might arise from rounding or truncat-
ing digits. Attribute error can involve misclassification of
a feature or some other form of incorrectly accounting
for its nature. Error is a measurable value quantifying
these differences.

Furthermore, uncertainty shall refer to a characteristic
for which the exact location and/or quantity cannot be
calculated (5) or an attribute whose value represents a
distribution or some other ensemble (composite) meas-
ure. Locational uncertainty often arises when inappro-
priate measurement systems are used. An example of
this is the use of a Public Land Survey designation (often

-------
referred to as a legal location) to specify a point location.
This system is designed to represent a tract of land (an
area). It is not accurate for locating points (1).

The uncertainty associated with an attribute is an impor-
tant characteristic of that feature. It quantifies the preci-
sion of a stochastic  quantity;  that is, one  that is not
accurately represented by a single value. Annual pre-
cipitation is represented by a single number, typically the
30-year mean annual  precipitation (30-year normal).
This number varies each year, however, and that uncer-
tainty can be quantified by the  variance or other statis-
tical measures. In this sense, uncertainty is a known  or
calculable value that can be used in spatial analyses.

Unreliable CIS data and products may lead to adverse
environmental and legal consequences. The National
Center for Geographic Information Systems and Analy-
sis (NCGISA) chose data quality as the first initiative on
its CIS  research agenda (6). Many efforts  have been
made priorto this, and since, to  understand and manage
error and uncertainty in CIS applications.

CIS analyses are inherently subject to propagation  of
error and uncertainty (4, 7). No data set can represent
every spatial reality of a geographically dispersed phe-
nomenon. Monmonier (8) points out that as long as the
three-dimensional earth's surface is transformed to a
two-dimensional plane,  error and uncertainty of various
forms will be produced. Goodchild and Min-Hua (9) point
out two issues that are important when dealing with error
and uncertainty:

• Minimization of error  in the creation of CIS products.

• Measurement and  presentation of error and  uncer-
  tainty in a useful fashion.

CIS technology introduces error and uncertainty through
two major sources: (1) inherent error and (2) operational
error.  Inherent error is the error present in source data.
It is generated when the data are collected. Operational
error is generated during data entry and manipulation
(7,  10-13). Examples include  locational shifts due  to
projection  or  combining   information  from  different
source scales.

Most error and uncertainty contained in CIS data cannot
be eliminated. Instead, they are actually created, accen-
tuated, and  propagated through CIS manipulation pro-
cedures (14-16). Most operational errors are difficult to
estimate.

The selection by the  U.S. Environmental Protection
Agency (EPA) of a 25-inch per year local precipitation
limit as one of the criteria to determine whether small
municipal solid waste landfills (MSWLF) are subject  to
the provisions of Subtitle D provides an excellent exam-
ple of how uncertainty and errors enter into a CIS analy-
sis and its subsequent  products. It demonstrates all  of
the major forms and purveyors  of error and uncertainty:
• Spatial (locational) error

• Statistical (sampling) uncertainty

• Temporal (time domain) error

• Error proliferation (processing error)

• Analytical (choice of methodology) error

• Cartographic representation error

Many of these are avoidable;  some are known and
understood, yet they remain largely ignored by users of
CIS technology. This work presents each of these fac-
tors, discusses their origins, and shows how CIS could
have been used to better serve the policy and regulatory
processes. The Kansas example demonstrates that ig-
noring the factors influencing error and uncertainty can
result in incorrect conclusions and inappropriate policy
decisions.

Data Requirements  and Sources

To perform an analysis of precipitation, data are typically
obtained  from the  National  Climatic  Data  Center
(NCDC), located in Asheville, North Carolina. This is the
national repository for such data.  These data are also
available through state or regional climate centers. The
Kansas Weather Library at Kansas State University pro-
vided data for this study. The 1990  "normal precipitation"
data (17)  and locations were obtained and generated
into an ARC/INFO point coverage. Figure 1 displays the
locations of the precipitation stations used in this study.

Normal  precipitation is defined as the average annual
precipitation for a  three-decade  (30 years)  period  at
each station  for which reliable  data are  available. To
avoid "edge effects" (processing anomalies due to a lack
of data along edges of an area), all stations in  Kansas
and some from neighboring states were used. A total of
380 stations compose this data set. In addition, precipi-
tation contours from the "Availability of Ground Water in
Kansas  Map" (18) were digitized from a [paper] source
map. The Geohydrology Section of the Kansas Geologi-
cal Survey  provided  base map  coverages  of carto-
graphic features.

All data represent the best available information from the
source institutions noted above.  Those organizations
use the data  in  their analytical and  cartographic re-
search and production operations.

Methodology

To examine the influence of locational uncertainty on the
representation of three-dimensional, natural phenom-
ena, a Monte Carlo approach was adopted (1). Using
this technique, random realizations of point locations are
generated for each  rain gauge,  in each of 50 separate
simulations. From this, 50 possible representations of
the unknown locations of each gauge are used to create

-------
                        *     •
  •   •   .
                                                                    \  •
                                                                            • .'
Figure 1.  Location of rain gauges used in this study.

50 different sets of contours. All Monte Carlo calcula-
tions and data generation were performed using Statis-
tical Analysis System (SAS) (19, 20).

These 50 simulations were sequentially processed us-
ing the four different contouring methods available within
ARC/INFO. This provided a means to examine analyti-
cal error propagation.  The first of the  four methods  is
kriging (21, 22). This is referred to in the paper as the
UK method, for its use of linear universal kriging (23).
The other three are manipulations of the triangular-
irregular network (TIN) contouring algorithm available in
ARC/INFO. These differ by the number of interpolation
points used along the edges  of the elements in the TIN
data structure (24). The first used the default  1, the
second used 5, and the third  used 10 (the largest value
available). These are labeled D1, D5, and D10, respec-
tively.

CIS operations used in this work include overlay analy-
sis, areal calculation, and arc intersection. ARC/INFO
was  used for all  CIS and cartographic  production  in
this work.

Identifying the Sources of Uncertainty

Spatial Error

Data obtained from NCDC is provided with the knowl-
edge  that weather station locations are reported using
truncated degrees and minutes of longitude and latitude.
NCDC cannot provide  any better locational accuracy at
this time. Because each location is reported with error,
this clearly has the potential to affect any contours or
other three-dimensional features interpolated from the
data. The magnitude and  nature of this influence are
unknown and unpredictable (1).

In addition to the poorly defined station locations, exami-
nation of the data revealed other anomalies. The loca-
tions in the publication reporting normal precipitation
(17) were not identical to those identified by the Kansas
State climatologist and NCDC. Some of the discrepan-
cies were quite large. These anomalies were brought to
the attention of all parties  involved.  No resolution was
provided to this investigator's satisfaction, however.

The contours digitized from the "Availability  of Ground
Water in Kansas Map" are stated to originate from the
1960 normals (18). No documentation exists, however,
concerning the way the lines were derived or the number
of rain gauges used. Presumably, they were contoured
by hand.

Statistical Uncertainty

This is a sampling consideration based on the size of
the data, the nature of the  process being sampled, and
its variability. Unfortunately, precipitation is a particularly
"patchy"  phenomenon. That is, rain falls in a discon-
tinuous  fashion, and adjacent gauges  can depict very
different  patterns. This is confounded  by the fact that
most contouring algorithms and other approaches to

-------
represent three-dimensional surfaces  assume a rela-
tively smooth (locally) and continuous process.

Areal processes are almost always sampled  as point
information. Most contouring algorithms require a regu-
lar grid  from  which to interpolate surface features. In
practice, rain gauges, as well as other environmental
sampling programs, are  irregularly distributed.  Place-
ment often depends on factors other than grid sampling
(e.g., convenience,  access  to  communications,  fi-
nances, Congressional districts). This creates a "nonex-
perimental sampling" design  (25). Nonexperimental
sampling can contribute to uncertainty (26, 27).

Temporal Error

The  normals are recalculated each decade  and can
change  drastically in local areas. These changes arise
for various reasons. First, some stations enter and drop
from the database. Stations are deleted due to  changes
in location or extended periods of data collection prob-
lems. On occasion, new stations are added. Thus, the
size and areal coverage of the data set changes with time.

In  addition,  weather patterns change  with time. Ex-
tended periods of drought or excess rain or snow alter
measured precipitation. In turn, three-dimensional rep-
resentations change unevenly.

Error Proliferation

Once an error enters into the database and is  included
in CIS operations, spatial analysis, or spatial interpola-
tion,  its effect passes into the next stage of processing.
In the 1990 normal precipitation data for Kansas, two
stations are reported in  Garden City. Despite the fact
that the two are only a few miles apart, their annual total
precipitation differs by 2 inches! In consultation with the
state climatologist (Mary Knapp, Kansas State Univer-
sity), one was eliminated from the analysis. This process
was  repeated for an additional six stations where re-
ported values appeared to be anomalous compared with
nearby  stations  or the  previous  normal precipitation
(1951 to 1980).

Errors can also proliferate through the normal  handling
of data. With geographic data, this often occurs while
converting data from raster to vector and vector to raster
forms (28). Some CIS operations are best accomplished
in one form or another. As a result, transformations are
often "hidden"  from  the  user.  Commonly,  features
"move" slightly after each step in an analysis.

Analytical Error

Different techniques have been developed for  perform-
ing spatial interpolation, and an abundance of  software
is available for this purpose. All these methods have
strengths and weaknesses.  Each is based on a specific
set of assumptions about the  form and nature of the
data.  Some are more robust (less sensitive to data
anomalies) than others. Most importantly, some provide
additional information useful in data analysis.

Unfortunately, users often "take the defaults" when using
sophisticated techniques and  ignore the assumptions
behind the method. Parameters can be varied and their
effect evaluated, as in a sensitivity analysis (29).  Often,
the best approach is to try several methods and evaluate
their joint performance (30,  31).

Another difficulty is the need to assign values to  areas.
By definition, polygons in a CIS are considered to be
homogeneous. In reality, they bound areas that are a
gradation from  one characteristic to another. On the
other  hand, contours are  commonly used to depict sur-
face gradients but are useless (within a CIS) for analyti-
cal or modeling purposes. Ultimately, data sampling is
accomplished as a  point process (except, perhaps,  in
remote sensing), while many forms of data analysis and
processing require areal information.

Cartographic Representation Error

Communicating the uncertainty of map features is not a
trivial  endeavor.  Maps can be produced in two basic
forms: as a raster  (e.g., orthophotoquads  or satellite
images)  or a composition of vectors  (e.g., contour
maps). The printing process, however, often  reduces all
of this to a raster representation at a very fine pixel size.
Each  method poses its own problems in depicting un-
certainty.

Rasters can be used effectively in conjunction with color
information theory to produce  a continuum  of shading
within a thematic map layer (32). The choice of colors,
however, can influence the interpretation of the data,
and no universal scheme exists for  depicting thematic
variability. For example,  blue  shades often represent
water or cold, while yellow and/or red often represent
temperature or heat.

Vectors present a different suite of problems.  Contouring
is the  primary technique for using vectors to depict areal
variation. By definition, however, contour lines represent
an exact isoline or single value along its  length. Uncer-
tainty  cannot be represented  in a line. Rather, a com-
posite of lines can be displayed that represents a set of
possible  interpretations of the  data. This  is not a  practi-
cal solution for mapping, however, as it can create a
jumble of intersecting lines that makes interpretation
difficult and is not an aesthetic means of presentation.

Challenges Encountered in This Study

This work attempts to discover and account for sources
of error  and  uncertainty in CIS  analysis.  Given  this
information, the challenges are to find the best way  to
incorporate it into the analysis and to represent it in a
useful manner. Another challenge is finding ways to use

-------
CIS uncertainty to support policy and management de-
cisions. Addressing these manifold problems starts with
identifying the sources of error and uncertainty, the way
they enter the analysis, and the manner in which they
are propagated through the use of CIS.

This study includes  a number of known  sources of un-
certainty. In practice, this is not always the case. Users
of CIS data and technology should always assume that
the sources of uncertainty discussed in  this paper are
present and attempt to  determine their  nature. Uncer-
tainty should  be considered a property of the data and
appropriately represented  (1). This is  the  approach
taken in this work.

After examining these factors, a Monte Carlo simulation
was deemed  the most appropriate approach to capture
the nature  of the locational  uncertainty. Four different
methods of contouring were used to examine analytical
uncertainty (uncertainty due to the choice of a contour-
ing algorithm). In addition, the contribution of statistical
(sampling)  uncertainty  could have been addressed
through incorporating information about the standard
error of the point precipitation measurements (normals)
used as the base data. Time limitations precluded ex-
amining this dimension of the question. Comparing the
contours resulting from the 1960 and 1990 precipitation
normals demonstrates the effect of temporal variation.

The  greatest challenge is  communicating the uncer-
tainty in a manner useful to decision-makers. This paper
presents a series of maps, figures, and tables aimed at
addressing this problem. Some of the maps (see Fig-
ures 2 through 5) show the uncertainty resulting from
each of the contouring  methods. Figure 6 depicts the
union (overlay) of the four approaches and displays their
correspondence. The pie chart in  Figure 7 is a nonspa-
tial representation of this correspondence and the rela-
tive area represented within the different combinations
of overlapping regions of uncertainty. Tables 1 through
3 further compare these quantities. Figure 8 is the map
that the Kansas Department of Health and Environment
(KDHE) chose to define the  regulatory  boundary (the
25-inch contour). The contours resulting from this study
can be seen in Figure 9. Finally, the map in Figure 10 is
a cartographic comparison of the differences between
the contours  used by KDHE  (based on the 1960 nor-
mals) and those  generated  by the currently available
data (the 1990 normals).

There  is no  single  best approach for meeting these
challenges, and there may never be one. The real chal-
lenge to  address is  how to educate technical CIS pro-
fessionals  and the  users  of their work to look for
uncertainty and consider its influence on their decision-
making process.
Results

The zones of uncertainty defined by the results from the
four contouring methods used in this study are displayed
in Figures 2 through 5. For each method, these zones
represent the areal extent of the overlain contour lines
produced in the 50 simulations. Each region is bounded
by the furthest west or east contour generated along any
length  of the region. Table  1  shows the relative  area
falling within each of these zones as they traverse the
state of  Kansas. Clear differences exist between the
total areas of uncertainty. It is their placement and rela-
tive location, however, that  have policy and manage-
ment  implications. CIS is required to examine  these
questions.

Table 1.  Comparison of Absolute and Relative Area of
        Uncertainty Arising From Four Methods of
        Determining the 25-Inch Precipitation Contour


Method3
UK
D1
D5
D10


Total Area
(square
miles)
289.33
494.38
602.13
631.11
Difference
Between
This and
UK Method
(square
miles)
—
205.05
312.80
341 .78


Percentage
of UK
Method
—
170.87
208.11
218.13


Percentage
of Combined
Areab of
Uncertainty
19.65
33.58
40.90
42.87
 UK = universal kriging with linear drift, D1 = TIN interpolation with 1
 subdivision, D5 = TIN interpolation with 5 subdivisions, D10 = TIN
 interpolation with 10 subdivisions (23, 24).
bA union (overlay) of all four sets of regions of uncertainty creates a
 combined area of 1,472.07 square miles. This includes the zones of
 uncertainty  for each method of contouring and areas not included
 within any of the four regions of uncertainty (gaps between them).

Figures 2 through 5 clearly show differences in both the
extent of uncertainty in the 25-inch contour line and its
positional  interpretation. Each  method  has a  slightly
different bend or twist.  Islands (isolated regions where
the 25-inch  line  appears as  a  closed loop) are mani-
fested  differently   depending  on  the  interpolation
scheme. It is interesting to note the relative correspon-
dence between  the general shape of the D1  and UK
methods. In  the south-central border region, D1 and UK
represent the local uncertainty as a bulge, while  D5 and
D10 depict it as an island of lower precipitation.

Areal correspondence  and difference are  depicted in
Figure 6 and Table 2. Figure 7 is a pie chart visualizing
the information in Table 2. These results are somewhat
surprising  in that areas where none of the four methods
located  the 25-inch line ("None  Present") represent the
second  largest composite area. Because of the large
number of "sliver polygons," the graphic representation
of the overlay is  somewhat difficult to interpret. Table 2
clarifies these interrelationships by breaking down the
various  categories. The average area per polygon value

-------
Figure 2.  Regions of uncertainty produced by the UK method of contouring.
Figure 3.  Regions of uncertainty produced by the D1 method of contouring.

-------
Figure 4.  Regions of uncertainty produced by the D5 method of contouring.
Figure 5.  Regions of uncertainty produced by the D10 method of contouring.

-------
UK
D1
D5
D10
Not Present
Figure 6. Union of the regions of uncertainty from all four methods of contouring. The numerous "sliver" polygons make this a
difficult presentation to interpret at this scale. The black areas appear prominently, however. These represent areas where
no method placed contour lines.
adds to the interpretation of the relative areas by incor-
porating the number of polygons in each category. An
inspection of this column makes those categories with
a multitude of very small polygons stand out. It also
displays a number of large jumps in magnitude. As this
number increases, the significance of the correspondence
increases.

Table 2 highlights the correspondence between the D5
and D10 approaches and the D1 and UK methods of
interpolation. This relationship is interesting because the
algorithms used by the UK and D1 methods both are
forms of linear interpolation. The D5 and D10 algorithms
are designed to provide more "smoothing" and appear
to create increasingly more "bull's eyes." Only D5 and
D10 generate these features. Of the 50 simulations, a
particular bull's eye appears west of the 25-inch contour
(see Figures 3 and 4) four times using D5 and 39 times
using D10. The size and location of these anomalies
also vary with the input data. Polygons containing con-
tours from all four methods rank ninth in total area and
seventh in average area (out of 17 categories). This
supports the conclusion that the four chosen methods
have a relatively low spatial correspondence.

Table 3 breaks down the area of uncertainty by county.
Although the zones of uncertainty appearto be relatively
small when displayed on a statewide basis, they have
important impacts in local areas. In particular, combining
this information with soils, topography, ground water,
and other information can clearly indicate whether a
specific location is suitable for a landfill. Often, informa-
tion developed at one scale is used in another. In this
case, statewide information is being used fora site-specific
application.

Figure 8 is a copy of the map the KDHE used to deline-
ate the 25-inch precipitation contour. The results from
LRGflty
12.13%
D1,03. acd DIG
11.11%
Ml Oiling;
Norn Praianl
22.13%
Figure 7. Breakdown of the total area in each category result-
ing from the union (overlay) of the regions of uncer-
tainty from all four contouring methods. The "None
Present" category represents a surprisingly large
proportion among the 17 possible combinations.
10

-------
Table 2. Comparison of Absolute and
Relative Area
of
Table 3.
Uncertainty Arising From Four Methods of
Determining the

Methods of
Contouring
Found Within
Area

D5 and D10
None present
D1 only

UK only

D1, D5, and D10

D10 only

UK and D1

D5 only
Common to all

UK, D5, and D10

D1 and D5

D1 and D10
UK and D5

UK, D1, and D5

UK, D1, and D10

UK and D10
Combined total




N

28
28
37

27

24

69

19

73
10

14

38

38
19

12

12

13
461
25-Inch Precipitation Contour


Total Area
(square
miles)

350.13
326.06
221 .02

178.60

163.61

70.07

65.82

34.68
23.45

15.47

10.68

6.50
2.21

1.89

1.41

0.47
1472.07

Percentage
of Total
Combined
Area

23.78
22.15
15.01

12.13

11.11

4.76

4.47

2.36
1.59

1.05

0.73

0.44
0.15

0.13

0.10

0.03
100.00
Average
Area per
Polygon3
(square
miles)

12.50
11.64
5.97

6.61

6.82

1.02

3.46

0.48
2.35

1.11

0.28

0.17
0.12

0.15

0.12

0.04




County

Barber
Barton
Clark

Area of Uncertainty for Each County Arising From
Four Methods of
Determining the 25-Inch
Precipitation Contour

UK Area
(square
miles)

36.38
28.47
12.64

Comanche 44.30

Edwards

Ellis

Jewell

Kiowa
Osborne

Pawnee

Pratt

Rush
Russell

Smith

Stafford

Total
aA dash
method

— a

—

—

15.15
43.87

—

18.84

0.04
27.17

27.59

34.88

289.3

D1 Area
(square
miles)

24.29
96.84
11.49

52.94

—

6.14

6.65

14.32
99.82

—

11.30

8.51
22.71

68.47

70.90

494.4
indicates that no contours appeared
specified.


D5 Area
(square
miles)

28.05
56.16
12.97

51.43

—

9.27

23.05

17.81
168.2

0.11

11.56

31.32
28.30

67.38

96.49

602.1
D10
Area
(square
miles)

28.31
59.56
13.02

52.23

9.44

8.90

24.26

31.12
172.6

0.30

11.43

29.51
28.00

66.34

96.05

631.1
in that county for the


 Average Area per Polygon =
 to compare the relative size
(Total Area) / N. This a useful measure
of each polygon in each classification.
the UK method were selected as the best available
representation of normal precipitation across Kansas
(see Figure 9). The figure displays the undipped contour
lines generated from the data. This is done to point out
the importance of "edge effect."  Note the  incoherent
behavior of the contour lines at their termini.  If a smaller
window of data points were used, interpolation problems
would  have lain across the region of interest.  When
present, these features require more handling and time
for analysis. They often introduce additional error and
uncertainty.

The  policy implications of this example are demon-
strated in Figure 10. Here, the map shows the combina-
tion of the "official" KDHE map and the data  interpreted
in this study. The pattern of noncorrespondence is note-
worthy. The lightest areas are regions that currently
experience higher annual precipitation than forecast by
the 1960 normals (from the KDHE map). Black areas
are expected to have lower precipitation under current
climatic conditions.  Therefore, large  areas  of Kansas
that should be under regulation according to the MSWLF
regulations are not.

In  summary, the figures and tables clearly show  that
locational  uncertainty of data measured as points  is
propagated into contour lines. The nature and magni-
tude of that uncertainty varies with location and method
of interpolation and shows no regular (predictable) pat-
tern. Perhaps most importantly, uncertainty that appears
small at one scale can be relatively more significant at
another. In addition, seemingly small geographic feature
and uncertainty can be an important factor in decision-
making.

Discussion and Conclusions

CIS is an established and accepted technology, espe-
cially  in  applications  related  to natural  resource and
environmental management.  Despite the widespread
proliferation of CIS into these areas, the  available data
are not always appropriate for the  intended application.
Furthermore,  adequate documentation  is not always
available to determine whether the data are adequate
for a given use. The development of metadata standards
will play  an important role in  addressing this problem.
Errors and  uncertainty will always be present in CIS
data.  Recognizing their presence, incorporating  them
into the analysis, and representing them in CIS products
will remain a constant challenge.

This study demonstrates the influence that various
sources of CIS uncertainty can leverage on the results
of an analysis. The example of the 25-inch precipitation
                                                   11

-------
          Yield of Greater Than
          500 Gallons per Minute

         * Precipitation Contours in Inches Per Year
Yield of 100 to 500 Gallons
of Water per Minute
Yield of Less Than 100
Gallons of Water per Minute
                   100 Miles
Figure 8.  The map that the KDHE selected as the definitive source for the location of the 25-inch precipitation contour (18).
Figure 9.  Map of the precipitation contours resulting from applying the UK method with linear drift to the 1990 normals (17). Here,
          the contour interval is 5 inches. Note the incoherent behavior of the contours around the margins of the map. This is
          referred to as "edge effect."
                                                             12

-------

D
Higher Precipitation in
1990 Data
Lower Precipitation in
1990 Data
Figure 10. This map represents the union (overlay) of the information in Figures 8 and 9. The lighter regions represent areas
exhibiting higher precipitation in the 1990 normals (1961 to 1990) than was apparent in the 1960 normals (1931 to 1960).
The darker areas show the opposite relationship.
line in Kansas is a clear example of how the use of
inappropriate data can have far-reaching effects on pol-
icy and management. The regulatory agency, KDHE,
chose the wrong map upon which to base its regulatory
authority. As a result, numerous potential sites for small
municipal solid waste landfills will be considered that are
in violation of the letter and intent of the law.

Ultimately, the responsibility for proper use of CIS tech-
nology lies in the hands of practitioners. Technical staff
performing CIS analysis must be knowledgeable about
sources of error and uncertainty and ensure that users
of their work are aware of their influence on CIS output.

The problems demonstrated in the Kansas example
could have been avoided simply by investigating the
appropriateness of the data. Instead, a convenient
source was chosen without seeking any other sources
of "better" information. Indeed, familiarity with the nature
of the data (30-year normals) should have led the policy
analyst to select the most current data and not data that
are 30 years out of date! An understanding that contour
lines represent a generalization of the point precipitation
measurements should also have led to the conclusion
that locations near the boundary line ought to be moni-
tored for compliance. Both the temporal and spatial
characteristics of climate can change, as exemplified by
the difference in the 1960 and 1990 normals. The "Dust
Bowl" periods of the 1930s and 1950s significantly
influenced the 1960 normals (33). As a result, they are
not appropriate for this application.

Although a powerful tool, CIS does not hold all the
answers. The technical community and policy-makers
must work together to ensure its proper use. In reality,
no 25-inch precipitation line floats over Kansas. It is
merely the interpretation of scientists and policy-makers
who select its location. The only way to arrive at a
reasonable answer is to gather the best available infor-
mation and allow all parties to scrutinize it. CIS can be
a wonderful tool to do this.

Acknowledgments

The author wishes to express his gratitude to the Kan-
sas Geological Survey for its computing support before,
during, and after his move to Louisiana. Also, the author
wishes to thank Mr. M. Schouest, who made the move
both possible and tolerable. Mr. C. Johnson of Johnson
Controls in Baton Rouge, Louisiana, produced the ex-
cellent presentation graphics. Finally, this work could not
have been performed without the Internet.

References

1. Mitchell, J.E. 1993. The influence of (x,y) uncertainty on predic-
tion error and contour lines from a three-dimensional surface.
Water Resources Bull. 29(5):863-870.
13

-------
 2.  Northeast Waste Management Officials Association. 1992. Cata-
    logue  and directory of  New  England states and  regional  CIS
    organizations and  activities, and an assessment of their future
    CIS needs. EPA/G-D001456/90-2.

 3.  Campbell, W.G., and D.C. Mortenson. 1989. Ensuring the quality
    of geographic information system data: A practical application of
    quality control. EPA/600/J-89/306.

 4.  Chrisman,  N.R. 1991. The  error component in spatial data. In:
    Maguire, Goodchild, and Rhine, eds. Principles of geographic
    information systems, Vol. 1. pp. 164-174.

 5.  Isaaks, E.H., and R.M. Srivastava. 1989. An introduction to applied
    geostatistics. New  York, NY: Oxford University Press.

 6.  National Center for Geographic Information  and  Analysis. 1989.
    The research plan of the National Center for Geographic Infor-
    mation and Analysis. Int. J. Geo. Info. Sys. 3(2):117-136.

 7.  Vitek,  J.D., S.J. Walsh, and  M.S.  Gregory.  1984.  Accuracy in
    geographic information systems: An assessment of inherent and
    operational errors.  Proceedings of the PECORA IX  Symposium.
    pp. 296-302.

 8.  Monmonier, M.S. 1977.  Maps, distortion, and meaning. Associa-
    tion of American Geographers Resource Paper Number 75-4.

 9.  Goodchild, M.F., and W Min-Hua. 1988. Modeling  error in  ras-
    ter-based spatial data.  Proceedings  of the  Third  International
    Symposium on Spatial Data Handling, Sydney, Australia (August
    17-19). pp. 97-106.

10.  Walsh, S.J. 1989.  Inherent and operational error within a CIS:
    Evaluation from a  user perspective. In: Goodchild, M.F., ed. Ac-
    curacy of spatial databases, p. 290.

11.  Burrough, PA. 1986.  Principles of geographic information  sys-
    tems for land resources assessment. Oxford, England: Clarendon
    Press.

12.  Marble, D.F.,  and  D.J.  Peuquet. 1983. Geographic information
    systems and remote sensing:  Manual  of remote sensing, 2nd ed.
    Falls Church, VA: American Society of Photogrammetry and Re-
    mote Sensing, pp.  923-958.

13.  Mead, D.A. 1982. Assessing data quality in geographic informa-
    tion systems. In: Johannsen, C.J., and J.L. Sanders, eds. Remote
    sensing for resource management. Ankeny, OH: Soil Conserva-
    tion Society of America, pp. 51-59.

14.  Heuvelink, G.B.M., and PA.  Burrough. 1989.  Propagation of
    errors in  spatial  modeling with GIS.  Int.  J. Geo. Info.  Sys.
    3(4):303-322.

15.  Dunn, R., A.R. Harrison, and J.C. White. 1990. Positional accu-
    racy and measurement error in digital databases of land use: An
    empirical study. Int. J. Geo. Info. Sys. 4(4):385-398.

16.  Walsh, S.J.,  D.R.  Lightfoot, and D.R. Butler. 1987. Recognition
    and assessment of error in geographic information systems. Pho-
    togrammetric Eng.  and Remote Sensing 53(10):1,423-1,430.
17. Owenby, J.R., and D.S. Ezell. 1992. Climatography of the United
    States: Monthly station normals of temperature, precipitation, and
    heating and cooling; degree days  1961-1990, Kansas.  No. 81.
    U.S. Department of Commerce, National Oceanic and Atmos-
    pheric Administration, National Climatic Data Center, Asheville,
    NC (January).

18. Bayne, C.K. 1975. General availability of ground water and nor-
    mal annual precipitation in Kansas  (1:500,000 map).  Kansas
    Geological Survey Map Series M4-A.

19. SAS Institute, Inc. SAS/GRAPH Software: Reference, Version  6,
    Vol. 1, 1sted. Gary, NC: Statistical Analysis System Institute, Inc.
    1990.

20. SAS Institute, Inc. SAS/GRAPH Software: Reference, Version  6,
    Vol. 2, 1st ed. Gary, NC: Statistical Analysis System Institute, Inc.
    1990.
21. Olea,  R.A.  1992. Understanding  allays  intimidation.  Geobyte
    7(5):12-17.

22. Bastin, G., B. Lorent, C.  Duque, and M. Gevers.  1984.  Optimal
    estimation of the average areal rainfall and optimal selection  of
    rain gauge locations. Water Resour. Res.  20(4):463-470.
23. ESRI. 1991. Cell-based modeling with GRID. Redlands, CA: En-
    vironmental Systems Research  Institute, Inc.
24. ESRI. 1991. Surface  modeling  with TIN,  surface analysis, and
    display. Redlands, CA: Environmental Systems Research Insti-
    tute, Inc.
25. Davis, J.C. 1982. Statistics and data analysis in geology, 2nd ed.
    New York, NY: John Wiley and Sons.
26. Mitchell, J.E. 1991. A regionalization approach based on empiri-
    cal Bayes techniques with applications to hydrologic forecasting,
    Ph.D. dissertation. Duke  University, Durham, NC.
27. Mitchell, J.E., H. Sun, and S. Chakrabarti. 1992.  Predicting the
    form of a surface from random, nonexperimental samples using
    neural networks. Proceedings of the First  Midwest Electro-Tech-
    nology Conference, Iowa State University, Ames, IA (April 10-11).

28. Congalton, R.G., and  D.M.  Schallert.  1992. Exploring the effects
    of vector to  raster and  raster to  vector  conversion. EPA/600/
    R-92/166.
29. Mitchell, J.E., and H.  Sun.  1992. Generalization in higher order
    neural networks when predicting polynomial surfaces, implica-
    tions for the free-form surface problem. In: Intelligent engineering
    systems through neural  networks, Vol. 2.  Proceedings of the
    Artificial Neural  Networks in  Engineering (ANNIE '92)  Confer-
    ence, St. Louis,  MO (November  15-18).
30. Bates, J.M., and C.W.J. Granger. 1969. The combination of fore-
    casts. Operational Res. Quarterly 20:451-468.
31. Granger, C.J., and R. Ramanathan. 1984. Improved  methods  of
    combining forecasts. J. Forecasting 3:197-204.
32. Carr, D.B., A.R. Olsen, and D.  White. 1994.  Hexagon mosaic
    maps for display of univariate and bivariate geographical data.
    EPA/68-C8/0006.
33. Rule,  G.K. 1960. Dust bowl.  In: The world book encyclopedia,
    Vol. 4. pp. 315-316.
                                                              14

-------
     Data Quality Issues Affecting GIS Use for Environmental Problem-Solving
                                           Carol B. Griffin
                            Henry's Fork Foundation, Island Park, Idaho
"Abandon hope, all ye who enter here." Dante's quote
might well be the advice that experienced geographic
information system (GIS) users give to nonusers about
to confront data quality issues  associated  with GIS
use. Indeed, after reading this paper, some  decision-
makers might abandon attempts  to use a GIS because
of the error associated with it. Others may want to spend
an inordinate amount of time and money trying to elimi-
nate all error associated with GIS use. Neither option is
prudent.

Data quality is important because it affects how reliable
CIS-generated information is in  the  decision-making
process. Too often, the availability of inexpensive digital
data overshadows data quality concerns;  people fre-
quently use digital data because they are available, not
because they have the necessary accuracy.

A GIS can help decision-makers use spatial informa-
tion more fully than manual methods allow, but some-
times data quality issues cause concern about using
CIS-generated  outputs.  Making  environmental  deci-
sions without adequate consideration  to data quality
may lead to an erroneous decision, erode public confi-
dence, or cause an agency to incur liability. This paper
attempts to encourage  decision-makers  to become
more aware of data quality issues, including the sources
and magnitude of error.

GIS  error research  has necessarily progressed in a
linear fashion, beginning with  identifying and classifying
sources of error. This paper discusses both inherent
(source) error and the error that GIS operations  intro-
duce  (operational error)  during data  input, storage,
analysis/manipulation,  and output (1).  Strategies  for
coping with error and research into error reduction tech-
niques have only recently received attention. Unfortu-
nately, the answers to error management  questions
such as, "How will the error affect decision-making?" are
not clear.  The  end of this paper covers several  error
management suggestions and anticipated software im-
provements designed to reduce errors,  however.
Data Quality Concepts and Their
Importance

Data quality is a major issue for CIS-generated maps,
much more so than it is for paper maps. In part, this is
because a GIS can perform operations on spatial data
that would be nearly impossible without a GIS because
of scale, complexity, and generalization issues (2). Car-
tographers adjust for these problems when they manu-
ally manipulate and instantly combine  paper maps by
adhering  to  long-standing cartographic principles, but
GIS personnel may not be fully trained in these princi-
ples. A GIS enables an analyst, whether trained in car-
tographic principles or not,  to combine or manipulate
data in appropriate  or in inappropriate, illogical,  and
erroneous ways. Lack of training coupled with the speed
of spatial data manipulation can have serious conse-
quences for an agency whose personnel produce and
use CIS-generated maps.

Limited scientific understanding, limited ability to meas-
ure data,  sampling error, inherent variability, and inade-
quacy of mathematical representations all contribute to
uncertainties associated with spatial data. Uncertainty
about spatial data consists of two parts: ignorance and
variability. Ignorance means that variables have a "true"
value, but it is unknown to us, whereas variability means
one value cannot represent the variables.

Data quality defies a simple definition. For this paper,
data quality can roughly mean  how "good" the data are
for a given purpose. People usually think of data quality
in terms of error, but the term is broader and encom-
passes the six components outlined  in the next section.
Error can mean the difference between the observed
values and the "true" value. The "true" value of a variable
is usually unknown and unknowable, but for this paper's
purposes, "true" could  be the known value or the value
one would obtain from field measurements (the discus-
sion of data collection tries to dispel the notion that there
is one "true" value for many variables, such as soil type
in a given area orwatertemperature in a lake). Imperfect
                                                  15

-------
equipment or  observers  and  environmental  effects
cause spatial error. According to Thapa and Bossier (3),
errors fall into three categories:
• Gross errors and blunders (people or equipment).
• Systematic errors (which introduce bias).
• Random errors (due to imperfect instruments and
  observers).
In addition, another view divides spatial error into two
different components: accuracy and precision. Accuracy
means  how close a  value is to the "true"  value or  a
known standard (absence of bias). Precision can have
two definitions:  it can be a measure of dispersion  (stand-
ard deviation) of observations about a mean, or it can
refer to the number of decimal digits used to represent
a value (4). In  the first definition of precision, a meas-
urement of 6 feet  plus or minus 1 foot is more precise
than one of 6 feet plus or minus 3 feet. In the second
definition, a value of 6.1794 feet is more precise than
one of 6.1 feet. Figure 1 provides a graphic explanation
of the difference between error, accuracy, and precision.
                      Error
                  Accuracy
                                 Precision
  True Value
4  1    Mean  2  3
     Observations
Figure 1.  Relationship between error, accuracy, and precision.

Data are not accurate or inaccurate. Instead, data accu-
racy exists on  a continuum, ranging from low to high
accuracy. Although  people strive for accurate (error-
free) data, obtaining  100-percent  accurate data  is im-
practical. The list below provides some of the reasons
why total accuracy is not obtainable  (5):

• Objects to  be measured are often vaguely defined.

• Some phenomena are variable  in  nature.

• Classification schemes are imprecise.

• Measurements are inherently imprecise.

• Gross errors of a nonstatistical nature can occur during
  measurement.
• Attributes encoded on an ordinal  scale  (high, me-
  dium, low) are approximate.

• Data represent a past state of reality.

Users of geographic data should strive for data that are
only as accurate as they need. A variety of factors, of
course, can determine need:

• Intended use of the data

• Budget constraints

• Time constraints

• Data storage considerations

• Potential liability

The main  barrier to highly accurate data is lack of funds.
Male (6) suggests that  rather than abandoning a CIS
project because funds are not sufficient to achieve the
desired accuracy,  an agency should collect data at the
desired accuracy  from  smaller  areas, such as  areas
being developed or redeveloped. Overtime, data collec-
tion at the desired accuracy can expand to include areas
that lacked data due to budgetary constraints. Smith and
Honeycutt (7) outline the use of a value of information
approach  in  determining the need for more  data (or
more accurate data) based  on the expected costs and
benefits associated with data collection.  If the  benefits
of increased data accuracy are  greater than  the ex-
pected costs, additional funds should be  allocated to
obtain more accurate data.

The intended use of data affects the type of data, as well
as the data quality needed. Beard (8) divides CIS appli-
cations into six types (see Table 1). The specific type of
data quality one needs (e.g., positional accuracy, attrib-
ute accuracy) also varies with the intended application.
Analysts with inventory applications such as agricultural
production are less  concerned about positional  accu-
racy than with an accurate  assessment of anticipated
crop yields (attribute  accuracy).  Decision-makers must

Table 1.  Types of GIS Applications (8)

Application                  Example

Siting        Finding optimal location (fire station, waste site)
Logistic       Movement or distribution through space
            (emergency response, military movement)
Routing       Optimal movement through a known network (mail,
            school bus)
Navigation    Way finding; may or may not involve a known
            network (ground, sea, air)
Inventory     Count and location of objects for a given time
            (census, tax rolls)
Monitoring/    Examining processes over space and time
Analysis      (ecological, zoological, geological,  epidemiological
            studies)
                                                    16

-------
decide which data quality component is the most impor-
tant for their use because optimizing all six components
can be very expensive  (9). An obvious conflict arises
when local and state governments must meet multiple
application needs simultaneously and thus feel forced to
try to optimize several data quality components.

The nature of the decision may also help decision-makers
determine the data quality they need. Beard (8) lists sev-
eral of these factors (see Table 2). A political, high-risk
decision requires higher quality data than a nonpolitical,
low-risk decision because more public attention focuses
on the former decision.

Table 2.  Factors That May Affect the Data Quality Needed for
        Decision-Making (8)
Lower Data Quality
Possibly Needed
Higher Data Quality
Possibly Needed
Routine

Nonpolitical

Minimal risk

Noncontroversial

Indefinite

Local implication
Nonroutine

Political

High risk

Controversial

Immediate

Global implication
Components of Data Quality

The  National Committee for Digital Cartographic Data
Standards (9) identifies six components of digital carto-
graphic data quality. This section discusses each  of
these components:

• Lineage

• Positional accuracy

• Attribute  accuracy

• Logical consistency

• Completeness

• Temporal accuracy

Most components of data quality apply to both source
and operational error.

Lineage

Because uses and users of data change, those at the
national level have noted a recent push to include docu-
mentation when disseminating spatial data. Data  line-
age, also known as metadata or a data dictionary, is data
about data. Metadata consists of information about the
source data such as:

• Date of collection

• Short definition

• Data type, field length, and format
• Control points used

• Collection method, field notes, and maps

• Data processing steps

• Assessment of the reliability of source data

• Data quality reports

Access to this information can help CIS personnel de-
termine if the data are appropriate for their use, thereby
minimizing risks associated with using the wrong  data
or using  data  inappropriately. According to Chrisman
(10), the only ethical and probably best legal strategy for
those who produce spatial data is to reveal more infor-
mation about the data  (metadata)  so that users can
make informed decisions. Eagan and Ventura's article
(11) contains a sample of a generic environmental  data
lineage  report.  The  U.S. Environmental  Protection
Agency's  (EPA's) new locational data  policy requires
contractors to estimate data accuracy and provide infor-
mation about the lineage of the data (12).

Positional A ecu racy

Anyone who has used a  map has probably come across
features that are not located where  the map says  they
should be located and has experienced low positional
accuracy. (Undoubtedly, they have  also detected  fea-
tures that were not on the map, but that is a different
issue.) Positional accuracy,  frequently referred to as
horizontal error, is how close  a location on a map  is to
its "true" ground position. Features may be located inac-
curately on maps for many reasons, including (13):

• Poor field work.

• Distortion of the  original paper  map (temperature,
  humidity).

• Poor conversion from raster to  vector or vector to
  raster data.

• Data layers are collected at different times.

• Natural variability in data (tides, vegetation, soil).

• Human-induced  changes   (altering  reservoir water
  levels).

• Movement of features (due to scale of the map and
  printing constraints) so they can be easily discerned
  by the  map reader.

• Combining maps with different scales.

• Combining maps with different projection and coordi-
  nate systems.

• Different national horizontal datum  in source materials.

• Different minimum mapping units.

Positional accuracy has  two components: bias and pre-
cision. Bias reflects the  average positional error of the
                                                   17

-------
sample points and indicates a systematic discrepancy
(e.g., all locations are 7 feet east of where they should
be). Estimating precision entails calculating the stand-
ard deviation of the dispersion of the positional errors.
Usually, root mean square error (RMSE) is reported as
the measure of positional accuracy, but it does not dis-
tinguish bias from precision (14). RMSE  is  frequently
monitored during digitizing to minimize the introduction
of additional positional error into the CIS.

To determine positional accuracy, one must compare the
location of spatial data with an independent source of
higher accuracy. Federal agencies that collect data and
produce maps adhere to National  Map Accuracy Stand-
ards (NMAS)  for positional accuracy. Maps such  as
United States Geological Survey (USGS) topographic
maps that conform to NMAS carry an explicit statement
on them. Other groups also  have developed  standards
for large-scale mapping (15).

NMAS for positional accuracy require that not more than
10 percent of well-defined points can be in error by more
than one-thirtieth  of an inch for maps at a scale of
1:20,000 or larger. For smaller scale maps,  not more
than 10 percent of well-defined points can be in error by
more than one-fiftieth of an inch (16). Thus, less than 10
percent  of  the  well-defined locations on   a  USGS
1:24,000 map can stand more than  40 feet  from  their
"true" location; the other 90  percent of the well-defined
points must stand less than 40 feet from their "true"
location. Table 3 shows the  acceptable positional accu-
racy for commonly used maps. Note that as scale de-
creases from 1:1,200 to 1:100,000, positional accuracy
decreases.

Several  important issues relate to NMAS. First, not all
maps adhere to NMAS, which means their positional
accuracy may be lowerthan  NMAS or may be unknown.
Second, NMAS do not indicate the location of points in
error. Third, 10 percent of the well-defined points can
have a positional error greater than the standards allow,
but neither the location nor the  magnitude of these
errors are known. Fourth, NMAS apply to well-defined
points; therefore,  areas that are  not well defined  may

Table 3.  NMAS Horizontal (Positional) Accuracy
Scale
1:1,200
1 :2,400
1 :4,800
1:12,000
1 :24,000
1:63,360
1:100,000
1 Inch = x Feet
100
200
400
1,000
2,000
5,280
8,333
Horizontal
Accuracy +/- Feet
3.33
6.67
13.33
33.33
40.00
105.60
166.67
have even lower positional accuracy. The implication of
these errors in location is that users should use caution
in  making decisions that require high positional accu-
racy. Positional accuracy issues are particularly trouble-
some for CIS operations on small-scale maps or when
combining large-scale maps (1:1,200) with small-scale
maps (1:100,000).
Recently, global  positioning systems (GPS), which the
U.S. military developed, have  helped  to obtain more
accurate feature locations. GPS is not without error,
however. The  list below  notes  some  of the possible
sources of error associated with GPS use, some of
which can be controlled while others cannot (17):
• Errors in orbital information.
• Errors in the satellite clocks.
• Errors in the receiver clocks.
• Ionospheric  or tropospheric refraction.
• Deliberate degrading of the satellite signal.
• Obstructions that block the signal.
• Reflection of the GPS signal off buildings, water, or
  metal.
• Human error.
The importance of positional accuracy depends on the
intended use  of the  data.  In an urban area, a posi-
tional error of 1  foot  on a  tax map may be unaccept-
able because  1 foot may be worth millions of dollars.
In  a  rural  area, however, tax  boundaries mapped
within 10 feet of their surveyed location may be accu-
rate enough (6). Somers  (18)  reports that  positional
accuracy of 10 to 20 feet may be sufficient for envi-
ronmental  analysis.  She says  the cost of increasing
accuracy to 5 feet could increase the cost of data  col-
lection by a factor of  10. The decision-maker must de-
termine the needed positional accuracy.

A ttribute A ccuracy
Attribute accuracy refers to how well the description of
a characteristic of spatial data matches what actually
exists on the ground. For some spatial data, the location
does not change overtime, but the value of the attribute
does  (e.g.,  the  location of a  census  tract does  not
change,  but the population  within a  census tract
changes). Attribute accuracy is  reported differently for
continuous data (i.e., elevation, which has  an infinite
number of values) or discrete data (i.e., gender, which
has a finite number of values).
NMAS exist for elevation  contour lines on topographic
maps.  NMAS for vertical accuracy state that not more
than 10 percent of the points tested shall be  in error by
more than one-half of the contour interval (16). A well-
defined point on a USGS topographic map with a 10-foot
                                                  18

-------
contour interval could vary by 10 feet because the actual
elevation could be 5 feet higher or lower than the map
indicates. The implications of these errors are similar to
the ones for positional accuracy.  In addition, errors in
elevation are important because small changes in ele-
vation may significantly affect some CIS analysis opera-
tions  such  as the  determination  of  aspect,  slope,
viewshed, and watershed boundaries.

NMAS do not exist for discrete variables such as land
use derived from satellite imagery. Instead, a classifica-
tion matrix reports attribute accuracy. Field checking or
checking a portion of the classified image against a map
of higher accuracy determines  the accuracy of the land
use classification. The result of the comparison is a table
from which to calculate overall, producer's, and user's
accuracy. Table 4 is an example of a classification accu-
racy matrix.

Table 4.  Example of a Classification Accuracy Matrix (19)

            Reference Data ("Ground Truth")
                   Number of Cells
Classified Data
(Satellite Image)
Number of Cells
Forest
Water
Urban
Total
Forest
28
1
1
30
Water
14
15
1
30
Urban
15
5
20
40
Total
57
21
22
100
         Overall Accuracy (sum of the main diagonal)

                      63
                     100
                         = 63%
Producer's Accuracy
(column total)
OQ
Forest = ^ = 93%
oU
User's Accuracy
(row total)
28
Forest = - =
49%
Water   =   ^   =   50%
          oU

Urban   =   ^   =   50%
Water   =   ^

          20
Urban   =   —
71%

91%
Overall accuracy is the percentage of correctly classified
cells calculated as the sum of the main diagonal (19).
Producer's accuracy is the total number of correct pixels
in a category divided by the total number of pixels of that
category as derived from the  reference data (column
total). It corresponds to how well the person classifying
the image (the "producer") can  correctly classify or map
an area on the earth. In this example, the producer most
accurately classified forested land (93 percent).  User's
accuracy describes the probability that a sample from
the classified area actually represents that category on
the ground. The  map  "user" is concerned  about  the
map's reliability. In this example, the most accurately
classified land use from the user's perspective is urban
(91 percent).

The significance of overall, producer's, and user's accu-
racy depends on  the intended use of the data. As an
example, Chrisman (20)  says that the error in distin-
guishing wetland from pasture may not matter to some-
one estimating open space, but the difference is critical
if the person is estimating the amount of wildlife habitat
available. Story and Congalton (19) provide an example
of how  to  interpret a classification matrix. A  forester
looks at the classification matrix and sees that forest
classification is 93 percent accurate (producer's ac-
curacy); therefore, the analyst did not identify only 7
percent of the forest on the ground. Once the  forester
field checks the supposed forested area, she finds that
only 49  percent (28 cells) of the sites mapped as forest
are actually forest; the rest are water (14 cells) or urban
(15 cells) areas.

A report of overall, producer's, and user's  accuracy can
help decision-makers determine the appropriateness of
the classified image for their use by identifying potential
errors in classification. This can help  direct field work,
which can improve the classification  of the image and
perhaps subsequent images. Because CIS analysis fre-
quently  uses land use, decision-makers need to know
that significant variability  can result when several  ana-
lysts classify the same image. Bell and Pucherelli (21)
found that consistency in classification can improve by
having one person classify the entire image. McGwire (22)
even found  significant differences between analysts in
unsupervised classification of Landsat imagery. Com-
puters  primarily  perform  unsupervised  classification,
which implies that different analysts would classify the
same image in the same way.

Logical Consistency

Logical  consistency focuses on flaws in the logical  rela-
tionships among data elements. For example, a vector
CIS should  label  all polygons with only one label per
polygon, and  all  polygons should  be closed.  Logical
inconsistency can also occur by collecting  data layers at
different times or from different scale maps with different
positional accuracies. For example, the edge of a  lake
on  the  hydrology data layer should  coincide with the
edge of land in the land use data layer. If data on the
lake were collected  during a wet year  rather than a dry
year, the lake's volume would be higher than  normal,
affecting its location on the map. If land use data for the
same area were collected during a dry year, the bound-
ary of the lake on  the two  layers would not be the same.

Logical  inconsistencies usually do not appear until the
two maps are overlaid and the boundaries do not coin-
cide (see Figure 2). The user must determine the  "cor-
rect" location of the feature that appears misaligned on
one or more data layers. The inconsistency between the
                                                   19

-------
    Land Use
                     Hydrology
Overlay
Figure 2.  Logical inconsistency in lake and forest location.

location of the two layers resolves through a process
called  conflation. All  maps  are  adjusted  so that  the
feature on each data layer lines up with the same feature
on the base map.

Completeness

Completeness focuses on the adequacy of data collec-
tion procedures. Robinson and Frank (5) discuss two
kinds of uncertainty associated with collecting spatial
data that can lead to error. One type of uncertainty is the
inability to measure or predict an inherently exact char-
acteristic or event with certainty. Examples of this  are
blunders in data collection or measurement error, nei-
ther of which can be accurately predicted. The other kind
of uncertainty is associated with concepts that are inher-
ently  ambiguous. Crisp data  sets,  such as  property
boundaries, have little ambiguity; the only issue related
to error  is the positional  accuracy in  measuring  the
boundary. Because  land use data are not crisp data
sets, the challenge is to accurately represent an inher-
ently inexact concept.

Although we know spatial data are variable, our classi-
fication  systems generally ignore the second type of
uncertainty. Analysts map data as though all variables
had  exact boundaries and  all polygons consisted of
homogeneous data. Burrough (4) reports that spatial
variation of natural phenomena is "not just a local  noise
function or inaccuracy that can be removed by collecting
more data or by increasing  the precision of measure-
ment, but is often a fundamental aspect of nature that
occurs at all scales. . .  ."

Mapping spatial  data is a function of how humans  ag-
gregate and disaggregate data either in space, catego-
ries, quantities,  or time;  spatial  data seldom exist in
nature the way maps depict them (23). Data and rela-
tionships between data are  sensitive to the scale and
the zoning system in which  the data are reported (24,
25). The modifiable  area unit problem occurs because
an analyst can recombine a given set of units  or zones
into the same total number of units producing very dif-
ferent results (see Figure 3).

The scale problem occurs because an analyst can com-
bine a set of small units into  a smaller number of larger
units, which can change the  inferences that can be
               Figure 3.  Modifiable area unit. (Number of units is constant;
                       location of units changes.)

               made from the data.  In Figure 4, the area containing the
               highest values changes from the southwest corner in the
               first picture to the northern half in the second picture.
               For example, water quality data are  scale-dependent
               because they vary based on the size and location of the
               collection area (e.g., adjacent to a  point source dis-
               charge, a stream segment, the entire river, or the lake
               the river discharges  into).

               Kennedy (25) reports on a similar problem known as the
               small number problem. This problem occurs when cal-
               culations use a percentage, ratio, or rate for  a geo-
               graphic  area for which  the  population  of interest
               (denominator) is sparse or the numerator is a rare event
               (1 case of cancer  per 1 million people). CIS-generated
               maps may highlight a statistically insignificant change in
               rare events. Small, random fluctuations in the numerator
               may cause large fluctuations in the  resulting percent-
               age, ratio, or rate.  If policy-makers  use these  maps,
               priorities for public health policy may change because
               of the erroneous belief that an area is experiencing more
               unwanted rare events.

               Data can be collected using a tag- or count-based sys-
               tem, which affects their usefulness. The tag approach
               categorizes items  based  on the dominant or average
               attribute and  is  ideal for planners who want only one
               value for each area. For example, each polygon in a
               county soil survey is tagged with one soil type. Accord-
               ing to soil taxonomy rules, however, only about 35 per-
               cent of a delimited area on a soil survey must match its
               classification, and  up to 10 percent may be a radically
               different soil (26).  Although the  text  in the soil  survey
               sets limits on data accuracy by listing  major impurities
               found with each soil type, the CIS seldom carries that
               information because analysts only digitize soil bounda-
               ries and  label data  with  the dominant attribute. This
               Figure 4.  Scale problem (number of units changes).
                                                   20

-------
leads to the depiction of apparently homogeneous soil
units although the text specifies that the  data are not
homogeneous (27).

Some soil or land cover phenomena, even though pre-
sent in small quantities and thus not mapped, may have
great  significance for hydrologic models, which makes
the tag approach to data collection troublesome. Data
collected using the count system allow the analyst to
tabulate the frequency of occurrence or areal extent of
a particular phenomenon. Environmental modelers pre-
fer count data but are usually forced to use tag data,
which can  introduce  error into their models (26). The
new digital soils databases, STATSGO and SSURGO,
are collected and depicted using a count format, which
will help experienced analysts use the data more fully.
Figure 5 shows the difference between tag and count
methods of data collection.

Data are seldom complete because analysts use clas-
sification rules to indicate how homogeneous  an area
must  be before  it is classified  a particular way (e.g.,
more than 50 percent, more than 75 percent).  Another
decision an analyst must make is where to draw the
boundary between two different areas; it is  seldom clear
where a forest leaves off and a rural development be-
gins. Analysts must also decide how or if to show inclu-
sions  (e.g., a forested area in  the middle of agricultural
land uses).

Temporal A ecu racy

Collecting  data at different times introduces error be-
cause the variable may have changed since data collec-
tion. The  effect  of time,  reported as  the date of the
source material, depends on  the intended use of the
data.  Some natural resource  data have daily, weekly,
seasonal,  or annual cycles that are important to con-
sider.  For example, obtaining  land use data from re-
motely sensed imagery in November for North Dakota
produces a very different land use map than data ana-
lysts obtain during the July growing season.

In addition, demographic and land  use  information
changes quickly in a  rapidly urbanizing area. Data  col-
lected at several times can  produce logical  inconsis-
tency between data layers, forcing the analyst to adjust
the location of features to coincide with the base map.
Another problem with collecting data at different times
             Soil A
   Soil B
Soil B 55%'
Soil C 45%
Soil A 30%
Soil B 25%
Soil C 20%
Soil D 25%
                                     30%
          Tag                       Count

Figure 5.  Tag and count methods of data collection.
is that data may be collected using different standards,
which may not be apparent to the user (4).

Source Errors in a GIS

Source (or inherent)  error derives from errors  in data
collection. The amount of error present in collected data
is a function of the assumptions, methods, and proce-
dures used to create the source map (28).  Primary data
refers to  data  collected  from  field sampling or remote
sensing. Causes of the errors associated with this data
are (3, 4, 8, 14,29):

• Environmental conditions (e.g., temperature, humidity).

• Sampling system (e.g., incomplete or  biased data
  collection).

• Time constraints.

• Map projection.

• Map construction techniques.

• Map design  specifications.

• Symbolization of data.

• Natural variability.

• Imprecision due to vagueness (e.g., classifying a forest).

• Measurement error from unreliable,  inaccurate, or
  biased observers.

• Measurement error from unreliable,  inaccurate, or
  biased equipment.

• Lab errors  (e.g., reproducibility between lab proce-
  dures and between labs).

The process of converting primary data to secondary
data (usually a map) introduces additional error. Many
of the data layers that a GIS analyst acquires are sec-
ondary data. Some of the errors associated with map-
making are (3):

• Error in plotting control points.

• Compilation  error.

• Error introduced in drawing.

• Error due to map generalization.

• Error in map reproduction.

• Error in color registration.

• Deformation of the material (temperature, humidity).

• Error introduced due to using  a uniform scale.

• Uncertainty  in the definition of a feature (boundary
  between two land uses).

• Error due to feature exaggeration.

• Error in digitization or scanning.
                                                  21

-------
Converting paper maps to digital data for entry into a
CIS (tertiary data) introduces still more error (the errors
generated from converting  paper maps into a digital
format are discussed in the section on input error), in
part because the  purpose for which the data was col-
lected differs from the intended use of the data.

Many types of error are associated with data collection:

• Data for the entire area may be incomplete.

• Data may be collected  and mapped  at inappropriate
  scales.

• Data may  not be relevant for the intended application.

• Data may  not be accessible because  use is restricted.

• Resolution of the data may not be sufficient.

• Density of observations may not be  sufficient.

The following discussion explains these types of errors.

Data for the Entire Area May Be Incomplete

An incomplete data record may be due to mechanical
problems that interrupt recording devices, cloud cover
or other types of  interference, or financial constraints.
Possible solutions to this problem include collecting ad-
ditional data for the incomplete area, using  information
from a similar area, generalizing  existing  large-scale
maps to match  the less detailed data needed, or  con-
verting existing small-scale maps to large-scale maps to
obtain data  at the desired scale. Collecting additional
data may not be a feasible solution because of time or
money constraints. Extrapolating data from the surro-
gate area to the desired  area can cause problems be-
cause  the  areas  are  not  identical   and  the  scale,
accuracy, or resolution  of the surrogate area data
may be inappropriate for the intended use. The sec-
tion on analysis/manipulation of data within a  CIS
covers the  effect of generalization  on data quality
as well as the effect of converting small-scale maps
to large-scale maps.

Data May Be Collected and Mapped at a Scale
That Is Inappropriate for the Application

A variety  of guidelines suggest the appropriate  map
scale to use  for various applications (see Table 5). Also,

Table 5. Relationship Between Map Scale and Map Use (6)
      Map Scale               Map Use
      1:600 or larger

      1:720 to 1:1,200

      1:2,400 to 1:4,800

      1:6,000 and smaller
Engineering design

Engineering planning

General planning

Regional planning
some maps and digital databases suggest the type of
application  for which  they  are  appropriate (e.g.,  the
STATSGO digital soil database is suitable for state and
regional planning, whereas SSURGO is suitable for lo-
cal level planning). Tosta (30) cites an example of com-
bining wetland data with parcel boundaries to determine
ownership of the land  containing a wetland. If wetland
mapping was done to plus or minus 100 feet positional
accuracy and parcels are 40 feet wide, then the scale of
the  wetland map is inappropriate for determining  if a
wetland is located on a specific parcel.

Identifying the optimal scale of  the necessary data is
crucial because at some point, the cost of collection and
storage exceeds  the  benefits of  increasing  the map
scale.  Lewis  Carroll (1893) summed  up the quest for
data mapped at an ever larger scale and the problems
associated with large-scale maps:

    "What do you consider the largest map that would
    be really useful?"

    "About six inches to the mile."

    "Only six inches!" exclaimed Mein Herr. "We very
    soon got to six yards to the mile. Then we tried a
    hundred  yards to the mile.  And then came  the
    grandest idea of all! We actually made a map of the
    country, on the scale of a mile to the mile!"

    "Have you used it much?" I enquired.

    "It  has never been  spread out, yet," said Mein Herr.
    "The farmers objected: they said it would cover the
    whole country, and shut out the sunlight! So now we
    use the country itself,  as its own map, and I assure
    you it does nearly as well."

Data Collected May Not Be Relevant for the
Intended Application

Frequently,  using surrogate data is quicker or cheaper
than collecting  needed data (e.g.,  Landsat imagery
rather than  data field collection used to determine land
use) (4). The accuracy and classification scheme used
in collecting the data depends on the intended use of
the  data, which may  not coincide with the  analyst's
purpose. For instance, soil maps were developed to aid
farmers in determining what crops they should plant and
for estimating crop yield. Soil maps, however, see wide
use for very different  purposes (e.g., hydrologic and
other environmental models). In addition, STORETdata,
collected at points, are typically extrapolated to repre-
sent water quality in an entire stream stretch.

Data May Not Be Accessible Because
Use Is Restricted

An example of restricted data is Census data on individ-
ual households. An agency may not want to release data
that reveal the location of endangered species. Another
                                                  22

-------
example is that people may not even want the informa-
tion mapped. For example, some cavers do not want to
reveal the location of caves to the U.S. Forest Service,
which is charged  under the  federal Cave Resources
Protection Act with protecting caves, because they think
the best way to protect  the caves  is to not map them
(31). The National Park Service is putting the location of
petroglyphs in the Petroglyph National Monument into a
CIS. Making their location known to the public, however,
is  troublesome because this may,  in fact, encourage
their vandalism (32). Other problems in obtaining data
include  difficulty in acquisition even if access is  not
restricted, expensive collection or  input,  or unsuitable
format (4, 14).


Resolution of the A vailable Data
May Not Be Sufficient

Spatial resolution is the  minimum distance needed  be-
tween two objects for the equipment to record the  ob-
jects as two entities; that is, resolution is the smallest
unit a map represents. To obtain an approximation of a
map's resolution, divide the denominator of the map
scale by 2,000 to get resolution in meters; for instance,
a 1:24,000-scale map has a resolution of approximately
12 meters (33).

Resolution relates to accuracy  in  that different map
scales conform to different accuracy standards. Two air
photos shot from the same camera at the same distance
above the ground have the same scale. If one photo has
finer grain film,  however, smaller details are evident on
it, and this photo produces a map with higher resolution
(34). According to Csillag (33), analysts cannot simulta-
neously optimize attribute accuracy and spatial resolu-
tion. As spatial resolution increases, attribute complexity
increases (35). Also, the finer the spatial resolution, the
greater  the probability that random error significantly
affects a data value.

Resolution of the data is not necessarily  the same as
the size of a raster cell  in a database. Statistical sam-
pling theory suggests using a  raster cell size that is half
the length (one-fourth of the area) of the smallest feature
an analyst wishes to record. Raster data have a fixed
spatial resolution that depends on the  size of the  cell
employed, but a CIS analyst can divide  or aggregate
cells to  achieve a different cell size.  Frequently, an
analyst transforms data collected at one level of resolu-
tion to a higher level  of resolution than existed in  the
original  source  material. According to Everett  and Si-
monett (23), "Geographic analysis,  however, can be no
better than that of the smallest  bit of data which  the
system is capable  of detecting." Vector data are limited
by the resolution of input/output devices,  limits  on data
storage,  and the accuracy of the digitized location for
individual points (36). The spatial  resolution of the data-
base and the  processes that operate on it should be
reduced to a level consistent with the data's accuracy.

The spatial resolution needed depends on the intended
use of the data, cost, and data storage considerations.
As resolution increases, so does the cost of collection
and storage.  Resolution  sufficient to detect an object
means that an analyst can reveal the presence of some-
thing. Identification, the ability to identify the object or
feature,  requires three times the  spatial resolution of
detection. Analysis, a  finer level  of identification, re-
quires 10 to 100 times the resolution that identification
needs (23). Increasing resolution increases the  amount
of data for storage, with storage requirements increas-
ing  by the square of the resolution of  the data. For
example, if the resolution of the data needs to  change
from 10-meter to 1-meter pixels, file size increases by 102
or 100 times (14).

Density of Observations May Be Insufficient

The density of observations serves as a general indica-
tor of data reliability (4). Users need to know if sampling
was done at the optimum density to resolve the  pattern.
Burrough determined that boulder clay in The  Nether-
lands could be resolved by sampling at 20-meter inter-
vals or less, whereas coversand showed little variation
in sampling from 20- to 200-meter intervals.

Some strategies for reducing data collection errors are to:

• Adhere to professional standards

• Allocate enough time and money

• Use a  rigorous sampling design

• Standardize data collection procedures

• Document data collection procedures

• Calibrate data collection instruments

• Use more accurate  instruments

• Perform blunder checks to detect gross errors

Documenting data collection procedures and distribut-
ing them along with data allows potential users to deter-
mine if the data are suitable for their purposes. By not
documenting  procedures, errors in the source material
are essentially "lost" by inputting the data to a CIS, and
the errors become largely undetectable in subsequent
CIS procedures. The result is that agencies that make
decisions based on the CIS-generated map assume the
source data are accurate, only to discover later  that the
map contains substantial errors in part due to errors in
the source material.

Operational Errors in a GIS

Data input, storage, analysis/manipulation,  and output
can introduce operational errors.  Digital maps, unlike
                                                   23

-------
paper maps,  can accumulate  new  operational errors
through CIS operations (8). Even if the input data were
totally error-free, which the last section demonstrated is
not the case, CIS operations can produce positional and
attribute errors. The CIS operation itself determines to
a large extent the types of errors that result.

Input Errors

The  process of inputting  spatial and attribute data can
introduce error. The  major sources  of input error are
manual entry  of attribute  features and scanning or dig-
itizing spatial  features. Manual entry errors include in-
complete entry of attribute  data, entering the wrong
attribute data, or entering the right attribute data at the
wrong location. Digitizing errors originate from equip-
ment, personnel, or the source  material (see Table 6).

Digitizing errors, such as under- and overshoot of lines
and  polygons that are not closed, can  introduce  error
(see  Figure 6). CIS software  can "snap" lines together
that  really do  not connect. Depending on  the tolerance

Table 6.  Types of Digitizing  Errors (4, 14, 37)

Personnel Errors
   Changes in the origin
   Incorrect registration of the map on the digitizing table
   Creation of over- and undershoots
   Creation of polygons that are not  closed
   Incomplete spatial data when data are not entered
   Duplication of spatial data when lines are digitized twice
   Line-following error (inability to trace map lines perfectly with the
   cursor)
   Line-sampling error (selection of points used to represent the map)
   Physiological error (involuntary muscle spasms)
Equipment Errors
   Digitizing table (center  has higher  positional accuracy than the
   edges)
   Resolution of the digitizer
   Differential accuracy depending on cursor orientation
Errors in Source Material
   Distortion because source maps have not been scale-corrected
   Distortion due to changes in temperature and humidity
   Necessity of digitizing sharp boundary lines when they are gradual
   transitions
   Width of map  boundaries (0.4 mm) digitized with a 0.02-mm accu-
   racy digitizer
      Undershoot       Overshoot         Polygon Not Closed

Figure 6.  Common digitizing errors.
selected, this can result in the movement of both lines,
which can decrease the accuracy of the resultant map.

Despite the long  list of personnel errors associated with
digitizing, a good operator probably contributes the least
error in the  entire digitizing process  (38). Giovachino
discusses methods that can help determine equipment
accuracy, including checking the  repeatability, stability,
and effect of cursor rotation.  Digitizing accuracy var-
ies based on the width, complexity, and density of the
feature being digitized but typically varies from 0.01
to 0.003 (3).

One problem with digitized data is that the data  can
imply a false sense of  precision.  Boundaries on paper
maps are frequently 0.4 mm wide but are digitized with
0.02-mm accuracy. The result is that the lines are stored
with 0.02-mm accuracy, implying a level of precision that
far exceeds the original data.

Minimizing digitizing errors is  important  because  the
errors can affect subsequent CIS analysis.  Campbell
and Mortenson (39)  provide a list of  procedures they
used to  reduce  errors  associated  with digitizing and
labeling:

• Use log  sheets to  ensure consistency and  account-
  ability, and to provide documentation.

• Check for completeness in  digitizing  all  lines and
  polygons.

• Check for complete and  accurate polygon labeling.

• Set an acceptable RMSE term for digitizing  (usually
  0.003).

• Always  overshoot  rather than  undershoot  when
  digitizing.

• Overlay  a plot of the digitized  data with the  source
  map to check lines and polygons. If light passes be-
  tween the digitized  line  segment and  source map,
  redigitize it.

• Check digitized work immediately  to provide feed-
  back to the digitizer operator and to  help identify and
  correct systematic errors.

• Limit digitizing  to less than 4 hours  a day.

• Involve people in doing CIS-related jobs other than
  digitizing to decrease turnover and increase the level
  of experience.

Storage Errors

Data storage in a CIS  usually involves two main types
of errors.  First,  many  CIS systems  have  insufficient
numerical precision,  which can introduce error due to
rounding. Integers are  stored as 16 or 32 bits, which
have four significant figures. Real numbers are stored
as floating point numbers either in single precision (32 bit,
                                                      24

-------
7 significant figures) or double precision (64 bit, 15 or 16
significant figures).  If the data in  a CIS  range from
fractions of a meter to full  UTM coordinates, typical
32-bit CIS systems cannot store all the numbers. Using
double precision (64 bits) reduces this problem but in-
creases storage requirements.

Second,  CIS  processing and storage  usually ignore
significant digits (data precision). As a result, the preci-
sion of CIS processing frequently exceeds the accuracy
of the data (40). When a CIS converts a temperature
recorded and entered as 70 degrees Fahrenheit (near-
est degree) to centigrade, the CIS stores the tempera-
ture as 21.111 degrees rather than 21 degrees, which
the significant figures in the original temperature meas-
urement would dictate.  Using the accuracy of the data,
not the precision  of floating  point  arithmetic, partially
resolves  this but requires the user to make a special
effort  because  the  CIS does not  automatically  track
significant figures.

Analysis/Manipulation Errors

CIS analysis/manipulation functions, designed to trans-
form or combine data sets, also can introduce errors.
These errors  originate from  the measurement scale
used  or during data conversion (vector to raster and
rasterto vector), map overlay, generalization, converting
small-scale to large-scale maps, slope, viewshed, and
other  analysis functions. One of the  biggest problems
associated with CIS use is that data in digital  form are
subject to different  uses than data  in paper form be-
cause the user has access to multiple data layers.

Measurement Scale

Four measurement scales can depict spatial data: nomi-
nal, ordinal, interval, or ratio scales. A name or letter
describes nominal data (e.g.,  land use type, hydrologic
soil group C). Performing mathematical operations such
as addition and subtraction on nominal data is meaning-
less. Ordinal or ranked data have an order to them such
as low, medium, and high. Interval data have  a known
distance between the intervals such as 0, 1 to 5, 6 to 9,
more  than 9. Ratio  data  are similar to interval data
except ratio data have a meaningful zero (e.g., tempera-
ture on the Kelvin scale).

Often  during  CIS  operations, analysts convert interval
or ratio data  into nominal data (e.g., low slope is 0 to
3 percent, medium slope is 4 to 10 percent), resulting in
a loss of information. Analysts should preserve the origi-
nal slope values in the CIS in case the user later wants
to modify the  classification  scheme.  Robinson  and
Frank (5) describe the tradeoff between information con-
tent and the meaning that can be derived from it, which
partly helps explain why interval data are frequently
converted to nominal data. The authors identify a con-
tinuum progressing from nominal data at one end that is
highly subjective, has low information content, and high
meaning (low slope means something to the average
user) to ratio data that has low subjectivity, high infor-
mation content, and low meaning (a slope of 7 percent
may not mean much to the average user).

Data Conversion

Errors can occur in converting a vector map to a raster
map or a raster map to a vector  map.  For instance,
remotely sensed data are collected using a raster-based
system. Using a vector CIS, however, requires conver-
sion from raster to  vector data. The size of the error
depends on the conversion algorithm, complexity of fea-
tures, and grid cell size and orientation (13).

A line on a vector map converted to  a raster map has
lower accuracy in  the raster representation because
vector data structures store data  more accurately than
raster ones. When polygons in a vector CIS are con-
verted  to a  raster  CIS, the coding rule usually used
assigns the value that covers the largest area within the
cell of a categorical map to the entire cell (see Figure 7).
For example, when placing a grid over a vector map with
an urban land polygon adjacent to an agricultural poly-
gon, the  cell placement can include part of both poly-
gons. If the  resultant cell comprises  51 percent urban
and 49 percent agricultural land, the cell is assigned 100
percent urban.  Converting  a  numerical map between
raster and vector systems requires spatial interpolation
procedures.  CIS software packages use different inter-
polation methods that can  produce a different  output
even when using the same input data.
            Vector              Raster

Figure 7.  Polygon conversion from vector to raster data.

Map  Overlay

Map overlay, used extensively in planning and natural
resource management, is the combining of two or more
data layers to create new information. In a vector CIS,
slivers or spurious polygons can result from overlaying
two data layers to produce a  new map (slivers cannot
be formed in a raster-based CIS). When combining the
data layers, lines do not coincide, resulting in the crea-
tion of a new polygon or sliver that did not exist on either
layer (see Figure 8). Unfortunately, as accuracy in digit-
izing  increases,  so does the number of slivers (41).
Positional error in the boundaries can occur because of
                                                   25

-------
o

o

Sliver
Figure 8.  Sliver example.

mistakes in measuring or converting the data to digital
form, incremental expansion or recession of a real world
boundary over time, or the fact that certain boundaries
are  difficult  to determine and  thus are generalized
differently (42).

The  number of map layers, accuracy of each map layer,
and the coincidence of errors at the same position from
several map layers all  determine the accuracy of the
map overlay procedures (43). Using probability theory,
Newcomer and Szajgin determined that the highest ac-
curacy to expect from  a map overlay  is equal to the
accuracy of the least accurate map layer. The  lowest
accuracy in  map overlay occurs when errors in each
map occur at unique  points.

In the quest for more accurate  results,  CIS modelers
have increased the  complexity  of  their models and
therefore have increased  the number of data  layers
needed.  Guptill (44) states,  "Conventional  wisdom
would say that as you add more data to the solution of
a problem, the likelihood of getting an accurate solution
increases. However,  if each additional data layer de-
grades the quality of the combined data set,  and hence
the accuracy of the solution, then additional data sets
may be counterproductive."

Generalization

Monmonier  (45) provides  an extensive discussion of
geometric and content generalization procedures used
in map-making. Table 7 lists common types of generali-
zation. Generalizing data on a map  helps to focus the
user's attention on  one or two types  of information and
to filter out irrelevant details. Generalizing is performed
by reducing the scale of the data; a 1:24,000-scale map
can be generalized to a 1:100,000-scale map so that all
data layers  have the same scale. With generalizing,
areas on a large-scale map become point  or line
features  on  a small-scale map (35). Obtaining some
measurements from small-scale  maps, however,  re-
quires caution.  For  example, a map may depict a
40-foot wide road as  a single line one-fiftieth of an inch
wide. On a  1:100,000  map, one-fiftieth  of an inch
translates into a 160-foot wide  road—four times the
actual width  of the  road.

Several studies have pointed to  errors that can result
from generalization.  Wehde  (46) compared soil maps
generated from 0.017-acre grid cells and 11 progres-
sively increasing grid cell sizes. He  found that as grid
Table 7.  Generalization Operations (45)

               Geometric Generalization

Generalizing a Line
   Simplification
   Displacement
   Smoothing
   Enhancement
   Selection

Generalizing a Point
   Selection
   Displacement
   Graphic association
   Abbreviation
   Aggregation
   Area conversion

Generalizing an Area
   Selection
   Simplification
   Displacement
   Smoothing
   Enhancement
   Aggregation
   Dissolution
   Segmentation
   Point conversion
   Line conversion
                Content Generalization

Selection

Classification
cell size increased, map accuracy decreased. More re-
cently, Stoms (47) found that generalizing a habitat map
from  1 to 25, 100, and 500  hectares decreased the
number of habitat types and the number of species
predicted.
Transforming Small-Scale Maps to
Large-Scale Maps

Converting small-scale maps (1:250,000) to large-scale
maps (1:24,000) is advisable only if the analyst fully
appreciates the effect  of this procedure on map quality.
Data mapped at a small scale are subject to different
accuracy standards than data mapped at a large scale.
Connin  (48)  reports,  "Problems with accuracy arise
when positions are reported to decimal parts of a foot or
meter, but the method of data capture may cause the
positional error to be  as much as hundreds  of feet or
meters."  Yet when converting the data from small- to
large-scale, the data appear to have the accuracy of the
large-scale map. Theoretically, data should not be trans-
formed and used at a scale larger than the scale of the
document from which the data are derived (3).
                                                   26

-------
Slope and Viewshed

CIS software packages use  a variety of algorithms to
calculate slope and viewsheds and  can produce very
different results. Algorithms are an unambiguous set of
rules or a finite sequence of operations used to carry out
a procedure. Smith,  Prisley, and  Weih (49) used six
different CIS algorithms to determine slope on  5,905
acres of land in order to calculate the amount of land
deemed unsuitable for timber harvesting. They  found
that unsuitable land  varied from 175 to 1,735 acres,
indicating that different algorithms produce very different
results. Felleman and Griffin  (50) found  that CIS pack-
ages with different algorithms generate  alternate view-
sheds (the area that can be seen from a point).

Output Errors

A variety of errors are associated with data output:

• Output devices create error.

• Paper shrinks and  swells.

• Line implies  certainty that may not  exist because
  boundaries are gradual.

• A cell or polygon implies homogeneity.

• Scale can be modified to imply higher accuracy than
  exists in the source data.

• Precision can be modified to  imply higher precision
  than exists in the source data.

• Depiction  of symbols and colors may  not  follow
  convention.

An important problem associated  with  CIS-generated
maps is that users make informal assessments  about
data quality, partially based on how they perceive the
quality of the  output. A hand-drawn map connotes a
lower level of accuracy than a five-color, CIS-produced
map complete with scale  and  agency logo. Another
problem with output is that distinguishing highly  accu-
rate data from  less accurate data is impossible on a
CIS-generated map. Users want the  output from  a CIS
to  look like  maps they usually  see,  perpetuating the
notion that lines mark exact  boundaries and that poly-
gons or cells are homogeneous. Maps that federal map-
ping  agencies  produce frequently follow NMAS,  but
CIS-generated maps seldom adhere to  published map
accuracy standards. An agency  could require that CIS
map products meet NMAS, which  would establish and
maintain data standards from data  collection to output.

A pen stroke of one-fiftieth of an inch on an output device
translates to an error of 40  feet on the ground for a
1:24,000-scale map (6). Small changes  in paper maps
due to changes in temperature and humidity can  repre-
sent several feet on  the ground. As previously noted,
analysts can modify the scale of CIS maps to whatever
they desire. The basic rule of informational integrity is
that  the  implied precision of data output should not
exceed the precision (spatial, temporal, or mathemati-
cal) of the least precise input variable (26).

CIS-generated maps probably do not differ significantly
from paper maps in their implication that lines and  poly-
gons on the map represent certainty and homogeneity.
CIS-generated maps, however, may not depict standard
symbols, sizes, shapes, colors, and orientation. For ex-
ample, paper geological maps use dashed lines to show
inferred, rather than actual, field collected data, but geo-
logical maps in a CIS may not follow the same conven-
tion (27). Cartographers conventionally use blue lines to
indicate water, but a CIS map-maker can show water as
red rather than blue.

Even more troublesome  are the  color schemes that
some analysts use in depicting model output. Analysts
often give little thought to assigning the colors to model
results depicted as ordinal rankings. For example, areas
of high erosion might be blue, medium erosion might be
red,  and  low erosion might be green. This selection of
colors ignores the intuitive meaning that people assign
to colors. It has been suggested that the color ordering
used in stop lights might provide a better option. In that
case, areas of high erosion would be red, medium ero-
sion would be yellow, and low erosion would be green.


Error Reduction Techniques

Although CIS users and researchers develop error re-
duction strategies, ultimately users must rely on CIS
software  developers to  implement new error reduction
techniques in CIS packages. Error reduction techniques
range from simple software  warnings to  prohibiting a
user from performing selected CIS  procedures. Dutton
(51)  predicts  that future CIS programs  will automate
data manipulation (i.e.,  size, format,  and placement
of feature labels  on  maps)  in  keeping with  standard
cartographic principles.  Dutton (51) and Beard (8) also
predict that future CIS packages will enforce metadata-
based constraints such as operations that are illegal or
illogical (e.g., determining the average value of nominal
data such as  land use), or are inadvisable (e.g., over-
laying maps with widely different scales).

Another change Dutton anticipates is that software ven-
dors will  include information  in manuals that explains
how executing a specific command may affect the data-
base. Graphic techniques to  depict error are being de-
veloped for nonexpert users  while experts tend to use
spatial statistics. Felleman (52) and Berry (53) present
an interesting graphic portrayal of an error map that may
indicate the future of error maps. Additional  research
must determine what effect errors will have on decision-
making.
                                                  27

-------
Error Management

Ultimately, the decision-maker must determine what to
do with the information in this paper. A decision-maker
has a variety of possible courses of action, ranging from
prudent steps that attempt to minimize  error and  the
effect it has on decisions, to other less useful options.
Possible actions are to:
• Abandon use of a CIS.
• Ignore the error associated with CIS use.
• Attempt to collect "error-free" data.
• Determine  if the data are accurate enough for  the
  intended purpose.
• Develop and use data quality procedures.
• Obtain and  use an  error report with CIS-generated
  output.
• Ask that CIS-generated maps show potential errors.
• Continually educate users about the appropriate use
  of spatial data.
First, the decision-maker could abandon any attempt to
use a CIS because of the errors associated with its use.
At times, this may be the appropriate strategy, but this
approach ignores the potential benefits associated with
CIS  use.
Second, the decision-maker could ignore the error associ-
ated with CIS use and continue to  use  the CIS for deci-
sion-making. This type of "head in the sand" approach is
not advisable because of the potential liability associated
with  making decisions  based on inaccurate data.
Third, the decision-maker could engage in an expensive
and time-consuming effort to collect highly  accurate er-
ror in hopes that error becomes a nonissue. Depending
on the intended  use of the data, the cost  of collecting
more accurate data may exceed the benefit.
Fourth, the decision-maker could  assess  whether  the
information available  is accurate enough for the  in-
tended purpose. If data quality is too low, the decision-
maker may opt to collect new data  at the desired quality.
If collecting additional data is not possible, the decision-
maker can explore what types of decisions  are possible
given the attainable data quality.  For  instance, Hunter
and Goodchild (54) found that the  data they were  using
were suitable  only for initial screening rather than  for
regulatory and land-purchasing decisions.
Fifth, procedures to ensure high quality  data could be
developed and used in the data collection, input, and
manipulation stages of building a CIS database.
Sixth, the decision-maker could require a quantitative or
at least a qualitative report on the  sources, magnitude,
and  effects of errors.  The absence of an  error report
does not mean the map is error-free (36). Dutton (51)
predicts that in the near future users of geographic data
will demand error reports,  confidence limits, and sensi-
tivity analyses with CIS-generated output.

Seventh, the decision-maker could ask for CIS-generated
maps that adequately portray the error in the final map. For
example, areas where the uncertainty is high could appear
in red on maps. Another option is to place a buffer around
lines to indicate the relative  positional accuracy of a line or
to show transition zones. Finally, an analyst can present
the output in ways other than a  dichotomous yes or no;
instead, the analyst may use yes, maybe, or no depictions
or even more gradations.

Finally,  Beard (8)  introduced  the concept of  directing
efforts  toward educating  users about use error.  She
defines use error as the misinterpretation of  maps or
misapplication of maps to  tasks for which they are not
appropriate. "We can't assume that CIS  will  automat-
ically  be less susceptible to  misuse than traditional
maps,  and it may, in fact, exacerbate the problem by
expanding  access to  mapped information." Beard ar-
gues that money directed to reducing source and opera-
tional error, while important, may not matter if use error
is large.

Conclusions

CIS is  a powerful tool for analyzing spatial data. Every-
one who uses CIS-generated output, however, must be
aware  of  source errors and  operational errors  intro-
duced during data input, storage, analysis/manipulation,
and output. Increased awareness  of the sources and
magnitude of error can help decision-makers determine
if data are appropriate for their use. Decision-makers
cannot leave data quality concerns to CIS analysts be-
cause  efforts to improve  data quality are not without
cost, and the decision-makers typically control funding.

Decision-makers must not  get caught up in the glamour
of the  spatial analyses and  outputs that a  CIS can
produce. These attributes  may lead decision-makers to
ignore  issues associated with uncertainty, error, accu-
racy, and precision. Inexpensive digital data can make
analysts and decision-makers ignore data quality. If sub-
sequent management decisions are made based on
poor quality data, the resultant  decisions  may turn out
wrong. This would give decision-makers a jaded view of
the usefulness of CIS. An adequate  understanding of
data quality issues can  help decision-makers ask the
right questions of analysts and avoid  making decisions
that are inappropriate given the  data quality.

References

 1. Walsh, S.J., D.R. Lightfoot,  and  D.R. Butler. 1987. Recognition
   and  assessment of error in geographic information systems. Pho-
   togrammetric Eng. and Remote Sensing 53(10):1,423-1,430.
                                                   28

-------
 2. Openshaw, S. 1989. Learning to live with errors in spatial data-
    bases.  In: Goodchild, M., and S. Gopal,  eds. The accuracy of
    spatial databases. Bristol, PA: Taylor and  Francis, pp. 263-276.

 3. Thapa, K., and J. Bossier. 1992. Accuracy of spatial data used
    in geographic information systems. Photogrammetric Eng.  and
    Remote Sensing  58(6):835-841.

 4. Burrough, PA. 1986. Principles of geographical information sys-
    tems for land resources assessment. New York, NY: Oxford Uni-
    versity Press.

 5. Robinson, V.B., and A.U. Frank. 1985. About different  kinds of
    uncertainty in collections of spatial data. In: Proceedings of Digital
    Representations of Spatial Knowledge, Washington,  DC, Auto-
    Carto 7. Falls Church, VA: ASPRS and ACSM. pp. 440-449.

 6. Nale, O.K. 1992. Do traditional map standards conflict with a  CIS
    landbase? CIS World 5(7):50-53.

 7. Smith, G.R., and  D.M. Honeycutt. 1987. Geographic data uncer-
    tainty, decision-making, and the value of information. In: Proceed-
    ings of CIS'87, San Francisco, CA (October 26-30). Falls Church,
    VA: ASPRS and ACSM. pp. 300-312.

 8. Beard,  K. 1989.  Dimensions of use  and value of geographic
    information. In: Onsrud, H., H.W  Calkins, and N.J. Obermeyer,
    eds. In: Proceedings of use and value of geographic information:
    Initiative four specialist meeting, Tenants Harbor, ME  (August).
    Vol. 89-7. Santa Barbara, CA: NCGIA.

 9. National  Committee for  Digital  Cartographic Data Standards.
    1988. The proposed standards for digital cartographic data. Amer.
    Cartographer 15(1):11-140.

10. Chrisman, N.  1994. A vision of digital libraries for geographic
    information, or how I stopped trying to find the on-ramps to the
    information superhighway. Geo. Info. Sys. 4(4):21-24.

11. Eagan, P.O., and S.J. Ventura. 1993.  Enhancing value of envi-
    ronmental data: Data lineage reporting. J. Environ. Eng. 119(1):5-
    16.

12. U.S.  EPA. 1990. Information resources management policy man-
    ual. Washington,  DC.

13. Lunetta, R.S., R.C. Congalton, L.K. Fenstermaker, J.R. Jenson,
    K.C. McGwire, and L.R. Tinney. 1991. Remote sensing and geo-
    graphic information system data integration:  Error sources  and
    research  issues.  Photogrammetric Eng.  and  Remote  Sensing
    57(6):677-688.

14. Aronoff, S. 1989.  Geographic information systems: A manage-
    ment perspective. Ottawa, Ontario: WDL Publications.

15. ASPRS. 1990. ASPRS  accuracy standards for large-scale maps.
    Photogrammetric Eng. and Remote Sensing 56(7):1,068-1,070.

16. Thompson, M.M.  1987. Maps for America: Cartographic products
    of the U.S. Geological  Survey and others, 3rd ed. Washington,
    DC: U.S. Geological Survey.

17. Ferguson, J. 1993. What is it we  obtain from GPS? Professional
    Surveyor 13(4):3-4.

18. Somers, R. 1994. Is a  CIS department the answer?  Geo. Info.
    Sys.  4(6):29-32.

19. Story, M.,  and R.G.  Congalton. 1986.  Accuracy  assessment: A
    user's perspective. Photogrammetric Eng. and Remote  Sensing
    52(3):397-399.

20. Chrisman, N.R. 1987. The accuracy of map overlays: A reassess-
    ment. Landscape and Urban Planning  14:427-439.
21. Bell, A.D., and M.J. Pucherelli. 1992. Audubon  Lake island ero-
    sion study using aerial photography and geographic information
    systems.  Report No. R-92-12. Denver, CO:  U.S. Department of
    the Interior, Bureau of Reclamation.

22. McGwire, K.C. 1992. Analyst variability in labeling of unsuper-
    vised classifications. Photogrammetric Eng. and  Remote Sensing
    58(12):1,673-1,677.

23. Everett, J.,  and D.S. Simonett. 1976. Principles, concepts, and
    philosophical problems in remote sensing. In: Lintz, J., and D.S.
    Simonett, eds. Remote sensing of the environment. Reading, MA:
    Addison-Wesley.

24. Fortheringham, A.S. 1989. Scale-independent spatial analysis. In:
    Goodchild, M., and S. Gopal, eds. The accuracy of spatial data-
    bases. Bristol, PA: Taylor and Francis, pp. 221-228.

25. Kennedy,  S. 1989. The small number problem and the accuracy
    of spatial  databases. In: Goodchild, M.,  and S.  Gopal, eds. The
    accuracy  of spatial databases. Bristol, PA: Taylor and Francis, pp.
    187-196.

26. Brown, D.A., and PJ. Gersmehl. 1987. Maintaining relational ac-
    curacy of  geocoded data in environmental modeling. In: Proceed-
    ings of CIS '87, San Francisco, CA (October 26-30). Falls Church,
    VA: ASPRS and ACSM. pp. 266-275.

27. Fisher, PF.  1987. The nature of soil data in GIS—error or uncer-
    tainty. In:  Proceedings of the International Geographic Informa-
    tion Systems (IGIS) Symposium, Arlington, VA (November 15-18).
    Vol.  Ill, pp. 307-318. Arlington, VA:  Association of American Ge-
    ographers.

28. Walsh,  S.J. 1989. User considerations in  landscape charac-
    terization. In: Goodchild, M., and S. Gopal, eds. The accuracy of
    spatial databases. Bristol, PA: Taylor and Francis, pp. 35-43.

29. Stoms, D. 1987. Reasoning with uncertainty in intelligent  geo-
    graphic information systems. In: Proceedings of GIS '87, San
    Francisco, CA (October 26-30). Falls Church, VA: ASPRS and
    ACSM. pp.  693-700.

30. Tosta. 1993. The data wars:  Part II. Geo. Info. Sys. 3(2):22-26.

31. Anonymous. 1994. Between a rock and a hard place. Backpacker
    22(8):16-18.

32. Fletcher, M., and D. Sanchez. 1994. Etched in stone: Recovering
    Native American rock art. GPS World 5(10):20-29.

33. Csillag, F. 1991. Resolution revisited. In: 1991 ACSM-ASPRS
    Annual Convention, Baltimore, MD (March 25-28). Auto-Carto 10,
    Volume 6. Bethesda, MD: ACSM and ASPRS. pp. 15-28.

34. Star, J.L.,  and  J.E. Estes. 1990. Geographic information systems:
    An introduction. Englewood Cliffs, NJ: Prentice  Hall.

35. Goodchild, M.  1993. The state of GIS for environmental problem-
    solving. In: Goodchild, M.R.,  B.O. Parks, and L.T Steyaert, eds.
    Environmental modeling with GIS. New York, NY: Oxford Univer-
    sity Press, pp. 8-15.

36. Veregin, H. 1987. Error modeling in geographic information sys-
    tems: A review. Unpublished manuscript.

37. Keefer, B.J., J.L. Smith, and T.G. Gregoire.  1991. Modeling and
    evaluating the effects of stream mode digitizing errors on map
    variables. Photogrammetric Eng. and Remote Sensing 57(7):957-
    963.

38. Giovachino, D. 1993. How to determine  the accuracy of your
    graphic digitizer. Geo. Info. Sys. 3(3):50-53.
                                                             29

-------
39. Campbell, W.G., and D.C. Mortenson. 1989. Ensuring the quality
    of geographic information system data: A practical application of
    quality  control.  Photogrammetric  Eng.  and  Remote  Sensing
40. Goodchild, M., and S.  Gopal. 1989. Preface. In: Goodchild, M.,
    and S. Gopal, eds. The  accuracy of spatial databases. Bristol,
    PA: Taylor and Francis.

41 . Goodchild, M. 1 993. Data models and data quality: Problems and
    prospects. In: Goodchild, M.F., B.O. Parks, and L.T Steyaert, eds.
    Environmental modeling with GIS. New York, NY: Oxford Univer-
    sity Press, pp. 94-103.

42. Lester, M., and N. Chrisman. 1991.  Not all slivers are skinny: A
    comparison of two methods for detecting positional error in cate-
    gorical maps. In: Proceedings of GIS/LIS '91, Atlanta, GA (Octo-
    ber 28-November 1). Bethesda, MD: ASPRS. pp. 648-658.

43. Newcomer,  J.A., and J. Szajgin. 1984. Accumulation of thematic
    map errors  in  digital overlay analysis.  Amer. Cartographer
44. Guptill, S.C. 1989. Inclusion of accuracy data in a feature-based,
    object-oriented data model. In: Goodchild, M., and S. Gopal, eds.
    The accuracy of spatial databases. Bristol,  PA:  Taylor and  Fran-
    cis. pp. 91-98.

45. Monmonier, M. 1991. How to  lie with maps. Chicago, IL: Univer-
    sity of Chicago Press.

46. Wehde, M. 1982. Grid cell size in relation to errors in maps and
    inventories produced by  computerized map processing. Photo-
    grammetric Eng. and Remote Sensing 48(8): 1, 289-1 ,298.
47. Stoms, D.M. 1992. Effects of habitat map generalization in bio-
    diversity assessment. Photogrammetric Eng. and Remote Sens-
    ing 58(11):1,587-1,591.

48. Connin, R.  1994. The trouble  with data. Professional Surveyor
    14(1):24-25.

49. Smith,  J.L., S.P. Prisley, and R.C. Weih. 1991. Considering the
    effect of spatial data variability on the outcomes of forest man-
    agement decisions. In: Proceedings of GIS/LIS '91, Atlanta, GA
    (October 28-November 1). Bethesda, MD: ASPRS. pp. 286-292.

50. Felleman, J.,  and C.B.  Griffin. 1990.  The  role of  error  in CIS-
    based viewshed determination: A problem analysis. Report No.
    IEPP-90-2.  Syracuse, NY: Institute of Environmental Policy and
    Planning, State University of New York, College of Environmental
    Science and Forestry.

51. Dutton, G. 1991. Improving spatial analysis in GIS environments.
    In: 1991 ACSM-ASPRS  Annual  Convention,  Baltimore, MD
    (March 25-28). Auto-Carto 10,  Volume 6. Bethesda, MD: ACSM
    and ASPRS. pp. 168-185.

52. Felleman, J. 1992. Letter to the editor. Geo. Info. Sys. 2(8):12-13.

53. Berry, J.K. 1994.  The this, that, there rule.  GIS World 7(7):22.

54. Hunter, G.J., and M.F. Goodchild. 1993. Managing  uncertainty in
    spatial  databases: Putting theory  into practice. In: Proceedings
    of the  annual meeting of the  Urban and  Regional Information
    Systems Association, Atlanta, GA (July 25-29). Washington, DC:
    Urban and Regional Information Systems Association, pp. 1-15.
                                                              30

-------
You Can't Do That With These Data! Or: Uses and Abuses of
Tap Water Monitoring Analyses
Michael R. Schock
Drinking Water Research Division, U.S. Environmental Protection Agency, Cincinnati, Ohio

Jonathan A. Clement
Black & Veatch, Cambridge, Massachusetts
Introduction

Linkage between human health and drinking water qual-
ity has been an area of interest in the United States for
many years. Over the past approximately 10 years,
drinking water monitoring requirements have expanded
rapidly under the Safe Drinking Water Act (SDWA).
Growing public and governmental interest in this envi-
ronmental area makes the aggregation and consolida-
tion of data on the occurrence and distribution of many
organic and inorganic contaminants and background
constituents of drinking water an important process.
These data can then be made available for systemiza-
tion and visualization to regulators, municipalities, water
utilities, public interest groups, health researchers, con-
sulting engineers, and water treatment scientists.

Given a sufficient number of data points and a conven-
ient computerized database/mapping platform, a wide
variety of maps can be generated to use in research and
decision-making processes. The validity of doing so,
however, rests inseparably upon the basis of the sam-
pling plan and protocols, as well as the precision and
accuracy of the analytical methods used forthe constitu-
ents of interest. The well-known problem of matching the
proper scale of the source data to that employed in the
maps for interpretation is a critical problem with drinking
water sampling, where many unappreciated small-scale
variations render many, if not most, attempts to make
generalizations inaccurate or meaningless.

This paper introduces and describes many concepts re-
lated to what generates or controls the concentrations
of metals and other constituents in drinking water, ways
in which the sampling protocol affects apparent levels of
constituents, and the magnitude of temporal and spatial
variability present in both municipal and private water
supplies. Illustrations from water quality studies show in
practical terms how generalizations must be kept to a
minimum and how the data input into a geographic
information system (CIS) for interpretation and evalu-
ation must be carefully analyzed and screened to deter-
mine the appropriateness for various well-intended
purposes. The discussion and examples show how
many apparently significant trends and assessments of
exposures or occurrences turn out to be merely artifacts
of critical (yet subtle) inconsistencies or errors in the
planning and execution of the sample collection proc-
ess, or inconsistencies caused by the fact that regula-
tory (and not research) requirements govern the origin
of the data.

The concepts this paper covers are equally valid in
many other disciplines using or contemplating the use
of CIS for interpretation of all kinds of "field" data.

Why Maps Are Useful for Drinking
Water Studies

Maps and CIS databases could have wide applicability
to drinking water studies. For example, they could pro-
vide the basis for investigating the occurrence of regu-
latory contaminants or related constituents, either to
estimate the costs of compliance with a regulation or to
estimate human health effects. Mapping could be useful
to utilities and consultants investigating process changes
for a utility or determining the effectiveness of some
existing treatment such as corrosion control orchlorina-
tion. Use of CIS could also assist in assessing the
feasibility and impact of system expansion. Another
promising application would be CIS assistance in devel-
oping and implementing wellhead protection plans.
Many other areas of application may be possible now,
or will be discovered in the future, as CIS technology
and regulatory requirements continue to develop.
31

-------
Sampling Protocols for Data Usable in GIS

Several SDWA regulations have resulted or will result in
the collection of geographically diverse drinking water
quality data that may interest mappers. The  Lead  and
Copper Rule, the Surface Water Treatment  Rule, the
proposed Information Collection Rule, and the Disinfec-
tion/Disinfection Byproduct Rule are but four examples.
Many states have their own variations on federal drink-
ing water regulations, so their data  collection require-
ments may differ somewhat. Considerable data may
also be collected for specific research studies of either
academic or purely practical nature.

Chemical Factors in Constituent Behavior

For the purposes of this discussion,  chemical constitu-
ents in drinking water may be classified as being gener-
ally reactive or nonreactive. Reactive constituents may
change concentrations or chemical form for a variety of
reasons, such as:

• A result of interaction with the background composi-
  tion of the drinking water.

• By precipitation or dissolution reactions with pipe ma-
  terial used for the distribution system.

• By  chemical  reactions with disinfectants  added at
  water treatment plants.

• By slow chemical reactions started at water treatment
  plants.

Nonreactive constituents may play an important role by
providing a chemical background that indirectly influ-
ences the speed or extent of other chemical reactions
and transformations. Table 1 gives a summary of many
common constituents of  drinking water  and identifies
whether they function essentially as  reactive  or nonre-
active constituents.

Reactive Constituents

Clearly, chemical  species or compounds  that  can
change in concentration or transform into other species
or compounds during distribution make mapping on very
large scales difficult to justify. Reactive constituents may
also change concentration in the same place overtime,
such as water standing overnight in a home, school, or
building, which  is discussed in a later section.  Some
examples of reactions during water distribution follow:

• During lime softening  processes at  some  central
  water treatment plants,  a  supersaturated state  is
  used for the compound calcium carbonate to remove
  calcium  (and  sometimes magnesium)  ions from the
  water. This condition is sometimes maintained  into
  the distribution system as well to assist in maintaining
  chemical conditions useful for corrosion  control of
  lead and copper. Thus, calcium levels,  pH, and  car-
Table 1.  General Reactivity Trends for Common Drinking
        Water Constituents
Constituent
                 General Reactivity Tendency
PH
Dissolved oxygen
Calcium
Magnesium
Total carbonate
Total alkalinity
Chlorine residual
Temperature
Iron
Copper
Lead
Zinc
Silica
Sulfate
Orthophosphate
Polyphosphate
Total phosphate
Nitrate
Chloride
Fluoride
Trihalomethanes
Haloacetic acids
Highly reactive
Reactive
Nonreactive (reactive when cementitious
pipe linings are present)
Nonreactive
Nonreactive
Reactive, particularly with pH changes
Reactive
Either
Reactive
Reactive
Reactive
Reactive
Nonreactive
Nonreactive
Reactive
Reactive
Reactive
Nonreactive
Nonreactive
Nonreactive
Reactive
Reactive
  bonate concentrations (and consequently, alkalinity)
  drop as water passes away from the plant (1, 2).

• The  metals in pipe  materials, such  as iron, copper,
  zinc  (in galvanized pipe), and lead,  are oxidized by
  oxygen, free chlorine, chloramines, ozone, and other
  disinfectants,  which  renders them into a form  that
  water can transport, unless other chemical conditions
  are such that  a highly insoluble scale deposits on the
  pipe, immobilizing the metal (1,  3).

• Prolonged contact with chlorine disinfectant species
  converts a fraction of natural organic matter present
  in many distributed waters into regulated "disinfection
  byproduct"  compounds, such  as  trihalomethanes,
  chloroform, and haloacetic acids (4, 5).

• Following the addition of chlorine or after increasing
  pH to enable  some  corrosion control for copper and
  lead, iron present in well waters in dissolved  ferrous
  (Fe2+) form oxidizes into Fe3+  form, which is much
  less soluble. Obnoxious "red water"  results, as ferric
  oxyhydroxide  precipitate forms and clouds the water.

• Polyphosphate chemicals added to  "sequester"  iron
  or manganese in well waters break down into simpler
                                                  32

-------
  polyphosphate forms of shorter chain lengths, plus
  orthophosphate.  The orthophosphate frequently be-
  comes present at high enough concentration to aid
  in controlling lead or copper (1, 6-8).

• Water passes  through  newly installed cement mor-
  tar-lined pipes, or aggressive water passes through
  older asbestos-cement pipes. Because of the particu-
  lar chemical nature of the water, calcium carbonate
  and calcium hydroxide in the cement dissolve, raising
  the pH and hardness of the water (1).

• Free chlorine is added to disinfect water and  is such
  a strong oxidant that it is unstable in water at normal
  concentrations. Additionally, it reacts with miles of
  unlined cast iron pipe,  accelerating the decomposi-
  tion of hypochlorous acid or hypochlorite ion to chlo-
  ride. Consequently,  the overall redox potential of the
  water supply and  the  effectiveness of disinfection
  decrease.

• A concentration of 1 milligram per liter (as PO4) phos-
  phoric acid is added to a distributed water at pH 7.5
  to control lead  corrosion. The orthophosphate reacts
  with  exposed iron in the distribution main, however,
  and  the  residual concentration  of orthophosphate
  decreases  throughout  distribution passage  to the
  point where the level is no longer adequate to create
  the lead orthophosphate passivating film needed  (1,
  6, 8, 9).

Unless a constituent is known to be nonreactive, maps
may be falsely generated under the premise that the
concentration of  a constituent is essentially  a constant
over some geographic area. Following  the changes in
concentration or  chemical form of reactive constituents
would also seem to be a useful application of CIS tech-
nology. One major restriction applies to the viability of
that approach, however. Presuming that the analytical
techniques used can adequately quantify the concentra-
tion and concentration changes observed, the scale of
the variability or concentration change relative to the
scale  of the mapping perspective becomes critical to
accurate mapping. A latersection ofthis paper considers
this critical factor in more specific detail.

Nonreactive Constituents

Almost no inorganic constituents in natural or drinking
water are purely chemically inert. Under some condi-
tions, and at some  concentrations, significant reactions
can occur. Some constituents that are actually reactive
may act as if they are nonreactive constituents, how-
ever, because they are present in high enough concen-
trations relative  to the  extent of chemical  reactions
taking place that  no discernible change in their concen-
tration  results. An  obvious  example is the dissolved
inorganic carbonate (DIG = hteCOs* + HCOs- +  COs2-)
concentration (1, 10). Complexation  and formation of
passivating basic carbonate solid films of lead and cop-
per by carbonate  and bicarbonate ion  dominate the
corrosion control chemistry of copper(ll) and lead(ll) (1,
11). The concentration of DIG in water on either a molar
or weight basis, however, is normally a factor of 500 to
10,000 higher than the lead or copper concentrations.
Hence, changes in the DIG content from these reactions
normally are analytically undetectable.

Another example is fluoride ion, which is often used as
a distribution  system water  flow "tracer" because of its
relative inertness. Actually, fluoride ion can form strong
complexes with aluminum left in water following coagu-
lation treatment  with  alum.  The solubility of fluoride-
containing solids  with other major drinking water
components (such  as calcium and sodium) is very high,
however, and fluoride reacts only weakly with metallic
plumbing materials in the distribution system. Therefore,
total fluoride concentrations tend to remain constant.

Relatively accurate maps of the occurrence and distri-
bution of  nonreactive constituents can be  made, but
their usefulness depends on the scale of the mapping
relative to their occurrence  and the particular question
under investigation.  All of  this supports the need to
ensure that the question asked can be answered  cor-
rectly at the map scale.

Scale of Drinking Water  Constituent
Sources

More than 59,000  public water suppliers  exist in the
United  States (12). Of these,  approximately 660 are
considered large water systems, which  serve  over
50,000 in population.  These  municipal systems  use
source water supplies that can be ground-water wells,
"surface" waters (i.e.,  rivers,  reservoirs, lakes),  or a
combination of both. Some water suppliers perform
minimal water treatment of their  own  and purchase
water from another water system or systems to satisfy
their needs.

Surface Water Sources

Many water utilities use a single water treatment plant
to treat surface waters, which  could satisfy the  entire
water demand of the community all year. In many cases,
however,  utilities  combine  several  surface  water
sources and use a different treatment plant to treat each
water source. The  water plants usually discharge  into
the  distribution system at different points,  and system
hydraulics dictate  the  areas of the  system in which
waters mix. This  is  important because the water quality
characteristics,  which  often differ among treatment
plants, influence the corrosivity of the waters to various
plumbing materials in the distribution system. Different
water constituents also may affect the disinfection effec-
tiveness of the treatment and the formation of unwanted
disinfection byproducts.
                                                   33

-------
For surface water systems, the chemical composition of
the water depends on the upstream or watershed geo-
chemistry, the seasonal nature of the water body used
as the source, and the characteristics that the treatment
imparts, such as coagulation with ferric sulfate or alum
(aluminum sulfate), lime softening, filtration, pH adjust-
ment, corrosion control treatment, chlorination, etc.

The scale of the source water chemical data, therefore,
is large, driven by the geology, soil nature, land use, and
climate. The chemical nature of the treated water, how-
ever, may differ significantly from that of its source.

Ground-Water Supplies

Many water utilities use multiple ground-water wells. A
water supply of medium to large size usually uses mul-
tiple wells, instead of or in addition to the surface water
supplies. Wells  number from only two or three to more
than 100 for very large water systems. Wells normally
operate in different patterns, and only rarely do all wells
operate at the same time. The yield of the wells and their
water quality dictates the combination and number of
wells used at a particular moment. The wells may or may
not be from the same aquifer, and even if they are, local
inhomogeneities frequently exist in water composition
(especially with  iron and manganese) that  limit the use-
fulness of certain wells without substantial treatment.

Historically, utilities have treated  some (but not neces-
sarily all) wells with  a chemical  such as a polyphos-
phate or sodium silicate  to  sequester the  iron  and
manganese from wells.  Some utilities install  physical
removal processes such as ion-exchange softeners, re-
verse-osmosis  plants,  aeration  systems  for  iron  re-
moval, air stripping towers for volatile organic compound
or radon removal, or "greensand" filters for the removal
of iron and manganese. These facilities sometimes exist
at only certain well sites or at some point  where water
from multiple wells is combined.

The scale of chemical controls on ground-water sup-
plies, therefore, becomes only hundreds of feet. Con-
taminants of raw waters, such  as arsenic, nitrate, or
chromium, are geologically and geochemically control-
led. Therefore, their occurrence is geographically vari-
able on even a small scale, and the variability exists
vertically in  the subsurface as well as horizontally. A
municipality may use wells of different depths into differ-
ent aquifers, or even approximately the  same depth
spread out over hundreds of feet to many miles in the
same aquifer or a variety of geologic units.

The variability of individual ground-water wells overtime
(such as seasonally)  is usually less apparent than with
surface water sources, but the fact that many wells are
frequently used in different combinations and for differ-
ent lengths of time (hours to days, usually) makes char-
acterizing "influent"  water quality  complicated.  The
same observation applies to water systems that allow
different amounts of water to bypass treatment proc-
esses (e.g., ion-exchange, reverse osmosis) depending
on the levels of targeted undesirable contaminants (e.g.,
nitrate, sulfate, arsenic).

These characteristics of the nature of chemical compo-
sition,  use,  and treatment of ground-water supplies
clearly show that generalizations  over areas such as
states or geographic regions (e.g., New England, Upper
Midwest) are at least very gross and uncertain and at
worst,  entirely misleading when decisions are to be
made about risk and health assessments, or estimates
of the necessity for certain treatments or economic im-
pacts of different potential drinking water regulations.

Combination Systems

Some municipalities combine the use of surface water
supplies  and ground-water wells.  Therefore, general
water chemical characteristics vary throughout the sys-
tem in a regular manner in response to the location and
use of different sources, as well as relative amounts of
water that the different sources produce and deliver.

Distribution System Mains

The next lower level of scale is the distribution  system
network of pipes and storage. Common materials used
for distribution system piping include cast  iron, ductile
iron, cement mortar-lined iron, iron with organic coat-
ings, asbestos-cement (A-C), and various forms  of plas-
tic. Pipe diameters range from about 4 inches to many
feet, depending on size of the water utility and commu-
nity, size of the neighborhood fed  by the line, and  dis-
tance of travel for the water. Here, because of the large
volume of water involved relative to the pipe diameter,
the major chemical interactions involve such constitu-
ents as hardness (calcium  and  magnesium)  ions, pH,
iron, bicarbonate and carbonate ions, and  chlorine re-
sidual species, and possibly microbiological parameters
such as total plate counts,  heterotrophic plate  counts,
and assimilable organic carbon. Disinfection byproducts
(DBFs) may change in concentration and type because
of the time involved in the water traveling  through the
piping from the treatment plant. Trace metal contamina-
tion, such as lead and copper, is usually negligible from
this source,  unless it is  present when distributed from
the wells or water treatment plants.

Depending on prevalent economics  and construction
practices during  periods of water system  growth, the
materials will not be either randomly or uniformly distrib-
uted geographically within  system boundaries. Water
flow often varies greatly within the distribution system,
and water lines sometimes terminate in dead-end areas
with minimal flow rates. Water quality often differs sub-
stantially in these dead ends from that in the fully flowing
parts of the distribution system.
                                                  34

-------
Household Service Lines

Service lines represent the connection between the
house or building and the distribution main. Sometimes,
the service lines are joined to the mains by a flexible,
approximately 2- to 3-foot long pipe called a "goose-
neck" or "pig-tail." Historically, this connector was often
made of lead.  Recently, copper  has been the  most
widely used material, with plastic gaining in acceptance.
Service lines for homes are usually 0.75 to  1  inch in
diameter, with service lines for many commercial build-
ings or multifamily dwellings ranging in size from 1.5 to
3 inches in diameter. Service line for homes and build-
ings have usually been made  of lead, brass, copper,
galvanized steel, or plastic. The material used depends
on the age of the water connection and the construction
practices of the area involved. A recent report estimated
that approximately 6.4 million lead connections ("goose-
necks") still  exist in the United States, and about 3.3
million lead service lines still exist (13). In many commu-
nities, old lead service  lines remain a major source of
lead in  drinking water.

Like distribution system materials, service line materials
may vary greatly within a distribution system by space
and time. For instance, in large eastern cities, very old
neighborhoods may have many (or even mostly) lead
service lines. New neighborhoods likely have copper or
plastic  service  lines. Galvanized steel or copper pipes
may have been installed between the era when lead was
used and modern times. With the exception of Chicago,
where lead service lines were occasionally installed into
the 1980s, the  use  of lead for service lines  generally
stopped in the late 1940s or early 1950s. An example of
nonuniform  distribution  of service line  materials is
shown  by Figure 1, a map indicating Cincinnati sampling
sites for Lead  and  Copper Rule (14-17)  monitoring.
Erratic  clustering of different service line materials is
evident.

Rehabilitation of old houses or replacement of failed
piping results in a mixture of new and old material in
areas where houses are predominantly old. Following
completion of the construction,  maps of service line
material would show many clusters representing preva-
lent plumbing codes and economics.

Interior Plumbing

Interior plumbing of buildings and  houses reflects even
more variability than service lines. This is the dominant
contributor to lead and copper levels at most sites cov-
ered under the Lead and Copper Rule (14-17).  Interior
plumbing consists of piping, plus a  large number of
valves, connectors, fixtures, and perhaps soldered joints
and a water meter. Any or all of these components are
replaced at varying  intervals as a result of failures or
remodeling.  Therefore, even generalizations within a
small neighborhood  are risky, unless the neighborhood
is very new and uniformly constructed. When attempting
to  survey the  composition of plumbing materials that
might be the source of drinking water contamination,
merely asking for the age of the house or building is
insufficient. Questions must be asked to obtain the nec-
essary precise information on  the  age and type of
plumbing materials and components in the building.

Typical interior plumbing materials include lead, galva-
nized steel, copper, and different  plastics for pipes.
Some brass and black steel  have been used for short
times in some areas. Faucets are almost always made
with either brass or plastic internal parts, which differ in
composition from the exteriors, which are usually plated
with chrome or other metal. Interior faucet volumes typi-
cally range from about 30 milliliters  to 120  milliliters,
depending upon design. Valves and meters are also
frequently made of brass or bronze, which  are  cop-
per/zinc alloys usually containing 2 percent to 6 percent
lead. Until recently, solders used to join copper drinking
water pipe sections  were usually a tin and lead combi-
nation, containing 40 percent to 60 percent  lead. Occa-
sionally,  connector  lines to  fixtures  include copper,
stainless steel, aluminum, or flexible plastic sections.

Private  Water Systems

The many possible designs of domestic water systems
originating from wells or cisterns are too numerous to
illustrate. Figure 2 gives an example of one such system
layout. Private systems share many  features with do-
mestic systems  supplied by  water utilities,  however.
Interior plumbing shares most of the same configura-
tions and materials. For private water systems, addi-
tional plumbing that could cause  contamination or water
chemistry changes includes well casing material, sub-
mersible  pump casing and fittings,  pressure tank feed
and control plumbing, and nonsubmersible pump inte-
rior materials. Therefore, problems with determining the
frequency and distribution of levels of potential contami-
nants include those present  for domestic situations in
general, plus those complications arising from cycling of
the pumps, pressure tank system, or both.

Water Samples  Representing Distances

One  of the most  important  fundamentals of under-
standing  drinking water sampling is that  volumes of
water (e.g., 1-liter samples, 250-milliliter samples) rep-
resent the linear distance of plumbing material in contact
with the water sampled. Because of water mixing and
flow during use or sampling,  they are  also integrated
samples  of that volume. This understanding is at the
heart  of designing  accurate water sampling programs
and making viable interpretations of existing monitoring
data that may  be contained in (or mappable by) a CIS.

Table 2 summarizes some interesting and  important
relationships between pipes of different inside diameters
                                                  35

-------
-4 ii —
S_ • wL -
-Tr ' • i£s .,
""*j '
-V . : - .' Jf%£. '- .
:- : ..?";. •
>--.:-" '
-
. Lead Site
» Copper Site
W.Q.P. Site
• C.W.W. Facility
•*• Bolton D.S.
*** California D.S.
Not C.W.W. D.S.
— Service Limits
California Water
Treatment Complex
Figure 1. Cincinnati Water Works Lead and Copper Rule compliance monitoring, July to December 1992.
Figure 2. Distribution system.
(IDs) and the volumes of water they contain per unit of
length (7). Much domestic interior plumbing has an ID of
approximately 0.5 inches, depending upon the material.

Figure 3 shows schematically what parts of a plumbing
system would likely be represented by samples of differ-
ent volumes taken after water was allowed to stand in
the pipe for many hours. Faucets, bubblers, and other
terminating fixtures vary widely in volume. Kitchen-type
fixtures usually contain from 60 to 120 milliliters of water.
Bathroom-type fixtures may contain only about 30 to 60
milliliters of water. Bubblers, such as those frequently
found on school or office drinking fountains, are smaller
still. As can be seen schematically in Figure 3a, a small
volume such as 125 milliliters captures the faucet and a
short distance of pipe immediately leading to it. In many
plumbing systems, this volume catches water in contact
with numerous soldered joints. On the other hand, if a
single 1-liter first-draw sample is taken, the water in the
bottle represents a much longer distance back into the
plumbing system. In a situation where the source of lead
in drinking water is a new brass faucet, or soldered joints
of lead-tin solder, this larger volume usually gives a
lower lead concentration than the smaller volume be-
cause more water in the sample is not in intimate contact
with materials containing lead.

Other sampling schemes logically follow. For instance,
if examining copper pipe corrosion, discarding the first
36

-------
Table 2.  Interrelationships Among Pipe Length, ID, and
        Internal Volume for Selected Common Plumbing
        Materials and Pipe Sizes
Material
Copper
tubing
Copper
tubing
Copper pipe
Galvanized
steel pipe
Lead pipe
or tube
Lead pipe
or tube
PVCor
CPVC pipe
Length
for 1,000
Identification/ True True Milliliters
Type ID OD (Feet)
0.5-inch, type 0.545 0.625 22
L, annealed
0.5-inch, type 0.545 0.625 22
L, drawn
0.5-inch, 0.622 0.840 17
schedule 40
0.5-inch, 0.616 0.840 17
schedule 40
0.5-inch ID, 0.50 1.00 26
0.25-inch wall
0.75-inch ID, 0.75 1.25 11.5
0.25-inch wall
0.5-inch, 0.546 0.840 22
schedule 80
Milliliters
per Foot
46
46
60
59
39
87
46
                Plumbing Represented by Samples


          a.  1,000 Milliliters = 22 Feet @ 0.5-Inch ID Cu Type L
    Faucet
  1,000-Milliliter Sample
                                              Main
                               Service Line
                                                  125 or 250  milliliters  of water is  likely to give more
                                                  accurate information because it minimizes the effects of
                                                  the faucet material as well as piping that connects the
                                                  faucet to the interior line.  Often, this connecting piping
                                                  is not copper. If examining the corrosivity of the water to
                                                  lead service lines, wasting a volume of water corre-
                                                  sponding  to the distance from the outlet to the service
                                                  line better estimates the effect, although not without
                                                  uncertainty  (18).  Many other  sampling schemes  are
                                                  possible and useful, but users must be aware that the
                                                  sampling  protocol may have as much or more influence
                                                  on  the observed metal concentration than water quality
                                                  or other variables. Hence, incorporation of monitoring
                                                  data into a CIS database must be done only when the
                                                  source represents equivalent samples.

                                                  Because of turbulent mixing during flow, local high con-
                                                  centrations of lead (or other contaminant) may become
                                                  broadened and diluted by the time the  water to be
                                                  sampled reaches the  sample collection bottle (18). In
                                                  many  cases, therefore, numerous small-volume  se-
                                                  quential samples  can be taken and used to profile  a
                                                  plumbing  system  to  locate brass valves, connectors,
                                                  soldered joints, etc. Figure 4 illustrates sequential sam-
                                                  pling results  for one  room of a building. Peaks in the
                                                  distribution of samples physically correspond to the lo-
                                                  cation of  a chrome-plated brass faucet and to  a  later
                                                  concentration of fresh Sn:Pb soldered joints. Unfortu-
                                                  nately, even small-volume sequential tap water samples
                                                  must pass over other potentially contaminating or alter-
                                                  ing surfaces  on the way through the sampling tap.

                                                  Kuch and Wagner have shown how water  can dissolve
                                                  large amounts of lead simply by traveling through long
                                                  distances in lead pipes with small  IDs (9, 19). Although
                                                  this study specifically examined lead, the  principle ap-
                                                  plies to other metallic piping materials. This  phenome-
                                                  non is inseparable from the aspect of time, which is the
                                                  next subject.
    Faucet
          b.  125 Milliliters = 2.75 Feet @ 0.5-Inch ID Cu Type L
                        Household
                        Plumbing
         T
         Soldered
         Joints


125-Milliliter Sample
                                              Main
                        Meter
                      •1UF-
                               Service Line
Figure 3.  Schematic  diagram  of plumbing materials repre-
         sented by sample volumes of a) 1 liter and b) 125
         milliliters.
  300 -,

| 250 -




I 150-
o

1, 100 -

1  50-
                                                                                      Samples
                                                                                      1-2  60 Milliliters
                                                                                      3-12 125 Milliliters
                                                           1  2   3   4  5   6   7  8  9  10 11 12
                                                                     Sample Sequence, #


                                                  Figure 4.  Sequential sampling results from a room on the
                                                           ground floor of a building.
                                                    37

-------
Effect of Time

While  some chemical  reactions  are instantaneous,
many dissolution and precipitation reaction steps that
are important in controlling metal levels  in water take
many hours to many days to reach equilibrium. In fact,
passivation films and scales on pipes that inhibit corro-
sion and  reduce leaching of trace metals  may  take
months to decades to develop substantially. Some other
chemical  transformations, such as creation of triha-
lomethanes from chlorination  of natural organic matter
during disinfection, or processes such as inactivation of
pathogens, may occur over hours (20, 21).

Many  steps in  an  overall chemical reaction  process
could be rate-limiting. Figure  5 shows how lead levels
increase in 0.5-inch ID pipe given two different assump-
tions. The top  curve shows  how lead increases and
levels off after about 8 to 12 hours (9, 19). This curve is
closely applicable to any metal, as long as the limiting
factor on the rate of metal migration into the water is the
radial diffusion of the soluble  metal species away from
the pipe surface. The second curve shows,  schemati-
cally, the effect of a diffusion  barrier film (e.g., calcium
carbonate, adsorbed iron hydroxide mixed with organic
matter, aluminosilicate mineral deposits) or inhibition of
metal oxidation rates on lead  migration into water after
different amounts of time.

In some water  systems, significant chemical changes
can occur while the water is standing that can drastically
affect the oxidation or solubility of the plumbing material.
For example, dissolved  oxygen and free chlorine react
quickly in  new copper pipe or  brass. If the water stands
   0.01
                        Stagnation Level Controlled by
                        Metal Ion Diffusion
                      Stagnation in Presence of Diffusion
                      or Oxidation Barrier Film
           10
                 20
                      30    40    50

                      Time, Hours
                                       60
                                             70
Figure 5.  Comparison of lead concentrations that would be ob-
         served after water stands different amounts of time
         given different controlling chemistry factors.
sufficiently long, their concentrations may become neg-
ligible, which would significantly alter the redox condi-
tions  governing  metal  solubility.  In the  absence  of
oxygen or chlorine species, the dominant form of copper
in water and on plumbing material then becomes cop-
per(l) instead of copper(ll), resulting in different solubility
characteristics after consumption of the oxidant than at
initiation of the standing period (11).

Seemingly identical water samples collected from the
same taps in houses, schools,  or other buildings yield
different metal concentrations, depending on the time
the water was in contact with the faucet,  solder, or piping
material. Similarly, samples taken for disinfection bypro-
ducts after different chlorine contact times may produce
different concentrations and different speciation (e.g.,
trihalomethanes, haloacetic acids). This factor causes
considerable confusion in many investigations of con-
tamination of school or building drinking water taps and
water coolers and complicates estimating human expo-
sure for health-effects studies.

Interconnectedness of Distance and Time

In  innumerable situations, the effects of distance and
time are impossible to separate. Some generalizations
and examples follow.

Dead ends and slow rate areas produce long residence
times for the distributed water. This results in long con-
tact times with pipe materials, so reactive constituents
can change  considerably in concentration. The process
is totally interactive, in that concentration changes of
reactive constituents are in response to  contact with the
pipe materials, and in  turn, the materials respond to the
water composition.  Water may take hours to days to
reach a particular home  or building and  may traverse
many miles of distribution system piping of the same or
differing composition.  Water thoroughly run  through  a
household faucet for 5 or 10 minutes to purge the lines
is "fresh"  from the resident's perspective, but may be
"old" from the distribution system perspective.

The profile of the water line shown in Figure 4 was made
on the basis of filling small volume (60 or 125 milliliters)
sample bottles, one after another, without wasting any
water.  If the  objective  were to capture only the highest
risk of lead contamination from a lead service line after
some hours of stagnation, then the sampling  process
would  be  different.  Instead of collecting  all  water be-
tween the tap and the  service line, the water can be run
until either a target volume is wasted (representing the
linear plumbing distance to the service line) or can be
run at a given rate until, after the appropriate length of
time passes, the sample bottle can "intercept" the slug
of water residing in the service line. Beware  that differ-
ences  may exist in the peak concentration of the con-
taminant  and the  "width" of  the slug  of elevated
                                                   38

-------
contaminant level, depending upon the rate of waterflow
before and during sampling (18, 22, 23).

Other Sources of Variability in Water
Samples

Variability in water samples can stem from many sources
aside from those discussed in this paper (18). The na-
ture  of the errors and their likely magnitude may vary
with  each episode of sampling and analysis and is far
beyond the present scope of discussion. A brief listing
to consider,  however,  when drawing conclusions from
"field" data includes:

• Analytical  imprecision or bias.

• Flow rate of the water during sampling.

• Temperature.

• Particulate erosion  from plumbing materials during
  sampling.

• Effect of container material.

• Effect of air contact or other handling  effects during
  sample collection and shipment.

A Case Study of Easy Misinterpretation

Interpretations of water quality problems based on ag-
gregate monitoring data can be very misleading unless
analysis is performed at the appropriate scale. The situ-
ation of one  utility described below  provides a good
example  of  how using  a CIS  approach could have
helped solve the problem but also highlights how care-
fully  data would need  to be matched and consolidated
only  at the proper scale  if CIS were to  be employed for
evaluating some kinds of water quality problems.

The utility at Hopkinton, Massachusetts, found very high
lead  and  copper levels exceeding the regulatory action
levels under the Lead and Copper Rule (14-17). The
90th  percentile copper  levels even  exceeded 6  milli-
grams per liter, compared with an action level require-
ment of only 1.3 milligrams per liter. Some sites with lead
service lines are present in the system. Figure 6 shows
a schematic representation of the distribution system for
this utility. Five wells feed the system, and four (1, 2, 4,
and 5) are used regularly.

The  background  hardness, alkalinity, and  carbonate
concentrations are fairly similar for all wells. The pH of
the ground water from the wells is usually slightly above
6. Chlorine solution is dosed for disinfection. High  iron
levels are present in wells 1, 2, and 3, and high manga-
nese is also  present in well 3. Generally, high dosages
of a polyphosphate chemical were added to wells 1  and
2 to  respond to consumer complaints about the "red
water" that results from iron oxidation and precipitation.
Water from different wells mixes in the distribution sys-
tem,  but the water tends to partition into two zones as
                                      WelM
   Well 4
    Well 5
Figure 6.  Schematic representation of the utility's distribution
         system.

Figure 6 indicates. The lead and copper levels tended
to be distinctly lower in the section where the polyphos-
phate was dosed, marked as the "treated" part of the
system. The ongoing research study has employed ap-
proximately 22 monitoring sites.

From the information presented thus far, the  system
clearly cannot be characterized by a discrete value for
lead or copper contamination, as well as the chemical
background  of water throughout  the  system.  Hence,
putting data at the "whole system" scale into a statewide
or countrywide data system would be tempting, but it
could be very misleading in solving the treatment prob-
lem. Having accurate spatially distributed data for back-
ground water qualities, monitoring site characteristics,
and  metal levels at the subsystem scale, such as that
which could  be integrated into CIS, would have been
extremely convenient, however. Yet, even more informa-
tion at a smaller scale is necessary to understand and
solve the whole treatment problem.

The utility initially observed that because the lowest lead
and copper levels coincided with the area of the system
fed by the polyphosphate chemical, that chemical likely
caused the corrosion inhibition. Median lead levels,  for
example, were between about  200 and 300 milligrams
per liter in the "untreated" section, compared with about
10 to 15 milligrams  per liter in the "treated" section.
Median copper levels were approximately 4 to 5 milli-
grams per liter in the untreated section, but only about
0.3 to 0.5 milligrams per liter in  the treated section. The
utility and the researchers  wondered whether the poly-
phosphate chemical should also be added to the other
wells. This is a matter of significant concern because
some studies indicate polyphosphate chemicals can en-
hance lead corrosion (1, 6) and the  subject has rarely
been studied under statistically valid controlled conditions.

Additional site-by-site investigation,  however, first  re-
vealed that the sites with lead service lines all lay in the
untreated area of the distribution system. Because the
                                                   39

-------
research sampling  program included two successive
1-liter samples, the additional contamination from the
service lines was confirmed by higher lead levels in the
second  1-liter sample than in the first in many cases.
Therefore,  physical reasons, in addition  to  chemical
ones, explained the discrepancy in the lead levels. Fur-
ther, when considering only the treated system sites, the
lead levels were still high enough to be of concern.

Focusing on the copper sites resulted in the collection
of more important and  interesting small-scale informa-
tion. Figure 7 shows the  difference  between average
copper levels in the two sections of the system. Almost
all sites in both parts of the system had  copper interior
plumbing with 50:50 or  60:40 Sn:Pb soldered joints and
faucets with brass-containing internal materials. Though
the chemical added for iron control was ostensibly a
polyphosphate chemical,  it also contained an initially
present fraction of orthophosphate  and also tended
to partially break down to orthophosphate in the pres-
ence of iron and calcium (as most polyphosphates do).
Figure 8 shows the orthophosphate concentrations in
the two different parts of the system. While the levels of
orthophosphate present in the treated section would be
far too low to significantly inhibit lead leaching at the
background pH  (1, 6,  7,  9, 22), the orthophosphate
plausibly may significantly inhibit copper dissolution, in
concordance with recent research projections (11).

Having determined  through detailed small-scale sam-
pling and analysis  that the  chemistry affecting  metal
levels in the system is generally consistent with modern
knowledge, a new treatment plan is being implemented
to control copper and lead  levels through  pH adjustment
in conjunction with  iron control through a compatible
sodium  silicate/oxidation  treatment. Incorporation  of
system and monitoring  site physical characteristic data,
plus monitoring results, into CIS could have saved con-
siderable  investigatory effort. The importance of this
case history,  however,  is that the data must be of the
appropriate scale and highly documented to be useful in
problem-solving. Failure to use  data  meeting these re-

                               	Untreated Average
                               ----• Treated Average
               Oct.
Nov.
                               Dec.
Mar.
Apr.
0.6 -,

0.5-
0.4-
0.3-
0.2-
0.1 -


J
/
**

^
*»^
*<
/
t j_

— Untreated Average

	 Treated Average
\
T/ > 	 I
1 T • 	 .
!-!•-•-•


                                                             Sept.    Oct.
                                                      Nov.      Dec.     Mar.     Apr.
Figure 7.  Average copper levels in treated and untreated sec-
         tions of the system.
Figure 8.  Average orthophosphate levels in  treated and un-
         treated sections of the system.

quirements, as well as overgeneralization to a large
mapping scale, can lead to ineffective if not damaging
water treatment choices that could adversely affect pub-
lic health.

Conclusions

The  examples  and discussion above lead to several
general conclusions about the  use of CIS with drinking
water monitoring data:

• Temporal and spatial  variability stems from  many
  causes, down to a very small scale.

• Sampling protocols must be keyed to the precise
  questions under investigation.

• Regulatory sampling,  whose results  are generally
  readily available, is usually  inappropriate to assess
  human exposure to trace metals or other parameters
  of interest (such as DBPs).

• Generalizations on a large scale are often impossible
  because  of the geology and water chemistry variations.

Additionally, some  considerations  apply to the types of
mapping that could be employed by CIS. For example,
a mapping  technique such as contouring may  be espe-
cially inappropriate for  use with  drinking water data.
Major problems could result from:

• Discrete, small-scale  (such  as within  an individual
  house) variability in distributions of certain  contami-
  nants, such as lead and  copper.

• Physical  constraints of the distribution system network.

• The small number of monitoring sites in relation to
  the size  of the distribution network.

• Different chemical or hydraulic zones  in the distribu-
  tion system.

Employing  CIS could be very useful in solving a variety
of drinking water problems. Users must be extremely
conscious of the nature of the source information, how-
ever, to avoid abusive extrapolations and generalizations.
                                                   40

-------
Acknowledgments

Some of the examples cited were  investigated as part
of a U.S. Environmental Protection Agency cooperative
agreement  (Darren A. Lytle, Project Officer) with the
New England Waterworks Association. Jack DeMarco
of the Cincinnati Water Works kindly provided Figure  1
on distribution  of sampling sites for lead and copper
monitoring.

References

 1. Schock, M.R. 1990. Internal corrosion and deposition control. In:
    Water quality and  treatment: A handbook of community water
    supplies, 4th ed.  New York, NY: McGraw Hill.
 2. Benefield, L.D., and J.S.  Morgan.  1990.  Chemical precipitation.
    In: Water quality and treatment: A handbook of community water
    supplies, 4th ed.  New York, NY: McGraw Hill.
 3. Glaze, W.H. 1990. Chemical oxidation. In: Water quality and treat-
    ment: A handbook of community  water  supplies,  4th ed. New
    York, NY: McGraw  Hill.
 4. Cotruvo, J.A., and  C.D. Vogt. 1990. Rationale for water quality
    standards and goals. In: Water quality and treatment: A handbook
    of community water supplies, 4th ed. New York, NY: McGraw Hill.
 5. Tate, C.H., and K.F. Arnold. 1990.  Health and aesthetic aspects
    of water quality. In: Water quality and treatment: A handbook of
    community water supplies, 4th ed. New York,  NY: McGraw Hill.
 6. Schock, M.R. 1989. Understanding corrosion control strategies
    for lead. JAWWA 81:7:88.
 7. AWWARF. 1990.  Lead control strategies. Denver, CO: American
    Water Works Association  Research Foundation and  American
    Water Works Association.
 8. AWWARF. 1994.  Corrosion control in water distribution systems,
    2nd  ed. Denver,  CO: American Water Works Association  Re-
    search Foundation/Engler Bunte Institute.
 9. Schock, M.R., and  I. Wagner. 1985. The corrosion  and solubility
    of lead in drinking water. In: Internal corrosion of water distribution
    systems. Denver,  CO:  American Water Works Association  Re-
    search Foundation/DVGW Forschungsstelle.
10.  Pankow, J.F. 1991.  Aquatic chemistry concepts.  Chelsea, Ml:
    Lewis Publishers, Inc.

11.  U.S. EPA. 1995. Effect of pH,  DIG, orthophosphate and sulfate
    on drinking water cuprosolvency. Cincinnati, OH:  Office of Re-
    search and Development. In press.

12.  AWWA. 1993. Initial monitoring  experiences of large water utilities
    under U.S. EPA's Lead and Copper  Rule. Denver, CO:  Water
    Industry   Technical   Action   Fund/American   Water  Works
    Association.

13.  AWWA. 1990. Lead service line  replacement:  Benefit to cost
    analysis.  Denver,  CO:  Water   Industry  Technical  Action
    Fund/American Water Works Association.

14.  AWWA. 1991. Lead and copper: A working explanation  of the
    Lead and Copper Rule. Denver,  CO: American Water Works
    Association.

15.  Federal Register. 1991. Lead and copper: Final rule correction.
    Fed. Reg. 56:135:32,112 (July  15).

16.  Federal Register. 1991. Lead and copper: Final rule. Fed. Reg.
    56:110:26,460 (June 7).

17.  Federal Register. 1992. Lead and copper: Final rule correction.
    Fed. Reg. 57:125:28,785 (June 29).

18.  Schock, M.R. 1990. Causes of temporal variability of lead in
    domestic plumbing systems. Environ.  Monit. Assess. 15:59.

19.  Kuch, A., and I. Wagner. 1983. Mass  transfer model to describe
    lead concentrations in drinking  water.  Water Res. 17:10:1,303.

20.  Ireland, J.C. 1993. Alternatives to chlorine for disinfection of drink-
    ing water. In: Strategies and  technologies for meeting SDWA
    requirements. Lancaster, PA: Technomic.

21.  Miltner, R.J.  1993. Pilot scale treatment for control of disinfection
    byproducts.  In: Strategies and  technologies for meeting SDWA
    requirements. Lancaster, PA: Technomic.

22.  Sheiham,  I., and P.J. Jackson. 1981.  Scientific basis for control
    of lead in drinking water by water treatment. J. Inst. Water Engrs.
    and Scientists 35:6:491.

23.  Heumann, D.W.  1989. Los Angeles  Department of Water and
    Power: Solid lead gooseneck slug dispersion in consumer plumb-
    ing system. Proceedings of the  AWWA Water Quality Technology
    Conference, Philadelphia, PA.
                                                           41

-------
Ground-Water Applications

-------
     Using GIS/GPS in the Design and Operation of Minnesota's Ground Water
                           Monitoring and Assessment Program
                Tom Clark, Yuan-Ming Hsu, Jennifer Schlotthauer, and Don Jakes
                     Minnesota Pollution Control Agency, St. Paul, Minnesota

                                        Georgianna Myers
                        Water Management Consultants, Denver, Colorado
Abstract

Minnesota's Ground Water Monitoring and Assessment
Program (GWMAP) is administered  by the Minnesota
Pollution Control Agency (MPCA) to evaluate baseline
ground-water quality conditions regionally and state-
wide. The  program uses a systematic sampling design
to maintain uniform geographic distribution of randomly
selected monitoring stations (wells) for  ground-water
sampling and data analysis. In  1993, geographic infor-
mation system (CIS) and  global positioning  system
(GPS) technologies were integrated into GWMAP, auto-
mating the selection of wells and the field determination
of well locations.

GWMAP consists of three components: the statewide
baseline network, regional monitoring  cooperatives,
and a trends analysis component. In the statewide
baseline network, Minnesota is divided into over 700
121-square-mile grid cells,  each with a centralized,
9-square-mile sampling region.  Wthin each target area,
single-aquifer, cased and grouted wells are sampled for
about 125  metals, organic compounds, and major cat-
ions and anions. We are currently finishing the second
year of a 5-year program to establish the statewide grid.
When complete, the statewide baseline component will
consist of  about 1,600 wells representing Minnesota's
14 major aquifers.

In 1993, approximately 4,000 well construction records
were selected for geologic and hydrologic review, using
a CIS overlay, from a database of 200,000 water well
records maintained in the state's County Well Index
(CWI). Using GPS, 364 wells were sampled and field
located. The  semiautomatic well selection process uses
existing electronic coverage of public land survey (PLS)
data maintained in CWI in conjunction with the digitized
systematic sampling grid. CIS has greatly reduced the
time needed for selecting sampling stations. With the
combination of CIS and GPS, program costs have de-
creased, allowing more resources to be applied toward sam-
pling, while efficiency and quality of data have improved.

Introduction

Quantitative assessment of ground-water quality condi-
tions requires a highly organized  data collection pro-
gram that includes statistical  evaluation of monitoring
results (1, 2). States have difficulty providing the staff
and financial resources necessary to generate state-
wide quantitative  ground-water information. Wth the
use of geographic information system (CIS) and global
positioning system (GPS) technologies, however, states
have the potential to improve the quality of environ-
mental monitoring programs and to reduce the amount
of staff time needed to collect and evaluate data, thus
decreasing costs. The degree to which states realize
these potential benefits depends largely on how effec-
tively the technology can be incorporated into the design
of the monitoring  program. This paper describes how
CIS and GPS technologies are being integrated into the
design and operation  of Minnesota's Ground Water
Monitoring and Assessment Program (GWMAP) to im-
prove overall effectiveness.

The Minnesota Pollution  Control Agency (MPCA) has
sampled and analyzed ambient ground-water quality in
the state's 14 principal aquifers since 1978. In 1990, the
MPCA began a redesign of its ground-water monitoring
program to better assess water quality conditions state-
wide (3). Three program components resulted from the
redesign: a statewide  baseline network  for complete
geographic coverage, a trends analysis component for
intensive studies of how ground-water quality in specific
areas changes with time, and  a  regional monitoring
cooperative link to governmental units such as counties
                                                45

-------
to meet specific local ground-water assessment needs.
This paper describes the design and operation of the
statewide baseline network.

The design of the statewide network is geographically
and statistically based to automate well selection and
data interpretation. In 1993, the MPCA began integrat-
ing CIS and GPS technologies into this  part of the
program. The implementation of CIS  and GPS sur-
passed our expectations by reducing staff time re-
quired to select wells and evaluate analytical results
(see Table 1). In addition,  through  the elimination of
previously uncontrollable variables, the use of CIS and
GPS has increased the accuracy of GWMAP data.

Monitoring Program Description

Since 1992, GWMAP has selected 150 to 250 existing
water supplies yearly for ground-water sampling and
analysis of about 125 parameters, including major cat-
ions and  anions,  metals, and  volatile organic com-
pounds.  Well selection  is a  fundamental  element of
GWMAP that, if efficiently performed, supports the pro-
gram objectives by upholding the quality of the monitor-
ing data and minimizing the operating costs.

A key  to the interpretation of monitoring  data is  the
technique used to  select wells for sampling (2, 4, 5).
Minnesota  has over 200,000 active water wells with
approximately 10,000 new installations annually.  For
each well selected for GWMAP monitoring, a hydrologist
must individually review many well construction records.
An automated prescreening mechanism to facilitate well
selection can result in considerable time (and therefore
cost) savings. GWMAP chose CIS as the best tool for
this task. CIS enables the program to combine a sys-
tematic sampling technique with hydrogeologic criteria
to ensure an efficient and consistent selection process.
As Table 1  shows, CIS allowed us to more than triple
our geographic coverage and wells initially selected,
while  dramatically  reducing the records that must be
individually reviewed. We realized a time savings of 2
months compared  with  the time required  before CIS
implementation.

In general,  systematic sampling techniques use a ran-
domly  generated uniform grid to  determine sampling
locations in space and/or time (5). Systematic sampling
was initially implemented in  GWMAP in 1991  using  a
manually generated spatial grid defined by the public
land survey (PLS) (3). Although the PLS is not 100

Table 1.  Well Selection  in 1992 and 1993
percent geographically uniform,  it was selected for the
grid to expedite well selection from existing digital data-
bases in which wells are organized by PLS location.

Systematic Sample Site Selection
Using GIS
Systematic sample site selection  is a three-step process.
First, a database search of Minnesota's County Well Index
(CWI) (6), containing nearly 200,000 driller's records, is
conducted to include  all available water wells in the
region of interest. Second, the candidate pool is reduced
to those wells located within regularly spaced grid cells.
Third, further wells  are eliminated from the candidate
pool by applying geologic and well construction criteria
mandated in the GWMAP design (7).

Generating the Sampling Grid
The statewide sampling grid was generated from a randomly
selected origin (8). This grid consists of approximately 700
square cells,  11  miles on a  side  (see Figure 1). The
centroid of each cell is  consecutively numbered and was
extracted to produce the origin of the sampling zone.
Figure 1. Statewide baseline network sampling grid.
Year
1992
1993
Area
Covered
9 counties
26 counties
PLS Sections
Selected
500
1,659
Well Logs
Selected
3,000
11,000
Well Logs
Reviewed
3,000
834
Wells
Sampled
158
206
Time
Spent
6 months
4 months
                                                  46

-------
Establishing the Sampling Zone

Each sampling zone consists of a 3- by 3-mile box from
which potential sampling sites are selected. It is gener-
ated by computing the coordinates of the four corners of
the box using the grid cell's centroid as the origin. To link
the sampling zone and grid cell, both are identified with
the same numerical code.

These sampling "target" zones,  a series of regularly
spaced, 9-square-mile boxes, are then made into a CIS
coverage and overlaid on top of the PLS coverage to
extract those sections that are associated with each of
the sampling zones. Ideally, each sampling zone should
cover exactly nine PLS sections (3). Due to irregularities
in the PLS system, however, portions of 16 to 20 sec-
tions usually fall within the sampling zone of each  cell
(see Figure 2).
                Watonwan
    Legend

   / \ /  PLS Boundary
Sample Zone
                     Sample Grid
                              County Border
Figure 2.  PLS and the sampling grid, Watonwan County.

Selection of PLS Sections

The  PLS  coverage was derived from the Minnesota
Land Management Information Center (LMIC) "GISMO"
file. It was originally created in 1979 by digitizing every
section corner in  Minnesota from the U.S. Geological
Survey (USGS) 7.5-minute quadrangle map series.

The  PLS section  information is necessary in the well
selection process because the original well construction
logs, maintained by the Minnesota Geological Survey
(MGS), are organized by PLS. Although most of the well
selection  process can  be  automated,  manual  file
searches for well records are still necessary and require
the PLS information.

Well Selection

After identifying the PLS sections within the sampling
grid, the statewide well database is imported as a point
coverage and  overlaid with the selected  PLS section
coverage. Thus, all wells that fall within the 16 to 20
sections are selected as potential candidates. The ac-
curacy of the well locations in CWI varies; most of the
point  locations  are  approximated  to  four  quarters
(2.5 acres). The CWI does not contain all well construc-
tion  information,  however, requiring that copies  of
driller's logs be made for GWMAP files.

The final well  selection  is done after applying the
9-square-mile sampling zone over the potential pool of
candidates. For wells that fall within the zone, the well
construction records are pulled from MGS  files, copied,
and submitted for hydrologist review.  Depending on the
target cell location, the number of candidate wells requir-
ing review may range from a few to more than 100. For
newly installed water wells whose records have not yet
been digitized by LMIC, the PLS locations of the wells
are manually plotted onto a map to confirm whether they
fall into a sampling grid cell. Typically, from 5 percent to
as many as 20  percent of selected wells that meet the
location criteria are sampled. This accounts for the hydro-
geologic and well construction criteria and the coopera-
tion  of  well  owners participating  in  the program.
Currently, interest in ground-water protection programs
runs high in rural Minnesota, with an acceptance rate of
up to 80 percent.

The  implementation  of CIS  in  well selection helped
GWMAP excel in two major areas. First, the develop-
ment  of the statewide CIS grid eliminated previously
uncontrolled variables by removing the  PLS spatial in-
consistencies from the systematic grid. Second, the CIS
reduced the manual workload with the automation of two
important steps in the well selection  process: the  gen-
eration of PLS section information to facilitate the data-
base search, and the identification of wells that meet the
geographic location criteria. The success of GWMAP
relies largely on the ability to use existing CIS cover-
ages. In using coverages created by  other entities, this
program  identified the need for a uniform standard for
data conversion and transfer.

Application of Global Positioning
Systems in  Ground-Water Sampling

In 1991, the U.S. Environmental Protection Agency
(EPA) established a policy that all new data collected
after 1992 should meet an accuracy goal of 25 meters
or better (9). The purpose of EPA's Locational Data
Policy (LDP) is to establish principles for collecting and
                                                  47

-------
documenting consistently formatted locational data to
facilitate cross-programmatic, multimedia analyses. Ac-
curate geographic information is important to the spatial
analysis of  well sampling results. Any uncertainty in
sample location can compromise  hydrogeologic analy-
sis (10). GPS is an easy, cost-effective solution.

Global Positioning System Field Application

Beginning in October 1992, GWMAP employed GPS in
the field to assist in locating sample sites. Applying GPS
in the field has proven to be quite easy. The program
uses a multichannel C/Acode receiver with internal data
logging capability. Typically,  the receiver is placed di-
rectly on top of the wellhead and logs 100 to 150 GPS
readings into the receiver's internal memory in approxi-
mately 5 minutes.

The GPS is also used for navigation in the field to locate
sampling sites. Because sampling sites are predeter-
mined, their locations can  be extracted from a topo-
graphic map. The approximate coordinates can then be
loaded into a GPS receiver. In most cases, the receiver
successfully led the field team within visual range of the
sampling site.

Because of the inherent selective availability (SA) of the
GPS, raw field data must go through a differential cor-
rection process to  achieve the goal of 25-meter accu-
racy (9,  11).

Data Management and Processing

Once the GPS receiver  is brought back from the field,
data are downloaded to a personal computer (I486 proc-
essor at a speed of 50 MHz) and differentially corrected
(11). The average or mean of the 100 or more readings
collected onsite is  calculated and  reported  as the site
location.

The MPCAdoes not operate a GPS base station for the
purpose of differential  correction. The base station data
are obtained through a computer network (Internet) from
the Minnesota Department of Health (MDH) base station
located in  Minneapolis.

To facilitate  future data integration and  document data
accuracy for secondary application, GWMAP proposed
quality assurance codes for GPS data collected by the
MPCA. The value of the accuracy proposed is a nominal
value rather than an  absolute number (see Table 2).
Each of the seven processing methods is assigned a
separate code.

In the field experience of GWMAP, a nominal  accuracy
of 2  to 5 meters has been  consistently achieved after
the postdifferential correction and averaging have been
applied to the data. This technology is suitable for any
program that is designed to conduct either large-area or
intensive monitoring activities. It helps  to cut costs by
Table 2.  Proposed Nominal Accuracy Reference Table
Type of GPS
Receiver Used
Processing Method
Used To Correct Data
 Nominal
Accuracy
 (meters)
Navigational
quality C/A code
receiver
Navigational
quality with carrier
aid receiver
Survey quality
receiver (dual or
single frequency)
Postdifferential corrected        2-5

Real-time differential
corrected (RTCM)             2-5

Autonomous mode (no
correction)                 15-100

Postdifferential corrected        < 1

Real-time differential
corrected (RTCM)             < 1

Autonomous mode (no        15-100
correction)

Postdifferential corrected       <0.1
increasing efficiency and accuracy of the data. The data
collected by GWMAP can be used not only in a regional
study but could be used directly in a site-specific inves-
tigation as well.

GWMAP also found that GPS can be  used most effi-
ciently by separating the two roles of field operator and
data manager. The field operators receive only the brief
instructions necessary to operate a GPS receiver before
going into the field. The data manager handles the data
processing details. The field operators can then concen-
trate their efforts on obtaining  ground-water samples
and conducting the hydrogeologic investigation.
Conclusions
CIS  and GPS  technologies made  it  possible for the
MPCA to implement the statewide GWMAP project by
optimizing the available funding and staff time. CIS mini-
mized  staff time spent on identifying  sampling areas,
manipulating the sampling grid, and selecting monitor-
ing sites. In addition, CIS enabled GWMAP to integrate
a variety of databases and maps of different scales.

Using GPS to locate sampling sites enabled GWMAP to
efficiently  obtain accurate geographic locational data
with  relative ease. This eliminated the  degree  of uncer-
tainty that previously might have compromised the sta-
tistical evaluation of the hydrogeologic data.

GWMAP's success in integrating existing digital data to
automate  the  well  selection process  clearly  demon-
strated the importance of the ability to share information
with  others and the great need for a  broadly applied
standard for data conversion and transfer.
                                                   48

-------
Acknowledgments

The authors wish to thank Renee Johnson  of the Min-
nesota Department of Natural Resources for her work
to convert the PLS data layerto CIS coverage. Susanne
Maeder of LMIC supplied the statewide CWI coverage,
and Susan Schreifels of MPCA conducted research on
the LDP and  made valuable suggestions on  implement-
ing GPS.

References
 1.  Nelson, J.D., and R.C. Ward. 1981. Statistical considerations and
    sampling techniques for ground-water quality monitoring. Ground
    Water 19(6):617-625.
 2.  Ward, R.C. 1989. Water quality monitoring—A systems approach
    to design. Presented at the International Symposium on the De-
    sign of Water Quality Information Systems, Colorado State  Uni-
    versity, Ft. Collins, CO.
 3.  Myers, G., S. Magdalene, D. Jakes,  and E. Porcher. 1992.  The
    redesign of the  ambient ground  water monitoring program. St.
    Paul, MN: Minnesota Pollution Control Agency.
 4.  Olea, R.A. 1984. Systematic sampling of spatial functions. Series
    on Spatial Analysis,  No. 7. Kansas Geological Survey, University
    of Kansas, Lawrence, KS.
 5.  Gilbert, R.0.1987. Statistical methods for environmental pollution
    monitoring. New York, NY: Van Nostrand Reinhold.

 6.  Wahl, I.E., and R.G. Tipping. 1991. Ground-water data manage-
    ment—The county well index. Report to the Legislative Commis-
    sion on Minnesota  Resources. Minnesota  Geological  Survey,
    University of Minnesota, St. Paul, MN.

 7.  Clark, T, Y Hsu, J.  Schlotthauer, and D. Jakes. 1994. Ground-
    water monitoring and  assessment program—Annual  report.  St.
    Paul, MN: Minnesota Pollution Control Agency.

 8.  ESRI. 1993. ARC/INFO version 6.1, ARCPLOT command refer-
    ences. Redlands, CA: Environmental Systems Research  Insti-
    tute, Inc.

 9.  U.S. EPA.  1992. Global positioning systems technology and its
    application in environmental programs. CIS Technical  Memoran-
    dum 3. EPA/600/R-92/036. Washington, DC.

10.  Mitchell, J.E. 1993.  A characterization  of the influence  of (x,y)
    uncertainty on predicting the form of three-dimensional surfaces.
    Proceedings of the  AWRA Spring Symposium on Geographic
    Information Systems and Water Resources, Mobile, AL. pp. 559-
    567.

11.  Trimble Navigation, Ltd. 1993. GPS Pathfinder System,  general
    reference. Sunnyvale, CA: Trimble Navigation, Ltd.
                                                         49

-------
    Use ofGIS in Modeling Ground-Water Flow in the Memphis, Tennessee, Area
                              James Outlaw and Michael Clay Brown
                           University of Memphis, Memphis, Tennessee
Abstract

Memphis,  Tennessee relies solely on ground water for
its municipal and industrial water supply. Memphis Light,
Gas, and  Water (MLGW) Division owns and operates
over 160 water wells in 10 production fields throughout
Shelby County. MLGW produces an average of approxi-
mately 200 million gallons per day, excluding much of
the industrial demand. The city obtains its water from a
thick, prolific  aquifer known  as the Memphis Sand,
which was thought  to be separated  from  a surficial
aquifer by a thick confining layer.  In recent years, evi-
dence of leakage from the surficial aquifer to the Mem-
phis Sand has been found.

The University  of  Memphis  Ground Water  Institute
(GWI) is developing a hydrogeologic database of the
Memphis area to study the aquifer. The database serves
as the basis for several ground-water flow models that
have been created  as well as part of the  wellhead
protection  programs currently being developed for Mem-
phis and other municipalities in Shelby County. A geo-
logic database was developed and is constantly being
updated from borehole geophysical logs made in the
area. Well locations  are being field verified  using a
global positioning system (GPS).

Use of the database has  allowed the development of a
three-dimensional model  of the Memphis area subsur-
face. The database also contains locations of and infor-
mation  on both  private  and  public  production  and
monitoring wells, Superfund sites, underground storage
tanks, city and county zoning, land use, and other per-
tinent information. Procedures  for linking the database
to ground-water flow and  solute transport models have
been developed. The data visualization capabilities and
the ability to  link information  to geographic features
make geographic  information  systems (CIS) an ideal
medium for solving ground-water problems.

An example of CIS use in ground-water flow modeling
is the study of the Justin  J. Davis Wellfield. The water
quality  parameters of alkalinity, hardness, sulfate, and
barium have significantly  increased over the  past 10
years at this facility. To understand why these changes
are occurring, MLGW, the GWI, and the U.S. Geological
Survey (USGS) participated in a joint investigation of the
wellfield.

In the spring of 1992, a series of 12 monitoring wells was
drilled into the surficial aquifer nearthe production wells.
Geophysical logging and split-spoon sampling revealed
an absence of  the  confining layer, referred to as a
window, at  one of the monitoring wells. All other wells
penetrated  various thicknesses of clay. This window in
the confining layer suggests that the  water  quality
changes  could  be due to leakage  from the  surficial
aquifer to the Memphis Sand.

The CIS database was used to construct a flow model
of the  Davis area. Also,  using the  surface modeling
capabilities  of CIS,  the extent of the confining layer
window was estimated and used  to calculate  leakage
between the two aquifers. The results of these analyses
also  indicate  that further subsurface  exploration  is
needed to more accurately define the extent of the con-
fining layer window.

Introduction

Memphis, Tennessee, relies solely on ground water for
its municipal and industrial water supply.  The Memphis
Light, Gas,  and  Water (MLGW) Division  owns  and op-
erates  over 160 water wells in  10 production  fields
throughout Shelby County, as shown in Figure 1. MLGW
produces an average of approximately 200 million gal-
lons per day, excluding much  of the industrial demand.

The city obtains its water from a  thick, prolific aquifer
known as the Memphis Sand, which was thought to be
separated from the surficial aquifer by a thick confining
layer. In recent  years, evidence  of leakage from the
surficial aquifer to the Memphis Sand has been found.
The  University  of Memphis  Ground  Water  Institute
(GWI)  is developing a hydrogeologic database of the
Memphis area to study the  aquifer. Several ground-water
                                                 50

-------
  Mississippi Alluvia] Plain
                       N
                      A
                0     25     50
              Scale in Thousands of Feet
Figure 1.  Physiographic description and MLGW Wellfields in
         Shelby County.

flow models have been developed using the database.
Also, the database is an integral  part of wellhead pro-
tection programs being developed for Memphis and
other municipalities in Shelby County. A geologic data-
base was developed and is  constantly being updated
from borehole geophysical logs made in the area. Well
locations are being field verified using a global position-
ing system (GPS).

Use of the database has allowed the development of a
three-dimensional model of the subsurface of the Mem-
phis area. The database also contains  locations of and
information on both private and public production and
monitoring wells; Superfund sites; underground storage
tanks; city and county zoning; land use; and other per-
tinent  information. Water quality measurements for
every MLGW production well have been obtained, and
a history of water quality for the Memphis Sand is being
developed. Procedures have been developed for linking
the database to ground-water flow and solute transport
models (1). The  data visualization capabilities  and the
ability to link information to geographic features make
geographic information systems (CIS) an ideal medium
for solving ground-water problems.

GIS Database

The GWI has developed and is continuing to update a
hydrogeologic   database  for the   Memphis  area.
ARC/INFO (marketed  by Environmental Systems Re-
search Institute,  Redlands, California)  is the GIS pro-
gram that the  GWI is using.  The program  runs on a
network of 10 SUN SPARC stations. The capabilities of
ARC/INFO and the computational speed of the SPARC
stations allow very sophisticated ground-water analyses
to be performed and have allowed the development of
an extensive electronic database.
The basic unit of data storage in ARC/INFO is a cover-
age. A coverage is a digital representation of a single
type of geographic feature (e.g., points may represent
wells, lines may represent streets or equipotential lines,
and polygons may represent political boundaries or zon-
ing classifications). Information may be associated with
an individual  geographic feature in a feature attribute
table. This information may then be queried and used in
analyses. ARC/INFO also has its own macro language
that allows the customization and automation of many
ARC/INFO procedures.
A relatively new feature of ARC/INFO is address match-
ing. This procedure compares a file containing the street
address of a  particular feature with an address cover-
age. This  coverage is basically  a library of addresses
that are linked to a geographic coordinate. As the ad-
dresses from the input file are compared with the ad-
dress coverage, the matching points are written to a
second coverage. Any addresses in the input file that do
not match an address in the address coverage  are
written to a "rejects" file. These can be matched by hand
on a one-by-one basis.
Address matching serves as an alternative to digitizing,
as long  as a good address coverage for a specific area
exists. The GWI has used this capability extensively and
has developed a coverage of underground storage tank
(UST) locations inside Shelby County. A database of
private and monitoring wells is  also  being developed
and updated using address matching. The raw informa-
tion was obtained in an ASCII format from the appropri-
ate regulating agencies (i.e., the Tennessee Department
of Environment and Conservation  Division of Under-
ground Storage Tanks and the Memphis/Shelby County
Health Department).  The ASCII information  was im-
ported into ARC/INFO and address matched. The crea-
tion of a suitable address coverage and completion of
the address matching of the UST file has taken almost
2 years. The private well coverage is currently  being
updated from historical information provided by regulat-
ing agencies and local well drilling companies.
An important  part of the database is the geologic infor-
mation  obtained from geophysical  logs  in the  area.
Gamma logs, resistivity logs, and spontaneous potential
(SP) logs are three major types  of electric geophysical
logs. Gamma logs measure naturally occurring radiation
emitted  from soil in the borehole. Clays and shales emit
gamma  rays.  A high gamma count  indicates the pres-
ence of clay or shale, and a low gamma count implies
that little or no clay is present. Sand layers that contain
fresh water are located using resistivity logs. Maximum
values of  resistivity indicate  the possibility of a sand
                                                  51

-------
layer. Clays and sands that contain  salt water may
exhibit similar resistivities. SP logs are  used to differen-
tiate between the two (2).

Corroborating data  such as formation logs, geologic
studies, and available material samples should be con-
sulted when reading and interpreting geophysical logs.
The accuracy and reliability of an application  based on
well logs is completely dependent on a realistic interpre-
tation of the geophysical data. A sample interpretation
of a set of geophysical logs is shown  in Figure 2. The
results of interpretations like this are  entered into the
point attribute file of a well coverage.

The Triangulated Irregular  Network (TIN) module of
ARC/INFO is used to create  a three-dimensional sur-
face from information stored in a coverage. TIN creates
a surface from a set of nonoverlapping triangles defined
by a set of irregularly or regularly spaced points.  In this
study, the points defining the triangular TIN surfaces are
the locations of the wells in the model area.  TIN uses
various interpolation routines to estimate surface val-
ues. Once the surfaces have been developed, two-di-
mensional profiles can  be made that show the relative
thicknesses of the various soil strata, as shown in Figure
3. These profiles aid in selecting boundary conditions
and defining layers in ground-water flow models (3, 4).

In addition to the creation  of profiles, a  process that
extracts surface values for  use in a ground-water flow
model  has been developed. The GWI  uses the United
States Geological Survey (USGS)  flow model,  MOD-
FLOW (5). Being a cell-based model, MODFLOW re-
   Gamma
              SP
                    Resistivity
Figure 2.  Example interpretation of geophysical logs.
quires a value for each  hydrogeologic parameter for
each  cell in the model grid. A series of FORTRAN  pro-
grams and arc macro language (AMI) programs were
coupled to extract the required hydrogeologic data from
surface models. For example, piezometric surface val-
ues are required to set initial conditions for the model. A
coverage  of the piezometric surface of the Memphis
Sand  was created, converted to a TIN surface, and the
required values for each cell in the model were extracted
using  the procedure described above.

The results and hydrogeologic data from the calibrated
model can be read back  into the database and con-
verted into coverages. This allows piezometric contours
to be  developed and displayed with other information in
the database to aid in decision-making. Also, capture
zones forthe wells can be brought into the database and
compared with surface features like industries, landfills,
Superfund sites, LIST locations,  or other sites that  may
have  an impact. This has proved especially helpful in
developing wellhead protection programs where a com-
plete  contaminant source inventory must be performed
for the  capture zone  of each well and within  a fixed
radius around the well. The procedure used to develop
model data from  the CIS  database is summarized  in
Figure 4.

Me Cord Wellfield Wellhead
Protection Program

MLGWand the GWI performed a demonstration project
funded  by the U.S. Environmental Protection Agency
(EPA) for the C.M. McCord Wellfield Wellhead  Protec-
tion Program (6). This wellfield was selected because of
multijurisdictional problems that will be encountered  dur-
ing plan implementation. The City of Memphis owns all
the wells, but many of them, and all future well lots, are
located within the city limits of Bartlett.  A wellhead  pro-
tection plan will have to involve the cooperation of both
municipalities. The existing wellfield is shown in Figure 5.

Tennessee wellhead  protection regulations require the
delineation of two  zones of protection for a city the  size
of Memphis: a 750-foot radius around the wellhead  and
a 10-year capture zone for the well. The 10-year capture
zone  (called the Zone 2 area) was delineated using two
flow models and  information obtained  from the GWI
database.  Results were imported into the CIS database
and compared  with existing  information. The Zone 1
area was delineated by buffering each well point in the
coverage by the appropriate  radius. Each well location
was verified using a Trimble  GPS unit  and is accurate
to within 2 meters. A contaminant source inventory was
performed using the  coverages  developed  by address
matching.  The primary potential sources of contamina-
tion in this area are USTs.  A windshield survey  located
other  potential sources,  such as dry cleaners. These
locations were entered into the database also by using
                                                  52

-------
                 Surficial Aquifer
                 Confining Layer/Clay Lens
Transitional Layer


Memphis Sand
Figure 3.  Two-dimensional profile constructed from surface models.
                                Study Area Defined and
                                 Model Grid Developed
                                    Within ARC/INFO
                                         «s*
                                          *e
                 Model Input Files
                    Developed
                               V
                             Model Calibration
                              and Verification
 the address-matching capabilities of ARC/INFO.  The
 Zone 2 areas for present wells and future wells, along
 with the potential sources of contamination, are shown
 in Figure 6.

 Davis Wellfield Study

 An  example of CIS use in ground-water flow modeling
 is the study of the Justin J. Davis Wellfield. The Davis
 Wellfield is  one  of 10 producing fields operated by
 MLGW It  is located  in the  southwestern  corner of
 Shelby County and consists  of 14 wells, as  shown in
 Figure 1.  Production at  the  Davis  Wellfield  began in
 1971, and an estimated 13 million gallons per day are
 currently withdrawn from the Memphis Sand aquifer.
 Since  1972,  MLGW has  collected  water quality data
 from the wells at the Davis Wellfield, including values for
 alkalinity, hardness, chloride,  sulfate, iron, and barium.
 Water quality parameters of alkalinity, hardness, sulfate,
 and barium have significantly increased in the past 10
 years.1  A possible explanation for the change in water
 quality is water leakage from  the upper aquifer through
 the confining unit to the  Memphis Sand aquifer.1 The
 water chemistry from the two aquifers is noticeably dif-
 ferent. The surficial aquifers generally  have a higher
 total dissolved solids concentration, hardness, and alka-
 linity than water from the Memphis Sand.1
Figure 4.  Procedure for integrating GIS and flow model.
  Webb, J. 1992. Memphis Light, Gas, and Water Division. Personal
 interview.
                                                   53

-------
Shelby |
County
Well Location
Pumping
Station
Municipal Boundary
Figure 5. Existing McCord Wellfield.

MLGW, the GWI, and the USGS participated in a joint
investigation of the wellfield to determine why the water
quality changes are occurring. In the spring of 1992, 12
monitoring wells were drilled into the surficial aquifer
near the production wells, as shown in Figure 7.

Geophysical logging and split-spoon sampling revealed
an absence of the confining layer at one monitoring well,
GWI-3. All other wells penetrated various thicknesses of
clay. This "window" in the confining layer suggests that
the water quality changes could be due to leakage from
the surficial aquifer to the Memphis Sand. The logs for
these monitoring wells were combined with an existing
geophysical log coverage. The extent of the confining
layer window was estimated using CIS surface model-
ing capabilities, as shown in Figure 8.
Two-dimensional profiles were created to further show
the extent of the confining layer window. The locations
of the profiles in relation to various surface features are
shown in Figure 9. Profiles 1 and 2 were taken across
the river bluff, and Profiles 2 and 3 were taken across
the window. The profiles are shown in Figure 10.

Many important features of this area's geology can be
inferred by looking at the profiles. A connection of the
alluvial and fluvial aquifers is shown in Profile 2. Else-
where along the bluff, the connection of the two aquifers
is less prominent, as shown in Profile 1. The connection
of the two aquifers in Profile 2 may be the cause of a
peculiar mounding effect in the water table of the alluvial
aquifer in that area. The thinning of the top soil in Profiles
1 and 3 may indicate a local recharge area for the
54

-------
                 0      5      10

               Scale in Thousands of Feet
10 -Year Capture Zones
        Developed Wellfield
        Fully Developed Wellfield
        Underground Storage Tank
        Aboveground Storage
Figure 6.  Underground storage tanks and aboveground storage locations.
alluvial aquifer. Profiles 2 and 3 show the confining layer
window that suggests a connection between the surficial
aquifer and the Memphis Sand.

Following the convention of the USGS, a "window" is
defined as any area where the aggregate clay thickness
is less than 10 feet (8). Asurface of the thickness of the
confining layer was generated from the geophysical log
coverage. The surface model was converted to a con-
tour line coverage on a 5-foot interval. Using the ARC-
EDIT module of ARC/INFO, the contour line coverage
was converted  to  a polygon  coverage.  The area
bounded by the 10-foot  contour of the surface model
was calculated to be 840,000 square feet  (about 19
acres). The area was calculated by adding  the areas
between the 10- and 5-foot contours and the area within
the 5-foot contour.

A flow model of the  area was developed based on  the
hydrogeologic data contained in the database. A steady
state model was calibrated  to hydraulic conditions re-
corded during fall 1992 by the USGS and the GWI;  the
root mean square (RMS) error for this model was 1.76
feet. A second steady state model was developed to
simulate conditions recorded during spring 1993 (a  pe-
riod of high water levels in area lakes). The RMS error
for this simulation was 5.19  feet. This higher error may
 indicate that the high water levels in surface water bod-
 ies are not realistic for steady state boundary conditions.
 Realistically, monitoring wells that are relatively far from
 a surface water body  are affected  more  by average
 water levels over time ratherthan relatively short periods
 of highs and lows.

 Using average values of hi, h2 (head in upper and lower
 aquifers), / (vertical flow distance), and VCONT (a pa-
 rameter used in MODFLOWto allow for vertical conduc-
 tance), the estimated flow rate through the window for
 fall 1992 may be computed as:
      k = VCONT 1 = 1.76e-3 x 199.2 = 0.351
                                        ft
                                       day
             h!-h2   186.5-156.1
               I
                       199.2
                                  = 0.153
    Q = kAi = 0.351 x 840,000x0.153 = 45,111
                                           day
45,111 -- x 7.48
     day
                                      = 0.34 MGD
The flow rate calculated from average spring 1993 (a
period of high water levels in the surficial aquifers)
model results was computed as:
                                                  55

-------
 Horn Lake Cutoff
                         Sewanee Road
                                                              Horn Lake Cutoff
                                       Raines Road
                                     Shelby Drive
                       Legend
                  A      GWI Well

                 mi        Water Body

                 NUU^UU^  BlUff

                 	  Roads
                01234

               Scale in Thousands of Feet
Figure 7.  Location of GWI monitor wells.
      k = VCONT I = 176e-3 x 199.2 = 0.351 -^-
                                          day

          .  h!-h2  196.4-156.8
I
                         199.2
  Q = kAi = 0.351 x 840,000 x 0.199 = 58,673


58,673 -ft- x 7.48 gall°ns= 438,874 -  _,
      day         ff              day
                                            day
Using the GIS-delineated window, an estimated 0.34 to
0.44 million gallons per day flow from the alluvial aquifer
to the Memphis Sand aquifer. This variation in the flow
rate was due to seasonal variations of water level in the
alluvial aquifer.
Since the wellfield pumps approximately 13 million gal-
lons per day, the effect of the window on the entire Davis
Wellfield may not be significant. The window lies within
the  30-year capture zone of two wells in the field, however.
                                                                                             Raines Road
                                                                                             Shelby Drive
                                                                Legend
                                                         Water Body

                                                         Bluff
                                                                   A
                                                                                      GWI Well

                                                                                      Roads
                                                       Scale in Thousands of Feet


                                                        Thickness of Confining Unit
                                                                < 5 Feet
                                                                 5 to 10 Feet
                                                                10 to 20 Feet
                                                                20 to 30 Feet
                                                                30 to 40 Feet
                                                                > 40 Feet
                                       Figure 8.  Location and extent of window in confining unit.

                                       The flow rate through the window is approximately 20
                                       percent of the total production of these two wells. Addi-
                                       tionally, since the wells would probably not operate si-
                                       multaneously, the flow  rate through the window may
                                       account  for  approximately 40 percent of the flow  at
                                       either well.

                                       Particle tracking  in the  Memphis  Sand was developed
                                       using MODPATH (9) from MODFLOW results. Particles
                                       were placed at model  well screens  and tracked back-
                                       ward for 30 years. The output from MODPATH was read
                                       into the CIS database for comparison  with other data,
                                       as shown in Figure 11. The hole in the confining unit lies
                                                    56

-------
                                                 Legend
                               Bluff   	 Roads  —™~, profile

                                          0    5   10    15   20

                                          Scale in Thousands of Feet
                     Water Body
Figure 9.  Location of selected profiles.

within the 30-year capture zone of MLGW wells 418 and
419. Historically, 419 was the first well that experienced
water quality changes. The water quality  is becoming
similar to water found in the alluvial aquifer.

A change in the water quality  in well  418 is  not as
immediately noticeable as in  well 419.  This inconsis-
tency in  data may indicate that the  window does  not
extend northward from GWI-3, as the TIN model pre-
dicted. To determine which  capture zone (418 or 419)
encompasses GWI-3, particles were  tracked backward
for 40 years, as shown in Figure 12.  GWI-3 lies on  the
edge of the  capture zone for 419. The flow lines from
418 and 419 move toward the northwest  and southwest
in the Memphis Sand, pass up through the window, and
emerge in the upper aquifer.

Conclusions

The explanation of the Davis Wellfield investigation  ad-
dressed some limitations of the database. The utility that
this hydrogeologic database provides greatly outweighs
the disadvantages, however. Without the ability to map
and define hydrogeologic features, this project may not
have been completed in the allotted time frame or may
not have been completed in the same level of detail. CIS
greatly enhances the  development and evaluation of
ground-water flow models.

Specific conclusions that can be drawn from the analy-
sis performed in this project are:

• The delineation of a window in the confining layer
  using  a CIS  database is possible.

• Based on  the CIS-generated window, an estimated
  0.34 to 0.44 million gallons  per day flow from  the
  upper aquifer  to  the  Memphis  Sand,  which may
  account for  as  much  as 40  percent of the flow at
  either well 418 or 419.

• The drilling  of more monitoring wells north,  south,
  east,  and  west of GWI-3 may  provide for a more
  accurate delineation of the window.
                                                  57

-------
                                               East
                        Profile 1
                                               East
   West
                        Profile 2
                        Profile 3
         01234
                                   Top Soil
Horizontal Scale in Thousands of Feet    r^™. gg^ an(j Qrave|

                          g|| Confining Clay
             Q1234
      Vertical Scale in Hundreds of Feet
                                   Memphis Sand Aquifer
Figure 10.  Selected subsurface cross sections in the Davis
          Wellfield.
                           Sewanee Road
                                     N
                                     A
                       Legend
* MLGW Production Well — -
U Window in Confining Unit — — —
01234 __


— Flow Path
— Water Body
- — Bluff


General conclusions that may be drawn from this dis-
cussion are:

• CIS provides a convenient method of viewing flow
  model input and output.

• A flow model may be developed  and evaluated in  a
  relatively short time using CIS.

• CIS provides a convenient means of compiling and
  managing the information required to develop a well-
  head protection program.

Some CIS disadvantages that have been noted are:

• The time required to develop a database and learn
  to apply the CIS program in a particular situation may
  be  prohibitive.

• CIS-generated results from a limited database may
  be  misleading and should be corroborated with other
  analysis methods.

Acknowledgments

The authors would like to acknowledge the efforts of the
professors and students of the GWI, both present and
past,  for their  contributions  to the  database  and this

                                 Sewanee Road
                                                           Particles From Well
                                                           418 Emerging in  ^
                                                           the Upper Aquifer >|
                                                               GWI-3
                                                                            Legend
•
*•
Sc:
GWI Well 	
Window in Confining Unit 	
n 1 •> n a


	 Flow Path
	 Flow Path (#41 9)



Figure 11.  Backward tracking for 30 years.
                                                   Figure 12.  Backward tracking for 40 years.
                                                     58

-------
project, especially: Dr. John W. Smith, Director, and Dr.
Charles  V.  Camp  for their patience and  guidance  in
interpreting the model results and hydrogeologic condi-
tions  of the  area; and  David W.  Kenley,  Brian  A.
Waldron, and Robert B. Braun for their help in creating
the well  log database.

References

1.  Camp,  C.V.,  J.E. Outlaw,  and  M.C. Brown. 1994. CIS-based
   ground-water modeling. Microcomputers in Civil Eng. 9:281-293.
2.  Driscoll, F.G. 1986. Groundwater and wells. St. Paul, MN: Johnson
   Division.
3.  Camp, C.V., and M.C. Brown. 1993. A CIS procedure for develop-
   ing a three-dimensional subsurface profile. J. Computing  in Civil
   Eng. (July).
4.  Camp, C.V., and J.E. Outlaw. 1993. Constructing subsurface pro-
   files using CIS. Adv. in Eng. Software. 18:211-218.
5.  McDonald,  M., and A. Harbaugh. 1988. A modular three-dimen-
   sional finite-difference ground-water flow model. Open File Report
   83-875. U.S. Geological Survey.

6.  Palazolo, P.J., J.W. Smith, J.L. Anderson, and C.V.  Camp. 1994.
   C.M.  McCord  wellhead  protection  demonstration  grant. Final
   report.  Herff College  of  Engineering Ground  Water Institute,
   University of Memphis, Memphis, TN.

7.  Richardson, G. 1989. A study of potential sources of leakage into
   the Memphis sand aquifer beneath the Davis well field in Memphis,
   Tennessee. M.S.  thesis. Memphis State University, Memphis, TN.

8.  Parks, W.S. 1990. Hydrology and preliminary assessment of the
   potential for contamination of the Memphis aquifer in the Memphis
   area,  Tennessee. Water Resources  Investigation Report 90-4092.
   U.S. Geological Survey.

9.  Pollock, D.  1989. Documentation of computer programs to com-
   pute and display  pathlines using results from the U.S.  Geological
   Survey  modular three-dimensional finite difference ground-water
   flow model. Open File Report 89-381. U.S. Geological  Survey.
                                                           59

-------
MODRISI: A PC Approach to GIS and Ground-Water Modeling
Randall R. Ross
Robert S. Kerr Environmental Research Laboratory, U.S. Environmental Protection Agency,
Ada, Oklahoma

Milovan S. Beljin
University of Cincinnati, Cincinnati, Ohio
Abstract

It is widely accepted that ground-water contamination
problems cannot be adequately defined or addressed
until the governing physical, chemical, and biological
processes affecting the transport and fate of contami-
nants are adequately characterized. Recent research
has led to a better understanding of these complex
processes and their effect on the movement of contami-
nants in the subsurface. The compilation and application
of such information has yet to be accomplished at many
hazardous waste sites, however. Too often, copious
quantities of data are collected, only to be stored, ig-
nored, or misplaced, rather than used for problem-solving.
Geographic information systems (GIS) are computer-
based tools that are relatively new to many environ-
mental professionals. GIS allows the manipulation,
analysis, interpretation, and visualization of spatially re-
lated data (e.g., hydraulic head, ground-water velocity,
and contaminant concentration). GIS is more than a
cartographic utility program, however. The analytical ca-
pabilities of GIS allow users to display, overlay, merge,
and identify spatial data, thereby providing the basis for
effective environmental decision-making.

IDRISI is a widely used PC-based raster GIS system
that provides numerous analytical capabilities that are
directly applicable to hydrogeologic studies. Raster sys-
tems are particularly well suited for analysis of continu-
ous data such as elevation (e.g., water table, land and
bedrock surfaces), precipitation, recharge, or contami-
nant concentrations and may be readily integrated with
finite-difference ground-water models. Because the for-
mats for IDRISI and ground-water model input data sets
are different, a need exists for a program to integrate
these two types of robust tools.

MODRISI is a collection of utility programs that allows
easy manipulation and transfer of data files between
IDRISI and ground-water models (e.g., MODFLOW,
ASM, MOC). In addition, MODRISI integrates other
widely used commercial and private domain software
packages, such as SURFER, Geopack, GeoEas, Auto-
CAD, CorelDraw, and various spreadsheet programs.
Two-dimensional arrays of models' input data sets can
easily be created from IDRISI image files. AutoCAD
vector files obtained by digitizing model boundaries, well
locations, rivers and streams, or U.S. Geological Survey
digital elevation model (DEM) files can also be trans-
lated into model input file formats. MODRISI can proc-
ess model output files and prepare GIS image files that
can be displayed and manipulated within IDRISI. Thus,
MODRISI is more than a pre- and postprocessor for
ground-water models; it is a complete GIS/ground-water
modeling interface that is accessible to most ground-
water hydrologists.

Introduction

Hydrogeologists collect and analyze large volumes of
data during a ground-water modeling process. These
data are stored and presented in many different forms
such as maps, graphs, tables, computer databases, or
spreadsheets. To most hydrogeologists, geographic in-
formation systems (GIS) are relatively new tools. They
have been developed and applied in other natural and
social science fields for over two decades, however, and
can also be used in the ground-water modeling process.

GIS represents a new, powerful set of tools that can
significantly improve the usefulness of results obtained
during the ground-water modeling process. Bridging the
disciplines of ground-water modeling, computer graph-
ics, cartography, and data management, GIS represents
a computer-based set of tools to display and analyze
spatial data (e.g., water level elevations, ground-water
quality data, modeling results, ground-water pollution
potential). Efficient use of increasingly large volumes of
60

-------
data can be achieved only with powerful systems capa-
ble of acquiring information from a variety of sources,
scales, and resolutions.

CIS can  be defined as a computer-assisted system for
the efficient acquisition, storage, retrieval, analysis, and
representation of spatial data. Most CIS platforms con-
sist of numerous subsystems that perform the listed
tasks. The subsystems have the ability to query spatially
related information and incorporate statistical analyses
and modeling  of relations and their temporal changes
within the database. More than just a mapping system,
CIS allows the user to analyze spatially related data and
visualize results in either paper map form or graphically
on screen. The data to be analyzed are a collection of
spatial information  represented by points, lines,  and
polygons and their associated attributes (characteristics
of the features such as elevation or concentration). The
cartographic tools of CIS allow the analyst to display,
overlay, measure, merge, and identify the data to sup-
port a particular analysis. By allowing spatial data analy-
sis and display, CIS provides the means necessary for
effective environmental decision-making and implemen-
tation of environmental management plans.

CIS uses two  basic  map representation  techniques:
vector and  raster. Vector  representations describe fea-
tures with a number of connected points. Raster repre-
sentations subdivide a study area into a mesh of grid
cells, each cell containing either a quantitative attribute
value or feature identifier. Raster systems are well suited
for analysis of continuous data (e.g., water level eleva-
tions, infiltration and recharge rates). This makes raster-
based systems ideal for integration with ground-water
models that use regularly spaced nodes. The objective
of this paper is to illustrate such an  integration of MOD-
FLOW (1),  a widely used  U.S.  Geological  Survey
(USGS) finite-difference ground-water flow model, and
IDRISI (2), a raster-based CIS.

Previous Studies

To enhance understanding of a hydrogeologic system,
and also to develop a credible ground-water model of
the system, hydrogeologic features such as lithological
logs, recharge and withdraw rates,  estimates of spatial
distribution of hydraulic conductivity, or specific storage
can be plotted using CIS capabilities of data retrieval
and overlay options to interactively define an area of
interest (3-6). Two  previous studies that combined CIS
and ground-water modeling are briefly described.

Torak and McFadden (7) used CIS to facilitate finite-element
modeling of ground-water flow. Complex aquifer geometry
and irregularly distributed aquifer-system characteristics
that influence ground-water flow affect the design of the
finite-element mesh. CIS systems represent the com-
plex arrangement of nodes and elements and the dis-
tribution  of aquifer properties to provide input to the
flow model. Point-data coverages of pertinent aquifer
characteristics are rated from a relational database and
are displayed using CIS.

Contoured surfaces based on point-data coverages pro-
duce triangulated irregular networks (TINs) that are su-
perposed on the finite-element mesh to delineate zones
of elements  having  similar aquifer properties.  Zone
boundaries are identified using the contoured TIN sur-
face and by manually determining  where boundaries
align with the  element sides. The allocation of well
pumping rates  to nodes  in the finite-element mesh  is
performed  efficiently with CIS for  model input. Well
pumping rates  are accumulated  by element  from the
combined coverages of the pumping data and the mesh,
and  element data are distributed to the node points for
input. CIS is also used to prepare data for model input
and to assess the adequacy of the data priorto simulation.

Three-dimensional perspectives  showing TIN  cover-
ages of aquifer-property data are used to analyze and
interpret complexities  within the  flow  system  before
zonation. Additionally,  CIS is used to display computed
hydraulic heads over the finite-element mesh to produce
contour maps of the simulated potentiometric surface.
Because the node points in the finite-element mesh are
not arranged in an orthogonal fashion, such as a finite-
difference grid, a map display of the computed  values of
hydraulic head at the nodes is prepared for efficient and
accurate interpretation of simulation  results.

Harris  et al. (8)  conducted the Remedial Investiga-
tion/Feasibility Study (RI/FS) of the San Gabriel basin.
Vast amounts of  hydrogeologic data have been gath-
ered, and a comprehensive systematized CIS database
has been developed. The identified hydrologic bounda-
ries, recharge basins, stream locations, well locations,
and  contaminant distributions are some of the features
considered in developing a base map. The CIS-gener-
ated base map  has allowed development of a finite-ele-
ment grid for the basin. For each finite element, the initial
estimates  of the  hydraulic  conductivity, specific yield,
recharge rates, and other input parameters were provided.

Using  simple interfacing programs, the  retrieval CIS
nodal and elemental data were converted to required
formats for the  input files of the Couple Fluid, Energy,
and Solute Transport (CFEST) code. Simulated ground-
water levels  were compared with  the CIS-generated
potentiometric surfaces. In areas of wide variations be-
tween simulated and observed data, the zonal distribu-
tion  of  controlling   parameters  was  reevaluated,
analyzed, and updated. Data processing,  development
of input files for computerized analysis of ground-water
flow, and analysis of simulation  results with different
alternative  conceptualizations  is  time consuming and
tedious. Efficient use of CIS and CFEST not only eased
the  burden  of  conducting  multiple  simulations but
                                                   61

-------
reduced the probability of errors as well as the amount
of time and effort required for each simulation.

IDRISI
IDRISI is a grid-based geographic information and im-
age  processing system developed by the  Graduate
School of Geography at Clark University and supported
by the United Nations  Institute  for Training and  Re-
search (UNITAR) and the United Nations Environment
Programme Global Resource  Information  Database
(UNEP/GRID) (9).  IDRISI is a  collection of over  100
program  modules  that  are  linked  through a  menu
system. These programs are organized into several
groups:

• The  core modules provide data entry and database
  management capabilities.

• The  geographic  analysis  modules provide tools for
  database analysis.

• The statistical analysis modules allow statistical char-
  acterization of images.

• The  peripheral modules provide a series of utilities.

IDRISI and other raster-based systems divide data sets
into map layers; each layer contains data for a  single
attribute.  For the  example of a ground-water  model,
these layers could  correspond to the MODFLOW two-
dimensional arrays (e.g., initial water levels, transmis-
sivity distribution, IBOUND arrays, computed hydraulic
heads). IDRISI provides many analytical tools that are
useful in hydrogeologic studies.

Three of the  most  important categories of these tools
are database  query, map algebra, and context operator.
Asemihypothetical  case described below illustrates the
use of these analytical tools. IDRISI provides an  exten-
sive set of tools for image processing, geographic and
statistical analysis,  spatial decision support, time series
analysis,  data display, and  import/export and conver-
sion. In addition, as a set of independent program mod-
ules linked to  a broad set of simple data structures, the
system is  designed such that researchers may readily
integrate into  the system their own modules, written in
any programming language.

IDRISI uses three types of data files: image, vector, and
attribute.   Image files  contain  rasterized  information
relating to  a  spatial variable. Vector files contain the
coordinates of points, lines, and polygonal features. An
attribute file lists the identifiers of features and the asso-
ciated  attribute values.  Values  files can  be  extracted
from the existing  image files, or image  files can be
created from  existing values files. The values files can
be combined and stored  in a dBASE format. Each image,
vector, or attribute file has a corresponding documentation
file that contains information about the data file (e.g.,
title, number of rows and columns).
MODRISI: MODFLOW/IDRISI Interface

MODRISI  is a set of utility programs that allows the
transfer of data files between MODFLOW, IDRISI, Gold-
en Software SURFER, GeoEas, and other software.
Preparation of two-dimensional arrays  for the MOD-
FLOW input files is generally tedious and time consum-
ing. The arrays can  be  created easily from the IDRISI
image files, however. Thus, MODRISI serves as a pre-
processor for MODFLOW. For example, when the val-
ues of a variable are available only for irregularly spaced
points, interpolation  routines in SURFER or GeoEas
may be used to estimate the values of the variable on
regularly   spaced  grid-nodes.  MODRISI  translates
SURFER  or GeoEas files  into IDRISI  image files  for
manipulation,  analysis,  and display. IDRISI  recognizes
either latitude  and longitude geodetic  coordinates or
arbitrary Cartesian plane coordinates. IDRISI assigns
the lower left grid-block of a raster image as a zero-row,
zero-column block.

Vector files, such as model boundaries,  well locations,
and rivers, may be created within IDRISI  and translated
into a MODFLOW input file format. For example, the
location of a river may be digitized on screen in IDRISI.
The vector-to-raster function may be invoked, assigning
all blocks through which the river passes as river nodes.
Similarly,  the  positions  of wells may be digitized and
translated into the row-column positions and saved as a
MODFLOW input  file for the well  package. Once the
MODFLOW input files are prepared, MODFLOW simu-
lations may be initiated. The MODFLOW hydraulic head
output files may be read by MODRISI and modified to
create IDRISI  image files. Again, the image files may be
displayed and evaluated within IDRISI. Thus, MODRISI
is used as a postprocessor for MODFLOW.

Case Study

The utility of MODRISI was demonstrated at a hazard-
ous waste site. Previous investigations  provided site-
specific information,  including water level and bedrock
and land  surface elevations, which was  then analyzed
using GeoEas, a public domain geostatistical software
program.  These data were kriged to produce a grid of
regularly  spaced data. These data were imported into
MODRISI  and converted to IDRISI image files. IDRISI
was used to visualize the surfaces that GeoEas generated.

Several prominent features are obvious upon inspection
of the kriged bedrock topography (see Figure 1). Abed-
rock ridge trending northwest  to southeast is flanked by
a minor trough to the east and a  major trough to the
southwest. The outline of the site is visible on all figures.
The  kriged water  level elevation map  (see Figure 2)
illustrates  the general  hydraulic gradient to the west.
Land  surface elevations (see  Figure 3)  range  from
greater than 200 feet in the northeastern portion of the
site to a  low of 166 feet  on the western boundary.
                                                 62

-------
                                                                                                     70-80
                                                                                                     80-90
                                                                                                    90-100
                                                                                                   100-110
                                                                                                   110-120.
                                                                                                   120-130
                                                                                                   130-140
                                                                                                   140-150
                                                                                                   150-160
                                                                                                    (Feet)
                                      	                 --     -        -   fcfeMss&SSt:          310 Feet

Figure 1.  Bedrock elevation contour map derived from kriged data and transformed by MODRISI into IDRISI image file format.
                                                              ,.-,=.- - _f#>*
                                                              &^- "
                                                                                              168
                                                                                              1 69
                                                                                              1 70
                                                                                              -|y-|
                                                                                              -| 72
                                                                                              175 -
                                                                                              176 :
                                                                                             (Feet)
                                                        iiialK                                 310 Feet
Figure 2.  Water level contour map derived from kriged water level data and transformed by MODRISI into IDRISI image file format.

                                                          63

-------
165-170 i

170-175

175-180

180-185 :

185-190 .

190-195

195-200

(Feet)
310 Feet
Figure 3. Land elevation contour map generated from kriged data and transformed by MODRISI into IDRISI image file format.
Additionally, the value (e.g., elevation) and (x,y) coordi-
nates may be queried for any point, line, or area of an
image file.

The analytical capabilities of IDRISI are illustrated in the
following example. The saturated thickness was deter-
mined by subtracting the bedrock surface from the water
level surface. The results illustrate the spatial variability
of saturated thickness of the overburden aquifer (see
Figure 4). The results correspond favorably with the
general bedrock topography, as would be expected
given the relatively low hydraulic gradient across the
site. This OVERLAY (subtract) function may also readily
be used to evaluate the adequacy of model calibration.
For example, predicted values may be compared with
observed values graphically, allowing the modeler to
quickly visualize and identify areas of the model domain
requiring additional consideration and manipulation.

Conclusions

One of the most tedious tasks in a ground-water mod-
eling process is preparing input data and postprocess-
ing model results. CIS allows rapid incorporation and
evaluation of new site characterization information. The
proposed combination of IDRISI, a raster-based CIS
system, and MODRISI, a set of utility programs, could
significantly reduce the amount of time necessary for
entering data in required array formats. The visualization
capabilities of IDRISI in conjunction with MODRISI and
MODFLOW allow project managers to better under-
stand the three-dimensional nature of subsurface envi-
ronmental problems.

Acknowledgments

The authors wish to thank Chet Janowski, U.S. Environ-
mental Protection Agency Region I, for his support on
this project.

References

1. McDonald, M., and A. Harbough. 1988. A modular three-dimen-
sional finite-difference ground-water flow model: USGS techniques
of water resources, Book 6, Chapter A1.

2. Eastman, J.R. 1992. IDRISI: Technical reference. Clark University,
Worcester, MA (March).

3. Van Metre, P.C. 1990. Structure and application of an interface
program between a geographic-information system and a ground-
water flow model. USGS Open File Report 90-165. Denver, CO:
U.S. Geological Survey.

4. Hinaman, K.C. 1993. Use of a geographic information system to
assemble input-data sets for a finite-difference model of ground-
water flow. In: Proceedings of the AWRA Symposium on Geo-
graphic Information Systems and Water Resources, Mobile, AL,
(March 14-17). pp. 405-412.

5. Rifai, H.S., L.A. Hendricks, K. Kilborn, and P.B. Bedient. 1993. A
geographic information system (CIS) user interface for delineating
wellhead protection areas. Ground Water 31 (3):480-488.

6. Orzol, L.L., and T.S. McGrath. 1992. Modifications of the USGS
modular three-dimensional finite-difference, ground-water flow
model. U.S. Geologic Survey Open File Report 92-50.
64

-------

                                                                                                                 < 20

                                                                                                                20-30 &:.

                                                                                                                30-40 •  :

                                                                                                                40-50 r:v.;

                                                                                                                50-60

                                                                                                                60-70

                                                                                                                70-80

                                                                                                                80-90

                                                                                                               90-100 53

                                                                                                              100-110

                                                                                                               (Feet)
     lisis&s&BliifisiiSifi                  -, -----;--' :."•':- : •" .".-.'• -.-   -  •:=;•, -.6.;"i'•••':.;--,; "                                 310 Feet


Figure 4.  Saturated thickness contour map developed by subtracting bedrock image file from water level image file within IDRISI.
7.  Torak, L.J., and K.W. McFadden. 1988. Application of a geographic
   information  system  to finite-element  modeling of ground-water
   flow. In:  Proceedings of the CIS Symposium Integrating Technol-
   ogy and  Geoscience Applications, Denver, CO (September 26-30).

8.  Harris, J., S. Gupta, G. Woodside, and N. Ziemba. 1993. Integrated
   use of  CIS  and  three-dimensional,  finite-element model: San
   Gabriel  basin ground-water flow analyses.  In:  Goodchild, M., B.
   Parks, and  L. Steyaert, eds. Environmental modeling with CIS.
   New York, NY: Oxford University Press.
9.  Eastman, J.R., PA.  Kyem, J. Tolendano, and W. Jin.  1993. CIS
   and decision making. Explorations in geographic information sys-
   tems technology, Vol. 4. United Nations Institute for Training and
   Research, Geneva, Switzerland.
                                                              65

-------
   GIS in Statewide Ground-Water Vulnerability Evaluation to Pollution Potential
                               Navulur Kumar and Bernard A. Engel
       Department of Agricultural Engineering, Purdue University, West Lafayette, Indiana
Abstract

The ground-water vulnerability of Indiana to pollution
potential was evaluated using a geographic information
systems  (GIS)  environment.  The  Geographic  Re-
sources Analysis Support System (GRASS) and the
GRID submodule of ARC/INFO were used to conduct
the analysis and to identify and display the areas sensi-
tive to ground-water pollution potential. The state soils
geographic (STATSGO) database was employed to re-
trieve statewide soils information required for the analy-
sis. The information from the STATSGO database was
used in two models, DRASTIC (acronym representing
the following  hydrogeologic settings: Depth to water
table, aquifer Recharge, Aquifer media, Soil media,
Topography, Impact of vadose zone,  and  hydraulic
Conductivity of the aquifer) and SEEPAGE (System for
Early  Evaluation of Pollution Potential of Agriculture
Ground-Water Environments). These models employ a
numerical  ranking system and consider various hydro-
geologic settings that affect the ground-water quality of
a region.  Ground-water vulnerability  maps were pre-
pared for the state of Indiana based on DRASTIC and
SEEPAGE results. Continuing work is planned to deter-
mine the accuracy of the results by comparing the ex-
isting well-water quality data. The DRASTIC  Index and
SEEPAGE Index number (SIN) maps show great poten-
tial  as screening tools  for policy decision-making in
ground-water management.

Introduction

Ground-water contamination due to fertilizer and pesti-
cide use in agricultural management systems is of wide
concern. In 1989, reports of ground-water contamination
in New York wells led the U.S. Environmental Protection
Agency (EPA) to conduct a nationwide survey on well
contamination in the United States. These wells were
tested for presence of nitrate, pesticides, and pesticide
breakdown products (1). Statistically, the wells selected
represent  more  than 94,600 wells in  approximately
38,300 community water systems. Over 52 percent of
the community water systems and 57 percent of the
rural domestic wells tested contained nitrates (2).

Indiana has abundant ground-water systems providing
drinking water for 60 percent of its population. A study
on well-water quality detected pesticides in 4 percent of
wells tested in Indiana. Also, 10 percent of private wells
and 2 percent of noncommunity wells contained exces-
sive nitrate levels (3).

Statewide  maps showing  the areas  vulnerable  to
ground-water contamination have many potential uses
such as implementation of  ground-water management
strategies to  prevent degradation of ground-water qual-
ity and  monitoring  of ground-water systems.  These
maps will be helpful in evaluating the existing and po-
tential  policies for  ground-water protection.  Ground-
water models such as  SEEPAGE  (System for Early
Evaluation of Pollution Potential of Agriculture Ground-
Water Environments)  and  DRASTIC  (acronym repre-
senting the following hydrogeologic settings: Depth to
water table, aquifer  Recharge, Aquifer media, Soil
media, Topography, Impact of vadose zone, and hy-
draulic Conductivity of the aquifer) can be applied on a
regional scale to develop such maps.

The data layers required forthese models are commonly
available data such as pH and organic matter content.
For most states, the statewide ground-water vulnerabil-
ity maps generated using  DRASTIC were produced
from 1:2,000,000-scale  data  (4). EPA (2)  found that
these maps did not correlate well with the water quality
analysis performed for the national survey of pesticides
in drinking water wells. States need more detailed and
accurate maps to implement ground-water management
programs.  The state  soils geographic (STATSGO)
database at the 1:250,000-scale might be useful for
studies at a  larger scale.

The geographic information  systems (GIS) environment
is widely applied for diverse applications in resources
management and other areas. It offers the facilities to
store, manipulate, and analyze data in different formats
                                                66

-------
and at different scales. The DRASTIC and SEEPAGE
models can be integrated within the CIS environment to
produce the final ground-water vulnerability maps.

Objectives
The purpose of the study was to prepare maps showing
areas in  Indiana vulnerable to ground-water pollution.
This goal was accomplished  by considering  hydro-
geologic  factors in each region that affect the mobility
and leaching  of the  contaminant reaching the aquifer.
The prime objectives of this research were to:
• Evaluate Indiana's ground-water vulnerability to
  pollution potential using the DRASTIC and SEEP-
  AGE models:
  - Integrate and evaluate the models in a CIS envi-
    ronment (Geographic Resources Analysis Support
    System [GRASS]  ARC/INFO).
  - Develop a graphic user interface (GUI) in ARC/INFO
    to conduct the analyses.
• Compare  the  pollution  potential  map  from the
  DRASTIC model with the map developed using the
  SEEPAGE  Index  number (SIN).
• Validate the accuracy of the  present  approach by
  comparing the vulnerability maps with  the existing
  well-water quality  data sampled across the state.

DRASTIC
DRASTIC is a ground-water quality model for evaluating
the pollution potential  of large areas using the  hydro-
geologic settings of the region (4-6). EPA developed this
model in  the 1980s.  DRASTIC includes different hydro-
geologic  settings that influence a region's  pollution po-
tential. A hydrogeologic setting is a mappable unit with
common  hydrogeologic characteristics. This model em-
ploys a numerical ranking system that  assigns relative
weights  to  parameters that help  evaluate  relative
ground-water vulnerability to contamination.
The hydrogeologic settings that make up the acronym
DRASTIC are:
• [D] Depth to water table: Compared with deep water
  tables,  shallow water tables pose  a  greater chance
  for the  contaminant to reach the ground-water surface.
• [R] Recharge (net): Net recharge  is the amount  of
  water per unit area of soil that percolates to the aqui-
  fer. This is  the principal vehicle that transports the
  contaminant to the ground water.  Higher recharges
  increase the chances of the contaminant being trans-
  ported  to the ground-water table.
• [A] Aquifer media: The material of the aquifer deter-
  mines  the   mobility  of  the  contaminant traveling
  through it. An increase in travel time of the pollutant
  through the  aquifer increases contaminant attenuation.
• [S] Soil media: Soil  media is the uppermost portion
  of the unsaturated zone/vadose zone characterized
  by significant biologic activity. This, in addition to the
  aquifer media, determines the amount of water per-
  colating to the ground-water surface. Soils with clays
  and silts have larger water holding capacity and thus
  increase the travel time of the contaminant through
  the root zone.

• [T] Topography (slope): The  higher the slope, the
  lower the pollution potential  due to higher runoff and
  erosion  rates, which include pollutants that infiltrate
  the soil.

• [I] Impact of vadose zone: The unsaturated  zone
  above the water table is  referred to as the vadose
  zone. The texture of the vadose zone determines the
  travel time of the contaminant. Authors of this model
  suggest using the layer that  most restricts water flow.

• [C] Conductivity (hydraulic): Hydraulic conductivity of
  the soil  media determines the amount of water per-
  colating through the  aquifer to the ground water. For
  highly permeable soils, the travel time of the pollutant
  is decreased within the  aquifer.

The major assumptions outlined  in DRASTIC are:

• The contaminant is introduced at the surface.

• The contaminant reaches ground water by precipitation.

• The contaminant has the  mobility of water.

• The area of the study site is more than 100 acres.

DRASTIC evaluates pollution potential  based on the
seven hydrogeologic settings listed above. Each factor
is assigned a weight based on  its relative significance in
affecting the pollution potential. Each factor is also as-
signed a rating for different ranges of the values. Typical
ratings range from 1  to 10, and weights range from 1  to
5. The DRASTIC Index, a measure of pollution potential,
is computed by summation of the products of rating and
weights of each factor  as follows:

DRASTIC Index =
DrDw + RrRw + ArAw + SrSw + TrTw + Irlw + CrCw

where:

Dr  = Ratings for the depth  to water table
Dw = Weights for the depth to water table
Rr  = Ratings for different ranges of aquifer recharge
Rw = Weights for the aquifer recharge
Ar  = Ratings for the aquifer media
Aw = Weights for the aquifer media
Sr  = Ratings for soil media
Sw = Weights for soil media
Tr  = Ratings for topography (slope)
Tw = Weights for topography
Ir   = Ratings for the vadose zone
                                                  67

-------
Iw  = Weights for the vadose zone
Cr  = Ratings for different rates of hydraulic conductivity
Cw = Weights for hydraulic conductivity

DRASTIC assigns two different weights depending upon
the type of contaminant. Pesticides are given different
weights than general contaminants. In  assigning the
weights, DRASTIC considers the different properties of
pesticides as they travel through the vadose zone and
root zone of the soil media.

The higher the  DRASTIC Index, the greater the relative
pollution potential. The DRASTIC Index  is divided into
four categories: low, moderate, high, and very high. The
sites with high  and very high categories are more vul-
nerable to  contaminations and  hence should be re-
viewed by the site specialist. These weights are relative,
however. Low pollution potential does not necessarily
indicate that a  site is  free from ground-water contami-
nation. It indicates only that the site is less susceptible
to contamination than sites with high  or very high
DRASTIC ratings.

SEEPAGE

The SEEPAGE model is a combination of three models
adapted  to meet  the Soil Conservation  Service's
(SCS's) need to assist field personnel (7, 8). SEEPAGE
considers  hydrogeologic settings and physical proper-
ties of  the soil  that affect  ground-water vulnerability to
pollution potential. SEEPAGE is also a numerical rank-
ing model that  considers contamination from both con-
centrated and dispersed sources.

The SEEPAGE model  considers the following parameters:

• Soil slope

• Depth to water table

• Vadose zone  material

• Aquifer material

• Soil depth

• Attenuation potential

The attenuation potential further considers the following
factors:

• Texture of surface soil

• Texture of subsoil

• Surface  layer pH

• Organic matter content of the surface

• Soil drainage class

• Soil permeability (least permeable layer)

Each factor is assigned a numerical weight ranging from
1 to 50 based  on its  relative significance, with the pa-
rameter that has the most significant effect on water
quality assigned a weight of 50 and the least significant
assigned a weight of 1. The weights are  different for
concentrated or site-specific sources, and dispersed or
nonspecific sources.

Similar to DRASTIC, each factor can be  divided into
ranges and ratings, varying from 1 to 50. The ratings of
the aquifer media and vadose zone are subjective and
can be changed for a particular region. Once the scores
of the six factors are  obtained, they are  summed to
obtain the SIN. These values represent pollution poten-
tial, where a high SIN implies relatively more vulnerabil-
ity of the ground-water system to  contamination. The
SIN values are arranged into four categories of pollution
potential: low, moderate, high, and  very high. A high or
very high SIN category indicates that the site has signifi-
cant constraints for ground-water quality management (7).

GIS

CIS  has been  widely used for natural resources  man-
agement and planning, primarily during the past decade.
A GIS  can be combined with a ground-water quality
model to identify and rank the areas vulnerable to pol-
lution potential  for different scenarios and land use prac-
tices. Many GIS software packages are  available.
GRASS is a raster-based public domain software devel-
oped  by the U.S.  Army  Construction  Engineers Re-
search Laboratory (9). This software can assign different
weights to, or reclass, the data layers and combine map
layers, and is suitable for implementing the DRASTIC
and SEEPAGE models. ARC/INFO is a GIS  software
developed by Environmental Systems Research  Insti-
tute  (ESRI)  in Redlands, California.  The GRID sub-
module of ARC/INFO facilitates the handling  of raster
data. Also, the  capability to develop a menu-based GUI
helps users  easily implement the  models. The GRID
submodule also  can reclass and manipulate  the map
layers suitable  for conducting the analyses.

Methodology

Developing the Data Layers in GRASS and
ARC/INFO

The STATSGO database from SCS comes  at a scale of
1:250,000 and is distributed in different data formats.
This study  used  the  STATSGO database  in the
ARC/INFO format. The database is organized into map
units that have up to 21 components. These map com-
ponents have information  assigned to layers of soil ho-
rizons. Each layer is attributed various soil properties
such as pH or  organic matter content (10). Each  prop-
erty is assigned a high and a low value for a map unit.
The STATSGO map for Indiana is available  in the vector
format. This map was exported to GRASS as a vector
coverage (11) and was converted into a raster coverage
within the  GRASS GIS environment. This was used as
                                                 68

-------
the base map for the DRASTIC and SEEPAGE analy-
ses. The hydrogeologic parameters required for the
models  were identified from the corresponding INFO
data tables  and were exported into an ASCII file. Code
was developed to generate a GRASS  reclass file as-
signing the weighted values of the parameters to the
corresponding map units in the base map. The STATSGO
base map imported  into GRASS was reclassed for each
hydrogeologic setting (e.g., topography, pH) to create the
data layers required for DRASTIC and SEEPAGE analyses.

The map layers  of the  hydrogeologic parameters in
GRASS were then  exported to ARC/INFO as  raster
coverages.  A GRASS command  was  developed that
allows the output ASCII file from GRASS to be imported
into ARC/INFO directly without further modifications to
the header in the ASCII file.
Developing a Graphic User Interface in
ARC/INFO

The dynamic form-menu option (12) was used to develop
a GUI for both DRASTIC and SEEPAGE analyses (see
Figures 1 and 2). Because ratings for some parameters
are subjective, the GUI provided an option to change the
weights assigned to hydrogeologic settings. The cover-
ages must already be assigned ratings before using the
interface, however. The  interface also allows users to
reclassify the final vulnerability maps qualitatively (13) into
four categories (low, moderate, high, and very high) after
viewing the range of DRASTIC Index or SIN values.
Conducting the Analyses

The data layers were developed separately for the high
and low values of the hydrogeologic settings. Once all
the data  layers were compiled, the corresponding rat-
ings and weights were assigned and the analyses were
conducted using the GUI. The data layers aquifer re-
charge, aquifer media, and vadose zone media were not
available, so the analyses were conducted without these
base maps. The SEEPAGE analysis was performed for
concentrated/point sources of pollution. The final vulner-
ability indexes from the analyses were  classified  into
four categories (low, medium, high, and very high) (see
Table 1) to generate the final statewide vulnerability maps.

Table 1. Pollution Potential Categories Using SEEPAGE and
       DRASTIC Indexes
               Range of DRASTIC/SEEPAGE Index
Analysis
            Low
Moderate   High
Very High
SEEPAGE
DRASTIC
1-24
30-70
25-48
71-100
49-70
101-110
>70
> 110
Validating the Accuracy of the
Vulnerability Maps

The  ground-water vulnerability  maps  produced  by
DRASTIC analysis were compared with those generated
using the SEEPAGE model. The final statewide ground-
water vulnerability maps from either approach  were
compared with the well-water quality data sampled from
over 2,500 wells (see Figure 3), and the number of wells
falling into each vulnerability category was tabulated.

Results and Discussion

Statewide analysis of ground-water vulnerability to pol-
lution potential was conducted using the DRASTIC and
SEEPAGE analyses at a scale of 1:250,000 in the raster
format. The analyses were conducted for both the high
and  low values of the hydrogeologic  settings, and the
final  vulnerability maps were prepared for the state of
Indiana (see Figures 4 and 5). The vulnerability maps
from both  approaches were compared in the GRASS
environment.

In both analyses, the low values of hydrogeologic set-
tings resulted in more areas being classified as high and
very  high categories, compared with the  high values of
hydrogeologic settings in a map unit. The DRASTIC
analysis placed more areas in the very high vulnerability
category, compared with the SEEPAGE analysis, which
categorized the  same  areas as high  vulnerability.  The
nitrate-nitrogen concentrations observed in the wells
were compared with the final vulnerability maps, and the
number of wells falling into each of the four vulnerability
categories (low, moderate, high, and very high) were sum-
marized (see Tables 2 and 3).

Approximately 80  percent of the wells with concentra-
tions less than 5 parts per million are classified under
the moderate vulnerability category in SEEPAGE analy-
sis. Overall, the results from the analyses did not corre-
late satisfactorily with the observed well-water quality
data. Unavailability of the data layers aquifer media,
aquifer recharge,  and vadose zone media may  have
caused these results. The well-water quality data of
nitrate-nitrogen contaminations was considered only for
testing map accuracy, whereas the  analyses  do not
account for the type of contaminant, its severity, and its
volume in  the generation of vulnerability maps. Other
limitations  of these approaches, including that the fac-
tors influencing aquifer contamination (e.g., direction of
water flow, land use, population at risk, point sources of
pollution) are not considered in the ground-water vulner-
ability evaluation, might also have led to the observed
results.

The DRASTIC and SEEPAGE analyses can be improved
by incorporating data layers such as land use and nitrate
loadings in computing the DRASTIC and SEEPAGE
Indexes.  The STATSGO database can be used for
                                                 69

-------
Figure 1.  GUI for DRASTIC analysis.
                                                       70

-------
                "•*    fff   VS   r     t .   tf        f   f   f      rr,
        ,„'„ fff'f f f  ff/'sf ,f",,, ', „'$    '{  f" f x"x*.   ' •.,•.  f , f f f'ff f, ixxxi  ;
     *-  t*$   \tr   ^   fff  ^,«'  «-   »>  sv  ^;   ,  , w f fit   ff   ff
       *   jf f    A-J:   ' *sf  •. fffff   j-j-f   fSff jff ^  t  v. f f •. fj?   ff ^ fff *& fffe

       -'& ,^r~  -^'/;-^/   ,
     T* ' «.'% **
     e*  4^.<  .• .
               frfft   AV ^
                                                                   ^ %*'$• ^V*^^.;|^r^ *" f *H '  ^"

                                                                  '*  'x,,''^  i'«^Si*t'^ry'fl«ifeu- 1T'»   '"'
                                                                       v  f $z&f*~. M5^^f ^^ ^""   ''"
                                                                         5   ^  „  ^   /A'  f,  "f,f  ,%.,
                                                                         •",'  » vsxj-x ^,xx   ,. -,;   ^
                                                                  B  #  ~, ^  'xi*'  x  ,"ff  ff   ;;_  -5, ^ ';,

                                                                  \^~*:*.::*z'i  *:.\  *.
Figure 2.  GUI for SEEPAGE analysis.
                                                    71

-------
Figure 3.  Sampling sites (wells) for water quality data.
                 Low


                 Moderate
High


Very High
                                      ^ Low


                                      1H Moderate
High


Very High
                                                              Figure 5.  Ground-water vulnerability  map  using SEEPAGE
                                                                        analysis.
                                                              Table 2.  Comparison of SEEPAGE Results With Observed
                                                                       Nitrate-Nitrogen Concentrations in Wells

                                                                                        Nitrate-Nitrogen Levels
Figure 4.  Ground-water  vulnerability  map  using  DRASTIC    yerv hjqh
          analysis.
Category
Low
Moderate
High
Very high
Table 3.
Category
Low
Moderate
High
Very high
0-5 5-10
7 2
1 ,322 76
249 11
194 2
>10
8
384
1
64
Comparison of DRASTIC Results With Observed
Nitrate-Nitrogen Concentrations in Wells
Nitrate-Nitrogen Levels
0-5 5-10
11 2
541 41
720 39
500 9

> 10
9
290
213
65
                                                           72

-------
developing most of the data layers required for the
analyses.  CIS is a useful tool  for integrating  ground-
water quality models and facilitates testing the models for
different scenarios.  The GUI helps users easily conduct
analyses and facilitates changing the weights for subjec-
tive hydrogeologic settings. DRASTIC and SEEPAGE ap-
proaches show great potential as screening tools for policy
decision-making in ground-water management.

Summary

Ground-water pollution from  agricultural management
systems  is  of wide  concern.  Few  models   address
ground-water  vulnerability on  a  regional  scale.  The
DRASTIC and SEEPAGE models are numerical ranking
models that consider various hydrogeologic settings af-
fecting the contamination of a  region. The data  required
for these models are commonly available data, and the
STATSGO database at 1:250,000-scale was used in this
study. These models were integrated in the CIS environ-
ment of GRASS and ARC/INFO in the raster format. A
menu-based GUI was developed in ARC/INFO for con-
ducting the analyses. The vulnerability maps generated
from DRASTIC  and SEEPAGE  analyses were com-
pared. The statewide vulnerability  maps also were com-
pared  with the well-water quality data  to  validate the
accuracy of the models.

References
 1. U.S. EPA. 1990. National pesticide survey: Phase I report. Wash-
   ington, DC.
 2. U.S. EPA. 1992. Another look: National survey of pesticides in
   drinking water wells, Phase II report. EPA/579/09-91/020. Wash-
   ington, DC.
 3. Department of Environmental Management, Groundwater Sec-
   tion. 1989. Indiana ground-water protection: A guide book (June).

 4. U.S. EPA. 1987. DRASTIC: A standardized system for evaluating
   ground-water  pollution  potential using  hydrogeologic  settings.
   EPA/600/2-87/035. Washington, DC.

 5. U.S. EPA. 1985. DRASTIC: A standardized system for evaluating
   ground-water  pollution  potential using  hydrogeologic  settings.
   EPA/600/2-85/0108. U.S. EPA, Robert S. Kerr Environmental Re-
   search Laboratory, Ada, OK.

 6. Deichert, L.A., and  J.M. Hamlet.  1992. Nonpoint ground-water
   pollution potential in Pennsylvania. ASAE Paper No.  922531.
   American Society of Agricultural Engineers International Winter
   Meeting, Nashville, TN.

 7. Richert, S.E.,  S.E. Young, and C. Johnson. 1992. SEEPAGE: A
   GIS model for ground-water pollution potential. ASAE Paper No.
   922592. American Society of Agricultural Engineers International
   Winter Meeting, Nashville, TN.

 8. Engel, B.A., and D.D. Jones. 1992. Technique for  developing
   ground-water  vulnerability to nitrate maps for large  areas: Re-
   search proposal 12-1992.

 9. U.S.  CERL. 1990. GRASS: Geographical Resources  Analysis
   Supporting System  user's manual.  Champaign, IL:  U.S. Army
   Construction Engineers Research Laboratory.

10. SCS. 1992. State Soils Geographic Database (STATSGO) user's
   guide. Soil Conservation Service Publication No. 1492.

11. ESRI. 1992. ARC/INFO command references: Arc command ref-
   erences. Redlands, CA: Environmental Systems Research Institute.

12. ESRI. 1992. ARC/INFO  user's guide: AML user's guide. Red-
   lands, CA: Environmental Systems Research Institute.

13. ESRI. 1992. ARC/INFO user's guide: Cell-based modeling with
   GRID. Redlands, CA: Environmental Systems Research  Institute.
                                                       73

-------
   Verification of Contaminant Flow Estimation With GIS and Aerial Photography
                                        Thomas M. Williams
                         Clemson University, Georgetown, South Carolina
Abstract

Estimation of contaminant movement in ground water
requires interpolation of data from sampling wells that
represent a very small sample of aquifer volume. Spatial
statistics and kriging provide the best unbiased estima-
tor  of interpolated  concentrations.  Hurricane  Hugo
provided an  opportunity to compare these estimators
with actual forest mortality caused by saltwater inunda-
tion associated with the tidal  surge. During the  9- to
15- month period after the hurricane, salt from the tidal
surge moved within the shallow water table aquifer,
causing widespread tree mortality on Hobcaw Forest in
eastern Georgetown County, South  Carolina. A small
watershed (12 acres) was instrumented with 24 multi-
level sampling wells. Piezometric potential and samples
for salt concentration  were collected for  12 months
(months 18 to 30 after the tidal surge).  These data
produced three-dimensional estimations of flow direc-
tions and two-dimensional maps of chloride concentra-
tion.  These maps led to the identification of important
heterogeneities in the  water table aquifer. Apparently,
the infiltrated salt water moved to the  bottom of the
aquifer (15 feet) and emerged,  killing the forest, where
aquifer heterogeneity resulted in upward  movements of
ground water.

Georgetown  County implemented  a geographic  infor-
mation system (GIS) for tax mapping in  1988 and pre-
pared 1:400-scale orthophotographs of the entire county
with  true ground accuracy of less  than 5  feet. Color
infrared aerial photographs were taken from a Cessna 150
platform annually after the hurricane. ERDAS GIS soft-
ware and the accurate  photo base allowed removal  of
scale irregularities and distortion that resulted from  using
a small aircraft. Scanned images, using a 10-square-foot
pixel, were compared with kriged chloride concentration
maps,  also using a 10-foot cell size. Grid cells with
estimated chloride concentration of more than 500 mil-
ligrams  per liter  also exhibited low reflectance  in the
infrared-enhanced color band, indicating tree mortality.
Here, a small number of sampling wells accurately pre-
dicted ground-water movement of a contaminant (NaCI),
and GIS and remote sensing verified this movement.

Introduction

Estimating contaminant flow in ground water is difficult
because  we cannot "see" the aquifer. We know that
aquifers comprise  sediments that vary from  place to
place, that changes in  hydraulic conductivity determine
the rate of water movement, and that the spatial variabil-
ity of the aquifer sediment determines the hydraulic
conductivity. Our inability to accurately represent spatial
variability  of the  aquifer limits  our ability to predict
ground-water flow and, thereby, contaminant transport.

A large variety of prediction models are available  (1-3),
and stochastic methods of estimating spatial heteroge-
neity have been developed (4, 5) and tested (6, 7). On
well-characterized field sites, these techniques can pro-
duce predictions of tracer movements that accurately
predict experimental plumes in terms of mass behavior.
Even at these  research sites, the spatial distribution of
hydraulic conductivity is not known well enough to pre-
dict behavior at any particular point.

Ground-water  measurements generally  derive  from
wells that are single points. To understand movement of
an entire plume, these single point samples  must be
extended to represent areas. The geostatistical ap-
proach allows quantitative estimation of the spatial vari-
ation of point estimates (8). Kriging is  a technique that
uses spatial covariance to estimate values at points
where no measurement exists (9). It produces the best
linear unbiased estimator of nonmeasured points  (8).

Following Hurricane Hugo, these techniques were used
to  study saltwater movement in the water table aquifer
in forested stands of eastern Georgetown County, South
Carolina. Clemson University received a grant from the
U.S. Forest Service to examine forest mortality and
regeneration success within the forest zone covered by
salt water during the tidal  surge.  In this study,  we used
a small sample of ground water to estimate the direction
                                                  74

-------
and concentration of salt moving in the  aquifer. Geo-
graphic information systems (CIS) proved to be a useful
tool to verify conclusions based on the  small sample
size. Onsite sampling, aerial photography, vector and
raster CIS, and spatial statistics were combined into one
analysis system. The system  estimated and verified
directions of salt movement within the aquifer. CIS and
remote sensing of forest mortality  produced an inde-
pendent indicator of salt movement that could be com-
pared with the geostatistical technique.
Problem Statement
The main goal of the research project was to evaluate
problems for forest regeneration in areas covered by salt
water during the hurricane. In  many of these areas, the
mature trees died during the summer following the hur-
ricane. These areas have very low elevation, little relief,
and abundant rainfall, causing the water table to remain
near the soil surface.  The hypothesis was that salt
movement within the aquifer killed the mature trees and
could  limit regeneration success. We divided the prob-
lem into three tasks: to determine if salt concentrations
in the aquifer were high in areas where mature trees
died, to determine pathways of salt movement within the
aquifer that could explain high salt concentrations, and
to predict regeneration success from the pattern of salt
movement.

CIS contributed both to testing the initial hypothesis and
to extending predictions to areas not  initially studied.
CIS has been used primarily to store and display spatial
data in a way that  preserves  and presents the  spatial
relationships as well as the data. For  this project, we
collected two dissimilar data types. To determine salt
movement, we  measured salt concentrations and pie-
zometric pressures in  a series of  wells.  CIS had to
represent the well  data and the domain of the  kriging
procedures in a coordinate system compatible with the
mortality data. We determined forest mortality data from
infrared-enhanced  color aerial photography. CIS also
had to allow separation of the infrared  signature of the
photography, transform the signature into data that were
comparable  with the well  data,  and  ensure that  the
coordinate systems of the well data and the mortality
data represented the same true ground positions.

To use the CIS ability  for this project, we needed to
choose several CIS parameters. In this case, we esti-
mated  ground-water  chloride   concentration   using
kriging, which produced data on a grid comparable with
mortality interpreted  from  photographs. A raster CIS
representation could be compared  with individual grid
chloride values. Each grid cell was 10 square feet so
that each cell would be  within  a single tree crown.
Methods

Study Location

The study was located on 12 acres of a small watershed
located on the eastern  side of Hobcaw Forest, an ex-
perimental forest managed by Clemson University, De-
partment of  Forest Resources.  Hobcaw  Forest is
located on the end of a peninsula between the Winyah
Bay  and the Atlantic Ocean in  eastern Georgetown
County, South Carolina. The study watershed is located
immediately west of the salt marsh and barrier island
separating the forest from the Atlantic Ocean and is in
Pleistocene-aged beach sediment. Watershed divides
were created by former low dune lines, and the stream
is within a small depression between these former dune
lines. Divides are from 7 to 8 feet above sea level and
the stream from 4 to 6 feet above sea level.

The  study watershed is 50  miles northeast of Char-
leston, South Carolina, where the eye of Hurricane
Hugo struck the U.S. coastline. Along this portion of the
South Carolina coast, the tidal surge was approximately
10 feet above mean sea level (10), covering the entire
watershed. Afterthe hurricane, shallow auger holes con-
tained water with sodium concentrations of 4,000  milli-
grams  per liter  (11). The  hurricane  winds did  little
damage to the watershed forest,  but 25 percent of the
large  oaks were windthrown  (12).  Beginning in the
spring of 1990, however, many hardwoods and pines
began dying. By the winter of 1990 and 1991, a large
portion of the forest on the watershed had died.  Tree
mortality did not correspond with high salinity measured
by the initial auger-hole method, suggesting movement
in the water table aquifer.

Well Installation

The water table aquifer is about 20 feet thick, consisting
of fine sand similar to the present beach, with thin beds
of shells 10 feet beneath the stream. The bottom of the
aquifer is a bed of clay up to 3 feet thick over a leaky
artesian aquifer  composed of shell and sand.  Local
rainfall recharges the water table aquifer. Recharge for
the lower aquifer is provided by leakage from the water
table aquifer beneath the center of the peninsula, about
2 miles west of the watershed, where land  elevations
are 15 to 25 feet above  sea level. Piezometric potential
in the lower aquifer is generally a few inches above the
water table aquifer, making it only weakly artesian  (13).

We installed 24 multilevel ground-water samplers (14)
in the water table aquifer. Five samplers were located
in  regeneration  measurement plots (15) placed within
the stream.  Two samplers, one  at each edge of the
hardwood wetland, formed a line perpendicular to the
stream at the regeneration plot. Two  more samplers
were located along these lines near the watershed di-
vides on each side of the watershed. The 24 samplers
                                                  75

-------
formed five transects across the stream (see Figure 1).
Piezometric potential and ground-water chloride con-
centrations were measured  from these samplers from
March  31,  1991, through April 1,  1991. Williams  (16)
provides a complete description of samplers, sampling
procedures, and laboratory analysis.

GIS Implementation and Measures

A GIS system for Hobcaw Forest management  was
developed  in 1987 using Environmental Systems Re-
search Institute's PC ARC/INFO software (17). The in-
itial  system  consisted  of  forest  stand   boundaries
digitized onto 1:100,000 digital line graphs (DLGs) pur-
chased from the U.S. Geological Survey. These rela-
tively crude maps were combined with  stand  records
and used for management decisions that did not require
exact locations of stand  boundaries. Later, management
of the endangered red-cockaded woodpecker's habitat
required that mapped  stand  lines be closer to  true
ground locations than the original  DIG data scale al-
lowed. A program of ground  surveys and aerial photog-
raphy was conducted in the late 1980s to locate stand
boundaries more accurately (18).

In 1988, Georgetown County began a program to con-
vert  county tax mapping to computer-based systems.
The  first step was to acquire survey grade orthopho-
tography. Copies of 1:400-scale orthophotographs, with
guaranteed ground accuracy of plus  or minus 5 feet,
            Location of Sample Points
    554740
    554576  -
    554412  _|
D)
C
!c
t
    554248
    554084 -
    553920
                             Transect 5
                     Transect 4
                                     Transect 3
                               Transect 2
                   Transect 1
         2548470    2548634   2548798  2548962

              Easting (feet) SC State Plane 1983 Datum


Figure 1.  Position of sampling wells in rectangle defined for
         estimation of salinity movements.
became available in 1990. Roads and stand boundaries
were  digitized from these  photographs into  the  PC
ARC/INFO database. The new, accurate map was com-
bined with stand records of previous coverages to create
stand record coverages on a map that was true to the
ground within 5 feet, plus or minus 0.3 percent.

In 1991, the GIS programs ERDAS VGA and LIVE LINK
were  obtained. ERDAS VGA programs allow image
processing and raster GIS to be done on personal com-
puters with VGA and some Super VGA monitor adapt-
ers. The LIVE LINK program allows  display  of  both
ARC/INFO and ERDAS images on the same  monitor
screen.  Orthophotographs  were  scanned  using  5-
square-foot pixels  and  rectified with less  than  1-pixel
mean error, giving accurate ground locations of plus or
minus 8 feet.

Ground surveys from a nearby benchmark provided accu-
rate locations of the sampling wells. PC-TRAVERSE per-
formed coordinate geometry from the survey notes, which
was then  plotted on the ARC/INFO forest stand data-
base. Accuracy was checked by plotting the recogniz-
able   points   on  the   survey   with   the  scanned
orthophotograph  using the LIVE LINK software.

In February  1991, the Hobcaw Forest  was  photo-
graphed with  infrared-enhanced  color film. In this  film,
the red layer is sensitive to near infrared radiation that
is strongly reflected by chlorophyll. Red colors in result-
ing prints indicate living  vegetation. The color photogra-
phy was not corrected  for scale variation (from small
fluctuation in aircraft altitude) or for distortion (caused by
slight variations in the aircraft attitude). One photograph
(1:1,320  scale)  covering the study watershed  was
scanned  into the ERDAS program. This image  was
rectified to the 1988 orthophotograph image using  con-
trol points visible on both. A 10-square-foot pixel  was
used to sample individual tree crowns.  The mean error
of rectification was  1.5 pixels for a ground location, plus
or minus 15 feet.

Ground-water  chloride values at any one  point varied
over three orders of magnitude in all three dimensions
and over  two  orders of magnitude with time. Annual
averages of piezometric potential and chloride concen-
tration, however, yielded interpretable results that were
also statistically significant (16). Averaged values could
then be combined with the surveyed sampler locations
in the ARC/INFO system. The GIS also calculated the
corner coordinates  for a rectangle that would include all
the sampler locations.

Geostatistical  calculations were performed  using  the
GS+ software. The data input to this program is an ASCII
file of sample point  locations in x,y coordinates and  data
values. The  program allows calculation of semivario-
grams with various combinations of active and maxi-
mum  lag distance and fitting of various model types to
                                                  76

-------
best fit the semivariogram (19). An active lag of 65 feet
and a Gausian model produced the best fit:

i(h) = 0.001  + 1.337 (1 - exp [-h2/20736]),  r2  = 0.616

where i(h) is the semivariance at lag distance  h.

This best fit model was then used in a block kriging (9)
procedure. The procedure used eight nearest neighbors
and calculated average values for 10- by 10-square-foot
blocks. Values were  calculated within the rectangle de-
fined  by the corner coordinates from the ARC/INFO
procedure described above.

Finally, the rectangle defined around the sampler posi-
tion formed the region of comparison between the rate
of mortality, as sampled by infrared  reflection, and esti-
mated average chloride concentrations. The first com-
parison involved mapping reflection in the red band of
the aerial photograph as a gray-scale map and compar-
ing it with the contoured map of chloride concentration.
The rectangle coordinates were used  in  the  ERDAS
software to create a subset of the scanned aerial  pho-
tograph that  included  only the red band in the 5,145
pixels defined by the  rectangle surrounding the  sam-
plers. In this  subset, the infrared reflection was scaled
as  a  gray-scale value between  0  and 255 for  each
10- by 10-square-foot block defined in the concentration
map. In addition to  mapping, a regression  of chloride
concentration to gray-scale value was performed  using
the individual blocks.

Results

Ground Water

Ground-water chloride data reflected a consistent expla-
nation of salt movement.  Initial  auger-hole data col-
lected within a month of the hurricane  indicated most
salt was near the surface of the pine  ridges, where,
presumably,  salt water had filled  the aquifer to the soil
surface during the hurricane. Data collected  30 months
after the hurricane indicated the bulk of the  salt had
moved to the bottom of the aquifer underthe pine ridges.
Figures 2 and 3 represent the most significant results
interpreted from the piezometric  potential and chloride
concentration measurements.

Figure 2 represents a cross section of the chloride con-
centrations and directions of ground-water flow in tran-
sect 4, the second most northern transect. The  common
information represented  in this cross section  was the
west to east movement of ground water, representing
the regional flow toward the forest edge. Also, there is
an area of upwelling just east of the stream at the bottom
of the aquifer, representing a leaky spot in the underlying
clay layer. Upwelling causes the west to east stream-
lines to rise toward the surface along the western edge
of the wetland. Chloride concentrations indicate  large
                                                                            Transect 4
    Pine
                 Chloride Concentration

    [] < 50 Milligrams per Liter     •  500 to 750 Milligrams per Liter
    PI 50 to 100 Milligrams pel
f Liter
       100 to 250 Milligrams per Liter
                             750 to 1,000 Milligrams per Liter
                             1,000 to 2,000 Milligrams per Liter
    [>] 250 to 500 Milligrams per Liter M  > 2,000 Milligrams per Liter

Figure 2.  Cross section of aquifer at transect 4; gray scale rep-
         resents chloride concentration, boxes at sampler po-
         sitions represent 95-percent confidence limits  of
         average chloride concentration in same gray scale,
         and arrows are perpendicular to contours  of pie-
         zometric  potential and represent two-dimensional
         vectors of streamlines.

reservoirs of salt beneath each of the pine  ridges and
small pockets of fresher water near the surface,  prob-
ably the  result  of rain infiltration during the  30 months
since the hurricane. The water upwelling  from the arte-
sian  aquifer was  consistently  fresh.  Flow  passing
through  the  concentrated  zone beneath the western
ridge was pushed to the surface beneath the stream,
where evaporation  caused  chloride concentrations  to
average  more than  1,000 milligrams per liter.

Figure 3  represents two plan views of the site at depths
of 4 and 12 feet below the surface. The 12-foot plan view
indicated that the east to west flow in transect 4 is only
the east vector of a southeast flow.  Other areas  of
upwelling exist beneath the stream. Chloride concentra-
tions are highest beneath both ridges and lowest in the
stream center. At the 4-foot depth, the east  and south-
east flows are also obvious. Also, as deeper flows from
the western ridge are turned to the surface, high con-
centrations  of chloride are  present near the surface.
High concentrations within the wetland result from water
being carried to the surface due to upwelling within the
wetland.

GIS Evaluations

The ground-water interpretations show a  consistent ex-
planation of salt movement. These interpretations are
                                                    77

-------
Plan View 12-Foot Depth
Plan View 4-Foot Depth
Figure 3. Plan views of aquifer at depths of 4 and 12 feet below the surface with same chloride scale and two-dimensional plan
vectors of streamlines.
based on only 24 sample points. Interpolation was linear
using the nearest neighbor. The samples removed from
the aquifer represent only 0.000014 percent of the aqui-
fer volume. Interpretation of a three-dimensional flow
regimen from such small sampling does not produce
great confidence in the validity of the interpretation.

Salt at the 4-foot depth is most likely to interact with tree
roots, and concentrations at this depth were used for
kriging. Kriged results, mapped in the same manner as
the aerial photography, show general agreement of high
mortality and predicted chloride concentrations over 500
milligrams per liter (see Figure 4). A regression of chlo-
ride concentration (chloride) to gray-scale value (G) for
the individual points yielded a significant negative cor-
relation. The regression line G = 169-0.12 (chloride)
explained only 27 percent of the variation in gray-scale
value, however.

Conclusions

CIS was successfully used to verify interpretations of
ground-water flow and salt movements in a shallow
water table aquifer. A variety of computer software com-
bined to create a system of analysis that allowed inte-
gration of field sampling, aerial photography, vector and
raster CIS, and spatial statistics. Using this system, we
compared chloride movement, measured by subsurface
samplers, with remotely sensed tree mortality caused by
soil salinity. The overall pattern of mortality was pre-
dicted by a 500-milligram-per-liter chloride contour esti-
mated by kriging averaged concentrations. Estimation
of mortality on a single tree basis was less successful,
with a regression of chloride to infrared reflection ex-
plaining only 27 percent of the variation in reflection. The
regression did not fit values of high reflection well but
did predict reflection values of 100 or below (regions of
high mortality) for concentrations above 500 milligrams
per liter.

The most important factor in the success of this project
was the availability of large-scale orthophotography.
Georgetown County's investment in accurate mapping
allowed creation of a map base that made scale correc-
tion of less costly aerial photography possible. Without
assurance that pixels on the aerial photograph corre-
sponded to the same locations as the subsurface sam-
plers, correlations would have been meaningless.

Another factor that contributed greatly to the research
was the fact that most of the computer software ex-
ported or imported data from simple ASCII files. The
standard (x coordinate, y coordinate, data value) format
in ASCII allowed files to be manipulated with spreadsheets
78

-------
                  Kriged Chloride Concentrations
                                                           Forest Mortality (Dark)
         554740
         554576 -
         554412
         554248
         554084 -
         553920
               2548470 2548634  2548798  2548962
                                Easting (feet)
                                                                   554740
                                                                   554576
                                                                   554412
                                                                   554248
                                                                   554084
                                                  553920
                                                       2548470  2548634  2548798 2548962
                                                                        Easting (feet)
    <200
<500
<700
<1,000
<1,200
Figure 4.  Plan view of chloride concentrations from krig analysis and gray scale of infrared reflection from aerial photograph. Lighter
          tones represent greater infrared reflection and less forest mortality.
or word processors. Creation of headers, positioning of
columns, or changing order of rows or columns could be
done for import into the next program. Although more
difficult than point and  click file  transfers of the modern
software, the simple standard format creates freedom to
use the software in ways not anticipated by the software
developers.

Finally, a clear problem statement aided in selecting the
most applicable CIS techniques.  CIS software allows
several methods of data representation. In this example,
we chose a raster  representation with a cell the size of
a tree  crown. Criteria  for choosing these  parameters
included physical  dimensions  of  the  phenomenon  of
interest, dimensions of CIS  accuracy, and  a desire for
automated determination of values for individual com-
parisons. A careful review of the problem to be solved,
data available, and capabilities  of the CIS software are
all necessary ingredients for a useful problem statement.


References

 1.  Van  Genutchen, M.T., and W.J. Alves. 1982. Analytical solutions
    of the one-dimensional convective-dispersive solute transport
    equation. U.S. Department of Agriculture Technical Bulletin 1661.
 2.  Wang, H.F., and M.P. Anderson. 1982. Introduction to ground-
    water modeling: Finite difference and finite element methods. San
    Francisco, CA: W.H. Freeman.
 3.  Bear, J., and A. Verruijt. 1987. Modeling ground-water flow and
    pollution. Dordrecht, The Netherlands: D. Reidel Publishing Co.
                                             4. Gelhar, L.W. 1993. Stochastic subsurface hydrology. Englewood
                                               Cliffs, NJ: Prentice Hall.

                                             5. Dagan, G. 1987. Theory of solute transport by ground water. Ann.
                                               Rev. Fluid Mech. 19:83-215.

                                             6. Garabedian, S.P., D.R. LeBlanc, L.W. Gelhar, and M.A. Celia.
                                               1991. Large-scale natural gradient tracer test in sand and gravel,
                                               Cape Cod, Massachusetts,  2. Analysis of spatial moments for a
                                               nonreactive tracer. Water Resour. Res. 19(1): 1,387-1,397.

                                             7. Rehfield, K.R., J.M. Boggs, and L.W. Gelhar. 1992. Field study
                                               of dispersion in a heterogeneous aquifer, 3. Geostatistical analy-
                                               sis  of hydraulic conductivity. Water Resour. Res. 28(12):3,309-
                                               3,324.

                                             8. de Marsily, G.  1986. Quantitative  hydrogeology. San Diego, CA:
                                               Academic Press.

                                             9. Krige, D.G. 1966. Two-dimensional weighted  moving  average
                                               trend surfaces for ore-evaluation. J. South  African Instit. Mining
                                               and Metallurgy 66:13-38.

                                            10. Coch, N.K., and M.P.Wolfe.  1991. Effect of Hurricane Hugo storm
                                               surge in coastal South Carolina. J. Coastal Res. SI 8:201-208.

                                            11. Gardner, L.R.,  W.K Michner, E.R. Blood, T.M. Williams,  D.J. Lip-
                                               scomb, and W.H. Jefferson.  1991. Ecological impact of Hurricane
                                               Hugo—salinization of a coastal forest. J. Coastal Res. SI 8:301-318.

                                            12. Gresham, C.A., T.M. Williams, and D.J. Lipscomb. 1991. Hurri-
                                               cane Hugo wind damage to southeastern U.S.  coastal tree spe-
                                               cies. Biotropica 23:420-426.

                                            13. Nittrouer, PL.  1988. Ground-water flow patterns in  a  forested
                                               beach ridge-scale system  adjacent to a  South Carolina salt
                                               marsh. Master's thesis, Department of Geological Sciences, Uni-
                                               versity of South Carolina.
                                                         79

-------
14.  Williams, T.M., and J.C. McCarthy. 1991. Field-scale tests of col-
    loid-facilitated transport. In: National Research and Development
    Conference on the Control of Hazardous Materials, Anaheim,
    California.  Greenbelt, MD: Hazardous Materials  Control  Re-
    search  Institute.

15.  Conner, W.H. 1993. Artificial regeneration of bald cypress in three
    South Carolina forested wetland areas after Hurricane Hugo. In:
    Brissette, J.C., ed. Proceedings of the Seventh Southern Silvicul-
    tural  Research Conference. U.S. Department of Agriculture For-
    est Service, General Technical Report SO-93. Southern Forest
    Experiment Station, Mobile, AL.

16.  Williams, T.M. 1993. Salt water movement within the water table
    aquifer  following Hurricane Hugo. In: Brissette, J.C.,  ed. Proceed-
    ings of  the Seventh Southern Silvicultural Research Conference.
    U.S.  Department of Agriculture Forest Service, General Technical
    Report  SO-93.  Southern Forest  Experiment Station, Mobile, AL.
17.  Lipscomb D.J., and T.M. Williams. 1988. A low-cost, map-based
    data retrieval system for forest managers.  In: Proceedings of the
    American Congress of Surveying and Mapping/American Society
    for Photogrammetry and Remote Sensing, Fall Convention. Falls
    Church, VA: American Congress of Surveying and Mapping.

18.  Lipscomb, D.J., and T.M. Williams. 1990.  Developing a CIS for
    forest  management in the 1990s. In: Resource technology 90:
    Second  international  symposium  on advanced technology in
    natural resource management. Bethesda, MD: American Society
    of Photogrammetry and  Remote Sensing.

19.  Rossi, R.E.,  D.J.  Mulla,  A.G. Journel,  and E.H. Franz.  1992.
    Geostatistical tools for modeling and interpreting ecological spa-
    tial dependence. Ecol. Monographs 62:277-314.
                                                             80

-------
                   Geology of Will and Southern Cook Counties, Illinois


                                       Edward Caldwell Smith
                        Illinois State Geological Survey, Champaign, Illinois
Introduction

The Silurian dolomite aquifer is the primary source of
ground water in  northeastern  Illinois. It is overlain by
glacially derived  sands  and gravels or tills. The sands
and gravels within the glacial drift hydrologically interact
with the fractured and creviced dolomite bedrock.

The purpose of this study was to define the extent of
major glacial drift aquifers and their relationship to the
shallow bedrock  aquifer surface. The study succeeded
in identifying two principal sand and gravel aquifers: an
"upper" drift aquifer within the glacial tills and a "basal"
drift aquifer overlying the bedrock. Bedrock topography,
drift thickness, thickness  of the  Silurian dolomite, and
thickness of major sand and gravel units were mapped
to help define the geologic and hydrologic system and
the interaction of the upper bedrock aquifer and the drift
aquifers.

The data collected to create the various maps came
from well records, engineering borings, oil and gas tests,
and structure tests on file at the Illinois State Geological
Survey  (ISGS).  Reviewing published  reports,  manu-
scripts, and unpublished reports on open file at the ISGS
provided an overall  perspective of the geology of the
study area. Previously, no detailed studies of the hydro-
geology of the entire area had been conducted. Incor-
porating water well and  other  data into a computer
database greatly facilitated map construction. Prelimi-
nary maps were developed using Interactive Surface
Modeling (ISM) software and a geographic information
system (CIS).

Past regional geologic studies of the northeastern Illi-
nois  area that have encompassed this  study area in-
clude Thwaites (1), Bretz (2), Bergstrom et al. (3), Bretz
(4),  Suter et al. (5), Hughes et al. (6), and Willman (7).
Bogner (8) and Larsen (9) included interpretive maps of
the surficial geology of the area as a part of planning
studies for northeastern Illinois.

Map Construction

Creating the database used in the construction of the
maps for this project entailed inputting information from
well driller's logs into a PC-based computerized spread-
sheet (Quattro Pro). Well logs were primarily from water
wells and engineering borings. Data items input into the
spreadsheet included:

• Well identification (ID) number

• Owner name

• Location of well

• Thickness of drift

• Depth to top and bottom of the bedrock

• Depth to top and bottom of each sand unit

The ground surface elevation of each well was interpo-
lated  from United  States Geological  Survey  (USGS)
7.5-minute quadrangles. Elevations of the top of bed-
rock and top and bottom of sand bodies were calculated
based on the  interpolated elevations. Locations were
verified wherever possible using plat books by matching
either landowner names or the address location from the
well log. After  compilation, the data were converted  to
ASCII text and transferred into an ARC/INFO (Versions
5.0.1 and 6.0) database on a SUN SPARC workstation.
ARC/INFO is a product of Environmental Systems Re-
search Institute, Inc., of Redlands, California.

Of the more  than 10,000 records  reviewed for this pro-
ject,  over 5,100 were  input into the database. Sub-
sequently, numerous data quality checks ensured that
duplicate well  ID  numbers were corrected, locations
were corrected, thicknesses were checked so  that the
sand thickness data reported did not exceed drift thick-
ness,  and elevations were checked so that elevation  of
a sand body was not below the bedrock surface. After
running the data quality checks and removing questionable
data from the  database, approximately  5,000  records
remained.

ISM,  a contouring  package  from Dynamic Graphics,
Inc., of Alameda,  California, helped to  create two di-
mensional grid representations of:

• Surface topography

• Drift thickness
                                                  81

-------
• Bedrock topography

• Bedrock isopach

• Intermediate sand body isopach

• Basal sand isopach

ISM also allowed for the creation of contoured output of
the grids. Grids are regularly spaced rectangular arrays
of data points (nodes) that allow for efficient mathemati-
cal  calculations and contouring.  ISM uses a minimum
tension  gridding  technique,  allowing for the curvature
(change in slope) of the surface to be spread throughout
the surface rather than being concentrated at the input
data points. The ISM program uses a biharmonic in-
verse cubic spline function (algorithm) to  assign data
values to grid nodes. This function assumes that for any
grid node assignment, input  data points farther away
from the node being evaluated have less influence on
that node's value than nearer data points. To determine
each grid node value, ISM calculates an average value
from the surrounding scattered input data (up to 15 input
data points) and finds the standard deviation. ISM con-
tinues to refine the values of the grid  nodes until the
standard deviation is minimized (10).

Several grid spacings were reviewed to determine which
would best represent the density of the data. The grids
that ISM uses, as described above, determine the fine-
ness to which the data control the resultant contours.
Experimentation  was  necessary to  determine  a grid
spacing that adequately represented the data. Too fine
a grid spacing can exaggerate or overly weight individ-
ual  points, causing the resultant contours to be overly
jagged.  With too large of a grid  spacing, the contours
can become overgeneralized  and become much less
data  dependent  because the  calculated grids are
ove rave raged.

The two-dimensional grid of the land surface topography
was based on surface topography lines and spot eleva-
tions digitized from USGS 7.5-minute quadrangles. The
linework for each quadrangle was converted to ASCII
files of data points. The ASCII files contained x and y
coordinates and  the  elevation value of each data
point. After inputting the  ASCII  files  into ISM,  a
two-dimensional grid for each quadrangle was created.
ISM also generated contour lines from each grid. Com-
paring plots of the generated lines with USGS 7.5-minute
topographic maps allowed for the correction of errors and
ensured that the grid  elevation values were within 10
feet of the elevations shown on the USGS maps. An ISM
two-dimensional grid of the entire area's surface topog-
raphy was created by combining the grids. After creating
a contoured surface of the grid, an ARC/INFO coverage
of the output was produced. ARC/INFO was used to edit
the coverage and produce the final map.
The two-dimensional grid of the bedrock surface topog-
raphy was based on data from water well and engineer-
ing boring  logs, ISGS field  observations  of  outcrop
locations, and previous ISGS mapping (9). An ASCII file
of x and y coordinates and the elevation of each bedrock
top was input into ISM. Subtracting the bedrock topog-
raphy grid from the land surface grid produced a grid  of
the drift thickness.  A  contoured output  of the grid was
produced, and an ARC/INFO coverage of the output was
created. Again, ARC/INFO was used to edit the cover-
age and produce the final map.

Creating the isopach  maps entailed subtracting the top
and  bottom elevations of each unit to calculate the
thickness of each unit. ASCII files of the xand y coordi-
nates and the thickness values for each data point were
input into ISM. ISM then created two-dimensional grids
of each isopach. Contoured  output of each grid  was
produced, which allowed for the creation of ARC/INFO
cove rages of the output. ARC/INFO was used to edit the
coverages and produce the final maps.

Bedrock  Geology of the Study Area

All the sedimentary bedrock units are of the Paleozoic
Era.  The Paleozoic bedrock  comprises sequences  of
sandstones, dolomites, limestones,  and shales.  The
stratigraphic column of Figure 1 illustrates the vertical
succession of the bedrock. Major tectonic activity of the
area includes the formation of the Kankakee  Arch  in
Ordovician  time (11)  and  faulting along the  Sandwich
Fault  Zone. Faulting  along the  Sandwich Fault  Zone
(see Figure 2) may have occurred coincidentally with the
formation of the Lasalle Anticlinorum in early Pennsyl-
vanian time (12). No further faulting has  been noted
since deposition of glacial sediments.  Bedrock  units
gently dip to the east (7). The majority  of the area lies
on the Niagara  cuesta, a south and west facing scarp
that comprises the resistant Silurian strata that have an
eastward dip of roughly 15 feet per mile (13). The Silu-
rian strata are absent west of the Kankakee River as
well as in an area west of the Des Plaines River  in
west-central Will County  (see Figure 2). This study re-
lates to the hydrogeology of the  Silurian strata  and the
drift materials, and  details only the uppermost bedrock
units. The report, however, does briefly summarize units
below  the  Maquoketa Group using  information  from
Hughes et al. (6) and  Visockey et al. (14).

Precambrian Bedrock

Granites  or granitic  rock  compose  the  Precambrian
basement of northern Illinois.  Few details  about the
nature of the basement rocks are known because few
wells have completely penetrated the sedimentary bed-
rock of the  region. The elevation of the top of the Pre-
cambrian basement probably stands at 4,000 feet below
mean sea level in the study area.
                                                  82

-------
SYSTEM
QUATER-
NARY
PENNSYL-
VANIAN
z
<
(r
^
to

ORDOVICIAN
CAMBRIAN
PHE-
CAMBRIAN
SERIES
PLEISTOCENE
DESMOINESIAN
NIAGAHAN
ALEXANDRIAN
CINCINNATIAN
CHAMPLAINIAN
CANADIAN
CROIXAN


GROUP OR
FORMATION

Spoon and
Carbondale
Racine
Sugar Run
Joliet
Kankakee
Elwood
Wilhelmi
Maquoketa
Galena
Platteville
Glenwood
St. Peter
Shakopee
New
Richmond
Oneota
Gunter
Eminence
Potosi
Franconia
Iron ton
Galesville
Eau Claire
Elmhurst
Member
Mt. Simon

AQUIFER
Sands
and
Gravels

Silurian

Galena-
Platteville
Glenwood-
St. Peter
Prairie du
Chien
Eminence
Potosi
Franconia
Ironton-
Galesville

omite aquifer system
•8
S
I
\
\
\
\
\


brian-Ordovician aquifer system
$




Elmhurst-
Mt. Simon
aquifer
system

LOG
•<.:,'.•>:• x
S&t
• a O "o •• D". »
&S&S
*^"V •

A
'£
WW
rWi
f
i\
/
/
/
— /.
/
/
f —
/-
~T
7-L
-AZ
TA
-7
7—
L 	 / T^fl 	 /•
:£^sn/»
— — ^\f"i

— /-
/• —
•^^«/


17 — _zr
i ~i ' —
/ /
/ / i
. . A. A,...
'/"VV
7 /
< i "V
• 1 • • 1 A' 1 '
* • / • • / • •
'•:s:':''-~^/:.
^ '•'•••'
/ •-' • •
/ • • • .

YA • • •
/ •:• .•
/ / ••••
/••.•• .
. •/ • • *
/• ' •

/ • •.•..
•••/ •••.
/ -A-
•ilfcA-
4%^
fr
i
/
—f~
/A/
/
/ /
•/••
• •/:
/
/

/• ..
• /
7-'-
•/
|r-h'%s'^.-
W&&
£!TiiT* *-~ * r\ •"*"
-/. —S—Y* -/*--{•

— /• ~~/1m /, ~ /,'t
*• * * • _ *f fj^t
t * * *^^ ~ •
^^^\
x^/X^/
THICKNESS
(FT)
0-260
0- 110

- reef
0 350
0-100
80-250
310-380

125-600
0-410
0-280
110- 160


390 - 570
2200

DESCRIPTION
Unconsolidated glacial deposits-pebbly clay
(till), silt, sand and gravel
Alluvial silts and sands along streams
Shale, sandstone, clay, limestone, and coal
Dolomite, very pure to argillaceous, silly,
cherty; reefs in upper part
Dolomite, slightly argillaceous and stlty
Dolomite, very pure to shaly and shale,
dolomitic; white, light gray, green, pink,
maroon
Dolomite, pure lop 1 ' - 2', thin green shale
partings, base glauconitic
Dolomite, slightly argillaceous, abundant
layered white chert
Dolomite, gray, argillaceous and becomes
dolomitic shale at base
Shale, red to maroon, oolites
Shale, silty, dolomitic, greenish gray, weak
(Upper unit)
Dolomite and limestone, white, light gray,
interbedded shale (Middle unit)
Shale, dolomitic, brown, gray (Lower unit)
Dolomite, and/or limestone, cherty (Lower
part)
Dolomite, shale partings, speckled
Dolomite and/or limestone, cherty, sandy
at base
Sandstone, fine anH coarse-grained; little
dolomite; shale at top
Sandstone, fine to medium-grained; locally
cherty red shale at base
Dolomite, sandy, cherty (oolitic); sandstone
Sandstone interbedded with dolomite
Dolomite, white to pink, coarse-grained
cherty (oolitic)
Sandstone, medium-grained, slightly dolomitic
Dolomite, light colored, sandy, thin sandstones
Dolomite, fine-grained gray to brown,
drusy quartz
Dolomite, sandstone and shale, glauconitic,
green to red, micaceous
Sandstone, fine to coerse- grained, well
sorted; upper part dolomitic
Shale and siltstone, dolomitic, glauconitic;
sandstone, dclomitic. glauconitic
Sandstone, coarse-grained, white, red in lower
half; lenses of shale and siltstone, red,
micaceous
Granitic rocks
Figure 1.  Generalized stratigraphic column of rock units and aquifers in northern Illinois (prepared by M.L. Sargent, ISGS).
                                                            83

-------
Cambrian

The Elmhurst-Mt. Simon Sandstone comprises the old-
est sedimentary units in Illinois and consists of medium-
grained  sandstones.  It  has  a  total  thickness  of
approximately 2,500 feet. The upper part of this unit has
acted as an aquifer in the Chicago region  in the past;
ground-water mining of the aquifer (a nonreplenished
lowering of the static water level), however,  has led to a
discontinuation  of its use for that purpose. The Eau
Claire Formation, the Basal Sandstone Confining Unit
(14), consists of dolomitic shale and siltstone with thin
beds of sandstone. It has a thickness of 300 feet to 400
feet and separates the Elmhurst-Mt. Simon aquifer from
the Ironton-Galesville Sandstones. The Ironton-Galesville
Sandstones have a thickness of 150 feet to 250 feet and
serve as a source of ground water in northern Illinois (6).
The Galesville Sandstone is fine-grained, while the Iron-
ton Sandstone  is  coarser grained and contains more
dolomite. The Knox Megagroup, the Middle Confining
Unit (14), comprises all the bedrock units between the
Ironton-Galesville Sandstones and the Ancell Group. It
includes the:

• Cambrian Franconia Formation

• Potosi  Dolomite

• Eminence Formation

• Jordan Sandstone

• Ordovician Prairie du Chien Group

The Knox Megagroup is primarily dolomitic in  compo-
sition,  though it contains thin sandstones. Its thick-
ness ranges from 400 feet in the northern portion  of
the study area to about 700 feet in the southernmost
tip of Will County. The sandstones tend to be some-
what discontinuous and, where present, offer a local-
ized  source of ground  water. The group as a whole
acts  as a confining unit between the Ironton Sand-
stone and the Ancell Group.

Ordovician

The Ancell Group, which  contains the St. Peter Sand-
stone and  Glenwood Sandstone, has a thickness  of
roughly 200 feet throughout the study area except in
north-central Will County where it is over 400 feet. The
thickness  of the Ancell  Group varies considerably in
northern Illinois because it rests on an  erosion surface.
The Ancell Group is the shallowest aquifer present in
this area below the Silurian dolomite aquifer. The eleva-
tions of the top of the Ancell Group range from just over
sea level in the northwest corner of Will County to 500
feet below mean sea level in the southwestern corner.
The Galena and Platteville Groups provide a sequence
of carbonate  rocks that are primarily dolomitic in com-
position. The Platteville Group conformably overlies the
Ancell Group. The two units have a combined thickness
of 350 feet throughout this part of the state. The Galena
and  Platteville Groups,  combined  with the  overlying
Maquoketa Shale Group, act as an aquitard between the
Ancell aquifer and the Silurian dolomite aquifer.

Maquoketa Shale Group

The study area has three subaerially exposed bedrock
units. The oldest  of these that  this report details are
Ordovician-aged strata comprising the Cincinnatian Se-
ries  Maquoketa Shale Group.  The thickness  of the
Maquoketa Group ranges from 260 feet in eastern Will
County to 120 feet in the northwestern corner of Will
County and is unconformably overlain by Silurian strata
(15). The Maquoketa  Group comprises four formations:

• Scales Shale

• Fort Atkinson Limestone

• Brainard Shale

• Neda Formation

The Scales Shale forms the lowermost unit and consists
of gray to brown dolomitic shale. Thin layers with phos-
phatic nodules  and pyritic fossils  occur near the top and
base of the unit. The Scales Shale may attain a thickness
of up to 120 feet in this  region (15). The Fort Atkinson
Limestone, a coarse-grained crinoidal limestone to fine-
grained dolomite, may range up to 60 feet thick (15). The
Brainard Shale comprises greenish gray  dolomitic shale
and has a thickness of generally  less than 100 feet (16).
The  Neda Formation, the youngest  formation  in the
Maquoketa Group,  is relatively thin  with  a thickness of
usually less than 10 feet. In some places, it may attain a
maximum thickness of 15 feet. The Neda is exposed along
the Kankakee River, and the Silurian-aged Kankakee For-
mation typically overlies it. The Neda Formation consists
mostly of red and green shale with  interbedded goethite
and hematite oolite beds (7, 16).

Silurian System

Silurian-aged rocks consist almost  solely of dolomites
and dolomitic limestones. The Silurian is divided into the
Alexandrian and Niagaran Series. The Alexandrian Se-
ries is  about 25 feet thick and  is represented  by the
Kankakee, Elwood, and Wilhelmi  Formations. These for-
mations are  a  fine- to medium-grained,  white, gray to
pinkish gray dolomite. The Kankakee Formation is exposed
along the Kankakee River in southern Will County (17).

The  Niagaran  Series comprises much of the bedrock
surface of this  area and includes three formations. The
Joliet Formation has  a lower member of dolomite with
interbedded red and green shale, and two upper mem-
bers with an increasing  purity of dolomite toward the
top  of  the formation  (7). The Sugar Run  Formation,
formerly termed the Waukesha  Formation  (17), is an
argillaceous, fine-grained, medium- to thick-bedded,
                                                  84

-------
brownish gray dolomite (7). The Racine Formation is the
thickest unit in the Niagaran Series, attaining a thick-
ness of as much as 300 feet (17). The Racine Formation
contains  large reefs that are as high as 100 feet and
consist of vugular gray  dolomite. The  inter-reef rock
consists  of dense, cherty  gray dolomite. The Racine
Formation is exposed in the bluffs along the Des Plaines
River from Joliet to Blue Island, Illinois (17).

Figure 2  is an isopach of the Silurian dolomite indicat-
ing the thickness  of the unit in the study area and the
boundary of the Silurian rocks. The Silurian dolomite
aquifer has a maximum thickness of just over 500 feet
in  the  southeast corner of Will County and becomes
thicker to the east and south. It rapidly increases in
thickness from its margin along the western border of
Will  County,  where  it has eroded. The contact  be-
tween  the  Silurian dolomite  and  the  underlying
Maquoketa  Shale Group has  relatively little relief.
Thus,  the major differences  in thickness of the unit
result from erosion of the bedrock surface. Joints and
fracture patterns within the upper bedrock have a domi-
 nantly northwest-southeast  and  northeast-southwest
 orientation (18).

 Pennsylvanian System

 Pennsylvanian-aged bedrock is found in the southeast-
 ern portion of Will County west of the Kankakee River
 with an outcropping at the confluence of the Des Plaines
 and Kankakee Rivers.  The lowermost unit,  the Spoon
 Formation, is very thin and consists of clay beds with
 scattered occurrences  of coal formed in channel-like
 depressions (19). The Spoon Formation overlies the
 Maquoketa  Shale Group. The overlying  Carbondale
 Formation may attain a thickness of over 100 feet in the
 southwestern corner of Will County. The  Carbondale
 Formation consists of shale with thin  limestone beds.
 The lowermost unit, the  Colchester (Number 2)  Coal
 Member, outcrops in this area and attains a thickness of
 up to 3 feet. It has been extensively mined along the
 Will-Grundy-Kankakee  County border where large  ar-
 eas of strip-mined land are evident. Most of the available
 coal has been mined out, and numerous gob piles exist
                   R9E
                   -  -
                                                                             R14E
                                                                                     R15E
                      Scale of Miles
                           5        10
Legend
•300"'  Thickness in Feet
      Contour Interval 50 Feet
. _-"•  Western Boundary of
      Silurian Dolomite
***-*  Fault
it
Figure 2.  Thickness of the Silurian dolomite.
                                                  85

-------
in the area of Braidwood. The Francis Creek Shale
Member, which overlies the Number 2 coal, constitutes
the remainder of Pennsylvanian units in the study area.
The Francis Creek Shale is gray with numerous flat-
tened concretions that contain the Mazon Creek flora of
Pennsylvanian-aged fossils (19). Weathering of the
mine slag materials may have exposed fossiliferous
concretions in the gob piles (7).
Bedrock Topography

The highest bedrock elevations are in east-central Will
County where the bedrock rises to over 700 feet above
mean sea level (see Figure 3). Bedrock uplands occur
as a broken curved ridge from the southeast to the
northwest with bedrock elevations consistently rising
over 650 feet above mean sea level. The bedrock sur-
face slopes from the bedrock upland high westward to
the Des Plaines River. It also has a regional downward
slope to the south into Kankakee County, Illinois, to the
northeast into the Lake Michigan basin, and to the east
into Indiana. West of the Des Plaines River, the bedrock
surface rises to over 650 feet above mean sea level in
northeastern Kendall County. Elsewhere, the surface
has relatively low relief.
The dominant features of the bedrock surface are the
river valleys. The Des Plaines River valley is better
expressed than the Kankakee River valley. This is true,
in part, because it is older and acted as a drainageway
for glacial meltwater where it may have become en-
trenched in the present valley. The Kankakee River
valley may be less expressed partly because of the
amount of scouring that occurred over a large area
during the Kankakee flood event such that the river is
not entrenched in most places. Also, smoothing of the
study's contour maps has generalized some of the de-
tail.

The buried Hadley Bedrock valley, described initially by
Horberg and Emery (20), probably existed prior to
glaciation and concurrently with the preglacial Des
Plaines River. The valley may have acted as a drainage-
way for glacial meltwaters until the time that glacial
debris buried it. Glacial scouring was originally believed
to have formed the valley, but evidence presented by
McConnel indicated a fluvial origin of the valley (21).
Also, the base of the Hadley valley does not overhang
or lie much below the Des Plaines valley but rather joins
it at a smooth juncture.

The bedrock surface contains a number of sinkholes
or closed depressions that are expressions of karst
Legend
^o*" Bedrock Elevation in Feet
Above Mean Sea Level;
—^ Contour Interval 25 Feet
^- Western Boundary of
x^ Silurian Dolomite Aquifer
s\ Bedrock Quarry
Figure 3. Topography of the bedrock in the study area.
86

-------
development that formed prior to continental glaciation.
Karst, a terrain developed on limestone or dolomite by
solution or dissolving of the rock,  is characterized by
closed depressions and cavity development along joints
and fractures.  Fischer (22) first noted karst features in
the Joliet area where early Pennsylvanian sediments of
shale and  clay  filled cavities in the  upper bedrock.
Buschbach and Heim (23) indicated closed depressions
in the Silurian  dolomite surface in their bedrock topog-
raphy map for the  Chicago  region. They speculated
these depressions were expressions of karst develop-
ment. McConnel  (21) demonstrated the existence  of
sinkholes  in the area  of the  buried Hadley  Bedrock
valley northeast  of  Joliet by using seismic  refraction
survey data.

Glacial Geology

The  sediments overlying the bedrock comprise  tills,
sands  and  gravels, lacustrine deposits from glacial
lakes, and surficial eolian deposits of loess and sand.
The  unconsolidated deposits are over 150 feet thick
along the crest  of  the  Valparaiso  Morainic System.
Figure 4,  adapted  from Willman  (24),  indicates the
principal moraines. In the area where the Hadley Bed-
rock valley is present, the deposits attain a thickness of
over 175 feet. Bedrock is mainly exposed along the Des
Plaines River valley and its tributaries. It is also exposed
in isolated areas in southeastern Cook County. The drift
thickness  map (see Figure 5) indicates the distribution
of the  earth materials overlying the bedrock and the
locations of bedrock outcrops. The  bedrock outcrop in-
formation for this map was derived from Piskin (25) and
Berg and Kempton (26).

Erosion of the glacial sediments was a major factor in
controlling the drift thickness of the area. Succeeding
glaciers scraped off previously deposited sediments, but
glacial  meltwaters, which came from the east and north
along the  river channels,  caused much of the erosion.
Both the Kankakee River and Des  Plaines River acted
as meltwater channels as  the glaciers melted. The Du
Page River acted as a minor drainageway and was most
active during large-scale flooding events. The thickness
of the drift varies in the area also because of the topo-
graphic control that the bedrock on the overlying sedi-
ments  exercises. The crest of the Valparaiso moraines
coincides  with  the  topographic high  in  the bedrock
                                                                          End Moraines
                                                         Scale in Miles
                                                     0               10
Figure 4.  End moraines (late Wisconsin) in Will and southern Cook Counties, Illinois.
                                                   87

-------
j
Legend
Drift Thickness in Feet; Contour
Interval 25 Feet Between 0 to 50
Feet and 50 Feet Between 50 to
150 Feet
^Sy Bedrock Outcrop Area
A Isolated Bedrock Outcrop
2^1 Strip-Mined Areas
Figure 5. Thickness of the glacial drift in the study area and bedrock outcrop information (25, 26).
surface. The cross sections in Figure 6 also show this.
The bedrock high may have caused late Woodfordian
glaciers to stall repeatedly in the same area, causing
moraines to build atop one another sequentially (27).

Descriptions by well drillers note few variations in the
character of the unconsolidated sediments; therefore,
we did not attempt to correlate these deposits. The drift
materials present in the study area are late Wisconsinan
or younger. Though this region experienced glaciating
repeatedly prior to the Wisconsinan glaciation, no Illi-
noisan or pre-lllinoisan deposits have been identified
(28). The drift units divide into three main units (29):

• The Lemont Drift

• The Yorkville Drift

• The Wadsworth Drift

The three drift units are all part of the Wedron Formation
of Wisconsinan age. The Lemont Drift has a dolomitic
character because the source material for the diamicton
was glacially eroded Silurian dolomite. The Lemont Drift
is the oldest of the three units and is found only under-
lying the Wadsworth Drift. The Yorkville Drift is the only
drift unit present west of the Valparaiso Morainic System
boundary within the study area. It overlies the bedrock
surface wherever the basal sand unit is not present. The
Wadsworth Drift comprises silty and clayey diamictons
and is the youngest of the drifts (29). It overlies the
Lemont Drift and the upper sand unit. In the cross
sections (see Figure 6), where the upper sand unit is
present, it roughly indicates the boundary between the
Lemont and Wadsworth Drifts. The gradation between
the different drift units at the Valparaiso System bound-
ary is not well defined. The Wadsworth Drift appears to
grade into the Yorkville Drift because they are very
similar in composition near the boundary (9).

The large Kankakee flood left extensive deposits of
sand and gravel and lacustrine sediments along the
Kankakee River and Des Plaines River. The flood oc-
curred as glacial meltwaters built up behind a constric-
tion at the Marseilles Morainic System to the west (30).
Large glacial lakes, which developed during the flood,
subsequently emptied into the Illinois River valley after
a breach in the moraines developed. The force of the
flood waters eroded the glacial deposits along the river
valleys, flattened the surface of the drift, and, in places,
88

-------
                        North
                                                                                                      South
                   800-1
                    Legend
                    Drift, Undifferentiated
                    Sand
                    Silurian Dolomite
                    Potentiometric Profile of
                   ' Silurian Dolomite Aquifer
               Vertical Exaggeration: 167x
                                                    450 J
                                                                                                              -450
           West
       800! D  R10E
                                                                                                                     rSOO
                                                                                                                     -750
Figure 6.  Geologic cross sections of the glacial drift and potentiometric profile of the Silurian dolomite aquifer.
                                                             89

-------
exposed the underlying bedrock. The flood event formed
thin, dispersed lake plain deposits of silt, clay, and sand
in southwestern Will County. Some lacustrine deposits
lie between morainic ridges in southern Cook County
where small glacial lakes developed as the Valparaiso
Moraines were being deposited (8).

Figure 7 shows the locations of some of the  surficial
materials. Sands and gravels were also deposited along
tributary creeks and in abandoned  channels that once
connected the Du Page River and  Des Plaines River
north of their present juncture. Wind has  reworked the
surficial sand deposits forming  low dunes  along  the
Kankakee  River in southern Will County.  Masters (31)
classified the sand  and gravel deposits of the  area by
their origin, indicating that most of the deposits present
in the valley of the Des  Plaines  River formed  as well-
sorted valley train deposits. In the Kankakee River val-
ley, the sands and gravels were primarily deposited as
riverine sediments during the Kankakee flood event.
                   Sand and Gravel Isopachs

                   The sand and gravel isopach maps (see Figures 8 and
                   9) indicate the variations in thickness of the upper and
                   basal sand and gravel units. The most extensive depos-
                   its  of both  exist throughout the area overlain by the
                   Valparaiso Morainic  System. This may be associated
                   with  bedrock control on the  formation of the moraines
                   and associated deposits referred to earlier.

                   The thickest deposits lie in the buried Hadley Bedrock
                   valley where thicknesses of both units can exceed over
                   100 feet. The upper sand unit may be found in the glacial
                   drift  within  a wide  range of elevations. For  mapping
                   purposes, we defined the upper sand unit as a sand unit
                   greater than 1 foot thick that occurs between two fine-
                   grained layers. The basal sand unit includes all coarse-
                   grained materials that overlie the bedrock surface. Most
                   of the basal sands present west of the Des Plaines River
                   were formed as valley train deposits along the river
              T
             32
              N
                          Legend
                   Surficial Glacial Geology
                      Scale of Miles
                            5
10
n     Equality Fm - Glacial Lake  I
     Plain Deposits          I

pv-pa Erosional Meltwater Channel [
K&-F-I Cut Through Equality Fm

ffff, :j Henry Fm - Glacial Outwash f
*'•«'-'•* Sand and Gravel Deposits

     Geologic Cross Section
                                              'A*
Wendron Fm - Wadsworth Drift
- Tinley Moraine

Wedron Fm -Wadsworth Drift
- Valparaiso Morainic System

Wedron Fm - Yorkville Drift
- Minooka Morainic System

Silurian Dolomite

Strip-Mined Areas
Figure 7.  Surficial glacial geology (23).
                                                    90

-------
R8E R 1QE
•R13E RUE
°%N
-*v-
R15E
Scale of Miles
5 10
Legend
Thickness of Upper Sand

0 Isolated areas > 10 Feet

10 to 50 Feet

50 to 100 Feet

> 100 Feet
Figure 8. Thickness of the upper sand unit.

channels as the glaciers melted back. The origin of the
extensive deposits underlying the Valparaiso Moraines
is not clear. They may have been formed during early
Wisconsinan glacial events as outwash plain deposits
or they may have been deposited subglacially. The
cross sections (see Figure 6) can reveal the variability
and complexity of the sand and gravel layers as they
occur within the drift. The sand and gravel deposits very
seldomly act as aquifers in this region because almost
all wells are completed in the Silurian dolomite aquifer.
Clearly, Figures 8 and 9 indicate that some ground-
water resource potential may exist within these depos-
its.

References
1. Thwaites, FT. 1927. Stratigraphy and geologic structure of north-
ern Illinois. Illinois State Geological Survey Report of Investiga-
tions 13.
2. Bretz, J.H. 1939. Geology of the Chicago region, Part I: General.
Illinois State Geological Survey Bulletin 65.
3. Bergstrom, R.E., J.W. Foster, L.F. Selkregg, and W.A. Pryor.
1955. Ground-water possibilities in northeastern Illinois. Illinois
State Geological Survey Circular 198.

4. Bretz, J.H. 1955. Geology of the Chicago region, Part II: The
Pleistocene. Illinois State Geological Survey Bulletin 65.

5. Suter, M., R.E. Bergstrom, H.F. Smith, G.H. Emerich, W.C. Wal-
ton, and T.E. Larson. 1959. Preliminary report on the ground-
water resources of the Chicago region, Illinois. Illinois State
Geological Survey and Illinois State Water Survey Cooperative
Ground-Water Report 1.

6. Hughes, G.M., P. Kraatz, and R.A. Landon. 1966. Bedrock aqui-
fers of northeastern Illinois. Illinois State Geological Survey Cir-
cular 406.

7. Willman, H.B. 1971. Summary of the geology of the Chicago
area. Illinois State Geological Survey Circular 460.

8. Bogner, J.E. 1976. Geology for planning in northeastern Illinois,
Vol. V: Cook County. Illinois State Geological Survey Open-File
Report.

9. Larsen, J.I. 1976. Geology for planning in northeastern Illinois,
Part VI: Will County. Illinois State Geological Survey Open-File
Report.
91

-------
         R9E
R10E
                                                                                       R15E
             Scale of Miles
       0          5          10
                                     Legend
                                     Basal Sand Thickness, Feet
                                     OtolO
                                     10 to 50
                               HI  50 to 100
                               •I  >100
Figure 9.  Thickness of the basal sand unit.

10. Dynamic Graphics, Inc. 1991. Interactive surface modeling users
    guide. Alameda, CA.

11. Ekblaw, G.E. 1938.  Kankakee Arch in Illinois. Illinois State Geo-
    logical  Survey Circular 40.

12. Kolata,  D.R., T.C. Buschbach,  and  J.D. Treworgy. 1978.  The
    Sandwich Fault Zone of northern Illinois. Illinois State Geological
    Survey Circular 500.

13. Horberg, L. 1950. Bedrock topography in  Illinois.  Illinois State
    Geological  Survey Bulletin 73.

14. Visocky, A.P., M.G.  Sherrill,  and K. Cartwright. 1985. Geology,
    hydrology,  and  water quality of the  Cambrian  and Ordovician
    systems in northern  Illinois.  Illinois Department of Energy  and
    Natural Resources Cooperative  Groundwater Report 10.

15. Kolata, D.R., and A.M. Graese. 1983.  Lithostatigraphy and depo-
    sitional environments of the Maquoketa  group (Ordovician) in
    northern Illinois.  Illinois State Geological Survey Circular 528.

16. Willman,  H.B., et al.  1975. Handbook of Illinois stratigraphy. Illi-
    nois State Geological Survey Bulletin  95.

17. Willman,  H.B. 1973. Rock stratigraphy of the Silurian system in
    northeastern and northwestern  Illinois. Illinois State Geological
    Survey Circular 479.
                                              18. Foote, G.R. 1982. Fracture analysis in northeastern Illinois and
                                                  northern Indiana.  Master's thesis, University of Illinois.

                                              19. Smith, W.H. 1968. Strippable coal reserves of Illinois, Part 6:  La
                                                  Salle, Livingston,  Grundy, Kankakee, Will,  Putnam, and parts of
                                                  Bureau and Marshall Counties. Illinois State Geological  Survey
                                                  Circular 419.

                                              20. Horberg,  L., and K.O.  Emery. 1943. Buried bedrock valleys east
                                                  of Joliet and their relation to water supply. Illinois State Geological
                                                  Survey Circular 95.

                                              21. McConnel,  D.R. 1972. Bedrock topography and paleogeomor-
                                                  phology northeast of Joliet, Illinois. Master's thesis, University of
                                                  Illinois.

                                              22. Fischer, D.J. 1925. Geology and mineral resources of the Joliet
                                                  quadrangle. Illinois State Geological Survey Bulletin 51.

                                              23. Buschbach, T.C., and G.E. Heim.  1972. Preliminary geologic in-
                                                  vestigations of rock tunnel  sites for flood and pollution control in
                                                  the greater  Chicago area. Illinois State Geological Survey  Envi-
                                                  ronmental Geology Notes 52.

                                              24. Wilman, H.B.,  and J.C. Frye.  1970. Pleistocene  stratigraphy of
                                                  Illinois. Illinois State Geological Survey Bulletin 94.

                                              25. Piskin, K. 1975. Thickness of glacial drift in Illinois. Illinois State
                                                  Geological Survey  1:500,000 scale map.
                                                                92

-------
26. Berg, R.C., and J.P. Kempton. 1988. Stack-unit mapping of geo-
    logic materials in  Illinois to a depth  of 15 meters. Illinois State
    Geological Survey Circular 542.

27. Hansel, A.K., and W.H. Johnson. 1987. Ice marginal sedimenta-
    tion in a  late Wisconsinan end  moraine  complex, northeastern
    Illinois, USA. In: van der Meer, J.J.M., ed. Tills and glaciotecton-
    ics. Proceedings of an INQUA Symposium on genesis and lithol-
    ogy of glacial deposits.

28. Horberg,  L. 1953. Pleistocene deposits below the Wisconsinan
    drift in northeastern Illinois. Illinois State  Geological Survey Re-
    port of Investigations 165.
29. Johnson, W.H., and A.K. Hansel. 1985. The Lemont section. In:
    Johnson, W.H., A.K. Hansel, B.J. Socha, L.R. Follmer, and J.M.
    Masters,  eds.  Depositional  environments and correlation prob-
    lems of the Wedron Formation (Wisconsinan) in  northeastern
    Illinois. Illinois State Geological Survey Guidebook  16.

30. Gross, D.L., and  R.C.  Berg. 1981.  Geology of the Kankakee
    River system in Kankakee County, Illinois. Illinois State Geologi-
    cal Survey  Environmental Geology Notes 92.

31. Masters,  J.M. 1978. Sand and gravel resources in  northeastern
    Illinois. Illinois State Geological Survey Circular 503.
                                                               93

-------
Watershed Applications

-------
                The Watershed Assessment Project: Tools for Regional
                                  Problem Area Identification
                                         Christine Adamus
                   St. Johns River Water Management District, Palatka, Florida
The St. Johns River Water Management District of Flor-
ida recently completed a major water resources plan-
ning effort. As part of this planning effort, the St. Johns
River Water Management District created a geographic
information systems (CIS)  project called the Watershed
Assessment, which included a nonpoint source pollution
load model. This paper introduces the planning project
and the Watershed Assessment, and describes how the
results of the model are  being  used to guide  water
management activities in northeast Florida.

Background

The St. Johns River Water Management District (Dis-
trict), one of five water management districts in Florida,
covers 12,600 square miles  (see  Figure 1).  The St.
Johns River starts at the southern end of the District and
flows north; it enters the Atlantic Ocean east of the city
       District Boundary
       Major Drainage
       Basin Boundaries
Figure 1. St. Johns River Water Management District, Florida.
of Jacksonville. The cities of Orlando, Daytona Beach,
and Jacksonville  are partially or entirely within the Dis-
trict boundaries. Ad valorem taxes provide primary fund-
ing for the District.

The  District  boundaries are  somewhat  irregularly
shaped because Florida water management districts are
organized on hydrologic, not political, boundaries, which
greatly improves the District's  ability to manage  the
resources.  On the north, the  District shares the St.
Mary's River with the state of Georgia and on the south,
shares the Indian River Lagoon with another water man-
agement district.  Most of the water bodies the District
manages, however, have drainage basins that are en-
tirely contained within the District's boundaries.

Water management districts in  Florida have amassed
extensive CIS libraries, which they share with local and
statewide agencies. These libraries include basic data
layers such as detailed land use, soils,  and drainage
basins.  Districts  also  coordinate data collection and
management to ensure data compatibility.

District Water Management  Plan

All activities and  programs of the water management
districts  are related to one  or  more  of the following
responsibilities: water supply,  flood  protection, water
quality management, and  natural systems  management.

Each water management district recently completed  a
district water management plan (Plan). The main pur-
pose of these Plans is to provide long-range guidance
for the resolution of water management issues. The
Florida Department of Environmental Protection will use
these five Plans as the basis for a state water manage-
ment plan.  Each water management district used the
same format, which comprised the following components:

• Resource assessment: What  are  the  problems and
  issues  related  to each of the four responsibilities
  listed above?
                                                 97

-------
• Options evaluation:  What options are available for
  addressing the problems?
• Water  management policies: What existing District
  policies influence the decisions that must be made?
• Implementation  strategy: What is the best plan for
  addressing the problems?

The Watershed Assessment Project

The District created the Watershed Assessment project
as part of is resource assessment. This  CIS project
examines the entire District to identify problems related
to flood protection, ecosystems protection, and surface
water quality.
The flood protection component is the only part of the
Watershed Assessment that is  not complete. It will in-
volve  simple overlays of floodplain boundaries with ex-
isting  and future land  use. Floodplain boundaries  are
defined as  Federal  Emergency Management Agency
(FEMA) flood insurance rate map 100-year flood hazard
areas. In many areas, these designations are not very
accurate, yet we  decided to  proceed with their use
because  they  are the best available  information for
many parts of the District. In areas where little hydrologic
information is available and where the District has not
conducted any  related studies, the FEMA  data are a
helpful starting point. This echoes a theme of the Water-
shed Assessment project: the assessment  is primarily
intended  to fill in gaps where we have  not performed
previous resource assessments, not to supplant existing
information.
The ecosystems protection component of  the Water-
shed Assessment is based heavily on a project identify-
ing priority habitat in Florida, conducted by the Florida
Game and Fresh Water Fish Commission (1). It is similar
to gap analyses that the U.S. Fish and Wildlife Service
currently is conducting in many parts of the country. For
the Watershed Assessment, we modified the data some-
what and examined ways to protect the habitat in coop-
eration with local agencies.
The surface water quality component of the Watershed
Assessment has two main parts. The first uses water
quality data from stations that have been spatially refer-
enced so that we can map them and combine the infor-
mation with other information, such as the second part
of the water quality component. This second part is a
nonpoint source pollution  load  model,  which is dis-
cussed in more detail below.

The Pollution Load Screening Model

The nonpoint source pollution load model is the Pollu-
tion Load Screening Model (PLSM), a commonly used
screening tool in Florida.  It is an empirical  model that
estimates annual loads to  surface waters  from storm-
water runoff. Our goal in  designing this model was to
identify pollution load "problem areas" for examination in
the Plan.

In these types of models, annual pollutant loads are a
function of runoff volume and mean pollutant concentra-
tions commonly found in  runoff. Runoff volume varies
with soil and  land  use, while  pollutant concentrations
vary with land use. For the PLSM, pollutant concentra-
tions were derived from studies conducted solely in Flor-
ida. A report describing the model in detail is available (2).

Usually, this kind of model combines CIS with a spread-
sheet: the CIS supplies important spatial information
that is input into a spreadsheet where the actual calcu-
lations are made. The PLSM is different, however,  be-
cause  we  programmed  it entirely within CIS.  The
District's  CIS software is  ARC/INFO, and the  model
employs an ARC/INFO module called GRID, which uses
cell-based processing and has analytical capabilities
(3). All the model calculations  are done in  the CIS
software, resulting in a more flexible model with useful
display capabilities.

Model input consists of grids, or data layers,  with a
relatively small cell size (less than 1/2 acre). We chose
this cell size based on the minimum mapping unit of the
most detailed input data layer (land use) and the need
to retain the major road features. The model has four
input  grids:  land use, soils,  rainfall,  and watershed
boundaries. For any given cell, the model first calculates
potential annual runoff based on the land use, soil, and
rainfall in that cell. It then calculates annual  loads by
applying  land-use-dependent  pollutant concentrations
to the runoff.

For this model:

• Land use is from 1:24,000-scale aerial photography
  flown in 1988 and 1989. The model incorporates 13
  land use categories.

• Soils  are  the  Soil  Conservation  Service  (SCS)
  SSURGO database, which corresponds to the  county
  soil  surveys. The PLSM uses  the hydrologic group
  designation of each soil type.

• Rainfall was taken from a network of long-term  rainfall
  stations  located throughout the District.

• Watersheds  were delineated  by the United  States
  Geological  Survey  (USGS)  on  1:24,000-scale,
  7.5-minute  maps and digitized.

Model output  consists  of a runoff grid and six pollutant
load grids. We  calculated  loads for total phosphorus,
total nitrogen,  suspended  solids,  biochemical oxygen
demand,  lead,  and zinc. We chose these pollutants
because reliable data were available and because they
characterize a broad range of nonpoint pollution-gener-
ating land uses,  from urban to agricultural. The  model
                                                  98

-------
calculates runoff and loads for any point in space, allow-
ing the user to see the spatial distribution of loads. An
example of a total phosphorus load grid for one sub-
basin in the Jacksonville, Florida vicinity is shown in
Figure 2.

The grids themselves provide a detailed view of model
output. Model results can also be summarized by water-
shed, using the watershed boundary grid, and the infor-
mation can be examined from a basinwide perspective.

We have applied PLSM results in other useful ways at
the District. For example, District staff felt that previous
sediment sampling sites were not appropriately located,
so the District water quality network manager used
model results to locate new sampling sites, focusing on
problem areas as well as areas where we expect to see
little or no nonpoint impact.
Figure 2. Distribution of total phosphorus loads, Ortega River
subbasin (darker areas represent higher loads).

Application of Model Results in the Plan

Because the goal of the model was to identify potential
stormwater runoff problem areas, we needed to simplify,
or categorize, the model results for use in the Plan. We
calculated the per acre watershed load for each pollut-
ant and defined "potential stormwater runoff problem
areas" as those individual watersheds with the highest
loads for all pollutants. Problem areas for one major
basin in the District, the lower St. Johns River basin, are
depicted in Figure 3.

We also ran the model with future land use data ob-
tained from county comprehensive plans. Because the
Potential Stormwater
Runoff Problem Areas
Figure 3. Potential stormwater runoff problem areas, lower St.
Johns River basin.
county maps are guides to future development, and not
predictions of actual development, we exercised caution
when using the results. Problem areas were defined as
those watersheds with projected loads greater than or
equal to existing problem areas. Also, District planners
combined model results with information about individ-
ual counties' regulations and policies to evaluate where
problems are most likely to occur.

Prior to compiling the Plan, the District conducted work-
shops in each county in the District, in which problem
areas identified by the PLSM were discussed with local
agency staff, officials, and the public. We provided large,
hard copy maps depicting stormwater runoff problem
areas combined with results of a separate water quality
analysis on county-based maps. These maps proved to
be powerful tools for initiating discussions and gathering
feedback.
99

-------
In the Plan, stormwater runoff problem areas were re-
ported for each of the 10 major drainage basins in the
District. The information was also repackaged  in  a
county-based format to create a quick reference for local
agencies. District planners recommended strategies for
addressing problems; these strategies vary as appropri-
ate for each county. Examples include the need to as-
sess  compliance with  existing stormwater  permits,
encourage stormwater reuse  during the stormwater and
consumptive use permitting processes, coordinate with
municipalities that are implementing stormwater man-
agement plans,  encourage and assist significantly af-
fected  municipalities to create stormwater utilities, and
improve monitoring in problem areas that do not have
sufficient water quality data.

In conclusion, the Watershed Assessment CIS project
has proved to be useful not only to the St. Johns River
Water Management District, but also to local govern-
ments. Large  projects such as this could not be com-
pleted in a reasonable time without the use of CIS. Also,
for ARC/INFO users who have been restricted to vector
processing, the cell-based processing available in GRID
is  a powerful modeling tool.

References

1.  Cox, J., R. Kautz, M. MacLaughlin, and T.  Gilbert. 1994. Closing
   the  gaps in  Florida's wildlife habitat conservation system. Florida
   Game and Fresh Water Fish Commission, Office of Environmental
   Services, Tallahassee, FL.
2.  Adamus, C.L., and M.J. Bergman. 1993. Development of a non-
   point source pollution load screening model. Technical Memoran-
   dum No. 1. Department of Surface  Water Programs, St. Johns
   River Water Management District, Palatka,  FL.
3.  ESRI. 1992. Cell-based modeling with GRID. Redlands, CA: En-
   vironmental  Systems Research Institute, Inc.
                                                    100

-------
   Watershed Stressors and Environmental Monitoring and Assessment Program
                    Estuarine Indicators for South Shore Rhode Island
                               John F. Paul and George E. Morrison
           Environmental Research Laboratory, U.S. Environmental Protection Agency,
                                    Narragansett, Rhode Island
Abstract

The U.S. Environmental Protection Agency has initiated
the Environmental Monitoring and Assessment Program
(EMAP),  a  nationwide ecological research, monitoring,
and assessment program whose goal is to report on the
condition of the nation's ecological resources. During the
summers of 1990 through 1993, data were collected from
approximately 450 sampling locations in estuarine waters
of the Virginian  Biogeographic Province  (mouth of the
Chesapeake Bay to Cape Cod). During  this period, sam-
pling stations were located  in the coastal ponds and
coastal area of south shore Rhode Island.

One  objective of EMAP is to explore associations be-
tween indicators of estuarine condition and stressors in
the watersheds of the sampled systems. Extensive wa-
tershed  information for south shore  Rhode Island is
available in geographic information system (CIS) for-
mat.  Watershed  stressors  along  south  shore  Rhode
Island were  compared with EMAP indicators of estu-
arine conditions using CIS analysis tools.  The indicator
values for coastal EMAP stations  (those offshore  from
coastal ponds) were associated with all of the aggre-
gated south shore watershed stressors. The coastal
pond indicator values were associated  with stressors in
the individual coastal pond watersheds. For the total south
shore watershed,  the major land use categories are resi-
dential and forest/brush  land, followed by agriculture.
Closer to the coast, residential land use is more preva-
lent,  while further from the coast, forests/brush lands
dominate. All coastal EMAP stations, with one exception,
exhibited unimpacted  benthic conditions,  indicating no
widespread problems.  For the individual watersheds, the
major land use categories are residential and forest/brush
land. The population density (persons  per square  mile)
shows an increasing trend from west to east. Impacted
benthic conditions were observed at EMAP sampling sites
in two coastal ponds. These two impacted benthic sites
appear to be organically enriched.

Introduction

Since its inception in 1970, the U.S. Environmental Pro-
tection Agency (EPA) has had the responsibility for regu-
lating, on a national scale, the use of individual and
complex mixtures of pollutants entering our air, land, and
water. The Agency's focus  during this  period centered
primarily on environmental  problems attributable to the
use of individual  toxic chemicals. Regulatory policy,
while continuing to control new and historical sources of
individual chemicals (i.e., "end of the pipe") and remedi-
ate existing pollution problems, will have to address the
cumulative  impacts from multiple stresses  over large
spatial and temporal scales.
In this decade, the focus of environmental problems, or
"scale of concern," has shifted  from point-source and
local scales to regional and  global  scales. Concurrently,
the focus  has shifted  from chemical  to  nonchemical
stressors. The threat posed by nonchemical stresses
(e.g.,  land  use, habitat alteration and fragmentation,
species  loss and  introduction) presents a substantial
risk to the  integrity of both specific populations and
ecosystems, and entire watersheds and landscapes.
The shift in the scale of concern for environmental prob-
lems presents a unique challenge for environmental
decision-making. Traditionally, environmental informa-
tion has  been collected  over local spatial  and short
temporal scales, focused on addressing specific prob-
lems, limited in the number of parameters  measured,
and collected with  a variety of sampling designs that
were neither systematic nor probabilistic. It  is not sur-
prising, then, that several scientific reviews  concluded
that the information needed to assess, protect, and man-
age marine and estuarine resources was either insuffi-
cient  or  unavailable  and  recommended a  national
                                                 101

-------
network of regional monitoring programs (1, 2). Two key
recommendations resulted from these reviews: (1) the
need  for a national monitoring program designed to
determine the status and trends of ecological resources,
and (2) the need for an assessment framework for syn-
thesizing and  interpreting the  information  being  pro-
duced in a timely manner and in a form that the public
can understand and decision-makers can use. EPA's
response to these recommendations was to institute a
long-term monitoring program, the Environmental Moni-
toring and Assessment Program (EMAP), and to adopt
a risk-based strategy for decision-making.

EMAP is a nationwide ecological research, monitoring,
and assessment program whose goal is to report on the
condition of the nation's ecological resources. During
the summers of 1990 through 1993, data were collected
from approximately 450 sampling locations in estuarine
waters of the  Virginian Biogeographic Province (mouth
of the Chesapeake Bay to Cape Cod) (3-5). During this
period, some  of the sampling stations were located in
the coastal ponds and coastal area of south shore
Rhode Island. One objective of EMAP  is to  explore
associations between indicators of estuarine condition
and stressors  in the watersheds of the sampled sys-
tems. Extensive watershed information  for south shore
Rhode Island is available in geographic information sys-
tems (CIS) format.

The intent of this paper is  to compare watershed stres-
sors with EMAP indicators of estuarine  condition along
south shore Rhode Island  using CIS analysis tools. The
indicator values for  coastal EMAP stations (those off-
shore from coastal ponds) are associated with all of the
aggregated south  shore  watershed  stressors.  The
coastal  ponds  indicator  values are associated with
stressors in the individual coastal  pond watersheds.
The project reported on in this  paper served as a pilot
for integrating watershed information with wide-scale
ecological data collected  to assess condition of estu-
arine waters.

Ecological  Risk Assessment Context

Robert Huggett, EPA's Assistant Administrator for Re-
search and Development,  is using the risk assessment-
risk  management  paradigm  as  a  framework to
reorganize the EPA research laboratories (6). Huggett is
also  reorienting the  research that EPA conducts to be
risk based  (both  human  and  ecological). The major
thrust of the research to be conducted in the EPA labo-
ratories  will be directed toward  reducing the uncertain-
ties in the risk assessment process. In this way, the risk
assessment context provides the "why" for the research
conducted.

Ecological risk assessment is defined as a process for
evaluating the  likelihood that adverse ecological effects
have occurred, are occurring, or will occur as a result of
exposure to one or more stressors (7). The value of the
risk assessment framework lies in its utility as a proc-
ess for ordering and analyzing exposure and effects
information, and in its flexibility for describing past, pre-
sent, and future risks.

One way of depicting the  ecological risk assessment
process is shown in Figure 1 (8). The key points are that
the process is continuous; the process can be oriented
in either direction, dependent upon the form of the ques-
tion or issue being addressed; and monitoring is at the
hub, providing information to all activities. The end result
of the effort is to provide better information for making
environmental management decisions.
Figure 1.  Ecological risk assessment framework (8).

Overview of EMAP and Estuarine Results

EMAP has been described as an approach to ecological
research, monitoring, and assessment (9). It is not the
only approach but is an  approach that is driven by its
goal to monitor and assess the condition of the nation's
ecological resources. The objectives of the  program to
address this goal are to:

• Estimate the current status, trends, and changes in
  selected  indicators of the condition of the nation's
  ecological resources on a regional basis with known
  confidence.

• Estimate the geographic coverage and extent of the
  nation's ecological resources with known confidence.

• Seek associations among selected indicators of natu-
  ral and anthropogenic stress and  indicators  of eco-
  logical condition.

• Provide annual statistical summaries and periodic as-
  sessments  of the nation's ecological resources.
                                                 102

-------
The approach used by the program to meet its objec-
tives and address its goal includes:

• Use of a large, regional scope that encompasses the
entire county but provides information on the scale that
federal and regional environmental managers require.

• Emphasis on ecological indicators to provide the in-
formation to assess condition (i.e., collect information
on the ecological systems themselves to determine
their condition or "health").

• A probability-based sampling design to produce sta-
tistically unbiased estimates on condition and to pro-
vide uncertainty bounds for these estimates.

• A vision of the program as long-term, continuing into
the next century, which is consistent with the large,
regional spatial scale being addressed.

• Development through partnerships with other agencies
that have natural resource stewardship responsibility.

The estuarine component of EMAP was initiated in
1990, with monitoring in the estuarine waters of the
Virginian Biogeographic Province (mouth of
Chesapeake Bay northward to Cape Cod) (10). Figure 2
depicts the biogeographic provinces of estuarine re-
sources of the country. These provinces have been deline-
ated based upon major climatic zones and the prevailing
offshore currents (11). This is comparable with the
ecoregion approach used to describe the distribution of
terrestrial ecosystems (12). The biogeographic province
is the comparable approach for coastal ecosystems.
Monitoring in the Virginian Province continued through

Columbian
1993; monitoring was conducted in the Louisianian
Province from 1991 to 1994; monitoring was initiated in
the Carolinian Province in 1994; and monitoring will be
initiated in the \Afest Indian Province in 1995.

A suite of measurements was collected at each of the
EMAP-Estuaries sampling sites that were selected with
a probability-based sampling design (13, 14). As indi-
cated above, the measurements emphasized ecological
conditions indicators, which included biotic indicators
such as benthicand fish abundance, biomass, diversity,
and composition, and also included abiotic indicators
such as dissolved oxygen, sediment contaminant con-
centration, and sediment toxicity (15).

In the Virginian Province, approximately 450 probability-
based sampling sites were visited during the summer
periods in 1990 through 1993 using consistent indica-
tors and collection and analysis procedures. An example
of the results is shown in Figure 3, which presents the
condition of benthic resources (16). The benthic condi-
tion is reported using a benthic index, which is an ag-
gregate of individual benthic measurements that were
combined using discriminant analysis to differentiate
impacted from unimpacted sites (3, 17). The figure pre-
sents results for values of the benthic index that were
determined to be impacted. The bar chart is the stand-
ard EMAP format for results: province-scale results with
95-percent confidence intervals about estimates. The
large, small, and tidal categories refer to the strata used
in the probability-based sampling design: large systems
are the broad expanses of water such as in Chesapeake
Bay, Delaware Bay, and Long Island Sound; small
Acadian
Californian
Lousianian
West Indian
Figure 2. Biogeographic provinces used by EMAP-Estuaries (13).
103

-------
            Impacted EMAP Site
       Philadelphia
       Baltimore
     Washington

      1
•,**
l£**~   H<  ^
$T      V^p
                                         \ yf
               jS&&*>*~   ^x-,J   v_   l^jP*
            ')
                               ia jf*
    V*%&
                                                              Province    Large
                                                                                 Small
                                                                                          Tidal
Figure 3.  Condition of benthic communities in Virginian Province.

systems include the bays and harbors along the edges
of the major systems and embayments along the coast;
and large tidal rivers include the Potomac, James, Rap-
pahannock, Delaware, and Hudson Rivers.

The results indicate that 24 percent + 4 percent of the
estuarine waters  of the Virginian  Province have  im-
pacted benthic communities. The  small and tidal river
systems have proportionately more impacted area than
the large systems.

All the EMAP data are geographically referenced; there-
fore, the data can  be spatially displayed to explore
patterns. The spatial display  of the impacted benthic
community information is a simple spatial analysis of the
EMAP data. This analysis shows that the impacted ben-
thic resources are distributed across the entire province,
with  more impacted sites  in the vicinity of the major
metropolitan areas.

In addition to analyzing the EMAP results at the regional
scale, analyses have been conducted at the watershed
scale (see  Figure 4).  The probability-based sampling
design permits the data to be aggregated (poststratified)
in ways other than the way  the  original design  was
stratified. The only restriction to the aggregation is the
number of available sample sites for the aggregation; a
                                    small number of sites leads to large uncertainties in the
                                    results. Figure 4 shows the aggregation for four major
                                    watersheds: Chesapeake Bay, Delaware Bay, Hudson-
                                    Raritan system, and Long Island Sound. This watershed
                                    scale is close to the  practical scale at which environ-
                                    mental  management  decisions are  implemented.  The
                                    data need to be analyzed at smaller scales, however, to
                                    focus on environmental management of the smaller wa-
                                    tersheds (e.g., contaminated sediments). This leads into
                                    the  need to conduct the pilot project addressing water-
                                    shed information.

                                    South Shore Rhode Island Pilot Project

                                    EMAP's third objective relates to exploring associations
                                    between indicators of estuarine condition and water-
                                    shed stressors.  Note that the  word "watershed"  was
                                    added.  One way to address environmental management
                                    remediation strategies is to look at the watershed activi-
                                    ties that could possibly be modified or changed to im-
                                    prove estuarine conditions.

                                    Watershed stressors for the estuarine environment in-
                                    clude land-based  sources of pollution, such as point
                                    sources of pollution, and land use activities (i.e., how the
                                    land is actually used, including landscape  patterns).
                                                  104

-------
\
Figure 4.
LIS
Condition of benthic communities in major watersheds in Virginian Province (Chesapeake Bay, Delaware Bay, Hudson-
Raritan system, and Long Island Sound).
Which of these stressors is more important for a particu-
lar situation depends on the types of estuarine impact
(localized or systemwide) and the management ques-
tion that is being addressed.

The specific objective of the south shore Rhode Island
pilot project was to compare watershed stressors with
EMAP indicators of estuarine condition using CIS analy-
sis tools. This project was not intended to be a definitive
study by itself of south shore Rhode Island butto explore
the process necessary to undertake the comparisons, to
investigate the feasibility of pulling the necessary infor-
mation together, and to identify potential problems before
undertaking this comparison on a much larger scale.

The south shore Rhode Island study area is depicted in
Figure 5. This coastal area drains into the coastal waters
of Block Island Sound. The project was intentionally
restricted to a limited geographic area to avoid being
overwhelmed with the tremendous volumes of data that
could have been encountered.

All data sources used in this project were available
electronically. Digitized U.S. Geological Survey quad
maps were available from the Rhode Island Geographic
Information System (RIGIS) at the University of Rhode
Island. The National Pollutant Discharge Elimination
System (NPDES) was available for major dischargers
from the National Oceanic and Atmospheric Administra-
tion's National Coastal Pollution Discharge Inventory
(18). The 1990 census was also available from RIGIS.
The EMAP 1990 through 1993 estuarine data were
available from the EMAP-Estuaries Information System
at the EPA Environmental Research Laboratory in Nar-
ragansett, Rhode Island. The RIGIS data were already
available as ARC/INFO coverages. The NPDES and
EMAP data had to be converted to ARC/INFO point
coverages.
Two approaches were used to conduct spatial analyses.
Buffer zones at 1, 3, 5, 10, and 20 kilometers from the
south coast of Rhode Island were created and used to
clip the south shore area coverages (e.g., land use,
population, point sources). The watershed boundaries
of three south shore coastal ponds were manually de-
lineated and used to clip the south shore area cover-
ages. The ponds were Quonochontaug, Ninigret, and
Point Judith (west to east).
105

-------
                                                                Point Judith Pond . Vji
                                                  Ninigret Pond
                              Quonochontaug Pond
                                                            Block Island Sound


                                                                   1
Figure 5.  South shore Rhode Island study area.

South Shore Rhode Island Pilot
Project Results

The results for land use by distance from the coast are
presented in Figure 6. These results give the broad-scale
coastal perspective. For the total south coast watershed,
the major land  use categories are residential and  for-
est/brush land, followed  by agriculture. Closerto the coast,
residential land use is more prevalent, while farther from
the coast, forests/brush lands dominate.  Population (see
Figure 7) increases with  distance from the coast, but popu-
lation  density does not appear to be a function of distance
from the coast. Only one out of five coastal EMAP stations
exhibited impacted benthic conditions, indicating no wide-
spread benthic problems in  the coastal waters. The one
station that was classified as impacted was dominated by
an extremely high number of individuals of one species.

A smaller scale  view can be  gained by looking at the
results for  the  individual watersheds.  This  view  also
provides an east-west perspective compared with the
south-north  perspective  with  the  distance from  the
coast. Again, for the individual  watersheds, the major
land use categories are residential and forest/brush land
(see Figure 8). The population increases from west to
east, and population density shows an increasing trend
from west  to  east (see  Figure  9). Impacted benthic
conditions  were observed at the EMAP stations  in
Quonochontaug and Point Judith  Ponds. These stations
exhibited organic enrichment (total organic carbon in the
sediments  exceeding 2 percent), possibly from histori-
cally improperly treated sewage.  No benthic data were
available for the Ninigret Pond  station; however, dis-
solved oxygen was observed to  be low at this station.
No major NPDES point sources are located in the coastal
pond watersheds, although two are located on the eastern
edge of the Point Judith Pond watershed boundary.

Discussion

A pilot project was conducted for south shore Rhode
Island to compare watershed stressors with EMAP indi-
cators  of estuarine condition. The results indicate that
such a comparison can be  accomplished, with the wa-
tershed information providing a  qualitative link to the
estuarine conditions observed. One potential problem is
the need to delineate the watershed boundaries for  all
                                                   106

-------

CO
(U
CO
•5
-------
2,500 -
2,000
1,500
in

-------
4. U.S. EPA. 1994. Statistical summary: EMAP-Estuaries Virginian
Province (1991). EPA/620/R-94/005. Narragansett, Rl.

5. U.S. EPA. 1994. Statistical summary: EMAP-Estuaries Virginian
Province (1992). EPA/620/R-94/019. Narragansett, Rl.

6. Newman, A. 1995. A new direction for EPA's Office of Research
and Development. Environ. Sci. and Technol. 29(3): 126A-129A.

7. U.S. EPA. 1992. Framework for ecological risk assessment.
EPA/630/R-92/001. Washington, DC.

8. Gentile, J.H., J.F. Paul, K.J. Scott, and R.A. Linthurst. 1995. The
role of regional monitoring in ecological risk assessment: An
EMAP example for watersheds. Manuscript in review.

9. Messer, J.J., R. Linthurst, and W. Overton. 1991. An EPA program
for monitoring ecological status and trends. Environ. Monitor, and
Assessment 17(1):67-78.

10. Paul, J.F., K.J. Scott, A.F. Holland, S.B. Weisberg, J.K. Summers,
and A. Robertson. 1992. The estuarine component of the U.S.
EPA's Environmental Monitoring and Assessment Program.
Chemistry and Ecology 7:93-116.

11. Bailey, R.G. 1983. Delineation of ecosystem regions. Environ.
Mgmt. 7:365-373.
12. Omernik, J.M. 1987. Ecoregions of the conterminous United
States. Annals of the Assoc. of Amer. Geographers 77:118-125.

13. U.S. EPA. 1990. Near coastal program plan for 1990: Estuaries.
EPA/600/4-90/033. Narragansett, Rl.

14. U.S. EPA. 1991. Design report of the Environmental Monitoring
and Assessment Program. EPA/600/3-91/053. Chervils, OR.

15. U.S. EPA. 1990. Environmental Monitoring and Assessment Pro-
gram: Ecological indicators. EPA/600/3-90/060. Washington, DC.

16. U.S. EPA. EMAP-Virginian Province four-year assessment (1990-
1993). EPA report in review.

17. Engle, V.D., J.K. Summers, and G.R. Gaston. 1994. A benthic
index of environmental condition of Gulf of Mexico estuaries.
Estuaries 17(2):372-384.

18. NOAA. 1993. Point-source methods document. NOAA technical
report. Silver Spring, MD: Strategic Environmental Assessments
Division, National Ocean Service, National Oceanic and Atmos-
pheric Administration.

19. Comeleo, R.L., J.F. Paul, P.V August, J. Copeland, C. Baker, S.
Hale, and R.W Latimer. 1995. Relationships between landscape
stressors and sediment contamination in Chesapeake Bay estu-
aries. Manuscript in review.
109
-------
GIS Watershed Applications in the Analysis of Nonpoint Source Pollution
Thomas H. Cahill, Wesley R. Horner, and Joel S. McGuire
Cahill Associates, West Chester, Pennsylvania
Introduction

Geographic information systems (GIS) have been used
to evaluate the impact of nonpoint source (NPS) pollu-
tion in a variety of watersheds and drainage systems
over the past 20 years (1-6). During that period, our
understanding of the sources and hydrologic transport
mechanisms of NPS pollutants, both in particulate and
soluble forms, has greatly increased (7-9). Our ability to
create and manipulate land resource data, however, has
advanced at a far more dramatic rate. Whereas 20 years
ago, both computer system capabilities and peripheral
hardware limited the process of encoding, storing, and
displaying spatial data, today we can encode land re-
source data, analyze it, and produce stunning visual
displays at a relatively low cost.

The question is: what has this experience told us regard-
ing the yet unresolved problem of water quality degra-
dation from NPS pollution in our streams, lakes, and
coastal waters (10, 11)?

The purpose of this paper is to report on several recent
studies of this nature that created a GIS as a tool to
analyze NPS pollution. This paper will not cover all
aspects of these studies; detailed reports on each pro-
ject are available from the authors or respective clients.
The objects of these studies were:

• A medium-sized lake draining a fairly small watershed

• A riverine system with multiple use impoundments

• A 100-mile stretch of Atlantic coastal estuary

These water bodies all have one common ingredient:
NPS pollution significantly affects them. While the pri-
mary focus of these studies was to understand the
dynamics of surface water quality, and specifically the
NPS component, the further objective was to document
the causal link between identified water resource prob-
lems and the watershed-wide management actions
needed for their remediation. Thus, GIS serves not only
as a mechanism for analysis of NPS pollution sources
but also as the tool by which to evaluate alternative
methods that would reduce or prevent this pollution.
Study Concepts

These three studies illustrate different approaches to
both aspects of this problem. In the 93-square-mile Up-
per Perkiomen Creek watershed (UPW) study, the ob-
jective was to develop a management program that
would reduce nutrient load in a system of reservoirs at
the base of the watershed. An essential element in the
analysis underlying GIS design (ARC/CAD) was to be
able to differentiate and evaluate pollution sources in the
watershed, while providing the technical basis for an
innovative and far-reaching management program on all
levels of government; that is, GIS was used not only to
analyze the problem but to help formulate the solution.

In the more focused Neshaminy Creek study, Cahill
Associates (CA) designed a detailed pixel/raster format
for GIS to support detailed hydrologic modeling (12) and
NPS loading analysis. This study, carried out under
Pennsylvania's Act 167 stormwater management pro-
gram, was under a legal requirement to translate tech-
nical findings into subdivision regulations that all 30
watershed municipalities would adopt. This mandate
required much more geographically specific rigor in the
GIS approach and in the management recommenda-
tions the law stipulated.

These two projects (see Figure 1), when taken together,
illustrate the critical relationship between understanding
the appropriate level of detail in GIS system design, GIS
development with modeling and other analytical require-
ments, and ultimately, the proposed management ac-
tions for watershed-wide implementation.

In the New Jersey Atlantic Coastal Drainage (ACD)
study, the objective was to document more completely
the magnitude and sources of NPS pollutants, espe-
cially nutrients, entering New Jersey estuarine coastal
waters. The GIS design placed special attention on the
role of urban or developed land uses situated along the
coastal fringe, particularly the maintained or landscaped
portions of developed sites. Most previous studies have
largely ignored this factor. Instead, they have focused
water quality analysis typically on NPS loadings as a
110
-------
Upper Perkiomen
Watershed
Pennsylvania
N. HAMPTON

-x ^ Vl
LEHIGH

Perfiomen Creek Watershed^"y
Figure 1. Regional location of Upper Perkiomen and Neshaminy
basins.
function of impervious area coverage, with the assump-
tion that loadings increase as imperviousness increases.

On the contrary, the CA thesis states that certain pollut-
ant loadings, such as nutrients, maximize in areas with
relatively moderate densities (1/2- to 1-acre lots) and
percentage impervious cover but with large maintained
lawnscapes. Because sandy soils allow soluble NPS
pollutants to pass as interflow to points of surface dis-
charge with surprising ease, they exacerbate the prob-
lem of nutrient applications in typical coastal drainage
areas. CIS application in this case enabled estimation
of the nutrient loading to coastal waters. Existing fertil-
ized lawn areas were calculated to be a significant
source of nutrient pollution, with loadings from new land
development posited as an even more serious problem
for New Jersey's coastal waters. CIS was then applied
to evaluate the suitability of various best management
practices (BMPs), based on the physical and chemical
properties of the soil mantle and the existing and antici-
pated land use.

The Upper Perkiomen Creek
Watershed Study

Background

The UPWin southeastern Pennsylvania is a tributary of
the Schuylkill River in the Delaware River basin (see
Figure 2). Serious eutrophication problems occurring in
the system of reservoirs lying at the base of this rela-
tively rural watershed prompted the study. The study
New Jersey

HUNTERDON
Upper Perkiomen Creek Water:
-\,—.-—
SCHUTLKIU.
Scale in Miles
Figure 2. The Perkiomen Creek watershed in the Delaware River
basin.
effort evolved from concerns on the part of the Delaware
Riverkeeper, a private nonprofit environmental organi-
zation dedicated to promoting the environmental well-
being of the Delaware River watershed. The Upper
Perkiomen Creek has experienced various water quality
problems, especially the eutrophication of Green Lane
Reservoir, a large raw water supply storage reservoir
(see Figure 3). Green Lane's highly eutrophic condition
has been a constant since shortly after initial construc-
tion over 35 years ago, but the relative importance of
NPS inputs has dramatically increased. Whereas 10
years ago point source input was the major source of
phosphorus, elimination of some point sources and ad-
vanced waste treatment for others has greatly reduced
that component of pollutant loading, while NPS sources
have remained constant or increased. Current analysis
indicates that NPS pollution constitutes over 80 percent
of the annual load of phosphorus (see Figure 4) into the
Green Lane Reservoir and is well in excess of the
desired loading to restore water quality (see Figure 5).

Nonpoint Source Analysis

Calculating the NPS load was an essential ingredient in
the study and relied on developing accurate measure-
ment of NPS transport during stormwater runoff periods.
Certain pollutants, specifically those associated with
sediment and particulate transport such as phosphorus,
have produced a "chemograph" that parallels but does
not exactly follow the traditional form of the hydrograph
(see Figure 6). The pollutant mass transport associated
with this runoff flux frequently constitutes the major frac-
tion of NPS discharge in a given watershed (8, 13). In
111
-------
T3 ^
CD &1
si ro
(/) T3

a i
-'a i
i
':

•
•f
:/ ' ^
.."V
•" »
o
Z

D)
0)
'•8
(D
Q.
W

OJ
CL
'Jb-
...... „
S
•1 v
£•
ro
T3
CD
X
Figure 3. Green Lane Reservoir in the Upper Perkiomen watershed, 814 acres, 4.3 BG.
112
-------
Dustfall
2.50%
500 Pounds
per Year
Waterfowl
2.50%
500 Pounds
per Year
Nonpoint Sources
Dry Flow
5,604 Pounds
perYi
Nonpoint Source
Total = 84%
Point Sources
Direct 2,486
Pounds per Year
Point Sources
to Tributaries 592
Pounds per Year

Direct Drainage
1,172 Pounds
Per Year
Nonpoint Sources
Storm Flow 8,036
Pounds per Year
Total Load = 18,889 Pounds per Year
Figure 4. Sources of total phosphorus mass transport into the
Green Lane Reservoir from the Upper Perkiomen wa-
tershed (71 square miles) in an average flow year—in
pounds per year.
20,000

18,000
Average Year, Riverkeeper-
1993 = 18,889 Pounds per Year
Eutrophic = 8,416 Pounds per Year
Oligotrophic = 4,208 Pounds per Year
Riverkeeper -1993 Eutrophic
Oligotrophic
the UPW study, operating continuous sampling stations
at two key gage locations above the reservoir allowed
the measurement of stormwater chemistry of this type
and produced estimates of wet weather transport of phos-
phorus and sediment. Surprisingly, the NPS transport
during dry weather, calculated by subtracting the point
sources, was also significant and is attributed to live-
stock discharges and septage drainage.

But the wet weather proportion of NPS pollution still
dominates lake water quality. Many have said that water
quality in a given watershed is a function of land use,
but that statement is as unsatisfying as saying that
runoff is a function of rainfall. Experience has taught us
that neither process is quite that simplistic, nor does
either follow a direct linear relationship of cause and
effect. The causal mechanisms that generate a certain
mass load of pollutant in a drainage basin certainly
result from how much mass of that pollutant is applied
to the landscape within the drainage, which in turn is
scoured from the landscape during periods of surface
saturation, transported in, and diluted by runoff. The end
result is a concentration of pollutant in the stormwater
that might be several orders of magnitude greater than
during dry weather flow, the hydrologic period tradition-
ally used to measure and define water quality.

Developing NPS analysis or algorithms for stormwater
quality modeling requires replicating the specific hydro-
graph and its associated chemograph, as well as defin-
ing the mechanisms by which pollutants are scoured
from the land surface, transported in runoff, and pass
through the river system. Total phosphorus (TP), for
example, is transported with the colloidal soil particles
(see Figure 7), so sediment transport and deposition
constitute a key mechanism.
Figure 5. Reduction in annual phosphorus load required to
achieve improved trophic level.
3/5/930:00 3/5/9312:00 3/6/930:00 3/6/9312:00
Figure 6. Storm hydrograph in the Upper Perkiomen watershed
illustrating the dramatic increase in total phosphorus
and suspended sediment during runoff.
Adding to these complications is the question of whether
to model single or multiple events. Is the chemodynamic
process one in which the transport takes place over a
series of storm events, so that each storm moves the
pollutant mass a given distance in the drainage and then
allows it to settle in the channel only to resuspend it with
the next peak of flow? Or does the total mass transport
occur in one single dynamic, from corn field or suburban
lawn to lake, estuary, or other sink, that is hours or days
downstream in the drainage? The issue of how storm-
water transport of pollutants takes place is of paramount
importance in current planning and regulatory imple-
mentation (11) because many of our current BMPs are
relatively ineffective in removing NPS pollutants. This
understanding is critical even as we attempt to intervene
in the pollutant generation process by changing the way
we cultivate the land, fertilize our landscapes, or for that
matter, how we alter the land surface during growth.
113
-------
3,500
3,000
2,500
Q_
Q_
Q_
Q.
2,000
1,000
500
Main Branch rA2 =
0.96 65 Samples
Regression
500 1,000 1,500

Suspended Solids (PPM)
2,000
2,500
Figure 7. The relationship between total phosphorus and suspended sediment concentrations during runoff is strong but varies
with different watersheds.
GIS Evaluation

The GIS data files on land use/land cover that were
created for the UPW show that the bulk of the area is
still quite undeveloped and rural (see Figure 8), with the
steeply sloped and igneous rock areas in the headwa-
ters in forest cover (38 percent) and the valleys in mixed
agriculture (44 percent). The urban/suburban land com-
poses the remaining 18 percent and largely consists of
several older, historic boroughs linked together in a
lineal pattern with widely scattered, low-density residen-
tial areas. Much of the existing housing is turn-of-the-
century at quite high densities, mixed with a variety of
commercial and other uses. This pattern contrasts
sharply with typical large-lot suburban subdivisions. In
fact, these watershed boroughs resemble the "village"
concepts that innovative planning theorists advocate in
a variety of important ways.

The watershed (see Figure 9) is blessed, or cursed,
depending upon one's perspective, with a multiplicity of
local governments including four different counties and
18 different municipalities. This arrangement poses spe-
cial challenges for management program implementa-
tion. Population projections indicate that additional
development will occur at moderate rates throughout the
watershed, reflecting recent trends.

Farming, both crop cultivation and dairying, is a major
existing land use in the watershed, although agriculture
is not especially robust and appears to be declining. This
lack of agricultural vibrancy becomes a major factor in
determining how to impose additional management
measures on agricultural pollution sources. GIS tabula-
tion of agricultural land totals some 19,000 acres above
the reservoir, which can be compared with the estimated
TP and suspended solids (SS) mass transport reaching
the lake. Considering only the agricultural land to be
the source of this NPS input (not quite true) suggests
an average annual yield of 180#/acre/year-SS and
0.22#/acre/year-TP.

This sediment/phosphorus yield is more than sufficient
to maintain a eutrophic condition in the reservoir system.
The problem with this yield, however, is that it is two
orders of magnitude less than commonly accepted
methodologies of soil erosion, such as the universal soil
loss equation (14), would suggest might come from such
a watershed. Analysis of the cultivation practices taking
place on farmland in the watershed estimates soil ero-
sion to be approximately 5 to 10 tons per acre or more
per year, far more than is observed passing out of the
basin into the reservoir. The phosphorus applications on
both cultivated and maintained residential landscapes
also appear much greater than the mass transport actu-
ally measured in the flowing streams, which represent
perhaps 7 percent or less of the annual land application.

The implication for NPS analysis is that the standard
shopping list of either agricultural or urban BMPs might
only reduce the mass transport by a relatively small
fraction, even if successfully applied throughout the
drainage. As Figure 7 illustrates, most of the phosphorus
transport occurs on the colloidal fraction of sediment
particles, which tend to remain in suspension as storm-
waters pass through conventional detention structures,
terraces, or grassed swales.

To consider more radical measures, GIS was used to
determine possible alternatives, such as creating a
stream buffer system (see Figure 10) with various set-
back distances from the perennial stream network, and
to evaluate how great an impact this might have on
agricultural land use and urban development. Land
114
-------
--•-•"" ~ = •"' -3%i JHr =" ~™>i-^T. -—,-, , , " ' - -- .
„„•-'""" *""' "" ~~ - —% „ SJ".•P«':!*I*~_,,.,™ .-.—, «««™,-™Bu "" ~- „

~ -zf^^^r'T^ * -,m *1; -' ^^~,-: --fgr' -T::-
s*f-r ^" .,-• -sjsf-c " '• T". _"- --». '*S*^!"Sil*7-5, -;••-^•'j^^ r55*"
.iL^~= -nJ8" ^lii*HBii'£r... " „ == . iTTl.:..• HillliMlllllLLlI' ^hmun iLTnnn-- — '"" """'"' '"'" = - ?Sfc4llhfc«fca ,"
Agricultural Land
Forested Land
Topography
Figure 8. GIS data files showing land use/land cover characteristics for the Upper Perkomien watershed.
115
-------
Fallow\Pasture

r j
[_ I Cropstrip

Orchard

Urban\Suburban

Trailerpk

Resrural

Resfarm

Golfcourse

Quarry
Figure 9. Existing land use/land cover CIS file for the Upper Perkiomen watershed. The 95-square-mile basin includes portions of
four counties and 18 municipalities.
116
-------
Legend
Forest
Meadow
Water
Developed \
\
Trailer Park
Resrural/Farm \
I
.y
i
--i-----\^^^f^p^-'V-^ - "iX "^----v-T-^v.^€!--
:--l - --•-%-^;^>:\-l|.-..-::>:..«::;:-':- .?- .-=• •, ^, .:, -j*- .- -,T- —. -.-;.™r. ~~Jr ™ "*
••¥•-•-••-*-7n-r&- • f
\ :-^ ;^r ^ii ^^,
T:- - -

/ '• y^,/^^M*SaS^i*l^tft^^
jr /...- :"u«P^"" =r =;—;--— rj&#c?r.-.v"•"•..•"".•: '; "= "-:- -:'^
Stream
Structures
Area

Detail
Figure 10. GIS analysis of stream corridors allows evaluation of riparian buffer systems, potential agricultural land loss, potential

septic system discharges, and related NFS reduction with selected best management practices.
117
-------
use at varying distances (100 feet, 200 feet, and 1,000
feet) from streams was tabulated, including all land area
in the "active" agriculture categories. This CIS documen-
tation allowed estimation of the significant NPS reduc-
tion in loadings that a riparian corridor management
program could achieve.

In the same way, CIS analysis helped estimate pollutant
loadings from malfunctioning onsite septic systems.
Counts of structures in nonpublicly sewered areas
within varying distances from the stream system were
developed using CIS data files. The nearly 300 potential
systems within a 200-foot radius of those streams drain-
ing into the Green Lane Reservoir identified in this man-
ner, with pollutant generation factors applied, became
the basis of a dry weather pollutant estimation. Although
this approach was dependent on a variety of assump-
tions, alternative approaches of evaluating the problem,
such as field visits to actual onsite systems throughout
the watershed, would not have been feasible.

For urban and suburban development, the management
focus was to estimate NPS loadings from future land
development. CIS was used to demonstrate NPS pollut-
ant load implications of future growth envisioned in the
watershed's keystone municipality, Upper Hanover
Township. Here, an increase of 15,000 residents would
convert 1,772 acres into residential, commercial, and
industrial uses. Nonpoint pollutant loadings generated
by this new land development constituted significant
increases in phosphorus, suspended solids, metals,
oil/grease, and other pollutants, and would reverse any
improvements in Green Lane Reservoir water quality
that recent wastewater treatment plant upgrades
achieved.

From a water quality perspective, future alternative land
use configurations that concentrate development and
minimize ultimate disturbance of the land surface
yielded would substantially reduce NPS pollutant load-
ings into the reservoirs. This entire process of testing
land use implications of different management ap-
proaches for their water quality impacts indicated that
pollutant loads could be minimized far more cost effec-
tively through management actions, both structural and
nonstructural, which varied from the areawide to the
site-specific.

Neshaminy Creek Watershed Stormwater
Management Study

Background

The Neshaminy Creek watershed, including 237 square
miles of mixed urban and rural land uses, lies primarily
in Bucks County, Pennsylvania, and flows directly into
the Delaware River (see Figure 1). The 1978 Pennsyl-
vania 167 Stormwater Management Act, which required
that counties prepare Stormwater management plans for
all 353 designated watersheds in the state, mandated
the Neshaminy study. This act further stipulated that
municipalities then needed to implement the watershed
plans through adopting the necessary municipal ordi-
nances and regulations. In fact, the Neshaminy study
had three water resource management objectives:

• Prevent worsened flooding downstream caused by
increased volumes of runoff from land development.

• Increase ground-water recharge.

• Reduce NPS pollutant loadings from new development.

In the initial study design, water quality and NPS issues
were secondary to flooding concerns. When Pennsylva-
nia's Stormwater management program was conceived,
the state focused on preventing watershed-wide flood-
ing. Clearly, detention basins have become the primary
mode of managing peak rates of Stormwater discharge
site-by-site in most communities. Because detention ba-
sins only control peak rates of runoff and allow signifi-
cantly increased total volumes of water discharged from
sites, however, the increased Stormwater volumes can
theoretically combine and create worsened flooding
downstream. Consequently, most Act 167 planning has
focused on elaborate hydrologic modeling designed to
assess the seriousness of potential cumulative flooding
in watersheds under study.

In the case of the Neshaminy, however, the record sug-
gested that although localized flooding could be an is-
sue, an existing network of eight multipurpose flood
control structures constructed during the 1960s served
to prevent significant flooding. Water quality certainly
was a serious Stormwater concern, however, especially
in the areas flowing into the reservoirs where recrea-
tional use had become intense. Several of the existing
impoundments were multipurpose, their permanent
pools providing critical recreational functions for a bur-
geoning Bucks County population. At the same time, the
proliferation of development in the watershed, with its
increased point and nonpoint sources, had degraded
streams and seriously affected the reservoirs. While the
total stream system in the watershed was of concern,
the future of the reservoirs came to be particularly im-
portant in developing the total Stormwater management
program for the Neshaminy watershed.
The Neshaminy lies at the heart of Bucks County, Penn-
sylvania's primary population and employment growth
county (see Figure 11). Although the Neshaminy water-
shed has already experienced heavy development, es-
pecially in the lower or southern portions, farmsteads
and large areas of undeveloped land still exist, espe-
cially in headwater areas. Agriculture has been a major
land use in the past, but farms rapidly are converting to
urban uses as the wave of urbanization moves outward
from Philadelphia and from the Princeton/Trenton met-
ropolitan areas. Growth projections indicate continu-
118
-------
HESHAMJNY CRKKK WATERSHED
STORM WATER MANAGEMENT PLAN
i^w^
?fejM&
EXISTING LAND COVER
Figure 11. Land use/land cover in the Neshaminy basin of Bucks County, Pennsylvania. The watershed covers 237 square miles

in southeast Pennsylvania.
119
-------
ation of this rapid growth and a continuing change in
existing land use/land cover, together with projected
development with in the required 10-year planning horizon.

Physiographically, the watershed spans both the Pied-
mont and Atlantic coastal plain provinces, with rolling
topography and relatively steep slopes underlain by Tri-
assic formation rock, including the Lockatong, Bruns-
wick, and Stockton formations. This bedrock ranges
from being a poor aquifer (Lockatong) to an excellent
aquifer (Stockton) where the many rock fractures allow
for considerable ground-water yields. Soils are quite
variable, ranging from good loam (hydrologic soil group B)
to clays and other types with poor drainage charac-
teristics (e.g., high water table, shallow depth to bed-
rock). A large proportion of the soils in the watershed are
categorized as hydrologic soil group C, which is mar-
ginal for many stormwater management infiltration tech-
niques (see Figure 12) and produces a relatively large
proportion of direct runoff. With an annual rainfall of 45
inches, base flow accounts for about 12 inches and
direct runoff accounts for 10 inches.

The system of eight stormwater control structures,
which were built over the past three decades under the
federal PL 566 program, have altered the hydrology of
the watershed (15). In addition, in heavily developed
portions of the watershed, impervious surfaces com-
bined with numerous detention basins prevent the bulk
of the precipitation from being recharged, and the vol-
ume of total runoff proportionally increases. An elabo-
rate system of municipal and nonmunicipal wastewater
treatment plants also adds to this alteration of the hy-
drologic cycle. These plants discharge wastewater efflu-
ent that, in some cases, constitutes the bulk of the
stream flow during dry periods. While the impact of NPS
was evident throughout the drainage, it was of special
interest in the impoundment network, especially those
impoundments that were conceived as multipurpose in
function and constitute major recreational resources in
the watershed.

CIS Design

Act 167 requirements and the needs of the hydrologic
and other modeling used in planning both heavily influ-
enced the CIS developed for the Neshaminy. Spatial
data files, including existing land use, future land use,
and soil series aggregated by hydrologic soil groups,
were created by digitizing at a 1-hectare (2.5-acre) cell
resolution. The encoding process that helped design the
CIS used a stratified random point sampling technique
that similar studies had developed and applied (1, 3).
The encoding process used a metric grid of 5-kilometer
sections, subdivided into 2,500 1-hectare cells (100 me-
ters on a side), aligned with the Universal Transverse
Mercator (UTM) Grid System. This grid appears in blue
on U.S. Geological Survey (USGS) topographic maps.
These maps served as the framework of reference for
all data compilation. Within each 100-meter cell, a ran-
domly located point was chosen (see Figure 13) at
which the specific factor was encoded as representative
for the cell, using a digitizer tablet. This approach al-
lowed extraction of the data from the respective source
documents with some rectification necessary for many
types of source maps and photographs.

The combination of soil series and cover in each cell
helped to calculate the curve number and unit runoff per
cell. The 45,000-cell data file was then used to calculate
total runoff for a range of events in each of 100 subbas-
ins that averaged 1.95 square miles each. The resultant
hydrographs, used in combination with a separate linear
data file in CIS describing the hydrographic network of
stream geometry, routed and calibrated the hydrologic
model (TR-20). NPS mass transport loadings were es-
timated on an annual basis by cell, again using the land
use/land cover data file, and total loads summed by
groups of subbasins above critical locations. This issue
was particularly important with respect to the drainage
areas above the impoundments, where NPS pollutants
were of greatest concern.
The soil properties data file was especially useful in
evaluating certain management objectives, such as the
opportunity for recharging ground-water aquifers. The
spatial variation in relative effectiveness of infiltration
BMPs was considered for both quantity and quality miti-
gation because the best methods for NPS reduction
usually include recharge where possible. The soil series
corresponding with new growth areas were classified
regarding their suitability for these BMPs, which are
most efficient on well-drained or moderately well-drained
soil. Thus, the alternative impacts of future growth could
be considered in terms of potential generation (or man-
agement) of NPS loads. A BMP selection methodology
(see Figure 14), which was developed forthe 30 munici-
palities within the watershed, focused on new land de-
velopment applications and considered both water
quantity and quality management objectives. BMP se-
lection is a function of several factors, including:
• The need for further peak rate reduction.

• The recharge sensitivity of the project site (defined as
a function of headwaters stream location, areawide re-
liance on ground water for water supply, or presence
of effluent limited streams).

• The need for priority NPS pollution controls (location
within reservoir drainage).

Development of two "performance" levels of BMP selection
techniques gave municipalities some degree of flexibility
in developing their new stormwater management pro-
grams. This system required only the minimally acceptable
techniques but recommended the more fully effective
ones, hoping that municipalities would strive to incorporate
120
-------
HVDKOLiu.U SUM <;ko|ips
Figure 12. GIS file of hydrologic soil groups in the Neshaminy basin. The 31 soil series are digitized in 45,000 pixels of 1-hectare size.
121
-------
Region With Labeled Zones
29
(4428000mN)
28

A
5

46 47 48 49
(446000mE)
290
288
286
284
282
Zone With Labeled Cells
280
460 462 464 466
468
470
Figure 13. Raster/pixel design of GIS for Neshaminy modeling
study. Each pixel is 1 hectare (2.47 acres).

recommended management measures wherever possi-
ble. The BMP selection methodology also was sensitive
to type of land use or proposed development, assigning
typical single-family residential subdivisions different
BMPs than, for example, multifamily and other nonresi-
dential proposals (including commercial and industrial
proposals). The selection process also determined size
of site to be a factor, differentiating between sites of 5
acres or more because of the varying degrees of cost
and effectiveness of different BMP approaches. The
methodology, if properly and fully implemented, should
achieve the necessary stormwater-related objectives—
both quantity and quality—that the analysis had deemed
necessary (16).

GIS was especially important in its ability to test how
reasonable the BMP selection methodology was. Such
tests included the ability to evaluate, for each municipal-
ity, the following factors:

• The nature and extent of the projected development.

• The size of development/size of site assumptions.

• Other vital BMP feasibility factors such as soils and
their appropriateness for different BMP techniques.

GIS also enabled analysis of the water quantity and
quality impacts of projected growth on a baseline basis,
assuming continuation of existing stormwater manage-
ment practices. Water quality loadings to individual res-
ervoirs and to the stream system could be readily
demonstrated. Because overenrichment of the reser-
voirs was so crucial, researchers could estimate phos-
phorus and nitrogen loadings from projected
development assuming existing stormwater practices,
even on a municipality by municipality basis.

New Jersey Atlantic Coastal
Drainage Study

Background

The third study considered a much larger coastal water-
shed in New Jersey (see Figure 15). The New Jersey

Development —
Application

Required
I See Map A I
No Peak
Rate
Factor

Hmpoundment
Drainage
1 Recharge 1
| Sensitive \
4 Non Impoundmen
Drainage
.
1 Impoundment
r~| Drainage
U Non Rechargel
| Sensitive |~
1 Non Impoundm
Drainage

1 Impoundment
|~~| Drainage
_| Non Recharge) |

\ — -

?nt ^

]

jNon Impoundment!
"I Drain age T
Not Applicable
Required: Multi-Resi and Non-Resi Overs Acres, Porous Pave. With
Underground Recharge Beds for Paved Areas and Infiltration Devices for
Non-Paved Areas, Sized for Peak; Other Uses, Infiltration Devices for
Paved and Nonpaved Areas, Sized for Peak

Not Applicable
Required: Multi-Resi and Non-Resi Overs Acres, Dual Purpose Detention
Basins for Paved/Nonpaved Areas, Sized for Peak; Other Uses, Detention
Basins Sized for Peak
Recommended: All Uses/Sizes, Porous Pave. With Underground Recharge
Beds for Paved Areas; Minimum Disturbance or Wet Ponds/Artificial
Wetlands for Nonpaved Areas, All Sized for Peak
Required: All Uses and Sizes, Porous Pave. With Underground Recharge
for Paved Areas; Minimum Disturbance for Nonpaved Areas

Required: Multi-Resi and Non-Resi Overs Acres, Porous Pave. With
Underground Recharge Beds for Paved Areas and Infiltration Devices for
Nonpaved Areas
Recommended: All Uses/Sizes, Porous Pave. With Underground Recharge
for Paved Areas; Minimum Disturbance and/or Infiltration Devices After
Site Stabilization
Required: All Uses/Sizes, First-Flush Settling Basins for Paved Areas; for
Nonpaved Areas, Minimum Disturbance/Wet Ponds/Artificial Wetlands
Recommended: for All Uses/Sizes, Porous Pave. With Underground
Recharge Beds for Paved Areas; Minimum Disturbance for Nonpaved
Areas
Required: Multi-Resi and Non-Resi Overs Acres, First-Flush Settling
Basin; Other Uses, Detention Basins (No Change)
Recommended: Porous Pave. With Underground Recharge Beds for Paved
Areas; for Nonpaved areas, Minimum Disturbance/Wet Ponds/Artificial
Wetlands
Figure 14. BMP selection methodology used with the GIS database in the Neshaminy basin modeling study.
122
-------
Atlantic Coastal Drainage (ACD) includes an area of
2,086 square miles, with barrier islands (50 square
miles), wetlands/bays/estuaries (285 square miles), and
a unique scrubby pitch pine-cedar forest, known as the
Pine Barrens, largely covering the 1,750 square miles
of mainland interior (see Figure 16). This flat coastal
plain comprises a series of unconsolidated sedimentary
deposits of sand, marl, and clay, which increase in thick-
ness toward the coastline. Over the past 16,000 years,
as the ocean level has risen, the water's edge has
progressed inland to its present position. Ocean cur-
rents and upland erosion and deposition have created a
long, narrow series of barrier islands that absorb the
energy of ocean storms and buffer the estuary habitats
from the scour of waves and currents. Between the
mainland and barrier islands are embayments and es-
tuaries of different sizes and configurations. Inland erosion
and marine sediments have gradually filled many of
these areas, creating extensive wetlands (17).

In this ACD region, new land development and popula-
tion growth have caused significant degradation of water
quality from an increase in both point source and NPS
pollution. Although the array of pollutants is ominously
broad, increased nitrogen and phosphorus loadings
have resulted in enrichment of back bays, estuaries, and
nearshore waters, contributing to algal blooms, declining
finfish and shellfish populations, diminished recreational
Atlantic
Ocean
Legend
Atlantic Coastal Drainage

Cafra Management Area

Pine Barren Region Including
'-1 ~ Cedar-Pine Fringe

Figure 15. The ACD of New Jersey includes approximately 2,000 square miles of land area from the Manasquan River to Cape May.
123
-------
Figure 16. Aerial photograph of New Jersey illustrating the Barrier Islands and estuary system situated along the Atlantic coast.
124
-------
opportunities, and a variety of other problems (18). A
major source of these nutrients is point source sewage
treatment plants (STPs), but the effluent outfalls of almost
all these STPs discharge into nearshore ocean waters
beyond the barrier islands. Thus, NPS pollutants almost
totally dominate the water quality in the estuaries and
back bays (19, 20).

These NPS pollutants, which rain scours from the land
surface and flushes into coastal waters with each rain-
fall, comprise a largely unmeasured and unmanaged
flux of contaminants. Prior research on coastal water
quality has given considerable attention to NPS pollution
generated from paved or impervious surfaces, particu-
larly roadways and parking lots where hydrocarbons,
metals, suspended solids, biologic oxygen demand
(BOD), and other pollutants have been measured.

Although these NPS pollutants are certainly of concern
in New Jersey's coastal waters, the enrichment issue
has led to a focus on NPS pollution produced when
creating large areas of pervious and heavily maintained
landscape, such as lawns and other landscaped areas,
in the sandy soil context of the coastal area. Typically,
significant quantities of fertilizer and other chemicals,
which are applied on these new pervious surfaces, are
naturally low in nutrients. Although a modest portion of
the applied fertilizer runs off directly into surface waters,
larger quantities of soluble pollutants, such as nitrates
and herbicides, quickly percolate down through the
sandy soil, then move rapidly as interflow to the estuary
system.

In this coastal drainage of unconsolidated sediments,
the hydrologic cycle differs from inland watersheds. Of
the 45-inch average annual rainfall, only a small fraction
(2.5 inches per year) becomes direct runoff, with the
balance rapidly infiltrating into the sand strata (21).
Most of the infiltration that reaches the ground water
(20 inches per year) discharges to surface streams
(17 inches per year) within a few hours following rainfall,
producing a lagging and attenuated hydrograph. This
rapid infiltration, combined with the sand texture of the
soil, has a major bearing on the water quality implica-
tions of new land development. Thus, urbanization of
coastal regions has dramatically altered hydrologic
response, with every square foot of new impervious
surface converting what had been approximately 41.5
inches of infiltration into direct runoff to bays and estu-
aries, with a turbid soup of NPS pollutants.

Even in areas that have maintained infiltration, the
coastal soils do not remove NPS pollutants as efficiently
as other areas of New Jersey that overlie consolidated
formations with heavier clay soils. These soils provide a
much more thorough removal of NPS pollutants through
physical, chemical, and biologic processes, as rainfall
percolates through the soil mantle.
With development of coastal areas, increased impervi-
ous areas and changing flow pathways (inlets and storm
sewers) convey nonpoint pollutants introduced by devel-
opment (from both pervious and impervious surfaces)
directly to the coastal waters. In addition, freshwater
recharge to the underlying aquifer decreases with the
increase in impervious surfaces, with resulting in-
creases in saltwater intrusion into the sand aquifers and
contamination of ground-water supply wells along the
coast. Further compounding the loss of the stormwater
for ground-water recharge are increased ground-water
withdrawals necessary for new watersupply. In sum, urban
growth within the ACD, with its 1.13 million permanent
residents (and still growing) and an additional 1.5 million
summertourists, has dramatically altered the natural drain-
age system (and landscape) in a way that significantly
increases the discharge of NPS pollutants (22).

GIS Approach
New Jersey's Department of Environmental Protection
already had developed a computerized GIS system
(ARC/INFO) for environmental analysis and resource
planning, so this study aimed to use existing GIS work
and to refine this GIS system. Although data files for
municipal boundaries, watershed areas, and a variety of
other factors already existed, land use/land cover data
had not been developed and constituted a major work
task. The subsequent land use/land cover file included
the entire 2,000 square miles of the ACD, but this fo-
cused on the urbanized area (212 square miles) that
occupied about 11 percent of the coastal fringe. The end
product was a polygon file that described about 2,500
polygons of urban/suburban land, each averaging about
0.1 square miles (see Figure 17).
Using aerial photographs combined with USGS base
maps and extensive field reconnaissance, each polygon
was classified by:
• Land use type.
• Percentage of impervious cover and maintained areas.
• Degree of maintenance (fertilization) being provided
to these maintained areas.
Although classifying land use type and extent of imper-
vious cover/maintained areas was a relatively straight-
forward evaluation process (rated within one of 11
categories by percentage, 0 to 5 percent, and so forth),
the third variable, degree of maintenance, required spe-
cial treatment and data development procedures. De-
gree of maintenance was translated into high, medium,
and low categories, with high maintenance exemplified
by golf courses or other intensively maintained areas.
Medium maintenance assumed chemical application
rates comparable with those recommended by Rutgers
University state agronomists. Finally, low maintenance
was typified by a wooded or otherwise naturally vegetated
125
-------
yw.'V MLAMTK COOT*. ORMHWE
• couuir BOUHOWIES
OCEANIC I15Y '.)
Figure 17. Urban land use polygons digitized for the New Jersey coastal drainage. The 2,500 polygons shown cover approximately
212 square miles (11 percent) of the ACD area of 2,000 square miles.
126
-------
lot and assumed little or no regular chemical application.
Research staff executed considerable field reconnais-
sance to objectify this judgment-based rating technique
(see Figure 18).

Nonpoint Source Analysis
Because the drainage is almost entirely estuarine, the
hydrologic aspects in this study were almost irrelevant
except as a tool to describe the pollutant transport proc-
ess. Such coastal drainage systems do not allow the
measurement of hydrographs and chemographs (see
Figure 6), except on inland riverine segments or se-
lected infrastructure points of discharge (storm sewer
outfalls). Thus, the NPS analysis focused on the pollut-
ant production and transport process, especially the
nutrients applied to the maintained landscapes, which
are a major part of coastal urbanization.
This study required a great deal of effort to produce an
indextable relating urban cover characteristics (percent-
age impervious, amount of chemical application) to NPS
production potential. For each of the 2,500 urban land
polygons CIS described, estimates of the NPS loading
for a number of pollutants were generated. Potential
loadings were then aggregated by subwatershed. Total
NPS loadings could then be compared with point source
loadings for the entire coastal drainage. Interestingly,
the NPS loading dominated water quality in the estu-
arine drainage while the point sources, discharged by
ocean outfalls to nearshore waters beyond the barrier
islands, were the major source of nutrient pollution in
this portion of the coastal environment (see Figure 19).
Given the estimates of NPS pollution, the major ques-
tion involves how to control or reduce these loads. The
suitability of selected BMPs for the reduction/prevention
of pollutant generation was then evaluated and spatially
identified within the drainage (see Figure 20). This figure
evaluated the use of constructed wetland systems as a
structural measure. That is, CIS allowed state regulators
to identify not only what works best in terms of water
quality protection measures, but also where these meth-
ods could work successfully. This analysis was driven
by a detailed evaluation of the combinations of natural
conditions CIS identified within the study area. For ex-
ample, certain BMPs can be applied on soils that have
a certain set of characteristics (permeability, depth to
seasonal high water table) and that are presently in a
given land cover and planned for urbanization.

CIS also aided in evaluating alternative BMP tech-
niques, including reduction in nutrient applications and
land management BMPs such as elimination of artificial
landscapes, again using a combination of natural fea-
tures and land use patterns (see Figure 21). The result
of this analysis considered the relative proximity of ur-
ban land uses to the coastal waters as significantly
increasing the potential for NPS transport. State regula-
tory programs establish minimum setback criteria for
development in sensitive areas, and these criteria may
be modified to consider pollutant production potential
based on CIS delineation of pollutant production.
New Jersey has been striving to develop NPS manage-
ment programs for coastal areas to reduce existing
sources of pollution as well as prevent the creation of
Legend

R = Residential Use
C = Commercial Use

# = Percent Impervious

H = High Maintenance
M = Medium Maintenance
L = Low Maintenance
Figure 18. Classification of urban polygons by land use, percentage impervious cover, and degree of land fertilization.
127
-------
\
Maryland \
Wildwood
N = 224 Tons
P = 52 Tons r!eVe"'V'^e Ocean City
N = 142 Tons N = 151Ton;
P = 33 Tons
Atlantic County
N = 1,183 Tons
P = 276 Tons
Ocean County Monmouth, Bayshore
Ocean County North Monmouth, NE
Ocean County Centra| N = 859 Tons Long Branch, Deal
N = 871 Tons P = 201 Tons Ocean Asbury Park
P = 203 Tons Soutn Monmouth
South
N = 267 Tons
P = 62 Tons
Atlantic Ocean
N = 1,966 Tons
P = 458 Tons
Figure 19. Point and NFS discharges to the ACD. Data shown are in tons of TP and NOs-N per year.
Legend
BMP Suitability
g- (1) Suitable
H- (2) Generally Suitable
M~ (3) Limited Suitability
Unsuitable Areas
D- (4) Unsuitable Soils
^~ Open Water
E3~ Urbanized Areas
Figure 20. BMP analysis using GIS. Files consider soil suitability, current vegetative cover, and BMP criteria on vacant and
developed land parcels.
128
-------
pollution. As most regulatory agencies have discovered,
NPS management programs can be difficult to imple-
ment, especially when confronting issues of land use
management. To substantiate the need for new manage-
ment programs amidst these controversies, the ability to
document causal linkages (i.e., to generate data and
statistics that make the case for NPS pollutant gener-
ators and resultant water quality degradation) is very
important. The need for documentation of various types
is especially great given the less than perfect data re-
cord of water quality in coastal and other waters. All of
these factors come together to make the value of a CIS
system for water quality management very real.

Conclusion

This CIS-driven analysis indicates that NPS pollutants,
especially the nutrients phosphorus and nitrogen, gen-
erated from fertilized fields or maintained landscapes
surrounding new residential, commercial, and other
types of development in drainage systems, contribute
significantly to water quality degradation. In effect, the
particulate-associated phosphorus and the soluble ni-
trates serve as surrogates for the full spectrum of NPS
pollutants that each rainfall washes from the land. A
comprehensive water quality management program
must include structural measures to remove pollutants
this runoff conveys, as well as management of the con-
tributing landscape to reduce (and perhaps eliminate)
the application of these chemicals within the drainage.
In planning new development, management actions
should occur on a variety of levels or tiers. On an
areawide basis, growth should proceed (with guidance
and management) in a manner that would reduce total
pollutant discharges; therefore, the total amount of
maintained area being created should be as concen-
trated as possible. On the more site-specific level,
measures and construction techniques that reduce the
quantity of pollutants generated are essential. Required
development guidelines must include, but not be limited to:

• Prevention of excessive site disturbance and ongoing
site maintenance (described as a policy of minimum
disturbance and minimum maintenance).

• Use of special materials for reduction of storm-
water runoff (porous pavement and ground-water
recharge).

• Use of stormwater treatment systems (water quality
detention basins, artificial wetlands).

In sum, the regulatory framework must contain both
"how to build" guidelines, as well as "where not to build"
guidelines. CIS can be a powerful tool in both of these
processes.

While inland lakes serve as nutrient traps for these NPS
pollutants, perhaps the greatest potential impact is the
gradual process of excessively enriching our coastal
waters. As population continues to migrate to coastal
areas, the importance of protecting this fragile ecosystem
Figure 21. For certain regulatory criteria, the proximity of land uses to the water's edge was a consideration in BMP selection.
129
-------
increases. The pollution that new land development
generates, including the discharge of point source
wastes, should not be allowed to enter coastal waters;
it should not be allowed to destroy the natural balance
that exists between land and water. The concept of
stormwater management takes on an entirely different
meaning when viewed as one of the basic mechanisms
of this NPS pollution transport. For centuries, engineer-
ing of the shoreline has intensively focused on protect-
ing human developments from the ravages of ocean
storms. Now, however, the converse seems to be
emerging: ocean waters need protection from the im-
pacts of human development.

References

1. Bliss, N., T.H. Cahill, E.B. MacDougall, and C.A. Staub. 1975.
Land resource measurement for water quality analysis. Chadds
Ford, PA: Tri-County Conservancy of the Brandywine.
2. Cahill, T.H., R.W. Pierson, Jr., and B.R. Cohen. 1978. The evalu-
ation of best management practices for the reduction of diffuse
pollutants in an agricultural watershed. In: Lohr, R.C., ed. Best
management practices for agriculture and silviculture. Ann Arbor,
Ml: Ann Arbor Science.
3. Cahill, T.H., R.W. Pierson, Jr., and B.R. Cohen. 1979. Nonpoint
source model calibration in the Honey Creek watershed. #R-
805421-01. Athens, GA: U.S. EPA Environmental Research
Laboratory.
4. Cahill, T.H. 1980. The analysis of relationships between land use
and water quality in the Lake Erie basin. Burlington, Ontario:
International Association of Great Lakes Research.
5. Cahill, T.H., J. McGuire, and C. Smith. 1993. Hydrologic and
water quality modeling with geographic information systems. Pro-
ceedings of the Symposium on Geographic Information Systems
and Water Resources, AWRA, Mobile, AL (March).
6. U.S. EPA. 1991. Remote sensing and GIS applications to nonpoint
source planning. Proceedings of the U.S. EPA Workshop for Re-
gion 5 and Northeast Illinois Planning Commission, Chicago, IL
(October 1990).
7. Baker, D.B., and J.W Kramer. 1973. Phosphorus sources and
transport in an agricultural basin of Lake Erie. Proceedings of the
16th Conference, Great Lakes Research, Ann Arbor, Ml (Septem-
ber).
8. Cahill, T.H., and T.R. Hammer. 1976. Phosphate transport in river
basins. Proceedings of the International Joint Committee on Flu-
vial Transport Workshop, Kitchener, Ontario (October).
9. Cahill Associates. 1989. Stormwater management in the New
Jersey coastal zone. Trenton, NJ: State of New Jersey, Depart-
ment of Environmental Protection, Division of Coastal Resources.
10. Delaware Riverkeeper. 1993. Upper Perkiomen Creek watershed
water quality management plan. Lambertville, NJ: Delaware
Riverkeeper/Watershed Association of the Delaware River.

11. U.S. EPA. 1993. Guidance specifying management measures for
sources of nonpoint pollution in coastal waters. EPA/840/B-
92/002. Washington, DC.

12. Soil Conservation Commission. 1982. TR-20, project formula-
tion—Hydrology. Technical Release No. 20. PB83-223768. Land-
ham, MD: Soil Conservation Service.

13. Cahill, T.H., M.C. Adams, and W.R. Horner. 1990. The use of
porous paving for groundwater recharge in stormwater manage-
ment systems. Presented at the 1988 Floodplain/Stormwater
Management Symposium, State College, PA (October).

14. Soil Conservation Commission. 1974. Universal soil loss equa-
tion. Technical notes, Conservation Agronomy No. 32. Portland,
OR: West Technical Service Center.

15. Cahill, T.H., M. Adams, S. Remalie, and C. Smith. 1988. The
hydrology of flood flow in the Neshaminy Creek basin, Pennsyl-
vania. Jamison, PA: The Neshaminy Water Resources Authority
(May).

16. BCPC. 1992. Neshaminy Creek watershed stormwater manage-
ment plan, Vol. 1: Policy document, and Vol. II: Plan implemen-
tation. Doylestown, PA: Bucks County Planning Commission
(January).

17. Clark, J. 1977. Coastal ecosystems: Ecological considerations for
management of the coastal zone. Washington, DC: The Conser-
vation Foundation.

18. New Jersey Department of Environmental Protection. 1988. The
state of the ocean: A report by the blue ribbon panel in ocean
incidents. Trenton, NJ.

19. Cahill, T.H., M. Adams, C.L. Smith, and J.S. McGuire. 1991. GIS
analysis of nonpoint source pollution in the New Jersey coastal
zone. With Whitney, S., and S. Halsey, New Jersey Department
of Environmental Protection, Division of Coastal Resources. Pre-
sented at the National Conference on Integrated Water Informa-
tion Management, Atlantic City, NJ (August).

20. Cahill, T.H., M. Adams, C.L. Smith, and J.S. McGuire. 1991.
Living on the edge: Environmental quality in the coastal zone.
With Whitney, S., and S. Halsey, New Jersey Department of
Environmental Protection, Division of Coastal Resources. Pre-
sented at the International Conference on Integrated Stormwater
Management, Singapore (July).

21. Martin, M. 1989. Ground-water flow in the New Jersey coastal
plain. Open File Report 87-528. West Trenton, NJ: U.S. Geologi-
cal Survey.

22. National Oceanic and Atmospheric Administration. 1989. Se-
lected characteristics in coastal states, 1980-2000. Rockville, MD:
U.S. Department of Commerce, National Oceanic and Atmos-
pheric Administration, Strategic Assessment Branch, Ocean As-
sessment Division (October).
130
-------
Using GIS To Identify Linkages Between Landscapes and Stream Ecosystems
Carl Richards, Lucinda Johnson, and George Host
University of Minnesota, Duluth, Minnesota
Introduction

Factors that operate on a variety of temporal and spatial
scales influence the structural and functional compo-
nents of stream ecosystems (1). Quantifying the effects
of factors that operate across multiple scales has chal-
lenged aquatic scientists over the last several decades.
Recently, scientists have recognized that they cannot
successfully protect or restore ecosystem integrity with-
out taking into account all appropriate scales; therefore,
they are focusing on understanding interactions between
terrestrial and aquatic components of entire watersheds
(2). Although awareness of the importance of watershed
and landscape-scale influences on streams is growing,
the tools to examine these influences are still in their
infancy.

Most watershed and landscape studies to date have
focused on the role watershed-scale parameters play on
water chemistry (3-5). These studies usually examined
nutrient and sediment inputs from various watershed
land covers. Methods for evaluating the patterns in the
terrestrial segment of the watershed were awkward and
laborious, involving use of planimeters or cutting and
weighing maps. More recent watershed studies have
attempted to integrate both longitudinal and lateral influ-
ences of the terrestrial ecosystems on water quality in
streams and wetlands (6-8). This approach takes advan-
tage of newly available tools (geographic information
systems and multivariate statistics) for quantifying land-
scape structure.

Relatively few studies have examined how watershed
features influence biological communities. Most studies
examining stream biota have concentrated on single
land use types (9, 10) or on the relationship between
watershed land use and stream physical habitat (11,12).
Typically, study designs have not addressed questions
concerning variability of stream communities over rela-
tively large geographic scales.

Our own work centers on identifying linkages between
landscape features (watershed scale) and stream reach
environments (physical habitat, chemistry), and relating
these parameters to major patterns of community vari-
ation. In this manner, landscape and reach environment
interactions probably control the influence that specific
landscape components have on biological communities
(13). This is the general premise when using biological
communities to assess watershed status. To represent
stream biota, we examine benthic macroinvertebrates,
which have been used extensively for biomonitoring
numerous environmental stresses (14). Macroinverte-
brates are sensitive to watershed conditions and exhibit
sufficient stability in assemblage structure over time to
make them useful as long-term monitors of stream
health (15).

This paper presents an overview of our attempts to
identify the relative strengths of landscape variables on
macroinvertebrate communities. We classify landscape
variables into two general categories. The first category,
geology and landscape structure (GEOS), considers
variables that are fixed on the landscape and are largely
uncontrollable by management activities. The second
category, land use (LU), includes variables that have
anthropogenic origins and may be influenced by land
management activities. By understanding the relative
strengths these two sets of variables possess in deter-
mining community structure, we hope to identify specific
species groups that can act as land use and land form
indicators. We also hope to identify ways to predict the
outcome of specific large-scale land management activi-
ties (e.g., silviculture, agriculture) or other large-scale
environmental changes (e.g., global warming) on stream
ecosystems.

Study Area

This study was conducted in the Saginaw River basin,
a 22,562-square-kilometer watershed in east-central
Michigan (see Figure 1) that flows into Lake Huron.
The Saginaw River watershed was chosen for this
study because its component drainages range from
heavily affected agricultural to relatively pristine areas.
131
-------
Figure 1. The Saginaw basin study area.

Dominating the soils in the lake plain are medium- and
fine-textured loams to clays, with sand found in the
outwash plains and channels. Artificial drainage and tile
systems extensively drain the clay regions. Glacial fea-
tures such as ground moraines and outwash plains are
common. The western sector is characterized by rolling
plains with coarse-textured ground moraines. This re-
gion contains a high percentage of the forested land,
while agricultural land use dominates the eastern sector.

The Saginaw basin covers 16,317 square kilometers,
including four major subbasins: the Tittabawassee
(6,734 square kilometers), Shiawassee (3,626 square
kilometers), Flint (3,108 square kilometers), and Cass
(2,331 square kilometers) Rivers. The Tittabawassee
subbasin further divides into three principal water-
courses—the Chippewa, Pine, and Tittabawassee Riv-
ers. Watersheds adjacent to Lake Huron (Kawkawlin
and East basins) are characterized by low topographic
relief and elevations averaging 203 and 206 meters,
respectively. The Flint and Chippewa/Pine basins aver-
age about 278 meters in elevation. These drainages
also exhibit the greatest variation in topography.

Study Design

The analysis covers 45 stream sites within the study
area. These sites reflect a gradient of land use and
physiographic conditions in the Saginaw River drainage.
Researchers obtained biological, chemical, and physi-
cal samples at one 200-meter stream segment at each
site. In addition, a geographic information system (CIS)
database was compiled reflecting a number of land-
scape parameters for the watershed of each stream
segment.

Sampling Methods

Macroinvertebrates

At each sampling site, we deployed Hester-Dendy arti-
ficial substrate samplers (16) for macroinvertebrate
community characterizations twice, in early summer and
during base flow conditions in late summer and fall of
1991 or 1992. We allowed samplers to colonize for 6 to
8 weeks. In the laboratory, macroinvertebrates were
counted and identified to genus whenever possible. A
series of derived variables from the original species
abundance tables was used to describe community
characteristics. We chose metrics based on their rela-
tive utility for examining macroinvertebrate communi-
ties, as suggested by Barbour et al. (17) and Karr and
Kearns (18). Because macroinvertebrate assemblages
are relatively stable through time (15) and preliminary
analysis indicated no significant differences between
sampling years at stations for which we had 2 years of
data (unpublished data), we combined macroinverte-
brate data into one database.

Chemistry

We assessed nutrients and other chemical properties
related to water quality at each stream site during sev-
eral periods in the summer and fall of 1991 or 1992.
Stream flow during fall sampling was typically less than
median flow rates and was considered to represent base
flow levels. We used the maximum values of samples
taken in June and July to represent summer conditions
and the maximum values from September and October
to represent fall base flow conditions. The nutrients
measured were ammonium (NHs), nitrate-nitrogen, total
nitrogen (TN), orthophosphate (PO4), and total phos-
phorus (TP). In addition, we assessed alkalinity (ALK),
conductivity, total dissolved solids, and total suspended
solids (TSS). Standard methods were used for all
measurements (19).

Physical Habitat

We assessed physical habitat during base flow condi-
tions at each stream site in a stream reach that is at least
8 to 12 times the width of the stream segment. A suite
of quantitative habitat structure measurements and ob-
servations was made at each site. We derived values
for six general habitat attributes:

• Substrate characteristics

• Instream cover

• Channel morphology

• Riparian and bank conditions
132
-------
• Riffle/Run quality

• Pool quality

Landscape Descriptors

Land use patterns, surficial geology, hydrography, and
elevation databases helped to quantify landscape char-
acteristics in the study area (see Table 1). Land use
patterns were derived from existing digital data at the
Michigan Department of Natural Resources (Michigan
Resource Information System [MIRIS] database) (see
Table 2). We based classification of land use/cover cate-
gories on a modified version of the Anderson (20)
scheme, which was constructed specifically for natural
resource applications. The result was the following nine
land use/cover categories:

• Urban

• Row crop/agriculture

• Other agriculture

• Herbaceous range land

• Shrubby range land

• Nonforested wetlands

• Forested wetlands

• Mixed hardwood forests

• Deciduous forests

In this region, nonrow-crop agriculture is largely repre-
sented by pasture, and range lands are predominantly
abandoned fields (old fields).

The U.S. Department of Agriculture (USDA) STATSGO
soils database enabled the compilation of soil data. The
database consists of U.S. Soil Conservation Service
(SCS) soil surveys and includes information on domi-
nant texture and drainage in large landscape units. We

Table 1. Spatial Data Used for Landscape Characterization
Data Layer Source
aggregated soils into simplified categories based on
glacial landform.

We delineated watershed boundaries for each sampling
station on United States Geological Survey (USGS)
topographic maps and digitized them using ARC/INFO
(Environmental Systems Research Institute [ESRI],
Redlands, California). We identified stream order for
each stream segment and coded it as an attribute of the
stream reach file. All databases were transformed into a
common digital format as necessary and projected into
a common coordinate system. We stored data in vector
format and analyzed them in ARC/INFO.

Table 2 lists the landscape variables we derived for each
watershed. Land use/cover values were reported and
analyzed as a percentage of the total watershed area.
Patch heterogeneity measured landscape fragmenta-
tion and was reported as the number of patches per
hectare. We derived slope from elevation data using
ARC/INFO. The standard deviation of elevation was
used as a surrogate measure of topographic variability.

Statistical Analysis

Using redundancy analysis (RDA), a canonical exten-
sion of principal component analysis (PCA), we de-
tected relationships among the individual multivariate
data sets. RDA is a form of direct gradient analysis that
describes variation in a multivariate data set (e.g., habi-
tat variables or macroinvertebrate metrics) based upon
environmental data (21). In RDA, the station scores from
a PCA are regressed on a specified set of environmental
variables with each iteration, and the fitted values of the
regression become new station scores (22). Thus, envi-
ronmental or predictor variables constrain PCA. Two
important outputs from this method are the interset cor-
relations of environmental variables with the RDA axes,
which indicate the environmental variables that have the
strongest influence in the ordination, and the fraction of
Scale
Format Received
Hydrology
Elevation
Land use/cover
Land use/cover
Watershed boundary
Station locality
Soils
Major basin
Quartenary geology
U.S. EPA stream reach
USGS-DEM
USGS
MIDNR
USGS topographic maps
USGS topographic map
USDA SCS STATSGO
USGS topographic
University of Michigan
1:100,000
1 :250,000
1:100,000
1 :24,000
1 :24,000
1 :24,000
1 :250,000
1 :24,000
1 :250,000
ARC/INFO
Digital elevation model
Digital line graph
Intergraph
Manual delineation
Manual digitizing
ARC/INFO
Manual digitizing
Manual digitizing
Key: USGS = United States Geological Survey
DEM = Digital elevation model
MIDNR = Michigan Department of Natural Resources
USDA SCS = United States Department of Agriculture Soil Conservation Service
133
-------
Table 2. Landscape Variables Measured or Derived for Each
Watershed
Results
Variable
Units
Land use/cover

Slope

Standard deviation elevation

Patch heterogeneity

Soils

Total area
Proportional area

Degrees

Meters

Patches/hectare

Proportional area

Hectares
total variance of each predicted variable that is ex-
plained by the RDA axes (22).

Performing Monte Carlo permutation tests determined
the statistical validity of the association between predic-
tor and predicted variables. Tests were conducted by
random permutation of the site numbers in the predictor
variables. We randomly linked the predictor data to the
predicted data. We conducted 99 simulations to approxi-
mate a normal distribution with which to compare our
data with random combinations.

We first determined which of the reach variables had
strong influences on macroinvertebrate distributions by
conducting separate RDAs with physical habitat and
chemistry variables as environmental descriptors. We
then examined the ability of the landscape data to pre-
dict the variation in the important reach variables.

To determine the relative influences of LU and GEOS
landscape variables on stream chemistry and physical
habitat, we used partial RDA, where one landscape
variable type was held constant and variation due to the
other landscape set was examined independently. Us-
ing this approach, total variation in a multivariate data
set can be decomposed in a manner analogous to
analysis of variance (23, 24). For this analysis, we at-
tributed variation in the reach variables (habitat and
chemistry) to four separate components:

• The variation in reach variables that LU variables
explained independently of GEOS variables.

• The variation in reach variables that GEOS variables
explained independently of LU variables.

• The variation in reach variables that both GEOS and
LU variables shared. This shared variation could
have been due to both the dependence of one type
of variable on the other as well as noncausal rela-
tionships (e.g., the types of soil found in a watershed
determine in large part the types of agriculture that
can be practiced).

• The variation in reach variables that were unexplain-
able. This may have been attributable to sampling
error, stochastic variation, or other variables not
sampled.
Regional Characteristics
Land Use

Land use within the study region was dominated by
row-crop agriculture (see Table 3). Individual water-
sheds ranged from 14 to 99 percent in agricultural land
uses, with the East basin watersheds exhibiting the
greatest proportion of agricultural land use and the Flint
having the lowest proportion of agricultural land use.
The Chippewa/Pine and Kawkawlin watersheds exhib-
ited the greatest diversity of land use and cover types
within the study region.

Wetlands represented a minor land use component with
most watersheds having between 0 and 15 percent land
area. The Cass and Kawkawlin basins had the greatest
proportion of wetlands, with a median of 6.8 percent for
individual watersheds.

Macroinvertebrates

Considerable variation existed among the major basins
with respect to the 15 macroinvertebrate community
metrics during summer (see Table 4). Metric values for
the Flint, Shiawassee, and Chippewa watersheds were
similar. Sites within the Kawkawlin and East basins
differed considerably from the Flint, Shiawassee, and
East basins in several of the metrics. The Kawkawlin
watershed was notable for high shredder and filterer
proportions and a low proportion of detritivores. The
East basin also had a high proportion of shredders. Both
the East and Kawkawlin basins had lower proportions of
strictly erosional taxa and higher proportions of deposi-
tional taxa than the other major basins.

Taxa at the East and Kawkawlin basins also exhibited
lower oxygen tolerance than at other major basins. In
addition, their Hilsenhoff Biotic Index (HBI) scores
(which are sensitive to oxygen availability) were higher
than other basins, and they had the lowest EPT (Ephe-
meroptera, Plecoptera, Trichoptera) richness. Total rich-
ness at Kawkawlin was relatively high, however.
Richness was highest in the Chippewa/Pine watershed
and lowest in the East basin.

In general, fall patterns of macroinvertebrate metrics
resembled those of summer. The Kawkawlin and East
basins had high HBI scores, low EPT scores, low pro-
portions of erosional taxa, and high proportions of depo-
sitional taxa. The proportion of predators was
exceptionally high in the Kawkawlin basin due to the
abundance and trophic classification of one chironomid
genus.
134
-------
Table 3. Summary of Landscape Metrics in Six Major Basins of the Saginaw River Drainage

Landscape Variables East Basin Cass Flint Shiawassee Chippewa/Pine Kawkawlin
n
Row crops
Other agricultural land
Urban
Deciduous forest
Mixed hardwood
Range: Herb
Range: Shrub
Forested wetlands
Non-forested wetlands
Slope (degrees)
Elevation (meters)
Patch heterogeneity
Watershed area
(hectares)
8
86.4
79.7-98.0
0.2
0.0-1.2
0.8
0.1-2.0
6.4
0.5-12.1
0.03
0.0-0.3
1.0
0.1-3.6
1.3
0.0-2.3
0.1
0.0-0.5
1.7
0.0-5.3
0.15
0.07
206.5
19.8
241.5
102.0
14,968.9
11,132.0
7
58.4
45.1-73.3
0.6
0.4-1.4
1.9
0.4-3.7
16.9
5.8-33.4
0.3
0.01-1.7
6.7
2.1-12.7
4.7
3.2-7.7
0.1
0.0-0.3
6.7
1.9-14.5
0.29
0.12
239.8
6.7
711.4
250.6
38,117.7
58,321.9
8
38.0
25.6-65.5
3.1
0.6-4.1
8.56
1 .4-23.2
17.5
10.2-20.5
2.6
0.0-3.8
13.0
6.6-14.5
6.5
2.6-10.5
2.3
0.3-3.2
3.9
0.4-5.2
0.41
0.11
277.2
22.5
950.4
228.6
28,926.1
19,236.7
5
43.5
26.7-71 .8
0.4
0.1-2.0
10.3
2.4-15.0
19.2
16.4-30.4
0.7
0.4-0.9
12.6
0.9-17.3
7.3
2.2-10.1
0.7
0.0-3.7
3.6
0.7-5.0
0.27
0.10
252.6
47.1
762.7
223.6
46,530.0
50,798.0
15
48.2
13.9-91.3
4.0
0.3-6.6
2.1
0.8-4.0
23.2
3.2-42.1
5.4
0.1-8.5
6.9
1 .0-9.0
5.6
0.7-9.0
1.0
0.0-2.5
4.3
0.1-7.9
0.35
0.10
278.7
34.9
703.6
185.9
53,704.2
44,872.8
3
26.4
18.5-74.9
1.9
0.2-2.3
1.2
1.0-6.8
56.8
13.2-64.5
0.3
0.1-0.3
2.5
0.7-3.1
3.0
2.7-3.2
1.8
0.1-2.2
5.0
1.3-5.9
0.14
.03
203.1
9.8
519.6
156.7
22,240.2
3,848.0
Land use/cover variables
(slope through watershed
(agricultural land through nonforested wetlands) are reported as median and
area) are reported as mean and standard deviation. Land use/cover represents
range; landscape structure variables
proportional areas of each watershed.
Identification of Important
Reach-Scale Variables

Chemistry

RDA showed that chemical variables explained 26 per-
cent of the variation in macroinvertebrate data in sum-
mer and 33 percent in fall. The most important variables
in summer were TN and TSS (see Table 5). Fall
macroinvertebrate communities were influenced by a
greater number of variables, including NHs, TP, ALK,
and TSS.

Physical Habitat

The 13 physical habitat variables explained 37 percent
of the macroinvertebrate data in summer and 46 percent
of the macroinvertebrate data in fall. In summer, the
percentage of deep pools and canopy extent along with
channel dimensions, such as bank full width (BFW) and
bank full depth (BFD), were the most important vari-
ables. In fall, the percentage of fines and deep pools as
well as canopy extent were among the most important
variables (see Table 6).
Landscape Influences on Surface
Water Chemistry

In summer, the landscape data explained 55 percent of
the variation in chemical variables. The proportion attrib-
utable to LU was larger than that attributable to GEOS
(see Figure 2). The two data types shared 12 percent of
100
Physical
Habitat
' Land Use

• Geology/
Structure
Summer Fall
Chemistry Chemistry
\T] Shared

| | Unexplained
Figure 2. Results of variance decomposition from partial RDA.
135
-------
Table 4. Mean and Standard Deviation of Macroinvertebrate Metrics Calculated for Summer Collection Periods for Six Major
Basins of the Saginaw River Drainage

n
Chironomidae

Omnivores

Detritivores

Shredders

Gatherers

Filterers

Grazers

Predators

2 Dominants

Total abundance

HBI

Erosional taxa

Depositional
taxa
Species
richness
EPT taxa
richness
East Basin
8
59.1
35.3
19.4
13.7
57.1
34.1
30.3
33.9
59.8
32.9
27.4
38.1
32.2
34.4
1.5
2.2
64.5
25.5
2077
4951
7.1
1.4
25.9
12.4
35.5
13.2
17.2
4.5
5.0
2.7
Cass
7
57.9
20.3
19.1
7.1
69.7
9.7
18.7
5.1
18.7
5.1
23.4
18.5
13.6
16.2
1.2
1.2
54.3
6.1
650
739
5.6
?
36.1
5.5
23.7
9.6
18.3
9.6
5.7
2.8
Flint
8
45.9
27.9
18.1
13.9
75.3
16.0
10.6
6.0
64.3
12.8
22.4
14.1
26.2
21.0
1.5
2.0
50.3
15.2
574
622
5.6
0.8
35.5
9.5
27.5
11.5
22.1
8.2
7.3
3.0
Shiawassee
5
32.2
28.5
14.4
3.8
79.9
6.4
7.7
6.8
65.0
14.1
18.9
12.8
40.1
22.7
1.0
1.0
54.2
9.5
325
91
6.0
0.5
38.9
14.7
27.0
6.6
20.6
4.7
8.0
2.3
Chippewa/Pine
15
45.5
29.6
21.5
9.8
70.4
9.8
14.4
15.6
65.8
17.2
17.6
15.1
25.5
21.1
1.4
0.8
51.9
11.6
497
230
5.1
1.1
36.1
11.0
25.4
10.7
26.6
3.0
10.0
3.7
Kawkawlin
3
67.1
18.6
22.0
16.7
29.0
26.1
51.0
26.9
39.8
30.1
39.9
35.7
25.4
22.9
1.9
0.8
60.0
21.7
433
297
8.1
0.8
14.9
5.3
52.4
6.5
23.3
4.9
3.3
0.5
the variation. The relationship between LU variables and
chemistry was significant (p < 0.05), and the relationship
between GEOS variables and chemistry was not
significant (p > 0.05).

In fall, variation explained by LU was proportionally less
than during summer (see Figure 2). GEOS landscape
variables explained 25 percent of the total variation
while LU variables accounted for less than 10 percent
of the total variation. LU and GEOS variables shared
approximately 8 percent of the variation. In contrast with
summer, GEOS variables were significant and LU vari-
ables were not significant when examined with the
Monte Carlo test (p < 0.05).

The importance of GEOS and LU variables in explaining
variation in the chemistry variables, as well as the total
amount of variation explained, differed considerably
among the chemistry variables. Figure 3 shows only
summer data. For example, the landscape variables
explained almost 80 percent of the total variation in TN
(see Figure 3). The largest proportion of variance was
explained by the shared influences of GEOS and LU.
Alkalinity, which was also well predicted, however, was
much more influenced by variation attributable to LU
variables. In comparison, LU variables explained less
than 45 percent of the total variance in TP, and the
majority of this variance was attributable to GEOS.

To further examine the influence of specific landscape
variables, we compared the various axes in the signifi-
cant partial ordinations (see Figure 4). In summer, when
we observed a significant effect of LU variables on water
chemistry, forested land covers and nonrow-crop agri-
culture had their greatest influence on TSS and ALK. LU
heterogeneity and shrub vegetative cover most strongly
influenced both TN and NHs. In fall, when GEOS vari-
ables had a significant relationship to water chemistry,
ALK and NHs were more influenced by peat land soils
and watershed size. The proportion of sand and gravel
soils, as well as clays, explained much of the variation
in TN and TSS.
136
-------
Table 5. Chemistry Variables That Had a Correlation (r) of at Least 0.30 With One of the RDA Axes With the Summer or Fall
Ordinations; a Monte Carlo Analysis Indicated That Both Summer and Fall Ordinations Were Significant (p < 0.05)
RDA 1
RDA 2
RDA 3
Variable
Summer
Fall
Summer
Fall
Summer
Fall
TN
NH3
TP
P04
TSS
ALK
0.44
0.05
0.12
-0.04
0.01
-0.08
0.03
0.04
0.40
0.19
-0.31
-0.36
-0.04
0.09
0.07
0.01
0.43
-0.16
-0.07
0.52
0.26
-0.08
0.36
0.09
0.05
0.07
0.25
0.26
0.22
0.15
-0.14
-0.09
0.14
0.34
-0.13
0.2
Table 6. Physical Habitat Variables That Had Correlations (r) Over 0.3 With the Ordination Axes; Results of the Monte Carlo
Simulation Indicated That the Fall but Not the Summer Ordinations Were Significant (p < 0.05)
RDA1
RDA 2
RDA 3
Variable
Percentage of
fines
Percentage of
shallows
Wood
Percentage of
deep pools
Erosion
Maximum
depth
Canopy extent
BFW
BFD
Flood ratio
Summer
0.27
-0.06
0.09
0.50
0.16
0.08

0.75
-0.10
-0.09
0.03
Fall
0.47
0.17
0.34
0.54
0.30
-0.14

0.21
-0.23
-0.13
-0.30
Summer
0.04
-0.28
-0.1
0.16
-0.3
0.08

-0.47
0.52
0.32
0.07
Fall
0.28
-0.11
0.02
0.09
-0.31
0.16

-0.42
0.36
0.02
0.23
Summer
0.18
0.32
0.10
-0.17
0.01
-0.39

0.21
0.07
-0.08
-0.41
Fall
-0.09
-0.38
-0.17
0.26
0.27
0.13

-0.25
0.03
0.22
0.02
Landscape Influences on Physical Habitat

GEOS landscape variables attributed for the largest
portion (22 percent) of the explained variation in physical
habitat variables (see Figure 2). LU variables accounted
for 16 percent of the explained variance. The partial
ordination for GEOS but not LU was significant as the
Monte Carlo procedure determined.

As noted with chemistry variables, there were consider-
able differences in the ability of the landscape variables
to predict individual habitat characteristics. Landscape
variables were best at predicting BFW and least power-
ful for predicting the percentage of deep pools (see
Figure 5). BFWwas influenced predominantly by GEOS
and only minimally by LU. The most influential GEOS
variables for BFW related to watershed area (see Figure 6).
Woody debris was predominantly influenced by LU vari-
ables. The most influential LU variables for woody debris
related to forested wetlands. Flood ratio was intermedi-
ate to these examples. Both sets of landscape variables
shared the largest proportion of explained variance for
this parameter.

Discussion

Our studies demonstrate the distinct influences land-
scape features have on stream macroinvertebrate com-
munities through modifying surface water chemistry and
stream habitat. Land use most strongly influences
stream chemistry during summer months when surface
runoff and soil leaching are greatest. In addition, fertil-
izer application in row-crop agriculture is highest in the
first part of the growing season. The strong relationship
between some aspects of land use and stream water
chemistry were similar to those observed in other stud-
ies (4, 6, 7). The specific mechanism by which stream
chemistry influences macroinvertebrates is not clear.
The addition of nutrients can significantly affect stream
productivity (25-27); however, light often limits primary
production in agricultural areas (28-30). Nutrients may
137
-------
Summer Chemistry
Fall Chemistry
120

100
I 80
60
1 20
o

-------
100
60
.i 40
20

-------
Land Use
Geology/Structure
50,
45
-a 40
\ \

B RDA 1 Watershed Area, SD Elev.
i — i RDA 2 Outwash Sand and Gravel

81 RDA 3 Lacustrine Clays

-
\\ Jf « K . n J

J
Fine Shal Wood Deep Erod MaxD Cnpy BFW BFD Flrt
Figure 6. Factors influencing physical habitat parameters.
10. Jones, R.C., and C.C. Clark. 1987. Impact of watershed urbani-
zation on stream insect communities. Water Resources Bull.
23:1,047-1,055.

11. Bisson, P.A., R.E. Bilby, M.D. Bryant, C.A. Dolloff, G.B. Grette,
R.A. House, M.L. Murphy, K.V. Koski, and J.R. Sedell. 1987.
Large woody debris in forested streams in the Pacific Northwest:
Past, present, and future. In: Salo, E.O., and T.W. Cundy, eds.
Streamside management: Forestry and fishery interactions. Con-
tribution 57. Institute of Forest Resources, University of Washing-
ton, Seattle, WA. pp. 143-190.

12. Everest, F.H., R.L. Beschta, J.C. Scrivener, K.V. Koski, J.R.
Sedell, and C.J. Cedeholm. 1987. Fine sediment and salmonid
production: A paradox. In: Salo, E.O., and T.W. Cundy, eds.
Streamside management: Forestry and fishery interactions. Con-
tribution No. 57. Institute of Forest Resources, University of
Washington, Seattle, WA. pp. 98-142.

13. Richards, C., and G.E. Host. 1994. Using CIS to examine the
influence of land use on stream habitats and biota. Water Re-
sources Bull. 30:729-738.

14. Rosenberg, D.M., and V.R. Resh. 1993. Freshwater biomoni-
toring and benthic macroinvertebrates. New York, NY: Chapman
and Hall.

15. Richards, C., and G.W. Minshall. 1992. Spatial and temporal
trends in stream macroinvertebrate species assemblages: The
influence of watershed disturbance. Hydrobiologia 241:173-184.

16. Ohio EPA. 1987. Biological criteria for the protection of aquatic
life, Vol. II. User's manual for biological assessment of Ohio sur-
face waters. Columbus, OH.

17. Barbour, M.T., J.L. Plafkin, B.P. Bradley, C.G. Graves, and R.W
Wisseman. 1992. Evaluation of EPA's rapid bioassessment ben-
thic metrics: Metric redundancy and variability among reference
stream sites. Environ. Toxicol. Chem. 111:437-449.
18. Karr, J.R., and B.L. Kearns. 1992. Components of biological in-
tegrity: Their definition and use in development of an invertebrate
IBI. In: Simon, T.P., and WS. Davis, eds. Proceedings of the 1991
Midwest Pollution Control Biologists Meeting: Environmental in-
dicators: Measurement and assessment endpoints. EPA/905/R-
92/003. U.S. EPA Region 5, Environmental Sciences Division,
Chicago, IL. pp. 1-16.

19. APHA. 1989. Standard methods for the examination of water and
waste water. American Public Health Association.

20. Anderson, J.R., E.E. Hardy, J.T Roach, and R.E. Witmer. 1976.
A land use and land cover classification system for use with
remote sensor data. Geological Survey Professional Paper 964.
U.S. Government Printing Office, Washington, DC.

21. ter Braak, C.J., and I.e. Prentice. 1988. A theory of gradient
analysis. Advances in Ecol. Res. 8:271-317.

22. Jongmann, R.H., C.J. ter Braak, and O.F. van Tongeren. 1987.
Data analysis in community and landscape ecology. Wageningen,
The Netherlands: Pudoc.

23. Borchard, D., PL. Legendre, and P. Drapeau. 1992. Partialling
out the spatial component of ecological variation. Ecology
73:1,045-1,055.

24. ter Braak, C.J. 1990. Update notes: CANOCO Version 3.0. Agri-
cultural Mathematics Group, Wageningen, The Netherlands.

25. Petersen B.J., J.E. Hobble, T.J. Corliss, and K. Kriet. 1983. A
continuous-flow periphyton bioassay: Tests of nutrient limitation
in a tundra stream. Limnology and Oceanography 28:583-591.

26. Grimm, N.B., and S.G. Fisher. 1989. Stability of periphyton and
macroinvertebrates to disturbance by flash flooding. Ecol.
Monogr. 52:93-110.

27. Pringle, C.M. 1987. Effects of water and substratum nutrient sup-
plies on lotic periphyton growth: An integrated bioassay. Can. J.
Fish. Aquat. 44:619-629.

28. Munn, M.D., L.L. Osborne, and M.J. Wiley. 1989. Factors influ-
encing periphyton growth in agricultural streams of central Illinois.
Hydrobiologia 174:89-97.
140
-------
29. Bushong, S.J., and R.W. Bachmann. 1989. In situ enrichment
experiments with periphyton in agricultural streams. Hydrobiologia
178:1-10.

30. Wiley, M.J., L.L. Osborne, and R.W. Larimore. 1990. Longitudinal
structure of an agricultural prairie river system and its relation-
ship to current stream ecosystem theory. Can. J. Fish. Aquat.
47:373-384.

31. Vannote, R.L., G.W Minshall, K.W. Cummins, J.R. Sedell, and
C.E. Gushing. 1980. The river continuum concept. Can. J. Fish.
Aquat. 37:130-137.
32. Statzner, B., J.A. Gore, and V.H. Resh. 1988. Hydraulic stream
ecology: Observed patterns and potential applications. J. North
Am. Benthological Soc. 7:307-360.

33. Osborne, L.L., and D.A. Kovacic. 1993. Riparian vegetated buffer
strips in water-quality restoration and stream management.
Freshwater Biol. In press.

34. Sedell, J.R., G.H. Reeves, F.R. Hauer, J.A. Standford, and C.P.
Hawkins. 1990. Role of refugia in recovery from disturbances:
Modern fragmented and disconnected river systems. Environ.
Mgmt. 14:711-724.
141
-------
Nonpoint Source Water Quality Impacts in an Urbanizing Watershed
Peter Coffin, Andrea Dorlester, and Julius Fabos
University of Massachusetts at Amherst, Amherst, Massachusetts
Abstract

As part of the larger Narragansett Bay Estuary Project,
the University of Massachusetts Cooperative Extension
Service contracted with the university's METLAND re-
search team to develop a geographic information sys-
tem (CIS) database, generate watershed-wide maps,
perform analyses, and develop a modeling procedure.
The objective was to educate local officials about the
impacts of development on water quality and to help
local boards minimize the effect of nonpoint sources of
pollution.

Because the receiving waters of the Narragansett Bay
are located far downstream in Rhode Island, the up-
stream communities in Massachusetts are reluctant to
enact measures to improve water resources outside of
their jurisdictions. A CIS was used to create awareness
of existing downstream problems and to show the up-
stream communities how development will ultimately
affect water resources in their own backyards.

To nurture this awareness, a "buildout" analysis was
conducted for an entire upstream subwatershed, the
Mumford River watershed, containing parts of four
towns, and roughly 50 square miles. This buildout was
coupled with a loading model using Schueler's Simple
Method to illustrate the potential impacts of future devel-
opment, and encourage local boards to minimize future
nonpoint sources of pollution.

CIS proved its usefulness by developing customized
maps for each town, by generating several "what if
scenarios showing the impacts of different zoning
changes, by facilitating long-range planning for small
towns without professional staff, and by encouraging a
regional perspective on development issues. The entire
planning process was most successful in creating a
series of partnerships that will continue after the grant
expires. The university shared coverages with the state
CIS agency, creating new coverages not previously
available, specifically soils, ownership, and zoning.
Small towns learned about the potential of the new
technology. Students gained from hands-on experience
with real-world problems. State agencies saw their ef-
forts understood at the local level, especially as they
reorganize on a basin approach and begin to implement
a total mass daily loading (TMDL) procedure to coordi-
nate permitted discharges and withdrawals.

As greater emphasis is placed on controlling nonpoint
sources of pollution, more attention needs to be focused
on local boards, who control land use decisions in New
England.

Introduction

Project Description

Narragansett Bay is a vital resource for southern New
England. The health of its waters is critical to the re-
gional economy, supporting fisheries, tourism, and qual-
ity of life. Increased development along the bay's
shorelines and throughout its drainage basin threatens
the quality of these waters, however. The U.S. Environ-
mental Protection Agency (EPA) recognized the threats
to this important water body and designated the Narra-
gansett Bay under its National Estuary Program in 1985.

Completing a Comprehensive Conservation and Man-
agement Plan (CCMP) for Narragansett Bay took 7
years. The CCMP identified seven priority areas for
source reduction or control, including the reduction of
agricultural and other nonpoint sources of pollution. The
nonpoint source strategy identified United States De-
partment of Agriculture (USDA) agencies, conservation
districts, and other public and private organizations as
having principal roles in nonpoint source management.

Whereas the vast majority of Narragansett Bay lies
within the boundaries of Rhode Island, a significant
portion of its pollution load originates in Massachusetts.
Recognizing that the watershed extends beyond state
boundaries, the USDA provided 3 years of funding to
Cooperative Extension and the Soil Conservation Serv-
ice (SCS) in both Massachusetts and Rhode Island to
142
-------
coordinate their efforts in an innovative attempt to re-
duce the impact of nonpoint sources of pollution on
Narragansett Bay. While water quality is a relatively new
focus for Cooperative Extension, it fits well with the
historic mission of extending the knowledge base of the
land-grant colleges out into the community, and provid-
ing training and capacity building for local officials and
community organizations.

With such a large area of concern, the management
team decided to focus on a smaller subwatershed area
in each state for the first 2 years. The strategy was first
to develop a program for the mitigation of nonpoint
source pollution on the smaller scale of a watershed of
roughly 50 square miles, then take the lessons learned
and apply the most appropriate efforts throughout the
larger watershed. By using similar strategies in Rhode
Island and Massachusetts, but choosing subwatersheds
that differ in terms of location relative to the receiving
water, size, staffing, and sophistication, the two states
gained from each other's experience, sharing the suc-
cessful techniques and avoiding each other's mistakes.

For its pilot study, Rhode Island chose Aquidneck Island,
home of Newport, Portsmouth, and Middletown, with a
special focus on protecting surface water supply reser-
voirs. Massachusetts chose an upstream watershed in
the Blackstone Valley, somewhat rural in character, but
rapidly undergoing a transformation to suburbia.

Watershed Description

The Blackstone River drops 451 feet in its 48-mile jour-
ney from Worcester, Massachusetts, to Pawtucket,
Rhode Island. In the 19th century, this drop of roughly
10 feet per mile was ideally suited to providing power to
mills during the early years of the industrial revolution.
By the Civil War, every available mill site was developed,
earning the Blackstone River the name "The Hardest
Working River."

The Blackstone has a long history of pollution. First, the
textile industry, then steel, wire, and metal finishing in-
dustries used the river for power, in their manufacturing
process, and for waste disposal.

In Massachusetts, the Blackstone River is the major
source of many pollutants to Narragansett Bay. Based
on total precipitation event loading calculations, the
Blackstone River is the principal source of solids, cad-
mium, copper, lead, nitrate, orthophosphate, and PCBs
to the bay (1). The Blackstone River has an average flow
of 577 million gallons per day or 23.2 percent of the
freshwater input to the bay.

The watershed area in Massachusetts equals 335
square miles; with a population of 255,682, this results
in a density of 763 people per square mile. The Black-
stone Valley has 9,000 acres in agricultural use, with
more land in hay (4,500 acres) than crops (3,700 acres)
to support its 4,400 animals.

Based on aerial mapping flown in 1987, the Blackstone
Valley has lost 5 percent of its cropland, 9 percent of its
pasture, and 21 percent of its orchards since 1971. The
valley remains more than 60 percent forested, but that
represents a decrease of 5 percent. The forest and
farmlands were lost to development as low density
housing grew by 45 percent, commercial use grew by
15 percent, and transportation grew by 54 percent.
Waste disposal grew 52 percent to 582 acres, and min-
ing, which in this region represents gravel pits, grew 22
percent to 1,100 acres.

Watershed soils consist mainly of compact glacial till on
rolling topography, with 3 to 15 percent slopes. The river
and stream valleys are underlain by glacial-derived sand
and gravel outwash, which provide drinking water to all
towns in the area except Worcester and support the
large gravel pits. The high clay content in the till soils of
the uplands makes for a high water table, which is
beneficial for growing corn but causes problems for
septic systems.

Following a preliminary study of the subwatersheds, the
Mumford River in the Blackstone Valley was selected as
the focus watershed based on its size, location, land
use, and existing water quality (see Figure 1). The Mum-
ford River watershed has an area of 57 square miles,
with a length of 13 miles, and lies within the towns of
Douglas, Northbridge, Button, and Uxbridge. These
towns share the attributes of small, rural communities
undergoing rapid development, with no professional
planning staff (see Figure 2). According to the 1990
Census, Douglas grew 46 percent in 10 years to 5,438;
Uxbridge experienced 24 percent growth to 10,415; Sut-
ton increased 17 percent to 6,824; and Northbridge grew
9 percent to 13,371.

Project Strategy

Because the generation of nonpoint sources of pollution
is so closely tied to land use, and because local boards
composed of citizen volunteers have principal control
over land use in New England, the key focus of this
program is to train local boards to recognize and begin
managing the threat that nonpoint sources of pollution
pose to water quality. Local planning boards, conserva-
tion commissions, and boards of health address land
use issues and can regulate and shape existing and
proposed development. By developing a program to
train local officials, Cooperative Extension can focus its
outreach where it will have the greatest impact in both
the short and long term. Local boards have the strongest
opportunity to comment on how land is to be used as it
undergoes development. Therefore, this project focused
on preventing future deterioration as opposed to fixing
143
-------
Figure 1. Map of Mumford River watershed study area.
existing problems. This is especially appropriate in a
rapidly urbanizing setting.

Both Massachusetts and Rhode Island chose to utilize
CIS technology because of its ability to store, analyze,
transform, and display geographic, or spatial informa-
tion. Its database management and analytical capabili-
ties make it a useful tool for pollution load modeling and
buildout scenario development, while its mapping capa-
bilities make it an excellent tool for sharing information
with local officials. This paper documents a case study
on how CIS technology was used to apply a watershed-
wide pollution loading model and to develop buildout
scenarios for demonstrating to local officials the poten-
tial impacts of future development on water quality.

This project used CIS in four different applications:

• Printing customized, large-scale maps: This most ba-
sic application of a CIS proved the most useful for
local officials. It was a revelation for some officials to
see how their current zoning related to actual land
use. In one town, these maps inspired a change in
zoning to protect the area of a future water supply
reservoir. These maps helped officials see how their
towns fit into the regional picture and how their zoning
and land use affected the adjoining towns, and vice-
versa.

• Performing "buildout" analysis: A "buildout" analysis
demonstrates the consequences of existing zoning.
It assumes that all land that can be developed will be
developed at some future date. In essence, it is a
spreadsheet that divides the land available for develop-
ment in each zone by the required lot size, subtracting
Northbridge
Legend

I I Agriculture/Open Land
HU Developed Land
V/\ Forest
gg Unforested Wetlands
fH Lakes and Ponds
Scale = 1:90,000
Douglas

Figure 2. Land use/land cover map of Mumford River watershed.
144
-------
a certain percentage for the road network and steep
slopes. It is best used to evaluate different develop-
ment scenarios, substituting different zoning require-
ments.

• Applying a watershed-wide pollutant loading model:
CIS provided the input needed to apply the "Simple
Method" for estimating existing and potential pollutant
loads. Future pollution loading was estimated using
a buildout with existing zoning and again assuming
the implementation of cluster zoning. The Simple
Method was compared in one subwatershed with the
Galveston Bay Method, which accounts for the hy-
drologic class of the soils.

• Promoting planning for a greenway: Land use maps
were overlaid with parcel ownership to show the
existing network of preserved open space and to
identify those parcels of land having significant wild-
life habitat and recreational value. In one town, these
maps were used to gain funding for planning a river
walk.

Database Development

The most daunting aspect of using a CIS is the prospect
of spending a great deal of time and money creating a
useful database. Fortunately for Massachusetts, many
of the basic coverages needed for regional planning are
housed in a state agency, MASS CIS, and are available
for a small processing fee. These coverages include
most of what appears on the standard United States
Geological Survey (USGS) map: roads, streams, town
boundaries, as well as watershed boundaries and land
use data generated from the interpretation of aerial pho-
tographs. The university entered into an agreement
whereby we gained access to this data at no charge, in
return for sharing the new coverages that the project
would generate.

New coverages needed for the study included: zoning,
soils, sewer and water lines, and land ownership, or
parcels taken from the assessor's maps. The soils maps
were obtained from the SCS, digitized by hand, then the
scale was converted with a computer program, "rubber-
sheeting," to achieve a uniform scale of 1:25,000. All
other new coverages were transferred onto a USGS
topographical map at a scale of 1:25,000, then digitized
directly into the computer. We obtained elevation data,
but the triangulation process used to convert elevation
data to slopes would require so much time and memory
that, for our purpose, deriving a slope map from the four
classes identified on the soils map was sufficient.

While CIS computer programs are powerful enough to
perform most overlay and analysis functions necessary
in nonpoint source pollution load modeling, database
development and accuracy issues can limit the effec-
tiveness of such modeling. The choice of which model
to use is a function of which data are available for input.
Physics-based distributed models are more precise but
require detailed input parameters, beyond the scope of
this project. The extent of our database limited us to
lumped-parameter empirical models. We chose two
such models, the Simple Method and the Galveston Bay
Method.

GIS Applications

The Simple Method

Schueler (2) developed the Simple Method, one of the
simplest lumped-parameter empirical models. The input
data necessary to compute pollutant loading with the
Simple Method are land use, land area, and mean an-
nual rainfall. Land use determines which event mean
concentration (EMC) values and percentage of impervi-
ousness to use in the computation. The amount of rain-
fall runoff is assumed to be a function of the
imperviousness of various land uses. More densely de-
veloped areas have more impervious surfaces, such as
rooftops and paving, which cause stormwaterto run off
the land rather than be absorbed into the soil. The
Simple Method can generate rough figures for annual
pollutant loading within a watershed and can effectively
show relative increases in pollutant levels as land is
developed.

The formula used in the Simple Method is as follows:
(C) * (A) * (2.72)
(load) = (runoff) * (EMC) * (area)

where:

L = pounds of pollutant load per year
P = rainfall depth (inches) over the desired time
interval (1 year)
Pj = percentage of storms that are large enough to
produce runoff (90 percent)
Rv = fraction of rainfall that is converted into runoff
(Rv = 0.05 + 0.009 (I), where I represents the
percentage of site imperviousness)
C = flow-weighted mean concentration (EMC) of
the targeted pollutant in runoff (milligrams per
liter)
A = area (in acres) of the study region

The Simple Method can be applied using a hand-held
calculator or a computer spreadsheet program. For this
project, the calculations were performed entirely within
the ARC/INFO GIS environment, where the input data
were stored. Results were exported to the Excel spread-
sheet program for presentation purposes.

The application of the Simple Method consists of three
major steps.
145
-------
Step 1: Aggregate Land Uses and Obtain Area Fig-
ures for Land Use Categories Within Each Subbasin

The land use coverage in our database has 21 catego-
ries. For the purpose of applying the Simple Method,
these were aggregated into the following six major cate-
gories, based on development density: undeveloped
forest and other open land, large-lot single-family resi-
dential, medium-density residential, high-density resi-
dential, commercial, and industrial. The aggregated land
use categories were matched with study basins from the
Nationwide Urban Runoff Program (NURP) for the pur-
pose of assigning EMC values.

Step 2: Enter Percentage Imperviousness and Event
Mean Concentrations for Each Land Use Type

The TABLES module of ARC/INFO was used to assign
percentage of imperviousness and EMC values to indi-
vidual land use polygons within the watershed's subbas-
ins. The estimated percentage of imperviousness was
obtained from Schueler's guide to using the Simple
Method (2). EMC values for three pollutants—phospho-
rous, nitrogen, and lead—were taken from selected
NURP study basins and were assigned to the aggre-
gated land uses within the watershed.

Step 3: Input the Simple Method's Mathematical
Loading Formula, Calculate Loading Results for
Each Distinct Land Use Area, and Sum Results by
Watershed Subbasin

Finally, the pollutant load was calculated for each dis-
tinct land use area within the Mumford River watershed
by inputting the loading formula through the TABLES
module of ARC/INFO. The mean annual rainfall figure
was assumed to be that of Worcester, Massachusetts,
or 47.6 inches. After calculating loading figures for phos-
phorous, nitrogen, and lead for each distinct land use
area, these numbers were summed for each watershed
subbasin, using the ARC/INFO frequency table report-
ing capability.

The Galveston Bay Method

As an experiment, we applied the Galveston Bay
Method to one of the subbasins to compare results with
the Simple Method. The slightly more sophisticated
Galveston Bay model considers soil drainage charac-
teristics in addition to land use/imperviousness to deter-
mine rainfall runoff. This method is similar to the Simple
Method, in that amount of rainfall runoff and EMCs for
particular land uses are multiplied by land area to deter-
mine total pollutant load (3). Runoff in this method,
however, is calculated using the USDA SCS's TR 55
runoff curve model. The SCS model calculates runoff as
a function of both land use and soil type. Runoff equals
total rainfall minus interception by vegetation, depres-
sion storage, infiltration before runoff begins, and con-
tinued infiltration after runoff begins (4).
The formula used with the Galveston Bay Method has a
structure similar to that of the Simple Method and is as
follows:

P-0.2[(1000/CN)-10]2
P + 0.8[(100(yCN)-10]
(load) =

where:
(runoff
(EMC)
(area)
L = milligrams of pollutant load per year
P = mean annual rainfall amount
CN = runoff curve number, which is a function of
soil type and land use
EMC = event mean concentration
A = area (in acres) of the study region

The application of the Galveston Bay Method consists
of four major steps.

Step 1: Aggregate Land Uses and Obtain Area Fig-
ures for Land Uses Categories. Aggregate Soils Ac-
cording to Drainage Classes

Land use types were aggregated into the same six major
categories as the Simple Method in orderto match EMC
values and to allow for later comparison of the two
pollution loading methods. Soils were aggregated ac-
cording to drainage classes for use with the USDA SCS
TR 55 runoff formula. The SCS identifies four classes of
soils according to their drainage capacity:

Class A = excessively to well-drained sands or
gravelly sands.
Class B = well to moderately drained, moderately
coarse soils.
Class C = moderately to poorly drained fine soils.
Class D = very poorly drained clays or soils with a
high water table.

Step 2: Overlay Soils Data With Land Use Data and
Clip This New Coverage Within the Subbasin

The ARC/INFO CIS overlay capability was used to over-
lay land use and soils maps for the Mumford River
watershed on top of each other. This created new, dis-
tinct areas of different land use and soils combinations.
Because we were only applying this model in one sub-
basin, the subbasin boundary was used in conjunction
with the ARC/INFO "clip" command to cut out (like a
cookie cutter) that portion of the watershed within the
subbasin.

Step 3: Assign Runoff Curve Numbers and EMC
Values to Each New Land Use/Soils Polygon Within
the Subbasin

EMC values were assigned to each distinct land
use/soils area in the same manner as they were as-
signed to land use areas using the Simple Method.
146
-------
Runoff curve numbers were assigned to each distinct
land use/soils area within the subbasin according to
values established by the USDA SCS.

Step 4: Calculate Loading Results for Each Distinct
Land Use/Soils Area and Sum Results for the Entire
Subbasin

Finally, the pollutant load was calculated for each dis-
tinct land use/soils area within the subbasin by inputting
the loading formula through the TABLES module of
ARC/INFO. After calculating loading figures for phos-
phorous, nitrogen, and lead for each distinct land
use/soils area, these figures were then summed for the
subbasin using the ARC/INFO frequency table reporting
capability. Results of this modeling were converted from
milligrams per acre peryearto pounds per acre peryear
to facilitate later comparisons.

Buildout Scenarios

For planning purposes, CIS is most useful in its ability
to quickly generate alternative scenarios. When these
development scenarios are coupled with a pollutant load
model as described above, alternative scenarios can be
evaluated according to their impact on water quality.
This project generated two different scenarios for each
of the four towns in the watershed: a maximum buildout
with existing zoning and a maximum buildout with clus-
tered development.

Maximum Buildout

A maximum buildout scenario was used to show the
worst case for development according to current zoning
regulations (see Figure 3). The result of this buildout is
expressed both in the number of new residential units to
be built and in the area of land to be converted from
undeveloped to residential and other urban uses.

Step 1: Eliminate From Consideration All Land That
Is Already Developed

Step 2: Eliminate From Consideration All Land That
Is Under Water

Step 3: Eliminate From Consideration All Land That
Is Protected From Development

These protected lands included cemeteries, parks, and
all land permanently restricted from development.

Step 4: Reduce the Remaining Amount of Land by
20 Percent To Account for New Roadways and Ex-
tremely Steep Slopes

The remaining land was considered to have "developa-
ble" status. Wetlands were included in this category
because while a house probably would not be built on a
wetland, wetlands can and often do constitute portions
of the required lot size of large residential lots.

Step 5: Overlay the Land Use Coverage With Zoning
and Minimum Lot Size Information

This created new land use areas as a function of zoning.
All forests and fields were converted to a developed
status.

Step 6: Divide Net Developable Land Area Within
Each Zone by Minimum Lot Size Allowed To Obtain
the Number of New Units

Results from the buildout are expressed in the number
of new units. Results can also be shown spatially by
shading in areas on the map according to future density
Legend

Already Developed

<1/4-Acre Lot Size

1/4-to 1/2-Acre Lot Size

1/2- to 1-Acre Lot Size

1-to2-AcreLotSize

>2-Acre Lot Size

! I Protected/Public Land

Scale = 1:90,000
Figure 3. Maximum buildout scenario within the Mumford River watershed.
147
-------
of development (darker shades for higher density, lighter
shades for lower density).
opment can reduce future levels of water pollution, es-
pecially from nutrients (see Figures 4 and 5).
Clustered Buildout

Another alternative development scenario was gener-
ated assuming the implementation of clustered develop-
ment. All areas zoned for lots larger than 1 acre were
changed to cluster zones, where three-fourths of the
land area remains undeveloped, and the remaining one-
fourth of the land area is developed at a density of
1/2-acre lot size. With clustering, an area zoned for
2-acre house lots still supports the same number of new
units, but three-quarters of the land area remains open
space for passive recreation, protected wildlife habitat,
and as a buffer zone to filter runoff.

Step 1: Select All Land Available for Development
Zoned for 1-Acre Lots or Larger

Step 2: Multiply Selected Land by 0.75 and Add to
the Category of Protected Land

Step 3: Multiply Selected Land by 0.25 and Change
the Minimum Lot Size to One-Half an Acre

Step 4: Divide Step 3 by 20 Percent To Allow for New
Roads and Steep Slopes

Step 5: Divide Step 4 by 21,780 (One-Half an Acre)
To Determine Number of New Housing Units

Results

Lumped-parameter empirical models were chosen for
this project and were applied to watershed subbasins
ranging in size from 1 to 20 square miles and having an
average of 4 square miles. The application of the Simple
Method to existing land use conditions allowed for a
comparison of the Mumford River watershed's subbas-
ins for the purpose of identifying the subbasins that
contribute the highest levels of pollutants per acre per
year. The development of a maximum buildout scenario
identified those areas within the watershed that will
sustain the greatest amount of new growth. The appli-
cation of the Simple Method to this maximum buildout
scenario revealed that pollutant levels in surface water
runoff would increase substantially for all subbasins in
the watershed. This finding supports the theory of a
positive relationship between development and in-
creased pollutant levels from surface water runoff.

The development of a customized buildout scenario for
future development identified those areas that are cur-
rently zoned for large-lot residential "sprawl" and that
can support higher development density under cluster
zoning, while protecting a significant amount of open
space that can support a variety of beneficial uses. The
application of the Simple Method to the customized
buildout scenario revealed that the use of cluster devel-
Results determined by applying the Galveston Bay
Method to one subbasin were compared with those
obtained using the Simple Method. The predicted pollut-
ant loading from current conditions differed significantly
Nitrogen Loading Estimates
Total
Whitin Reservoir
West Sutton
Wallis Pond
Tuckers Pond
Stevens Pond
Rivulet Pond
Mouth
Morse Pond
Manchaug Pond
Lackey Pond
East Douglas Jl
Cross Street
Badluck Pond
50 100 150
Percent Change
200
n 1985-Cluster
Buildout
1985 - Maximum
Buildout
Figure 4. Chart showing difference in simple method results
for nitrogen loading between maximum and custom-
ized buildout scenarios.
Phosphorus Loading Estimates
Total
Whitin ._-
Reservoir
West Sutton
Wallis Pond
Tuckers Pond
Stevens Pond
Rivulet Pond
Mouth
Morse Pond
Manchaug Pond
Lackey Pond
East Douglas
Cross Street
Badluck Pond
I
-10 0
10
20 30 40
Percent Change
50
60
70
1985-Cluster
Buildout
1985 - Maximum
Buildout
Figure 5. Chart showing difference in simple method results
for phosphorus loading between maximum and cus-
tomized buildout scenarios.
148
-------
between the two methods; the Simple Method consis-
tently predicted five times the amounts generated by the
Galveston Bay Method.

When the two methods were applied to both the maxi-
mum and customized buildout scenarios, however, the
percentage growth of predicted pollutant loadings was
remarkably similar for both methods; the Simple Method
consistently predicted loadings 10 to 15 percent greater
than the Galveston Bay Method. This indicates that
while the Galveston Bay Method may provide more
accurate results in predicting actual pollutant loading,
the Simple Method is adequate enough for evaluating
and comparing different development scenarios (see
Figures 6 and 7).

Discussion

As states begin to implement a TMDL approach to regu-
lating water quality, they face the quandary of how to
determine the extent of nonpoint source pollution in the
rivers. The crudest method is to subtract from the total
load those quantities generated by point sources and
call all the rest nonpoint source. While this is appropriate
in some settings, it is unacceptable in a watershed with
a long history of pollution because a significant source
of pollution is the resuspension of historical sediments
stirred up by storms. The situation demands the devel-
opment of a model to predict the loading from nonpoint
sources. Only a computer can handle the multiple fac-
tors that interact to generate nonpoint sources of pollution.

As greater emphasis is placed on watershed planning,
the abilities of a CIS to input, store, manipulate, analyze,
and display geographic information become indispensa-
ble. As the scientific community improves its knowledge
base for determining the critical factors influencing non-
point source pollution, CIS technology is improving in
its ability to store and handle large amounts of data.

While a detailed, physics-based distributed model would
be more accurate than the lumped-parameter models
used for this project, they are difficult to apply at the
watershed scale. The real limiting factor is the provision
of all the data coverages needed to apply complex mod-
els. Lumped-parameter models, such as the Simple
Method and the Galveston Bay Method, are ineffective
for accurately predicting pollutant loads, but they are
suitable for comparing and evaluating alternative devel-
opment scenarios.

Time, and the development community, will not wait until
all the answers are known. Local officials continue to
approve development with no thought to the impacts on
water quality. These officials need to be informed about
the implications of haphazard growth. A CIS, with its
ability to generate customized maps and quickly evalu-
ate alternative development scenarios, is a powerful tool
to help local officials visualize how the decisions they
Nitrogen Loading
Total
Uxbridge
Sutton
Northbridge
50 100 150
Percent Change
200
D Galveston Method Percent
Change 1985 to Cluster
ED Simple Method Percent
Change 1985 to Cluster
Galveston Method Percent
Change 1985 to Maximum Buildout

Simple Method Percent
Change 1985 to Maximum Buildout
Figure 6. Chart showing difference between Simple Method re-
sults and Galveston Bay Method results for nitrogen
loading.
Lead Loading
Total
Uxbridge
Sutton
Northbridge
50 100 150
Percent Change
200
D Galveston Method Percent
Change 1985 to Cluster

E3 Simple Method Percent
Change 1985 to Cluster
Galveston Method Percent
Change 1985 to Maximum Buildout

Simple Method Percent
Change 1985 to Maximum Buildout
Figure 7. Chart showing difference between Simple Method re-
sults and Galveston Bay Method results for lead
loading.

make on paper today will have an impact on the land
tomorrow.
References

1. Metcalf & Eddy, Inc. 1991. Assessment of pollutions in Narragan-
sett Bay. Draft report to the Narragansett Bay Project.

2. Schueler, T. 1987. Controlling urban runoff: A practical manual for
planning and designing urban BMPs. Washington, DC: Metropoli-
tan Washington Council of Governments.
149
-------
3. Newell, C., et al. 1992. Characterization of nonpoint sources and
loadings to Galveston Bay, Vol. I, Technical Report: Galveston Bay
National Estuary Program (March).
4. U.S. Department of Agriculture (USDA) Soil Conservation Service.
1975. Urban hydrology for small watersheds. Technical Release
55. Springfield, VA: U.S. Department of Agriculture.

Additional Reading

Arnold, C.L., et al. 1993. The use of Geographic Information System
images as a tool to educate local officials about the land use/ water
quality connection. Proceedings of Watershed Conference, Alexan-
dria, VA.

Chesebrough, E. 1993. Massachusetts nonpoint source management
plan. Massachusetts Department of Environmental Protection, Office
of Watershed Management (October).

Joubert, L, et al. 1993. Municipal training for water quality protection.
Contribution #2845. College of Resource Development, University of
Rhode Island/Rhode Island Agricultural Experiment Station.
Massachusetts Geographic Information System. 1993. MassGIS
Datalayer descriptions and guide to user services. Boston, MA: Ex-
ecutive Office of Environmental Affairs.

McCann, A., et al. 1994. Training municipal decision-makers in the
use of Geographic Information Systems for water resource protection.
Contribution #2927. College of Resource Development, University of
Rhode Island/Rhode Island Agricultural Experiment Station.

Narragansett Bay Project. 1992. Comprehensive conservation and
management plan: A summary (January).

U.S. EPA. 1983. Results of the Nationwide Urban Runoff Program
(NURP), Vols. I and II. Final report prepared by Water Planning Divi-
sion. NTIS PB84185537. Springfield, VA: National Technical Informa-
tion Service.

U.S. EPA. 1992. Compendium of watershed-scale models for TMDL
development. E PA/841 /R-92/002. Washington, DC (June).
150
-------
A GIS for the Ohio River Basin
Walter M. Grayman
W.M. Grayman Consulting Engineers, Cincinnati, Ohio

Sudhir R. Kshirsagar
Global Quality Corporation, Cincinnati, Ohio

Richard M. Males
RMM Technical Services, Inc., Cincinnati, Ohio

James A. Goodrich
Risk Reduction Engineering Laboratory, Office of Research and Development,
U.S. Environmental Protection Agency, Cincinnati, Ohio

Jason P. Heath
Ohio River Valley Water Sanitation Commission, Cincinnati, Ohio
Abstract
Much of the information used in the management of
water quality in a river basin has a geographic or spatial
component associated with it. As a result, spatially
based computer models and database systems can be
part of an effective water quality management and
evaluation process. The Ohio River Valley Water Sani-
tation Commission (ORSANCO) is an interstate water
pollution control agency serving the Ohio River and its
eight member states. The U.S. Environmental Protec-
tion Agency (EPA) entered into a cooperative agreement
with ORSANCO to develop and apply spatially based
computer models and database systems in the Ohio
River basin.
Three computer-based technologies have been ap-
plied and integrated: geographic information systems
(GIS), water quality/hydraulic modeling, and database
management.
GIS serves as a mechanism for storing, using, and
displaying spatial data. The ARC/INFO GIS, EPAs
agencywide standard, was used in the study, which
assembled databases of land and stream information for
the Ohio River basin. GIS represented streams in hydro-
logic catalog units along the Ohio River mainstem using
EPAs new, detailed RF3-level Reach File System. The
full Ohio River basin was represented using the less
detailed RF1-level reach file. Modeling provides a way
to examine the impacts of human-induced and natural
events within the basin and to explore alternative strate-
gies for mitigating these events.

Hydraulic information from the U.S. Army Corps of En-
gineers' FLOWSED model enabled EPAs WASP4 water
quality model to be embedded in a menu-driven spill
management system to facilitate modeling of the Ohio
River mainstem under emergency spill conditions. A
steady-state water quality modeling component was
also developed under the ARC/INFO GIS to trace the
movement and degradation of pollutants through any
reaches in the RF1 representation of the full Ohio River
basin.

Database management technology relates to the stor-
age, analysis, and display of data. A detailed database
of information on dischargers to the Ohio River mainstem
was assembled under the PARADOX database manage-
ment system using EPAs permit compliance system as
the primary data source. Though these three technolo-
gies have been widely used in the field of water quality
management, integration of these tools into a holistic
mechanism provided the primary challenge of this study.

EPAs Risk Reduction Engineering Laboratory in Cincin-
nati, Ohio, developed this project summary to announce
key findings of the research project, which is fully docu-
mented in a separate report of the same title.
151
-------
Introduction

During the past 25 years, computers have been actively
used in water quality management, demonstrating their
potential to assist in a wide range of analysis and display
tasks. Technologies such as geographic information
systems (CIS), database management systems (DBMS),
and mathematical modeling have been applied in the
water quality management field and have proven to be
effective tools. For computers to achieve their full poten-
tial, however, they must become integrated into the
normal programmatic efforts of agencies and organiza-
tions in the planning, regulation, and operational areas
of water quality management.
Recognizing this need for routine use of computer-
based tools, the Ohio River Valley Water Sanitation
Commission (ORSANCO) and the Risk Reduction En-
gineering Laboratory (RREL) of the U.S. Environmental
Protection Agency (EPA) commenced a study in 1990.
The goals of the study included the adaptation, devel-
opment, and application of modeling and spatial data-
base management (DBM) tools that could assist
ORSANCO in its prescribed water quality management
objectives. These goals were consistent with EPAs on-
going programs involving the use of CIS and modeling
technology. The study's goals also coincided with EPAs
Drinking Water Research Division's work over the past
decade, which applied similar technology to study the
vulnerability of water supplies on the Ohio and Missis-
sippi Rivers to upstream discharges.

Methodology Overview

To address the goals of this project, three basic tech-
nologies have been applied and integrated: CIS, water
quality/hydraulic modeling, and DBM. CIS serves as a
mechanism for storing, using, and displaying spatial
data. Modeling provides a way to examine the impacts
of human-induced and natural events within the basin
and to explore alternative strategies for mitigating these
events. DBM technology relates to the storage, analysis,
and display of data. Though these three technologies
have been widely used in the field of water quality
management, integration of these tools into a holistic
mechanism provided the primary challenge of this study.

CIS Technology

The guiding principle in developing the CIS capability
was to maximize the use of existing CIS technology and
spatial databases. The study used ARC/INFO CIS,
EPAs agencywide standard. Remote access of
ARC/INFO on a VAX minicomputer facilitated the initial
work. Subsequently, both PC ARC/INFO and a worksta-
tion-based system were obtained.
EPA has developed an extensive spatial database re-
lated to water quality and demographic parameters. This
served as the primary source of spatial data for the
study. Following is a summary of spatial data used in
this study:

• State and county boundaries.

• City locations and characteristics.

• Water supply locations and characteristics.

• Locations and characteristics of dischargers to
water bodies.

• Toxic loadings to air, water, and land.

• Dam locations and characteristics.

• Stream reaches and characteristics.

The primary organizing concept for the water-related
information was EPAs Reach File System (1). This sys-
tem provides a common mechanism within EPA and
other agencies for identifying surface water segments,
relating water resources data, and traversing the nation's
surface water in hydrologic order within a computer envi-
ronment. A hierarchical hydrologic code uniquely identi-
fies each reach. Information available on each reach
includes topological identification of adjacent reaches,
characteristic information such as length and stream
name, and stream flow and velocity estimates. The origi-
nal reach file (designated as RF1) was developed in the
early 1980s and included approximately 70,000 reaches
nationwide. The most recent version (RF3) includes
over 3,000,000 reaches nationwide.

As part of this project, an RF1-level database was es-
tablished for the entire Ohio River basin. The RF3 reach
file was implemented for the Ohio River mainstem and
lower portions of tributaries. River miles along the Ohio
River were digitized and established as an ARC/INFO
coverage to provide a linkage between the reach file and
river mile indexing used by ORSANCO and other agen-
cies along the river. Figure 1 shows the RF1 reach file
representation of the Ohio River basin along with state
boundaries.

The study incorporated several EPA sources of informa-
tion on dischargers to water bodies. The industrial facil-
ity discharger (IFD) file contains locational and
characteristic data for National Pollutant Discharge
Elimination System (NPDES) permitted discharges. De-
tailed permit limits and monitoring information was ac-
cessed from the permit compliance system (PCS). The
toxic release inventory (TRI) system includes annual
loading of selected chemicals to water, land, air, and
sewer for selected industries based on quantity dis-
charged. All water data are referenced to the NPDES
permit number, which is spatially located by reach and
river mile, and by latitude and longitude.
152
-------
0 meter,
147884
Figure 1. RF1 reaches in the Ohio River basin.

Spill Modeling

An important role that ORSANCO fills on the Ohio River
relates to the monitoring and prediction of the fate of
pollutant spills. Typically, ORSANCO serves as the over-
all communications link between states during such
emergency conditions. ORSANCO coordinates and par-
ticipates in monitoring and serves as the information
center in gathering data and issuing predictions about
the movement of spills in the river. In the past, a series
of time-of-travel nomographs, based on National
Weather Service flow forecasts, Corps of Engineers
flow-velocity relationships, and previous experience,
were used to predict the movement of spills. This project
combined a hydraulic model with a water quality model
to serve as a more robust method for making such
predictions.

The U.S. Army Corps of Engineers' FLOWSED model
was selected as the means of predicting daily flow quan-
tities and water levels along the mainstem and portions
of major tributaries near their confluence with the Ohio
River (2). The Ohio River Division of the Corps of Engi-
neers applies FLOWSED daily as part of its reservoir
operations program. The Corps can generate 5-day
forecasts of stage and flow for 400 mainstem and tribu-
tary segments, and ORSANCO can access the results
via phone lines.

EPAs WASP4 water quality model was selected for use
in the project (3). WASP4 is a dynamic compartment
model that can be used to analyze a variety of water
quality problems in a diverse set of water bodies. Because
the primary use of the model in this project is quick
response under emergency situations, only the toxic
chemical portion of the model with first order decay is
being used. The FLOWSED and WASP4 models have
been combined into a user-friendly spatial decision sup-
port system framework described later in this project
summary.

Discharger Database Management System

EPA's PCS and historical records maintained by
ORSANCO furnish a rich source of data on dis-
charge information for the Ohio River. To organize
these data and make them available for analysis, a
database was developed using the PARADOX DBM
system.

The database was established using a relational struc-
ture with a series of related tables (two-dimensional flat
files). Individual tables contain information on facilities,
outfalls, permit limits, monitoring data, and codes used
in the other tables. The NPDES permit number is used
as the primary key in each data table. A mechanism for
downloading and reformatting data from the national
PCS database has been developed along with custom
forms for viewing and editing data, and custom reports
for preparing hard copy summaries. Latitude and longi-
tude values for each facility can provide the locational
mechanism for use of this data in conjunction with CIS.

Integration of GIS/Modeling/Database
Technologies

A major objective of this study was the integration of
CIS, modeling, and DBMS technologies into a holistic
tool for use by ORSANCO. Several integration mecha-
nisms were implemented as summarized below.

Steady-State Spill Tracing

The NETWORK component of the ARC/INFO CIS pro-
vides a steady-state, transportation-oriented routing ca-
pability. This capability was used in an arc macro
language (AML) program to construct a routing proce-
dure for determining downstream concentrations and
travel times. The pollutant may be treated as a conser-
vative element or represented by a first order expo-
nential decay function. This capability has been
implemented for use with the RF1 reach file repre-
sentation of the full Ohio River basin. The user may
select from six flow regimens: average flow, low flow,
and four multiples of average flow ranging from one-
tenth to 10 times average flow. This system gives
ORSANCO the ability to estimate the arrival time of a
spill from any RF1 tributary to the Ohio River mainstem.
153
-------
Sp/7/ Management System

A PC-based spatial decision support system (SDSS)
was built as a spill management system to be a quick
response tool for analyzing and displaying the results of
pollutant spills into the Ohio River. The schematic in
Figure 2 illustrates the components in this computerized
spill management system. The system is implemented
in the C language using a commercial menuing system
and a series of graphic display routines developed at
EPA. Custom, written routines have been used to read
the output from the U.S. Army Corps of Engineers'
FLOWSED model, to generate input files for EPAs
WASP4 model, to create output reports and output plots,
and to provide an animated representation of the con-
centration profiles moving down the river. Figure 3 pre-
sents an example of a graphic output the system
generated. Additionally, the system generates a file in
5-DAY RIVER FLOW
AND STAGE FORECAST
PREPARED
DAILY BY CORPS OF
ENGINEERS USING
FLOWSED MODEL
hooocH
via phone line
ARC/INFO CIS
DATABASE
SPATIAL DECISION SUPPORT SYSTEM
FOR TOXIC SPILL MODELING
Menu driven user interface linked to
EPA WASP 4 water quality model.
Tabular /graphical screen and hard
copy outputs, spill animation and
CIS DBF format output file
Arc View Spatial data
base display system
GRAPHS
CONCENTRATION
REPORT
by
Segment & Time
REPORTS
Figure 2. Schematic representation of spill modeling system process.
Figure 3. Graphic output from the basinwide network spill model.
154
-------
DBF format that may be read by ARC/VIEW (the com-
panion software to ARC/INFO for user-friendly viewing
of spatial data).

Hardware Platform

Within the study, the initial hardware platform was a
combination of local PCs (in Cincinnati) and a remote
access terminal to a VAX computer located at EPA's
National Computer Center in Research Triangle Park,
North Carolina. The final platform, and the one on which
the completed system was installed, comprised a UNIX-
based Data General workstation and a PC workstation.
The full hardware configuration is shown schematically
in Figure 4.

Conclusions

The application of computer-based display, analysis,
and modeling tools in conjunction with CIS technology
proved to be an effective strategy for water quality man-
agement. This study used an existing CIS package and
DBMS in conjunction with existing water quality and
hydraulic models. The study focused primarily on as-
sembling available spatial and relational databases and
integrating the systems to provide a usable, effective tool.

References
1. Horn, R.C., and W.M. Grayman. 1993. Water-quality modeling with
EPA reach file system. J. Water Res. Planning and Mgmt.
119(2):262-274.
2. Johnson, B.H. 1982. Development of a numerical modeling capa-
bility for the computation of unsteady flow on the Ohio River and
its major tributaries. Vicksburg, MS: U.S. Army Engineer WES.
3. Ambrose, R.B., T.A. Wool, J.P. Connolly, and R.W Schanz. 1988.
WASP4, a hydrodynamic and water quality model: Model theory,
users' manual, and programmers' guide. Athens, GA: EPA Envi-
ronmental Research Laboratory. NTIS PB88185095.
UNIX WORKSTATION
1.44MB
FLOPPY
DRIVE
DUAL
CPU
/

150MB
TAPE
DRIVE

8 MM
DAT

2GB
HARD
DRIVE

1/4" TAPE
DRIVE
650MB
HARD
DRIVE
Figure 4. Hardware configuration.
155
-------
Nonpoint Source Pesticide Pollution of the Pequa Creek Watershed,
Lancaster County, Pennsylvania: An Approach Linking
Probabilistic Transport Modeling and GIS
Robert T. Paulsen
The Paulsen Group, Bowie, Maryland

Allan Moose
Southampton College, Southampton, New York
Abstract

The U.S. Environmental Protection Agency (EPA) has
mandated that each state prepare a state management
plan (SMP) to manage pesticide residues in the state's
environment. One aspect of an SMP involves identifying
specific soils and sites that may be vulnerable to the
transport of pesticides into water resources. A recently
developed system identifies vulnerable areas by cou-
pling probabilistic modeling that uses the Pesticide Root
Zone Model (PRZM) with a desktop geographic informa-
tion system (GIS-MAPINFO). A limited test of this sys-
tem succeeded in identifying and mapping individual soil
series in a watershed that were shown to have trans-
ported atrazine to surface and ground water.

During this project, various digital data sources were
evaluated for availability and ease of use, including:

• STATSGO.

• U.S. Geological Survey (USGS) digital line graphs
(DLGs).

• National Oceanic and Atmospheric Administration
(NOAA) climate data.

This study documents hands-on hints and tricks for
importing and using these data.

From 1977 to 1979, the USGS measured the movement
of atrazine off fields of application into water resources
in the Pequa Creek basin in Lancaster County, Pennsyl-
vania (1). Atrazine in surface water appeared at levels
exceeding 20 parts per billion in storm flow and above
the 3 parts per billion maximum contaminant level (MCL)
during base flow from Big Beaver Creek, a tributary to
Pequa Creek. Each soil series in the subbasin was
digitized into a GIS. PRZM allowed simulation of runoff,
erosion, and leaching of atrazine (applied at 2.24 kilo-
grams per hectare in conventionally tilled corn) for each
soil. This process included simulating each soil under
different slopes for an 11 -year period from 1970 to 1980.
Interpreting the results for each soil series determined
the probability distribution of atrazine in kilograms per
hectare for each mode of transport. GIS used these data
to thematically map each soil series for atrazine loss.

The results of this demonstration project suggest that
the Manor silt loam, with slopes varying from 6 percent
to 20 percent, had a high potential to transport atrazine
residues to surface water. This type of analysis could
suggest that this soil series be:

• Farmed using conservation tillage.

• Managed to install grass waterways or buffer strips
to stop runoff.

• Set aside from production to protect water resources.

Digital databases were available for the study area, but
many technical problems were encountered in using the
data. Researchers embarking on these types of model-
ing and GIS projects should prepare themselves for
significant expenditures of time and finances.

Introduction

A significant volume of published literature documents
pesticide residues in ground water, and the volume of
investigations of residues in surface water is expanding.
The growing acceptance of immunoassay techniques
for the determination of pesticide residues in water has
given the field of pesticide monitoring an accurate and
economical analytic methodology. This will result in an
increase in monitoring capability at the federal, state,
156
-------
local, and university levels. These increases in monitor-
ing capability have documented and will continue to
document the occurrence of pesticides in water re-
sources as the result of past transport through the soil
profile. The U.S. Environmental Protection Agency
(EPA) has mandated that each state prepare a state
management plan (SMP) to manage pesticide residues
in the state's environment. Lacking, however, is a reli-
able pesticide screening technique to indicate which
soils, on a countywide scale, may be sensitive to the
transport of a specific pesticide to deep within the soil
profile or to the surface water resources. These assess-
ments would greatly supplement the usability and valid-
ity of SMPs.

Electronic databases such as State Soils Geographic
(STATSGO), Data Base Analyzer and Parameter Esti-
mator (DBAPE), or the SOILS subsets found in Nitrate
Leaching and Economic Analysis Package (NLEAP)
provide easy access to detailed soil data and model
input estimator subroutines, thereby simplifying data en-
try to numerical models. Two groupings define soils: soil
series and soil associations. Soil series are the individ-
ual soil taxa found in a field. Soil associations represent
groups of soil series, usually three or four soil series
occurring together in an area, and are mapped as a
single unit on a county scale. Mapping of most soil
associations across the United States is complete, with
open access to the county scale maps. A digital soils
mapping data set called SSURGO contains many of the
soil series maps for the United States. Climatologic
databases also provide easy access to long-term data
from the National Oceanic and Atmospheric Administra-
tion (NOAA) weather stations, allowing a user the op-
portunity to input realistic climate data to pesticide
transport models.

Many numerical pesticide transport models, such as the
Pesticide Root Zone Model (PRZM), Ground-Water
Leaching Effects of Agricultural Management Systems
(GLEAMS), and Leaching Estimation and Chemistry
Model (LEACHM), can produce transport estimates for
specific pesticides in specific soils. Each model has its
own strengths and weaknesses, and detailing these
characteristics is beyond the scope of this paper. Sev-
eral authors, however, have described comparisons be-
tween models (e.g., Smith et al. [2], Mueller et al. [3],
and Pennell et al. [4]). These numerical models all gen-
erally require extensive site-specific soil, agronomic,
and climatologic databases. The results from these
models are extremely detailed. Their pesticide transport
estimates, however, are only valid for those locations for
which site-specific data are sufficient to allow calibration
of the model. Applying such site-calibrated model results
to larger scales (county scales) is inappropriate.

In one procedure, a user could identify soils and use
transport models that may have a limited ability to retard
pesticides from reaching water resources. This type of
modeling has recently been called probabilistic model-
ing (5). The concept behind this procedure is to use an
existing transport model, such as PRZM, and vary cer-
tain input parameters (e.g., slope, organic content, pes-
ticide Koc) to produce a probability of a given output
being equaled or exceeded. For example, a PRZM
model could be created for a soil series with an average
organic content of 1 percent and a slope of 8 percent in
the eastern corn belt. The model would use the 30 years
of historical climate data for a nearby station. The model
would vary the organic content and surface slope within
given ranges for the soil series and run 1,000 simula-
tions. The analysis could then entail plotting the results
(i.e., monthly runoff loads, erosion loads, and leaching
through the root zone) in a frequency diagram and gen-
erating probability curves. This analysis would allow the
user to estimate the anticipated pesticide losses, runoff,
erosion, and leaching for any given soil in the county.
The soils with greater probabilities for pesticide loss
could be identified and mapped using CIS.

Recent advances in computing speed and efficiency
have reduced the amount of time and expense needed
to run numerical pesticide transport models. This makes
it possible, in a relatively short amount of time, to quan-
titatively model not just one soil series, but the hundreds
of major soil series that occur in an entire state (e.g.,
357 major soil series combined into 464 different soil
associations in Wisconsin). This type of model can be
very useful to the development of SMPs as well as to a
variety of users, including pesticide registrants, bulk
pesticide handlers, custom appliers, county agricultural
extension agents, and individual growers.

Objective

The objective of this study was to use probabilistic mod-
eling analyses and a geographic information system
(CIS) to determine which soil series in a watershed may
contribute to nonpoint source pollution through runoff of
agricultural chemicals. Specifically, the study aimed to
locate a watershed with historical atrazine runoff, map
the soils, and perform transport modeling using histori-
cal precipitation. The results of this procedure would
determine which soil series had a high potential to con-
tribute to the nonpoint source pollution of the watershed.
Once the transport modeling was completed, a CIS
would help map the distribution of the sensitive soil
series. The mapping would act as a base for implement-
ing best management practices (BMPs) to reduce non-
point source pollution.

Background

Ward (1) described the water quality in the Pequa Creek
basin in Lancaster County, Pennsylvania, for the years
1977 through 1979. Flow from Pequa Creek (154-square-
157
-------
Figure 1. Location of Pequa Creek basin, Lancaster County,
Pennsylvania.

mile drainage area) eventually discharges into the
Chesapeake Bay (see Figure 1). The data collection
efforts (6) documented the occurrence of atrazine
(2-chloro-4-ethylamino-6-isopropylamino-s-triazine),
a commonly used herbicide for weed control in corn-
growing regions, and other agrichemicals in both base-
flow and storm-flow conditions of the Pequa Creek. A
subbasin of Pequa Creek, Big Beaver Creek, had the
greatest reported atrazine concentrations during the
sampling period. The maximum reported atrazine con-
centrations at the Big Beaver Creek sampling station,
near Refton, Pennsylvania, were 0.30 parts per billion
during base-flow conditions and 24.0 parts per billion
during storm-flow conditions. The Big Beaver Creek
basin is 20.4 square miles in area, and agriculture con-
stituted about 66 percent of the land use in 1979. Corn
was grown on 26.6 percent of the agricultural lands in
this subbasin. The average rainfall forthis basin is about
37 inches annually (1).

As noted, agriculture represented the major land use in
the area. The primary agricultural soils in Lancaster
County are silt loams (Typic Hapludults and Hapludalfs)
in texture with slopes that range from 0 percent to 8
percent (7). Upon inspection of the air-photo soil se-
ries maps found in the county soil survey, however,
agricultural crops grew on lands with slopes of up to
and exceeding 15 percent, with soils such as the
Manorsilt loam, Pequa silt loam, and Chestersilt loam
(7).
Natural soil organic contents in the agricultural soils
range from 0.1 percent to 2.0 percent. Water contents
of the agricultural soils range from 10 percent to nearly
30 percent. The soil Erosion Factor (K) for the surface
layer ranges from 0.17 for the relatively stable Lingers
Series to 0.43 for the Pequa Series. The greater the
value, the greater the susceptibility to sheet and rill
erosion. The soil Erosion Factor (T in tons/acre/year) for
the entire soil profile ranges from 2 for the relatively
stable Clarksburg Series to 5 for the Elk Series.

Methods

Determining which soil series in the Big Beaver water-
shed may contribute to nonpoint source pollution
through runoff of agricultural chemicals entailed per-
forming a combination of probabilistic modeling analy-
ses and a CIS data manipulation.

Physiographic and Soil Series Boundaries

The orientation of Pequa Creek, Big Beaver Creek, and
other surface water bodies was digitized directly from
the 1:50,000-scale county topographic map for Lancas-
ter County (8). This map also provided the basis for
digitizing the Pequa Creek drainage divide, location of
urban areas, and roadways. The MAPINFO CIS allows
for the creation of boundary files by tracing the boundary
off the topographic map with a digitizing tablet config-
ured to the latitude and longitude coordinates of three
points on the map. The latitude and longitude are dis-
played while the boundary is being traced, allowing the
user to verify the accuracy of the boundary against
known coordinates on the map. CIS contains a self-
checking boundary closure program to ensure that the
polygons are closed and that the boundary contains no
extraneous line segments. These boundary data are
already available from the USGS in a digitized format,
digital line graphs (DLGs). Because digitizing is an easy
task, however, and to minimize costs, the project used
manual digitizing rather than purchase the data.

The roadways and urban areas were digitized to allow
use of standard control points, such as road intersec-
tions and benchmarks, to configure the U.S. Depart-
ment of Agriculture (USDA) Soil Conservation Service
air-photo-based, 1:15,840-scale soil series maps for the
Big Beaver Creek watershed. Known land grid coordi-
nates were noted on the air-photo maps (7). We con-
cluded, however, that using known reference points,
such as roadways and towns, allowed for a better con-
figuration of the digitizing tablet to the air photos and
158
-------
eliminated concerns over scale distortions sometimes
common in air photos.

After configuring the air photos to the digitizing tablet, a
2-square-mile area around the surface water sampling
points was digitized. The next step entailed digitizing all
the mapped soil series units within this area. The loca-
tions of crop areas, as plowed fields, were noted. In
noting forested areas, it became apparent that only
minor acreages were not in agricultural production.
Those mapped units were generally the Manor and
Pequa Series soils with slopes exceeding 25 percent.

Pesticide Transport Modeling

The PRZM pesticide transport model helped to quantify
the ability of several soil series to retard the transport of
atrazine through the root zone as leachate, dissolved in
surface runoff and adsorbed on sediment that moved
during erosion. The PRZM model performed in an un-
calibrated or screening model mode. The input values
for soil properties came from both the EPA DBAPE
database and the Lancaster County Soil Survey (7). The
modeled soil profile was 150 centimeters thick and di-
vided into 5-centimeter compartments. The soil half-life
of atrazine was set at 57 days in accordance with values
that the PRZM manual listed (9). The primary soil prop-
erty that varied in this demonstration project was surface
slope. All other parameters, such as soil organic content,
moisture content, and bulk density, appeared as mid-
point values for the ranges listed in DBAPE.

The agronomic scenario that the model simulated was
for corn grown continuously for 10 years using conven-
tional tillage practices and planted on May 7 of each
year. Atrazine was surface applied at a rate of 2 pounds
per acre (2.24 kilograms per hectare) on May 1 of each
year. For climatic input, the model used the historical
precipitation regimen from 1970 through 1980, as meas-
ured at the Harrisburg, Pennsylvania, station.

PRZM simulations were made for each of the following
soil series:

• Chester

• Conestoga

• Elk

• Glenelg

• Glenville

• Hollinger

• Letort

• Manor

• Pequa

Monthly values were calculated for leachate, runoff, and
erosion per hectare. Unfortunately, no data for the
Pequa Series were available in the DBAPE database;
therefore, this portion of the analyses omitted it. In
addition, analyses of the Manor Soil Series included
more detailed probabilistic modeling where the surface
slope held constant (6-percent slope) and the surface
soil organic content varied to include the high, average,
and low organic contents as listed in DBAPE. PRZM
also calculated the volume of water as evapotranspira-
tion, runoff, and recharge through the root zone.

Results

The results of this study should demonstrate the appli-
cation of transport modeling to the possible protection
of water resources. Regulatory decision-makers should
not consider these results in their current form because
such decisions would require a much more rigorous
simulation strategy to increase the level of confidence in
the data. As a demonstration study, however, the results
do show the usefulness of this approach. Table 1 con-
tains the cumulative frequency data for the simulated
atrazine residues in runoff, erosion, and leaching that
occurred under 30 years of historical precipitation. The
data cover the 12 soils mentioned, with the surface
slope held constant at 6 percent.

Atrazine in Runoff and Erosion

The results of this analysis suggest that the Hagger-
stown Series had the greatest potential for yielding
atrazine in runoff; approximately 50 percent of the simu-
lated monthly atrazine in runoff values equaled or ex-
ceeded 0.0001 kilogram per hectare. Conversely, the
Elk Series yielded the least atrazine to runoff; 50 percent
of the runoff data were at residue levels of 1 x 10~6
kilograms per hectare. Within the Big Beaver Creek
subbasin, the Manor Series had the greatest potential
to yield atrazine in runoff.

As with the runoff data, the Haggerstown Series had the
greatest potential to yield atrazine in eroded sediments,
and the Elk Series yielded the least atrazine in erosion.
Within the Big Beaver Creek subbasin, the Manor Series
had the greatest erosion potential regarding atrazine.

GIS Analyses

After entering the results from the transport modeling
into a database, GIS could produce maps showing the
location of soils with high runoff potentials. Figure 2a
shows the orientation of soil series around the surface
water sampling points in Big Beaver Creek. Figure 2b
represents the same scene but fills in the soils with high
runoff potential. Using this type of analysis can help
areas that may be sources of nonpoint source runoff
contamination. Once identified, these soils can be tar-
geted for alternative management practices that may
reduce the amount of runoff and the degree of nonpoint
source contamination.
159
-------
Table 1. Cumulative Frequency of Simulated Atrazine Residues in Runoff for 12 Major Soils in Lancaster County, Pennsylvania
(values are percentage of data)
Load
(kilograms
per
hectare)
IE-10
IE-9
IE-8
IE-7
IE-6
IE-5
IE-4
IE-3
0.00
0.01
0.10
1.00
10.00
25.00

Bucks
83.33
78.79
74.24
66.67
56.06
47.78
36.36
24.24
15.91
6.82
0.00
0.00
0.00
0.00

Chester
84.85
80.30
76.52
68.94
46.97
35.61
21.97
15.15
6.82
0.00
0.00
0.00
0.00
0.00

Clymer
84.09
80.30
75.00
66.67
56.06
46.21
32.58
21.21
14.39
6.06
0.00
0.00
0.00
0.00
Soil
Connestoga
84.08
81.82
78.03
72.73
61.36
50.00
35.61
21.21
15.12
6.06
0.00
0.00
0.00
0.00
Series
Elk
78.79
75.76
69.70
59.09
50.00
43.18
31.82
21.21
15.15
6.82
0.00
0.00
0.00
0.00
in Lancaster County, Pennsylvania
Glenelg
84.85
80.30
76.52
68.18
56.06
47.73
34.85
21.21
13.64
3.79
0.00
0.00
0.00
0.00
Haggerstown
93.94
93.94
93.94
91.67
84.85
74.24
53.79
37.12
19.70
7.58
0.00
0.00
0.00
0.00
Hollinger
83.33
81.06
77.27
68.94
59.09
47.73
34.85
21.21
15.15
6.06
0.00
0.00
0.00
0.00
Lansdale
88.64
85.61
80.30
75.00
65.15
51.52
40.15
24.24
15.91
6.82
0.00
0.00
0.00
0.00
Letort
85.61
84.85
84.09
83.33
76.52
67.42
46.97
31.82
17.24
4.55
0.00
0.00
0.00
0.00
Manor
86.36
84.61
84.85
75.52
66.67
46.97
31.82
16.67
4.55
0.00
0.00
0.00
0.00
0.00
Lingers
84.09
84.09
81.82
75.76
67.42
53.03
39.39
24.24
15.15
4.55
0.00
0.00
0.00
0.00
This table reads as follows: Given the Elk Series, the first value
IE-10 kilograms per hectare. Similarly, within the same column, 6
than 0.1 kilograms per hectare.
reads that 78.79 percent of the simulated data were greater than or equal to
.82 percent of the data were greater than 0.01 kilograms per hectare but less
Pequa Creek
Pequa Creek
0 0.5 1

Figure 2a. Soil series in the Big Beaver Creek basin.

Detailed Modeling of the Manor Series

Performing an introductory probabilistic modeling exer-
cise allowed further investigation of the potential of the
Manor Series to release atrazine into runoff. The exist-
ing 30-year climate data and the stated agronomic data
were retained from the previous modeling. The organic
carbon content of the surface soil layer, however, was
allowed to vary between the published low, average, and
maximum values found in the DBAPE database. This
exercise followed the principles set forth by Laskowski
et al. (5) and others who describe probabilistic modeling
0.5
Figure 2b. Soils sensitive to atrazine runoff in the Big Beaver
Creek basin.
approaches. In essence, by varying input parameters
within known endpoints, the probabilistic approach can
generate a distribution of pesticide residue values that
statistically reflects the anticipated residues. Parame-
ters to vary may include:

• Organic carbon content

• Surface slope

• Kd (distribution coefficient)
160
-------
• Moisture content

By allowing input variables to vary according to a normal
distribution, this approach thereby eliminates some of
the uncertainty associated with pesticide transport mod-
eling. The probabilistic modeling approach requires the
creation of a significant database by performing many
runs (e.g., 1,000 model runs that generate 12,000
monthly values for each soil).

This study included a limited probabilistic modeling ex-
ercise. Table 2 lists the results for the mean atrazine
residues in runoff, erosion, and leaching for the Manor
Soil Series during the:

• Entire year

• Growing season

• Winter months

The surface slope was held constant at 6 percent, but
the soil organic carbon content varied within the publish-
ed range. The means for all months show limited vari-
ation in mean residues. Runoff was by far the major
Table 2. Summary Statistics for Detailed Modeling of the
Manor Soil Series in the Pequa Creek Watershed
(statistics based on 1,080 values)

Mean Atrazine Residue
(kilograms per hectare)
Percent
Organic
Carbon3
All Months
Low
Average
High
Growing Season
Low
Average
High
Winter Months
Low
Average
High
Runoff

0.01381
0.01229
0.01105

0.03290
0.02881
0.02540

0.00014
0.00045
0.00075
Erosion

0.00024
0.00042
0.00057

0.00058
0.00100
0.00130

< 0.000002
< 0.000001
< 0.000003
Leaching

0.00028
0.00012
< 0.000006

0.00017
< 0.000008
< 0.000004

0.00036
0.00014
< 0.000007
Data taken from DBAPE soils database as low, midpoint, and maxi-
mum reported organic contents.
source of atrazine. Erosion and leaching values were on
similar scales (trace amounts).

The greatest runoff and erosion values occurred during
the growing season. The greatest leaching, however,
occurred during the winter months. These results sup-
port the general observations that surface residues run
off during spring and summer but that as the crops grow
and evapotranspiration increases, recharge to ground
water decreases, subsequently limiting pesticide trans-
port to ground water. Conversely, during the winter
months, the surface soil pesticide residues generally
decrease because of exposure to months of photolysis,
hydrolysis, and biodegradation. Subsurface residues
have been protected from degradation, however, and
increased ground-water recharge, due to great reduc-
tions in evapotranspiration, transports the residues
through the soil column.

This limited exercise provided a valuable learning expe-
rience regarding probabilistic modeling. As computing
techniques and hardware advance, the cost in time and
money for each simulation should decrease dramati-
cally. Although researchers tend not to have great faith
in pesticide transport modeling, the advances in this
field will reduce uncertainty and instill greater confi-
dence in the modeling process.

GIS Pitfalls

CIS is a powerful tool and has great promise for use
in environmental problem-solving. Several points or
pitfalls, however, hinder broad acceptance of GIS. As
with most new technologies, cost is the overriding con-
cern in using GIS. Although technical staff and project
scientists understand the power of GIS and the effort
that data preparation requires, management and corpo-
rate staff often do not see the benefits for the costs.
Many managers assume that current GIS systems re-
semble those seen on "Star Trek," and when reality
becomes apparent, managers tend to discard GIS as
too costly and complex. Several points need considera-
tion when contemplating the use of GIS. Although vari-
ous products exist, this discussion focuses on
ARC/INFO and MAPINFO products.

Hardware

Computer hardware is plentiful if the available budget
can support a purchase. Many high- powered GIS pack-
ages (e.g., GRASS, ARC/INFO, INTERGRAPH, IDRISI)
run best on mainframes or minicomputers. Most techni-
cal staff, however, only have access to PC machines.
Corporate purchasing departments more readily expend
funds for PC technology because they will eventually
find use forthese machines even if they are not used for
GIS. A recent ARC/INFO advertisement (August 1994)
lists costs for SUN SPARC minicomputer systems with
ARC/INFO software at $12,000 to $15,000 depending
on configuration.

Minicomputers and mainframes require specialized staff
to configure and maintain the hardware. Today, many
staff level personnel can open and augment their PC
machines with a minimum of external support. GIS per-
formance reflects the tradeoff in hardware, particularly
161
-------
when a considerable amount of data manipulation is
required. For example, if linking discrete depth soil se-
ries data to STATSGO soil associations is necessary,
then a minicomputer system may be best. The postproc-
essed data could, however, be exported to a format that
will run on PC-based systems. If the user wants to
import and manipulate remote sensing imaging (e.g.,
SPOT or Landsat data), then minicomputers are recom-
mended. If the user wants to display already edited
images and preprocessed CIS data, then PC-based
computing may be sufficient. The ultimate use of CIS
drives the hardware selection.

Software

A great number of CIS software packages are available
to meet almost any level of use and expertise. Software
runs under both UNIX and DOS/Windows (denoted as
DOS forthe remainder of this paper) operating systems.
The UNIX-based software tends to be more powerful
and flexible than the DOS-based software. UNIX-based
packages require more specialized staff to optimize
CIS, however.

UNIX-based software packages include GRASS,
ARC/INFO, INTERGRAPH, and IDRISI. Costs vary
from public domain charges for GRASS and IDRISI to
vendor supplied ARC/INFO and INTERGRAPH, which
can cost several thousand dollars each.

DOS versions of ARC/INFO (e.g., PC ARC/INFO,
ARCAD, ARC/VIEW) are also available and provide the
userwith various levels of data editing and manipulation
abilities. Generally, PC ARC/INFO is the same as the
UNIX version, varying in speed of processing. ARCAD
is a CIS engine that uses AutoCAD for drawing and
displaying, giving the user most of the abilities of the
UNIX-based version. ARC/VIEW I was primarily a dis-
play and simple analysis tool. It allowed the user to view,
display, and manipulate existing arc data but did not
support image editing. Currently, ARC/VIEW II provides
more support for image editing and data manipulation.
Costs range from about $3,000 for PC ARC/INFO and
ARCAD (AutoCAD also costs about $2,000) to around
$500 for the ARC/VIEW products.

MAPINFO is a DOS-based CIS that was designed for
marketing and demographic applications. Several re-
searchers, however, have used MAPINFO for environ-
mental applications. The most outstanding feature of
MAPINFO is that it easily imports data layers as it reads
dBASE type files directly. MAPINFO V3 also reads da-
tabase files and recreates them as *.TAB files. In con-
trast to the "coverage and entity" concepts of the
ARC/INFO line of programs, MAPINFO reads latitude
and longitude coordinates and displays the results. This
simplifies data management because many researchers
who have already created custom databases can easily
import those data as long as latitude and longitudes
coordinates are present.

As with the ARC/INFO line of programs, many common
data layers can be purchased for use in MAPINFO.
These layers can be expensively priced, costing ap-
proximately $1,000 per county for roadway, census, and
demographic data. One major lapse is the poor library
of environmental layers, USGS topography, hydrogra-
phy, soil boundaries, or climate stations. MAPINFO does
sell a module that allows users to convert to and from
ARC/INFO coverages so that common data layers can
be established. Experience shows, however, that con-
version programs do not always work as advertised. For
example, large boundary files (STATSGO data for Indi-
ana) do not readily convert from ARC/INFO to MAP-
INFO. Third-party vendors may be needed to convert
data for use in MAPINFO.

One very important factor supporting the use of MAP-
INFO is that it has a business application slant; there-
fore, it is slightly easier to convince corporate
management to invest in CIS because marketing and
sales data (territories) can be relatively easily overlain
onto environmental data.

Finally, some packages that are add-ons to spreadsheet
programs tend not to be powerful or versatile enough for
use in environmental CIS work. These software pack-
ages may be valuable as an introduction to CIS tech-
niques, however.

Data A vailability and Format

After compiling the hardware and software into CIS, the
next step entails accessing data layers such as:

• State and county boundaries

• Land use covers

• Water boundaries

Currently, USGS DLGs for hydrography, land use, trans-
portation, and cultural features are available for minimal
costs. Shareware programs can convert the USGS
DLGS formats into DXF (data transfer files) for import to
CIS packages. These data require conversion to DXF
or ARC coverage type formats for use in either
ARC/INFO or MAPINFO.

The USDA Soil Conservation Service produces digital
data for soil types (STATSGO and SSURGO) that users
can import to ARC/INFO relatively easily. The STATSGO
data cost approximately $1,000 per state and are avail-
able for most states. The detailed soil series maps,
SSURGO, cost approximately $500 per county and are
not available for every county in the United States. Many
data layers are available for direct use by GRASS. As
of yet, however, no convenient conversion utilities exist
to move GRASS data to ARC/INFO or MAPINFO. The
162
-------
U.S. Fish and Wildlife Service now distributes data
layers from the National Wetlands Inventory on the
Internet (enterprise.nwi.fws.gov).

Other data sources available through private vendors
are listed in the MAPINFO and ARC/INFO user guides
and in any issue of CIS World. The user should be
prepared to absorb significant costs if purchasing all the
required data layers.

Conclusion

This study shows that the technology and software exist
for a water resource manager to couple pesticide trans-
port modeling with CIS to identify areas or individual
soils that may contribute to nonpoint source pollution of
water resources. The study used PC-based computing
system and software. Soils maps and hydrographic
maps can be easily digitized for limited cost. A skilled
scientist or technician, without being a CIS expert, can
run CIS to answer specific questions. The technologies
this study demonstrated may be extremely valuable to
managers responsible for producing SMPs.

The pesticide transport modeling performed during
this study was intended for illustrative purposes. More
detailed analyses, and additional simulations, would be
necessary to use these data for regulatory actions or
land use management. The study did, however, succeed
in identifying the Manor Soil Series, with slopes exceed-
ing 6 percent, within the Big Beaver Creek subbasin of
the Pequa Creek basin, as a potential source for
atrazine in runoff.

References

1. Ward, J.R. 1987. Surface-water quality in Pequa Creek Basin,
Pennsylvania, 1977-1979. U.S. Geological Survey. Water Re-
sources Investigation Report 85-4250.

2. Smith, M.C., A.B. Bottcher, K.L. Campbell, and D.L. Thomas. 1991.
Field testing and comparison of the PRZM and GLEAMS models.
Trans. Amer. Soc. of Agric. Eng. 34(3):838-847.

3. Mueller, T.C., R.E. Jones, P.B. Bush, and PA. Banks. 1992. Com-
parison of PRZM and GLEAMS computer model predictions with
field data for alachlor, metribuzin, and norflurazon leaching. Envi-
ron. Toxicol. and Chem. 11:427-436.

4. Pennell, K.D., A.G. Hornsby, R.E. Jessup, and P.S.C. Rao. 1990.
Evaluation of five simulation models for predicting aldicarb and
bromide behavior under field conditions. Water Resour. Res.
26(11):2,679-2,693.

5. Laskowski, D.A., P.M. Tillotson, D.D. Fontaine, and E.J. Martin.
1990. Probability modeling. Phil. Trans. Royal Soc. of London
329:383-389.

6. Ward, J.R., and D.A. Eckhardt. 1979. Nonpoint-source discharges
in Pequa Creek Basin, Pennsylvania, 1977. U.S. Geological Sur-
vey. Water Resources Investigation Report 79-88.

7. Custer, B.H. 1985. Soil survey of Lancaster County, Pennsylvania.
Washington, DC: U.S. Department of Agriculture, Soil Conserva-
tion Service.

8. U.S. Geological Survey. 1977. Lancaster County, Pennsylvania:
County map series (topographic) 1:50,000 scale.

9. Carsel, R.F., C.N. Smith, L.A. Mulkey, D.J. Dean, and P. Jowise.
1984. Users manual for Pesticide Root Zone Model (PRZM), Re-
lease I. EPA/600/3-84/109.
163
-------
Integration of GIS With the Agricultural Nonpoint Source Pollution Model:
The Effect of Resolution and Soils Data Sources on Model Input and Output
Suzanne R. Perlitsh
State University of New York, Syracuse, New York
Abstract
The assessment of agricultural nonpoint source pollu-
tion has been facilitated by linking data contained in a
geographic information system (GIS) with hydrologic
models. One such model is the Agricultural Nonpoint
Source (AGNPS) Pollution Model, which simulates run-
off, nutrients, and sediment from agricultural water-
sheds. Vector-based (ARC/INFO) and raster-based
(IDRISI) GIS systems were used to generate AGNPS
input parameters.
The objectives of this project were to generate AGNPS
input parameters in GIS format from GIS data at differ-
ent resolutions and different levels of detail (soil survey
soils data versus soils data currently available in digital
format from the United States Department of Agricul-
ture). Differences in the AGNPS model sediment out-
put based on the variations in CIS-generated AGNPS
model input were evaluated.
The study also evaluated the influence of cell size reso-
lution and soils data on sediment generated within each
cell in the watershed (SGW), sediment yield from each
cell in the watershed (SY), sediment yield at the water-
shed outlet, and peak flow. Model output was validated
by comparison with measured values at the watershed
outlet for a monitored storm event. Results of this study
indicate that the use of different resolution GIS data and
different soils data sources to assemble AGNPS input
parameters affects AGNPS model output. Higher reso-
lution data do not necessarily provide better results.
Such comparisons could affect decision-making regard-
ing the level and type of data analysis necessary to
generate sufficient information.

Introduction
Agricultural runoff is a major contributor to nonpoint
source pollution. Fifty-seven percent of the pollution in
impaired lakes and 64 percent of the pollution in im-
paired rivers of the United States can be attributed to
agricultural nonpoint source pollution (1). Sediment is
one of the most common agricultural nonpoint source
pollutants and is the largest pollutant by volume in the
United States (2). More than 3 billion tons of sediment
enter surface waters of the United States each year as
a result of agricultural practices (1).

Accurate assessment of the effects of agricultural activi-
ties on water quality within a watershed is vital for
responsible watershed management and depends on
our ability to quantify the spatial variability of the water-
shed and the complex interactions of hydrologic proc-
esses (3). Computer models have been developed to
simulate these hydrologic processes to provide esti-
mates of nonpoint source pollutant loads. Adequate
simulation of a watershed's spatial variability helps pro-
vide the best representation of hydrologic processes
within the watershed.

Preservation of spatial variability within hydrologic mod-
els can be accomplished using a distributed parameter
model. The distributed parameter model is more advan-
tageous than lumped parameter models, which gener-
alize watershed characteristics, because distributed
parameter models provide more accurate simulations of
the systems they model (4). One of these models is the
Agricultural Nonpoint Source (AGNPS) Pollution Model.
AGNPS is a distributed process model because it pro-
duces information regarding hydrologic processes at
grid cells within the watershed, thus enabling preserva-
tion of the spatial variation within the watershed. Distrib-
uted parameter models integrate well with GIS because
GIS can replicate the grid used in a distributed parame-
ter model. Manual compilation of AGNPS input parame-
ters required to evaluate small areas at low resolution
(large grid cells) is relatively easy. Manually assembling
data to evaluate larger areas at finer resolutions be-
comes tedious, however. The integration of GIS data
with the AGNPS model facilitates data assembly and
manipulation (5).
164
-------
Several researchers have integrated AGNPS with CIS
(4-11). Smaller cell sizes within distributed parameter
models are thought to best represent spatial variability
within a watershed (5, 10). Certain AGNPS input
parameters show sensitivity to changes in grid cell size,
affecting sediment yield output (11). The use of CIS to
generate input parameters for the AGNPS model en-
ables analysis of watersheds at higher resolutions than
would be practical using manual methods (5).

Research Hypotheses

The project investigated the following research hypotheses:

• AGNPS output at the highest resolution will better
approximate sediment yield at the watershed outlet.

• AGNPS output for sediment generated within each
cell in the watershed at highest resolution will best
reflect the watershed processes.

• AGNPS output generated from the more detailed soil
survey data will better estimate watershed processes.

The project also investigated other questions: will cer-
tain AGNPS input parameters (cell land slope, soil erodi-
bility [K], the cropping factor [C], and the U.S.
Department of Agriculture [USDA] Soil Conservation
Service [SCS] curve number [CN] show sensitivity to
changes in grid cell size? How does slope affect model
output? Does a qualitatively significant difference exist
between model input parameters and output calculated
from data sets generated at different resolutions with
different levels of detail in soils output?

Objectives/Tasks

The research in this project included analyses of:

• Certain AGNPS input parameters generated at differ-
ent resolutions (10- x 10-, 30- x 30-, 60- x 60-, and
90- x 90-meter resolutions).

• AGNPS output for sediment yield (SY) and sediment
generated within each cell (SGW) at different resolu-
tions (center cells of 10- x 10-, 30- x 30-, 60- x 60-,
and 90- x 90-meter resolutions).

• AGNPS output generated from different levels of de-
tail in the soils input data (soil survey versus STAT-
SGO data sources).

Significance

Version 4.03 of AGNPS was released in June 1994. Ver-
sion 4.03 allows for evaluation of 32,767 cells. This version
allows for cell sizes from 0.01 to 1,000 acres (approxi-
mately 6.36-x 6.36-meter resolution to 2,012-x 2,012-me-
ter resolution). Previous versions of AGNPS limited the
number of cells to 3,200 and the cell size resolution to 0.4
hectares (or 63.25 x 63.25 meters) (12). Reviewed litera-
ture provides no evidence that AGNPS has been used
to evaluate a watershed at 10- x 10-meter resolution.

The soils data in this study were compiled at two differ-
ent levels of detail. Soils data at the 1:20,000 soil survey
level were generated in digital format. This level of detail
was compared with soils data at the State Soil Geo-
graphic (STATSGO) database level with a scale of
1:250,000. Reviewed literature mentions no previous
studies comparing AGNPS output with input generated
from these two different levels of detail in soils input.

Technology for collecting and processing geographic
data is continuously improving. Currently, the United
States Geological Survey (USGS) 1:24,000 digital ele-
vation models (OEMs) are available at 30- x 30-meter
resolution. New satellite technology will enable DEM
data to be available at 10- x 10-meter resolution, or
higher. Certain satellites currently provide land cover
data at 10- to 30-meter resolution (13).

An important objective of this project was to determine
whether higher resolution data provide different results
when routed through AGNPS. Does spatial data at
higher resolutions provide better information? This pa-
per describes the results of an analysis of AGNPS out-
put based on different levels of both resolution and soils
detail in CIS data input sources.

Materials

The Study Area

An ongoing effort is underway to clean up Onondaga
Lake, Onondaga County, New York. To accomplish this
effort, areas contributing agricultural nonpoint source
pollution to the lake are being evaluated.

The Onondaga Lake watershed is approximately 287.5
square miles, with 40 subwatersheds. The subwater-
sheds in the agricultural portion of the Onondaga Lake
watershed (south of Syracuse, New York) have been
isolated for study of their potential nonpoint source con-
tributions to Onondaga Lake (see Figure 1). The study
area watershed (1.84 square miles, 1,177.5 acres) is
one of these agricultural subwatersheds. CIS data were
collected within the Otisco Valley quadrangle (USGS
1:2,400), which includes the southern portion of the
Onondaga Lake watershed. Elevations in the study wa-
tershed range from 1,820 feet to 1,203 feet, with an
average elevation of 1,510 feet. The watershed perime-
ter is approximately 6.5 miles (34,505 feet). The streams
in the watershed flow from south to north to Rattlesnake
Gulf, with a stream length of approximately 3.08 miles
(16,265.4 feet). The stem fall of the main stream stem
is quite steep at 283 feet per mile. The drainage density
of the watershed is 1.67 miles of stream persquare mile.
Land use in the watershed is predominantly agricultural
(82.8 percent).
165
-------
Study Area Watershed
Canty Hill Rd. ! Bar*ell Rd
Figure 1. Onondaga Lake watershed and study area (not to scale).

The AGNPS Model

AGNPS was developed to analyze and provide esti-
mates of runoff water quality, specifically to evaluate
sediments and nutrients in runoff from agricultural
watersheds for a specific storm event (11). To use
AGNPS, a watershed is divided into cells of equal area.
Calculations for each of the model output values are
made within each cell based on the watershed data
contained in each cell. Approximately 1,000 people in 46
different countries use the AGNPS model. Users include
students, university professors, government agencies,
lake associations, and environmental engineers.1
AGNPS was developed in 1987 by the Agricultural Re-
search Service (ARS) in cooperation with the Minnesota
Pollution Control Agency (MPC) and the SCS. The
model runs on an IBM-compatible personal computer.

Data Sources

The CIS packages of ARC/INFO Version 3.4D (14) and
IDRISI Version 4.1 (15) were used to prepare input
parameters for the AGNPS model. AGNPS input pa-
rameters were derived from three base maps—land
use, a DEM, and soils. Table 1 shows the 22 input
parameters that AGNPS required, and the base source
for the data.

The land use map was obtained from a classified
ERDAS image (resolution of 28 x28 meters). The image
was converted to IDRISI, brought into ARC/INFO, and
regridded based on the resolution requirements of each
data set. USGS could not provide a DEM for the study
Personal communication from AGNPS Technical Support, September
1994.
area, so the OEMs were interpolated from points digit-
ized in ARC/INFO based on the Clarke method (16). The
OEMs were interpolated in IDRISI on 10- x 10-, 30- x30-,
60- x 60-, and 90- x 90-meter resolution surfaces.

Soil survey data were obtained from Onondaga County
Soil Survey air photographs. An orthophoto of the
7.5 minute quadrangle was obtained from the USGS
and was used with a zoom transfer scope to ortho-cor-
rect the soil survey data. The corrected soil polygons
were then digitized in ARC/INFO. The Otisco Valley
quadrangle comprises 79 soils mapping units. Thirty-
eight different mapping units occur in the study area
watershed.

The USDA SCS (now the Natural Resource Conser-
vation Service [NARCS]) provides digital soils data
from its STATSGO database. The mapping scale of
STATSGO data is 1:250,000, thus it is best suited for
broad planning and management uses. The number of
soil polygons per quadrangle is between 100 and 400,
and the minimum area mapped is 1,544 acres. The
STATSGO soil data used in this project were obtained
from the Onondaga County Soil Conservation Serv-
ice. Approximately seven STATSGO soil groups were
identified for the Otisco Valley quadrangle. Only one
STATSGO soil type occurs in the study area watershed
(Honeyoe silt loam).

Methods

AGNPS input parameters that showed sensitivity to
changes in grid cell size in previous studies were com-
pared between the resolutions. The AGNPS model was
run eight times using precipitation values from the actual
166
-------
Table 1. AGNPS Input Parameters
# AGNPS Parameter
Root Data Source
General Derivation of Input
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
Cell number
Cell division
Receiving cell number
Receiving cell division
Flow direction
SCS curve number
Land slope percentage
Slope shape factor
Average slope length
Manning's n coefficient
USLE K factor
USLE C factor
USLE P factor
Surface condition constant
Chemical oxygen demand
Soil texture
Fertilizer indicator
Pesticide indicator
Point source indicator
Additional erosion
Impoundment indicator
Channel indicator
Watershed map
Not applicable
Aspect map from DEM
Not applicable
Aspect map from DEM
Land use and soils coverage
Slope map from DEM
Algorithm
Table of values
Literature values
SCS and soil survey
Literature values
Literature values
Land use coverage
Land use coverage
Soil survey
Land use coverage
Land use coverage
USGS 1:24,000 map
Field survey, known gullies
1:24,000 map, field survey
Streams coverage
Program written to determine #
No cell division, assumed 0
Program written to determine #
No cell division, assumed 0
Reclassed 1-8 from azimuth map
SML written to determine CN
Provided in slope percentage from IDRISI
Assume uniform slope
Obtained from SCS
Attached to land use database
Attached to soils database
Attached to land use database
Attached to land use database
Attached to land use database
Attached to land use database
Attached to soils database
Assumed for agricultural land class
Assumed for agricultural land class
Points in ARC/INFO and IDRISI
Assume no additional erosion
Assume no impoundments
Assume no significant channel
storm that was monitored. Each time, the model was run
using an input file created with the different input data
sources as follows:
1. 30- x 30-meter resolution-soil survey data.
2. 30- x 30-meter resolution-STATSGO data.
3. 60- x 60-meter resolution-soil survey data.
4. 60- x 60-meter resolution-STATSGO data.
5. 90- x 90-meter resolution-soil survey data.
6. 90- x 90-meter resolution-STATSGO data.
7. Center cells of the 10- x 10-meter resolution—soil survey
data.
8. Center cells of the 10- x 10-meter resolution—
STASGO data.
As grid cell size increases, the time required to assemble
data as well as the space required to store the data files
increase. If a cell size resolution is cut in half, the
number of cells in that coverage quadruples. In the study
area watershed, increasing grid cell size from 90- x 90-
meter resolution to 60- x 60-meter resolution created
778 more cells within the watershed. Moving from
60- x 60-meter resolution to 30- x 30-meter resolution
added 3,646 cells to the watershed, and moving from
30- x 30-meter resolution to 10- x 10-meter resolution
added 40,115 cells to the watershed data set (see
Figure 2). Due to the 32,767-cell limitation of AGNPS
Version 4.03, AGNPS output for SY and SGW at the
10- x 10-meter resolution (which contains 45,104 cells)
could not be obtained. Input parameters at 10- x 10-meter
resolution, however, could be compared with input
parameters at 30- x 30-meter resolution.
Methodology of Data Analysis
The input parameter "maps" were converted to IDRISI
files and combined in a format that could be routed
through the AGNPS model. AGNPS model output for
soil generated within each cell and for sediment yield
was assembled. The 30- x 30-meter resolution maps
were compared with the 60- x 60-meter resolution maps;
the 60- x 60-meter resolution maps were compared with
the 90- x 90-meter resolution maps; and the 30- x 30-
meter resolution maps were compared with the 10- x 10-
meter resolution maps.
A method for comparing maps with different grid sizes
was developed so that maps of different resolutions
167
-------
45,104
90x90
Figure 2. Effect of resolution on study area data set.
60x60
30x30
10x 10
Resolution (meters)
could be compared using the root mean square (RMS)
statistic. This method was selected to provide a means
for statistical analysis that could accommodate the spa-
tial variability of this data set. The 60- x60-meter resolution
maps were "expanded" in IDRISI. This duplicated each
grid cell so that the original 60- x 60-meter resolution
map was equivalent to the 30- x 30-meter resolution
map. The IDRISI command "RESAMPLE" was used to
bring the 60- x 60-meter resolution map data onto the
same grid size as the 90- x 90-meter resolution map.
The center cells of the 10- x 10-meter resolution map
were selected for comparison with the 30- x 30-meter
grid cells. This comparison is based on the fact that the
center cell of the 10-meter resolution is the cell that
best corresponds to the entire cell of 30-meter reso-
lution (see Figure 3).

Once maps were registered on comparable grids, the
RMS difference between the maps of differing resolution
was used to compare the difference between the maps
and the effect of cell size resolution. The RMS statistic
is a measure of the variability of measurements about
their true values. The RMS is estimated by comparing
values in one grid system with the values in the com-
parison grid system. The difference between corre-
sponding values in each grid system is squared and
summed. The sum is then divided by the number of
measurements in the sample to obtain a mean square
deviation. Finally, the square root of the mean square
deviation is calculated. The RMS difference quantifies
the discrepancy between two data sets.
RMS:
(grich -grid2)2
Expand
Resample
Center Cell
30x30
laoVsoV

\
«' 60x60',

\
10x 10

Figure 3. Method for comparing cells of differing resolution.
Results

The Storm Event

The storm that was monitored for the purposes of this
field validation occurred on August 28,1994, at approxi-
mately 8:45 p.m.; the storm duration was approximately
1.5 hours. It was a high intensity, short duration thunder-
storm (see Figure 4). A global flow probe (Model FP101)
was used to obtain discharge velocity measurements in
feet per second. These values were then multiplied to
obtain cubic feet per second. The peak discharge occurred
at 11:30 p.m. on August 28, 1994, with a flow discharge
of 11.15 cubic feet per second. The average runoff for
the period was 4.458 cubic feet per second, or 2.42 cubic
168
-------
feet per second per square mile. The runoff volume per
day was 0.09 inches.

Total sediment yield was derived from the analysis of total
solids measured in field samples throughout the 24-hour
storm period. The samples were processed to evaluate
total suspended solids (TSS) using the vacuum filtration
procedure (17). A total of 1.204 tons of suspended sedi-
ment was predicted at the watershed outlet from field
data samples. A LaMotte field nutrient test kit was used
to measure nitrate and phosphate concentrations in the
stream. Nutrient values in this watershed for this storm
event were so small (phosphorous below 0.1 parts per
million and nitrogen 0.3 parts per million), they were not
selected as parameters to be used in evaluating and
validating the AGNPS model. The AGNPS predicted
nutrient output for the storm was not measurable (0.00
parts per million). The low levels of nitrogen and phos-
phorous in the stream channel during the storm event
can be attributed to the time of year in which the stream
was monitored. At the time of field validation, agricultural
activities were not operating.

Results at the Watershed Outlet: Peak Flow

The peak flow values that AGNPS calculated are largest
at the highest resolution and decrease as cell size in-
creases. The peak discharge from the watershed during
the monitored storm event was 11.15 cubic feet per
second. Comparisons of the AGNPS predicted peak
flow and the actual field-validated peak flow showed that
the 30- x 30-meter resolution cells best approximate the
peak flow of the watershed for the sampled storm event.
As grid cells increase from 30 x 30 to 60 x 60 and from
60 x 60 to 90 x 90, the peak flow is underestimated. As
grid cells decrease from 30 x 30 to 10x10, the peak
flow is grossly overestimated (see Table 2).

Results at the Watershed Outlet:
Sediment Yield

Sediment yield at the watershed outlet was determined
to be 1.204 tons. In all of the resolutions, the amount of
sediment deposited at the watershed outlet cell in-
creased as cell resolution increased (see Table 2). For
this particular watershed in this particular storm, the
AGNPS model overestimated the sediment yield pre-
dicted at the watershed outlet at the 10- x 10-, 30- x 30-,
and 60- x 60-meter resolutions and underestimated
sediment yield at the 90- x 90-meter resolution. Table 2
includes the information that AGNPS predicted for the
cell designated as the watershed outlet within each
resolution. (The results reported include output from the
center cells of the 10- x 10-meter resolution data set,
routed through the AGNPS model. Although these val-
ues are reported, the results from this data set cannot
be assumed to approximate the sediment output that
would result had the entire 10- x 10-meter resolution
data set been simulated.)

So/7 Survey Versus STATSGO Data

The Kappa statistic (14,18) was used as an indicator of
similarity to describe the differences between the
AGNPS output for SY and SGW generated from STATSGO
and soil survey data. Results (see Table 3) indicate that no
significant difference exists between the output derived
from the STATSGO and soil survey data inputs within
Relationship Between Streamflow Discharge
and Suspended Sediment
O)
"c
CD
o
c
o
o
"c
CD
E
T3
CD
600 T
500 --
400 --
300 --
200 ••
100 -•
M-l t"T I I M 1 1 |ifT'M'"M"l"i'l"1"t'l"i"t'"t"i"t'r">ll'l"l1l"l'"l
CD
O)
ooooooooooooooooooooooooooooo
ococococococococococococococococococococococococococococo
(Nco^mcDr^coajo-^-cNco^-^-cNco^incDr^coajo-^-cNco^incD
CN CN CN CN CN
Time (8/29/94 to 8/29/94)
Figure 4. Storm event hydrograph and pollutograph.
169
-------
Table 2. AGNPS Results at the Watershed Outlet Versus
Actual Field Values

Predicted
Soil Difference Predicted
Resolution Survey From STATSGO
(meters) Data Actual Data

10 x 10
(center cells)
Peak runoff 29.88 +18.73 29.88
rate (cfs)a
Total sediment 4.48 +3.49 4.69
yield (tons)
30x30
Peak runoff 11.27 +0.12 11.27
rate (cfs)
Total sediment 2.84 +1.64 3.11
yield (tons)
60 x 60
Peak runoff 9.96 -1.19 9.96
rate (cfs)
Total sediment 2.11 +0.91 2.12
yield (tons)
90x90
Peak runoff 8.87 -2.28 8.87
rate (cfs)
Total sediment 0.86 -0.34 0.87
yield (tons)
acfs: cubic feet per second.

Table 3. Kappa Coefficient of Similarity
Resolution (meters)

10 x 10 (center)
Soil survey versus STATSGO SY
Soil survey versus STATSGO SGW

30 x 30
Soil survey versus STATSGO SY
Soil survey versus STATSGO SGW
60 x 60
Soil survey versus STATSGO SY

Soil survey versus STATSGO SGW
90x90
Soil survey versus STATSGO SY
Soil survey versus STATSGO SGW

Difference
From
Actual

+18.73

+3.28

+0.12

+ 1.91

-1 .19

+0.92

-2.28

-0.33

Kappa

0.9866
0.9703

0.9859
0.9785

0.8960

0.7542

0.8743
0.6574
Table 4. RMS for AGNPS Sediment Loss Output

Description SGW Pounds SY Pounds
Soil Survey Data Constant
10 centers to 30-meter resolution 168.38 344.22
30- to 60-meter resolution 125.43 739.75

60- to 90-meter resolution 93.50 498.01
30- to 90-meter resolution 29.91 711 .49

STATSGO Data Constant
10 centers to 30-meter resolution 164.45 312.48
30- to 60-meter resolution 125.37 782.07
60- to 90-meter resolution 97.07 501 .81

30- to 90-meter resolution 122.04 747.51

Moving to higher cell resolutions increasingly affects
sediment generated within each cell; the largest differ-
ence in sediment generated within each cell occurs as
cell resolution increases from 30 x 30 to 1 0 x 1 0 meters.
Sediment generated within each cell is least affected by
moving from 60 x 60 to 90 x 90 meters. Sediment yield
per cell is most affected when cell resolution increases
from 60 x 60 to 30 x 30 meters and least affected by
increasing resolution from 30 x 30 to 10x10 meters.

These results prompted an assessment of the methods
used to compare resolutions, to determine whether the
effect on sediment yield between the 30- x 30- and
60- x 60-meter resolutions could result from the method
used in comparing the resolutions (expansion of the
60- x 60-meter resolution). The procedure of comparison
between the 30- x 30- and the 60- x 60-meter resolu-
tions was repeated; however, rather than expanding the
60 x 60 data file, the 60 x 60 data file was "resampled"
onto the 30- x 30-meter resolution grid, then the files
were compared. The RMS results (see Table 5) show
that both methods for comparing data between the reso-
lutions provide essentially the same results. The effect
of resolution on sediment yield per cell is, in fact, greatest
as resolution increases from 60 x 60 to 30 x 30 meters.

Results: AGNPS Input Parameters of Concern
Previous AGNPS analyses have shown sediment yield
(and sediment-associated nutrient yields) to be most
the same resolutions. This may be due to the homoge-
neity of the soil textures in both soils data sets (both
dominated by silty soils).

Effects of Resolution on SGW and SY

AGNPS output for SGW and SYwas evaluated for every
cell within the watershed. The RMS difference was ap-
plied to determine the relative effect of input data reso-
lution on SY and SGW output (see Table 4 and Figure 5).
affected by AGNPS inputs for cell land slope, the soil
erodibility factor (K), the Universal Soil Loss Equation's

Table 5. Comparison of RMS for 30 x 30 to 60 x 60

Method RMS SGW Pounds RMS SY Pounds
Expansion of 60 x 60
to 30 x 30
Resampling 60 x 60
to 30 x 30
Difference
125.43
125.37
0.06
739.75
739.30
0.45
170
-------
RMS Difference
Sediment Generated in Each Cell—Soil Survey Data
10 to 30
20 40 60 80 100 120140160180
RMS Difference
Sediment Yield per Cell—Soil Survey Data
60 to 90
30 to 60
10 to 30
0 100 200 300 400 500 600700 800
RMS Difference
Sediment Generated in Each Cell—STATSGO Data
60 to 90
30 to 60
10 to 30
0 20 40 60 80 100 120140160180
RMS Difference
Sediment Yield per Cell—STATSGO Data
60 to 90
0 100 200 300400 500 600700 800
Figure 5. RMS for AGNPS sediment output.

(USLE) cropping management factor (C) and the SCS
curve number (CN). To address the concerns regarding
the influence of these parameters on sediment yield, the
RMS differences (see Table 6 and Figure 6) and general
statistics (see Table 7) for these parameters were computed.

Discussion

When evaluating the RMS as an indicator of the effect
of resolution on input parameters and output sediment
values, looking at the overall trend between resolutions,
rather than focusing on specific values, is important. The
RMS statistics for the soil erodibility factor (K), the crop-
ping management factor (C), and the SCS curve number
(CN) are least affected by a decrease in cell size reso-
lution from 10x10 meters to 30 x 30 meters. These
parameters are most affected when cell size resolution
decreases from 30- x 30-meter to 60- x 60-meter reso-
lution. As resolution decreases further from 60 x 60
meters to 90 x 90 meters, the effect on RMS decreases.
The small-large-smaller trend in the RMS for these pa-
rameters is the same trend seen in the RMS for sedi-
ment yield throughout the watershed. The sediment
Table 6. RMS Difference: AGNPS Input Parameters of
Concern
Parameters of Concern
K value
Cropping factor
SCS curve number
Slope
10 to 30
0.0058
0.03
2.11
8.17
30 to 60
0.054
0.173
11.40
5.59
60 to 90
0.051
0.015
8.40
3.68
yield within each cell therefore seems to be most af-
fected by these input parameters. The general statistics
for each of these parameters of concern show that very
little difference exists in the values within each resolu-
tion, with the exception of slope. Slope values are higher
at the higher resolutions and decrease as resolution
increases. This is related to the method in which the CIS
calculates slope.

The RMS statistics comparing resolutions for sediment
generated within each cell follow the same trend as the
RMS statistics for slope percentage. As resolution in-
creases, so do the discrepancies between the compared
171
-------
Soil Erodibility (K)
60 to 90
30 to 60
10 to 30
0.01 0.02 0.03 0.04 0.05 0.06
USLE Cropping Management (C)
60 to 90
30 to 60
10 to 30
0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18
SCS Curve Number (CN)
Slope %
60 to 90
30 to 60
10 to 30
60 to 90
30 to 60
10 to 30
10 12
0123456789
Figure 6. RMS difference for AGNPS input parameter of concern.

data sets. This trend between resolutions indicates that
slope values influence sediment generated within each
cell in the watershed.

The results from the 10- x 10-meter resolution data set
were obtained by selecting the center cells of the
10- x 10-meter resolution data set and routing the data
from this set through AGNPS using the flow pathways
developed for the 30- x 30-meter resolution. The results
do not provide the same information as would be pro-
vided had the entire 10- x 10-meter data set been routed
through AGNPS. The RMS values obtained from com-
parisons of the 10- x 10-meter resolution input parame-
ters with the 30- x 30-meter resolution input data reveal
that little difference exists between the data in these
resolutions. Comparison of the center cell 10- x 10-meter
data set output with field monitored data shows that the
10x10 center cell data overestimates both peak flow
and sediment yielded at the watershed outlet. This can
be attributed to the larger slope values in this resolution.

Conclusion

This study used CIS to generate data files for application
to the AGNPS model. The objectives of this project were
to evaluate the effect of different levels of detail used in
generating the input files on selected input and output
parameters. The results show that, for a watershed with
characteristics equivalent to those of the study area
watershed, differences exist in model output based on
the cell size resolution of the watershed.

The selected cell size resolution directly affects slope
values. The influence of the slope parameter dominates
AGNPS predictions for sediment generated within each
cell and sediment yield at the watershed outlet in the
study area watershed. The indicated parameters of con-
cern have the most influence on sediment yield for each
cell in the watershed. The greatest variation in the indi-
cated parameters of concern and thus the sediment
yield output occurs between the 30- x 30-meter and
60- x 60-meter resolutions. AGNPS estimates for sedi-
ment yield in files generated from STATSGO data were
larger than sediment yields from files generated with soil
survey soils data in the 30- x 30-, 60- x 60-, and
90- x 90-meter resolutions. For this watershed, how-
ever, no significant difference existed between data gen-
erated from soil survey and STATSGO data sources as
indicated by the kappa coefficient of similarity.

Results predicted by the AGNPS model at the water-
shed outlet were compared with results from an actual
storm monitored at the watershed outlet. The 30- x 30-
meter resolution data set provided the most accurate
172
-------
Table 7. Statistics for AGNPS

Value 10 x 10
Input Parameters of Concern

30 x 30 60 x 60 90 x 90
K Value (units of K)
Average
Maximum
Minimum
Standard deviation
C Factor (units of C)
Average
Maximum
Minimum
Standard deviation
CN (units of CN)
Average
Maximum
Minimum
Standard deviation
Slope (%)
Average
Maximum
Minimum
Standard deviation

0.2989
0.49
0.17
0.0453

0.0306
0.076
0
0.0212

71 .004
100
55
9.13

34.13
567
0
33.75

0.2989
0.49
0.17
0.0453

0.0306
0.076
0
0.0211

71.003
100
55
9.08

30.153
224
0
23.45

0.2882
0.37
0.17
0.0513

0.0295
0.076
0
0.0213

70.75
100
55
8.97

27.67
152
0
18.97

0.2889
0.49
0.17
0.0456

0.0295
0.076
0
0.0208

70.68
100
55
8.84

26.38
99
0
16.44
prediction for peak flow at the watershed outlet. AGNPS
output in the 10- x 10-meter center, 30- x 30-meter, and
60- x 60-meter resolutions overestimated the actual
sediment yield recorded at the watershed outlet for the
validated storm event.

For the study area watershed, cell size resolution of
30 x 30 meters seems appropriate based on the accurate
AGNPS model prediction for peak flow when validated
with the field-monitored storm. The steep slopes created
in the 10- x 10-meter resolution data set may lead to an
overestimation of sediment output, rendering data at this
resolution unreliable. At this time, the 10- x 10-meter
resolution is both impractical and infeasible for use with
the AGNPS model.

AGNPS output at the highest resolution does not pro-
vide a better approximation of sediment yield at the
watershed outlet. AGNPS output for sediment gener-
ated within each cell in the watershed at the highest
resolution does not accurately simulate the watershed
processes. AGNPS output generated from the more
detailed soil survey data is not significantly different from
data generated by the STATSGO digital soils database.

This study raises the following questions:

• What level of detail is both practical and acceptable
for policy-making and decision-making?
• What constitutes a cost-effective analysis?

Ultimately, these questions are best answered on a
case-by-case basis and should be determined based on
the size of the study area and on how the results of the
analysis will be used (i.e., to make a direct land use
decision or for broader planning). For broad planning
analyses on large watersheds, the benefit of digitizing
the soil survey data is outweighed by the cost in time
and effort to generate this detailed database. STATSGO
data may be sufficient. If a direct land management
decision is being made for a small area such as a farm
within a watershed, however, the analysis should use
the most detailed soils data.

Recommendations for Future Work

The original intent of this study was to use the capabili-
ties of AGNPS Version 4.03 to evaluate a watershed
using data generated at a high cell size resolution—
10x10 meters. AGNPS h

ad never been used to evaluate data at such a high
resolution. As discovered during this project, the newest
version of AGNPS is not, at this time, capable of han-
dling a data set that has more than 32,767 cells (19).
Once this limitation with the AGNPS model is remedied,
the entire 10- x 10-meter resolution data set should be
routed through the model so that definite conclusions
regarding the applicability of such a detailed data set
can be made.

References

1. U.S. Department of Agriculture (USDA). 1991. Riparian forest
buffers: Function and design for protection and enhancement of
water resources. N1-PR-07-91.

2. Ashraf, M.S., and O.K. Borah. 1992. Modeling pollutant transport
in runoff and sediment. Trans. Amer. Soc. Agric. Eng. 35:1,789-
1,797.

3. Barten, P., and K. Stave. 1993. Characterization of streamflow
and sediment source areas for the Little Beaver Kill Watershed,
Ulster County, NY (July).

4. Vieux, B., and S. Needham. 1993. Nonpoint-pollution model sen-
sitivity to grid cell size. J. Water Resour. Planning and Mgmt.
119(2).

5. Kumar, V. 1993. Geographic information system application for
nonpoint source pollution management. Logan, UT: Utah State
University.

6. Yoon, J., L. Padmanabhan, and L. Woodbury. 1993. Linking Ag-
ricultural Nonpoint Source Pollution Model (AGNPS) to a geo-
graphic information system. Proceedings of the AWRA
Conference on Geographic Information Systems and Water Re-
sources (March).

7. Haddock, G., and P. Jankowski. 1993. Integrating nonpoint
source pollution modeling with a geographic information system.
Department of Geography, University of Idaho.
173
-------
8. Mitchell, J., B. Engel, R. Srinivasan, and S. Wang. 1993. Valida-
tion of AGNPS for small watersheds using an integrated
AGNPS/GIS system. Proceedings of the AWRA Conference on
Geographic Information Systems and Water Resources (March).

9. He, C., J. Riggs, and Y. Kang. 1993. Integration of geographic
information systems and a computer model to evaluate impacts
of agricultural runoff on water quality. Proceedings of the AWRA
Conference on Geographic Information Systems and Water Re-
sources (March).

10. Freezor, D., M. Hirschi, and B. Lesikar. 1989. Effect of cell size
on AGNPS prediction. Paper No. #89-2184. ASAE/CSAE Meet-
ing, St. Joseph, Ml.

11. Young, R., C. Onstad, D. Bosch, and W Anderson. 1989.
AGNPS: A nonpoint-source pollution model for evaluating agri-
cultural watersheds. J. Soil and Water Conserv. 44(2).

12. U.S. Department of Agriculture (USDA). 1987. Agricultural Re-
search Service, AGNPS: Agricultural nonpoint-source pollution
model, a watershed analysis tool. Conservation Research Report
No. 35.
13. Star, J., and J. Estes. 1990. Geographic information systems: An
introduction. Englewood Cliffs, NJ: Prentice Hall.

14. ESRI. 1987. PC ARC/INFO user's guides. Redlands, CA: Envi-
ronmental Systems Research Institute, Inc.

15. Eastman, R. 1993. IDRISI technical reference. Worcester, MA:
Clark University.

16. Clarke, K.C. 1990. Analytical and computer cartography. Engle-
wood Cliffs, NJ: Prentice Hall.

17. Clesceri, L, A. Greenberg, and R. Trussell. 1989. Standard meth-
ods for examination of water and wastewater, 17th ed. pp. 2-75
to 2-76.

18. Rosenfield, G.H., and K. Fitzpatrick-Lins. 1986. A coefficient of
agreement as a measure of thematic classification accuracy.
Photogrammetric Engin. and Remote Sensing 52(2).

19. U.S. Department of Agriculture, Soil Conservation Service. 1995.
AGNPS Newsletter.
174
-------
XGRCWP, a Knowledge- and GIS-Based System for Selection, Evaluation,
and Design of Water Quality Control Practices in Agricultural Watersheds
Runxuan Zhao, Michael A. Foster, Paul D. Robillard, and David W. Lehning
Penn State University, University Park, Pennsylvania
Abstract

The Expert CIS Rural Clean Water Program (XGRCWP)
integrates a geographic information system (CIS), a
relational database, simulation models, and hypertext
mark language documents to form an advisory system
that selects, evaluates, sites, and designs nonpoint
source pollution control systems in agricultural water-
sheds. Its major features include:

• Customized CIS functions to obtain spatial and attrib-
ute data and feed them to a rule-based expert system
for selecting feasible control practices.

• A user interface for examining the field-specific con-
ditions and recommended control practices on the
screen by clicking on the displayed field boundary
map.

• A direct linkage between the CIS spatial data and the
relational attribute data, which allows users to exam-
ine data on the screen interactively.

• A graphic user interface to CIS functions, which en-
ables users to perform routine watershed analyses.

• Linkage to hypertext reference modules viewable by
Mosaic Internet document browser.

• Dynamic access to other models such as the Agricul-
tural Nonpoint Source Simulation Model.

The software environment of XGRCWP is GRASS 4.1
and X-Windows on SUN OS 4.3.1. Its major functions
have been tested for the Sycamore Creek watershed in
Ingham County, Michigan.

Introduction

In 1981, the U.S. Environmental Protection Agency
(EPA) and the U.S. Department of Agriculture (USDA)
initiated the Rural Clean Water Program (RCWP) in 21
agricultural watersheds. This program represents the
most intensive water quality monitoring and implemen-
tation and evaluation of nutrient, sediment, and pesti-
cide reduction practices ever undertaken in the United
States (1). More than a decade of research efforts has
resulted in a wealth of experiences and lessons on
selection, siting, and evaluation of nonpoint source con-
trol practices.

The storehouse of knowledge gained from RCWP is of
little use, however, unless it is properly integrated and
packaged in an easily accessible form. Technology
transfer of this knowledge is therefore critically impor-
tant. To integrate and synthesize the lessons learned
from RCWP, Penn State University initiated an RCWP
expert project. The hypertext-based version of the
RCWP expert system, completed in 1993, can select
and evaluate nonpoint source control systems at a sin-
gle site. Although the hypertext-based version is still
suitable for users who do not have access to geographic
information systems (CIS) data, it is inadequate for the
comprehensive selection and evaluation of control sys-
tems on a watershed basis. It does not provide the user
the spatial reference of a site and requires the user's
subjective judgment for the model input.

The Expert CIS Rural Clean Water Program (XGRCWP)
is the UNIX and X-Window version of the RCWP expert
system, which integrates CIS and the RCWP expert
system to provide decision support at multiple spatial
scales from single fields to subwatersheds to the water-
shed scale. This paper presents the major features of
XGRCWP, including design of the expert system, inter-
face to CIS functions, and linkages to a relational data-
base and simulation models.

Overview

XGRCWP comprises five major components (see
Figure 1):

• An expert system for recommending control practices
based on site-specific information.
175
-------
HTML Reference
Modules:
Contaminants
Monitoring
Transport
Case Studies
Expert System:
Recommending
Control Practices
X/Motif
Graphic User
Interface
GIS Functions:
Customized and
Existing Ones for
Watershed Analysis
Relational Database:
Relate Spatial and
Attribute Data
Simulation Models:
AGNPS
GLEAMS (To Be
Added)
Figure 1. Major components of XGRCWP and their relationships.

• Custom and existing GIS functions for watershed
analysis and estimation of contaminant loading
potential.

• Linkage to fields, soils, and land use databases.

• Linkage to the Agricultural Nonpoint Source Simula-
tion Model (AGNPS) (2).

• Hypertext mark language (HTML) reference modules.

The X/Motif graphic user interface (GUI) integrates the
five components and allows the user to navigate flexibly
among them. The components are also internally con-
nected in different ways. For example, the expert system
can use the customized GIS functions to retrieve site-
specific information from Geographical Resource Analy-
sis Support Systems (GRASS) (3) data layers and
INFORMIX relational database tables. In addition, the
expert recommendations of control practices can be
displayed and examined using GRASS functions. Fi-
nally, the GIS functions can help generate input to the
AGNPS model, and its output can be converted to GIS
format for additional analyses.
Design of the Expert System

The objective of the expert system is to recommend
feasible control systems (i.e., complementary sets of
control practices to reduce nonpoint source pollution
based on site-specific conditions). One distinct feature
of this system is the combination of two modes of data
acquisition: direct user input and GIS functions.
XGRCWP also has two modes for deriving the expert
recommendations: batch or interactive. This section dis-
cusses these aspects of the expert system as well as its
knowledge base.

Rules for Control Practice Selection

The knowledge base of the expert system includes the
following six site-specific characteristics:

• Contaminant of interest and its adsorption characteristic.

• Potential level of contaminant loading (low, medium,
or high).

• Potential level of contaminant leaching (low, medium,
or high).

• Soil hydrologic group (A, B, C, or D).
176
-------
• Time of year (during or outside the growing season).

• Type of land use (cropland, animal waste, or critical
area).

The user first chooses a contaminant of interest from a
list consisting of four kinds of pesticides (strongly, mod-
erately, or weakly adsorbed, and nonadsorbed) and
eight other contaminants (ammonia, bacteria, sediment,
total nitrogen, total phosphorus, nitrate, orthophospho-
rus, and viruses). The values of other characteristics,
some of which vary with the contaminant specified, can
then be input either directly by the user or by custom
GRASS functions as discussed in the section of this
paper on data acquisition.

The RCWP used 14 general categories of control prac-
tices (see Table 1). Many suitable conditions were es-
tablished for each general category. For example,
conservation tillage is recommended to reduce runoff for
cropland under conditions otherwise favoring loss
through sediment transport, such as a contaminant
strongly adsorbed to the soil (e.g., total phosphorus), the
nongrowing season, and soils with a relatively high run-
off potential (soil group C or D) (see Figure 2). Each
general category includes several specific control prac-
tices. When a general practice category is recom-
mended, the user must decide which specific practice
within that general category to evaluate further by con-
sulting the nonpoint source database (NPSDB) for the
reported research data about this practice or by running
the AGNPS simulation model.

Data Acquisition

The expert system recommends one or more control
systems based on site-specific conditions that are either
directly input by the user or calculated by the customized
GRASS functions. The user always specifies the con-
taminant of interest and the season, while a GRASS
function (R.HYDRO-GRP) always determines the soil
hydrologic group of each field. For the other factors
(loading potential, leaching potential, and application

Table 1. The Best Management Practices Used in the Rural
Clean Water Program
Source control
practices
Nutrient Management (NUTR)
Pesticide Management (PEST)
Structural control Animal waste systems (AWS)
practices Diversion systems (DIV)
Sediment retention and water control (SED)
Terrace systems (TERR)
Waterway systems (WATW)
Vegetative control
practices
Conservation tillage (CT)
Critical area treatment (CAT)
Cropland protection systems (CPS)
Grazing land protection (GLP)
Permanent vegetative cover (PVC)
Stream protection (SP)
Stripcropping (SCR)
class), however, the user has two alternative ways to
decide input values. For example, after the user selects
a contaminant of interest, the program displays the con-
taminant loading potential window (see Figure 3). The
potential level of the selected contaminant can be indi-
cated if the user knows it. Otherwise, the user can let
the GRASS functions derive loading potential from ex-
isting field data.

The direct input option can also be used to help the user
address "what-if questions. When the user selects the
CIS functions to determine the loading potential,
XGRCWP makes a series of calls to appropriate cus-
tomized GRASS functions according to the current con-
taminant of interest. For example, if the contaminant is
total nitrogen, the functions R.MANURE, R.FERT, and
R.B.CONCENTRATION are called to estimate total ni-
trogen from manure, fertilizer, and soil base concentration,
respectively. Another GRASS function, R.NP.LOADING,
is then called to translate the quantitative measure of
loading potential into the qualitative classification (low,
medium, or high) as input to the expert systems. These
GRASS functions generate the inputs by searching and
converting the data from INFORMIX relational data ta-
bles that are associated with the GRASS spatially refer-
enced data, such as field boundary and soil map. Table
2 lists the customized GRASS functions developed for
data acquisition.

Control System Recommendation

XGRCWP derives the expert recommendations for con-
trol systems in two ways: in a batch mode for every field
in a watershed and in an interactive mode for a user-
specified field.

In batch mode, an existing GRASS function, R.INFER,
is used to create a raster data layer for each general
practice category of control practice according to a rule-
set prepared for that general category. For example, the
contents and formats for the conservation tillage prac-
tice are documented in Table 3. The raster data layer for
representing the conservation tillage recommendations
(CTree) is generated by running R.INFER with the ap-
propriate rule. The category value of CTree is 1 at each
point in the data layer where the conservation tillage is
recommended, or 0 otherwise. The R.INFER function is
similarly called for other general practice categories.
Additional GRASS functions can then display or further
analyze the resulting map layers. The batch mode pro-
vides the user the overall picture with a watershed-wide
view of feasible control systems.

In the interactive mode, the field boundary map is dis-
played and the user can specify any field of interest by
clicking the mouse on it. The recommendations and the
site-specific conditions of the field are displayed on the
right half of the screen. The recommended control prac-
tices are also displayed in a popup window for further
177
-------
Adsorb
Nonadsorb
Contaminant
A
B
C | D
Soil Group

Low
Medium
High
Loading

Low
Medium
High
Leaching

Growing] Nongrowing
Season

CL
AW
HSA
Application
Figure 2. Dependency network (AND-OR diagram) for site-specific recommendation of conservation tillage.
examination, such as the specific practices within each
general category, the feasible control systems for non-
point source pollution control, and research data on the
practices. The interactive mode is implemented through
the integration of a Bourne shell script, structured query
language (SQL) commands, a customized GRASS
function (R.RCWP.EXPERT), and GRASS display func-
tions with the Motif GUI. Interactive mode is intended for
detailed consideration of a specific farm.

Interface to GIS Functions

XGRCWP provides a GUI to most of the customized
GRASS functions and some of GRASS'S existing func-
tions (see Figure 4). This interface shields the user from
complex syntax so the user can focus on the subject
(LOW
High
•% loitf determined by GIS fanctivn s
\ ™ i™,,,™™,.™.,™™™™™™,™.™,*,,.. *,„„,.,,,„„„. *>
-Tip
Figure 3. The popup window for the potential level of contami-
nant loading.
matter. The GUI makes it easier for the user to perform
routine operations such as estimation of contaminant
loading, identification of critical areas, erosion and runoff
calculation, and other watershed analysis tasks. It also
helps the user make full, effective use of all custom and
some existing GRASS functions.

Linkages to Database and Other Models

Data Structure

The GRASS functions used to generate inputs for the
expert system use the same soils and fields relational
databases as the Water Quality Model/GRASS Interface
under development by the Soil Conservation Service
(SCS) (4). XGRCWP and our custom GRASS functions
were tested forthe Sycamore Creek watershed, Ingham
County, Michigan. In this data structure, spatial data
(e.g., field boundaries, watershed boundaries, soils map
unit boundaries, and elevation data) are saved as
GRASS raster data layers while attribute data (e.g., crop
information, fertilizing schedule, soil information) are
stored in INFORMIX relational database tables. Each
field or soil map unit is assigned a unique identification
(ID) number. The field attribute (INFORMIX) data also
contain this ID number. The linkage between the
GRASS raster map and the INFORMIX data is accom-
plished with a GRASS category label (see Figure 5).

Linkage to Database

To allow the interactive examination of field data from
GRASS raster layers and the associated relational da-
178
-------
Table 2. Summary of the Customized GRASS Functions Developed by Nonpoint Source Agricultural Engineering Research
Group at Penn State University To Generate Inputs for the RCWP Expert System
Name
Descriptions
R.FERT

R.MANURE

R.B.CONCENTRATION

R.NP.LOADING

R.EROSION

R.LEACHING.P

R.HYDRO-GRP
Produces raster maps of total nitrogen or total phosphorous from the scheduled fertilizer applications for
different crops by dynamically retrieving information from a GRASS data layer and INFORMIX data tables

Calculates the total manure on each farm according to animal numbers and types (e.g., dairy cow, beef
cow, horses, swine), allocates manure to the fields on a farm by a user specified strategy (uniformly
spreading or inverse distance weighted distributing method), and finally estimates nitrogen and
phosphorous loading from manure application rate, conversion factor, percentages of transportation
losses, and volatile losses

Estimates nitrogen and phosphorous concentration in parts per million within different types of soils
according to the organic matter contents

Classifies the loading potential of nitrogen or phosphorous into three categories (low, medium, and high)
based on the actual loading from fertilizer and manure and the N or P concentration in soils

Obtains a relative measure of soil erosion severity by dividing the amount of erosion by the tolerance
values of the soils and then reclassifying them into three categories (low, medium, and high)

Estimates leaching index from soil hydrologic group and annual and seasonal precipitation and classifies
it into three categories (low, medium, and high)

Retrieves soil hydrologic group from the INFORMIX database and reclassifies the soil map into soil
hydrologic groups
tabase tables, XGRCWP calls our custom function,
D.WHAT.FIELD.SH, a Bourne shell script that dynami-
cally links GRASS raster layers and the INFORMIX
database tables. When the user clicks on a field, for
example, this function extracts field-specific information
from INFORMIX tables such as field information, fertili-
zation schedule, crop operation schedule, and soil infor-
mation. The D.WHAT.FIELD.SH function then displays
all related soils and fields information for the given field.
It also marks the field boundary map to remind the user
which fields have already been examined.

Linkage to Reference Modules

At any stage of the selection, evaluation, siting, and
design procedure for control practices, the user can
consult reference modules that provide information,
guidance, and data about contaminant properties, trans-
port variables, and examples of applications from
RCWP projects. Four reference modules are available
in the Macintosh version of RCWP expert system: con-
taminants, monitoring, transport, and case studies. We
are currently converting these reference modules into
Mosaic-viewable HTML documents so that they can be
accessed from XGRCWP. Mosaic is a public domain,
Internet-aware document browser that is available for
X-Windows, Macintosh, and Microsoft Windows.

All four modules use graphics to demonstrate design
procedures and contaminant control processes. The
contaminant module provides information about 11 cate-
gories of contaminants cited in RCWP projects and their
impacts on surface and ground-water resources. The
monitoring module describes different aspects of water
quality sampling and analysis systems. The transport
module describes contaminant pathways in surface
and ground water. The case studies module presents
detailed examples from key RCWP projects. These
examples cover both practice selection and implemen-
tation aspects of control systems. The reference modules
serve as a complementary component of XGRCWP.

Linkage to AGNPS

AGNPS is a distributed-parameter, storm event-based
model that estimates runoff, sedimentation, and nutrient
loss in surface runoff within agricultural watersheds (2).
The prototype version of the Water Quality Model/GRASS
Interface developed by SCS conveniently generates an
AGNPS input file for all cells in a watershed from the
spatial and relational soils and fields databases. The
UNIX version of AGNPS can then use this input file.
XGRCWP can call AGNPS directly from its X-Window
interface and convert standard AGNPS model outputs
for all cells in the watershed into GRASS raster format
for display and analysis.

Discussion

The literature on software systems for managing non-
point source pollution in agricultural watersheds is di-
verse and rapidly growing. With few exceptions (5-7),
these decision support systems are purely model-
based, CIS-based (8), or hybrid systems with models
running within a CIS framework (9-14). The addition of
expert system components can overcome some of the
difficulties in primarily model-based systems:

• Overly intensive input data requirements.

• Inability to handle missing or incomplete data.

• Requirements that all inputs be numerically expressed.
179
-------
Table 3. The Rule File for Recommending Conservation Tillage
IFNOTMAP app.class 3
ANDIFMAP contam.feature 2
ANDIFMAP leaching.p 1
ANDIFMAP soil.g 1 2
ANDIFMAP contam.load 1
ANDIFMAP season 2
THENMAPHYP 1 yes, CT
is recommended
i
IFNOTMAP app.class 3
ANDIFMAP contam.feature 2
ANDNOTMAP leaching.p 3
ANDNOTMAP soil.g 1
ANDNOTMAP contam.load 3
ANDIFMAP season 1
THENMAPHYP 1 yes, CT
is recommended
i
IFNOTMAP app.class 3
ANDIFMAP contam.feature 1
ANDNOTMAP contam.load 3
ANDIFMAP season 1
THENMAPHYP 1 yes, CT
is recommended
i
IFNOTMAP app.class 3
ANDIFMAP contam.feature 1
ANDNOTMAP leaching.p 3
ANDIFMAP soil.g 3 4
ANDNOTMAP contam.load 3
ANDIFMAP season 2
THENMAPHYP 1 yes, CT
is recommended
!application class is not high-source area
!contaminant is nonadsorbance
teaching potential is low
!soil groups are A or D
!contaminant loading is low
!nongrowing season
!application class is not high-source area
!contaminant is nonadsorbance
teaching potential is not high
!soil groups are not A
!contaminant loading is not high
!growing season
!application class is not high-source area
!contaminant is strong adsorbance
!contaminant loading is not high
!growing season
!application class is not high-source area
!contaminant is strong adsorbance
teaching potential is not high
!soil groups are C or D
!contaminant loading is not high
!nongrowing season
Manure M/ffffmf I ttati
Alamtre J
Options:
Loatl
_.r. i'' f>i--'-f:P I i~ fj i -:.'.
•>!. f - "ill • r i
Figure 4. The GUI to the R.MANURE function.
• High degree of expertise needed to structure model
input and explain model output relative to the user's
problem context.

The expert system component of XGRCWP also re-
duces the number of model runs needed for decision
support through preliminary, rule-based screening of
control systems at each site of interest in the watershed.

Conclusions

XGRCWP incorporates several kinds of expertise for the
user's benefit:

• Subject matter expertise in siting and selecting non-
point source control systems in agricultural water-
sheds.

• Expertise in configuring AGNPS model input from the
soils and fields databases.

• Expertise in interpreting, explaining, and visualizing
expert system and model input.

The integration of the expert system and the GRASS
CIS makes input to the expert system easier and more
objective. It enhances the expert system's capability
for recommending effective control practices at the
field level to achieve watershed contaminant loading
objectives. XGRCWP is designed as an open structured
180
-------
Category Value
1173
Category Label
FDID1173
Fnin

fert_sch_id
FTID01
fert_name
13-13-13
appl_rate
200.0

fert_name
13-13-13
total_n
13.000
total_p
5.700

Figure 5. Data structure of Sycamore Creek watershed, Ingham County, Michigan.
program and has great potential to be improved easily
and continually according to users' feedback. Ongoing
efforts to enhance the program include:

• Developing more rules that incorporate topographical
factors such as slope and slope length for the expert
system so that more site-specific control practices
can be recommended.

• Adding dynamic hypertext-based help and reference
to the program.

• Establishing intelligent linkages among the expert
system, the CIS functions, and other simulation or
design models for nonpoint source control practices.
Acknowledgments

We thank Vicki Anderson, Ruth Shaffer, and Brent Stin-
son of the Michigan USDA SCS and John Suppnick of
the Michigan Department of Natural Resources for their
assistance in working with the underlying databases in
the Sycamore Creek watershed. This project is sup-
ported by U.S. EPA Grant No. X-818240.
References

1. U.S. EPA. 1992. Extending the RCWP knowledge base to future
nonpoint source control projects. In: Proceedings of the National
RCWP Symposium. EPA/625/R-92/006. Washington, DC.

2. Young, R.A., C.A. Onstad, D.D. Bosch, and WP. Anderson. 1989.
AGNPS: A nonpoint-source pollution model for evaluating agri-
cultural watersheds. J. Soil and Water Conserv. 44(2):168-173.

3. USACERL. 1993. GRASS 4.1 users reference manual. Champaign,
IL: U.S. Army Corps of Engineers Construction Engineering Re-
search Laboratory.

4. USDA. 1993. Users' guide to water quality model/GRASS inter-
face. Fort Collins, CO: U.S. Department of Agriculture, Soil
Conservation Service, Technology Information Systems Division.

5. Barnwell, T.O., L.C. Brown, and W. Marek. 1989. Application of
expert systems technology in water quality modeling. Wat. Sci.
Tech. 21:1,045-1,056.

6. Ford, D.A., A.P. Kruzic, and R.L. Doneker. 1993. Using GLEAMS
to Evaluate the Agricultural Waste Application Rule-Based
Decision Support (AWARDS) computer program. Wat. Sci. Tech.
28(3-5):625-634.

7. Yakowitz, D.S., J.J. Stone, L.J. Lane, P. Heilman, J. Masterson,
J. Abolt, and B. Imam. 1993. A decision support system for evalu-
ating the effects of alternative farm management systems on
water and economics. Wat. Sci. Tech. 28(3-5):47-54.
181
-------
8. Hamlett, J.M., D.A. Miller, R.L. Day, G.A. Peterson, G.M. Baumer,
and J. Russo. 1992. Statewide CIS-based ranking of watersheds
for agricultural pollution prevention. J. Soil Water Cons.
47(3):399-404.

9. Srinivasan, R., and B.A. Engel. 1994. A spatial decision support
system for assessing agricultural nonpoint source pollution.
Water Resources Bull. 30(3):441-452.

10. Srinivasan, R., and J.G. Arnold. 1994. Integration of a basin-
scale water quality model with GIS. Water Resources Bull.
30(3):453-462.

11. Engel, B.A., R. Srinivasan, J. Arnold, and S.J. Brown. 1993.
Nonpoint source (NPS) pollution modeling using models inte-
grated with geographic information systems (GIS). Wat. Sci. Tech.
28(3-5):625-690.
12. He, C., J.F. Riggs, and Y. Kang. 1993. Integration of geographic
information systems and a computer model to evaluate impacts
of agricultural runoff on water quality. Water Resources Bull.
29(6):891-900.
13. Kiker, G.A., G.M. Cambell, and J. Zhang. 1992. CREAMS-WT
linked with GIS to simulate phosphorus loading. ASAE Paper
No. 92-9016. St. Joseph, Ml: American Society of Agricultural
Engineers.
14. Tim, U.S., S. Mostaghimi, and V.O. Shanholtz. 1992. Identification
of critical nonpoint pollution source areas using geographic infor-
mation systems and water quality modeling. Water Resources
Bull. 28:877-887.
182
-------
Integration of EPA Mainframe Graphics and GiS in a UNIX Workstation Environment
To Solve Environmental Problems
William B. Samuels
Science Applications International Corporation, McLean, Virginia

Phillip Taylor
Tetra Tech, Inc., Fairfax, Virginia

Paul Evenhouse
Martin Marietta, Inc., Research Triangle Park, North Carolina

Robert King
Assessment and Watershed Protection Division, U.S. Environmental Protection Agency,
Washington, DC
Abstract

The Assessment and Watershed Protection Division of
the Office of Wetlands, Oceans, and Watersheds has
developed water quality analysis software on the U.S.
Environmental Protection Agency (EPA) mainframe
computer. This software integrates national on-line en-
vironmental databases and produces maps, tables,
graphics, and reports that display information such as
water quality trends, discharge monitoring reports, per-
mit limits, and design flow analyses.

In the past, this graphic software was available only to
users connected to the mainframe with IBM graphics
terminals or PCs with graphics emulation software. Re-
cently, software has been developed that can be used
to: 1) access the EPA mainframe from a UNIX worksta-
tion via the Internet, 2) execute the Water Quality Analy-
sis System (WQAS) procedures, 3) display WQAS
graphics in an X-Window on the workstation, and 4)
download data in a geographic information system (GIS)
format from the mainframe. At the same time, this work-
station can execute ARC/INFO and ARC/VIEW applica-
tions in other X-Windows. This capability allows analysts
to have the power of GIS, the mainframe databases
(e.g., Permits Compliance System [PCS], STORET,
Reach File, Industrial Facilities Discharge File, Daily
Flow File, Toxic Chemical Release Inventory), and the
retrieval/analysis/display software (Environmental Data
Display Manager, Mapping and Data Display Manager,
Reach Pollutant Assessment [RPA], PCS-STORET In-
terface, UNIRAS) available to them on one desktop.
This capability extends the tool set available to GIS
analysts for environmental problem-solving.

This paper discusses application of these tools and
databases to several problems, including EPAs water-
shed-based approach to permitting, and the RPA, an
automated method to identify priority pollutants in water-
sheds.

Introduction

The purpose of this paper is to explore new geographic
information systems (GIS) data integration tools that are
applicable to a wide range of environmental problems,
including the U.S. Environmental Protection Agency's
(EPAs) watershed-based approach to permitting and
the Reach Pollutant Assessment (RPA), an automated
method to identify priority pollutants in watersheds. The
ultimate goal is to make these tools and databases
accessible to a wide range of users.

Understanding aquatic resource-based water quality
management depends on access to and integration of
diverse information from many sources. To date, the
techniques to perform this integration, and thus yield
meaningful analyses supporting environmental deci-
sion-making, are neither fully developed nor docu-
mented. New tools and information resources are now
available, but not used to their full potential, for more
valuable water quality and watershed analyses. EPA
183
-------
headquarters is responsible for ensuring that integrated
data management tools are available for water quality
analyses and data reporting as well as making national
data systems more useful. EPA will accomplish this by
upgrading and crosslinking systems, developing interac-
tive data retrieval and analysis mechanisms, and provid-
ing easy downloading of data to client workstations.

The Assessment and Watershed Protection Division
(AWPD) of the Office of Wetlands, Oceans, and Water-
sheds (OWOW) has developed water quality analysis
software on the EPA mainframe computer (1). This soft-
ware integrates national on-line environmental data-
bases and produces maps, tables, graphics, and reports
that display information such as water quality trends,
discharge monitoring reports, permit limits, and design
flow analyses. In the past, this graphic software was
available only to users connected to the mainframe with
IBM graphics terminals or PCs with graphics emulation
software. Recently, software has been developed that
can be used to:

• Access the EPA mainframe from a UNIX workstation
via the Internet.

• Execute the Water Quality Analysis System (WQAS)
procedures.

• Display WQAS graphics in an X-Window on the
workstation.

• Download data in a CIS format from the mainframe.

At the same time, this workstation can execute
ARC/INFO and ARC/VIEW applications in other X-Win-
dows. This capability allows analysts to have the power
of CIS, the mainframe databases (e.g., Permits Compli-
ance System [PCS], STORET, Reach File, Industrial
Facilities Discharge File, Daily Flow File, Toxic Chemical
Release Inventory), and the retrieval/analysis/display
software (Environmental Data Display Manager, Map-
ping and Data Display Manager, RPA, PCS-STORET
Interface, UNIRAS) available to them on one desktop.
This extends the tool set available to CIS analysts for
environmental problem-solving. This paper discusses
how these tools and databases have been applied to
two examples: 1) a watershed-based approach to per-
mitting and 2) the RPA, an automated procedure for
identifying watersheds with priority pollutants.

Mainframe Databases and Tools

The EPA IBM ES9000 mainframe computer, located in
Research Triangle Park, North Carolina, contains a
large volume of digital water quality and environmental
data available on-line through a number of data retrieval
and display tools (see Figure 1). Other documents de-
scribe these databases and tools in detail (2, 3).

This effort focused on showing how these databases
and tools can complement CIS activities. In some cases,
data can be directly downloaded to a workstation in CIS
format. An example of this is accessing EPAs Reach
File (Version 1) from GRIDS (4). In other cases, data-
bases are accessed by mainframe tools, the data are
processed, and a CIS data set is produced that can be
downloaded to a workstation. An example of this case
is the RPA (RPA3) tool that integrates data from the
Reach File, STORET, and PCS to identify priority pollut-
ants in watersheds (5).

The mainframe can be accessed through several paths:
Internet, PC dialup, or dedicated line into a terminal
controller (see Figure 2). In the applications presented
here, the Internet connectivity is emphasized because
this is the mechanism that makes these databases and
tools available to CIS analysts at their workstations.
Figure 3 shows the hardware and software require-
ments for Internet access to the water quality data inte-
gration tools. The basic components are a UNIX
workstation with an X-Window manager, the X3270 soft-
ware, and Internet connectivity. The X3270 software is
required to emulate an IBM 3270 full-screen terminal.
This software is publicly available through EPAs Na-
tional Computer Center User Support. In addition, an
account on the mainframe computer is required. Once
this account is established, an additional software mod-
ule, GDDMXD, is required to map IBM host-based
graphics to the workstation's X-Window. The GDDMXD
software resides on the mainframe and is loaded when
the user logs in. Once the hardware and software are
set up, a single UNIX workstation can provide access to
mainframe and workstation tools and databases on one
desktop (see Figure 4).

Applications

To illustrate how these mainframe and workstation
tools/databases can work together to solve environ-
mental problems, we present two applications. The first
shows a watershed-based approach to permitting; the
second describes the RPA.

Watershed-Based Approach to Permitting

The watershed approach is a process to synchronize
water quality monitoring, inspections, and permitting to
support water quality protection activities on a geo-
graphic basis. It is a coordinated and integrated method
to link science, permits, and other pollution control and
prevention activities to meet state water quality stand-
ards. Numerous local, state, and federal agencies have
recognized watershed approaches as the best way to
manage natural resources effectively and efficiently. Es-
tablishing a schedule for data collection, permit issu-
ance, and other elements of this approach affords the
opportunity to coordinate and integrate other natural
resource management efforts to make better use of
184
-------
NATIONAL DATABASES
1
Reach File FRDS
STORET USGS DLG
PCS LU/LC
TRI Soils
IFD Population
Gage Dun &
Drinks Brad street
Daily Flow ODES
BIOS
V J

^
' \
MAINFRAME TOOLS
EDDM GRIDS
MDDM DFLOW
RPA3 DESCON
IPS5 RF3MSTR
UNIMAP Bookmanager
Sitehelp Pathscan
v }

GIS
Figure 1. EPA mainframe databases/tools and linkage to GIS.
INTERNET
PC or UNIX workstation
with GIS software
DIALUP
DEDICATED LINE
EPA MAINFRAME
Databases
Tools
Figure 2. Access to the EPA mainframe databases and tools.
185
-------
LOCAL WORKSTATION

HARDWARE
UNIX workstation (with X-Windows) on Internet (i.e., DG Aviion)

SOFTWARE
X3270 Software - creates an X-Window, which emulates an IBM 3270
full-screen session (provided by EPA)

EPA MAINFRAME ACCOUNT
GDDMXD software provided on the mainframe to map GDDM
graphics into X-Windows
Figure 3. Hardware and software requirements for accessing water quality data integration tools via INTERNET.
EPA MAINFRAME
WINDOW- X3270
ARC/INFO
WINDOW
ARC/VIEW
WINDOW
DATA DOWN-
LOADING (FTP)
Figure 4. A UNIX workstation environment provides access to mainframe and workstation tools and databases on one desktop.
186
-------
limited local, state, and federal financial and human
resources (6).

This application illustrates how the watershed approach
used CIS and EPA mainframe databases and tools. As
an example, a four-step approach (see Figure 5) has
been developed and applied to an impaired watershed
(Saluda River basin) in South Carolina. Steps one and
two identified watersheds of concern through their
nonattainment of designated uses (see Figure 6) and
highlighted the cause of nonattainment, in this case
pathogens (see Figure 7). The data sets used were U.S.
Geological Survey (USGS) hydrologic unit boundaries,
Soil Conservation Service (SCS) watershed bounda-
ries, and data from the EPA waterbody system, which
were indexed to the SCS watersheds.1 These data sets
were integrated into an ARC/INFO arc macro language
(AMI) to allow users to pose queries and prioritize
watersheds for further investigation.

Once priorities were set, the third step was to evaluate,
in detail, the sources and causes of nonattainment. The
Saluda River basin, which had pathogens as its cause
of nonattainment, was selected for further analyses. In
this step, the mainframe tools supplement the worksta-
1 Clifford, J. 1994. Personal communication with Jack Clifford, U.S.
EPA, Washington, DC.
tion CIS capabilities illustrated so far. A STORET re-
trieval was performed for ambient water quality stations
monitoring for fecal conforms. The STORET stations
were partitioned into three categories (low, medium, and
high) according to the state fecal coliform standard (7),
which reads:

not to exceed a geometric mean of 200/100 ml,
based on five consecutive samples during any 30
day period; nor shall more than 10% of the total
samples during any 30 day period exceed 400/100 ml

The categories in Figure 8 correspond to the standard
as follows:

low: < 200/100 milliliters

200/100 milliliters < medium < 400/100 milliliters

high: > 400/100 milliliters

Figure 8 illustrates the use of ARC/VIEWto visualize the
location of fecal coliform "hot spots" in the Saluda River
basin. Figure 9 focuses on one SCS watershed
(03050109-040) where pathogens cause nonattain-
ment. The locations of industrial and municipal dis-
chargers are plotted, and facilities with fecal coliform
limits and their respective permit expiration dates (cap-
tured from the PCS) are shown in a table included with
the figure. The CIS capabilities used to generate Figure 9
1. IDENTIFY WATERSHEDS OF CONCERN
- Nonattained waters
2. PRIORITIZE WATERSHEDS FOR PERMITTING
• Toxic vs. Nonconventiona! vs. Conventional
- Point source vs. Nonpoint source
- Existing/Designated use(s)
- Environmental equity; populations; endangered/sensitive species; etc.
3. EVALUATE SOURCES AND CAUSES
- Ambient water quality
- Location of point and nonpoint source discharges
- Existing controls
4. DEVELOP CONTROLS
- TMDL/WLAs
- NPDES permit limits/controls
• Synchronize issuance

Figure 5. Four steps illustrating an example approach to permitting on a watershed basis.
187
-------
Step 1: Identify Watersheds of Concern
n 305b Clean Water Act Support

SsJect Use Category
Overall Use flgua^j-Ht) I Swimming j^CTndafyjCgntactgec
Select SiiBpert Category
(fptscfting I Partially Sue&ortlng • Threatened I NotSupssrtlni
^JiBiBct Support Heasiire

FFerceni 5uuport Number of Miles [
aier Quality stations; o
tosic Releass IfiMSMory: 0
Figure 6. Identification of watersheds of concern—watersheds where at least 10 percent of the reaches are not fully supporting
overall designated use (data and ARC/INFO AMLs provided by Jack Clifford).

Step 2: Evaluate Priorities
Figure 7. Watersheds where the cause of nonattainment is pathogens (data and ARC/INFO AMLs provided by Jack Clifford).3

2 See note 1.
3 See note 1.
188
-------
Step 3: Evaluate Ambient Water Quality—Low, Medium, and High Levels of Fecal Coliforms
Figure 8. Map of the Saluda River basin showing the location of STORET monitoring stations and fecal coliform levels.

Step 3: Existing Controls—Dischargers That Have Limits for Fecal Coliforms
ij Dl rhargm In SCS bs in 41

S^M ,BD OlBljt, OtralltV)
VI Dischargers InSCS t>
1 *•_ ^_
y tsea* coiiKusa levels
: I R^th File t -loath ^
_j DnrtUng Waiar Syppt* ^

*
3 Water Quaffly S
! Toxic Chewe.il selaasa iR
'/* . " •*
Vf fiaOSOl 03 - SSLODftl
JAv^H'
Figure 9. Focus on SCS watershed 03050109-040 (shaded in gray). Locations of STORET stations, municipal and industrial dis-
chargers are also shown. The table in the upper right highlights dischargers with fecal coliform limits and their respective
permit expiration dates.
189
-------
show that permit issuance is not synchronous, which is
a key element in the watershed approach.

Within this local workstation CIS environment, the attrib-
utes associated with the STORE! monitoring stations
and the PCS dischargers are limited with respect to the
large amount of time series sampling data that exists in
these databases. Figures 10 through 12 illustrate how
an EPA mainframe procedure, the Environmental Data
Display Manager (EDDM), can be accessed from an
X-Window on the CIS workstation to query the entire
STORET and PCS databases and thus provide additional
data analysis and display capabilities to the CIS work-
station. In Figures 10 and 11, a water quality inventory was
performed for a STORET station, and a time series plot
of fecal coliform levels is displayed. In Figure 12, the
limits and discharge monitoring report (DMR) data were
accessed from PCS for a sewage treatment plant. Ex-
cursions beyond the PCS limits for fecal conforms are
easily visualized in the plot.
The fourth and final step in the watershed approach was
to develop controls for achieving water quality stand-
ards. This might include the development of total maxi-
mum daily loads (TMDL), waste load allocations (WLA),
and the synchronization of permit issuance. Another
mainframe tool, the PCS-STORET INTERFACE (re-
ferred to as IPS5 on the mainframe), can be used to
access and compute design flows for TMDL develop-
ment (see Figure 13) and to find all facilities discharging
to a particular reach (see Figure 14), an initial step in the
synchronization of permit issuance.

RPA

The RPA is a procedure on the EPA mainframe that
automates identification of reaches where priority pollut-
ants have been detected. It can be run for a user-
selected state or USGS hydrologic unit.

Section 304(1) of the Clean Water Act (CWA) identifies
water bodies impaired by the presence of toxic substances,
Step 3: Using EDDM To Evaluate Ambient Upstream Fecal Coliform Levels—Regulation
of Upstream Dischargers Is Necessary
ENVIRONMENTAL RISK ASSESSMENT-BRflNDYWlNE RIVER
ENVIRONMENTAL DATA DISPLAY MANAGER (EDDM)
Figure 10. Using EDDM to perform a water quality inventory for STORET station 21SC06WQ S-084 in the Saluda River basin, South
Carolina.
190
-------
Step 3: Using EDDM To Visualize Trends in Ambient Coliform Levels Upstream
of a Discharger
WATERSHED BASED APPROACH TO PERMITTING
SALUDA RIVER WATERSHED -ARCVIEW
HAPPING AND DATA DISPLAY MANAGER (MDDM)
storet_ulm-id
statioa
frequency
mean-long
mean-lat
mean-yalue
ENVIRONMENTAL DATA DISPLAY MANAGER (EDDM)
PCS -STORET INTERFACE
REACH POLLUTANT ASSESSMENT
TMDL DEVELOPMENT - CLARK FORK RIVER
ENVIRONMENTAL RISK ASSESSMENT-BRANDVW1NE SHIS
ENVIRONMENTAL DATA DISPLAY MANAGER (EODM)
STORET RETRIEVAL S-OB4
TIME SERIES - FECAL COLIFORM
PCS FACIL TV LEVELDATA SUMMARY
PCS PIPE LEVELDATA SUMMARY
PLOT OF LIMITS/DMR
Figure 11. From the EDDM water quality inventory table, the fecal coliform parameter (31616) was selected for a time series plot.
identifying point source dischargers of these substances
and developing individual control strategies for these
dischargers. To meet these requirements, the EPA Of-
fice of Wetlands, Oceans, and Watersheds prepared
guidance identifying criteria to be used in reviewing
state reports.

The RPA was designed to address the requirements
under criterion 7 of Section 304(1): identification of state
waters with likely presence of priority toxic pollutants.
This assessment was accomplished by identifying and
summarizing reaches with point source dischargers of
priority pollutants and water quality stations with priority
pollutant data.

Information on the state's waters is summarized using
the USGS hydrologic unit naming convention and the
Reach Structure File (Version 1). Numerous databases
were accessed and analyzed, including the Reach
Structure and Reach Trace File (Version 1), industrial
facilities discharge (IFD) file, STORET parameter file,
PCS, and the STORET water quality file. The IFD file
and PCS provided the facility information. Comparing
information from both data sources identified active fa-
cilities and generated a complete list of facilities by their
assigned reach numbers. Water quality data from
STORET were summarized on reaches with priority pol-
lutant monitoring data. Stations were retrieved with the
following restrictions:

• Stations located within the state or hydrologic unit of
interest.

• Ambient monitoring stations located on streams,
lakes, or estuaries.

• Stations sampled for at least one priority pollutant in
either water, sediment, or fish tissue on or after January
1, 1982.
191
-------
Step 3: Using EDDM To Evaluate Existing Controls—PCS Limits and DMR
PFJ-ENO

PF4-DI1R

PF5-STDRET

PFb-flPERTURE

PF7-UMI TS
•••
PF8-FLUU
ENVIRONMENTAL DATA DISPLAY MANAGER (EDDM
PCS FACILITY LEVEL DATA SUMMSRY

PC5_PIPEJlEWEL_DA™_SUMMfl_RV _
PLOT OF LIMITS/DMR
Figure 12.
Using EDDM, PCS data is accessed for discharger SC0023906, Piedmont sewage treatment plant. The windows show
plant location (upper left), facility and pipe summary data (upper right) and time series plot (lower left).
Recently, the RPA program was modified to output files
compatible with CIS. An example of this is shown in
Figure 15. The data in the table portion of this figure are
written to two files as follows:

• For each reach in the hydrologic unit, the geographic
coordinates are written to a file in ARC/INFO GEN-
ERATE format.

• The attributes associated with each reach (e.g.,
name, length, number of water quality stations) are
written to a delimited ASCII file.

A third file is also automatically generated. This file is an
AMI that GENERATES the line coverage of reaches,
defines and populates the INFO table of attributes, then
joins the attributes in the INFO table to the line coverage
of reaches. Once these three files are created, they are
downloaded to the CIS workstation (via ftp) and proc-
essed by ARC/INFO. In ARC/VIEW, the user can identify
a reach and determine:

• The number of water quality stations with priority pol-
lutant monitoring data.

• The number and type of industrial facilities with pri-
ority pollutant discharge.

• The number of publicly owned treatment works
(POTW) with and without indirect dischargers (see
Figure 15).

In addition to this reach summary data, other tables
(cross-linked to reaches, water quality stations, and dis-
chargers) are produced that summarize the data by pollut-
ant (see Figure 16). In this figure, each pollutant is cross-
linked to the reach where it was detected, and the source
of detection is also identified (i.e., water column, sediment,
fish tissue, NPDES permit limit, Form 2(c) submittal) or
192
-------
Step 4: Using the PCS/STORET INTERFACE for NPDES Permit Development—Analysis of
Receiving Stream Flow Data

Figure 13. Using the PCS-STORET INTERFACE to access and compute design flows for a specific reach.
Step 4: Using the PCS/STORET INTERFACE To Determine Other Facilities on a Reach for
Wasteload Allocation Purposes
Figure 14. Using the PCS-STORET INTERFACE to list all facilities that discharge to a specific reach.
193
-------
predicted to be in the discharger's effluent based on the
standard industrial classification (SIC) code. More de-
tailed information is also generated. For example, Fig-
ure 17 shows a detailed report for priority pollutants
detected in the water column (similar reports are gener-
ated for sediment and fish tissue). Each pollutant is
cross-linked to a reach and the specific monitoring sta-
tion where it was detected. Basic summary statistics are
also presented.

Figure 18 shows a detailed report for pollutants detected
in the NPDES permit limit. In this figure, each pollutant
is cross-linked to a reach and the specific NPDES dis-
charger containing a permit limit. In addition, each dis-
charger is identified as a major, minor, or POTW.

Figures 15 through 17 show the RPA output in the
foreground and coverages displayed by ARC/VIEW in
the background. Inspection of these figures shows that
the Bush River is a priority pollutant reach containing
seven industrial facilities, four of which discharge priority
pollutants (one pulp and paper mill, three textile facto-
ries). Further examination of the data shows that cad-
mium has been detected in the water column and is also
contained in the NPDES permit limit. It is also predicted
to be in the discharge effluent based on SIC code. In the
water column, cadmium was measured at 10 u,g/L at two
stations sampled in 1988.
Finally, there is a limit for cadmium in NPDES facility
SC0024490 (Newberry plant), a POTW on the Bush
River. The RPA output, linked to CIS, can be used as a
screening and targeting tool for identifying specific
reaches within watersheds where toxic priority pollut-
ants cause water quality degradation.

Summary and Conclusions

The proliferation of CIS workstations, the expansion of
the Internet, and the development of X-Window-based
graphics emulation software (X3270 and GDDMXD) has
afforded analysts the opportunity to use the powerful
analytical capabilities of CIS and the EPA mainframe
databases and tools together on one desktop. Thus, a
user performing a CIS watershed analysis can also
have immediate and complete access to national on-line
databases such as STORET and PCS by opening up a
"window" to the EPA mainframe. This allows detailed
queries to be performed that supplement the data al-
ready being analyzed at the local workstation. This ca-
pability allows users to easily visualize additional data
without having to spend effort in retrieval, downloading,
transforming, and reformatting to make it useful. By
enhancing existing mainframe programs to create out-
put in CIS format, the time spent importing data to the
CIS is reduced and more time can be spent on analysis.
An example of this capability is the RPA program.
Reach Pollutant Assessment: Priority Pollutant Detection in the Bush River
•: ] Dlgch&rgan! :n |p| basin dQ '*';':,: tr ^j
"J:* : : , «-if?::*;: '-' :.
,
\ TOMC Ctercicai F
ii i;,*li»
D BASED APPROACH TO PERMITTING
£ 2. SsJfWSV OF SUSERiCAllif SC=RHiS RTACKCS WTh PRICS11V FCUUTSfjT S*3JfKES

ftlilf WflltH rjUflUT" S'lfillOHS IN CATALOGim; UM1 JUMICa
S^q E -'t-gtil

Cat. Unit No. Miles PEKCH Han

CO C2) (3) C-0
001 7.' EflLADfl R
W2 "1,1 SflLABA h

6fN &8.S L «'8KAV

nns 51.s L MURR^v

013 -4.-S UWIM OH

Ot^ B.S WlftOA R

015 1Q.B irtLWm R

SIS t?.5 tf* n
7 1 PLASTIC MOLBIHC &

» V?£A« LLECTSIC

1 (IHtWlC, (.HtFKftLi
Figure 15. Using the RPA procedure to identify specific reaches with priority pollutants.
194
-------
Reach Pollutant Assessment: Cadmium in the Bush River
Figure 16. RPA summary report by pollutant.
Reach Pollutant Assessment: Bush River: Cadmium in the Water Column
emtftool -/liin/csh
POLLUTANT PARSM RFWH RFftCH HAMF
MflME CODE NUMBES
RPA5 DETAIL REPORT CO S
POLLUWTS DETtCTED II4
STOrtFT S'ATTOH TD WO. OF
103? 3Q5S50M53 ftit&'i S ZfSCfiflWj S-fiS8
1G2? 3050109030 LITTLE H 21SCSQWQ S-C34
1027 3D501D90S2 COROHACft
102? SQSQIMC1^ 5HAPA fi
102? ^osoiiBcie sysiit B
ibroii 321DS 3D50109059 5kl.MA P
ifcros 32-irtS SegO'lGSPS,! S'SDV R
ft tot 1Q34 3D501D9C18 3JSH R
H tDt 1034 3050109053 riCOOV B
lO.UUU
1G-OM
10. DUD
10, Mfl
in.nno
10.COD
1G.C03
Figure 17. RPA detail report: pollutants detected in the water column.
195
-------
Reach Pollutant Assessment: Permitted Industries for Cadmium—Bush River
cmdtool -/iln/csh
R0AS DtTAIl REPORT [4) 8V POLLLTAHT

POLLUTANTS INCLUDED T> NPOE3 PERMIT LIMIT
POLLUTANT MME
PflRAH REACH

COPE &UM8£ft
Acenaphthylene
3 FIBERS/COLUMBIA PtftXT HAIOB 5D1

&COUOJSi? ftLiltl) UttRS/L'OUJMlil^ HflHT NftJQR 001

btD303191 HiUIKt!^ S CQ/SmE'J MJLI. NflJOP U01

SCOOfi3^7 £UI£D fie£RS/COiU»9STft PiftUT i^J*< Qyi

SCQOC3557 PLLIED FIEER5/COLUMBIA PLANT MftJtiR DQ1

SCOOD3557 fttLIEtt rZBTRS/COlUieXA PIWT WJ08 KM

SC0003557 fttllEB FIBEK/COlUnBIft PLANT MAJOR 301

SCOOD355? ALLIED FIBERS/COLUMBIA PLANT KUOR D01

5COD^43S NS'iWE&m'/PBOPGSED PifeNT HT^OR POTW 001

sroc2oro2 LAU^EHS MATOR POTW im
REfiCh HftP - PRIOBiTV POUUTffNTS J
Figure 18. RPA detail report: pollutants included in the NPDES permit limit.
Two examples were presented as illustrations of how
CIS and the mainframe databases and tools can work
together.

In the first example, EPA's watershed-based approach
to permitting, a four-step approach, was outlined, show-
ing how a combination of local CIS functions and remote
mainframe databases and tools were used in each step
of the process. The end result was the targeting and
prioritizing of watersheds of concern, and a detailed look
at where and why water quality standards were not
being met.

In the second example, the RPA program along with CIS
was used to identify and map toxic priority pollutants and
cross-link them to reaches, media (water column, sedi-
ment, fish tissue), NPDES dischargers, and monitoring
stations. This analysis focused on what the toxic pollut-
ant problems are and where they occur.

Acknowledgments

The authors wish to thank Jack Clifford, AWPD, OWOW,
EPA, for providing data and ARC/INFO AMLs that were
used in the South Carolina watershed example. The
authors also extend their appreciation to Duane More
and Tom Lewis (both with Martin Marietta), who provided
support in setting up the X3270 and GDDMXD software.

References

1. Taylor, P.L., P. Evenhouse, L. Hoelman, T. DeWald, W.B. Samuels,
and O. Hansen. 1988. STORET—Water quality analysis system.
Presented at the State/EPA Data Management Conference,
Raleigh, NC (June).

2. U.S. EPA. 1992. Office of Water environmental and program
information systems compendium, information resources manage-
ment: Tools for making water program decisions. EPA/800/B-
92/001. April.

3. Samuels, W.B., PL. Taylor, P.B. Evenhouse, T.R. Bondelid, PC.
Eggers, and S.A. Hanson. 1991. The environmental display man-
ager: A tool for water quality data integration. Water Resources
Bull. 27(6):939-956.

4. U.S. EPA. 1993. Geographic Resources Information and Data Sys-
tem (GRIDS): User guide. Office of Information Resources Man-
agement, National CIS Program.

5. Samuels, W.B. 1990. Reach pollutant assessment user's guide.
McLean, VA: Science Applications International Corporation.

6. Washington State Department of Ecology (WDEC). 1993. Water-
shed approach to water quality management. F-WQ-93-029.
October.

7. State of South Carolina. 1992. South Carolina code of regulations,
Chapter 61, Regulation 68 and 69. Amended April 24, 1992.
196
-------
Wetlands Applications
-------
Wetlands Mapping and Assessment in Coastal North Carolina:
A CIS-Based Approach
Lori Sutter and James Wuenscher
Division of Coastal Management, State of North Carolina Department of Environment, Health,
and Natural Resources, Raleigh, North Carolina
Introduction

The coastal area of North Carolina covers 20 counties
and over 9,000 square miles of land area, about 20
percent of the state (see Figure 1). It also includes over
87 percent of the state's surface water. The North Caro-
lina Coastal Management Program (NC CMP) is re-
sponsible for managing this area to meet the goals set
forth in the Coastal Area Management Act (CAMA)
(North Carolina General Statute [NCGS] 113A, Article
7). These goals provide a broad mandate to protect the
overall environmental quality of the coastal area and to
guide growth and development in a manner "consistent
with the capability of the land and water for develop-
ment, use, or preservation based on ecological consid-
erations" (NCGS 113A-102(b)(2)).
Figure 1. HU and county boundaries in the North Carolina
coastal area.
Much of the North Carolina coastal area consists of
wetlands, which, in many areas, constitute nearly 50
percent of the landscape. These wetlands are of great
ecological importance, in part because they occupy so
much of the area and are significant components of
virtually all coastal ecosystems, and in part because of
their relationships to coastal water quality, estuarine
productivity, wildlife habitat, and the overall character of
the coastal area.

Historically, close to 50 percent of the original wetlands
of the coastal area have been drained and converted to
other land uses (1-3). Although agricultural conversion,
the largest historical contributor to wetlands loss, has
largely stopped, wetlands continue to disappear as they
are drained or filled for development. Conflicts between
economic development and wetlands protection con-
tinue to be a major concern, with many coastal commu-
nities considering wetlands protection to be a major
barrier to needed economic development.

Because wetlands are such a dominant part of the
coastal landscape and are vitally important to many
aspects of the area's ecology, their management and
protection is a major concern of the NC CMP. The State
Dredge and Fill Act (NCGS 113-229) and the CAMA
regulatory program stringently protect tidal wetlands, or
"coastal wetlands" as law and administrative rules call
them. Coastal wetlands are designated areas of environ-
mental concern (AECs), with the management objective
"to give highest priority to the protection and manage-
ment of coastal wetlands so as to safeguard and per-
petuate their biological, social, economic and aesthetic
values; and to coordinate and establish a management
system capable of conserving and utilizing coastal wet-
lands as a natural resource essential to the functioning
of the entire estuarine system" (15A NCAC 7H .0205).

North Carolina law does not, however, specifically pro-
tect nontidal freshwater wetlands. State protection of
199
-------
freshwater wetlands is limited to the regulatory authority
provided under federal laws for state agency review of
federal permits; in this case, §404 permits granted by
the U.S. Army Corps of Engineers. Under §401 of the
Federal Water Pollution Control Act (33 USC 1341), a
Water Quality Certification from the North Carolina Divi-
sion of Environmental Management (DEM) is required
for a 404 permit to discharge fill material into wetlands.
Section 307 of the federal Coastal Zone Management
Act (CZMA - 16 USC 1451 et seg.) also requires that
404 permits be consistent with the enforceable rules and
policies of the NC CMP. The standards for consistency
are the use standards for AECs and wetlands policies
stated in the applicable local land use plan. Other than
AECs, the NC CMP has no consistent policies regarding
wetlands. A few local land use plans include policies to
protect freshwater wetlands, but most do not.

Wetlands Conservation Plan

In 1991, the CZMA §309 Assessment of the NC CMP
revealed NC CMP's weakness in protecting nontidal
wetlands (4). The assessment demonstrated that both
opponents and proponents of wetlands protection con-
sidered the current system inadequate. Economic de-
velopment interests found the 404 regulatory program
to be unpredictable and inconsistent, often resulting in
the loss of needed economic growth in coastal counties.
Environmental interests felt that the program allowed
the continued loss of ecologically important wetlands. As
a result, the assessment identified wetlands manage-
ment and protection as one of the primary program
areas in need of enhancement.

The North Carolina Division of Coastal Management
(DCM) developed a 5-year strategy (5) for improving
wetlands protection and management in the coastal
area using funds provided under the Coastal Zone En-
hancement Grants Program established by 1990
amendments to §309 of the federal CZMA. The Office
of Ocean and Coastal Resources Management (OCRM)
in the National Oceanographic and Atmospheric Admini-
stration (NOAA), U.S. Department of Commerce admin-
isters the §309 program. Funds provided under this
program, particularly Project of Special Merit awards for
fiscal years 1992 and 1993, supported the work reported
in this paper. A grant from the U.S. Environmental Pro-
tection Agency (EPA) for a Wetlands Advance Identifica-
tion (ADID) project in Carteret County, North Carolina,
also funded this work.

The key element of DCM's strategy for improving wet-
lands protection is the development of a wetlands con-
servation plan for the North Carolina coastal area. The
plan has several components:

• Wetlands mapping inventory

• Functional assessment of wetlands
• Wetlands restoration

• Coordination with wetlands regulatory agencies

• Coastal area wetlands policies

• Local land use planning

The obvious first step in developing a wetlands conser-
vation plan is to describe the wetlands resource. An
extensive geographic information system (CIS) wet-
lands mapping program is helping to accomplish this
first step by producing a CIS coverage of wetlands by
wetland type for the entire coastal area. The CIS cover-
age allows generation of paper maps for areas within
any boundaries available in CIS format. This is the
subject of the first part of this report.

One weakness of the 404 program is that, for individual
permits, it attempts to apply the same rules and proce-
dures equally to all wetlands, regardless of the wetland
type and location in the landscape. This approach can
result in permits being granted for fill of wetlands of high
ecological significance or permits being denied to pro-
tect wetlands of little significance. Neither outcome is
desirable because the result may be the loss of either
vital wetland functions or beneficial economic activity.
This is an unsatisfactory way to manage wetland re-
sources in an area such as the North Carolina coast,
where:

• A high proportion of the land is wetlands.

• Many of the wetlands are vital to the area's environ-
mental quality.

• Economic stimulation is sorely needed.

To help overcome this weakness in the current wetland
regulatory framework, the Wetlands Conservation Plan
includes an assessment of the ecological significance of
all wetlands to determine which are the most important
in maintaining the environmental integrity of the area.
This will result in a designation of each wetland polygon
in the CIS coverage as being of high, medium, or low
functional significance in the watershed in which it ex-
ists. The procedure by which this occurs is the subject
of the second part of this report.

The remaining components of the Wetlands Conserva-
tion Plan comprise the means by which the results of the
wetland mapping and functional assessment steps will
be used to improve wetland protection and management.
Close coordination with other state and federal agencies
involved in wetlands protection and management has
been an important component of the entire effort.
Agency representatives have been involved in develop-
ing the methods used, and the agencies will receive
copies of the resulting maps for use in their own plan-
ning and decision-making. Policies for protection of wet-
lands of varying functional significance will be proposed
to the Coastal Resources Commission to serve as the
200
-------
basis for consistency review of 404 permit applications.
Wetland maps and functional assessment results will
also be provided to local governments for use in local
land use planning, and DCM will work with local govern-
ments to increase local involvement in the wetlands
regulatory structure.

While the wetland maps themselves are useful for land
use planning and helping to find suitable development
sites, simply knowing where the wetlands are located is
insufficient information for many purposes. Any area for
which a 404 permit application is in process has been
officially delineated as a wetland by the Corps of Engi-
neers. The value of wetland maps to the regulatory
review agencies at this stage is limited to determining
the relationship of the site to other wetlands in the area.
While, ideally, all wetlands should be avoided in plan-
ning development, avoiding wetlands completely in the
coastal area is difficult, and avoiding all wetlands in any
extensive development is virtually impossible.

The results of the functional assessment will provide
additional information about the ecological significance
of wetlands. This information will be valuable to wetland
regulatory review agencies in determining the impor-
tance to an area's environmental integrity of protecting
a particular site for which a permit to fill has been
requested. It will also enable development projects to be
planned so as to avoid, at all reasonable costs, the most
ecologically important wetlands. An accurate functional
assessment of wetland significance, then, is the most
valuable component of the Wetlands Conservation Plan.

Wetlands Mapping Inventory

An important, initial step in developing a comprehensive
plan for wetlands protection is to understand the extent
and location of wetlands in the coastal area. When
developing mapping methods, DCM quickly realized
that the more than 9,000-square-mile coastal area was
too large for any mapping effort in the field (see Figure
1). To complete this task in an accelerated timeframe,
DCM needed to use existing data compatible with CIS.
Reviewing the existing data revealed that most are not
applicable for one of two reasons: (1) available wetlands
data are based on older photography, and (2) more
recent data are not classified with the intent of wetlands
mapping. These data types, used independently, are
inappropriate for use in a coastal area wetlands conser-
vation plan. In addition, the classification schemes used
in the existing methods are too complex or not focused
on wetlands.

While several data sets were believed to be inappropri-
ate if used exclusively for wetlands mapping in coastal
North Carolina, each contained useful components.
DCM elected to combine three primary layers of data
and extract the most pertinent information from each
layer. DCM selected the National Wetlands Inventory
(NWI) because its primary purpose is to map wetlands.
Unfortunately, these maps were based on photography
from the early 1980s in coastal North Carolina, and
many changes have occurred in the landscape since
that time. NWI also omitted some managed wet pine
areas from its maps; DCM wished to include these areas
because they are important to the ecology of the North
Carolina coastal area. DCM also selected detailed soils
lines for use in its mapping efforts. While soils alone
should not be used to identify wetlands, soils can be
very useful in identifying marginal areas. Finally, DCM
also employed thematic mapper (TM) satellite imagery
in its methods. This data layer was not developed as a
wetlands inventory; however, the imagery is more recent
than the soils and NWIs. DCM desired to incorporate the
benefits of each of these data sources into its mapping
techniques.

The information provided by this mapping exercise will
be useful to county and municipal planners in helping
guide growth away from environmentally sensitive ar-
eas. For this reason, DCM elected to pursue mapping
on a county by county basis. In addition, a single county
allowed DCM to focus methodology development to a
limited geographic area to refine its methods. Carteret
County was selected as a methods development labo-
ratory because data were available for the area and
because Carteret has a large number of representative
wetlands.

Data Descriptions

The U.S. Fish & Wildlife Service produces the NWI for
all wetlands in the country. For the coastal North Caro-
lina area, these vector data were developed from
1:58,000-scale color infrared photography taken during
the winters of 1981, 1982, and 1983. Photointerpreters
delineated wetland polygons on clear stabilene mylar
taped over the photographs. After an initial scan of the
photographs to identify questions or problem signatures,
the photointerpreters reviewed areas in the field. They
performed approximately one-half to one full day of field
verification per quadrangle (quad) (6). Features were
compared with U.S. Geological Survey (USGS) topo-
graphic maps for consistency. Following completion of
the 'draft' paper maps, the Regional Coordinator re-
viewed the data. After approval as a final map, each
quad was digitized. Initially, the North Carolina Center
for Geographic Information and Analysis (CGIA) digit-
ized the coastal North Carolina NWI maps, and later, the
NWI Headquarters in St. Petersburg, Florida, who sub-
contracted the task, digitized them. Digital maps were
obtained initially from 1/4-inch tape transfer and later
from direct access to NWI via the Internet.

CGIA provided digital, detailed soil lines, which also are
vector data based on 1:24,000 quads. County soil sci-
entists delineated soil boundaries on aerial photographs
201
-------
based on slope, topography, vegetative cover, and other
characteristics. This process occurs in any soil survey.
After appropriate personnel approved the lines, a quali-
fied soil scientist recompiled them onto orthophoto
quads. CGIA scanned or manually digitized these lines.
The coverage incorporated databases describing soil
characteristics, which were then released for use.

The Landsat Thematic Mapper (TM) imagery was clas-
sified as part of the Albemarle-Pamlico Estuarine Study
(APES). To provide complete coverage for the southern-
most region of DCM's jurisdiction (Onslow, Render,
Brunswick, and New Hanover Counties), DCM con-
tracted with CGIA and the North Carolina State Univer-
sity (NCSU) Computer Graphics Center to have that
area processed identically to the APES region. These
data provide a raster-based coverage of approximately
30-meter pixel resolution. Some of the imagery was
taken at high tide, which precludes some near-water
wetlands from appearing in certain areas. Using
ERDAS, imagery specialists grouped similar spec-
tral signatures into one of 20 classes. DCM used these
data in two formats: filtered and unfiltered. The unfiltered
information was vectorized with the ARC/INFO GRID-
POLY command. To remove some of the background
noise in the coverage, it was filtered using ERDAS 'scan'
with a Majority filter of 5 by 5 pixels, then vectorized with
the ARC/INFO GRIDPOLY command.

Methods

Within each county, mapping is based on 1:24,000
USGS quads. After completion, each quad is assembled
into a countywide coverage, which eventually is assem-
bled into a coastal area coverage. The initial step in the
mapping process is to ensure completion of the base
layers described previously. Reviewing for errors at
early stages prevents confusion in correction later in the
process; therefore, the importance of the preliminary
techniques cannot be overemphasized. The NWI data
are first inspected to ensure complete coverage. If parts
of the quad are missing, the error is investigated and
corrected. Omissions may be areas of severe cloud
cover on the photography or areas neglected during the
digitization process. Next, the coverage is reviewed for
missing label points. Any omissions are corrected based
on the finalized version of the published NWI paper map.
Appropriate NWI staff are contacted for the necessary
information. At this time, labels are verified for typo-
graphical misentry. If not corrected, these errors could
lead to confusion later in the mapping process.

Once the label errors are detected and corrected, the
polygons are reviewed for completion. Verifying every
line in the areas of coastal North Carolina densely
populated with wetlands is impossible, but the lines are
reviewed for completeness. NWI staff again must pro-
vide necessary information for any omissions. When the
map is approved, technicians ensure projection of the
quad to the State Plane Coordinate System. If this has
not been completed, the ARC/INFO PROJECT com-
mand is employed.

The soils information is prepared in a similar manner to
the NWIs, with questions being directed to the county
soil scientist. Prior to the steps described previously,
soils must be verified for completeness. Because soils
are mapped by county boundaries and DCM maps by
quad, some files must be joined in quads that intersect
county boundaries. At this time, the quad must be
checked for differing abbreviations between counties.
Discrepancies are handled on a case-by-case basis.
When an abbreviation describes different soils in differ-
ent counties, a temporary abbreviation is created for one
of the counties. If a single soil is described by two
abbreviations across counties, both abbreviations are
incorporated into the classification scheme.

The Landsat data do not require additional verification.
Review of this layer is often helpful, however, to ensure
that the geographic boundaries match. Cases where
landforms do not appear to match require investigation
of the discrepancies. If the area is mis registered, this
layer might be omitted from the analyses. To date, no
area has been mapped without this imagery.

The hydrogeomorphology of a wetland is unique in de-
fining the wetland's function (7). Because these maps
serve as the base for additional wetland projects (as
described later in this report), an accurate determination
of this characteristic is essential. Prior to the overlay
procedure, technicians add a new item, hydrogeomor-
phic (HGM), to the NWI coverage. Because DCM con-
siders both vegetation and landscape position in its
classification (discussed later), riverine, headwater, and
depressional wetland polygons are assigned an HGM of
'r,' 'h,' or 'd,' respectively. The digital line graphs (DLGs)
of hydrography are essential in this step of the procedure.

All wetlands that are adjacent to streams or rivers are
considered in the riverine HGM class and are desig-
nated as riverine polygons. This class should include all
bottomland hardwood swamps and some swamp for-
ests. It rarely includes any of the interfluvial wetland
types. If it does, it is a small section of a large interfluvial
flatwood from which a small stream emerges. Only the
polygons adjacent to the stream are considered riverine.
Headwaters are defined as linear areas adjacent to
riverine areas that do not have a stream designated on
the hydrography data layer. Because these unique sys-
tems form the transition between flatwoods and riverine
wetlands, they are treated specially. Finally, polygons
that exist on interfluvial divides are designated as flats
or depressional wetlands. This class should not include
any wetlands along streams.
202
-------
The complete data coverages are overlaid to create a
new, integrated coverage that often approaches 100,000
polygons. Each polygon has many characteristics assigned
to it, including the Cowardin classification assigned by
the NWI, the soil series provided by the detailed soil
lines, the unfiltered land use/land cover code, the filtered
land use/land cover code from the Landsat TM imagery,
and the HGM classification assigned in the previous
step.

Based on these characteristics, each polygon is as-
signed to one of DCM's classes through an automated
ARC/INFO model using an arc macro language (AMI).
Personnel from the NWI and the North Carolina Depart-
ment of Environment, Health, and Natural Resources
Division of Soil and Water Resources have reviewed the
classification of the Cowardin types into DCM wetland
types. The classes that DCM currently recognizes are
upland, salt/brackish marsh, estuarine shrub scrub, es-
tuarine forest, maritime forest, pocosin, bottomland
hardwood, swamp forest, headwater swamp, hardwood
flatwoods, piney flatwoods, and managed pinelands.
DCM also classifies soils as hydric or nonhydric based
on List A of the U.S. Soil Conservation Service (SCS)
List of Hydric Soils.

The base of the map is the NWI polygon coverage.
Some NWI polygons are omitted from the DCM maps
because they are temporarily flooded, but on nonhydric
soils or because recent TM imagery indicates these
areas are currently bare ground. The managed pineland
wetland group on DCM maps includes areas that NWI
considers uplands, identified as pine monocultures on
the imagery, and that occur on hydric soil.
In addition, DCM also provides a modifier to some of
these polygons. DCM notes if NWI has determined that
the area has been drained or ditched. Areas designated
as wetlands at the time of the NWI photography that
currently appear as bare ground on the TM imagery are
designated as 'cleared' on the maps. Many of the
cleared areas would no longer be considered jurisdic-
tional wetlands. These modifiers are useful indicators of
the impacts wetlands sustain from human activities.

Initiation of an interactive session follows completion of
the automated procedure. This session considers land-
scape characteristics that are not easily described to
a computer model in correcting the classification. This
is especially important in distinguishing bottomland
hardwood swamps from hardwood flats. Both contain
deciduous, broad leaf species of trees and can be tem-
porarily flooded. The hydrology of these systems, how-
ever, is completely different. All bottomland hardwood
swamps, for example, must be adjacent to a river where
they receive seasonal floodwaters from the channel.
Conversely, hardwood flatwoods should be located on
interfluvial flats and not adjacent to any streams. Water
is not introduced into hardwood flatwoods via a channel;
rather, precipitation and ground water provide the water
for this system. Polygons that are adjacent to rivers or
estuaries but do not have a distinct channel designated
in the hydrography coverage are considered headwater
swamps.

During the course of methodology development, staff
members visited at least 371 sites in the field. As staff
members encountered new Cowardin classes, they
would verify that the polygons were being placed into
the correct DCM categories. If they determined that a
particular Cowardin class was systematically misidenti-
fied, they updated the algorithm for automation. While
this method does not provide for a usable accuracy
assessment, it allowed development of the most accu-
rate methods.

The accuracy of these data is unknown at this time. An
accuracy assessment of the data is anticipated in the
near future. This assessment will allow map users to
understand the strengths and limitations of the data. It
also will provide an overall summary of data error.

Functional Assessment of Wetlands

Certain initial considerations shaped the approach and
methods used in developing a wetlands functional as-
sessment procedure. The procedure needed to fit within
the context and objectives of the Wetlands Conservation
Plan for the North Carolina coastal area as described
above. This context, and the opportunities and limita-
tions it imposed, had considerable influence on the spe-
cific procedure developed.

Because we are dealing with a large geographic area
with many wetlands, we recognized from the outset that
we needed a method we could apply to large land areas
without site visits to each individual wetland. This ruled
out the many site-specific functional assessment meth-
ods that were applied in other contexts. Almost of ne-
cessity, a CIS-based approach was chosen. That meant
we would have to use information available in CIS for-
mat and make use of CIS analytical techniques. The
wetland mapping on which the functional assessment is
based was performed using CIS, so the basic digital
data were available.

The primary objective was to produce information about
the relative ecological importance of wetlands that would
be useful for planning and overall management of wet-
lands rather than to serve as the basis for regulatory
decisions. While we could not visit every wetland, the
goal was to predict the functional assessment value that
a detailed, site-specific method would determine. We
wanted to be able to predict in advance what the wetland
regulatory agencies would determine as a wetland's
significance so that the resulting maps would identify
those wetlands where a 404 permit would be difficult or
impossible to obtain. The resulting information would
203
-------
then be useful in determining where not to plan devel-
opment. This would benefit potential permit applicants
by preventing ill-advised plans that would be unlikely to
receive permits and simultaneously serve to protect the
most ecologically important wetlands. The result of the
procedure, then, is not a substitute for a site visit in
making regulatory decisions, but a predictor of what a
site visit would determine.

A primary consideration was that the procedure be
ecologically sound and scientifically valid, based on the
best information available about the functions of wet-
lands. It needed to be based on fundamental principles
of wetlands and landscape ecology rather than on arbi-
trary or subjective decisions.

Finally, the procedure was to be watershed-based. This
requirement was primarily because consideration of a
wetland's role in its watershed is the soundest basis for
determining its ecological significance, but also because
the other components of the Wetlands Conservation
Plan, including wetland mapping and restoration plan-
ning, are based on watershed units. The watersheds
being used are 5,000- to 50,000-acre hydrologic units
(HUs) delineated by the SCS as illustrated in Figure 1.
The North Carolina coastal area comprises 348 of these
HUs. Watershed units of any size, however, could be
used without changing the validity of the watershed-
based considerations used in the procedure.

These initial considerations result in a summary defini-
tion of the functional assessment procedure. It is a CIS-
based, landscape scale procedure for predicting the
relative ecological significance of wetlands throughout a
region using fundamental ecological principles to deter-
mine the functions of wetlands within their watersheds.

The functional assessment procedure is meant to be
used with CIS data for regional application. It is not a
field-oriented, site-specific method that involves visiting
individual wetlands and recording information. A CIS-
based procedure is the only practical approach for deal-
ing with a large geographic area with many wetlands in
a limited amount of time.
This CIS-based approach can make information on wet-
land functional significance available for broad regions
in advance of specific development plans. The informa-
tion is then available for planning to help avoid impacts
to the most ecologically important wetlands. In this
sense, the North Carolina procedure is unlike other
functional assessment techniques that are designed for
use in a regulatory context or that require field data for
each wetland.

Data Requirements

Because the procedure uses CIS analysis, it requires
digital information in CIS format. CIS data layers used
in the procedure include:
• Wetland boundaries and types (the topic of the first
section of this report).

• Soils maps.

• Land use/land cover.

• Hydrography.

• Watershed boundaries.

• Threatened and endangered species occurrences.

• Estuarine primary nursery areas.

• Water quality classifications.

In the North Carolina coastal area, these data layers
either already existed and were available from the CGIA
or were developed as part of the Wetlands Conservation
Plan. Because other projects funded most of the data
acquisition and digitization, developing the necessary
CIS databases was not a major cost.

The soils coverage consists of digitized, detailed county
soils maps produced by SCS and digitized by CGIA. The
soils coverage allows identification of the soil series
underlying a wetland, and the properties of the series
are used to determine soil capacity for facilitating the
wetland's performance of various functions.

The land use/land cover data layer was produced for the
APES from interpretation of satellite TM imagery (8). It
is used to determine land cover and uses surrounding
each wetland and in the watershed.

The basic hydrography coverage consists of 1:24,000-
scale USGS DLGs. Because the functional assessment
procedure uses stream order as an indicator of water-
shed position, stream order according to the Strahler
system was determined manually and added to the DIG
attribute files.

As described previously, the watersheds used in the
procedure are relatively small HUs delineated by SCS.
DCM contracted with CGIA to have these boundaries
digitized for the coastal area. During the digitization
process, the watershed boundaries were rectified to
USGS and DEM boundaries of larger subbasins to en-
sure that the HUs could be combined into larger water-
shed units.

A data layer produced by the North Carolina Natural
Heritage Program is used to identify threatened and
endangered species occurrences. The North Carolina
Division of Marine Fisheries maintains the coverage of
primary nursery areas, and the Division of Environ-
mental Management developed a map of water quality
classifications that was digitized by CGIA.

The ways in which these data layers are used to determine
values for various parameters in the functional assess-
ment procedure are described later in this report. The
CIS procedures have been automated using ARC/ INFO
204
-------
AMI on a Sun workstation. The AMI programs are
available from DCM to anyone planning to use the pro-
cedure elsewhere.

Because the assessment procedure was designed for
CIS analysis, the choice and expression of individual
parameters have been shaped to some extent by the
CIS data available and the capabilities and limitations of
ARC/INFO techniques and AMI automation. DCM was
fortunate to have a relatively large amount of CIS data
readily available. For use in other areas, the procedure
could be modified to use different CIS coverages. At
least the first five databases listed above, however, are
essential to its basic propositions.

Classification Considerations

The HGM classification system for wetlands (7) classi-
fies wetlands into categories based on landscape position
(geomorphic setting), water sources, and hydrodynam-
ics (direction of water flow and strength of water move-
ment). It is being increasingly used as the basis for
wetland classification and functional assessment sys-
tems. HGM classification focuses on the abiotic features
of wetlands rather than on the species composition of
wetland vegetation as do most traditional wetland clas-
sification schemes.

Several features of the HGM classification system make
it a useful starting point for an assessment of wetland
functions. Because the HGM system is based on geo-
morphic, physical, and chemical properties of wetlands,
it aggregates wetlands with similar functions into
classes. The HGM class of a wetland, in itself, indicates
much about the ecosystem functions of the wetland. The
HGM approach also forces consideration of factors ex-
ternal to the wetland site, such as water source. This
helps relate the wetland to the larger landscape of which
it is a part and puts consideration of the wetland's func-
tions in a landscape and watershed context.

Three HGM classes are used as the starting point for
the North Carolina functional assessment procedure. All
wetlands are first classified as one of the following:
• Riverine

• Headwater

• Depressional

Riverine wetlands are those in which hydrology is deter-
mined or heavily influenced by proximity to a perennial
stream of any size or order. Overbank flow from the
stream exerts considerable influence on their hydrology.
Headwater wetlands exist in the uppermost reaches of
local watersheds upstream of perennial streams. Head-
water systems may contain channels with intermittent
flow, but the sources of water entering them are precipi-
tation, overland runoff, and ground-water discharge
rather than overbank flow from a stream. Depressional
wetlands, including wet flats and pocosins, generally are
not in direct proximity to surface water. While they may
be either isolated from or hydrologically connected to
surface water, the hydrology of depressional wetlands is
determined by ground-water discharge, overland runoff,
and precipitation.

The functions of wetlands in these different HGM
classes differ significantly. Riverine wetlands regularly
receive overbank flow from flooding streams and, thus,
perform the functions of removing sediment and pollut-
ants that may be present in the stream water and pro-
viding temporary floodwater storage. Headwater and
depressional wetlands cannot perform these functions
because they do not receive overbank flow. Headwater
wetlands occur at landscape interfaces where ground
water and surface runoff coalesce to form streams.
Headwater wetlands provide a buffer between uplands
and stream flow so they can perform significant water
quality and hydrology functions. While depressional wet-
lands do not perform buffer functions, they often store
large amounts of precipitation or surface runoff waters
that otherwise would more rapidly enter streams. Wet-
lands in all HGM classes can perform important habitat
functions.

Because the wetlands in these different HGM classes
are functionally different, their functional significance is
assessed using different, though similar, procedures. If
the same procedure were used for all HGM classes,
depressional wetlands would always be considered of
lower functional significance simply because they are
not in a landscape position to perform some of the water
quality and hydrologic functions of riverine and headwa-
ter wetlands.

In addition to HGM classes, wetland types identified by
dominant vegetation are used at several points in the
functional assessment. This reflects a recognition that
the biologic properties of a wetland site considered to-
gether with its hydrogeomorphic properties can provide
a more detailed indication of its functions than either
taken alone. The HGM class of a wetland, as a broad
functional indicator, determines which assessment pro-
cedure to use. Within each HGM class and correspond-
ing assessment procedure, wetland type determines the
level or extent of specific parameters.

The wetland types used are those typical of the North
Carolina coastal area. They result from a clumping of
the Cowardin classes used on NWI maps into fewer
types with more intuitively obvious type names (e.g.,
swamp forest, pocosin), as described previously. These
wetland types are used in the wetland maps that form
the starting point for the functional assessment.

Wetland types are used in the procedure as indicators
of functional characteristics. Correlations between wet-
land type and wetland functions were determined from
205
-------
statistical analysis of field data from nearly 400 sites. At
each site, the presence or absence of a list of functional
indicators was recorded. Dr. Mark Brinson of East Caro-
lina University developed the functional indicators lists,
in part. Dr. Brinson served as primary scientific consult-
ant in developing the HGM classification system and the
field sampling methodology.

Wetland types differ in other areas, so their inclusion in
this procedure limits its use in its current form to the
southeastern coastal plain. Adaptation of the procedure
for use in other areas would require either extensive field
sampling as was performed in coastal North Carolina or
a more arbitrary clumping of wetland types based on
best professional judgment. Other methods of wetland
classification could be used, provided wetlands are clas-
sified in such a way that functional characteristics of the
wetland types are constant and can be determined by
field sampling, literature values, and/or professional
judgment. The procedure could be applied directly to
NWI polygons if these are the only wetland map base
available.

In addition to wetland type, several other parameters are
used as indicators of the existence or level of specific
wetland functions. These include both site-specific pa-
rameters, such as wetland size and soil characteristics,
and landscape considerations, such as watershed posi-
tion, water sources, land uses, and landscape patterns.
CIS analysis determines values for these parameters
based on the data layers discussed above. They could
be determined manually, but the process would be very
labor intensive.

Unlike assessment procedures that depend solely on
information that can be collected within a wetland, this
procedure relies heavily on factors external to the wet-
land site itself. Relationships between a wetland and the
landscape within which it exists are integral considera-
tions in determining wetland functional significance.
Characteristics of the landscape surrounding a wetland
are often more important determinants of its functional
significance than are the characteristics of the wetland
itself. Of the 39 parameters evaluated in the procedure,
21 are landscape characteristics, and 18 are internal
characteristics of the wetland itself.

While we believe this emphasis on a wetland's land-
scape context is a more ecologically sound approach to
functional assessment than site-specific methods, it re-
quires a great deal more information than could be
collected within the wetland itself. The procedure is
based on CIS data and analysis, not only to make it
suitable for regional application, but because CIS pro-
vides the most practical way to analyze the spatial rela-
tionships of landscape elements and their properties.
Structure of the Assessment Procedure

The assessment procedure uses a hierarchical structure
that rates individual parameters and successively com-
bines them to determine the wetland's overall functional
significance. The complete hierarchical structure is illus-
trated in Figure 2. It consists of four levels:

• Overall functional significance of the wetland.

• Specific functions and risk of wetland loss.

• Subfunctions.

• Parameters evaluated to determine the level and ex-
tent of functions.

The objective of functional assessment is to determine
an individual wetland's ecological significance in its wa-
tershed and the larger landscape. The highest hierarchi-
cal level, or end result of applying the procedure, then,
is the wetland's overall functional significance.

The second hierarchical level includes the four primary
factors that are considered in determining the wetland's
functional significance (see Figure 3). The overall eco-
logical significance of a wetland is determined by the
degree to which it performs, or has the capacity to
perform, specific functions. The broadest grouping of
wetland functions includes water quality functions, hy-
drologic functions, and habitat functions. The nature of
the landscape and the water characteristics of the wa-
tershed in which a wetland functions also determine
ecological significance to some extent. These factors
determine the potential risk to watershed and landscape
integrity if the wetland functions were lost. Including a
"risk factor" as a basic consideration in functional as-
sessment also provides a means of considering cumu-
lative impacts and the practicality of replacing lost
functions through mitigation in determining a wetland's
overall significance.

Each primary function of wetlands is actually a combi-
nation of separate, more specific subfunctions. Water
quality subfunctions include the removal of nonpoint
source pollutants from surface runoff and the removal of
suspended or dissolved pollutants from flooding
streams. Hydrology subfunctions include storage of pre-
cipitation and surface runoff, storage of floodwater from
streams, and shoreline stabilization. Habitat subfunc-
tions include providing habitat for both terrestrial species
and aquatic life. Several considerations that, while not
truly wetland functions, are called subfunctions for par-
allelism also determine risk factor. The subfunction lev-
els of the assessment procedure are illustrated in
Figures 4 through 7.

Properties of the wetland and its surrounding landscape
determine the extent to which a wetland performs these
different subfunctions. The assessment procedure re-
fers to these properties as "parameters." Parameters
206
-------
Overall Significance •

Functions/Risk III I

Subfunctions • • | I | I •••it

„ III IIIIIII Illll | I I III I "l Illll
Figure 2. Overall hierarchical structure of the functional assessment procedure.
Wetland's Overall
Ecological
Significance
Hydrology
Functions
Habitat
Functions
Risk
Factor
Figure 3. Assessment level two: Primary wetland functions and risk factor.

Water Quality
Functions

Nonpoint Source
Function
Floodwater
Cleansing Function
Figure 4. Water quality subfunctions.

make up the levels in the hierarchical structure that are
actually evaluated based on fundamental ecological
considerations. Parameter values, in turn, are combined
to produce ratings for the subfunctions. Future reports
will explain in detail all parameters evaluated in the
assessment procedure and document them for scientific
validity. This paper discusses only the parameters under
the nonpoint source removal subfunction of the water
quality function for illustration (see Figure 8).

The first parameter determining a wetland's significance
in removing nonpoint source pollutants from surface
runoff water is whether the water contains sediment,
nutrients, or toxic pollutants in significant quantities. This
is evaluated in the "proximity to sources" parameter
based on the land uses surrounding the wetland. If
agricultural fields or developed areas from which pollut-
ants are likely to enter surface runoff largely surround
the wetland, the wetland's potential for removing non-
point source pollutants is high. If, on the other hand,
natural vegetation from which runoff water is likely to be
largely unpolluted mostly surrounds the wetland, its po-
tential for removing significant pollutants is low.

Proximity to sources is an "opportunity" parameter. That
is, it determines whether a wetland has the opportunity
to remove pollutants from surface runoff by considering
how likely the runoff water is to be polluted. The other
parameters for this subfunction are "capacity" parame-
ters that measure the wetland's ability to perform the
function if the opportunity is present. Opportunity and
capacity parameters are treated differently in determining
a wetland's overall significance to prevent a wetland from
being rated lower simply because present opportunity
does not exist. This is discussed in more detail below.

The second parameter considered in determining a wet-
land's significance in nonpoint source removal is its
proximity to a surface water body. If runoff entering a
wetland would otherwise directly enter surface water,
the wetland's significance as a filter is greaterthan if the
wetland is far removed from surface water. In that case,
pollutants in runoff could either settle out or be removed
by other means before they enter surface water as
pollutants.
207
-------
Hydrology
Functions
Surface Runoff
Storage
Floodwater
Storage
Shoreline
Stabilization
Figure 5. Hydrology subfunctions.
Habitat
Functions
Terrestrial
Wildlife
Aquatic
Life
Figure 6. Habitat subfunctions.
Risk
Factor
Landscape
Character
Water
Characteristics
Replacement
Difficulty
Restoration
Potential
Figure 7. Risk factor subfunctions.
Nonpoint Source
Function
Proximity to
Sources
Proximity to
Water Body
Watershed
Position
Site
Conditions
Wetland
Type
Soil
Characteristics
Figure 8. Parameters evaluated under nonpoint source pollutant removal subfunction.
208
-------
The third parameter is the position of the wetland in its
watershed. Several studies have documented that
headwater wetlands are most effective in removing non-
point source pollutants (9-11). Thus, the higher in its
watershed a wetland is located, the higher is its signifi-
cance in nonpoint source removal.

Two subparameters, wetland type and soil charac-
teristics, determine the value of the fourth parameter,
site conditions. By virtue of their typical microtopogra-
phy, hydrology, and vegetative structure, some wetland
types more effectively retain and filter surface runoff
than do other types. Some soil series are more effective
than others in retaining and chemically transforming
pollutants. Each subparameter is rated, and their com-
bined values produce a rating for the site conditions
parameter.

A similar evaluation of specific parameters is performed
to derive significance ratings for other wetland subfunc-
tions. In all cases, CIS analysis determines parameter
values based on the data layers described above. Some
parameters, such as wetland type in the nonpoint source
illustration, are surrogates or indicators of other wetland
properties that actually determine the wetland's func-
tional capacity. The limitations of CIS data and tech-
niques necessitate the use of indicator parameters.

Evaluation Procedure

The objective of the assessment procedure is to deter-
mine an individual wetland's ecological significance in
the watershed in which it exists. Ecological significance
is divided into three broad classes (high, medium, and
low) rather than attempting to derive a specific numerical
"score." This is partly because of the procedure's initial
application in an EPA ADID project performed by DCM
in Carteret County, North Carolina. Standard ADID pro-
cedure is to classify wetlands into three groups:

• Areas generally unsuitable for the discharge of
dredged or fill material.

• Areas that require a project-by-project determination.

• Possible future disposal sites for dredged or fill material.

These groups correspond to the H, M, and L used in the
assessment procedure.

The approach of classifying wetlands into three broad
functional significance classes is also used, however,
because it is feasible with our current understanding of
wetland function. Attempting to assign a specific value
along a numeric continuum of functional significance
greatly exaggerates the precision with which we can
realistically apply current knowledge. The three signifi-
cance classes used in the assessment procedure pro-
vide the information necessary to meet the procedure's
objectives without going beyond the realm of reasonable
scientific validity.
As explained above, the basic evaluation is performed
at the parameter level. An H, M, or L value is assigned
to each parameter as it relates to the performance of the
wetland subfunction being considered. For example, if
the soils underlying a wetland have properties that are
highly conducive to the function being considered, the
soil characteristics parameter is rated H; if soil proper-
ties are less conducive to performing the function, the
parameter is rated M; and if soil properties are not at all
conducive to the function, the parameter is rated L. All
individual parameters under a given subfunction receive
similar ratings.

The individual parameter ratings are then combined to
give an H, M, or L rating for each subfunction. The
subfunction ratings are combined into a rating of the
wetland's significance in performing each of the primary
wetland functions. Finally, the ratings for primary func-
tions are combined into an overall rating of the wetland's
functional significance.

The process of successively combining ratings up the
structural hierarchy is the most complex aspect of the
assessment procedure. The combining, as well as the
evaluation of individual parameters, is based on funda-
mental ecological principles about how wetlands and
landscapes function. Because the ecological processes
themselves interact in complex ways, combining ratings
is much more complex than a simple summation of
individual ratings. Some parameters are normally more
important than others in determining the level at which
a wetland performs a specific function and, thus, must
be weighed more heavily in determining the combined
value. In some cases, different combinations of individ-
ual parameter ratings result in the same level of func-
tional significance. Each possible combination of
parameters must then be considered.

The automated version of the assessment procedure
maintains all individual parameter ratings and combina-
tions in a database. Because the combining process is
complex, the reason a wetland receives an overall H, M,
or L rating may not be intuitively obvious. The database
makes it possible to trace through the parameter, sub-
function, and primary function ratings that result in a
wetland's overall rating.

This database also allows consideration of specific wet-
land functions individually. For example, in a watershed
targeted for nonpoint source pollution reduction, one
management objective may be to give the highest level
of protection to wetlands most important in performing
this function. The database allows examination of each
wetland for its significance in nonpoint source removal
and production of a map of wetlands rated according to
their significance for this single function.

Individual function ratings in the database can also
be used to improve planning, impact assessment, and
209
-------
mitigation for development projects that affect wetlands.
If alternative sites are available, such as alternative
corridors for a highway, the alternative with the least
impact on the wetland function considered most impor-
tant in the watershed can be identified. Rather than
simply minimizing acres of wetland impact, the objective
would be to minimize impacts to the most important
wetland functions. Environmental assessment of wet-
land impacts can identify specific functions to be lost.
Mitigation can be improved by giving priority to sites with
the highest potential for performing the same functions.

Future reports will explain detailed procedures for evalu-
ating individual parameters and combining them into
functional ratings. This paper illustrates only the water
quality nonpoint source removal subfunction. The rating
system for this subfunction is summarized in Figure 9.

Four parameters are evaluated to determine the signifi-
cance of the nonpoint source removal subfunction.
Because (d), the site conditions parameter, has sub-
parameters below it, it is first evaluated using a relatively
simple procedure. If conditions typical of the wetland
type and characteristics of the underlying soil are both
highly conducive to removal of pollutants in runoff water
entering the wetland, the site conditions parameter is H.
If either the wetland type or the soil is not at all conducive
to pollutant removal and the other subparameter is no
more than somewhat conducive, the site conditions pa-
rameter is L. Any other combination results in an M.
Parameters

(a) Proximity to Sources

(b) Proximity to Surface Water

(d) Site Conditions

(1) Wetland Type

(2) Soil Characteristics

Evaluation Procedures

Site Conditions

H Both Parameters H

M Other combinations

L One parameter L and neither H

NPS Subfunction

H (a) & (b) H and (d) at least M or
(c) & (d) H and (b) at least M

M Other combinations

L Two of (b), (c), & (d) L
Figure 9. Parameters evaluated under nonpoint source pollut-
ant removal subfunction.
Following evaluation of all parameters, they are com-
bined to evaluate the significance of the wetland in
removing nonpoint source pollutants. Two combinations
result in the wetland being evaluated as highly signifi-
cant in performing this function. First, if the wetland is
adjacent to both a significant source of polluted runoff
(a = H) and a permanent surface water body into
which the runoff would flow if the wetland were not
there (b = H), and has site conditions that are at least
reasonably efficient in catching, holding, and removing
pollutants from the runoff (d at least M), it receives an
H. Alternatively, even if the wetland is not adjacent to a
pollutant source, it receives an H if it is in the headwaters
of the watershed (c = H), site conditions are highly
conducive to pollutant removal (d = H), and it is at least
close to an intermittent stream (b at least M).

On the other hand, if any two of parameters (b), (c), and
(d) are evaluated L, the significance of the wetland for
nonpoint source pollutant removal is Low. That is, the
wetland is evaluated as L for this function if any of the
following conditions exist:

• The wetland is not close to surface water (b = L) and
downstream in the watershed (c = L).

• The wetland is not close to surface water (b = L), and
its site conditions are poor for pollutant removal (d = L).

• The wetland is downstream in the watershed (c = L)
and has poor site conditions (d = L).

Any combination of parameter evaluations other than
those resulting in an H or L results in the wetland being
evaluated as of moderate significance for removing non-
point source pollutants. This example is typical of evalu-
ation procedures used for all subfunctions. More often
than not, the evaluation procedures are complex and
multifarious in their reasoning and application. Hope-
fully, though, they are scientifically valid based on cur-
rent knowledge of wetland ecology.

Opportunity and Capacity

The concepts of opportunity and capacity for a wetland
to perform a given function were briefly discussed
above. For a wetland to actually perform a function, it
must have both the opportunity and the capacity for the
function. In terms of the nonpoint source example, a
source of potentially polluted runoff must enter the wet-
land to provide an opportunity, and the wetland must
have the internal capacity to hold the runoff and remove
the pollutants before releasing the water. Factors exter-
nal to the wetland usually determine the opportunity to
perform a function, while properties of the wetland itself
along with its landscape position determine the capacity
to perform the function.

Because the assessment procedure is a landscape
scale procedure that evaluates the functions a wetland
210
-------
performs in relation to its surroundings, essentially every
subfunction includes opportunity parameters. A func-
tional assessment that is too heavily dependent on op-
portunity parameters, however, is static and rapidly
becomes invalid as land uses change. A wetland that is
bordered by natural forest today can be bordered by a
young pine plantation or a subdivision under construc-
tion by next year. The fact that a wetland does not have
the opportunity to perform certain functions today does
not mean that it will not have the opportunity in the
future. If an assessment of wetland significance is to
remain valid overtime in a landscape subject to change,
opportunity parameters alone cannot be determinative.

The evaluation procedure for the nonpoint source sub-
function explained above is an example of how the
assessment procedure handles this situation. The op-
portunity for a wetland to receive polluted runoff water
from surrounding lands (a = H) can result in an evalu-
ation of H for this subfunction if other properties are also
present, but it does not have to be present for a wetland
to be evaluated H. Other parameters (c and d = H, and
b at least M) that give a wetland a high capacity to
remove nonpoint source pollutants can also result in an
H. Conversely, lack of present opportunity (a = L) does
not result in an evaluation of low significance for this
function. At least two of the other parameters must be L
for the wetland to be evaluated as L.

These conventions hold throughout the procedure. A
present high opportunity to perform a function can result
in an evaluation of high significance for the function, but
high capacity can also result in an H evaluation even if
present opportunity is lacking. Lack of present opportu-
nity alone never results in an evaluation of low signifi-
cance for a function. High opportunity is treated
essentially as a "bonus" consideration that can result in
a higher evaluation for a wetland than its capacity alone
would indicate but that will never result in a lower evalu-
ation because of its absence.

Overriding Considerations

Several considerations are of such importance in the
North Carolina coastal area that their presence alone
will result in a wetland evaluation of high significance.
These parameters are evaluated first as either true or
false, and if one or more of them is true, the rest of the
evaluation procedure is not performed.

The first overriding consideration is whether the wetland
is a salt or brackish marsh meeting the definition of
"coastal wetland" as set forth in North Carolina statutes
(NCGS 113-229(n)(3)) and rule (NCAC 7H .0205(a)).
Coastal wetlands in North Carolina are designated by
law as highly significant. Consequently, the assessment
procedure evaluates them automatically as H and in-
cludes no considerations for differentiating among the
functional significance of these wetland types.
The second overriding consideration is whether the wet-
land is adjacent to an officially designated primary nurs-
ery area (PNA). All designated PNAs are included in
"areas of environmental concern" in the NC CMP and
are protected by a specific set of regulations. They are
areas where initial postlarval development of finfish and
crustaceans takes place and, thus, are critical to estu-
arine fish and shellfish populations. Wetlands adjacent
to PNAs are highly important in maintaining water qual-
ity and appropriate salinity gradients in these critical
areas and are automatically evaluated as of high func-
tional significance.

The third overriding consideration is whether the wet-
land contains threatened or endangered species. If a
known threatened or endangered plant or animal spe-
cies on either federal or state lists is present, the wetland
is evaluated as highly significant. The determination is
based on information obtained from the North Carolina
Natural Heritage Program.

The fourth overriding consideration is whether the wet-
land includes all or part of a critical natural area as
designated by the North Carolina Natural Heritage Pro-
gram. If so, the site is considered of high significance.
CIS data layers maintained by the Natural Heritage
Program also help make this determination.

Verification

Throughout the development and initial application of
the assessment procedure, we have checked and veri-
fied its validity. Parameter evaluations and combination
procedures are based on the best wetland science avail-
able in the scientific literature. The validity and accuracy
of the CIS databases used to apply the procedure
have been verified to the extent possible. Following sec-
tions of this report fully document any assumptions
made about wetland ecology, CIS data, or CIS analytical
techniques.

An advisory panel of wetland scientists familiar with the
wetlands of coastal North Carolina and representatives
of several state and federal wetland-related agencies
reviewed every step of the procedure's development.
While their review does not represent an endorsement
of the procedure or its results by the agencies or indi-
viduals included, it does indicate the level of peer review
the procedure has received.

During development of the procedure, field visits were
made to nearly 400 wetland sites to gather data on func-
tional indicators. On these same site visits, a field-based
functional assessment procedure, the Wetland Rating
System developed by the North Carolina Division
of Environmental Management, was applied. This pro-
vides the basis for a field verification of the assessment
procedure.
211
-------
Discussion

As we continue to understand more about the role of
wetlands in maintaining a healthy environment, the use-
fulness of wetlands locational data continues to grow in
importance. Spatial data can assist county planners in
guiding development away from environmentally sensi-
tive areas. Landowners now have the capability to look
at a map and realize very quickly that wetlands exist on
a given area of land. In addition, economic development
councils can use this information to plan development in
areas attractive to a particular industry. If a new busi-
ness or industry wishes to locate in an area positioned
such that the wetlands permitting process could be
avoided, maps showing lands void of wetlands could be
a significant tool to the economic development council.
The representation of these wetlands' ecological signifi-
cance dramatically increases the utility for these data.

While paper maps can be distributed to all interested
parties, digital data also are available to public agencies
who have CIS capabilities. In Carteret County, for exam-
ple, a publicly installed workstation will be made avail-
able with these data installed. The county government
will be able to view wetlands in the context of cadastral
boundaries that already are on CIS. Information about
sensitive resources made available prior to any devel-
opment will, hopefully, lead development away from
environmentally sensitive areas.

References
1. Hefner, J.M., and J.D. Brown. 1985. Wetland trends in the south-
eastern United States. Wetlands 4:1-12.
2. Dahl, I.E. 1990. Wetlands losses in the United States 1780s to
1980s. U.S. Department of the Interior, Fish and Wildlife Service,
Washington, DC.
3. DEM. 1991. Original extent, status, and trends of wetlands in
North Carolina. Report No. 91-01. North Carolina Department of
Environment, Health, and Natural Resources, Division of Envi-
ronmental Management, Raleigh, NC.

4. DCM. 1992. Final assessment of the North Carolina coastal man-
agement program. Report to the Office of Ocean and Coastal
Resource Management, NOAA, U.S. Department of Commerce,
performed under the Coastal Zone Enhancement Grants Pro-
gram (January 10). Section 309, CZMA, North Carolina Division
of Coastal Management, Raleigh, NC.

5. DCM. 1992. Final strategy for achieving enhancements to the
North Carolina coastal management program. Proposal to the
Office of Ocean and Coastal Resource Management, NOAA, U.S.
Department of Commerce, performed under the Coastal Zone En-
hancement Grants Program (March 25). Section 309, CZMA,
North Carolina Division of Coastal Management, Raleigh, NC.

6. Hefner, J.M., and K.K. Moorhead. 1991. Mapping pocosins and
associated wetlands in North Carolina. Wetlands 2(Special Is-
sue):377-389.

7. Brinson, M.M. 1993. A hydrogeomorphic classification for wet-
lands. U.S. Army Corps of Engineers Waterways Experiment Sta-
tion Report WRP-DE-4. Washington, DC.

8. Khorram, S., H. Cheshire, K. Siderelis, and Z. Nagy. 1992. Map-
ping and CIS development of land use/land cover categories for
the Albemarle-Pamlico drainage basin. Report No. 91-08-NC.
Raleigh, NC: North Carolina Department of Environment, Health,
and Natural Resources.

9. Leopold, L.B. 1974. Water: A primer. San Francisco, CA: WH.
Freeman and Co.

10. Whigham, D.F., C. Chitterling, and B. Palmer. 1988. Impacts of
freshwater wetlands on water quality: A landscape perspective.
Environ. Mgmt. 12(5):663-671.

11. Novitski, R.P. 1979. The hydrologic characteristics of Wisconsin
wetlands and their influence on floods, streamflow, and sediment.
In: Greeson, P.E., J.R. Clark, and J.E. Clark, eds. Wetland func-
tions and values: The state of our understanding. Lake Buena
Vista, FL: American Water Resources Association.
212
-------
Decision Support System for Multiobjective Riparian/Wetland Corridor Planning
Margaret A. Fast and Tina K. Rajala
Kansas Water Office, Topeka, Kansas
Kansas has numerous programs that affect riparian cor-
ridors and associated wetlands. These programs in-
clude planning, monitoring, assistance, research, and
regulatory activities. Although administration of these
programs often overlaps, integration of program objec-
tives into a holistic, multiobjective approach to resource
planning and management has been lacking. A large
amount of resource data was routinely collected and
compiled, but no effective way had been developed to
integrate these data into the decision-making process.

The Kansas Water Office (KWO) was awarded a grant
in September 1992 from the U.S. Environmental Protec-
tion Agency (EPA) to develop a geographic information
system (CIS) decision support system (DSS) that would
enable the state to augment its ability to manage ripar-
ian/wetland corridors. The project used CIS to differen-
tiate between reaches of a stream corridor to evaluate
their environmental sensitivity. The Neosho River basin,
one of 12 major hydrologic basins in Kansas, was used
as a pilot to demonstrate the feasibility of the concept.

The KWO will use the DSS to help target sensitive areas
in the Neosho basin for further planning activities. The
project will also benefit other state agencies in their
riparian/wetland corridor efforts. The implementation of
planning objectives may involve local units of govern-
ment and, ultimately, private landowners.

Major phases of the project included:

• A needs assessment study

• A feasibility analysis

• A system design

• Construction of the DSS for the Neosho River basin

• A final evaluation of the DSS capabilities

An interagency project advisory group (IPAG), consist-
ing of representatives from eight agencies directly or
indirectly involved in riparian and wetland protection
activities, was formed to assist in project design and
evaluation.

Major steps involved in designing the DSS included:

• Selection and CIS development of databases used
for riparian corridor evaluation.

• Creation of riparian corridor segments.

• Development of an analysis methodology to apply to
corridor segments.

• Evaluation of the DSS.

Databases Selected for Decision Support
System Development

Many types of data were reviewed for the DSS. Several
were not used due to the costs associated with geo-
graphically referencing the data, given the current data
format.

The databases listed in Table 1 are available in the DSS.

During the system design phase of the project, the IPAG
identified the need to develop a pilot study area for the
DSS. The IPAG had difficulty understanding how a DSS
would use geographically referenced data sets (cover-
ages). Before committing to a design for the develop-
ment of a basinwide system, the IPAG decided first to
develop a pilot study area, with a specific focus (appli-
cation), that could be on-line and demonstrated early.
This would allow time for further refinement of the scope
of work and identification of coverages to be developed
prior to basinwide development of the DSS. For the pilot
study application, the IPAG chose to assess the value
and vulnerability of the riparian areas in two 11-digit
hydrologic unit code (HUC11) watersheds to allow the
user to evaluate a corridor segment and compare be-
tween segments and to prioritize or target segments for
further planning activities.

As development of data layers progressed for the pi-
lot, the IPAG quickly determined that the DSS project
213
-------
Table 1. DSS Database List
DSS Name Data Description
Source
Boundary
Buffer
Channels
Con_ease
Contam
Corridor
County
Dams
Dwrapp
Gages
Geology
Huc11
HydMOOk
Kats
Landc
Lc_stats
MDS
NPS
Perenial
Pop
PPL
Publand
Roads
Sections
Streamev
Neosho River basin boundary
Riparian corridor
Stream channelization
Conservation easements
Water contamination
Riparian corridor
County boundaries
Dam structures
Water appropriations
United States Geological
Survey (USGS) stream gaging
stations
Surface geology
11 -digit hydrologic unit
boundaries
Hydrology
Kansas water quality action
targeting system
Land cover
Land cover statistics
Minimum desirable stream
flow monitoring gages
Nonpoint source pollution
Perennial hydrology
Population
Populated places
Public lands
Roads
Section corners
1981 stream evaluation
Soil Conservation Service (SCS) HUC11 drainage basins; 1 :100,000-scalea
Original buffer on mainstem Neosho and Cottonwood; 147 corridor segments split
on tributary confluences
Division of Water Resources (DWR) legal description of locations
Locations of important natural resources that could be purchased by the state from
willing landowners for conservation protection
Kansas Department of Health and Environment (KDHE) contamination locations3
Final riparian corridor; 63 corridor segments developed from HUC11 boundaries
Kansas Geological Survey (KGS) cartographic database; 1 :24,000 scale3
DWR legal descriptions of locations
DWR legal descriptions of locations3
USGS latitude-longitude descriptions; CIS cover developed by USGS
KGS 1 :500,000-scale3
SCS HUC11 drainage basins; 1 :100,000-scale3
USGS 1:100,000-scale digital hydrology3
KDHE target valuable and vulnerable scores by HUC11 drainage basin
1 :100,000-scale developed from satellite imagery by the Kansas Applied Remote
Sensing Program, University of Kansas3
Summary statistics on land cover by corridor segment
Subset of USGS gaging stations
Target watersheds identified in the Kansas Water Plan
Reselected perennial streams from 1:100,000 USGS digital hydrology
Urban land cover (from landc) with 1980 and 1990 Census population data
Geographic names information system (GNIS) entries for Kansas; GIS cover
developed by USGS
State and federally owned land digitized from 1:100,000-scale USGS quad maps
USGS 1:100,000-scale digital roads3
KGS cartographic database; 1 :24,000-scale3
U.S. Fish and Wildlife stream evaluation study; Kansas Department of Wildlife and
Parks (KDWP) provided data on paper maps
T_and_e Threatened and endangered
species
Temussel Threatened and endangered
species
Tigrcity City boundaries
Twp Townships
Watrfowl Water fowl locations

Wq_eff Water quality: effluent
Wq_grnd Water quality: ground water
Wqjake Water quality: lake
Wq_strm Water quality: stream
Stream locations of state and federal identified threatened and endangered
species; KDWP provided data on paper maps
Locations of state endangered floater mussels; KDWP provided data on paper maps

U.S. Census 1:100,000-scale TIGER line data; boundaries only, areas not named
(use with PPL)
KGS cartographic database; 1:24,000-scale3
KDWP locations and counts of annual waterfowl migration; data developed from
paper maps (Restrict public distribution of data per KDWP request.)
KDHE sampling sites; GIS cover developed by KDHE
KDHE sampling sites; GIS cover developed by KDHE
KDHE sampling sites; GIS cover developed by KDHE
KDHE sampling sites; GIS cover developed by KDHE
Data available at the Kansas Data Access and Support Center (DASC).
214
-------
parameters would have to be limited to the riparian
corridor along the mainstem of the Neosho and Cotton-
wood Rivers. The costs associated with developing ri-
parian corridor segments for all perennial waters in the
Neosho basin was fargreaterthan the available funding.

Creation of Riparian Corridor Segments

A buffer width of one-half mile (one-quarter mile from
each stream bank) for the mainstem of the Neosho and
Cottonwood Rivers was used to produce the riparian
corridor. If more time and funding had been available,
riparian corridors for all perennial streams in the Neosho
basin could have been developed. The development of
this second view of data, organized by the HUC11 wa-
tershed, would then have been useful for individual
watershed analysis because all perennial streams in the
watershed could be analyzed.

The intersection of the HUC11 basin boundaries seg-
mented the corridor. In several instances, small sliver
polygons were produced where the HUC11 boundary
paralleled the river within the 1/4-mile corridor. The sliver
polygons were dissolved into the majority HUC11. In
other words, this project assumed that the 1/4-mile cor-
ridor buffer was more accurate and useful than the
1:100,000-scale HUC11 boundary.

Many of the HUC11 boundaries that the Soil Conserva-
tion Service (SCS) developed actually follow the course
of the Kansas streams, ratherthan intersect them. When
this occurred along the Neosho and Cottonwood Rivers,
we found that the resulting opposing corridor segments
did not always balance with an equivalent length. Also,
some HUC11 boundaries would first follow the river,
then cross the river. This resulted in corridor segments
that encompass both sides of the river for a portion of
the segment and follow only one side of the river for
another portion of the segment. To address these situ-
ations, the KWO arbitrarily added intersections to create
equivalent left and right bank corridor segments and to
create corridor segments that encompassed either one
side of the river or both sides of the river.

Once the corridor segments were finalized and numbered,
the corridor segment identification number (corrseg-id)
was attached to the other CIS covers. This allows the
reselection of data for a given corridor segment, using
Boolean expressions in the DSS.

Development of an Analysis
Methodology: Land Use

The IPAG determined that one of the most significant
factors associated with the quality of the riparian corridor
is land cover. Land cover was analyzed for the riparian
corridor segments; the CIS cover lc_stats contains sum-
mary statistics for each corridor segment. The calcula-
tions discussed in the following paragraphs identify the
data found in the lc_stats cover. Due to the size of the
land cover data set in the Neosho River basin, the DSS
includes only the land cover within the riparian corridor.

One way of identifying corridor segments in need of
protection or remedial action is to determine the ratio of
the number of acres in the corridor segment that contain
the preferred riparian land cover types (grasses, woods,
and water) to the number of acres that contain the least
preferred types of land cover (crops and urban areas).
The corridor segments can then be ranked according to
that ratio.

Other calculations are useful:

• bad_pct: percentage of the corridor segment that con-
tains crop and urban land cover types.

• bad_tbad: percentage of all crop and urban land
cover for the entire riparian corridor that resides in
the corridor segment.

• 'type'_pct: percentage of the corridor segment that is
crop, grass, wood, water, and urban. Type' refers to
each of the five land cover types; lc_statsuses: a
separate value for each (e.g., crop_pct).

• 'type'_t'type': percentage of each type of land cover
for the entire riparian corridor that resides in the cor-
ridor segment (e.g., crop_tcrop).

• 'type'_acres: total acreage of each type of land cover
in the corridor segment (e.g., crop_acres).

• good_acres: total acreage of grass, wood, and water
in the corridor segment.

• bad_acres: total acreage of crop and urban in the
corridor segment.

Another significant benefit of the DSS is the ability to see
where the land cover types are in relation to the river.
As an example, the ability to identify corridor segments
that have crop land extending to the river on both banks
is useful because they are the segments most vulner-
able to bank erosion. Those segments can then be
targeted for further remedial activities planning.

Decision Support System Requirements

The DSS data sets were developed and analyzed using
ARC/INFO on a UNIX-based workstation. The final cov-
ers were then exported and transferred to a microcom-
puter for use in ARC/VIEW. Hardcopy prints are printed
to a Tektronix Phaser III color wax printer with 18 Mb of
RAM, running in Postscript mode.

The DSS data sets total 26 Mb. ARC/VIEW version 1
requires 8 Mb of RAM to load the program. To run the DSS
efficiently, a 486DX-66 with 16 Mb of RAM is preferred.
The DSS is slower on a 486DX-33 with 8 Mb of RAM. It
was not tested on any other PC configuration, so a con-
figuration in between the two may be satisfactory.
215
-------
Processing GIS Data

Reselecting the perennial streams in the Neosho basin
and further identifying the mainstem of the Neosho and
Cottonwood Rivers using United States Geological Sur-
vey (USGS) 1:100,000-scale hydrography can be time
consuming. Perhaps the River Reach III covers should
replace that data in the future.

Attaching census data to the urban land cover polygons,
as was done for the pop cover, is not recommended.
Use of the TIGER line files and cover would give a more
accurate distribution of the population. Because less
than 5 percent of the riparian corridor had urban land
cover, the KWO did not use the pop cover in its evalu-
ation. Several summary covers of TIGER and census
data will soon be available from DASC.

Clipping the other ARC/INFO covers to the Neosho
basin and attaching the corrseg-id, using the identity
command, was unremarkable.

Processing Non-GIS Digital Data

Channels and dams were in digital format but were not
in ARC/INFO format. The files were processed using the
LeoBase conversion software from the KGS, then gen-
erated into ARC/INFO covers. Some records were lost
in the conversion. The LeoBase program fails to convert,
or incorrectly converts, legal descriptions for sections
that do not have four section corners (e.g., northeastern
Kansas). The Division of Water Resources is in the
process of attaching latitude-longitude to the point loca-
tions. Processing these data should take only a few
hours at most.

Processing Nondigital Data

Several covers were developed on contract from paper
maps or legal descriptions. They were: conservation
easements (con_ease), public lands (publand), stream
evaluation (streamev), threatened and endangered spe-
cies (t_and_e and temussel), and waterfowl (watrfowl).
Most of the data for these covers were drafted on a
1:100,000-scale USGS quad map and digitized. The
stream evaluation data were developed using a
scanned paper map of the coded streams as a backdrop
for the 1:100,000-scale hydrography; the digital streams
were reselected and coded.

In summary, KWO's GIS personnel needed approxi-
mately 275 hours to develop the riparian corridor seg-
ments, process the land cover data and summary
statistics, export the covers, transfer and import the
covers for ARC/VIEW, and assist in the development
and presentation of the DSS demo. Contract personnel
spent approximately 183 hours developing GIS covers
for the DSS. This does not include the time spent iden-
tifying the perennial and mainstem hydrology in the
USGS 1:100,000-scale hydrology.
Final Evaluation of the Decision
Support System

In its final evaluation of the system, the IPAG determined
the system to be useful and an excellent start at consoli-
dating a variety of data that have application for riparian
corridor/wetland issues. Many IPAG members found
ways to use the DSS in their own programs. Additional
comments on the system evaluation are as follows:

• Concern about the lack of complete wetland data.
The land cover data available could not identify wet-
land areas.

• Need for more detailed woodland data. Again, the
resolution of the land cover data precluded detailed
identification of woodland areas. The Kansas Biologi-
cal Survey (KBS), the KWO, and EPA are now pur-
suing options to develop more detailed land cover
data, including wetlands and woodlands.

• The lack of information on the tributaries did not allow
full basin analysis, which would be desirable. This
issue is addressed in the "construction" discussion
above.

• Desirability of expanding the project with elevation
and temporal data.

• Lack of definition of the floodplain. Federal Emer-
gency Management Agency (FEMA) floodplain data
are not easily incorporated into a GIS. Other options,
including satellite imagery of the flood of 1993, will
be evaluated.

• Project development requires extensive communica-
tion between program people and GIS technicians.
This can be a daunting task due to the technical
vocabularies involved and the many other ongoing
activities of the participants.

• Consideration of the requirements for transferring the
project to other potential users. GIS applications gen-
erally use large databases. User microcomputers
may not have the CPU, RAM, and storage capacity
necessary for the DSS application and often have a
limited number of options for data transfer.

• Concern about costs and time associated with the
expansion of the DSS to other basins in the state.
This project was focused on one of the 12 major
hydrologic regions in Kansas. Funding options, pro-
ject scope, and system refinements based on the
physical characteristics of the other basins need to
be pursued.

The KWO learned that clearly defining a single DSS
application at the outset of the project is critical. The
KWO originally believed that the DSS could be devel-
oped with general descriptions of the broad range of
program applications, utilized by multiple agencies, that
could benefit from the DSS. Each participating agency
216
-------
could bring its programs and needs to the IPAG for areas for further planning activities, the IPAG became
discussion; the resulting DSS would then serve those more confident in its advisory role. Upon completion of
multiple programs and needs. Instead, the ambiguity of the project, the IPAG members could readily identify
the objective confused the IPAG. Once the IPAG chose how the DSS could be enhanced, modified, or directly
to focus on a single application, the assessment of used in their own programs.
riparian corridor value and vulnerability to target priority
217
-------
Design of GIS Analysis To Compare Wetland Impacts on Runoff in Upstream
Basins of the Mississippi and Volga Rivers
Tatiana B. Nawrocki
Natural Resources Research Institute, University of Minnesota, Duluth, Minnesota
Introduction

The attention given in hydrologic studies to wetlands
differs significantly between the United States and Rus-
sia at the present time. Fundamental theories and
mathematical models are developed in both countries to
describe hydrologic processes and impacts of water-
shed conditions on surface runoff. In the United States,
however, theoretical investigations are directly pointed
at wetlands and are supported by large-scale field stud-
ies and advanced technological capabilities to manage
spatially distributed information. Unlike in Russia, in the
United States, special scientific symposia are devoted
to wetland hydrology, where major tasks for hydraulic
and hydrologic research needs are formulated. Among
these tasks are the understanding and assessment of
relationships between various hydrologic modifications
and wetland functions, especially wetland flood convey-
ance and water quality protection functions (1). Water-
shed-scale comprehensive field studies of wetland
functions are underway, for example, at constructed
experimental wetlands in the Des Plaines River basin in
Illinois (2). A new long-term goal—strategic restoration
of wetlands and associated natural systems—has been
formulated (3).

The intensive efforts of many U.S. scientists yielded
numerous results and attracted more attention to the
complicated nature of wetlands processes. Wetlands
were evaluated as runoff retention basins, and it was
found that, in northwestern states, up to 12 inches of
water could be accumulated per wetland acre (4, 5).
Overtime, piecemeal loss and degradation of wetlands
in many areas of the United States have seriously de-
pleted wetland resources. Researchers also discovered
that adverse impacts from wetland degradation could
appear indirectly with little obvious spatial or temporal
connections to sources. As described by Johnston (6):

Cumulative impacts, the incremental effect of an
impact added to other past, present, and reasonably
foreseeable future impacts, has been an area of
increasing concern. . . . Impacts can accumulate
over time or over space and be direct or indirect. An
indirect impact occurs at a location remote from the
wetland it affects, such as the discharge of pollut-
ants into a river at a point upstream of a wetland
system.

The process of solving environmental problems related
to wetlands is increasingly complex. Analyzing diversi-
fied data over increasingly broad areas becomes essen-
tial for making competent decisions.

Comparing wetland hydrologic functions in headwaters
of the Mississippi River (United States) and the Volga
River (Russia) could provide additional information
about how alternative management strategies affect
runoff, peak flow, and water quality under changing
climates. A macro-scale "field experiment" in both of
these naturally similar areas is already underway. Wet-
land conservation as opposed to drainage is now the
prevailing policy in the upper Mississippi basin. In Rus-
sia, however, economic problems have prevented this
type of policy from becoming a priority. Instead, peat
mining, reservoir construction on lowlands, and drain-
age for farming and private gardening are common.

This project, which is being implemented at the Natural
Resources Research Institute (NRRI), University of Min-
nesota, Duluth, has the following goals:

• Developing a multilayered hierarchical base of geo-
graphic information system (GIS) data for headwater
watersheds of the Mississippi and Volga Rivers.

• Developing a comparative analysis of wetland im-
pacts on the hydrology of the rivers.

• Studying the relationships between natural and hu-
man-induced factors on wetland functions under cli-
mate change and variable strategies of wetland
conservation.
218
-------
• Defining criteria and thresholds for wetland system
stability with regard to flood risk and water quality.

• Outlining recommendations for wetland management
in the headwaters.

The methodology for comparative assessments in-
volves statistical analysis, hydrologic models, CIS, and
remote sensing. Representative watersheds will be
studied in more detail, and procedures for scaling infor-
mation from the local to the regional level will be
developed.

Input Data

In recent years, U.S. governmental and state agencies,
as well as a number of private companies, have ex-
pended considerable efforts to compile the available
data in CIS format for multidisciplinary analysis of wa-
tershed problems. Among the major sources of informa-
tion essential for studies of wetland hydrologic functions
are:

• The National Wetlands Inventory, conducted by the
U.S. Fish and Wildlife Service.

• Digital elevation maps (OEMs) developed by the U.S.
Geological Survey (USGS).

• Major and minor watershed boundaries, outlined for
Minnesota by the Minnesota Pollution Control Agency
(MPCA).

• The water quality sampling network from the U.S.
Environmental Protection Agency (EPA).

• The digital chart of the world (DCW), issued by the
Environmental Science Research Institute (ESRI) in
scale 1:1,000,000.

These and other sources, listed in Table 1, were used to
compile the map illustrations for this paper.

Almost no similar data in CIS form could be found for
the territory of the former USSR, however. Any specific
data (e.g., detailed maps, hydrology records, water
quality sampling data) are generally in paper files dis-
persed among many agencies and are hard to obtain.
The forms of information storage and means of its analy-
sis are out of date, and most maps exist in single or few
copies in paper files. In Russia, the time lag grows
between the dynamic changes in the environment and
the traditional pattern and inertia of management
structures.

The CIS situation in Russia developed some interesting
paradoxes. During the first few decades of space pro-
grams, certain state agencies accumulated an outstand-
ing bank of world image data. When economic
hardships hurt the previously privileged space industry,
numerous joint ventures with foreign companies were
created to distribute images on the world market for hard
currency. These data are hardly available for domestic
uses, however. The current domestic price for image
data is 20,000 to 25,000 rubles for a black-and-white
picture of an area 60 by 60 kilometers, or 60,000 to
70,000 rubles for the same image on a computer disk.
With the present level of funding for scientific research,
the price is too high. Security regulations still restrict
access to later data, showing land use changes.

Another paradox is scientists' attitude toward their data.
Abandoned by the state, agencies and institutes are
reluctant to share their specific data in multidisciplinary
projects. Data files are now a commodity. Accomplishing
an overlay and integration of special data coverages,
which is essential to any watershed CIS study, is almost
impossible.

The third paradox is the attitudes of local, regional, and
central authorities toward CIS. Many authorities are still
ignorant about the potential of this technology. Those
who are knowledgeable prefer not to promote CIS for
watershed-related tasks because it involves land use
analysis. With the onset of land privatization, the best
pieces of property (e.g., the waterfront lots adjacent to
drinking water reservoirs in the Moscow region) are
rapidly allocated to the most powerful landowners. Thus,
limiting access to this kind of information is deemed
safer.

Experts in Russia have not yet applied CIS to wetland
hydrology studies because CIS is still a very new and
mostly unfamiliar technology. This makes the current
study unique both for its results and for its application of
CIS methodology.

Closer review of data sources indicates that most of the
input data for the project is available, though dispersed
among many agencies. Table 1 is a preliminary list of
data and data sources.

Project Design

The project addresses the following questions:

• How do the extent and positioning of wetlands in the
headwaters of large rivers affect runoff and peak
flow?

• What are the spatial relationships between wetland
and other land uses regarding flood risk and water
quality under variable climate conditions?

• What is the role of wetlands for diffuse pollution pre-
vention and sediment deposition control under alter-
native management?

• What determines major criteria for wetland conserva-
tion in headwaters, ensuring environmentally sustain-
able development under multiobjective land and
water resource uses?
219
-------
Table 1. Data Sources for a GIS Study of Wetlands in the Basins of the Mississippi (United States) and the Volga (Russia)
Data
Level 1
Base maps
Stream network
Urban and rural areas
Wetlands, unclassified
Forests
Agricultural lands
Level 2
Watershed boundaries
Digital elevation maps
Digital orthophotos
Soils
Hydrologic records
Water quality records
Land uses
Wetlands, classified
United States
DCW3
DCW3
DCW3
NWIa
DCW3
LMDb'c

MPCAa
USGSa
USGS, LMICa
SCSb'c
USGSb
EPA/MPCAa'b
LMIC, LSAT3
NWIa
Russia
DCW3
DCW3
DCW3
DCW3
MGUb
MGUb

RWRCC
NA
CD, RPIb'c
RPIC
CHMb'c
CHM, RCP, RPIC
LSAT, CD, RPI3'C
RPIC
Data available in GIS ARC/INFO format.
bDatabases; needed conversion to ARC/INFO.
cData available in paper files; needed digitizing.
Key:

CD = Commercial distributors
CHM = Russian Committee on Hydrometeorology
DCW = ESRI digital chart of the world
EPA = U.S. Environmental Protection Agency
LSAT = Satellite image data
LMD = Published literature and map data
LMIC = Minnesota Land Management Information Center
MGU = Moscow State University
MPCA = Minnesota Pollution Control Agency
NA = Data not available
NWI = U.S. National Wetland Inventory
RCP = Russian State Committee on Natural Resources and Conservation
RPI = Miscellaneous planning agencies and research institutes
RWRC = Russian Water Resources Committee
SCS = U.S. Soil Conservation Service
USGS = U.S. Geological Survey
Tasks established to address these questions include:

• Developing a multilayered hierarchical base of GIS
data for headwater watersheds of the Mississippi and
Volga basins.

• Performing a comparative analysis and simulation of
wetland impacts on hydrology and water quality at
representative watersheds.

• Deriving the relationships between natural and hu-
man-induced factors and wetland functions under cli-
mate change with regard to variable strategies of
wetland conservation.

• Defining the criteria and thresholds for wetland sys-
tem stability with regard to flood risk and water quality
parameters.

• Outlining recommendations for land and water re-
sources management and wetland positioning in the
headwaters.
GIS is the essential tool for manipulating and integrating
the many types of spatial data on water resources, soils,
vegetation, land use, economics, and the environment.
GIS compiles many sources (e.g., maps, field notes,
remote sensing, statistical data) into a consistent, inter-
pretable database used for specific scientific goals and
development decisions. The user can run GIS ARC/INFO
software on workstation and PC platforms and apply the
hierarchical approach to GIS data management, devel-
oped earlier (7). At the task level, data resolution and
corresponding modeling tools vary.

Level 1 contains the basic reference information for
large regions (e.g., the Upper Volga and the Minnesota
portion of the Upper Mississippi River basins). It covers
an area of several hundred thousand square kilometers,
with a map scale approaching 1:1,000,000. Landsat
thematic map data and the DCW (8) are used as
sources of data at this level. Vogelmann et al. (9) dem-
onstrated the methodology for detection of freshwater
wetlands using remote sensing data based on maximum
220
-------
likelihood supervised classification. A graphic data file
on CIS focuses on basic physical characteristics such
as stream network, geology, soils, wetland classifica-
tion, and other major land uses. A complementary tabu-
lar database or attribute file contains information on
stream flow, water quality, pollution sources, and wet-
land impacts on material fluxes. At this level, the general
physiographic and statistical information is accumulated
and analyzed, territories are classified, and major prob-
lems and typical case study watersheds are defined.
This information is compiled from literature, cartographic
data in paper and digitized form, statistics, and space
image data.
In Level 2, the more detailed CIS analysis and scenario-
based modeling is implemented at the watershed scale
with a map scale of approximately 1:25,000. The water-
shed demonstration focuses on alternative approaches
to priority-setting in wetland management, climate im-
pact analysis, and resulting interactions with landform,
soils, biosphere, and runoff. The sources of data are
special, topographic maps and air photo interpretation.
Simulation studies of water balance and fluxes among
the various reservoirs are implemented at Level 2. De-
veloping procedures for scaling information from the
local to regional level is the important task at this level.
CIS assists in handling the input parameter library and
analyzing the output. CIS studies, involving area meas-
urements and distribution analysis, evaluate cumulative
impacts on runoff and its quality from the loss of wetland
area, caused by drainage or filling, under stationary and
changing climate.

Wetland functions are considered under two sets of
scenarios. Management scenarios compare different
wetland and farming allocations, conservation practices,
and agricultural chemical use. Climate scenarios as-
sume rainfall and temperature changes under global
warming. Scenario-based simulation is applied in the
analysis of watershed runoff, wetland moisture regime,
soil erosion, and water quality processes.

Methodology
CIS database structure is related to the selected meth-
odology. CIS serves as a linking tool for input-output
data analysis and transfer between models, used at
different levels and stages of studies.
Scientists in both the United States and Russia devel-
oped statistical methods to obtain quantitative relation-
ships between stream flow and wetland area in the river
basins. Johnston (6) summarized the U.S. findings:
Empirical equations for predicting streamflow, de-
veloped by U.S. Geological Survey in Wisconsin
and Minnesota, indicate that flood flow is propor-
tional to the negative exponent of wetlands and
lakes ratio on a watershed (10,11). This means that
relative flood flow is decreased greatly by having
some wetlands in a watershed, but a watershed with
a large proportion of wetlands does not reduce flood
flow much more than a watershed with an interme-
diate proportion of wetlands. For example, predicted
flood flow was 50 percent lower in Wisconsin water-
sheds with 5 percent lakes or wetlands than it was
in watersheds with no lakes or wetlands, but in-
creasing the proportion of lakes and wetlands to 40
percent decreased relative flood flow by only an
additional 30 percent (12).

Other estimates agree that wetland encroachment on a
watershed of less than 25 percent generally has a mini-
mum influence on peak flow (5, 13, 14).

Johnston and colleagues (15) applied these equations
to watersheds in central Minnesota. They found that a
watershed with 1.6 percent lakes and wetlands had a
flow per unit watershed area that was 10 times the flow
predicted for a watershed with 10 percent lakes and
wetlands, while watersheds with 10 to 50 percent lakes
and wetlands had about the same flood flow per unit
area.

Statistical analysis indicates that peak discharge in-
creases with decreasing wetland area within the drain-
age basin. The regression equation defines the
approximation for northwestern Minnesota (16):
QE.R A A 0.677 n N-0.506
AM=58.4AW (Ls)

LS=100(AL + AM)/AW+1

where:
QAM = arithmetic mean of the annual series, cubic
feet per second.
Aw = watershed area, square miles.
AL = lake area within the watershed, square miles.
AM = marsh area within the watershed, square miles.

A similar statistical approach was developed for peak
flow determination in Russia. Maximum flow discharge
from snow melt is calculated for the central European
zone as (17):

Qm=K0*hp*S1 *S2*S3/(A + 1)n

where:
Qm = flow discharge, cubic meters per second.
KQ = coefficient of flood concurrence, KO = 0.006
for plain river basins.
hp = calculated flood runoff for given probability,
millimeters.
A = drainage area, square kilometers.
n = coefficient, n = 0.17.
221
-------
B! = lake storage coefficient, if lake area is less
than 1 percent of A, then S1 = 1.
S2 = pond and reservoir storage coefficient, 82 =
0.9 with ponds and S2 = 1 without ponds.
S3 = combined wetland and forest storage
coefficient.
S3 = 1 - 0.8 Ig (0.05 Sf + 0.1 Sw + 1), where Sf
and Sw are forest and wetland area,
percentage to total drainage area.

Calculated flood runoff for a given probability, hp, is
determined based on average flood runoff h, millimeters,
coefficient of variation Cv, and tabulated parameter F, as
follows:

hp = (1 + F * Cv) h

h = K, hk

hk = 100 millimeters for Moscow region.
K, = land surface coefficient, Kt = 0.9 for plains
and sandy soils, K, = 1.1 for hills and clay
soils.

The studies mentioned above generally agree with an
assumption that the incremental loss of wetland area
would have a small effect on flood flow from watersheds
with 10 percent up to 40 to 50 percent wetlands, but a
large effect on flood flow from watersheds with less than
10 percent wetlands.

The existence of similar thresholds was found in relation
to wetlands abilities to intercept pollutants. As Johnston
stated (6),

The same 10 percent threshold was identified by
Oberts (18) for suspended solids, a measure of
water quality function. Stream-water draining water-
sheds having 10 to 20 percent wetlands had about
the same loading of suspended solids, so the con-
tribution of suspended solids was relatively constant
per unit area of watershed. However, the water-
sheds with less than 10 percent wetlands had load-
ing rates per unit area that were as much as 100
times greater than the loading rates from the water-
sheds with more than 10 percent wetlands.

CIS could be especially helpful in determining the im-
pacts on downstream water quality of the spatial posi-
tioning of wetlands within watersheds. Studies prove
that the location of wetlands can affect their cumulative
function with regard to water quality (6). In an earlier
work (15), Johnston developed an index of wetland
location and applied it to a landscape-level CIS study of
urban and rural stream watersheds in central Minne-
sota. The index is formulated as:
where:
PWP = relative wetland position.
j = stream order (19) of water quality sampling
station.
i = stream order of wetland.
Ai = area of Ah order wetlands.

Calculated values for the index ranged from 0 (i.e., all
wetlands were on streams of the same order as that of
the sampling station) to 2.6 (i.e., average wetland posi-
tion was 2.6 stream orders upstream of the sampling
station). Watersheds with wetlands located close to
sampling stations had significantly better water quality
(i.e., lower concentrations of inorganic suspended sol-
ids, fecal coliform, and nitrate; lower flow weighted con-
centrations of ammonium and total phosphorus) than
watersheds with wetlands located far from sampling
stations (6).
The review of methodological approaches, as shown
above, indicates that parameters describing wetland ex-
tent, positioning, and land surface characteristics are of
universal significance for any comprehensive water-
shed-scale wetland study.
In the current study, CIS is used at Level 1 to evaluate
wetland area per watershed and to develop input pa-
rameters for relative wetland position assessment. The
comparison and selection procedures for case study
watersheds in the Volga and Mississippi basins are
based on these values. The parameters, derived from
CIS, are as follows:
1. Total watershed area.
2. Lake, pond, and reservoir area.
3. Forest area.
4. Wetland area.
5. Ratio of wetland area to total watershed area.
6. Wetland area by subwatersheds of different order.
7. Relative wetland extent by subwatersheds of different
order.
8. Land surface coefficients.
Parameters listed in groups 1 through 4 are obtained
directly from CIS attribute tables as values of "area"
items for the respective land cover polygons. Parame-
ters 5 through 7 require calculations relating values of
area items for different polygon coverages. Land surface
coefficients (group 8) could be determined indirectly
based on basic soil, land cover, and topography data.
Most U.S. methodologies use hydrologic soil groups,
222
-------
based on soil permeability, rates of infiltration, and Soil
Conservation Service (SCS) runoff curve numbers (20).
Some Russian methodologies have adopted similar em-
pirical land surface coefficients. For example, for central
European Russia, this value varies from Kt = 0.9 for
plains and sandy soils, to Kt = 1.1 for hills and clay soils
(17).

Level 2 of analysis applies two hydrologic simulation
models:

• The Agricultural Watershed Runoff and Water Quality
Model (Agricultural Nonpoint Source Pollution Model
[AGNPS]), developed by the Agricultural Research
Service of the U.S. Department of Agriculture, con-
tains explicit procedures to evaluate the impacts of
management practices and landscape feature posi-
tioning on watershed runoff. AGNPS is a cell-based
runoff model that estimates water volume, peak flow,
eroded and delivered sediment, chemical oxygen de-
mand, and nutrient export from watersheds (20-22).

• The Forest Runoff Watershed Model (FRWM) com-
bines analytical and numerical methods for solving
hydro- and thermodynamics equations (23). This
model considers snow melt constituent in runoff in
more detail than does AGNPS. Hydrologic simulation
is based on physical process descriptions for snow
cover dynamics, freezing and thawing of soil, soil
moisture dynamics in frozen and thawed soils, inter-
ception of liquid and solid precipitations by vegeta-
tion, surface runoff, ground-water aquifers, and
channeled streams.

Both models use a similar set of watershed input data,
derived from CIS (e.g., elevations, slopes, channel
slopes, stream network configuration, soil texture, land
cover). The methodology, linking CIS with hydrologic
models, was already tested in the wetland study project
at the Voyageurs National Park in Minnesota. The
ARC/INFO GRID module was used to derive watershed
variables for input to AGNPS. CIS then presented and
interpreted the scenario-based results of the simulation
(24). The typical stages of such an analysis and inter-
pretation for a watershed-scale area are presented in
Figures 1 through 5.

Case Study Watersheds

The areas where wetland impacts on runoff are evalu-
ated are located in Minnesota (United States) and Mos-
cow and adjacent regions (Russia) (see Figures 6
through 15). They have mixed urban, rural, recreational,
and forest land uses. Both regions have a variety of
development pressures. The relative effects of different
alterations in watershed management are distinguished
and quantified. CIS provides metrics for comparative
assessments and analysis of related variables for both
areas.
Table 2 and Figures 7 through 14 present a general
overview of wetland extent in both areas. The case
study subwatersheds used for more detailed analysis
will include tributaries of the second and third order. At
this stage, several watersheds are considered for more
detailed analysis. The limitations imposed by data avail-
ability as well as by project resources could affect the
final selection. Table 2 serves, therefore, as a prelimi-
nary overview of several areas that could potentially be
adopted for more detailed studies.

CIS analysis shows that in the Upper Volga, wetlands
extent very much depends on allocation of populated
areas. The heavily urbanized Moscow metropolitan area
affects a large territory of many thousands of square
kilometers. The ratio of wetlands as a percentage of total
land area is one-tenth of that in the neighboring Tver
area, which has the same size but a smaller population
(see Figure 11). In areas of intensive agriculture (e.g.,
the Pronya basin located southeast of Moscow), almost
all wetlands were drained and have not existed for
several decades.
In Minnesota, despite the growing urbanization (e.g., the
Twin Cities area [7,330 square kilometers]), about half
of the presettlement wetlands still remain (25); wetlands
occupy 442 square kilometers, or over 6 percent of the
land area; and shallow lakes constitute an additional 114
square kilometers (1.56 percent). Some watersheds
within the Twin Cities metropolitan area have a high
wetland percentage, such as 18.9 percent in the Lam-
berts Creek watershed. Intensive studies with CIS ap-
plication of landscape feature functioning and wetland
impacts on stream flow and water quality demonstrated
an innovative approach and made detailed databases
available for this area (15).

Preliminary comparative analysis indicates that two
pairs of case study watersheds could be initially se-
lected for further studies in the Mississippi and Volga
basins:
• Upstream watersheds with wetlands area of 15 to 20
percent (Tver region in Russia and Cass and adja-
cent counties in Minnesota).

• Tributary watersheds downstream with wetlands area
of 1 to 2 percent (the Istra basin in Russia and sub-
watersheds of the Minnesota River basin, located in
Sibley, Scott, and adjacent counties in Minnesota).

Case study watersheds in the Mississippi and Volga
basins are situated on gently rolling plains in mixed
forest zones with southern portions extending into the
forest/steppe and prairies. The Quaternary sediments
are of glacial, glaciofluvial, lacustrine, and alluvial origin.
Wetlands have hydric soils with various degrees of gley
process development and/or peat accumulation, varied
by wetland type and soil moisture regimen (28). The
annual precipitation is 500 to 600 millimeters with similar
223
-------
Variable
Semiconstant
Constant

Climate

Resource Use
and
Control Practices

Soil and
Physiographic

Factors
t
Weather
Data
i
Hydrological

Runoff

Land Cover
and
Biota
J -

Water
Quality

Soil
Properties
_ _ t

Soil
Quality

—

Elevation
t
Biological
Wildlife
Habitat

Landscape Feature
Positioning Alternatives
and Recommendations
Improved Methodology for
Spatial Process Analysis
Input
Parameters
Processes
Analysis Based on
Models and CIS
Output Results
Demonstrated
on CIS
Figure 1. Conceptual framework of linking GIS and models for environmental management.
Hypsometric curve
With Ponds
40 -

20 ..
v
340
360 380
Value
400
Outlet
Elevations (meters)

I—I <350
era 350 to 360
m 360 to 370
m 370 to 380
&B >380
Beaver Ponds
Beaver Wetlands
Wooded Wetlands
Coniferous Forest
Deciduous Forest
0 500 1,000 1,500 2,000
Figure 2. Stream network configuration derived from GIS ele-
vation map.
Figure 3. Scenarios of land use.
224
-------
With Ponds
With Ponds
500 1,000 1,500 2,000
500 1,000 1,500 2,000
Figure 4. Scenario-related hydrologic curve numbers.

Table 2. Comparative Data on Wetland Extent in Minnesota and in the Upper Volga Basin (8, 25-27)
Figure 5. Patterns of sediment transfer between cells (%),
+ deposition, - erosion.
Total Area
Wetland Area
Region
United States
Minnesota
Beltrami Co.
Cass Co.
Hubbard Co.
Le Sueur Co.
Hennepin Co.
Sibley Co.
Wright Co.
Scott Co.
Lambert Creek
Russia
Tver region
Moscow region
Istra basin
Pronya basin
(Square Kilometers)

205,940.30
7,923.04
6,256.38
2,624.81
1 ,204.82
1 ,588.07
1 ,555.38
1 ,852.97
982.56
9.51

10,000.00
10,000.00
1 ,827.38
10,200.00
(Square Kilometers)

30,500.00
3,909.23
1,505.29
283.22
28.31
36.37
24.26
24.27
8.06
3.69

1,169.08
165.39
24.07
a
Percentage

14.80
49.34
24.06
10.79
2.35
2.29
1.56
1.31
0.82
18.90

16.90
1.70
1.32
a
Wetland area is insignificant and not identified by available maps.
225
-------
1 - Doibiza, 2 - Istra, 3 - Pronya
Figure 6. Location of study areas in the Volga basin.

Volga Basin, Russia
Land Features

• Urban Lands
H Peat Bogs
a Wetlands
£±J
M Reservoirs and Lakes
il
/J Rivers
71 Roads
Statistics %
Total Area 100.00
Urban 12.38
Water 1.58
Wetlands 1.66
Kilometers
10
20 30
40
Figure 7. Wetlands and urban lands in the Moscow region.
226
-------
Land Features
• Urban Lands
1 Peat Bogs
I Wetlands
&J
H Inundated
^
I] Reservoirs and Lakes
7] Rivers
?] Roads

Statistics %
Total Area 100.00
Urban 1.22
Water 4.64
Wetlands 16.93
Kilometers
10 20 30 40
Figure 8. Wetlands and urban lands in the Tver region.
Wetlands, 1954
1 Wetlands, 1980s
Urban Lands
Reservoirs and Lakes
30
40
Figure 9. Wetland decline since 1954, Moscow region.
227
-------
Volga Basin, Russia
£59
Land Features

Urban Lands
(Population, Thousands)
Peat Bogs

Wetlands

Reservoirs and Lakes

Rivers

Roads

£| Watershed

Elevation (meters)

Square
Kilometers %
Total Area 1,827.38 100.00
Urban 61.31 3.36
Water 40.43 2.21
Wetlands 24.07 1.32

Kilometers
10
20
30
40
Figure 10. Istra watershed in the Moscow region.

Wetlands Moscow Region, Russia
Populated Areas
t ] <10 EH2 20-29 m 40-49 ^ffl 75-99
10-19 [T-T] 30-39 50-74 100
Kilometers
50 100 150 200
Figure 11. Land cover, percentage of total in Moscow and Tver regions.
228
-------
Presettlement
Current
Wetland Drop, Percentage
and Location of Study Areas
0-10
11-20
21-30
31-40
41-50
51-60
61-70
>70
Kilometers
100 200 300 400
Figure 12. Wetlands in Minnesota, percentage of total area.
Mississippi Basin, Minnesota
Land Features
H Urban Lands

™j Forests

H Wetlands
3S55S
H Reservoirs and Lakes
2 Rivers |/\/| Roads

2| Watershed |/'v'| Counties
Statistics
Total Area
Urban
Water
Wetlands*
100.00
0.04
14.5
18-24
'Indicated by County, %
Kilometers
10
20
30
40
Figure 13. Wetlands and urban lands in Cass County area.
229
-------
Mississippi Basin, Minnesota
a.
Land Features
1 Urban Lands
] Forests
Wetlands
Reservoirs and Lakes
Rivers I/S/I Roads
Watershed |/ V' | Counties
Statistics %
Total Area 100.00
Urban 9.0
Water 4.5
Wetlands* 1-2
Kilometers
'I •••-'! I I
10 20 30 40
Figure 14. Wetlands and urban lands in Twin Cities area.
Mississippi Basin, Minnesota
Major Watershed I/ v I Clean Streams
/ \''| Minor Watersheds L-'Vl Impaired Quality
Urban
Lakes
Sampling Sites
Kilometers
i
10
20
30
40
Figure 15. Minnesota River watershed in Twin Cities area.
230
-------
seasonal distribution in both areas. Average runoff
ranges from 150 to 250 millimeters (7, 29).

Conclusion

The current status of the project indicates that most of
the input data is available, though dispersed among
many agencies. In both the United States and Russia,
research methodologies have been developed and ap-
plied to study landscape feature impacts on runoff quan-
tity and quality based on simulation and statistical
analysis. The comparative analysis of hydrologic and
diffuse pollution processes on watersheds in the Upper
Mississippi and Upper Volga basins will allow derivation
of metrics of wetland loss relative to impacts on runoff
and water quality.
The applications of CIS to watershed hydrology are
currently much more advanced in the United States than
in Russia. Initiatives emerging in the United States,
however, could considerably promote CIS use in Rus-
sia. Such promotion is beneficial for several reasons.
First, this kind of cooperative political activity is in full
compliance with the 1992 Freedom Support Act, ap-
proved by the U.S. Congress. Second, support of CIS
as a new information technology will create a favorable
infrastructure in many bilateral economic fields and busi-
nesses. Third, a better meshing of the CIS systems in
the two countries will lead to further international coop-
eration in responding to global changes.
Project implementation also helps meet the goal of pro-
viding a basis for sound environmental, technical, and
economic decision-making on the use of natural re-
sources. This knowledge is essential in developing prac-
tical guidelines for sustainable economic development
through applied research and technologies.

Acknowledgments

The Water Quality Division of MPCA provided valuable
assistance with CIS data for Minnesota. The National
Science Foundation (NSF/EAR-9404701) contributed
research support. Any opinions, findings, and conclu-
sions or recommendations expressed in this material
are those of the author and do not necessarily reflect the
views of the National Science Foundation.

References
1. Wetland management: Hydraulic and hydrologic research needs.
1987. In: Wetland hydrology: Proceedings of the National Wet-
land Symposium, Chicago, IL (September 16-18). Association of
State Wetland Managers, Inc. pp. 331-333.
2. Sather, J.H. 1992. Intensive studies of wetland functions: 1990-
1991 research summary of the Des Plaines River Wetland dem-
onstration project. Technical Paper No. 2. Wetland Research, Inc.
3. Hey, D.L. 1985. Wetlands: A strategic national resource. National
Wetlands Newsletter 7(1): 1-2.
4. Kloet, L. 1971. Effects of drainage on runoff and flooding within
the Pembina River basins, North Dakota and Manitoba. Unpub-
lished report by the Bureau of Sport, Fisheries, and Wildlife (De-
cember).

5. Simon, B.D., L.J. Stoerzer, and R.W. Watson. 1987. Evaluating
wetlands for flood storage. In: Wetland hydrology: Proceedings
of the National Wetland Symposium, Chicago, IL (September
16-18). Association of State Wetland Managers, Inc. pp. 104-112.

6. Johnston, C.A. 1994. Cumulative impacts to wetlands. Wetlands
14(1):49-55.

7. Higgins J.M., T.B. Nawrocki, and N.A. Nazarov. 1993. Hierarchi-
cal approach to integrated watershed management: Joint
TVA/Russian demonstration project. Proceedings of AWWA
CONSERV93 Conference, Sessions 4B-1 through 7C-3, Las
Vegas, NV. pp. 1,177-1,197.

8. Environmental Science Research Institute. 1993. Digital chart of
the world.

9. Vogelmann, J.E., F.R. Rubin, and D.G. Justice. 1991. Use of
Landsat thematic mapper data for fresh water wetlands detection
in the Merrimack River watershed, New Hampshire (unpub-
lished).

10. Conger, D.H. 1971. Estimating magnitude and frequency of
floods in Wisconsin. Open File Report. Madison, Wl: U.S. Geo-
logical Survey.

11. Jacques, J.E., and D.L. Lorenz. 1988. Techniques for estimating
the magnitude and frequency of floods in Minnesota. Water Re-
sources Investigation Report 87-4170. St. Paul, MN: U.S. Geo-
logical Survey.

12. Novitzki, R.P. 1979. Hydrologic characteristics of Wisconsin's
wetlands and their influence on floods, stream flow, and sedi-
ment. In: Greeson, P.E., J.R. Clark, and J.E. Clark, eds. Wetland
functions and values: The state of our understanding. Minneapo-
lis, MN: American Water Resources Association, pp. 377-388.

13. Larson, L.A. 1985. Wetlands and flooding: Assessing hydrologic
functions. Proceedings of the National Wetland Assessment Sym-
posium, Portland, ME (June 17-20). pp. 43-45.

14. Ogawa, H., and J.W Male. 1986. Simulating the flood mitigation
role of wetlands. J. Water Res. Planning and Mgmt. 112(1).

15. Johnston, C.A., N.E. Detenbeck, and G.J. Niemi. 1990. The cu-
mulative effect of wetlands on stream water quality and quantity:
A landscape approach. Biogeochemistry 10:105-141.

16. Moore, I.D., and C.L. Larson. 1979. Effects of drainage projects
on surface runoff from small depressional watersheds in the
North Central Region. WRRC Bulletin 99. Water Resources Re-
search Center, University of Minnesota.

17. Maslov, B.S., I.V. Minaev, and K.V. Guber. 1989. Reclamation
manual. Rosagropromizdat (in Russian).

18. Oberts, G.L. 1981. Impacts of wetlands on watershed water qual-
ity. In: Richardson, B., ed. Selected proceedings of the midwest
conference on wetland values and management. Navarre, MN:
Freshwater Society, pp. 213-226.

19. Horton, R.E. 1945. Erosion development of streams. Geol. Soc.
Amer. Bull. 56:281-283.

20. Chow, V. 1964. Handbook of applied hydrology. New York, NY:
McGraw-Hill.

21. Wishmeier W.H., and D.D. Smith. 1978. Predicting rainfall erosion
losses: A guide to conservation planning. Agriculture Handbook
No. 537. U.S. Department of Agriculture.
231
-------
22. Young, R.A., C.A. Onstad, D.D. Bosch, and W.P. Anderson. 1987.
AGNPS: Agricultural nonpoint-source pollution model. U.S. De-
partment of Agriculture Conservation Research Report 35.

23. Nazarov, N.A. 1988. Model formation of the flood hydrograph of
Northern Plain Rivers. Water Resour. 15(4):305-315.

24. Nawrocki, T., C. Johnston, and J. Sales. 1994. CIS and modeling
in ecological studies: Analysis of Beaver Pond impacts on runoff
and its quality. Voyageurs National Park, Minnesota, case study.
NRRI Technical Report 94/01 (February).

25. Anderson, J.P., and W.J. Craig. 1984. Growing energy crops on
Minnesota wetlands: The land use perspective. Minneapolis, MN:
Center for Urban and Regional Affairs, University of Minnesota.
pp. 1-88.
26. Mitsch, W.J., and J.G. Gosselink. 1986. Wetlands. New York, NY:
Van Nostrand Reinhold.

27. Brown, R.G. 1987. Effects of wetland channelization on storm
water runoff in Lamberts Creek, Ramsey County, Minnesota. In:
Proceedings of the National Wetland Symposium on Wetland
Hydrology, Chicago, IL (September 16-18). pp. 130-136.

28. SCS. 1975. Soil taxonomy: A basic system of soil classification
for making and interpreting soil surveys. Agriculture Handbook
No. 436. Soil Conservation Service, U.S. Department of Agriculture.

29. USGS. 1988. Water resources data: Minnesota, water year 1988.
Vol. 2, Upper Mississippi and Missouri River basins. U.S. Geo-
logical Survey Data MN-88-1.
232
-------
Water Quality Applications
-------
Vulnerability Assessment of Missouri Drinking Water to Chemical Contamination
Christopher J. Barnett, Steven J. Vance, and Christopher L. Fulcher
Center for Agricultural, Resource, and Environmental Systems, University of Missouri,
Columbus, Missouri
Introduction

In 1991, the Missouri Department of Natural Resources
(MDNR) implemented the Vulnerability Assessment of
Missouri Drinking Water to Chemical Contamination pro-
ject. MDNR's Public Drinking Water Program (PDWP)
contracted with the Center for Agricultural, Resource,
and Environmental Systems (CARES) to conduct this
assessment. They designed the project to determine
which, if any, public water supplies are threatened by
chemicals being tested under the Safe Drinking Water Act.

Under Phase II of the Safe Drinking Water Act, the
United States Environmental Protection Agency (EPA)
required that all public drinking water systems be rou-
tinely monitored for 79 contaminants beginning January
1,1993. If a selected chemical parameter is not detected
in an area that would affect a water supply (where
"detected" is defined as used, stored, manufactured,
disposed of, or transported regardless of amount), then
the water supply need not be tested for that chemical.
Instead, that system would be granted a use waiver,
meaning that the state would not test for that chemical.
EPA grants use waivers for 43 of the 79 contaminants.
Use waivers can result in considerable cost savings.

Because use waivers are granted based on the spatial
relationship between drinking water sources and con-
taminant sources, accurate positional data needed to be
collected for those items. A geographic information sys-
tem (CIS) was used to store and analyze this informa-
tion in a spatial context.

Water Sources

Water sources, as defined for this study, are the points
where water is drawn from a river, lake, or aquifer for
use in a public water supply. Our efforts focused primar-
ily on the development of the water source layers for the
CIS. These layers, containing wellheads, impoundment
intakes, and river intakes, were created in house or
obtained from state and federal agencies. MDNR
regional office personnel inspected these water source
layers in the spring of 1993. Since these personnel
routinely inspect Missouri public drinking water supplies,
their knowledge of these locations is exceptional.
The updated water source information was mapped
on 1:24,000-scale USGS topographic quadrangles at
the regional offices, then entered into the CIS. MDNR's
PDWP provided available attribute information, which
was associated with these layers. The layers offer the
most accurate and current information available. Only
the community (e.g., cities, subdivisions, mobile home
parks) and the nontransient, noncommunity (e.g.,
schools, large businesses) water supply systems were
considered for water source mapping. This study did not
consider private wells.

The information is stored in the CIS in the form of
geographic data sets or layers. The wellhead layer con-
tains 2,327 public wells and their attributes (e.g., well
depth, casing type). The majority of the wellheads are
located in the Ozarks and Southeast Lowlands. Natu-
rally poor ground-water quality prohibits a heavy reli-
ance on ground water for drinking water in other areas
of the state. The surface water impoundment layer con-
tains 105 points representing the intake locations for
systems that rely on lake water. Additionally, the drain-
age basin and lake area are mapped for these systems.
The majority of the systems that rely on lake water are
located in northern and western Missouri. The final layer
represents the systems that use river water. The major-
ity of the 50 intakes are located on the Mississippi and
Missouri Rivers and on the major streams in the Grand
and Osage River basins.

Contaminant Sources

Contaminant sources, as defined for this study, are the
points or areas where existing databases indicate the
presence of a chemical contaminant. Incorporation of
contaminant data into the CIS proved to be the most
difficult task. These data usually contained very precise
235
-------
information about what contaminants were found at a
site and who was responsible, but the quality of the
locational information was often poor.

Ninety-three state and federal databases were reviewed
for contaminant information before performing the final
use waiver analysis. The contaminant information was
broken into two separate types, contaminant sites and
pesticide dealerships. The contaminant sites were loca-
tions at which certain chemicals were known to exist.
The pesticide dealerships were dealerships licensed to
distribute restricted use pesticides. Information about
contaminant sites was extracted from the databases
and entered into Microsoft Excel, a spreadsheet pro-
gram. The small amount of data with coordinate (lati-
tude/longitude) or map information was readily
converted to the CIS. The majority of the contaminant
records, however, contained only address information,
often appearing as a rural route address or post office
box number.

While the water source locations were being verified,
personnel at the MDNR regional offices reviewed the
contaminant site records. The regional office personnel
were familiar with their respective territories and could
assist CARES personnel in locating the contaminant
sites. The Missouri Department of Agriculture pesticide
use investigators provided additional information about
the locations of contaminant sites. All contaminant
source information was also mapped on the 1:24,000-
scale USGS topographic quadrangles and transferred
to the CIS.

Of more than 2,800 contaminant sites found in these
databases, 88 percent were geographically located and
used in the study. At this time, the contaminant site layer
contains 2,493 points representing the information col-
lected on the 43 chemical contaminants required by
MDNR. Each point contains a seven-digit chemical code
indicating the chemical it represents and serving as a
link to the chemical contaminant files. The contaminant
sites tend to be concentrated more in urban areas than
rural areas. Even though this layer is being continually
updated, the basic distribution of contaminant sites re-
mains the same.

A second contaminant source layer represents Mis-
souri's licensed pesticide dealers. This information is
included to indicate potential contamination even though
specific chemicals at dealership locations are not
known. At this time, we have been able to locate 1,344
dealerships out of 1,650. Two types of dealerships are
included in the layer, active dealers and inactive dealers.
Of the active dealerships in 1991, 91 percent were found
and entered into the CIS. Of the inactive dealerships, 79
percent were located.
Spatial Analysis

The final parameters for the use waiver analysis were
developed from EPA and MDNR guidelines and account
for the capabilities of the CIS. These parameters were
designed to present a conservative list of the systems
that needed to be tested for the possible presence of
studied chemicals. Parameters forthe wellhead analysis
are as follows:

• A 1/4-, 1/2-, and 1-mile radius around each wellhead
was searched for contaminant sites and pesticide
dealerships (see Figure 1). Any contaminant sources
found within those radii were reported to PDWP.
(PDWP requested that the results of the three radius
analyses be reported, but the 1/2-mile radius was
used to determine the issue of the use waiver.)

• Any wellheads found within a contaminant area were
denied a use waiver for that contaminant.

• Each highway and railroad within 500 feet of a well-
head was recorded. This indicates the threat posed
by the transport of chemicals near wellheads.

• Additionally, the percentage of the county planted in
corn, soybeans, wheat, sorghum, tobacco, cotton,
and rice was listed for each well to indicate the threat
posed by agricultural chemical use within that county.

The parameters for the systems relying on lake water
are as follows:

• Any contaminant sources found within a surface
water impoundment drainage basin caused the asso-
ciated intake(s) to fail use waiver analysis for those
contaminants.
X = Contaminant
Figure 1. Use waiver search radius distances.
236
-------
• Any area of contamination overlapping a drainage
basin caused the associated intake to fail use waiver
analysis for that contaminant.

• Transportation corridors passing through a drainage
basin were noted to indicate the threat posed by
transport of chemicals within the basin.

• The percentage of the county planted in the seven
crops mentioned above was listed to indicate agricul-
tural chemical use within the drainage basin.

Many of the rivers that supply water to systems in Mis-
souri have their headwaters outside the state. To fully
evaluate the potential for contamination within those
drainage basins, we would have to collect data for large
areas outside of the state. For example, the Mississippi
and Missouri River drainage basins cover large portions
of the United States. Because collecting data for those
areas would be impractical, we have recommended to
MDNR that use waivers not be granted to river supplies.

The following provides details on how the analysis was
performed. The CIS searches around each wellhead for
each radius and notes which contaminant sites affect
which wellheads. If a contaminant falls within that ra-
dius, we recommend that the wellhead be monitored. In
this example, the well is affected by one contaminant
within the 1/4-mile radius, two within the 1/2-mile radius,
and four within the 1-mile radius.

Results

The results of the use waiver analysis indicate which
systems may be affected by the use of a chemical near
a water source. Several results show the substantial
savings realized from our analysis. For example, the
analysis showed that only five wells serving four public
drinking water systems were potentially affected by di-
oxin and should be monitored. By not testing the remain-
ing systems for dioxin, the state can realize a
considerable cost savings, as the test for dioxin is the
most expensive test to perform.

The final wellhead system analysis shows that the
1/2-mile buffer analysis affected a total of 447 well-
heads in 241 systems. That is, a chemical site or pesti-
cide dealership was found within 1/2 mile of 447 public
wellheads. A result form was generated for each of the
1,340 systems in the state listing each well or intake and
the potential threat posed by nearby contaminant
sources.

The cost of testing all wellhead systems for all 43 con-
taminants without issuing use waivers is more than $15
Table 1. Estimated Cost Savings for Public Drinking Water
Systems
Method
Estimated
Total Cost
Estimated
Mean Cost
per System
Estimated
Total Cost
Savings
No use waiver $15,533,100 $12,200 $0
With use waiver $1,813,900 $1,400 $13,719,200

million (see Table 1). According to our analysis, CARES
estimates that only $1.8 million need be spent to monitor
vulnerable wells. Therefore, the state can save more
than $13.5 million in monitoring costs.

Summary and Recommendations

To date, the investment the state made in the vulnerabil-
ity assessment project has provided many benefits. The
state saved several million dollars in testing costs and
developed several spatial and nonspatial databases that
will have many uses. In addition, the project established
a basic framework for future assessments, which EPA
requires on a regular basis.

The basic data required for use waiver analysis are the
locations of water sources and the locations of potential
contamination sources. CARES determined that the
available data did not contain the information necessary
to map these locations or that the data were of question-
able quality. Many layers required update and correc-
tion. Considerable effort was necessary to improve
existing locational information for both water source lay-
ers and chemical contaminant files. Local knowledge of
an area was heavily relied upon to determine accurate
locations, particularly contaminant sites. The vast ma-
jority of these sites contained only the address as the
geographic reference. An address is not a coordinate
system; it does not indicate a fixed location on a map.
Because the location of any chemical detection site is
of vital importance, state and federal agencies that col-
lect these data need to record more complete geo-
graphic information. Ideally, a global positioning system
could be employed to generate coordinates. Realisti-
cally, the recording of legal descriptions or directions
from an easily located point would substantially improve
the quality of the current databases.

In many cases, data resided in digital format; however,
due to regulations or lack of agency cooperation, they
could only be distributed in paper format. Reentering
data from paper format into digital format required con-
siderable time and expense. Interagency cooperation
should be emphasized to reduce unnecessary data entry.
237
-------
Reach File 3 Hydrologic Network and the Development of GIS Water Quality Tools
Stephen Bevington
Water Quality Section, Division of Environmental Management, North Carolina Department of
Environment, Health, and Natural Resources, Raleigh, North Carolina
Introduction

The application of geographic information system (GIS)
tools to water quality management is limited by the lack
of geographically referenced data describing the surface
water environment. Ongoing efforts at the local, state,
and federal level are producing a multitude of GIS data
coverages describing land use/cover and relevant water
quality data files. As these data coverages become
available, water quality managers will need to develop
new analysis techniques to take advantage of the vast
amount of geographically referenced data. A key step in
the development of analytical tools for water quality
management will be the development and maintenance
of a coverage describing the structure and hydrology of
surface waters.

Reach File 3 (RF3) is one potential source of surface
water maps and topology for the development of a
CIS-based water quality analysis tool. This paper de-
scribes a pilot project designed to examine the suitability
of RF3 as a network system for the collection, integra-
tion, and analysis of water quality data.

To be considered an appropriate water quality analysis
tool, RF3 should provide the following functions:

• Present a working environment that allows users to
explore geographic relationships between surface water
features, landmark features, and data coverages.

• Allow users to select specific stream segments, in-
cluding all points upstream and downstream of a
given point.

• Provide tools to assist users in partitioning water quality
databases into hydrologically meaningful subsets.

Reach File 3

RF3 is a hydrographic database of the surface waters
of the United States. The database contains 3 million
river reaches mapped at 1:100,000 scale. The source
for RF3 arcs were digital line graphs (DLGs).

Attribute data for RF3 arcs include the major-minor DIG
pairs, stream name, water-body type, stream order, and
a unique identifying reach number. The unique reach
numbers are structured in such a way as to provide a
logical hydrologic framework. Reach numbers can be
used to sort the database for all reaches in any specified
watershed or locate all upstream or downstream
reaches.

The U.S. Environmental Protection Agency (EPA) origi-
nally designed RF3 as a tabular data set. It evolved into
a GIS data coverage, and EPA and the U.S. Geological
Survey (USGS) will likely maintain it as a surface water
mapping standard. At present, RF3 as a GIS data layer
is not widely used for water quality applications.

RF3 Pilot Study: Upper Yadkin River Basin

The Upper Yadkin River basin (USGS h03040101) was
selected to test RF3 water quality applications (see
Figures 1 and 2). The Upper Yadkin was chosen be-
cause of the availability of water quality and stream flow
data layers in that area. Also, the Upper Yadkin RF3 file
contained arcs depicting lakes and double-line rivers as
well as simple stream networks. These two-dimensional
water features present interesting complications to net-
work routing and path-finding.
Figure 1. The Upper Yadkin River watershed, North Carolina
and Virginia.
238
-------
Figure 2. RF3 hydrography for the Upper Yadkin River basin.

Two forms of point source data were used in the study:
National Pollutant Discharge Elimination System
(NPDES) wastewater discharge points and USGS
gages. The NPDES coverage includes data on the per-
mit limits such as daily flow, dissolved oxygen, bio-
chemical oxygen demand (BOD), and ammonia. The
USGS gage coverage includes data on several flow
statistics for each USGS gage in the basin. Both data
layers contain information about the location of the site
and stream with which it is associated.

Coverages of counties and cities were also made avail-
able for geographic orientation.

Preparing the Network

The original RF3 file received from the USGS had sev-
eral topological issues that needed to be addressed
before RF3 could function as a stream network. First,
not all arcs were connected to each other (see Figure 3).
The ARC/INFO command TRACE was used to select all
connected arcs. This revealed three major blocks of
connected arcs and many isolated arcs. The three major
blocks were easily connected in ARCEDIT by extending
the main tributary links between the blocks. Processing
of the isolated arcs was not pursued for this study.
Complete processing of arcs for this RF3 basin would
not be difficult or time consuming, with the possible
exception of the many arcs surrounding the lake. A
functional network encompassing a high percentage of
the arcs was not difficult to achieve, however.

The second network issue concerned the direction of the
arcs. RF3 has all arcs oriented toward the top of the
watershed, with the exception of one side of double-line
streams. Arcs that make up double-line streams are
oriented up one side of the double-line section and down
the other (see Figure 3). Clearly, this complicates rout-
ing. To allow for accurate downstream routing, arcs on
the downward-facing side of the stream were flipped
using ARCEDIT. With all arcs in the network facing
upstream, most hydrologic routes can be traced. Given
the network system alone, upstream routing from dou-
ble-line streams does not function properly, ignoring all
tributaries on one side of the double-line stream.
Double-Line Stream Routing

Many possible solutions exist for the problems caused
by double-line streams. Some involve improving the
network (e.g., by adding center-line arcs down the mid-
dle of double-line streams). This would involve not only
adding arcs but establishing conductivity with all tribu-
taries. This option will involve significant topological
changes to RF3. To maintain compatibility with other
239
-------
VTH •' rf^^^^VW^
^mt-i^^'
V-Jv- -'•>•''•'•••(-." ' J
' " --XVS/'
Figure 3. Original conductivity of RF3 hydrography.

RF3 and DIG sources, this option should be considered
only as part of a major RF3 upgrade.

At the other end of the technological spectrum, one
could simply instruct users to watch for double-line
streams and select arcs from both sides of the river.
Users may have trouble with this option, however, if they
are not working at an appropriate scale to easily differ-
entiate between double- and single-line streams.

A third option is to program an arc macro language
(AMI) to check for double-line streams and run up-
stream traces from both sides of the stream. The diffi-
culty in this method is to find the appropriate starting
place on both banks. The algorithm developed to do this
goes as follows:

• Select stream segment and trace upstream. (Results
in incomplete trace.)

• Find the minimum segment and mile of selected dou-
ble-line streams.

• Unselect all double-line streams below minimum seg-
ment and mile.

• Add to selection all non-double-line streams.

• Trace from original point both upstream and down-
stream. (Results in completed upstream trace.)
Results and Conclusions

AMLs and menus were written that can perform up-
stream and downstream traces on the RF3 stream net-
work and select data points within 500 feet of the
stream. Lists of attributes can be returned to the screen.
This system is easy to use and can be used to quickly
identify general watersheds and water quality data
points. An AMI can be used to trace upstream from a
double-line stream given only one point on the stream
(see Figure 4). The success of these methods suggests
that two-dimensional surface water features can be suc-
cessfully integrated into RF3 water quality analyses.

This system could be further developed to support poly-
gon analysis using the ARC command BUFFER. Other
developments could include the procedures to write se-
lected attributes to files and increased flexibility for the
screen environment and outputs.

This pilot project demonstrates only a few of the poten-
tial applications of RF3 to water quality. Success in this
pilot project suggests that RF3 is a potentially valuable
water quality analysis tool. It may also be a valuable tool
for demonstrating the results of water quality analyses
to managers or the public.

Because RF3 will require some processing before network
algorithms can be run, it is important to plan for the inte-
gration of RF3 into other CIS tools and data coverages.
240
-------
Figure 4. Upstream and downstream traces of RF3 hydrography.

Ongoing efforts to update RF3 may address some of proceed in a way that is compatible with ongoing efforts
these problems. If RF3 is to be developed into a produc- to update RF3 and the development of new data
tive water quality management tool, it is important to sources.
241
-------
EPA's Reach Indexing Project—Using GIS To Improve Water Quality Assessment
Jack Clifford
Office of Wetlands, Oceans, and Watersheds, U.S. Environmental Protection Agency,
Washington, DC

William D. Wheaton and Ross J. Curry
Research Triangle Institute, Research Triangle Park, North Carolina
Abstract

The Waterbody System (WBS), which the U.S. Environ-
mental Protection Agency (EPA) originally developed to
support preparation of the report to Congress that Sec-
tion 305(b) of the Clean Water Act requires, is a poten-
tially significant source of information on the use support
status and the causes and sources of impairment of U.S.
waters. Demand is growing for geographically refer-
enced water quality assessment data for use in inter-
agency data integration, joint analysis of environmental
problems, establishing program priorities, and planning
and management of water quality on an ecosystem or
watershed basis.

Because location of the waterbody assessment units is
key to analyzing their spatial relationships, EPA has
particularly emphasized anchoring water bodies to the
River Reach File (RF3). The reach file provides a nation-
wide database of hydrologically linked stream reaches and
unique reach identifiers, based on the 1:100,000 U.S.
Geological Survey (USGS) hydrography layer.

EPA began the reach indexing project to give states an
incentive to link their water bodies to RF3 and to ensure
increased consistency in the approaches to reach index-
ing. After a successful 1992 pilot effort in South Carolina,
an expanded program began this year. Working with
Virginia, a route system data model was developed and
proved successful in conjunction with state use of PC
Reach File (PCRF), a PC program that relates water
bodies to the reach file. ARC/INFO provides an extensive
set of commands and tools for developing and analyzing
route systems and for using dynamic segmentation.

One important advantage of the route system is that it
avoids the necessity of breaking arcs; this is an impor-
tant consideration in using RF3 as the base coverage in
a geographic information system (GIS). Using dynamic
segmentation to organize, display, and analyze water
quality assessment information also simplifies use of the
existing waterbody system data. Because of the variabil-
ity in delineation of water bodies, however, other states
used a number of different approaches. Working with
these states has defined a range of issues that must be
addressed in developing a consistent set of locational
features for geospatial analysis.

Wider use of these data also depends upon increased
consistency in waterbody assessments within and be-
tween states. Several factors complicate the goal of
attaining this consistency in assessment data:

• The choice of beneficial use as the base for assess-
ment of water quality condition.

• The historical emphasis on providing flexible tools
to states.

• The lack of robust standards for assessment of water
quality condition.

This paper explores possible resolutions to the problem
of building a national database from data collected by
independent entities.

Section 305(b) of the Clean Water Act and
the Waterbody System

Background of Section 305(b)

Since 1975, Section 305(b) of the Federal Water Pollution
Act, commonly known as the Clean Water Act (CWA),
has required states to submit a report on the quality of
their waters to the U.S. Environmental Protection Agency
(EPA) administrator every 2 years. The administrator
must transmit these reports, along with an analysis of
them, to Congress.

State assessments are based on the extent to which the
waters meet state water quality standards as measured
242
-------
against the state's designated beneficial uses. For each
use, the state establishes a set of water quality criteria
or requirements that must be met if the use is to be
realized. The CWA provides the primary authority to
states to set their own standards but requires that all
state beneficial uses and their criteria comply with the
'fishable and swimmable' goals of the CWA.

Assessments and the Role of Guidelines

EPA issues guidelines to coordinate state assessments,
standardize assessment methods and terminology, and
encourage states to assess support of specific benefi-
cial uses (e.g., aquatic life support, drinking water sup-
ply, primary contact recreation, fish consumption). For
each use, EPA asks that the state categorize its assess-
ment of use support into five classes:

• Fully supporting: meets designated use criteria.

• Threatened: may not support uses in the future unless
action is taken.

• Partially supporting: fails to meet designated use cri-
teria at times.

• Not supporting: frequently fails to meet designated
use criteria.

• Not attainable: use support not achievable.

In the preferred assessment method, the state com-
pares monitoring data with numeric criteria for each
designated use. If monitoring data are not available,
however, the state may use qualitative information to
determine use support levels.

In cases of impaired use support (partially or not sup-
porting), the state lists the sources (e.g., municipal point
source, agriculture, combined sewer overflows) and
causes (e.g., nutrients, pesticides, metals) of the use
support problems. Not all impaired waters are charac-
terized. Determining specific sources and causes re-
quires data that frequently are not available.

States generally do not assess all of their waters each
biennium. Most states assess a subset of their total
waters every 2 years. A state's perception of its greatest
water quality problems frequently determines this sub-
set. To this extent, assessments are skewed toward
waters with the most pollution and may, if viewed as
representative of overall water quality, overstate pollu-
tion problems.

Assessment Data Characteristics

Each state determines use support for its own set of
beneficial uses. Despite EPA's encouragement to use
standardized use categories, the wide variation in state-
designated beneficial uses makes comparing state uses
an inherent problem. This affects the validity of aggre-
gation and use of data across state boundaries. Com-
parably categorizing waters into use support categories
also poses a problem; different states apply the qualita-
tive criteria for use support levels in very different ways.
Further limiting the utility of Section 305(b) data is that
data are aggregated at the state level and questions
about the use support status of individual streams can-
not be resolved without additional information. While
some states report on individual waters in their Section
305(b) reports, EPA's Waterbody System (WBS) is
the primary database for assessment information on
specific waters.

State monitoring and assessment activities are also
highly variable. States base assessments on monitoring
data or more subjective evaluation. The evaluation cate-
gory particularly differs among states.

Waterbody System

The WBS is a database and a set of analytical tools for
collecting, querying, and reporting on state 305(b) infor-
mation. It includes information on use support and the
causes and sources of impairment for water bodies,
identification and locational information, and a variety of
other program status information.

As pointed out earlier, although some states discuss the
status of specific waters in their 305(b) reports, many do
not. The WBS is generally much more specific than the
305(b) reports. It provides the basic assessment infor-
mation to track the status of individual waters in time
and, if georeferenced, to locate assessment information
in space. By allowing the integration of water quality
data with other related data, the WBS provides a frame-
work for improving assessments.

WBS has significant potential for management planning
and priority setting and can serve as the foundation for
watershed- and ecosystem-based analysis, planning,
and management. In this respect, it can play a vital role
in setting up watershed-based permitting of point
sources. The primary function of WBS is to define where
our water quality problems do and do not exist. WBS is
increasingly used to meet the identification requirement
for waters requiring a total maximum daily load (TMDL)
allocation. It can serve as the initial step in the detailed
allocation analysis included in the TMDL process. In
addition, WBS is an important component of EPA par-
ticipation in joint studies and analyses. For instance,
EPA is currently participating with the Soil Conservation
Service (SCS) in a joint project to identify waters that
are impaired due to agricultural nonpoint source (NPS)
pollution. WBS can also anchor efforts to provide im-
proved public access at the state and national levels to
information on the status of their waters.

It is important to recognize that use of WBS is voluntary.
Of the 54 states, territories, river basin commissions, and
Indian tribes that submitted 305(b) reports, approximately
243
-------
30 used the WBS in the 1992 cycle. While submissions
for the 1994 cycle are not complete, we anticipate about
the same level of participation. This represents about a
60-percent rate of participation in WBS, which may be
the limit for a voluntary system. This severely limits use
of WBS assessment data for regional and national level
analysis. If data at the national level are required, man-
datory data elements, formats, and standards may be
necessary.

EPA is currently attempting to achieve consistency
through agreement with other state and federal agen-
cies. The recent work of the Interagency Task Force on
Monitoring offers hope for eventual consensus on the
need for nationally consistent assessment data and mu-
tually agreed upon standards for collection, storage, and
transfer. The Spatial Data Transfer Standards already
govern spatial data, allowing movement of data between
dissimilar platforms. The Federal Geographic Data
Committee provides leadership in coalescing data inte-
gration at the federal level; it provides a model for gov-
ernment and private sector efforts. This level of
cooperation, however, has not always been present in
water assessment data management. Assuming that
national and regional assessment data are needed, if
consistency is elusive through cooperative efforts, regu-
lations may be necessary. Developing a national data-
base may not be feasible without a mutual commitment
by EPA and the states to using common assessment
standards.

WBS was originally developed as a dBASE program in
1987. It has undergone several revisions since then, and
the current Version 3.1 is written in Foxpro 2.0. The
WBS software provides standard data entry, edit, query,
and report generation functions. WBS has grown sub-
stantially in the years since its inception, primarily in
response to the expressed needs of WBS users and
EPA program offices. The program's memory require-
ments and the size of the program and data files, how-
ever, are of growing concern to state WBS users and
the WBS program manager. Because of the wide range
of WBS user capabilities and equipment, users must be
equipped to support an array of hardware from high
capacity Pentium computers to rudimentary 286 ma-
chines with 640 Kb of memory and small hard disks. This
range makes memory problems inevitable for some users.

While WBS contains over 208 fields, exclusive of those
in lookup tables, approximately 30 fields in four files
provide the core data needed to comply with 305(b)
requirements. These fields contain:

• Identification information for the water body.

• The date the assessment was completed.

• The status of use support for beneficial uses.
• The causes and sources of any use impairment in
the water body.

The uses WBS considers are both state-designated
uses and a set of nationally consistent uses (e.g., overall
use, aquatic life support, recreation) specified in the
305(b) guidelines. The other essential piece of informa-
tion is the geographic location of the water body, which
the remainder of this paper discusses in detail.

Significant differences exist in the analytical base as well
as in assessments. EPA provided little initial guidance
on defining water bodies; therefore, states vary widely
in their configurations of water bodies. Water bodies are
supposed to represent waters of relatively homogene-
ous water quality conditions, but state interpretation of
this guidance has resulted in major differences in water-
body definition.

Initially, many states developed linear water bodies, and
these were often very small. The large number of water
bodies delineated, however, created significant difficul-
ties in managing the assessment workload and were not
ideal in the context of the growing need for watershed
information. Some states, such as Ohio, developed their
own river mile systems.

As discussed below, some states indexed their water
bodies to earlier versions of the reach file, and therefore,
the density of the streams these water bodies include is
fairly sparse. Recently, many states have redefined
their water bodies on the basis of small watersheds
(SCS basins, either 11-digit or 14-digit hydrologic unit
codes [HUCs]).

Locating water bodies geographically is a necessary
prerequisite to assessing water quality on a watershed
or ecosystem basis. The WBS has always included
several locational fields, including county name and
FIPS, river basin, and ecoregion. These fields have not
been uniformly populated, however. One of the WBS
files includes fields for the River Reach File (RF3)
reaches included in the water body. While a few states
had indexed their water bodies to older versions of the
reach file (RF1 and RF2), however, no state had indexed
to RF3 until 1992.

In 1992, EPA initiated a demonstration of geographic
information system (CIS) technology in conjunction with
the South Carolina Department of Health and Environ-
mental Control. This project involved:

• Indexing South Carolina's water bodies to RF3.

• Developing a set of arc macro languages (AMLs) for
query and analysis.

• Producing coverages of water quality monitoring sta-
tions and discharge points.

• Using CIS tools in exploring ways to improve water
quality assessments.
244
-------
South Carolina has defined its water bodies as SCS
basins.

The results have been very encouraging. First, South
Carolina took the initial coverages and decided they
needed much more specificity in their use support de-
terminations and their mapping of the causes and
sources of impairment. As a result, they mapped these
features down to the reach level. Next, they decided that
they needed better locational information, so they used
global positioning satellite receivers to identify accurate
locations for discharges and monitoring stations. They
then used CIS query and analysis techniques to relate
their monitoring and discharge data to their water quality
criteria. South Carolina is using CIS to actively identify
water quality problems and improve their assessments.

In 1993, EPA worked cooperatively with several states
to index their water bodies to the reach file. Virginia, the
next state to be indexed, demonstrated the successful
use of PC Reach File (PCRF) software (described later
in this paper) for indexing water bodies to the reach file.
Ohio and Kansas also are essentially complete. Each of
these states required a somewhat different approach
than Virginia. The need for flexibility in dealing with
states on reach indexing issues is essential. Existing
waterbody delineations often represent considerable in-
vestment; therefore, EPA must provide the capability to
link the state's existing assessment data to the reach file
in order to encourage state buy-in.

Figure 1 shows the results of Ohio's indexing of a typical
cataloging unit (CD). Figure 2 reflects part of the output
of the Kansas work. We can link use support, cause, and
source data to each of these water bodies now. In the
future, we hope to map these attributes at a higher level
of resolution, down to the reach segment level. CIS has
proven to be a useful assessment tool. With higher reso-
lution, it should prove to be even more helpful in identify-
ing water quality problems, picking up data anomalies,
and assessing management actions, strategies, and poli-
cies. This entire process has taught us much and has
strengthened enthusiasm for place-based management.

The Reach Indexing Project—
Georeferencing the Waterbody System

Purpose and Overview

The reach indexing project is designed to locate water
bodies using RF3 as an electronic base map of hydrog-
raphy and to code RF3 reaches with the specific water-
body identifier (WBID). After linking water bodies to their
spatial representation, they can be queried and dis-
played with assessment data located in WBS files.

Reach indexing includes several steps. First, the state
must supply waterbody locations and WBIDs. The next
step entails developing a set of procedures for indexing.
Finally, the coded RF3 data must be produced.

Input data to the indexing process includes:

• A list of valid WBIDs. In most cases, the state has
already input these identification numbers to the WBS.

• Information about the location of each water body. Lo-
cational information may be found in marked-up
paper maps showing waterbody locations or electronic
files from WBS containing waterbody indexing
66022
66023
66024
66025
67001
66001
66002
66003
66004
66005
66006
66007
fifinnp
DDUUO
66009
66010
= 66011
— 66012
«*«** 66013
ooocxj 66014
= 66015
ca=t 66016
fifim 7
DDU 1 /
66018
= 66019
__-. 66020
Figure 1. State of Ohio water bodies in CU 04100008.
245
-------
Figure 2. State of Kansas water bodies in CU 11070202.

expressions (discussed later), orit may be embedded
in the WBID itself.

• A complete set of RF3 data for the state being indexed.

Depending on the type of information the state supplies,
procedures used to index water bodies can be almost
fully automated, semiautomated, or completely manual.

The final result of the indexing processes is a set of RF3
coverages that contain a WBID attribute. This product
allows querying and displaying of assessment data,
which is collected and stored by water body, in a CIS
environment.

The Reach File Database

The reach file is a hydrographic database of the surface
waters of the continental United States. Elements within
the database represent stream segments. The elements
were created for several purposes:

• To perform hydrologic routing for modeling programs.

• To identify upstream and downstream connectivity.

• To provide a method to uniquely identify any particu-
lar point associated with surface waters.

The unique reach identifier has succeeded in associat-
ing other EPA national databases, such as STORET, to
surface waters. Any point within these databases can be
associated with and identified by a specific location on
any surface water element, such as a reservoir, lake,
stream, wide river, or coastline.

There are three versions of the reach file. The first was
created in 1982 and contained 68,000 reaches. The
second version, released in 1988, doubled the size of
Version 1. The third version (RF3) includes over
3,000,000 individual reach components.

The base geography of RF3 is derived from U.S. Geo-
logical Survey (USGS) hydrographic data (1:100,000
scale) stored in digital line graph (DIG) format. Unlike
DIG data, which are partitioned by quad sheet bounda-
ries, RF3 data are partitioned by CU. A CU is a geo-
graphic area that represents part or all of a surface
drainage basin, a combination of drainage basins, or
a distinct hydrologic feature. The USGS uses CUs
for cataloging and indexing water-data acquisition
activities.

The continental United States comprises over 2,100
CUs. CUs are fairly small; for example, 45 units fall
partially or completely within the state of Virginia (see
Figure 3).

RF3 is a powerful data source used in hydrologic appli-
cations for many reasons, including the following:

• RF3 has spatial network connectivity that topological
upstream/downstream modeling tools use.
246
-------
Figure 3. Cataloging units in Virginia.

• RF3 has attributes that describe connectivity, which
offers the ability to accomplish upstream/downstream
navigation analytically (without topological networking).

• RF3 has a simple and consistent unique numbering
system for every stream reach in the United States.

• RF3 has built-in river mileage attributes that describe
upstream/downstream distances along river reaches.

Use ofRF3 in the Indexing Process

When importing Reach File data from EPA's mainframe
computer, an arc attribute table (AAT) is automatically
built for each RF3 coverage. The AAT contains the
standard AAT fields, plus the items found in Table 1.

The CD item stores the USGS CD number of this piece
of RF3. Every arc in the coverage has the same value
for CU.

The SEG item stores the number of the stream seg-
ment to which the particular arc is assigned. SEG num-
bers start at 1 and increase incrementally by 1 to 'N' for
each CU. A SEG could represent all the arcs of a
mainstream, the arcs of a tributary, or piece of a main-
stream or tributary. SEG numbers were defined in the
production of RF3.

Ml stores the marker index for each particular arc. The
Ml resembles a mile posting along a stream. In reality,
the Ml field does not truly measure mileage along the
RF3 stream network. It does, however, represent a
method of producing a unique identifier (in combination
Table 1. Fields Found in Arc Attribute Table

12070104-ID CU SEG Ml
UP
DOWN
1
2
3
4
5
6
7
1 20701 04
1 20701 04
1 20701 04
1 20701 04
1 20701 04
1 20701 04
1 20701 04
1
1
1
2
3
3
4
0.00
1.30
2.10
0.00
0.00
1.15
0.00
-1
-1
-1
-1
-1
-1
-1
0
0
0
0
0
0
0
with the CU number and the SEG number) for every
reach in the United States (see Figure 4).

Together CU, SEG, and Ml uniquely identify every arc
in RF3 nationwide. These three items are combined in
the redefined item called RF3RCHID. This provides a
powerful scheme for consistently identifying locations
along streams everywhere in the country.

Along with the AAT file, a second attribute file is auto-
matically created for RF3 coverages. This file is always
named COVER.DS3. The DS3 file stores a wealth of
information about arcs in the coverage. Some of the
important fields in the DS3 file contain:

• Upstream and downstream connectivity for navigat-
ing along reaches.

• Codes to describe the type of reach (e.g., stream,
lake boundary, wide river).

• DIG major and minor attributes.
247
-------
4.05
2.4
o
1.30
WBID
KS- KR-04-R001
KS-KR-04-W020
KS-KR-04-W030
1.0
o
WBID WBNAME

KS-KR-04-R0001 Mainstem

KS-KR-04-W020 Tributaries

KS-KR-04-W030 Lakes
Figure 4. RF3, SEG, and Ml data elements.

Waterbody Locations

Because states define water bodies, they provide the
only information on waterbody location. South Carolina
was indexed to RF3 in 1992, followed by Virginia. Vir-
ginia indexed its water bodies using the PCRF program
instead of indexing in a CIS environment with ARC/INFO.

PCRF is a PC-based system that indexes water bodies
and locates other assessment data from WBS. PCRF
stores the definitions of water bodies (including their
location) in a file that is linked to other WBS database
files that contain information about the assessment
status and quality of the waters.

A water body is a set of one or more hydrologic features,
such as streams, lakes, or shorelines, that have similar
hydrologic characteristics. Water bodies are the basic
units that states use to report water quality for CWA
305(b) requirements. Depending on the state's assess-
ment goals and resources, water bodies can be defined
in several ways, including (see Figure 5):

• All streams within a watershed

• All lakes and ponds within a watershed

• Sets of streams with similar water quality conditions

PCRF stores locational data for a water body with a
unique WBID. WBS uses this WBID as a common field
to relate the water body's definition and location to de-
scriptive data about the water body's assessment status
and quality. The two most important files used in PCRF
are the SCRF1 and SCRF2 files.

The SCRF1 file simply lists the unique water bodies by
state. Table 2 offers an example. The most relevant data
for reach indexing in this file are the WBID, WBNAME,
and WBTYPE, as defined by the state. The WBID, as
stated, is a unique identifier for each water body the
state has defined. The WBNAME stores a verbal de-
Figure 5. Potential definitions of water bodies.

scription of the water body. Finally, the WBTYPE de-
fines the type of water body; for example, R is for river,
L is for lake.

The SCRF2 file contains an explicit definition of each
water body. Because of the complexity involved in de-
fining water bodies, this file may include more than one
record for each water body. The SCRF2 file can be
considered a waterbody definition language because it
contains specific codes, attributes, and keys that can be
converted into specific reaches on the RF3 data, if read
properly (see Table 3). The WBBEGIN and WBEND
fields contain explicit CD, SEG, and Ml attributes to
define the location of the starting point and ending point
for the water body. The WBDIR field contains an attrib-
ute that describes whether to go upstream or down-
stream from the WBBEGIN to the WBEND. In addition,
a blank WBEND field denotes that the water body
should include all upstream or downstream reaches
(depending on the WBDIR) of the WBBEGIN reach.

Virginia used PCRF to create an SCRF2 file that con-
tains reach indexing expressions for all of their defined
water bodies. ARC/INFO macros were then written to
process this file and expand the expressions into the set
of specific arcs that compose each water body. The
macros will be described in more detail later.
Table 2. Example of SCRF1 File Data

WBID WBNAME WBTYPE
WBSIZE
KS-KR-04-R001
KS-KR-04-W020
KS-KR-04-W030
KS-KR-04-W040
KANSAS
RIVER
LOWER
WAKARUSA
RIVER
MUD CR
CAPTAIL CR
R
R
R
R
15.20
61.60
39.43
15.63
248
-------
Table 3. Example of SCRF2 File Data

WBID WBDIR WBBEGIN
WBEND
RFORGFLAG
KS-KR-04-R001
KS-KR-OR-W020
KS-KR-04-W030
KS-KR-04-W040
U
D
U
U
10270104001 0.00
10270104005 10.80
10270104059 12.05
10270104038 0.00
1027010400115.20
102701040050.00
102701040078.10

2
2
3
3
States that have not already generated indexing expres-
sions in PCRF must provide locations in some other
way. The most basic method is for the state to supply a
set of 1:100,000 USGS quad sheets that they have
marked up with locations of each water body. The
maps can be used in conjunction with a digitizer to
manually select the appropriate RF3 reaches and
code them with the WBID.

The state of Ohio created a CIS database of its river
reaches several years ago. The CIS coverage is rep-
resentational in nature. The stream reaches are 'stick-
figures' only. Generally, they fall along the paths of the
actual streams, but they are schematic in nature and do
not show the true shape of streams. The CIS layer,
however, contains the attributes of Ohio's stream reach
numbering system, which is used to identify water bod-
ies as well. Ohio's river reach coverage contains infor-
mation on the locations of water bodies and is being
manually conflated to transfer the WBIDs to RF3. The
conflation process will be covered later in this paper.

The state of Kansas had previously defined its water
bodies on RF2, the precursor to RF3. Kansas defined
some indexes by a set of RF3 SEG numbers in a CD
and some by the RF3 reaches in a small watershed
polygon within a CD. The locations were, in effect, de-
fined within the WBID itself.

Indexing Procedures

Procedures developed for performing waterbody index-
ing include automated, semiautomated, and manual
systems.

Automated Indexing Procedures

As stated, Virginia used PCRF to perform the indexing
operation. The state delivered an SCRF2 file containing
indexing expressions for all of its water bodies. AMI
programs were created to read the SCRF2 file and
select the reaches specified by each indexing expres-
sion. The selected sets of reaches were then coded with
the appropriate WBID. The macros were designed to run
on one RF3 CD at a time, so the operator specified runs
of up to 10 CDs at a time. The macros had to accom-
modate indexing expressions that included:

• Select reaches upstream of a specified location.
• Select reaches on a reach-by-reach basis.

• Select reaches within a given polygon area.

• Select shorelines of lakes or ponds given latitude and
longitude coordinates.

• Select reach downstream from a specified location.

Kansas water bodies were also indexed through an
automated process. Kansas supplied an ARC/INFO
coverage of small watershed polygons (sub-CD poly-
gons) containing a watershed identifier. The state's
WBID contained all other information necessary to de-
termine the RF3 CD and the set of reaches making up
each water body. An example of a Kansas WBID is
KS-KR-02-W030. This is explained by the following:

• KS refers to the state. All WBIDs in Kansas begin
with KS.

• The second component (in this case KR) is an ab-
breviation of the basin in which the water body falls.
KR indicates that this water body is in the Kansas-
Lower Republican River basin.

• The third component contains the last two digits of
the eight-digit CD number. Although basins comprise
several CDs, the last two digits of each CD in a basin
are unique; therefore, between the basin (e.g., KR)
designation and the last two digits of the CD (e.g.,
02), the complete eight-digit CD number in which the
water body falls is defined.

• The next letter (in this case W) denotes whether the
water body is defined by a watershed polygon (W),
an RF2 SEG (R), or a lake or pond shoreline (L).

• Finally, the WBID ends with the number of the poly-
gon (in this case 030) that contains the reaches for
the water body in the watershed coverage.

The completed macros could index the entire state in a
single run provided that all the WBIDs were contained
in single file.

In all cases, Kansas has indexed to RF2 reaches. Only
RF3 reaches originally created in RF2 production, there-
fore, are coded with a WBID.
249
-------
Manual Indexing Procedures

Because Ohio already has a coverage of river reach
codes, WBIDs from this coverage had to be transferred
to the RF3 reaches they represent. This entailed using a
manual conflation process. The operator displayed a CD
of RF3 along with the Ohio river reach system for the
same area. In a simple process of 'pointing and clicking,'
the operator first selected an Ohio river reach arc, then
the RF3 arcs that seemed to coincide. As each RF3 arc
was selected, it was coded with the WBID of the pre-
viously selected Ohio river reach arc.

Other states that have no means of describing water
bodies in electronic files may have to mark up paper maps
to show waterbody locations. These maps can then be
used in a manual process of selecting RF3 reach and
coding them with WBIDs either in ARC/INFO or in PCRF

Using the Route System Data Model To
Store Water Bodies

Because water bodies can be defined as noncontiguous
sets of arcs and portions of arcs, a robust linear data-
base model is necessary to model these entities.
ARC/INFO's route system data model seems well suited
for this application. The route system data model allows
one to group any set of arcs or portions of arcs into
routes. Each route is managed as a feature in itself.
Attributes of water bodies are stored in a route attribute
table (RAT) and relate to all the arcs defined as the water
body. Figure 6 helps illustrate the route system model.

Each route comprises one or more arcs or sections of
arcs. ARC/INFO manages the relationship between arcs
and routes in the section table (SEC). The structure of
the SEC, which is an INFO table, is defined in Table 4.
Tables reflects how the sections that make up the above
routes would appear.
cover.ratwbs
WBS#WBS-ID
1 1
WBID
KS-KR-04-R0001

KS-KR-04-W020

KS-KR-04-W030
Table 4. Definition of Structure of SEC INFO Table

ROUTELINK* The route upon which the section falls
ARCLINK* The arc upon which the section falls
F-MEAS The measurement value at the beginning of the
section
T-MEAS The measurement value at the end of the section
F-POS The percentage of the distance along the arc at
which the section begins
T-POS The percentage of the distance along the arc at
which the section ends
SEC# The internal identifier of the section
SEC-ID The user identifier of the section
Table 5. How Sections Appear in SEC INFO Table

ROUTE-
LINKS ARCLINK8 F-MEAS T-MEAS F-POS T-POS SEC# SEC-ID
1
1
1
2
3
3
3
1
2
3
4
5
6
7
0
1.30
2.10
0
0
1.15
0
1.30
2.10
4.05
1.20
1.15
2.4
2.5
0
0
0
0
0
0
0
100
100
100
100
100
100
100
1
2
3
4
5
6
7
1
2
3
4
5
6
7
Figure 6. The route attribute table containing waterbody data.
Representing Water Bodies as Routes

ARC/INFO offers several ways of grouping sets of arcs
into discrete routes. One can use ARCEDIT to select a
set of arcs to group them into a route, or ARCSECTION
or MEASUREROUTE in ARC to group arcs into routes.
The method described here uses the MEASUREROUTE
command. This method requires that the AAT or a re-
lated table has an attribute containing the identifier of
the route to which an arc should be assigned. In the
application the authors employed, they converted the
SCRF2 file into an INFO table containing, for each arc
in the coverage, the RF3RCHID of the arc and the WBID
to which the arc should be assigned. The WBID item is
used to group arcs into routes. One route exists for each
unique WBID. Table 6 illustrates the table used in the
MEASUREROUTE command method. This table is re-
lated to the AAT of the RF3 coverage by the RF3RCHID.

An RAT is automatically created for the coverage, which
now can be related to other WBS assessment files for
display and query. Figure 4 illustrates the RAT. The most
important characteristic of the file is that it has only one
record for each water body. This simplifies the display
and query of water bodies based on water quality data.

Using EVENTS for Subwaterbody Attributes

Water bodies, as states define them, often constitute a
gross aggregation of the water in an area. States often
have more specific data about particular stretches of
streams within a water body. A system is needed to
250
-------
Table 6. Table Used in MEASUREROUTE Command Method
To Group Arcs Into Routes
$RECNO
RF3RCHID
WBID
1
2
3
4
5
6
7
8
9
1 02701 04
1 02701 04
1 02701 04
10270104
10270104
10270104
10270104
10270104
10270104
1 0.00
1 1.30
1 2.10
2 0.00
3 0.00
3 1.15
40.00
5 0.00
6 0.00
KS-KR-04-R0001
KS-KR-04-R0001
KS-KR-04-R0001
KS-KR-04-W020
KS-KR-04-W020
KS-KR-04-W020
KS-KR-04-W020
KS-KR-04-W030
KS-KR-04-W030
4.05
2.4
o
1.0
1.30
WBID
KS- KR-04-R001
KS-KR-04-W020
KS-KR-04-W030
query and display data at the subwaterbody level.
ARC/INFO's dynamic segmentation tools and event ta-
bles are useful for this application. Once water bodies
have been defined and reporting methods have been set
up based on those water bodies, the task of redefining
them is cumbersome.

Event tables can help to keep these waterbody defini-
tions yet still offer the ability to store, manage, and track
data at the subwatershed level. Event tables are simple
INFO files that relate to route systems on coverages.
This concept and data structure can work in conjunction
with the predefined waterbody system. We have already
seen how a route system called WBS is created in RF3
to group arcs into waterbody routes. This works quite
well when displaying water bodies and querying their
attributes. A route system based on the WBID cannot,
however, act as an underlying base for subwaterbody
events because the measures used to create the WBS
route system are not unique for a particular route. For
example, in the route depicted in Figure 7, three loca-
tions are defined as being on WBID KS-KR-04-W020 and
having measure 1.0.

The mileage measurements along SEG, however, are
always unique (see Figure 8). To use EVENTS, therefore,
a second route system must be created based on the RF3
SEG attribute, which provides a unique code for each CD.

The ARCSECTION command, instead of the MEASURE-
ROUTE command, is used to create the SEG route
system. This is because the measurement items (Ml on
the AAT and UPMI on the DS3) already store the
summed measures along particular SEGs. Table 7 lists
the contents of the resulting RAT table.

Because the name of the route system is SEG, the
SEG# and SEG-ID are the names of the internal and
user IDs. The SEG item contains the actual SEG num-
ber in the RF3 coverage. Because the SEG numbers for
each RF3 CU coverage start at 1 and increase incre-
Figure 7. Measurements along SEGs.
O
Figure 8. Events located on RF3 data.

Table 7. Route Attribute Table

SEG-ID SEG#
SEG
mentally by 1, the SEG item looks much like the SEG-ID
and SEG#.

Event tables contain a key item, the WBID or SEG, to
relate them to the appropriate route system (see Figure 8).
They also contain locational information on where to lo-
cate the events on the route (either WBID to indicate the
water body on the WBS route or SEG to identify the
route in the SEG route system). Separate event tables
can then relate use support, causes, and sources as
linear events. FROM and TO store the starting and
251
-------
ending measures for each event. Using event tables
allows us to apply many useful cartographic effects
(e.g., hatching, offsets, text, strip maps). Events can be
queried both in INFO and graphically. Event data can
help in producing overlays of two or more event tables.
An event table can display use support information (see
Table 8). WBS users can update their event tables using
RF3 maps supplied by EPA without having proficiency
in ARC/INFO. ARC/VIEW2 is expected to support
events and route systems. This will give users powerful
tools for spatial query of assessment data. Developing
event tables would also display and query data on the
causes and sources of use impairment. These events
can be offset and displayed to show the areas of inter-
action. More permanently, preparing line-on-line over-
lays can show intersections and unions.
An alternative approach is to use an EVENT-ID as a
unique identifier for each event. The SEG field stores the
number of the route (SEG) upon which the event occurs.
FROM and TO store the beginning and end measures
along the route upon which the event occurs. WBID
contains the identifier of the water body upon which the
event occurs (see Table 9). An event can occur within
a single SEG, across two or more SEGs, within a single
water body, or across two or more water bodies.
Additional attribute tables can be created to store de-
scriptive attributes for each event. These tables would
resemble the SCRF5 and SCRF6 files except that in-
stead of using the WBID to relate to a water body, a field
called 'EVENT-ID' would link the use, cause, and source
data to a particular event (see Table 10).
Both approaches offer some advantages. In either case,
they allow us to map our water quality assessment data
and communicate it in a meaninaful and useful wav.
Table 8. Event Table That Reflects Use Support Information
USE
SEG FROM TO WBID USE SUPPORT
1 0.80 1.30 KS-KR-04-R0001
1 1.30 2.10 KS-KR-04-R0001
1 2.10 2.31 KS-KR-04-R0001
1 0.50 1.30 KS-KR-04-R0001
3 0.00 1.15 KS-KR-04-W030
4 0.00 2.5 KS-KR-04-W040
21 Fully
21 Partial
21 Not supported
40 Threatened
21 Fully
40 Not supported
Table 9. Using EVENT-ID as a Unique Event Identifier
EVENT-ID SEG FROM TO WBID
1 1 0.80 1.30
1 1 1.30 2.10
2 4 0.00 2.5
Table 10. Using EVENT-ID To Link Use,
Data to an Event
EVENT-ID ASCAUSE
1 900
1 -9
1 0500
2 1200
2 0900
KS-KR-04-R0001
KS-KR-04-R0001
KS-KR-04-W040
Cause, and Source
ASSOURC
1200
1100
1100
9000
8100
252
-------
Environmental Management Applications
-------
Ecological Land Units, GIS, and Remote Sensing:
Gap Analysis in the Central Appalachians
Ree Brannon, Charles B. Yuill, and Sue A. Perry
West Virginia University, Morgantown, West Virginia
Abstract

The gap analysis team in West Virginia is assessing the
state's natural communities as part of a nationwide,
comprehensive planning effort. Underrepresented or
unrepresented habitats represent gaps in the present
network of conservation lands and conservation activi-
ties. After identifying these gaps, we can assess
whether our current management direction will maintain
natural diversity and will prevent additional species from
being classified as threatened or endangered.

The relationship between vegetation and ecological
variables serves as the basis for classifying ecological
land units. To characterize the ecological land units,
many layers of physical data can be integrated in a
geographic information system (GIS). Satellite imagery
and videography map existing conditions over the state.
The existing vegetation is classified to reflect physiog-
nomic and floristic elements to correlate with vertebrate
and butterfly habitat requirements. This correlation of
vegetation and wildlife habitat creates mappable habitat
types. Analysis of these habitat types with land-
ownership data indicates where the species-rich areas
occur in the landscape and whether the most species-
rich areas are protected.

Introduction

To respond to the urgency of habitat loss and its effect
on species diversity, scientists must implement a meth-
odology for rapid assessment and documentation of
natural communities at a scale pertinent for regional
management activities.

Geographic information systems (GIS) and remote
sensing support the development of ecological land
classifications over large regions. CIS-based mapping
of ecological land classes allows users to combine and
display environmental variables for spatial modeling and
refinement of ecological land units (1).
The Gap Analysis Project is a comprehensive planning
effort for land conservation in the United States. The
objective of the Gap Analysis Project is to identify spe-
cies, species-rich areas, and vegetation types underrep-
resented or unrepresented in existing biodiversity
management areas. Unprotected communities are the
gaps in the conservation strategy. The Gap Analysis
Project is not merely identifying communities with the
largest number of species; its ultimate goal is to identify
clusters of habitats that link the greatest variety of
unique species.

Local areas with considerable diversity of habitat or
topography usually have richer faunas and floras (2).
Nature reserves, which incorporate a variety of habitats,
may be the best guarantee for long-term protection of
biodiversity. By protecting species-rich regions, we can
reduce the enormous financial and scientific resources
needed to recover species on the brink of extinction.

The West Virginia Gap Analysis Project began in 1991.
The objective is to map existing vegetation and to use
that as the foundation to model potential distribution of
vertebrate and butterfly species. High cost precludes
intensive field inventory and monitoring of wildlife.
Therefore, habitat modeling is critical to predict wildlife
species composition and potential ranges over the vari-
ous landscape types of West Virginia. Lastly, the vege-
tation map will provide a record of the existing habitat to
use in monitoring changes due to human activities and
natural disturbances.

Pilot Study Area

Distribution of wildlife and plant communities will be
modeled for the entire state. Initially, we will focus on a
smaller pilot study area. This region includes approxi-
mately 50,000 hectares in the central Appalachian
Mountains and spans several physiographic provinces
and vegetative communities. Generally, soils in the pilot
study area are of two kinds: acidic soils that develop a
255
-------
clay horizon from extensive leaching over time and
younger soils that are found on steep slopes and where
environmental conditions, such as cold climate, limit soil
development (3, 4).

The vegetation types include spruce-fir, oak-pine, high
elevation bogs, northern hardwoods, Appalachian oak,
mixed-mesophytic, open heath barrens, and grass
balds. The mixed-mesophytic, the most diverse in West
Virginia, lies primarily west of this area, but localized
stands do occur in the lower elevations. Cover types
within the Appalachian oak and mixed-mesophytic types
are not discrete and will be difficult to delineate.

The pilot study area includes a variety of land uses, such
as residential, commercial, industrial, mining, and agri-
culture. Portions of the Monongahela National Forest in
this area are the Fernow Experimental Forest, and the
Otter Creek and Laurel Forks Wilderness Areas.

Methods

The following discussion describes the methods formu-
lated and data compiled for the West Virginia Gap Analy-
sis Project.

Describe Ecological Land Units With Existing
Vegetation

Davis and Dozier (1) note that a landscape can be
partitioned by ecological variables, which contributes to
an ecological land classification. This process is applied
frequently to analysis and mapping of natural resources.
Davis and Dozier classified vegetation in California
based on the documented associations of vegetation
with terrain variables. They based this approach on the
assumptions that the arrangement of natural landscape
features is spatially ordered by an ecological interde-
pendence among terrain variables and that actual vege-
tation is a reliable indicator of these ecological
conditions. Similar documentation exists for the distribu-
tion of vegetation types in West Virginia, and the gap
analysis team is proceeding along a similar course.

West Virginia lies in two major provinces, the Eastern
Broadleaf Forest and the Central Appalachian Broadleaf
Forest-Coniferous Forest-Meadow Provinces (5). Within
these provinces are several broad vegetation types. The
gradient diagram in Figure 1 (6) illustrates the range of
these types. The vertical axis represents elevation in
feet. Three vegetation types emerge distinctly along the
elevation gradient. The horizontal axis is not quite as
explicit. This gradient spans moist, protected slopes to
dry, exposed ridgetops, and the vegetation types are
much less distinct. Drier oak and pine types occur al-
most exclusively on exposed ridgetops. The vegetation
types along the horizontal axis are the mixed mesophytic
forest association of the Allegheny and Cumberland
Mountains and can have 20 to 25 overstory and under-
story species per hectare in North America (7).

The distribution of the vegetation along gradients such
as elevation and soil moisture lends itself to a CIS
analysis. Physical data such as elevation and soil mois-
ture regime can be incorporated into a CIS. These lay-
ers of information can be manipulated graphically or
mathematically to model the spatial distribution of vege-
tation types or to provide useful ancillary data for clas-
sification of satellite imagery (see Figure 2). Much of this
data is digital and can be used to substantially reduce
the time required to develop a database. To standardize
output, members of the national gap team have
specified ARC/INFO as the software to generate final
products.

Classify Satellite Imagery To Create a Map of
Current Distribution

Remote sensing provides an effective means to classify
forests, and the Gap Analysis Project has successfully
used it in the western United States (1, 8, 9).

The Gap Analysis Project is using Landsat Thematic
Mapper imagery in all states to standardize the baseline
information. The hypothesis is that the spectral data
from the imagery is related to the distribution of the
ecological land units and land use across the landscape.
The data include all spectral bands, except thermal, for
the entire state. The West Virginia project is using two
seasons of data, spring and fall. Temporal changes,
which record phenologic variation in the deciduous spe-
cies, enhance classification accuracy. The spectral reso-
lution is 30-meter pixels. This is equivalent to
approximately 1/6 hectare (1/2 acre). Our final product
will be a series of 1:100,000 maps. The minimum map-
ping unit is 40 acres.

The mountainous terrain in the Appalachian Mountains
offers disadvantages and advantages for using remotely
sensed data. Irregular topography can cause inconsis-
tencies in the spectral data that diminish the classifica-
tion accuracy. Similar cover types may have different
spectral signatures; for instance, if one stand is in sun-
light and the other is shaded. Also, phenology can vary
due to microclimatic influences. Conversely, topo-
graphic features influence the distribution of vegetation
types, and ancillary data, such as digital elevation mod-
els (OEMs), enhance classification results of the im-
agery. The West Virginia gap analysis team selected the
following strategy for image classification.

1. Stratify the imagery using ecological units based on
a hierarchical scheme. Bauer et al. (10) found that
an initial stratification of physiographic regions was
necessary to reduce the effect of broadscale
environmental factors caused by changes over
latitude. Therefore, stratification enhanced the
256
-------
5,000

4,500

4,000

3,500

3,000

2,500

2,000

1,500

1,000

500
MESIC
Red Spruce
Forest ._. .,
Heath
Barren

Red Spruce -
Yellow Birch
Northern Hardwoods
Forest
Red
--------- , Oak
._--"' " ', ' Forest /
x' Hemlock I . ,'' /
and ,' ,-' /' '
Hemlock- ' -' / ,-'*
Cove / Hardwoods ,' / ,-'
Hardwoods ' ''
White /'
Oak /
1 / ' '
, / ; i
/ Red Oak - / chestnut/
/ / White Oak / oak ''
Forest / ,'
Scarlet/
,' \ ,' I Black Oak
Coves Flats Sheltered Open Slope
Canyons Draws/Ravines Slopes NE, E, S, W, NV

Grass
Bald

Pitch/
Shortleaf
Pine
Heath

Scrub
Pine

Post/
Blackjack
Oak
Ridges
V XER\C
Figure 1. Environmental gradients for vegetation.

efficiency of the training data. An interagency
committee, including ecologists from the Monongahela
National Forest, West Virginia Division of Forestry,
and geologists from the State Geologic Survey,
generated a draft map of physiographic regions.
They delineated sections based on geomorphology
and climate. Sections were divided into subsections:
those most typical of the section or those that are
transitional, or irregular, to the section. Figure 3 is a
draft map of these sections and subsections in West
Virginia.

2. Classify stratified imagery using the ancillary data.
High resolution imagery has not been used widely in
the eastern United States, where forests are not
homogeneous stands of relatively few tree species
as they are in the West. Researchers who classified
eastern forests from satellite imagery attempted to
find the most distinctive spectral band combinations
for discriminating cover types (10-13). One recent
technique (14) uses a nonparametric approach that
combines all spectral and informational categories to
classify imagery. We are testing a variety of methods
such as nonparametric processes, traditional
clustering techniques, and use of derived vegetation
indexes to find the most successful method.

3. Assess accuracy with random plots from
videography. Videography will be acquired in the
spring of 1995. Aerial transects, which extend the
length of the state, will be flown with approximately
30-kilometer spacing. By regulating flight altitudes,
the resolution per frame can be captured at 1
kilometer per frame. About 7,000 frames will be
collected, which make up a 3 percent sample of the
state. About 2 to 3 percent of the videography frames
will be field verified. With this strategy, we will test
the effectiveness of using videography, instead of
intensive field plots, to verify classification of satellite
imagery. Areas of special interest, which the
systematic transects may not capture, will require
separate flights. The bulk of the videography will
provide training data for supervised image
classification. The remaining frames will be used to
assess the accuracy of the classification.
;>, Streams
."I;;-- Ecological Subregions
";:>- Digital Elevation Models
i-* Soils
Agricultural, Tilled
High Elevation, Red
Spruce
Low Elevation, Coniferous
Forest (Hemlock)
Open Grassland, Deep
Soil (Grass Bald)
High-Density Urban
Landscape Unit
Classes
Figure 2. GIS and the development of ecological land units.
257
-------
Figure 3.
Legend

g] Pilot Study Area
Physiographic regions of West Virginia and pilot
study area.
In summary, 100 percent of the state will be classified
using the Thematic Mapper imagery. Aerial video-
graphy, covering approximately 3 percent of the
state, will help to verify the image classification, and
2 to 3 percent of the videography will be verified from
transects on the ground.

4. Determine sources of data for image classification.
Due to the increasing interest in CIS, digital data are
more readily available from a variety of sources,
such as the federal government, state agencies, and
private companies. Acquisition of available data sets
can substantially reduce the time and cost of
database development. Users must bear in mind,
however, that databases are developed with differing
objectives and techniques, so one must consider
scale and standards of production when deciding
which data sets are appropriate for project design.
The West Virginia gap team determined that the
following CIS coverages are important for image
classification.

The U.S. Geological Survey's (USGS's) graphic infor-
mation retrieval and analysis system (GIRAS) ear-
marked land use/land cover data. The classification was
done several years ago, and although these data are not
current, they provide excellent information concerning
urban and agricultural land use. Land-cover categories
represent Level II classifications from Anderson's (15)
system. The maps are produced at a 1:250,000 scale,
so they require few CIS operations to piece together a
regional coverage of land use.

The Southern Forest Experiment Station mapped U.S.
forestland using advanced very high resolution radiome-
ter (AVHRR) satellite imagery (16). This imagery is rela-
tively current (1991 to 1992), but the resolution is coarse
at 1 kilometer per pixel (100 hectares or 247 acres). The
classes are based on Forest Inventory and Analysis
plots established by the U.S. Department of Agriculture
(USDA) Forest Service and Kuchler's (17) potential
natural vegetation types. We are using the maps to
depict broad changes in forest type over a region, such
as the state of West Virginia. This coverage does not
show land use.

The eastern region of the Nature Conservancy has com-
pleted a draft of the classification of the terrestrial com-
munity alliances (18). The classification hierarchy is that
prescribed by the national gap team, and as such, re-
flects physiognomic and floristic characteristics neces-
sary for correlating vegetation structure and floristic
composition with vertebrate habitat requirements. The
descriptions include the range of alliances and charac-
teristic species of the overstory, understory, and herba-
ceous layer. This provides information on associated
species not detectable by image classification.

The National Wetlands Inventory data are available digi-
tally. Maps have been digitized at a scale of 1:24,000,
and the classification scheme is from Cowardin et al.
(19). Coverages come with attribute data for each poly-
gon, arc, or point as needed. A labor-intensive effort is
required to join the maps in a CIS for an area the size
of West Virginia, but the information will be invaluable
for masking water and forested wetlands on the satellite
imagery. The U.S. Fish and Wildlife Service includes
detailed instructions for converting the data to coverages.

Field data, much of it already digital, has been acquired
from many sources. Commercial timber companies pro-
vided data for timber stand composition and age groups.
The USDA Forest Service ecologist conducted transects
throughout the Monongahela National Forest to charac-
terize ecological land units. Forest inventory and analy-
sis plots are also available. We acquired these data to
verify videography classification.

The USGS has digital data of terrain elevations. The
West Virginia gap analysis team acquired 3 arc-second
OEMs as an additional band in the satellite imagery. We
will use these data to generate coverages of slope,
aspect, and elevation classes to further stratify the
physiographic regions of the state. This will increase the
accuracy of the classification.

The Soil Conservation Service created a statewide da-
tabase called STATSGO, produced at a scale of
1:250,000. For West Virginia, the map of the soil map-
ping units consists of approximately 450 polygons. Each
mapping unit is an aggregation of soil components that
occupy a certain percentage of the mapping unit area.
The database is extensive and includes information on
soil attributes such as soil taxonomy, soil chemistry, soil
structure, and interpretations for erodability and wildlife
habitat. For an attribute such as soil temperature, each
component has an individual value so that each
mapping unit may have several different values for soil
258
-------
temperature. Attribute information is difficult to query in
ARC/INFO, where there is a one-to-many relationship
between polygons and database entries (for instance,
each mapping unit, or polygon, has several soil compo-
nents). We found that exporting the attribute information
from ARC/INFO to another software package such as
Excel was easier. The values can be aggregated by
attribute and then imported into ARC/INFO to produce
individual coverages such as soil texture, soil depth, or
soil group. STATSGO data can provide useful informa-
tion for the physical variables that influence vegetation,
such as soil moisture and nutrient availability.

To review, the project researchers will first identify the
physical parameters that govern the distribution of
ecological land units and the existing vegetation in the
state. Then, the team will gather applicable ancillary
data of physical data in CIS to support the image clas-
sification. Once the imagery has been classified, the
wildlife models can be incorporated.

Integrate Terrestrial Vertebrate Models

Once classification is completed for the state, the gap
analysis team will integrate terrestrial vertebrate mod-
els. Concurrent with the image classification, the team
will develop a species profile for each vertebrate species
known to occur in the state. These profiles, when com-
pleted, will be condensed into rule-based models for
associated species that can be linked to the ecological
land units (see Figure 4). This step will create habitat
types. After integration, the team will generate maps that
display species richness of vertebrates for each habitat
type (see Figure 5). These maps link spatial data to the
species database. This enables users to identify areas
in the landscape that combine habitats with the greatest
number of unique species. When a coverage of land-
ownership is overlaid on this map, land managers or
conservation groups can take a proactive stance to seek
protection of critical habitats. Additional analyses that
Wildlife
Database
Landownership
Assess
Protection
Status
Figure 4. Informational flow chart for wildlife data.
users can perform are displays of the potential distribu-
tion of vertebrate groups, such as upland salamanders.
Another analysis would be to report the species that
occur in the fewest habitats and that would be most
vulnerable to landscape changes. Clearly, CIS provides
a powerful environment for quick and efficient retrieval
of spatial data for management decisions.

Summary

To summarize, the West Virginia gap analysis team is
assessing the natural communities in the state as a part
of the national comprehensive planning effort. We need
to conduct the assessment rapidly, compiling existing
information and integrating these data with CIS and
remotely sensed data. Ecological land units are classi-
fied according to the relationship between vegetation
and ecological variables. Satellite imagery is used to
map existing conditions over the state. The existing
vegetation is classified to reflect physiognomic and flor-
istic elements to correlate with vertebrate and butterfly
habitat requirements.

The gap analysis team is using many widely available
data sets such as OEMs, land use/land cover, wetlands
inventory, and soils data. While these can reduce the
time and cost of developing an ecological database,
they do present implications for project design and ac-
curacy. When the user combines maps of different
scales, accuracy is constrained by the map with the
smallest scale. Additionally, data sets may be con-
structed with objectives for an intended use that is not
compatible with project needs. The classification of
AVHRR data reflects forest types but not land use, so
another source may be required for these data.

The Gap Analysis Project is not a substitute for intensive
biological studies at a fine scale. It is merely a quick
assessment at a broad scale to provide information on
existing conditions. While accumulating data and mod-
eling potential wildlife distributions, we will identify
where inventory data may be lacking. Additional work
must be done to verify wildlife models and the classifi-
cation of vegetation, but this preliminary analysis will be
a valuable framework that will direct future studies of
biological diversity. Finally, this effort will provide a data
set that can be used to monitor changes to land cover
and land use.

Acknowledgments

This study is being funded by the Cooperative Research
Units Center, National Biological Survey, United States
Department of the Interior. This manuscript is published
as Scientific Article No. 2499 of the West Virginia Agri-
culture and Forestry Experiment Station.
259
-------
Legend
• Very High
n High
E3 Moderately High
m Moderate
H Moderately Low
IB Low
H Very Low
ea Comm. Timber Lands
H Public Lands
N
Figure 5. Relative species richness.

References

1. Davis, F.W., and J. Dozier. 1990. Information analysis of a spatial
database for ecological land classification. Photogramm. Eng.
and Remote Sensing 56:605-613.
2. Scott, J.M., B. Csuti, K. Smith, J.E. Estes, and S. Caicco. 1991.
Gap analysis of species richness and vegetation cover: an inte-
grated biodiversity conservation strategy. In: Kohm, K., ed. Bal-
ancing on the brink of extinction: the Endangered Species Act
and lessons for the future. Washington, DC: Island Press.
pp. 282-297.
3. Stephenson, S. 1993. Upland forests of West Virginia. Parsons,
WV: McClain Printing Co.
4. Buol, S.W, F.D. Hole, and R.J. McCracken. 1980. Soil genesis
and classification, 2nd ed. Ames, IA: Iowa State University Press.
5. Bailey, R.G. 1994. Ecoregions and subecoregions of the United
States (map). U.S. Department of Agriculture, Forest Service.
Revised February 23.
6. Fortney, R. 1994. Vegetation pattern of central Appalachian
Mountains. (Unpublished).
7. Barbour, M.G., J.H. Burk, and WD. Pitts. 1987. Terrestrial plant
ecology, 2nd ed. Menlo Park, CA: Benjamin/Cummings Publishing
Company.
8. Davis, F.W, D.M. Stoms, J.E. Estes, J. Scepah, and M. Scott.
1990. An information systems approach to the preservation of
biological diversity. Int. J. Geogr. Info. Syst. 4:55-78.
9. Scott, J.M., F. Davis, B. Csuti, R. Noss, B. Butterfield, C. Groves,
H. Anderson, S. Caicco, F. D'Erchia, T.C. Edwards, Jr., J. Ulliman,
and R.G. Wright. 1993. Gap analysis: A geographic approach to
protection of biological diversity. Wildl. Monogr. 123:1-41.
10. Bauer, M.E., T.E. Burk, A.R. Ek, PR. Coppin, S.D. Lime, T.A.
Walsh, O.K. Walters, W Befort, and D.F. Heinzen. 1994. Satellite
inventory of Minnesota forest resources. Photogramm. Eng. and
Remote Sensing 60:287-298.
0 10
Kilometers
11. Benson, A.S., and S.D. DeGloria. 1985. Interpretation of Land-
sat-4 thematic mapper and multispectral scanner data for forest
surveys. Photogramm. Eng. and Remote Sensing 51:1,281-1,289.

12. Hopkins, PR, A.L. Maclean, and T.M. Lillesand. 1988. Assess-
ment of thematic mapper imagery for forestry application under
lake states conditions. Photogramm. Eng. and Remote Sensing
54:61-68.
13. Schriever, J.R., and R.G. Congalton. 1993. Mapping forest cover-
types in New Hampshire using multitemporal Landsat Thematic
Mapper data. Papers presented at ASCM/ASPRS Annual Con-
vention and Exposition, New Orleans, LA (February), pp. 333-342.

14. Cetin, H., T.A. Warner, and D.W Levandowski. 1993. Data clas-
sification, visualization, and enhancement using n-dimensional
probability density functions (nPDF): AVIRIS, TIMS, TM, and geo-
physical applications. Photogramm. Eng. and Remote Sensing
59:1,755-1,764.
15. Anderson, J.R., E.E. Hardy, J.T Roach, and R.E. Witmer. 1976.
A land use and land cover classification system for use with
remote sensor data. U.S. Geological Survey Professional Paper
965. Washington, DC: U.S. Geological Survey.
16. Zhu, Z., and D.L. Evans. 1992. Mapping midsouth forest distri-
butions: AVHRR satellite data and CIS help meet RPA mandate.
J. Forestry 27-30.
17. Kuchler, A.W. 1964. Potential natural vegetation of the contermi-
nous United States (map and manual). American Geographical
Society Special Publication 36. New York, NY: American
Geographical Society.

18. Sneddon, L., M. Anderson, and K. Metzler. 1994. A classification
and description of terrestrial community alliances in the Nature
Conservancy's eastern region: First approximation. Washington,
DC: U.S. Department of the Interior Fish and Wildlife Service.
19. Cowardin, L.M., V Carter, F.C. Golte, and T.E. LaRoe. 1979.
Classification of wetlands and deepwater habitats of the United
States. Report No. FWS/OBS-79/31. Washington, DC: U.S.
Department of the Interior Fish and Wildlife Service.
260
-------
A GIS Strategy for Lake Management Issues
Michael F. Troge
University of Wisconsin, Stevens Point, Wisconsin
Abstract

Lake management plans are crucial to the sustained life
of a lake as it experiences pressures from human as well
as environmental activities. As proven in the past, geo-
graphic information systems (GIS) can meet the needs
of most if not all environmental entities. Applying GIS to
lakes and lake management, however, is a fairly new
concept because most previous work focused on the
terrestrial realm. Future studies must address problems
relating to dimension, but adopting certain methods (i.e.,
cross-sectional coverages) can help lake managers
plan for critical lake issues. By using sufficiently planned
coverages, lake quality data management coverages
can increase storage and/or analysis efficiency. After
evaluating certain management criteria, a lake manage-
ment plan can be derived and set up as a coverage.
These criteria can then correspond collectively to form
management zones within a lake. Each of these zones
has its own set of management goals to which all lake
users must strictly adhere.

Introduction

The importance of maintaining lake quality has long
concerned recreationalists and ecologists. The multifac-
eted interrelationships of the lake environment, how-
ever, usually make proper assessment and analysis of
lake quality information difficult. Over the last decade,
assessment has become easier due to the increased
use and acceptance of geographic information systems
(GIS). This computer-based tool has allowed successful
integration of water quality variables into a comprehen-
sible format.

One area of the environmental sciences that has neglected
GIS is lake management. This paper presents an alternative
method for using a traditional two-dimensional GIS for
viewing, querying, and displaying three-dimensional in-
formation—in this case, lake quality information and
lake management criteria. Lakes, unlike geologic enti-
ties, offer a three-dimensional realm that humans can
fully penetrate without a great amount of effort. Lakes
also contain a complete aquatic environment of physi-
cal, chemical, and biological entities that humans can
effectively observe and analyze. This paper does not dis-
cuss the issue of dimension; however, future studies, pri-
marily those relating to the creation of three-dimensional
GIS, should address this issue.

GIS allows incorporation of a multitude of environ-
mental variables (e.g., water chemistry, geologic
strata) into a synergism of the many coexisting vari-
ables of the lake environment. The ability of a GIS to
"capture, manipulate, process, and display spatial or
georeferenced data" is now well known and accepted
(1). Surprisingly though, GIS is rarely used for lake
management databases and associated water quality
analysis.

The few examples that exist include Schoolmaster's
Texas Water Development Board System (2), which
examined water use on a county basis, and RAISON
GIS (3), which is an expert computer system imple-
menting proper application of hydrologic principles to
a particular lake. Many other systems are simply data-
base collection storehouses of lake information, such as
the Galveston Bay National Estuary Program (4).

One notable example is the LAKEMAP program (5).
This extensive and comprehensive GIS spans the
entire United States covering approximately 800,000
lake sampling sites from the U.S. Environmental Pro-
tection Agency's (EPA's) STORET system. LAKEMAP
uses both a database management and mapping dis-
play system, allowing retrieval of information for spe-
cific sites or aggregation of regional areas. This GIS
is unique because it examined the creation of stand-
ards that could be used across the country in data-
base development and the presentation of that data.

Using GIS in a lake management study inspires many
questions because of the lack of existing research and
the absence of any true standards. For example, how
should one create a lake quality database for general
purpose management? Is visualizing the integration of
several variables within the lake ecology possible? Can
261
-------
one examine temporal changes in pH? Many technical
and logistical CIS questions therefore existed when the
Legend Lake study began.

Background of the Legend Lake Study

Legend Lake (see Figure 1) is located in Menominee
Reservation, which is in Menominee County in north-
eastern Wisconsin. Legend Lake is a 1,230-acre im-
poundment comprising eight natural drainage lakes that
a single stream once connected. In the late 1960s, a
plan was introduced to convert this area into an im-
poundment/recreational area, and construction soon be-
gan. The ecology and hydrology had not been seriously
evaluated since the development was finalized, hence
the Legend Lake project was designed in cooperation
with EPA, Wisconsin Department of Natural Resources
(WDNR), Menominee Reservation and County, Legend
Lake District, and the Legend Lake Property Owners
Association. This intensive study spanned the qualita-
tive and quantitative aspects of surface water, ground
water, sediment, and aquatic plants, as well as human
influences on the Legend Lake watershed and sur-
rounding areas.
Figure 1. Legend Lake with basin identifiers.

Initially, questions needed to be answered regarding
aquatic plants and sediment and their influences on lake
management strategies. Because the scope of the Leg-
end Lake study included subjects such as surface- and
ground-water chemistry, land use and development,
septic system impacts, and recreational stress, all these
factors needed to be considered in determining optimum
management strategies. In addition, the study ad-
dressed the question of GIS's ability to alleviate some
technical aspects of deriving and presenting a lake man-
agement plan. Given these questions, the goal was to
integrate collected data into a CIS database to create a
prototype standard for future lake studies, as well as to
present new techniques for visualization and analysis of
lake quality data.

CIS/Database Creation

Several techniques were used to best manage the data
for a three-dimensional system: cross-sectional cover-
ages (described later), data summary coverages, and
multidate coverages. The latter two coverage types are
traditional coverages that contain general lake informa-
tion excluding water column data. These techniques
work fairly well for data storage and visualization; how-
ever, they were not sufficient for determining the geo-
graphic areas within the lake that required intensive
management decisions as opposed to areas that
needed little attention.

During formulation of a new coverage design, the study
focused on the littoral zone, which is usually defined as
that area of the lake with a depth of 15 feet or less. Most
management concerns deal with the littoral zone be-
cause most recreational and ecological activities occur
in this zone. One of the most pressing issues in lake
management is aquatic plant growth. The fact that most
aquatic plant growth is confined to the littoral zone rein-
forced the decision to use the littoral zone as the primary
sink for potential management decisions.

To curtail the dimensional problem, depth was basically
ignored. This allowed for easier delineation of areas
within the lake. This, in turn, facilitated classifying areas
into management zones to implement varying degrees
of activity, ranging from casual to intensive efforts. Thus,
management recommendations for a particular zone
were the same at a depth of 2 feet as at a depth of 12
feet. This greatly reduced complication of the model and
centered effort on the areal extent of the lake. It also
facilitated visualization of management decisions by
professionals and lay people.

Problems can arise when combining depths for man-
agement considerations, as this technique did. For ex-
ample, a littoral zone that contains a gentle slope usually
does not receive the same attention as a littoral zone
with a very abrupt slope because the littoral zone with
the gentle slope contains more area. Thus, if a lake
manager recommends restricting boat traffic in a man-
agement zone (a section of the littoral zone) that has a
gentle slope to promote wildlife habitat, the amount of
area available for boaters would decrease significantly.
In this situation, primary activity would be most crucial
in shallow areas where wildlife or waterfowl predominate
rather than in more open water areas where boaters
predominate. Situations like these may require the crea-
tion of management subzones when setting up a lake
management CIS and database to increase efficiency
of lake area use and increase support by lake users.

Many criteria can affect the decisions made for a
management zone. For instance, if the lake manager
recognizes excessive, unhealthy weed growth in a par-
ticular zone, the lake manager may recommend exten-
sive weed harvesting to neutralize the situation. An
adjacent zone may have very little weed growth and
may not require weed harvesting. Criteria such as these
must be recognized before constructing the CIS. Table 1
lists some common criteria to consider. Generally, man-
agement criteria include anything that is influent on the
262
-------
Table 1. Criteria To Consider When Creating Lake
Management Plans
Environmental Variables
Artificial Variables
Aquatic plants

Sediment

Ground water

Surface water

Wildlife/waterfowl

Fisheries

Climate

Wind

Geology/geography

Adjacent natural land cover

Natural nutrient loading

Hydrologic characteristics
Adjacent land use

Septic systems

Development

Construction sites

Fuel leaks

Shoreland zoning

Population density

Primary lake use (recreational)

Nutrient loading

Visitor use
shoreline and lake itself and any influence the lake has
on the shoreline or adjacent shorelands.

A primary concern when accessing lake data from a
computer database is being sure to query the correct
lake. For example, searching a database for all data on
"Sand Lake" would be a legitimate action, except that
the database may include close to 200 lakes with the
name of Sand Lake. This is one of the main reasons why
the WDNR developed a system known as the Water
Mileage System (6). Based on logical criteria, each
water body (e.g., lakes, streams, sloughs) receives a
unique six- or seven-digit number called the waterbody
number. Thus, if the number 197900, assigned to Sand
Lake near Legend Lake, is the query subject, then the
output should include all data for this particular Sand
Lake. Because having a unique identifier for each spe-
cific entity in a CIS database is ideal, the waterbody
number was used, and all sampling performed on this
lake will be linked with this number.

Examples of Types of Coverage

Management Zone Coverages

Figure 2 and Table 2 together show how a potential lake
management CIS and plan might work. The lake man-
ager can easily manage and frequently update this sys-
tem if necessary, or the system can serve as a long-term
plan to consult for all decision-making. A plan of this sort
specifically emphasizes areas that need intensive man-
agement over areas that may need frequent monitoring.
It provides specific instructions for plan implementation,
leaving little guess-work forthe manager. This technique
is also visually informative to the lake user because the
user can easily discern areas of concern. This example
is hypothetical, but a plan is being formulated based on
the information collected during the Legend Lake study.
Cross-Sectional Coverages

Another technique currently included in the Legend
Lake study entails the z dimension. Cross-sectional
views of each lake basin, derived from 1992 lake con-
tour maps, provided a more detailed description of the
lake bottom. These cross sections were then digitized
and transformed into CIS coverages. For each lake
basin, 22 tests were conducted on the deepest part of
the lake at several different depth intervals along the
water column. These data provided valuable information
on the way various chemical and biological attributes
react to depth. Using CIS, a point could represent each
depth where data were collected. These points could
actually act as labels for polygons based on depth.

For example, if performing a series of analyses on a lake
(maximum depth of 10 feet) at 3-foot, 5-foot, and 8-foot
depth intervals, the labels on the cross-sectional cover-
age would be placed at these respective depths. Thus,
labels would be positioned at depths of 3, 5, and 8 feet.
Because these labels represent cross-sectional poly-
gons, the 3-foot depth label may represent a polygon
with boundaries at 0 feet and 4 feet. The 5-foot depth
label may represent a polygon with boundaries at 4 feet
and 6 feet, and the 8-foot depth label may represent a
polygon with boundaries at 6 feet and the lake bottom
(10 feet). Those who are familiar with lake ecology
understand that no clear-cut boundaries distinguish
where chemical values jump from one measurement to
another without a gradual transition. All users of a lake
quality CIS must be made aware of these types of
inaccuracies (see Figure 3).
Figure 2. Hypothetical management zones for a section of Legend Lake that correspond to the management plans in Table 2; black
areas indicate depths greater than 15 feet.
263
-------
Table 2. Hypothetical Management Plans for a Section of Legend Lake (see Figure 2)

Management Zone Management Plan (brief explanations)
I
IV

VII

VIII

XII

XIII

XIV

XV
High recreation area, high plant growth, frequent harvesting; frequently monitor water quality

Moderate recreation; manage for fish habitat

Open water, high recreation/possible fish habitat; consider subzoning

High-grade wildlife habitat; restrict human contact

Moderate recreation; manage for fish habitat

Open water, high recreation, increased shoreland development; frequently monitor water quality

Adjacent to high recreation area, possible fish habitat; manage for fish and aquatic habitat

Adjacent to high recreation area, shoreland development; monitor water quality

Increased development; frequently monitor water quality

aPrime wildlife/waterfowl habitat adjacent to high recreation area; restrict human presence (hot spot)

Open water, high recreation area; frequently monitor water quality

Open water, high recreation area, possible fisheries and wildlife habitat; consider subzoning

Excessive aquatic plant growth (species listed) choking out preferred species; potential wildlife/waterfowl
habitat, fisheries potential; continual harvesting; restrict human presence

High-grade slope with little plant growth, potential for increased sedimentation; monitor shoreland development
Critical area between good habitat and high recreation area; monitor extensively.
Figure 3 shows a cross section from one of the larger basins,
Basin F, in the Legend Lake system (see Figure 1). The
shades of gray represent ranges of temperature, with the
lightest being the coldest and the darkest being the
warmest. The thermocline can be located roughly in the
middle. Each colored section represents a depth range
where certain chemical attributes were collected. Using
ARC/VIEW, the user can choose these areas with a
pointer (mouse) and gain access to the database that
contains all the sample results for this depth range.

Conclusion

These ideas are still preliminary as the Legend Lake
study analysis concludes. Clear-cut discussions and
recommendations will become available at a later date,
although some observations can be made at this point.
First, future research should focus on creating and de-
veloping a three-dimensional CIS, not to be confused
with three-dimensional or cartographic models. Ideally,
lake management plans should consider depth. This
paper did not include depth because of the dimensional
factor. Depth was not compatible with ourtwo-dimensional
CIS, and the presentation quality was not sufficient to relay
our results in the form of management zones. We must
develop a three-dimensional CIS to address the problems
of the three-dimensional environment in which we live.

Lastly, maps are the best form for communicating this
information to professionals and the public. Maps can
also easily confuse people, however. Proper carto-
graphic techniques are a necessity (7). Significant effort
must be devoted to map creation to ensure a successful
plan and successful relationships between lake manag-
ers and lake users. CIS and map-making are closely
related. Both the planning stages and the database
development phase of the lake quality CIS should em-
phasize this point. At an early stage of the process,
management criteria should be determined, and all play-
ers or potential players must be included. A poorly
planned project can lead to a failed CIS.

Creativity may offer new ideas in map development. For
instance, animation (8) has some unique traits. Trend
analysis using animation may produce the best visual
results. Techniques such as these augment our methods
of communication, and some are very revolutionary.
Remember, however, that cartographic principles still
must apply.

Acknowledgments

I would like to thank Dr. Keith Rice of the University of
Wisconsin, Stevens Point, for his review efforts as well
as Dr. Byron Shaw and Steven Weber for their technical
support on lake management issues. Also, I would like
Figure 3. Cross section from one of the larger basins, Basin F,
in the Legend Lake system.
264
-------
to thank members of the University of Wisconsin,
Stevens Point, Geography/Geology Department for their
support and for the use of their equipment.

References

1. Fedra, K. 1993. Models, GIS, and expert systems: Integrated
water resources models. Proceedings of the HydroGIS '93: Appli-
cation of Geographic Information Systems in Hydrology and Water
Resources Conference, Vienna. International Association of Hy-
drological Sciences Publication No. 211.

2. Schoolmaster, F.A., and P.G. Marr. 1992. Geographic information
systems as a tool in water use data management. Water Re-
sources Bull. 28(2):331-336.

3. Lam, D.C.L., and D.A. Swayne. 1993. An expert system approach
of integrating hydrological database, models and CIS: Application
of the RAISON system. Proceedings of the HydroGIS '93: Appli-
cation of Geographic Information Systems in Hydrology and Water
Resources Conference, Vienna. International Association of Hy-
drological Sciences Publication No. 211.
4. Rifai, H.S., C.J. Newell, and P.B. Bedient. 1993. CIS enhances
water quality modeling. CIS World 6(8):52-55.

5. Samuels, WB. 1993. LAKEMAP: A 2-D and 3-D mapping system
for visualizing water quality data in lakes. Water Resources Bull.
29(6):917-922.

6. Fago, D. 1988. Retrieval and analysis system used in Wisconsin's
statewide fish distribution survey, 2nd ed. Report 148. Wisconsin
Department of Natural Resources Bureau of Research.

7. Ahner, A.L. 1993. Modern cartography plays essential role in GIS.
CIS World 6(10):48-50.

8. Loucks, D.P. 1993. Interactive multimedia, GIS, and water re-
sources simulation. Proceedings of the HydroGIS '93: Application
of Geographic Information Systems in Hydrology and Water Re-
sources Conference, Vienna. International Association of Hydro-
logical Sciences Publication No. 211.
265
-------
A Watershed-Oriented Database for Regional Cumulative Impact Assessment
and Land Use Planning
Steven J. Stichter
Division of Coastal Management, State of North Carolina Department of Environment, Health,
and Natural Resources, Raleigh, North Carolina
Introduction

In 1974, North Carolina passed the Coastal Area Man-
agement Act (CAMA) to guide growth and development
in the state's coastal zone. Today, the Division of Coastal
Management (DCM), under the direction of the gover-
nor-appointed Coastal Resources Council, implements
CAMA. DCM's jurisdiction covers the 20 counties that
border either the Atlantic Ocean or the Albemarle-Pamlico
estuary.

This coastal region comprises a diverse set of human,
animal, and plant communities. Abroad array of coastal
plain ecosystems occurs in this area, from the barrier
dunes and maritime forests of the outer banks to cedar
swamps and large pocosin complexes of interior areas.
This area includes some of the state's fastest growing
counties and some that are losing population. Urban
centers such as Wilmington do exist, but the region
remains primarily rural.

In recognition of the 20th anniversary of the passage of
CAMA, the governor designated 1994 as the "Year of
the Coast." Associated celebrations, panels, and studies
highlighted the unique features of the North Carolina
coast, successes of coastal management in the state,
and unresolved problems and concerns. Problems re-
main despite protection efforts by various agencies. For
instance:

• Fish landings have dropped dramatically of late.

• Shellfish Sanitation recently closed a set of shellfish
beds located in outstanding resource waters.

• Shellfish statistics show that the quality of the state's
most productive coastal waters continues to decline.

Because coastal North Carolina as a whole is growing
more rapidly than any other section of the state,
pressures on coastal resources can only continue to
increase.
Declining water quality and associated sensitive habi-
tats, resources, and animal populations have prompted
several state agencies to develop new approaches to
environmental protection that incorporate a broader,
natural systems perspective. The North Carolina Divi-
sion of Environmental Management is developing river
basin plans to guide point and nonpoint water pollution
control efforts. The DCM has begun work to assess and
manage the cumulative and secondary impacts of de-
velopment and other land-based activities by using
coastal watersheds as the basis for analysis. The goal
of this work is to expand the regulatory and planning
programs in order to better address cumulative impacts.
This paper describes the approach that DCM has devel-
oped for cumulative impacts management, with special
emphasis on the use of a geographic information system
(CIS). The project described here is scheduled for com-
pletion by the fall of 1996.

Cumulative Impacts Management

The concept of cumulative impacts management (1) is
not new to North Carolina's coastal program. CAMA
requires the consideration of cumulative impacts when
evaluating development permits within defined areas of
environmental concern. A permit must be denied if "the
proposed development would contribute to cumulative
effects that would be inconsistent with the guidelines.. .."
Cumulative effects are defined as "impacts attributable
to the collective effects of a number of projects and
include the effects of additional projects similar to the
requested permit in areas available for development in
the vicinity" (2). Despite this directive, few permitting
actions have been denied because of cumulative ef-
fects; the existence of limited impact data and a dearth
of viable analysis approaches have restricted applica-
tion of this rule.

Since the passage of the National Environmental Policy
Act (NEPA) in 1969, many attempts have been made to
266
-------
define and assess cumulative impacts. The Council on
Environmental Quality developed the most familiar defi-
nition in its guidelines for NEPA implementation. It de-
fines cumulative impact as:

... the impact on the environment which results
from the incremental impact of the action when
added to other past, present, and reasonably fore-
seeable future actions, regardless of what agency
or person undertakes such other actions. Cumula-
tive impacts can result from individually minor but
collectively significant actions taking place over a
period of time (3).

Although this definition focuses the discussion of cumu-
lative impacts, it provides little guidance on how to carry
out such an analysis. Selecting both an appropriate time
frame for the assessment (how far into the past and
future to carry the analysis) and appropriate boundaries
for the study (municipal or county boundaries, water-
sheds, ecoregions) are but two of the questions that
require answers to successfully investigate cumulative
impacts. Such decisions become even more complex
when incorporating the limits imposed by available data
and existing management structures.

Rigorous cumulative impact analysis is a difficult propo-
sition. It requires identification of all sources of degrada-
tion that affect a given resource. The next step involves
assigning relative significance to each of these sources
along with any impacts that result from additive or syn-
ergistic interaction between sources. Assessment of the
impacts of a pier on surrounding sea grasses, for in-
stance, must include not only impacts related to the
structure, such as shading and wave or current changes,
but also such ambient impacts as natural wave and wind
effects, upland runoff, and varying salinity.

All these investigations require the availability or collec-
tion of baseline environmental data at an appropriate
spatial and temporal scale. Quantifying all the sources
and causal pathways that affect a resource is extremely
complicated in all but the simplest of systems. Assigning
proof of significant impact is difficult unless the cause is
clear and direct.

Because of the difficulties associated with assigning
cause in cumulative impact analysis, especially in a
regional review, DCM has chosen a different approach.
It is focusing instead on locating areas at high risk to
cumulative impacts. Impacts management studies and
responses can then target the areas at greatest risk of
degradation. Changing the scale of analysis from the
site to the region requires applying some simplifying
assumptions. The first assumes that any existing re-
source degradation results from the cumulative impact
of all sources within the system boundaries. Locating
such areas is relatively straightforward because most
natural resource fields have developed measurements
and indicators for locating degraded resources. The
second assumption claims that a sufficiently intensive
concentration of activities within a limited area will result
in cumulative impacts on the affected system. Determin-
ing a threshold beyond which impacts cause degrada-
tion is much harder than locating already degraded
resources because the level of such a threshold de-
pends upon both the strength or spatial concentration of
the impacts and the sensitivity of the resource.

Working from these simplifying assumptions, DCM's
first step in assessing regional cumulative impacts is to
identify areas within coastal North Carolina that exhibit
symptoms of resource degradation, contain a concen-
tration of activities that affect resources, or contain a
concentration of sensitive resources. The use of catego-
ries of resources and impacts have helped to focus this
search. These eight cumulative impact, high-risk area
categories are:

• Impaired water quality

• High potential for water quality impairment

• Sensitive ground-water resources

• Impaired air quality (present or potential)

• Historical rapid growth

• Anticipated high growth

• High-value resources

• Productive and aesthetic resources

This set of categories is presently under public review. The
next step is to develop indicators of the presence of im-
pacts or resources appropriate to each of these categories.
These indicators, when applied to a database of information
about the study area, will help identify those locations at
high risk as defined by the eight categories.

The regional cumulative impacts assessment approach
that DCM developed is a hybrid of various assessment
techniques. The overall approach is grounded in the
theory and methods of site-specific cumulative impact
assessment. Determination of high-risk categories and
appropriate indicators and indexes is closely associated
with both relative risk assessment procedures and geo-
graphic targeting. By focusing on known causes and
effects of cumulative impacts on terrestrial and aquatic
natural resources instead of attempting to quantify all
impact pathways, available data and analysis techniques
can help assess relative risk of cumulative impacts.

A Watershed Database for Cumulative
Impact Assessment

High-risk categories and indicators of degradation or
sensitivity are useless without information on the location
of sensitive resources and impact sites. Consequently,
a comprehensive database of information about coastal
267
-------
North Carolina is central to cumulative impacts manage-
ment in this area. The form of any database determines
what types of questions to ask it; the selection of
boundaries has been central to this study.

County boundaries constitute the most typical reporting
unit in DCM operations. Counties determine the
boundaries of DCM's jurisdiction, and the great majority
of statistics used in planning and assessment are avail-
able primarily or solely by county. County size, hetero-
geneity, and the small number of counties available for
comparison, however, have made county boundaries
inappropriate for this project. Because the study focuses
on impacts on natural resources, clearly the most appro-
priate boundaries would relate more directly to those
resources.

Although using a single set of boundaries may not be
appropriate for assessing impacts on all resources,
management constraints limit the choice to one bound-
ary type. Because the primary resources of concern are
water based, watersheds were considered most appro-
priate. Surface waters receive the integrated effects of
activities within a watershed; such boundaries fit intui-
tively with the concept of cumulative impact assess-
ment. The number of water-related resources of concern
also supported this choice. This analysis used small
watersheds (5,000 to 50,000 acres) delineated in 1993
by the Soil Conservation Service for the entire state of
North Carolina.

The Population, Development, and Resources Informa-
tion System (PDRIS), which was designed for this pro-
ject, is a PC-based, watershed-oriented database that
contains the following information about the coastal
area:

• Natural resources

• Population and housing

• Agricultural activities

• Economic activities

• Development activities

Table 1 includes a list of database fields. The presence
and extent (or absence) of each of the features that this
database represents will be available for each coastal
watershed. The small watershed orientation of this study
is only possible because of the availability of CIS; the
volume and complexity of the watershed boundaries
preclude any other assessment tool. In fact, 348 of these
watersheds fall wholly or partially in the 20-county re-
gion. Figure 1 shows a map of these small watersheds.
This map indicates county boundaries and shorelines in
solid lines and the watershed boundaries in gray.
Data Needs and GIS Analysis

Over the past 5 years, North Carolina has actively col-
lected a large amount of natural resource and base map
information in GIS form. Research and funding associated
with the Albemarle/Pamlico Estuarine Study (APES), a
national estuary program study, spurred much of this
data development in the coastal area. The state main-
tains a central repository for geographic data at the
North Carolina Center for Geographic Information and
Analysis (CGIA). Table 2 lists the general types of infor-
mation available from the state database. The availabil-
ity of data in GIS form is but one criterion for selecting
a data set for use in this analysis. To be useful, the scale
and accuracy of the data must be appropriate to the
analysis.

Data Scale

The majority of data in the state's GIS database was
collected at a scale of 1:100,000. Broader use and
interest will probably urge the development of data lay-
ers at finer scales. A recently released layer of closed
shellfish waters, for instance, was created at 1:24,000
scale. This prompted an update of the associated shore-
line coverage to the same base scale. A handful of state
departments and divisions, including Coastal Manage-
ment, now use global positioning systems to collect
even more precise locational information. This scale
suits DCM's regional cumulative impacts scan, which is
based on summary values for entire watersheds. More
detailed intrawatershed planning and analysis would
require finer scale data. A scale of 1:24,000 delineated
the watershed boundaries in this project.

Mixing these 1:24,000 boundaries with the 1:100,000
data sets, however, can cause problems. For instance,
a number of watersheds were designated for the large
open water areas in Albemarle and Pamlico sounds.
Although these should comprise exclusively water, over-
lay analysis of these watershed boundaries on the Tl-
GER-derived census boundaries (1:100,000 scale)
resulted in the assignment of small population counts to
some of these watersheds. Individually locating and
correcting such discrepancies is necessary.

Database Accuracy

Data layer accuracy problems are difficult to identify and
assess. Because other agencies developed the majority
of data used in this project, these source agencies must
be relied upon for accuracy assessment of the source
data. CGIA, steward of the state GIS database, adheres
to National Map Accuracy Standards for all GIS data that
it maintains. CGIA delivers metadata reports with any
data; these reports include the source agency, collection
date, and scale for the information used to derive the
GIS layer. Descriptions of data lineage (collection and
processing procedures), completeness, and positional
268
-------
Table 1. Population, Development, and Resource Information System: Database Fields
Agriculture: Livestock and Poultry
Beef feedlots (< 300 head, > 300 head)
Dairy farms (< 70 head, > 70 head)
Hog farms (< 200 head, > 200 head)
Horse stables (< 200 head, > 200 head)
Poultry farms (< 15,000 birds, > 15,000 birds)
Agriculture: Farming
Land in farms (acres, % of HU)
Land with best mgmt. practices (acres, % of HU)
Land w/o best mgmt. practices (acres, % of HU)
Land in conservation tillage (acres, % of HU)
Land w/o conservation tillage (acres, % of HU)
Harvested cropland (acres, % of HU)
Hay crops (acres, % of HU)
Irrigated land (acres, % of HU)
Pasture land (acres, % of HU)
Row crops (acres, % of HU)
Primary
Estuarine waters (acres, % of HU)
Freshwater lakes
HU name
Receiving HU
Receiving water body
Primary water body
Secondary water body
Shoreline
Waterways w/vegetated buffers (miles, % of HU)
Population 1970
Population 1980
Population 1990
Population growth 1970 to 1980
Population growth 1980 to 1990
Counties
Total HU size
Land area (acres, % of HU)
Water area (acres, % of HU)
Stream length (miles)
Stream order (miles, % of stream length)
Development
Building permits—all residential
Building permits—amusement/recreation
Building permits—multifamily residential
Building permits—one-family residential
Building permits—hotels and motels
Building permits—retail
Building permits—industrial
Highway mileage:
Total (miles)
Primary (miles, % of total)
Secondary (miles, % of total)
Paved (miles, % of total)
Unpaved (miles, % of total)
Rail lines (miles)
Increase of primary & secondary roads (miles, %)
Increase of paved vs. unpaved roads (miles, %)
Economic
Ag-related business (number, employees, income)
Farms (number, employees, income)
Fisheries business (number, employees, income)
Forestry/wood-using business (number, employees, income)
Lodging establishments (number, employees, income)
Manufacturing establishments (number, employees, income)
Marinas (number, employees, income)
Mining establishments (number, employees, income)
Recreation business (number, employees, income)
Restaurants (number, employees, income)
Retail establishments (number, employees, income)
Ground Water
Ground-water contamination incidents
Ground-water class (acres, % of HU)
Ground-water contamination area (acres, % of HU)
Ground-water capacity use areas (acres, % of HU)
Land and Estuarine Resources
Anadromous fish streams (miles, % of streams)
Coastal reserve waters (acres, % of HU)
Coastal reserve lands (acres, % of HU)
Federal ownership:
National parks (acres, % of HU)
National forests (acres, % of HU)
Military reservations (acres, % of HU)
USFWS refuges (acres, % of HU)
Federal ownership—other (acres, % of HU)
State ownership:
Game lands (acres, % of HU)
State parks (acres, % of HU)
State forests (acres, % of HU)
State ownership—other (acres, % of HU)
Natural heritage inventory sites (count)
Primary nursery areas (acres, % of water area)
Private preservation (acres, % of HU)
Secondary nursery areas (acres, % of water area)
Threatened/endangered species habitat
Water supply watersheds (acres, % of HU)

Land Use
Total wetland area (acres, % of HU)
High-value wetlands (acres, % of HU)
Medium-value wetlands (acres, % of HU)
Low-value wetlands (acres, % of HU)
Predominant land cover

Population and Housing
Average seasonal population
Peak seasonal population
Units without indoor plumbing
Units with septic tanks
Units on central water systems
Units on central sewer
Units with wells

Permits
Air emission permits—PSD
Air emission permits—toxic
CAMA minor permits
CAMA general permits
CAMA major permits
CAMA exemptions
CWA Sect. 404/10 permits
Landfill permits—municipal
Landfill permits—industrial
Nondischarge permits
NPDES permits—industrial
NPDES permits—other
NPDES permits—POTW
Stormwater discharge permits
Sedimentation control plans
Septic tank permits

Shellfish
Shellfish waters (acres, % of HU)
Shellfish closures—permanent (acres, % of HU)
Shellfish closures—temporary (acres, % of HU)

Water Quality—Open Water
Class B waters (acres, % of water area)
Class C waters (acres, % of water area)
HQW waters (acres, % of water area)
NSW waters (acres, % of water area)
ORW waters (acres, % of water area)
Swamp waters (acres, % of water area)
SA waters (acres, % of water area)
SB waters (acres, % of water area)
SC waters (acres, % of water area)
WS-I waters (acres, % of water area)
WS-II waters (acres, % of water area)
WS-III waters (acres, % of water area)
269
-------
Table 1. Population, Development, and Resource Information System: Database Fields (Continued)
Water Quality—Streams
Class B streams (miles, % of streams)
Class C streams (miles, % of streams)
HQW streams (miles, % of streams)
NSW streams (miles, % of streams)
ORW streams (miles, % of streams)
Swamp water streams (miles, % of streams)
SA streams (miles, % of streams)
SB streams (miles, % of streams)
SC streams (miles, % of streams)
WS-I streams (miles, % of streams)
WS-II streams (miles, % of streams)
WS-III streams (miles, % of streams)
Key
HU = hydrologic unit
PSD = point source discharges
POTW = publicly owned treatment work
NPDES = National Pollutant Discharge Elimination System
HQW = high-quality waters
NSW = nutrient-sensitive waters
ORW = outstanding resource waters
SA = saltwater classification A
SB = saltwater classification B
SC = saltwater classification C
WS1 = water supply classification 1
WS2: water supply classification 2
WS3: water supply classification 3
Water Quality—Use Support
Algal blooms (count, extent/severity)
Fish kills (count, extent/severity)
Streams fully supporting (miles, % of streams)
Streams support threatened (miles, % of streams)
Streams partially supporting (miles, % of streams)
Streams nonsupporting (miles, % of streams)
Waters fully supporting (acres, % of water area)
Waters support threatened (acres, % of water area)
Water partially supporting (acres, % of water area)
Waters nonsupporting (acres, % of water area)
accuracy are not available from these standard metadata
reports, however.

DCM's cumulative impacts analysis also incorporates
information not available from the state CIS database.
Some of this information, such as business locations, is
available from private data providers. Other information,
especially agricultural statistics, does not presently exist
in CIS form. Non-GIS formats include county statistics,
voluntary compliance databases with self-reported co-
ordinates, and other tabular databases. Typically, little
quality control has been performed on any coordinate
information. When the data originate from other state
agencies, DCM is often the first user of the data outside
of the source agency.

CIS Analysis Procedures

This study involves no sophisticated CIS analysis pro-
cedures. CIS helps to generate summary statistics by
watershed for each of the database features. CIS draw-
ing and query operations allow analysis of database
accuracy. If the data are acceptable, the next step re-
quires overlaying the watershed boundaries on the fea-
ture and assigning the appropriate watershed codes to
all features that fall within the study area. Statistics can
then be generated on the number of points, length of
lines, acreage of polygons, or a total of any other nu-
meric field in the feature coverage. Finally, the resulting
summary file is converted to the format that PDRIS
requires. A macro has been developed to complete this
analysis. This macro generates a page-size reference
map, performs the overlay, generates the watershed
summary statistics, and converts the statistics to the
final PC format.

Extra steps are necessary to analyze any information
that does not already exist as a CIS coverage. Typically,
these are tabular summaries associated with a specific
boundary layer, such as county or U.S. Census statis-
tics. These cases entail overlaying the watershed
boundaries on the reporting unit boundaries; the data
are distributed to the watershed in direct proportion to
the percentage of the unit that falls into the watershed.

For instance, if a census tract falls 30 percent into
watershed A and 70 percent into watershed B, 30 per-
cent of the total tract population will be assigned to
watershed A and the remainder to B. After performing all
assignments, summary statistics are again generated
by watershed. This procedure assumes that the distri-
bution of the feature is even across each reporting unit.
Rarely is this a valid assumption, but when the units are
considerably smaller than the watersheds, as is the
case with census tracts and blocks, this assumption
introduces only limited errors. Watershed estimates
based solely on county statistics, however, can be
grossly inaccurate. When working with county informa-
tion, therefore, using covariate information that ties
more precisely to specific locations is necessary. Crop-
land location derived from the LANDSAT land cover
layer, for instance, can be used to better distribute
county-level agricultural statistics.

Database and Analysis Documentation

Data documentation is essential to this project. Given
the large number of fields in the final database and the
270
-------
Figure 1. Watersheds in the North Carolina coastal area.
correspondingly large number of data types and
sources, such documentation is key to understanding
the quality of the individual database components as
well as easing future database additions and updates.
Because the results of this cumulative impact analysis
exercise will be used to extend DCM's resource man-
agement efforts, documenting data sources and analy-
ses will be critical if any decisions made based on this
information are disputed.
A metadata database has been developed to document
PDRIS data sources and analysis procedures. For each
Table 2. A Sample of North Carolina GIS Database Contents
Type Examples
database entry, fields exist for a description and contact,
collection methodology, and geographic extent of the
source data. Data selection, overlay, and conversion
procedures are also documented, along with any as-
sumptions made in the analysis. In addition, recording
data source, analysis procedure, and the date facilitates
future database updates. Fields also record accuracy
assessments for positional and attribute accuracy, logi-
cal consistency, and completeness.
The restrictions listed above regarding source data ac-
curacy assessment, however, have limited their use.
Once an entry is made to the PDRIS, all project team
members receive metadata reports along with a refer-
ence map for a final review of completeness of the
source data, data selection, and analysis logic. Figure 2
shows an example of a blank metadata worksheet.
Status of the Cumulative Impacts
Assessment
DCM is presently gathering, verifying, and analyzing
information for entry into the PDRIS. Although each
source data layer was checked for accuracy before use,
the logical consistency of each of the database entries
relative to the other components also needs addressing.
One example of such database inconsistencies is the
watersheds that are covered entirely by water but also,
according to the database, support a resident popula-
tion. These inconsistencies could result from problems
related to scale, differing category definitions, data inac-
curacies, or errors in the GIS conversion or analysis at
DCM. Database precision is essential for an accurate
analysis and for general support of DCM's cumulative
Coverage
Natural resource
Base data
Ownership
Permits, waste sites

Cultural, population
Fishery nursery areas
Natural heritage sites
Detailed soils
Closed shellfish areas
Water quality use classes
Detailed wetlands maps
Hydrography (24K, 100K)
Roads/transportation
County and city boundaries
LANDSAT-derived land cover
Federal and state ownership
NPDES permit site
Landfills, hazardous waste, Superfund
TIGER boundaries, census information
Historic register sites, districts
Coastal North Carolina
Statewide
Varied
Coastal North Carolina
Statewide
Varied
Statewide
Statewide
Statewide
Coastal North Carolina
Statewide
Statewide
Statewide
Statewide
Statewide
271
-------
General Information:
Field
Description
Database
Definition
Source Data Description:
Contact
Data
Scale
Sample Method
Geographic Extent
Database Entry:
Procedures
Assumptions
Accuracy Assessment:
Rating
Logic Test
Error Comment
Value Range
Update Procedure:
Next Update
Procedure
Overall
Positional
Units
Attribute
Logical
Consistency
Completeness
Source
Figure 2. Population/Development database: Data dictionary

impacts approach. Because the watershed database pro-
duced for this project will be widely available, errors and
inconsistencies will undermine support for the rest of the
project. Careful documentation of data sources, limita-
tions, and analysis assumptions and procedures will pro-
vide useful support should problems or concerns arise.

Once database development is sufficiently complete
(the database encompasses much dynamic data and
could be constantly updated), indexes describing each
of the cumulative impact, high-risk areas must be final-
ized. Applying these indexes to the database will allow
identification of the watersheds at highest risk to cumu-
lative impacts. Discussions held concurrently with index
development will determine which management re-
sponses are appropriate to each high-risk category.
Possibilities include strengthened land use planning re-
quirements, new permit standards, or the designation of
a new type of environmental critical area.

Although the data-intensive approach that DCM has
chosen relies heavily on a CIS, the greatest challenges
in this project do not lie in the CIS analysis. Applying this
watershed-based analysis to existing political jurisdic-
tions will be a more difficult undertaking. A convincing
demonstration of the importance of including a natural
systems perspective into a development permitting sys-
tem, land use plan, or even economic development
strategy, will ultimately contribute more to environmental
protection in coastal North Carolina than any individual
regulation that emanates from this project.

Summary

Twenty years after the passage of CAMA, DCM has
developed a framework for a consistent approach to the
problem of cumulative impacts of development. The
approach and PDRIS database combine existing natural
resource management techniques to locate areas of the
coast at greatest risk of serious impairment from cumu-
lative impacts. The availability of natural resource data
at an acceptable scale (1:100,000) eases the develop-
ment of the database essential to this analysis. The
simultaneous development of a set of comprehensive
small watershed boundaries for the state, along with the
initial planning of this project, provided the final critical
component to DCM's approach.

Perhaps more importantly, both DCM and individual
local governments will have a large volume of informa-
tion on natural units, which will provide an important,
new perspective on the problems and prospects for local
governmental action.

This project will not solve all problems related to cumu-
lative impacts. The PDRIS will provide little support for
site-specific or within-watershed cumulative impacts
analysis; such an analysis at fine scales requires a much
more precise database. By providing a broader-scale
framework for this discussion, however, DCM's regional
cumulative impacts study will hopefully further discussion,
understanding, and management of cumulative and
secondary impacts on natural systems.

References
1. Wuenscher, J. 1994. Managing cumulative impacts in the North
Carolina coastal area. Report of the Strategic Plan for Improving
Coastal Management in North Carolina. North Carolina Division of
Coastal Management.
2. North Carolina General Statutes (NCGS) 113A-120.
3. 40 CFR §1508.7.
272
-------
A GIS Demonstration for Greenbelt Land Use Analysis
Joanna J. Becker
Environmental Planning Services, Santa Rosa, California
Purpose

The goal of this project was to demonstrate what analy-
ses could be undertaken with a GIS program without
substantial GIS training ortime input. The demonstration
attempted to show how planning staff and decision-makers
could easily and usefully employ GIS. It was not intended
as a complete study of all possible variables. Only avail-
able data were used. Diverse techniques were pre-
sented while keeping the content as simple and relevant
as possible. The project was designed as a demonstra-
tion using regional scale data and was combined with
another parcel-specific demonstration that showed
urban GIS applications.

The demonstration showed the following modeling
techniques:

• Buffer zones

• Combination of variables (overlays)

• Weighting of values

• Absolute value variables

• Reclassification of final values

Site Location

The San Luis Obispo watershed comprises an area of
approximately 84 square miles. The watershed drains
into the Pacific Ocean at Avila, California. The major
creek in the watershed, San Luis Obispo Creek, is a
perennial creek, but many of its tributaries have only
seasonal flow. Agriculture and grazing are the major
land uses in the watershed, although a significant num-
ber of areas have been developed. Growth of these
areas is moderate to limited but has a pronounced effect
on the watershed. The watershed also supports a large
amount of riparian and other natural vegetation. Figure 1
demonstrates the distribution of land cover/land use
within the watershed.
Watershed
Study Perimeter
City Greenbelt
Pacific Ocean
Figure 1. A generalized map of the San Luis Obispo area show-
ing the location of the two ARC/INFO GIS study areas:
(1) the parcel analysis in the Dalideo area and (2) the
regional analysis in the city greenbelt stippled area
surrounding the city. All boundaries are approximate
and are for schematic purposes only (1, 2).

Background

The City Council approved the open space element of
the San Luis Obispo General Plan in January 1994 and
identified a greenbelt area that extended from the Urban
Reserve Line approximately to the boundaries of the
San Luis Obispo Creek watershed. The intent of this
greenbelt area is to provide a buffer to the city and to
preserve the agricultural and natural resources of the
area.
273
-------
The data forthe watershed were already available through
the work of the Landscape Architecture Department of
California Polytechnic State University at San Luis
Obispo for a study of San Luis Obispo Creek (1), and
the city's greenbelt area lay approximately within those
boundaries (see Figure 1). Variables were selected that
could be extracted from the available data.

The data for the creek study were initially entered into
workstation ARC/INFO in a polygon format. They were
then transferred to MacGIS, a PC raster program, for
simplicity of use. The final product was then transferred
back to ARC/INFO as a grid format and viewed in a PC
version of ARC/VIEW using DOS files.

In interpreting the overlay of values, the assumption was
made that the occurrence of high values for the most
variables would result in the most suitable land for that
land use. This was presented as a range of values
derived from the total values divided into three approxi-
mately equal groups of high, medium, and low. In addi-
tion to providing a composite analysis, however, any one
of the data sets can be queried separately such that, for
example, slopes greater than 20 percent could be iden-
tified or two layers such as storie index and distance
from roads could be compared.

Criteria

Note that the ratings of high, medium, and low are based
on available data, and the rating of low implies no suit-
able use. In addition, these ratings do not imply that
categories rated low could not be used for a particular
land use but rather that other land uses might be more
appropriate. For example, open space use was rated
low for flatter slopes but only because this category
would likely be more suitable for agriculture.

The demonstration used an existing cell size of the data
on the MacGIS program of 75 meters per side, which is
assigned during the initial conversion process. There-
fore, buffer zones are in aggregates of 75 meters. This
size cell does not allow for minute analysis but reduces
the size of the files, which may become extensive in
raster format.

In presenting the final analysis, land contained within the
urban reserve limit line has been excluded.

Procedure

Initially, eight variables from the available data were
deemed suitable for this analysis:

• Slope

• Storie index (indicating soil fertility)

• Distance from major roads

• Distance from creeks
• Erosion hazard1

• Oak woodlands

• Land use compatibility

• Grasslands2
After selecting the six variables, the categories were
receded to conform to a rating of high, medium, or low.

Each land use was then evaluated separately for every
variable except for combining the variables of slope and
distance from creeks for rangeland analysis. In this case,
composite values were assigned to the two variables, then
receded to produce a high, medium, or low rating.

After obtaining maps for each of the variables according
to land use, the maps were compiled to indicate the
density of overlays for each land use category. In as-
sessing the suitability of land for the three land uses, the
values of all the variables, except land use, were aggre-
gated and a rating system developed. In addition, a
double weight was assigned to the storie index in evalu-
ating agriculture (because this is a primary index for
considering prime agricultural land). If less than 75 me-
ters, the distance from creeks also received added
weight in consideration of open space preservation (be-
cause this is likely to ensure the least erosion and
pollution to the waterways). The weighting then altered
the scores as follows:
Land Use

Agriculture
Rangeland
Open space
Attributes

5
4
5
Number of Values

6
4
6
After the values had been assigned for each ranking,
further receding established three categories of high,
medium, and low for each land use.

The land use buffer was added to this receded aggre-
gate map and resulted in an additional three values due
to the interaction of the buffer with each category. These
additional values were receded according to each land
use to produce a final map with three values.

The Urban Reserve Area was overlaid on the final map
to exclude urban areas.

Assumptions

In determining what properties would be most suitable for
each land use, the following assumptions were made.
1 After reviewing the material, erosion hazard was eliminated because
it was similar to the Soil Conservation Service (SCS) storie index
data while the identification of native grasslands fell only within the
area currently designated as open space land, so it was not included.
2 See above note.
274
-------
Open Space

This land is desirable to preserve as open space be-
cause of the existence of scenic or significant natural
resources. It could also be land that is inappropriate for
other uses due to the presence of such factors as steep
slopes or poor soils. A distinction is made in the final
map between land that is designated open space for
recreational uses, such as parks, and land preserved for
habitat or species protection. Open space adjacent to
an urban area would be rated high if public accessibility
was desirable but low if its purpose was resource pro-
tection or preservation. Separate maps based on two
types of proposed uses present the contrast in analysis.

The analysis of the variables thus was rated as follows:

A. Land with steep slopes and therefore less suited for
other purposes.

B. Land that has oak woodland vegetation resources.

C. Riparian land.

D. Low storie index indicating a lack of suitability for
agriculture or rangeland.

E. At least approximately one-eighth of a mile from a
major road to avoid negative impact on wildlife.

F. Either approximately one-quarter of a mile from ur-
ban areas if designated to protect resources or adja-
cent to urban areas if designated to serve as parks and
recreation.

Agriculture

This land use includes all forms of agricultural activity;
obviously, its suitability for specific crops and practices
would vary. The determination of suitability would need
to be made on a site-specific basis.

For the purpose of general agricultural suitability, the
highest land suitability for agriculture was a rating of the
variables as follows:

A. Land that does not have oak woodland.

B. Land that is not close to perennial creeks (to avoid
fertilizer/pesticide runoff contamination).

C. The flattest slopes.

D. The highest storie index.

E. Proximity to a major road (considered an advantage
for trucking and farm equipment access).

Rangeland

Some types of livestock can graze under most condi-
tions, but for purposes of this analysis, land more suited
for either open space or agricultural designation was
rated above that of rangeland. The major limitation to
suitability of land for rangeland was a combination of
steep slopes and proximity to creeks. A rating of medium
for the other variables was considered the most desir-
able for rangeland purposes.

Details of the Variables

Storie Index

In determining the most suitable uses according to soil
fertility, the SCS storie index rating was used, with a
modification of the categories to three to accommodate
the ratings of high, medium, and low that were used
throughout the analysis. Therefore, the first two SCS
categories of excellent and good were combined into
Category 1. Categories fair and poor were combined
into Category 2, and categories very poor, nonagricul-
tural, urban, and mines were combined to compose
Category 3.

Subsequently, receding was undertaken to prioritize
these categories according to land use:
Agriculture
Rangeland
Open space

Roads
High for Category 1
High for Category 2
High for Category 3
The five principal arterials of the watershed were used
in this analysis:

• Highway 1

• Highway 227

• Los Osos Valley Road

• U.S. 101

• Avila Valley/San Luis

A buffer on each side of the road was created, which
was then receded into the three categories of more than
75 meters, 75 meters to 150 meters, and greater than
150 meters. Each land use was then evaluated accord-
ing to these criteria, with agriculture deemed the most
suitable closest to the roads and open space the least
suitable.

Slope

The slope categories in the San Luis Obispo watershed
data set were divided into a number of classes, which were
receded into the most appropriate grouping of three
classes for the analysis of the three land uses. The existing
categories were not altered for the purpose of the demon-
stration, so they do not necessarily represent the most
ideal slopes for the particular uses. A separate category of
less than 10 percent slopes was provided for agriculture
because most agricultural practices require flat land.
275
-------
Slopes between 10 and 21.5 percent would be limited
to such activities as orchards or vineyards.
Agriculture:
Open space:
< 10 percent
10 to 21.5 percent
> 21.5 percent

< 10 percent
10 to 46 percent
> 46 percent
High (H)
Medium (M)
Low (L)

L
M
H
No slope analysis was undertaken for rangeland be-
cause this category was combined with that for streams
(see Streams section).

Urban Adjacency

Urban adjacency was treated as an overlay of the ag-
gregate map of the other variables because it is an
absolute value. That is, this variable has no ranking.
Land is either within or outside the buffer. The suitability
of each land use adjacent to urban areas was deter-
mined, then the aggregate map was adjusted according
to a comparison of the receded aggregate values with
the designated ranking of land use suitability.

The first step was to recede the existing data on land
uses (interpreted from 1989 aerial photography ob-
tained from the United States Department of Agriculture
[USDA], Agricultural Stabilization and Conservation
Service, Atascadero, California) into urban/commercial
areas and nonurban/noncommercial areas. A buffer
zone of approximately one-quarter of a mile was then
applied around the urban/commercial areas and an
analysis undertaken of suitability for the three land uses
to be located within this buffer.

In making the analysis, the following assumptions were
made:
Land Use
Ranking
Agriculture L

Rangeland M

Open space H

L
Conflict with dust, noise,
pesticides, and urban use
Fire hazard of open
grassland near buildings
Most suitable if used as
parks/recreation
Least suitable if
designated for
habitat/wildlife protection
Therefore, the analysis provided for two planning alter-
natives for open space, with the scenarios presented as
separate maps overlaid on the aggregate map for the
other rangeland variables.

In interpreting this map, the combined values were rated
according to the above criteria, with any values lower
than the desired ranking receiving a low value no matter
what the aggregate value had been. This produced the
following results:

• Agriculture: A rating of low for any land within the
buffer zone.

• Rangeland: A rating of medium for any land within
the buffer zone except that rated low for the aggre-
gate map.

• Open Space: A rating equivalent to the rating of the
aggregate map if the land was to be used for
parks/recreation, or a rating of low for any land within
the buffer zone if the land was to be used for habitat
protection.

Streams

The original data for streams (from 1:24,000-scale
USGS maps), which included both intermittent and per-
ennial creeks, were used for rangeland analysis. A buffer
of 150 meters was then applied to these streams, and
these data were combined with the slope analysis. This
combination was important in evaluating the erosion
hazard and resulting stream pollution caused by nitro-
gen waste and hoof disturbance. The complete stream
complex for the watershed area was therefore evaluated
using the following matrix:
Stream Buffer
< 75 meters
75 to 150 meters
> 300 meters
<21.5
percent

1/L
4/M
7/H
Slope

21.5 to 46
percent

2/M
5/H
8/H
>46
percent

3/L
6/M
9/M
The stream data were then modified for agricultural and
open space land use analysis to indicate only the per-
ennial creeks as defined by the California Department
of Fish and Game. Buffers were created for this as
follows:
Stream Buffer

< 150 meters
150 to 300 meters
> 300 meters

Oak Woodlands
Open Space Agriculture
H
M
L
L
M
H
The receding of oak woodland data for agriculture was
different than for the other two land uses because
the presence of oak woodlands is not conducive to
agriculture:
No oak woodlands
< 10 percent
> 10 percent
H
M
L
276
-------
The suitability of oak woodlands for open space and References
rangeland was ranked as follows: ^ Ha||ock B G L s Bowker WD Bremer and D N Long 19g4

Open Space Rangeland Nutrient objectives and best management practices for San Luis
—* * " Obispo Creek. San Luis Obispo, CA: Coastal Resources Institute,
California Polytechnic State University.
< 33 percent L M
2. Community Development Department of San Luis Obispo. 1994.
33 to 75 percent M H Open space element for the city of San Luis Obispo general plan.
> 75 percent - high H L san Luis Obispo, CA.
277
-------
GIS as a Tool for Predicting Urban Growth Patterns and Risks
From Accidental Release of Industrial Toxins
Samuel V. Noe
University of Cincinnati, Cincinnati, Ohio
Introduction

The catastrophic Bhopal incident demonstrated to the
world what could happen when industry and population
are geographically incompatible. Many believe that the
large urban population "should not have been there." A
recent publication, "New York Under a Cloud," presents
a frightening map of New York State that indicates po-
tential areas of serious population exposure to acciden-
tal releases of chemicals stored by area industries and
municipalities.

Conventional urban planning and administrative prac-
tices at the local level do not adequately provide for the
minimization of these risks. Local jurisdictions on the
fringes of metropolitan areas may be particularly ill-
equipped to respond and plan effectively. Their elected
officials, supported by minimal professional staffs and
unaware of specific potential risks, may be more inter-
ested in soliciting new industrial development along with
the tax base it brings. They therefore create industrial
zones without restricting facilities that may generate
hazardous substances and without recognizing the pos-
sibility that underground aquifers, which are current or
potential sources of drinking water, may underlie these
zones. Jurisdictions often permit facilities that could rou-
tinely or accidentally release toxic substances into the
air without due regard for prevailing wind patterns or
existing or projected urbanized areas that may be
affected.

Although the available data and methodology have
some gaps, much of the knowledge required to provide
adequate protection from these risks exists. We know
how to identify the hazardous substances that these
sites may produce or store and how to calculate the
types and levels of risks associated with them. We can
accurately map the locations of streams, underground
aquifers, and their catchment areas. Although with less
precision, we also can indicate the areas more likely to
receive the outfall of airborne and waterborne pollutants.
At the other end of the equation, we can construct
models for projecting patterns of urban growth. Those
responsible for planning, however, have not made the
connections between these techniques. Hazardous fa-
cilities sites are thus still permitted in areas that place
existing urban residents and their drinking water sup-
plies at risk, and new urban development grows in areas
polluted by existing hazardous substance sites.

Clearly, this situation displays a need forthe coordinated
application of scientific risk assessment techniques and
new approaches to regulating urban development.
Equally critical, however, is the need to give greater
attention to formulating appropriate public policy meas-
ures at the local and state levels for dealing with the
complex disputes that surround these issues.

Project Background

The project this paper describes addressed these
needs. It was undertaken by a team of faculty from the
University of Cincinnati's School of Planning, Depart-
ment of Environmental Health, and the College of Law.
The study team focused on the accidental release of
hazardous materials both into the air and into the
ground-water supply. The team's purpose was to de-
velop an integrated approach to scientific risk assess-
ment, environmental analysis, urban planning, and
policy analysis to address conflicts between:

• Expected patterns of suburban residential growth.

• The need to safeguard existing and new residential
areas, and their water supplies, from toxic chemical
pollution.

• The promotion of industrial development on the pe-
ripheries of urban regions, which often leads to the
proliferation of hazardous substances sites.

• The need for effective regulation of these sites in
complicated multijurisdictional environments.
278
-------
This project examined these issues within a 100-square-
mile area on the northern edge of metropolitan Cincin-
nati. The study area is not yet completely urbanized but
lies in the path of urbanization. It contains a significant
number of industrial or storage facilities that house sup-
plies of hazardous materials. A major aquifer serving as
a public water supply source passes under the area.
Approximately 17 local jurisdictions fall within the study
area: two counties, six townships, and nine municipali-
ties. The area encompasses an intricate mix of agricul-
tural, residential, and commercial land uses. In addition,
several major industrial concentrations, as well as a
number of jurisdictions, are aggressively soliciting new
industrial employment. Because of its proximity to most
major employment sites in southwest Ohio and to a
variety of large retail complexes, the area is experiencing
rapid residential development.

Projecting Areas of Future Development

The study team used PC ARC/INFO geographic infor-
mation systems (CIS) to project the locations of future
residential and industrial growth in the study area, to
show the locations of areas at various degrees of risk
from either airborne or waterborne industrial toxins, and
to reveal the potential areas of population exposure
resulting from the overlap of these areas.

Although this paper does not describe the models used
to project residential and industrial growth in the study
area, it does include the criteria used to make projections.
The criteria we used to project residential growth were:

• Travel times to major employment concentrations in
the region.

• Proximity to interstate highways, interchanges, and
main trunk sewers.

• Avoidance of areas composed of steep slopes and
flood plains.

• Land currently zoned for agriculture or housing.

The criteria for projecting industrial areas were:

• Relatively flat, not in a flood plain, and zoned industrial.

• Proximity to existing industrial development, main
trunk sewers, and interstate highways.

• Relative aggressiveness of local jurisdictions in at-
tracting industrial development.

Identifying Areas at Risk From
Airborne Releases

The study team determined the model project areas at
risk from airborne releases by using information avail-
able from the Ohio Environmental Protection Agency.
This information, which included the location, identity,
and quantity of hazardous materials recently stored in
the study area, was available as a result of reporting
requirements mandated by several federal statutes. The
study team assumed that a similar ratio reflecting sites
containing hazardous materials to the area of industri-
ally developed land would continue into the future.
Based on this assumption, the study team could ran-
domly project new potential release sources.

The algorithm and associated software used for calcu-
lating the plume size of aerial dispersion of hazardous
chemicals was Aerial Locations of Hazardous Atmos-
pheres (ALOHA). The National Oceanic and Atmos-
pheric Administration (NOAA) developed this system,
which is in wide use by government and industry for the
preparation of emergency contingency plans. This soft-
ware is available for use on any Macintosh computer, or
any IBM-compatible with an Intel 80286 (or better) CPU.
This software employs three classes of variables in
calculating the plume dispersion for a specific chemical:

• Chemical variables

• Meteorological options

• Source strength options

Chemical variables include both physical properties of
the chemical and parameters that define the human
health effects of the chemical. In the latter case, the two
variables are the threshold limit value (TLV) and the
immediately dangerous to life and health (IDLH) value.
The TLV is a measure of chronic toxicity of the chemical
in humans. It represents the maximum concentration of
the chemical in air to which a human can safely be
exposed for 8 hours per day on a daily basis. The IDLH
is a measure of acute toxicity of the chemical in humans.
The IDLH represents the maximum concentration to
which a human can be exposed for a short time and not
experience death or some other severe endpoint.

Meteorological options describe the ambient atmos-
pheric conditions into which the chemical disperses.
ALOHA has the capability of downloading real-time data
from NOAA satellites. This case, however, employed
average meteorological conditions for the study area
over the course of a year. The variables this study used
were atmospheric inversion height (or no inversion),
wind velocity, air temperature, ground roughness (rural
or urban), and stability class (a combined variable de-
scribing cloud cover and incoming solar radiation).

The source options quantify the amount of chemical
being released and describe how the chemical is re-
leased (instantaneous or continuous).

The ALOHA model provides a procedure forshowing the
IDLH and TLV risk zones from a single accidental re-
lease of a single chemical, given specified conditions of
atmospheric stability, wind direction, and air tempera-
ture. Obviously, climatic conditions change daily, so
areas surrounding a single industrial site experience
279
-------
different degrees of risk depending on the variability of
these conditions. Moreover, a single site that can poten-
tially release more than one chemical poses a higher
risk to surrounding areas. Finally, when release plumes
from two or more sites that are located relatively close
to each other overlap, risks also increase. To account
for these factors of climatic variation and overlapping
chemical release plumes, the study team constructed
the model described below.

In discussing these procedures, bear in mind that the
risk factors are relative. Sufficient data are not available
to estimate the absolute probability of an accidental
release of industrial toxins into the air. Consequently, no
absolute risk levels can be estimated.

1. We acquired NOAA climatic statistics for a full year
for the weather station nearest the study. NOAA
tabulates eight wind directions (N, NE, E, SE, S, SW,
W, and NW). We sorted the average daily wind
speeds and average daily temperatures according to
the daily prevailing wind direction. Thus, for each of
the eight wind directions, it was possible to determine
the number of days in the year that the prevailing
wind comes from that direction, as well as the
average daily wind speed and temperature.

2. We then input the temperature and wind speed data,
derived as explained above, into the ALOHA model
to prepare plots of the IDLH and TLV zones, or plumes,
for each of the eight wind directions. Individually, the
plumes emanated downwind from the source of the
release. In our study, IDLH plumes varied from 0.17
miles to greater than 10 miles. TLV plumes varied
from 0.52 miles to greater than 10 miles.

When the plumes for the eight different wind directions
were combined, the results were translated into a
pattern of wedges representing various plume lengths
in each of the eight different wind directions. These
risk levels vary according to the number of days per
year the wind blows in each direction from the source
of the release. Plumes blowing in different directions
vary in length according to average temperature and
wind velocities for the days the wind blew in each
direction. The numbers in the wedges represent risk
factors assigned as indicated in Table 1.

Table 1. Assigned Risk Factors
Frequency of Wind
Risk Factor
0-25 days/year
26-50 days/year
51 -75 days/year
76-100 days/year
1
2
3
4
3. When a single industrial location employed more
than one chemical, the IDLH and TLV risk patterns
for each were overlaid on one other. Where the
overlap occurred, we added the risk factors together.

4. Finally, when the risk patterns from two or more sites
overlapped, we added together the risk factors
assigned to overlapping areas. The overlap capabilities
of CIS allowed us to easily draw and combine the risk
patterns, superimposed on a map of the industrial
sites under consideration.

When we compared the eight existing combined IDLH
and TLV risk patterns with the risk patterns that appear
when six projected new sources of releases and pro-
jected areas of residential growth are added, the
changes are rather dramatic. Of course, we must note
that the projected sources of potential releases are hy-
pothetical and their locations selected at random. In
reality, precise locations of areas at greater risk cannot
be predicted with any degree of certainty. We can rea-
sonably deduce from the maps, however, that any sub-
stantial increase in industries storing or using toxic
chemicals that might be accidentally released into the
air can compound risk levels much more than might be
expected. The maps we produced indicated five risk
levels.

The study results also showed that an industry capable
of generating accidental releases of airborne toxins will
very likely place at risk residents not only of the same
community but those in neighboring jurisdictions as well.
This is particularly true on the fringes of metropolitan
areas where highly fragmented political boundaries ex-
ist. This fact complicates tremendously the ability of
each jurisdiction to protect its citizens and suggests a
need for a more comprehensive approach to regulation
than conventional land use zoning measures that each
locality administers.

Identifying Areas at Risk From Accidental
Releases Into Ground-Water Supplies

As the introduction of this paper indicates, local govern-
ment usually manages conventional land use planning,
while air quality is largely a state or federal responsibility.
Thus, decisions regarding the regulation of industrial
location may not account for the types of industrial
operations proposed, the possible use of hazardous
materials, and possible risks to local residents from
accidental release of toxins into the air. The same prob-
lem applies to the location of industries that have the
potential for accidental release of hazardous materials
into ground-water supplies. In our study, we outlined a
technique for predicting where local residents may be
placed at risk by drinking water from sources vulnerable
to contamination by industrial toxins.
280
-------
As stated earlier, the study area includes a major aqui-
fer. We noted industrial and related facilities that use and
store hazardous materials, and that are located over or
immediately adjacent to the area aquifers. We distin-
guished between sites where a previous spill had been
reported, those where a waste well is located, and all
other locations where a hazardous material is used or
stored. The results clearly indicated a significant poten-
tial for contamination.

To predict the number and locations of additional indus-
trial toxin sources that might appear over a 10-year
period following 1990 (the base year of the study), we
employed a procedure similar to that used in the air
pollution section of the study. We assumed that during
this period, the number of such sites would increase by
35 percent. The 35-percent increase represented an
arbitrarily selected figure approximately midway be-
tween estimates of 25- and 50-percent industry growth
in the study area. We used this increment to project
additional waste well sites and sites that would experi-
ence a spill sometime during the decade, as well as the
total number of new sites.

We projected the locations of the additional sites by
overlaying a 5,000-foot by 5,000-foot grid on the study
area and assigning the new sites to grid cells using a
random number generator. We considered only cells
lying completely or largely over the aquifers and also
falling within the area of projected industrial land use.
Each cell was assigned a relative contamination risk
factor based on the number of projected sites it con-
tained, with a multiplier of 3 applied to sites with a waste
well and a multiplier of 4 applied to sites assumed to
have had spills. We also assigned existing sites to grid
cells and scored them in the same manner.

To obtain more information, we used a simplified version
of DRASTIC maps prepared by the Ohio Department of
Natural Resources. DRASTIC is an acronym for Depth
to water, net Recharge, Aquifer media, Soil media, To-
pography, Impact of the vadose zone, and hydraulic
Conductivity of the aquifer. These factors contribute to
an index of the relative vulnerability of the aquifer to
pollution. Different shades on the map represented the
relative vulnerability of sections of the aquifers to
ground-water pollution. We assigned two points to aqui-
fers with a DRASTIC pollution potential index less than
180 and four points to aquifers with a higher index. This
allowed us to use the CIS to combine the DRASTIC
vulnerability map with the maps that showed risks of
pollution from the hazardous materials sites. The resul-
tant maps showed the existing and projected potential
risk of pollution in different areas of the aquifers.

The next step in the study related the above information
to public water companies that extract water from differ-
ent parts of the aquifers. Thus, we were able to associ-
ate risk levels with well locations as well as with the
areas served by water companies whose supplies may
be at risk. In this case, the projected risks of polluted
water supplies were identical to existing risk levels be-
cause no public water wells happened to be located in
areas where additional potential sources of pollution
were projected.

Of course, wide areas in the study area had no public
water supply. Residents in these areas may be at risk
depending on where they dig private wells. The maps
we generated showing areas at risk nonetheless provide
useful guides to potentially hazardous areas.

As in the construction of all such models, we needed to
make a number of assumptions, simplifications, and
value judgments. In this case, these included the pro-
jected number of new toxic sites and point scores as-
signed to hazardous materials sites and the various
areas in the DRASTIC maps. Also, for the sake of
simplicity, we projected no new well sites in preparing
one of the maps. This additional element should prob-
ably be included, assuming local water companies could
provide projected well locations. Use of a CIS, however,
makes it possible to explore the implications of adjust-
ment of any of these factors.

We must note two more significant omissions from the
model that the CIS cannot factor in. One was our inabil-
ity to identify from the data specific chemicals that each
sight might release and the relative effects of each. We
would have required more complicated techniques for
dealing with these variables and for projecting the travel
and dilution of plumes of contaminants in an aquifer. In
the interest of providing a simple, if relatively crude,
model capable of replication by a local planning agency
with a simple CIS, we elected not to propose use of
more sophisticated techniques.

Another obvious omission was the consideration of
water pumping and treatment measures that might miti-
gate risks of water contaminated by accidental release
of industrial toxins. Perhaps, with knowledge of the spe-
cific contaminants found in the water at any given time,
such mitigation might be possible. We elected not to
consider this factor for reasons of simplification but also
because this study aimed to provide planning agencies
the means to identify potential risks to local residents
and to prevent or minimize them through better land use
planning and regulatory measures.

Some Final Notes

As stated earlier, the purpose of the study was to pro-
pose a technique that planning agencies could use to
identify:

• The locations of existing and projected patterns of
residential and industrial development in a multijuris-
dictional suburban area.
281
-------
• The locations of existing and projected industrial,
storage, and disposal sites of hazardous materials.

• Residential areas that might be placed at risk by the
accidental release of these hazardous materials into
the air or into ground-water supplies.

• The relative levels of risks resulting from potential
exposure to more than one hazardous material at a
single site, or from multiple sites in the vicinity.

The technique we used in the study permitted projection
of relative risk levels. Projecting absolute risk levels is
impossible without data on the actual incidence of acci-
dental releases of toxins overtime in this or in similar
areas. A related question is whether it is possible to
meaningfully indicate the combined risk from airborne
releases of industrial toxins and drinking water contami-
nation to a particular residential area. The issue of
weighting relative risk factors is central here. Could we,
for example, weight the risk levels of exposure to air-
borne releases three—or possibly four—times higher
than water contamination risks? Or can we say that the
risks may be the same, but the danger from airborne
releases is three or four times greater?

Obviously, these would be futile exercises, especially
because the point scores in the separate mapping stud-
ies were arbitrarily assigned. The public should know,
however, which present or projected residential areas
carry some level of risk from both types of exposure.
Thus, after calculating point scores to derive relative risk
levels from waterborne pollutants, we multiplied the
scores by 4 to bring their maximum ranges into the same
order of magnitude. Otherwise, the effect of waterborne
pollutants would not be apparent. Consequently, we
created two maps to show the combined existing and
projected risks; therefore, we highlighted the combined
risks that residents face from both airborne and water-
borne hazardous materials.

The maps of airborne releases used in these combina-
tions showed TLV. Continuing exposure within the TLV
areas over an extended period can also have adverse
health effects. This study focused only on accidental
releases, however, and a single release is unlikely to
sustain continuous exposure. Of course, residential ar-
eas at risk from several sites might approach conditions
of sustained exposure. This situation is more analogous
to prolonged exposure to contaminated drinking water
supplies. We did not combine the maps of IDLH airborne
releases with the maps of areas at risk from ground-
water contamination because they are not analogous
conditions. Nonetheless, the IDLH risk maps in them-
selves reveal the conditions that local planning officials
should most seriously consider.

Replicating the procedures outlined in this study should
be technically and financially feasible for local planning
agencies. Armed with the results of such an investiga-
tion, their next step should be to establish the planning
and regulatory measures that would minimize both ex-
isting and projected levels of risk to area residents. The
attorney on our team has outlined a range of possible
measures, but detailing them would be the subject of
another paper.
282
-------
Integration of GIS and Hydrologic Models for Nutrient Management Planning
Clyde W. Fraisse, Kenneth L. Campbell, James W. Jones,
William G. Boggess, and Babak Negahban
Agricultural Engineering Department, University of Florida, Gainesville, Florida
Introduction

Recent evidence that agriculture in general, and animal
waste in particular, may be an important factor in surface
and ground-water quality degradation has increased the
interest in nutrient management research. The presence
of nitrogen and phosphorus in surface water bodies and
ground water is a significant water quality problem in
many parts of the world. Some forms of nitrogen and
phosphorus, such as nitrate N and soluble P, are readily
available to plants. If these forms are released into
surface waters, eutrophic conditions that severely impair
water quality may result. Advanced eutrophication (pH
variations, oxygen fluctuations or lack of it in lower
zones, organic substance accumulation) can cause
physical and chemical changes that may interfere with
recreational use and aesthetic appreciation of water. In
addition, possible taste and odor problems caused by
algae can make water less suitable or desirable for
water supply and human consumption (1).

Increases in nutrient loadings to water resources have
recently been observed in the southeastern United
States, where well-drained sandy soils with low nutrient
retention capacity and high water table conditions are
found in most coastal areas. Those increases were
associated statistically with nutrient sources such as
agricultural fertilizers and dense animal populations (2,
3). Repetitive occurrences of extensive blooms of blue-
green algae that threatened the overall health of Lake
Okeechobee, located in southern Florida, were attrib-
uted to an increase in nutrient loadings to the lake. The
South Florida Water Management District (SFWMD) re-
ported an increase of phosphorus concentrations in the
lake water from an annual average of 0.049 milligrams
per liter in 1973 and 1974 to a peak of 0.122 milligrams
per liter in 1988 (4).

Most water quality problems concerning phosphorus
result from transport with sediment in surface runoff into
receiving waters. Continuous high loadings from animal
waste on sandy soils with low retention capacity, however,
may contribute significant quantities of labile phospho-
rus to subsurface drainage. Ground-water aquifers may
also become polluted due to recharge of high loadings
of nitrogen. Drinking water with nitrate N concentrations
higherthan 10 milligrams per liter may lead to methemo-
globinemia in infants. Ground-water monitoring of the
Middle Suwannee River area in Florida has shown high
concentrations of nitrate nitrogen near intensive agricul-
tural operations. The U.S. Geological Survey has inten-
sively monitored dairy and poultry farms and has found
high nitrate levels below these operations compared with
nearby control wells (5).

Animal waste management has always been a part of
farming, but historically has been relatively easy due to
the buffering capacity of the land. In fact, land applica-
tion of animal waste at acceptable rates can provide
crops with an adequate level of nutrients, help reduce
soil erosion, and improve water holding capacity. As the
animal industry attempts to meet the food requirements
of a growing population, however, it applies new tech-
nologies that reduce the number of producers, but cre-
ate larger, more concentrated operations. That, in
addition to the decreasing amount of land available for
waste application, has increased the potential for water
quality degradation.

Successful planning of an animal waste management
system requires the ability to simulate the impact of
waste production, storage, treatment, and use on water
resources. It must address the overall nutrient manage-
ment for the operation, including other nutrient sources
such as supplemental fertilizer applications. Livestock
operations are highly variable in their physical facilities,
management systems, and the soil, drainage, and cli-
matic conditions that affect the risk of water pollution
from animal wastes (6). Linkage between geographic
information systems (GIS) and hydrologic models offers
an excellent way to represent spatial features of the-
fields being simulated and to improve results. In addition,
a GIS containing a relational database is an excellent way
283
-------
to store, retrieve, and format the spatial and tabular data
required to run a simulation model.

This paper examines some of the issues related to the
integration of hydrologic/water quality models and CIS
programs. In addition, the paper discusses the ap-
proaches used in the Lake Okeechobee Agricultural
Decision Support System (LOADSS), which was re-
cently developed to evaluate the effectiveness of differ-
ent phosphorus control practices (PCPs) in the Lake
Okeechobee basin. The paper also details a dairy
model, designed to simulate and evaluate the impacts
of alternative waste management policies for dairy op-
erations, that is currently under development.

Hydrologic Models and GIS

By using models, we can better understand or explain
natural phenomena and, under some conditions, make
predictions in a deterministic or probabilistic sense (7).

A hydrologic model is a mathematical representation of
the transport of water and its constituents on some part
of the land surface or subsurface environment. Hydro-
logic models can be used as planning tools for determin-
ing management practices that minimize nutrient
loadings from an agricultural activity to water resources.
The results obtained depend on an accurate repre-
sentation of the environment through which water flows
and of the spatial distribution of rainfall characteristics.
These models have successfully dealt with time, but
they are often spatially aggregated or lumped-parame-
ter models.

Recently, hydrologists have turned their attention to GIS
for assistance in studying the movement of water and
its constituents in the hydrologic cycle. GIS programs
are computer-based tools to capture, manipulate, proc-
ess, and display spatial or georeferenced data. They
contain geometry data (coordinates and topological in-
formation) and attribute data (i.e., information describing
the properties of geometrical objects such as points,
lines, and areas) (8). A GIS can represent the spatial
variation of a given field property by using a cell grid
structure in which the area is partitioned into regular grid
cells (raster GIS) or by using a set of points, lines, and
polygons (vector GIS).

A close connection obviously exists between GIS and
hydrologic models, and integrating them produces tre-
mendous benefits. Parameter determination is currently
one of the most active hydrology-related areas in GIS.
Parameters such as land surface slope, channel length,
land use, and soil properties of a watershed are being
extracted from both raster and vector GIS programs,
with a focus on raster-based systems. The spatial nature
of GIS also provides an ideal structure for modeling. A
GIS can be a substantial time saver that allows differ-
ent modeling approaches to be tried, sparing manual
encoding of parameters. Further, it can provide a tool for
examining the spatial information from various user-
defined perspectives (9). It enables the user to selec-
tively analyze the data pertinent to the situation and try
alternative approaches to analysis. GIS has been par-
ticularly successful in addressing environmental prob-
lems.

Approaches for Integrating GIS and Models

A significant amount of work has been done to integrate
raster and vector GIS with hydrologic/water quality mod-
els. Several strategies and approaches for the integra-
tion have been tried. Initial work tended to use simpler
models such as DRASTIC (10) and the Agricultural
Pollution Potential Index (11). In these cases, the mod-
els were implemented within the GIS themselves. These
studies attempted to develop CIS-based screening
methods to rank nonpoint pollution potential. The use of
more complex models requires that the GIS be used to
retrieve, and possibly format, the model data. The model
itself is implemented separately and communicates with
GIS via data files. Goodchild (12) refers to this mode as
"loose coupling," implying that the GIS and modeling
software are coupled sufficiently to allow the transfer of
data and perhaps also of results, in the reverse direc-
tion. Fedra (8) refers to this level of integration as "shal-
low coupling" (see Figure 1). Only the file formats and
the corresponding input and output routines, usually of
the model, must be adapted. Liao and Tim (13) describe
an application of this type, in which an interface was
developed to generate topographic data automatically
and simplify the data input process for the Agricultural
Nonpoint Source (AGNPS) Pollution Model (14), a water
quality model.
Shared Databases and Files
j

GIS

User Interface

MODEL

User Interface
Figure 1. Loose or shallow coupling through common files (8).
284
-------
Higher forms of connection use a common interface and
transparent file or information sharing and transfer be-
tween the respective components (see Figure 2). The
dairy model, currently under development, is an appli-
cation of this kind. It will link the Ground-Water Loading
Effects of Agricultural Management Systems (GLEAMS)
(15) model and CIS to evaluate potential leaching and
runoff of both nitrogen and phosphorus.

LOADSS is an extension of this type of application
because it includes an optimization module that enables
the system to select the best PCPs at the regional scale,
based on goals and constraints defined by the user.

Both applications use ARC/INFO's arc macro language
(AML), a high-level application language built into the
CIS. A subset of functions of a full-featured CIS, such
as creation of maps (including model output) and tabular
reports, as well as model-related analysis, are embed-
ded in the applications, giving the system great flexibility
and performance. Fedra (8) describes a deeper level of
integration that would merge the two previous ap-
proaches, such that the model becomes one of the
analytical functions of a CIS, or the CIS becomes yet
another option to generate and manipulate parameters,
input and state variables, and model output, and to
provide additional display options. In this case, software
components would share memory rather than files.

The choice between integrating a water quality model
with a raster or vector CIS depends on the importance
of spatial interactions in the process being studied and
the nature of the model itself. Some water quality mod-
els, such as GLEAMS, are field-scale models that pro-
vide edge-of-the-field values for surface runoff and
erosion as well as deep percolation of water and its
constituents. In this case, spatial interactions between
adjacent fields are ignored and a vector CIS can be
used to describe the system. Moreover, important fac-
tors in the simulation process, such as land use and
management practices, are normally field attributes and
thus, are better represented in a vector structure.

Other factors playing an important role in the hydrologic
process, such as field slope, aspect, and specific catch-
ment area, are hard to estimate in vector systems,
however. A raster-based CIS is better suited for handling
watershed models in which the routing process is impor-
tant and spatial interactions are considered. For those,
several algorithms for estimating important terrain attrib-
utes are often incorporated in commercially available
raster-based CIS programs.

LOADSS

LOADSS was developed to help address problems cre-
ated by phosphorus runoff into Lake Okeechobee. It was
designed to allow regional planners to alter land uses
and management practices in the Lake Okeechobee
Shared Databases and Files
GIS
MODEL
Common User Interface
Figure 2. Deep coupling in a common framework (18).

basin, then view the environmental and economic ef-
fects resulting from the changes. The Lake Okeechobee
basin coverage incorporates information about land
uses, soil associations, weather regions, management
practices, hydrologic features, and political boundaries
for approximately 1.5 million acres of land and consists
of close to 8,000 polygons.

The SFWMD, responsible for managing Lake Okeechobee,
has initiated numerous projects to develop effective con-
trol practices for reducing the level of phosphorus in
agricultural runoff as part of the Lake Okeechobee Sur-
face Water Improvement and Management (SWIM)
Plan. These projects, numbering more than 30, were
designed to develop information on the control and man-
agement of phosphorus within the lake basin and to
determine the costs and effectiveness of selected man-
agement options. Three types of control options are
being studied:

• Nonpoint source controls, such as pasture management.

• Point source controls, such as sewage treatment.

• Basin-scale controls, such as aquifer storage and
retrieval.

After completing most of these research efforts, the
need arose for a comprehensive management tool that
could integrate the results for all three classes of PCPs.
In response to these needs, design and implementation
of a decision support system was initiated with the fol-
lowing objectives (16):

• Organize spatial and nonspatial knowledge about
soils, weather, land use, hydrography of the lake ba-
sin, and PCPs under a GIS environment.
285
-------
• Develop and implement algorithms for modeling non-
point source, point source, and basin-scale PCPs.

• Develop and implement mechanisms for evaluating
the performance of the entire Lake Okeechobee basin
under different combinations of PCPs applied to the
basin.

• Design and develop a user interface that would fa-
cilitate use of the system by noncomputer experts.

The goal in developing LOADSS was to create an infor-
mation system that would integrate available information
to help regional planners make decisions. LOADSS can
generate reports and maps concerning regional land
attributes, call external hydrologic simulation models,
and display actual water quality and quantity sampling
station data. LOADSS is a collection of different
components:

• The regional scale CIS-based model used to develop
and manipulate regional plans for reducing phospho-
rus loading to Lake Okeechobee.

• The Interactive Dairy Model (IDM) used to develop
field-level management plans for dairies and run the
Field Hydrologic and Nutrient Transport Model
(FHANTM) simulation model for nutrient transport
modeling.

• An optimization module that enables the system to
select the best PCPs at the regional scale (currently
under development).

Although these components can run independently, they
are fully integrated in the LOADSS package and can
exchange information where necessary. A design sche-
matic of LOADSS is given in Figure 3.

Regional-Scale CIS-Based Model

LOADSS serves both as a decision support system for
regional planning and as a graphic user interface for
controlling the different components. One consideration
in the design of LOADSS was the size of the database
that was being manipulated. Because the land use da-
tabase consisted of nearly 8,000 polygons, running the
simulation models interactively would not be a feasible
option. Thus, the CREAMS-WT (17) runoff model was
prerun for different levels of inputs and management for
each land use, soil association, and weather region (18).

Depending on the land use and its relative importance
as a contributor of phosphorus to the lake, anywhere
from one (background levels of inputs to land uses like
barren land) to 25 (dairies, beef pastures) levels of
inputs were selected. Each set of inputs to a particular
land use was given a separate PCP identification code.
A CREAMS-WT simulation was performed for each
PCP, on each soil association and weather region. This
resulted in approximately 2,600 simulation runs. Annual
average results were computed for use in LOADSS.
CREAMS-WT provides an average annual estimate of
phosphorus runoff from each polygon. Phosphorus
assimilation along flow paths to Lake Okeechobee are
estimated as an exponential decay function of distance
traveled through canals and wetlands (4).

The imports, exports, and economics of each PCP are
based on a per production unit basis. Depending on the
type of polygon, the production unit can be acres (e.g.,
pastures, forests), number of cows (dairies), or millions
of gallons of effluent (waste treatment plants and sugar
mills). Developing a regional plan in LOADSS involves
assigning a PCP identification code to each one of the
polygons in the Lake Okeechobee basin. Accessing the
results of a regional plan involves multiplying the pro-
duction unit of each polygon by its appropriate database
import, export, or economic attribute and summing the
resulting values overall polygons in the Lake Okeechobee
basin. LOADSS runs in the ARC/INFO Version 6.1.1 CIS
software on SUN SPARC stations.

Interactive Dairy Model

Although the LOADSS level of detail is adequate for
regional planning, a more detailed model was necessary
to analyze individual dairies in the Lake Okeechobee
basin, as dairies were one of the large, concentrated
sources of phosphorus runoff into the lake. Thus, the
IDM was developed and incorporated into LOADSS.
IDM utilizes FHANTM to simulate phosphorus move-
ment in dairy fields. FHANTM is a modification of
DRAINMOD (19) with added functions to handle over-
land flow routing, dynamic seepage boundary, and sol-
uble phosphorus algorithms for P input, mass balance,
and transport (20).

Unlike in LOADSS, FHANTM is run interactively, as IDM
requires. Furthermore, in LOADSS, the user can only
select from a number of predefined PCPs, while in IDM,
the user has access to more than 100 input and man-
agement variables, all of which can take a range of
values. This allows for the development and evaluation
of detailed dairy management plans that otherwise
would be impossible at a regional scale. While LOADSS
only provides average annual results, IDM displays daily
time series simulation results. IDM uses the same as-
similation algorithm and can produce the same phos-
phorus budget maps and reports as LOADSS.

Optimization Module

A variety of factors must be considered in planning
nutrient management programs. Production and envi-
ronmental goals need to be balanced, and these goals
are often incompatible. Performing this exercise on a
regional scale, comprising many fields for which a variety
of land uses and management options can be assigned,
is a tremendously time-consuming, if not impossible,
286
-------
CIS Databases
LOADSS User Interface
CIS-BASED

USER

INTERFACE
SCREEN.
PRINTER OR
PLOTTER
MAPS
AND
REPORTS
PHOSPHORUS
& MATERIAL 8
BUDGETS
PHOSPHORUS,
MATERIAL &
ECONOMIC BUDGETS
Input Attribute
Databases
Spatial
Databases
DAIRY MANAGEMENT
PRACTICES
NONPOINT SOURCE
POP'S CREAMS-WT
INDIVIDUAL FIELD
MANAGEMENT
POINT SOURCE PCP'S
PROCESS ANALYSIS
FHANTM

SIMULATION MODEL
BASIN TREATMENTS
PROCESS ANALYSIS
Optimization Module
LOADSS
Models and
Analysis Tools
I DM and
Analysis Tools
Figure 3. LOADSS design schematic (16).
task. The optimization component of LOADSS, currently
under development, will determine the best combination
of agricultural, environmental, and regulatory practices
that protects and maintains the health of Lake
Okeechobee and also maintains the economic viability
of the region. The optimization process will provide an-
other method for modifying the PCPs assigned to indi-
vidual fields. Different optimization solution methods,
such as linear programming and integer linear program-
ming, will be available for solving the optimization prob-
lem that the user defines.

Dairy Simulation Model

The dairy model was expected to be fully functional by
the end of 1994. It is designed to be an additional tool
for answering questions related to the environmental
costs and impacts of dairy operations. A design sche-
matic of the dairy model is given in Figure 4. It differs
from the LOADSS/IDM model in the following aspects:

• It is designed to be generic so that any dairy repre-
sented by a coverage for which relevant data, such
as topography, soil characteristics, weather, and field
boundaries, are available can be simulated.

• The GLEAMS water quality model will be used for
simulating nutrient transport of nitrogen and phosphorus.

• The user will be able to assign a larger variety of
crops and crop management practices to the individual
fields, including crop rotation.

GLEAMS (15) is a field-scale water quality model that
includes hydrology, erosion/sediment yield, pesticide,
and nutrient transport submodels. GLEAMS was devel-
oped to use the management oriented CREAMS (21)
model and incorporate more descriptive pesticide
subroutines and more extensive treatment of the flow of
water and chemicals in the root zone layers. The water
is routed through computational soil layers to simulate
the percolation through the root zone, but the volume of
percolation in each layer is saved for later routing in the
pesticide component. A minimum of three and a maxi-
mum of 12 layers with variable thickness may be used.
Soil parameter values are provided by soil horizon, and
287
-------
GIS Databases
TOPOGRAPHY
DAIRY MODEL User Interface
Spatial
Databases
Input Attribute
Databases

, - ! . "•
DAIRY PLAN
»
NITROGEN * PHOSPHORUS
BALANCE
i
T

DAIRY MANAGEMENT
PRACTICES

L INDIVIDUAL FIELD
MANAGEMENT

CROP
MANAGEMENT
i

GLEAMS

SIMULATION MODEL
WEATHER
FILES
SOIL AND
CULTURAL FILES
Dairy Model
Analysis Tools
GLEAMS Simulation
Shell Controller
Figure 4. Dairy model design schematic.
the crop root zone may have up to five horizons. The
values for parameters, such as porosity, water retention
properties, and organic content, are automatically fitted
into the proper computational layers. Two options are
provided in the model to estimate potential evapotran-
spiration, the Priestly-Taylor method (22) and the Pen-
man-Monteith method (23). The nutrient component of
the model simulates land application of animal wastes
by creating appropriate nitrogen and phosphorus pools
for mineralization. It considers ammonia volatilization
from surface-applied animal waste by using a relation-
ship developed by Reddy et al. (24).

The graphic interface is designed to help the user plan
a balanced nutrient management program for the dairy
being simulated. First, total nutrient production and ac-
counting are estimated, based on information related to
the dairy management such as herd size, confinement
system, waste characterization, and handling. Figure 5
shows the general structure of the graphic interface and
a first version of the menu used to estimate the total
amount of nitrogen and phosphorus available for assign-
ment to the various fields. Nutrient losses during waste
storage and treatment vary widely depending on the
method of collection, storage, and treatment. Climate
can also have a great effect on the losses. Covering all
possible methods of storage and treatment is practically
impossible, especially in an application that is designed
to be generic and applied in any part of the country. A
simplification was made to overcome this problem: the
user must provide the percentage of original nitrogen
and phosphorus that is retained after waste storage and
treatment. The menus designed to enter information
related to the management of fields and crops are given
in Figure 6.

For each field, a sequence of crops can be defined in
the Field Management Table, and for each crop, the
sequence of practices or field operations is defined in
the Crop Management Table. Every time a waste appli-
cation operation is defined or a field is used as pasture
for a certain period, the corresponding amount of nutri-
288
-------
f^ii? AGRICULTURAL ENGINEERING DEPARTMENT- UNIVERSITY OF FLORIDA j
(Select a Dairy ^X0"0™1! Dairy tlanaeenent wJOlan Hanafer V^( Practice Henus »Xsi"ulation Results X Report Hanaeer ?Xprint "w 7X Utilities "XOuitJ 1
''»> FIELDS
(SELECT FIELD(S) J
(REHOVE FIELIKS)
(UNDO LflST SELECTION J
(i/IEM FIELD INFO )

ri& SIMULATION
(RUN SELECTED FIELDS )
(RUN (ILL FIELDS )

'' 3 MAP UTILITIES
HUP 1TFE:
I
MPP UIDO.S:

D ItOOIFY LEGEND
larSW
ECOMIH EODMOiml fccFRESH

' S' NUTRIENT'S' PRODUCTION TABLE
HERD SIZE « COIIFINEHEHT SYSTEM OfHRY : dairs.OS
Nunlier of RuHrage Munber of
cons : 100 ueitht 100
HH57E r.HimnnFRT?nTTnH a NUTRIENTS pRnnucnnH
Hitrneeii s/llll/.l.i.j) ; 0.4K l>l,o=|.l,.,r,,= (1 l,-j/HII/.l,..j) : 0.07
Hilrofien Phosphorus
Lbs/nil/ijcar lbs/!icor Lbs/flU/ucar Lbs/ucar
Confinen«nt 01 10071 13 1533
Ursine HI 10071 13 I!i33
TUMIL 1GO 2014B 2G 30CG
limiOLINO OF MIBTE PRODUCED UNDER COHFINEICHT
Percentage of oripinal nutrients retained after STDRHGE/TRERTMENT

NUTRiEiirs nr.r.nimTTiir, (pERCEHinnE)
nuailaltlc fnr flssignncnt TreatiMl/rtarkcted
Crajint; Land HpplicaLlon
HITROCEH 50 JO 22
PIIOCPIIORU!; 50 ' 33 17

(SBVE ma EXIT} (CIIHCEL)

' -a MESSAGES
5 : Drawing dairy nap |fZ|
i. IE : 1 la
Figure 5. The nutrient production table of the dairy model interface.
ents will decrease from the amount available for assign-
ment and the total available for future assignment will
be updated. Once the total amount of nutrients is as-
signed, the model can be run for the several fields in the
dairy and the results evaluated in terms of nutrient load-
ings to the edge of fields and ground water. Alternative
plans can be designed and saved for comparison and
selection of best management options. The best solu-
tions in terms of reducing nutrient loadings to surface
and ground water must also consider economic aspects.
A producer's decision about competing waste manage-
ment practices is ultimately economically motivated.
Thus, the system will eventually include a tool for eco-
nomic analysis of alternative management options.

Summary and Conclusions

The search for solutions to the many problems concern-
ing nutrient management that affect water resources
implies a continued demand for the development of
modeling systems that can be used to analyze, in a
holistic approach, the impact of alternative management
policies.

The development of LOADSS exemplifies how the inte-
gration of hydrologic models and a CIS can be used for
analyzing nutrient control practices at different scales.
The addition of optimization algorithms further enhances
the ability of policy- and decision-makers to analyze the
impact of alternative management practices and land
uses at the regional level.

The first part of LOADSS (Version 2.2) that includes the
CREAMS-WT regional-scale model and the IDM com-
ponents is fully functional and currently available at the
SFWMD. Preliminary results show that LOADSS be-
haves consistently with measured data at the lake basin
scale. Some of this, however, is due to offsetting errors
in model behavior at the subbasin scale, particularly in
subbasins that are adjacent to or very far from the lake.
Currently, projects are underway to further verify and
calibrate the model at the subbasin level to improve its
289
-------
Figure 6. Field and crop management tables of the dairy model interface.
performance at smaller scales (16). Initial results of the
optimization component are currently being evaluated.

The dairy model represents a different approach in inte-
grating water quality models and CIS in that it is de-
signed to be generic and focused mainly on the farm
level. It is primarily designed to help policy- and decision-
makers analyze the effects of alternative dairy waste
management practices on the farm level. The framework
can easily be adapted to handle different types of animal
wastes (such as beef cattle and poultry) and to simulate
the impact of other crop management practices such as
pesticide applications.

References

1. Novotny, V., and G. Chesters. 1981. Handbook of nonpoint pol-
lution: Sources and management. New York, NY: Van Nostrand
Reinhold Company.

2. Council on Environmental Quality (CEQ). 1989. Environmental
trends. Washington, DC: U.S. Government Printing Office.
3. U.S. Geological Survey (USGS). 1987. Analysis and interpreta-
tion of water quality trends in major U.S. rivers, 1974-81. Water
Supply Paper 2307. Washington, DC: U.S. Government Printing
Office.

4. SFWMD. 1989. Interim Surface Water Improvement and Man-
agement (SWIM) plan for Lake Okeechobee. West Palm Beach,
FL: South Florida Water Management District.

5. Reck, WR. 1994. GLEAMS modeling of BMPs to reduce nitrate
leaching in Middle Suwannee River area. In: Proceedings of Sec-
ond Conference on Environmentally Sound Agriculture, Orlando,
FL. American Society of Agricultural Engineers, pp. 361-367.

6. Fedkiw, J. 1992. Impacts of animal wastes on water quality: A
perspective from the USDA. In: Proceedings of the National
Workshop on Livestock, Poultry, and Aquaculture Waste Manage-
ment, Kansas City, Ml (1991). American Society of Agricultural
Engineers (ASAE) Pub. 03-92. pp. 52-62.

7. Woolhiser, D.A., and D.L. Brakensiek. 1982. Hydrologic system
synthesis. In: Haan, C.T., H.P. Johnson, and D.L. Brakensiek,
eds. Hydrologic modeling of small watersheds. ASAE Monograph
No. 5. St. Joseph, Ml: American Society of Agricultural Engineers.
pp. 3-16.
290
-------
8. Fedra, K. 1993. GIS and environmental modeling. In: Goodchild,
M., B. Parks, and L. Steyaert, eds. Environmental modeling with
CIS. New York, NY: Oxford University Press, pp. 35-50.

9. Tim, U.S., M. Milner, and J. Majure. 1992. Geographic information
systems/simulation model linkage: Processes, problems, and op-
portunities. ASAE Paper No. 92-3610. St. Joseph, Ml: American
Society of Agricultural Engineers.

10. Whittemore, D.O., J.W. Merchant, J. Whistler, C.E. McElwee, and
J.J. Woods. 1987. Ground-water protection planning using the
ERDAS geographic information system: Automation of DRASTIC
and time-related capture zones. In: Proceedings of the National
Water Well Association (NWWA FOCUS) Conference on Mid-
western Ground-Water Issues, Dublin, OH. pp. 359-374.

11. Petersen, G.W, J.M. Hamlett, G.M. Baumer, D.A. Miller, R.L. Day,
and J.M. Russo. 1991. Evaluation of agricultural nonpoint pollu-
tion potential in Pennsylvania using a geographic information
system. ER9105. University Park, PA: Environmental Resources
Research Institute.

12. Goodchild, M. 1993. The state of CIS for environmental problem-
solving. In: Goodchild, M., B. Parks, and L. Steyaert, eds. Envi-
ronmental modeling with GIS. New York, NY: Oxford University
Press, pp. 8-15.

13. Liao, H.H., and U.S. Tim. 1992. Integration of geographic infor-
mation system (GIS) and hydrologic/water quality modeling: An
interface. ASAE Paper No. 92-3612. St. Joseph, Ml: American
Society of Agricultural Engineers.

14. Young, R.A., C.A. Onstad, D.D. Bosch, and W.P. Anderson. 1989.
AGNPS: A nonpoint source pollution model for evaluation of ag-
ricultural watersheds. J. Soil and Water Conserv. 44(2):168-173.

15. Leonard, R.A., W.G. Knisel, and D.A. Still. 1987. GLEAMS:
Ground-water loading effects of agricultural management sys-
tems. Trans, of the Amer. Soc. of Agricul. Eng. (ASAE)
30(5):1,403-1,418.
16. Negahban, B., C. Fonyo, W Boggess, K. Campbell, J. Jones, G.
Kiker, E. Hamouda, E. Flaig, and H. Lai. 1993. A CIS-based
decision support system for regional environmental planning. In:
Proceedings of the Conference on Application of Advanced Infor-
mation Technologies: Effective Management of Natural Re-
sources, Spokane, WA. American Society of Agricultural
Engineers (ASAE). pp. 169-178.

17. Heatwole, C.D., K.L. Campbell, and A.B. Bottcher. 1987. Modified
CREAMS hydrology model for coastal plain flatwoods. Trans, of
the Amer. Soc. Agricul. Eng. (ASAE) 30(4):1,014-1,022.
18. Kiker, G.A., K.L. Campbell, and J. Zhang. 1992. CREAMS-WT linked
with GIS to simulate phosphorus loading. ASAE Paper No. 92-9016.
St. Joseph, Ml: American Society of Agricultural Engineers.
19. Skaggs, R.W 1980. DRAINMOD reference report. U.S. Depart-
ment of Agriculture, Soil Conservation Service. Fort Worth, TX:
South National Technical Center.
20. Tremwel, T.K., and K.L. Campbell. 1992. FHANTM, a modified
drainmod: Sensitivity and verification results. ASAE Paper No. 92-
2045. St. Joseph, Ml: American Society of Agricultural Engineers.

21. Knisel, W.G. 1980. CREAMS: A field-scale model for chemicals,
runoff, and erosion from agricultural management systems. Con-
serv. Res. Rep. No. 26. Washington, DC: U.S. Department of
Agriculture.
22. Priestly, C.H.B., and R.J. Taylor. 1972. On the assessment of
surface heat flux and evaporation using large-scale parameters.
Monthly Weather Rev. 100:81-92.
23. Jensen, M.E., R.D. Burman, and R.G. Allen, eds. 1990. Eva-
potranspiration and irrigation water requirements. Manuals and
reports on engineering practice. Amer. Soc. Civil Eng. No. 70.

24. Reddy, K.R., R. Khaleel, M.R. Overcash, and P.W. Westerman.
1979. A nonpoint source model for land areas receiving animal
waste: II. Ammonia volatilization. Trans, of the Amer. Soc. Agric.
Eng. (ASAE) 22(6): 1,398-1,405.
291
-------
Other GIS Applications
-------
Expedition of Water-Surface-Profile Computations Using GIS
Ralph J. Haefner, K. Scott Jackson, and James M. Sherwood
Water Resources Division, U.S. Geological Survey, Columbus, Ohio
Abstract
Water-surface profiles computed by use of a step-back-
water model such as Water Surface PROfile (WSPRO)
are frequently used in insurance studies, highway de-
sign, and development planning to delineate flood
boundaries. The WSPRO model requires input of hori-
zontal and vertical coordinate data that define cross-
sectional river-channel geometry. Cross-sectional and
other hydraulic data are manually coded into the WSPRO
model, a labor-intensive procedure. For each cross sec-
tion, output from the model assists in approximating the
flood boundaries and high-water elevations of floods with
specific recurrence intervals (for example, 100-year or
500-year). The flood-boundary locations along a series of
cross sections are connected to delineate the flood-prone
areas for the selected recurrence intervals.

To expedite the data collection and coding tasks re-
quired for modeling, the geographic information system
(GIS), ARC/INFO, was used to manipulate and process
digital data supplied in AutoCAD drawing interchange
file (DXF) format. The DXF files, which were derived
from aerial photographs, included 2-foot elevation data
along topographic contours with +0.5-foot resolution
and the outlines of stream channels. Cross-section
lines, located according to standard step-backwater cri-
teria, were digitized across the valleys. A three-dimen-
sional surface was generated from the 2-foot contours
by use of the GIS software, and the digitized section
lines were overlain on this surface. GIS calculated the
intersections of contour lines and cross-section lines,
which provided most of the required cross-sectional ge-
ometry data for input to the WSPRO model.

Most of the data collection and coding processes were
automated, significantly reducing labor costs and hu-
man error. In addition, maps at various scales can be
easily produced as needed after digitizing the flood-
prone areas from the WSPRO model into GIS.
Introduction and Problem Statement

Losses due to flood damage generally cost the Ameri-
can public hundreds of millions of dollars annually. In
1968, the National Flood Insurance Act established the
National Flood Insurance Program (NFIP) to help re-
duce the cost to the public and provide a framework to
help reduce future losses. The Federal Emergency Man-
agement Agency (FEMA) administers the NFIP. As listed
in Mrazik and Kinberg (1), the major objectives of the
NFIP are to:

• Make nationwide flood insurance available to all com-
munities subject to periodic flooding.

• Guide future development, where practical, away
from flood-prone areas.

• Encourage state and local governments to make ap-
propriate land use adjustments to restrict develop-
ment of land that is subject to flood damage.

• Establish a cooperative program involving the federal
government and the private insurance industry.

• Encourage lending institutions, as a matter of national
policy, to assist in furthering program objectives.

• Authorize the continuing studies of flood hazards.

Studies of flood-prone areas typically involve using
step-backwater computer algorithms (digital models) to
estimate river water-surface profile elevations and flood-
inundation patterns along the topography of the river
and its overbanks. FEMA recognizes the U.S. Geologi-
cal Survey's (USGS's) step-backwater model, Water
Surface PROfile (WSPRO), as a suitable computer
model for use in flood insurance studies (2, 3). Basic
data input for step-backwater models includes:

• Estimates of flood discharge and initial water-surface
elevations.

• Stream cross-sectional geometry.

• Roughness coefficients for cross sections.
295
-------
• Contracted opening geometry if bridges or culverts
are located along the study reach.

Obtaining meaningful model results typically requires
numerous stream cross sections referenced to a com-
mon elevation datum along a stream reach. The data-
collection efforts to obtain these cross-sectional data
require costly, labor-intensive fieldwork. Study efforts
along lengthy stream reaches may, however, involve the
generation of a contour map using aerial photogram-
metric mapping techniques. Processing the spatial data
may still require extensive labor to extract the cross-
sectional data needed for the WSPRO model.

The development of geographic information systems
(CIS) technology has greatly enhanced analyses of spa-
tial data such as topography. In an effort to improve the
quality of mapping and delineation of flood-prone areas
in Summit County, Ohio, the USGS developed a method
of using a CIS as a pre- and postprocessor of the input
and output data for the WSPRO model. This paper
describes the steps the USGS used to develop this
interface and discusses some difficulties encountered
during the process.

Approach

Several steps were taken that resulted in the delineation
of a flood-prone area in Summit County, Ohio. These
steps are shown in a flow chart (see Figure 1) and
described below.

Data were obtained for this study via aerial photography
during April 1990. These data include mappable fea-
tures at the given scale including topography at 2-foot
contour intervals, stream boundaries, roads, and build-
ings. The data are estimated to be vertically accurate to
+0.5 feet. The data were put into AutoCAD and were
prepared for delivery to the USGS on 3.5-inch floppy
disks in AutoCAD drawing interchange file (DXF) ASCII
format. ARC/INFO was used to convert the DXF file into
two separate data layers containing only the topography
and traces of stream banks within the study area.

A three-dimensional surface was generated from the
topographic data using the ARC/INFO software package
Triangulated Irregular Network (TIN). Cross-section
lines were digitized over the topography data layer. The
cross sections were placed according to standard step-
backwater criteria (4) and were generally:

• Perpendicular to stream flow

• At major breaks in streambed profiles

• At minimum and maximum cross-sectional areas

• At major changes in stream conveyance

• Spaced about one cross-section width apart
Obtain
Aerial
Photography
Put Data in DXF Format
Process DXF Data With CIS (ARC/INFO)
Establish Relation of Attribute Data to Spatial Data
Generate Three-Dimensional Surface
Overlay Cross Sections and
Calculate Intersections
Verify Elevations With Topographic Controls
Generate Input Files for Model With
Cross-Section Data
Use WSPRO Model To Compute Water-Surface
Elevations for Each Cross Section
Plot Model Output on Topography Data Layer and
Connect Endpoints of Cross Sections
Figure 1. Flowchart of data conversion and processing for use
in the Water Surface PRQfile.
The cross-section lines were then overlaid on the three-
dimensional surface of topography, and CIS calculated
the intersections of the contour lines and cross sections.
The locations and elevations of these intersections were
output as an ASCII file and slightly modified for input into
the WSPRO model.

These CIS data were used along with the aforemen-
tioned required data as input to the WSPRO model.
Input for the model included estimates of the 100-year
flood discharge (5), stream cross-sectional geometry
(supplied by this work), and estimates of roughness
coefficients for cross sections. The WSPRO model was then
run, providing output in the form of water-surface eleva-
tions at specific distances along section lines correspond-
ing to the simulated elevation of a 100-year flood.

Points corresponding to the flood elevations along the
cross-section lines were plotted on the topography data
296
-------
layer and were connected manually (to delineate flood
boundaries) by interpolating the elevations with respect
to adjacent contours. A polygon of the flood "surface"
was generated and drawn on a map (see Figure 2).

Results

The supplied topographic data were of sufficient quality
and resolution to substitute for field-surveyed eleva-
tions; however, field surveys to verify the elevations
along the cross sections would augment this quality
control process (see Figure 1). Typically, a crew of two
individuals may take up to 4 days to survey and reduce
the field data for the study area chosen for this study.
Because aerial photography is commonly substituted for
land surveying, the most significant effort and source of
error may come from manually extracting elevations and
distances along cross sections for input into the WSPRO
0 500 1,000 1,500 2,000 2,500 Feet
I i i i i I
100 200 300 400 500 Meters
Legend
100-Year Flood-Prone
Area

Topographic Contours
(Contour Labels Omitted
for Clarity; Contour
Interval 2 Feet)
Cross-Section Lines
Stream-Bank Trace
Intersection of 100-Year
Flood-Prone Area and
Cross-Section Line
Figure 2. Watershed showing delineation of 100-year flood-prone area.
297
-------
model. Initial development of the method to use CIS for
this analysis took approximately 1 week to refine; how-
ever, future analyses would probably only take one per-
son 1 day to perform. This represents a significant cost
savings. Additionally, reducing the amount of human-in-
duced error can substantially improve the reliability and
accuracy of the computer-generated flood-prone area
data.

Because topography, stream traces, and other features
are supplied in the DXF file, these data can easily be
brought into CIS. Maps can be made that show these
features in relation to the predicted flood-prone area.
Maps showing a variety of features can be produced at
any scale, with accuracy limited only by the accuracy of
the source scale. Additionally, CIS can calculate the
intersection of map features that may lie within the
flood-prone area, such as buildings that may contain
hazardous materials. CIS can also overlay land use
data layers within the flood-prone area to define areas
that should not be developed or that have already been
overdeveloped in accordance with the aforementioned
NFIP objectives.

FEMA now requests that future flood-study mapping be
completed using CIS format, a common goal that both
the USGS and FEMA are working toward. These data
are important to land planners, flood-plain regulators,
and insurance companies that rely on accurate esti-
mates of flood-prone areas. By increasing the accessi-
bility of the data by using CIS, we can substantially
improve our ability to analyze spatial data efficiently.

Problems Encountered

Problems using data supplied in DXF format in conjunc-
tion with CIS resulted primarily from the fact that the
DXF data were prepared for the purpose of making a
topographic map, not a CIS data layer. The contour lines
were segmented; that is, where ends of segments met,
they were not physically connected to form a topologi-
cally viable data layer. The data layer needed to be
edited because CIS requires topology for spatial-data
processing. Additionally, in areas where the topographic
gradient was particularly steep, contour lines were omit-
ted. In both cases, an attempt was made to allow CIS
to establish a physical connection of contour lines, but
subsequent manual interpolation was also required.
This may have introduced error into the data set. If future
work requires the use of DXF data, the request for data
should specifically state that all topographic contours be
continuous.
AutoCAD stores data differently from CIS, so a relation-
ship needed to be established between the data file
containing elevations and the data file associated with
the lines that make up the topography data layer. Sev-
eral lines from the DXF file did not have any data asso-
ciated with them, thus necessitating the addition of
contour elevation data by context with the adjacent con-
tours that did have data. This step may also have intro-
duced errors, but quality-control measures to verify the
topographic contours and contour elevations could help
to minimize these errors.

Output from the WSPRO model is in the form of a series
of points along cross sections that were connected by
manual interpolation. This step also may introduce some
error, but the same process must be performed when
not using CIS.

Conclusions

This report documents an example of how CIS can be
used to facilitate step-backwater modeling of flood-
prone areas. The results of the study show that signifi-
cant savings may be expected in the form of reduced
labor requirements. Furthermore, FEMA now requires
the use of CIS to conduct flood-study mapping, thus
providing a means to conduct additional spatial analy-
ses more efficiently. As aerial photography and CIS
technology improve, although additional sources of error
may arise, the overall accuracy, reliability, and repro-
ducibility of the model input and results should also
improve.

References

1. Mrazik, B.R., and H.A. Kinberg. 1991. National flood insurance
program: Twenty years of progress toward decreasing nationwide
flood losses. Water Supply Paper 2375. U.S. Geological Survey.

2. Shearman, J.O., W.H. Kirby, V.R. Schneider, and H.N. Flippo.
1986. Bridge waterways analysis model. Research Report
FHWA/RD-86/108. Federal Highway Administration.

3. Shearman, J.O. 1990. Users manual for WSPRO, a computer
model for water-surface profile computations. FHWA-IP-89-027.
Federal Highway Administration.

4. Davidian, J. 1984. Computation of water-surface profiles in open
channels: Techniques of water-resources investigations of the
United States Geological Survey. In: Applications of Hydraulics,
Vol. 3.

5. Koltun, G.F., and J.W Roberts. 1990. Techniques for estimating
flood-peak discharges of rural unregulated streams in Ohio. In-
vestigations Report 89-4126. U.S. Geological Survey, Water Re-
sources.
298
-------
Small Is Beautiful: GIS and Small Native American Reservations—
Approach, Problems, Pitfalls, and Advantages
Jeff Besougloff
Upper Sioux and Lower Sioux Indian Communities, Redwood Falls, Minnesota
Background

The Lower Sioux Indian Reservation
The Lower Sioux Indian Reservation covers 1,743 acres
in southwestern Minnesota bordering the Minnesota
River. The land base consists of several hundred acres
of prime, flat agricultural land, a large wetlands slough
complex, prairie pothole wetlands, bottom land wet-
lands, small lakes, and approximately 250 acres of tim-
ber and brush. The elevation ranges from Minnesota
River level to the adjacent bluffs several hundred feet
higher.
This rural reservation contains a moderate amount of
infrastructure, including paved and dirt roads, 12-acre
sewage lagoon serving a moderately sized casino, com-
munity water system composed of a tower and small
treatment plant for the 90 mostly single-family dwelling
homes, convenience store/gas station/gift shop, com-
munity center, small two-room schoolhouse, pottery works
with gift shop, warehouse, and church. The casino-
fueled economic boost to the community recently re-
sulted in improvements to infrastructure and plans for
additional projects.

Office of the Environment
The tribal government was formed under the Indian
Reorganization Act of 1934. The governing body is an
elected five-person tribal community council that admin-
isters several government departments and is responsi-
ble for all government activities.
Under a U.S. Environmental Protection Agency (EPA)
Region 5 multimedia grant, the Upper Sioux and Lower
Sioux Office of the Environment (OE) was formed in late
1992. This unique joint venture between two tribes and
EPA envisioned moving the tribal governments into com-
pliance with major federal environmental legislation. At
the present time, only the Lower Sioux are developing
a tribal geographic information system (GIS). Therefore,
this article is solely applicable to this community, al-
though adoption of an Upper Sioux Community GIS
would likely follow a similar lifeline.

Environmental Regulation in Indian Country

Reservations are subject to a bewildering array of envi-
ronmental regulations. Numerous meetings, publications,
projects, and court decisions are devoted to determining
what law does or does not apply on any particular
reservation. In very general terms, the following can be
stated: state environmental regulations do not apply,
federal regulations do apply, and tribal regulations may
apply. From a tribal environmental employee's point of
view, numerous environmental regulations (whether fed-
eral or tribal) do exist that apply to reservation activities
and land, and they require compliance.

The Problem and the Solution

The OE's responsibility is to bring the reservation into
compliance with the 14 major pieces of environmental
legislation administered through EPA and directly appli-
cable to tribes. The OE finds itself responsible for any
and all other applicable environmental regulations and
all other less-regulated environmental media. The OE
currently has a staff of one.

In addition to the responsibility of moving the tribes into
compliance with federal environmental regulations, the
OE also develops environmental infrastructure, insti-
tutes environmental programs, and performs grant writ-
ing. Lower Sioux programs currently include Clean
Water Act (CWA) and Safe Drinking Water Act (SDWA)
compliance, solid waste planning including develop-
ment and institution of a household recycling program,
wetlands regulations compliance, wetlands mapping
and restoration, National Environmental Policy Act
(NEPA) compliance and site assessments, basic hydro-
logical data gathering and mapping, radon testing and
mitigation, environmental education as necessary, SARA
299
-------
Title III compliance and planning, and a variety of related
tasks.

Contracts or grants are currently being administered
under several Bureau of Indian Affairs (BIA) programs,
U.S. Geological Survey (USGS) and U.S. Army Corps
of Engineers (COE) matching funds programs, two EPA
programs, one Federal Emergency Management
Agency (FEMA) program, one Administration of Native
Americans (ANA) program, one Great Lakes Intertribal
Council (GLITC) program, and a cooperative project
with the National Tribal Environmental Council (NTEC).

Needless to say, responsibilities of the OE are limited by
staff hours rather than need. As distressing as the res-
ervation's unaddressed environmental needs are,
equally distressing (prior to CIS development) was the
helter skelter manner in which the OE digested the data
and information flowing into the office. Because of its
broad responsibilities and the administrative problems
being encountered, the OE began to investigate devel-
oping a tribal CIS.

The Lower Sioux GIS

System Selection

The Lower Sioux GIS system is a networked PC station
through the Bureau of Indian Affairs Geographic Data
Service Center's (BIA GDSC's) two Sun MP690
SparcServers in Golden, Colorado. The GDSC is the
hub of BIA's GIS and remote sensing program, known
as the Indian Integrated Resources Information Pro-
gram (IIRIP). The purpose of the IIRIP is twofold: first,
make GIS and remote sensing technology available to
tribes and BIA personnel; second, transfer these tech-
nologies to tribal organizations.

Database development and management functions,
technical support, development of simplified user inter-
faces, remote sensing interpretation, and implementa-
tion of equipment directives are performed by
approximately 30 GDSC employees for approximately
230 GDSC users. User technical support is also avail-
able through BIA field offices, each of which has a
designated GIS coordinator. Simplified user interfaces
for specialized programs have been developed includ-
ing the Lightening Display System and the Land Title
Mapping System. Quality control is provided for non-
BIA-produced data that will be inputted.

The GDSC has standardized on the ARC/INFO family
of software produced by Environmental Systems Re-
search Institute (ESRI). GDSC has developed a number
of hardware/software configuration options depending
on tribal needs and financial resources and based upon
GDSC experience. The OE happily relies on this expe-
rience to avoid the familiar horror stories related to
equipment and software incompatibility.
Based upon GDSC configuration advice, the initial GIS
setup will be on the OE's existing Compaq PC using
Tektronix Terminal Emulation software (EM4105) and a
Multitech modem (MT932BA). The system can use the
OE's Hewlett Packard (HP) DeskJet 500, although a
significant upgrade, possibly to an HP Paint or HP Excel
Paint, is soon expected. Initial startup hardware and
software costs are minimal in this configuration. Costs
for the above equipment and introductory training are
less than $5,000.

GIS Users at Lower Sioux

Initial setup and data loading will be in the OE, and the
OE employee will receive introductory training on the
system. Because the OE is formed through a coopera-
tive agreement between two tribes, the Upper Sioux and
the Lower Sioux, the OE is centrally located between
the reservations. The system will probably be relocated
to the Lower Sioux Community Center within 1 year. A
tribal government employee will receive advanced GIS
training and be available for all tribal government depart-
ments and businesses.

Funding

In addition to tribal contributions, funding has come
through several sources and joint agreements with the
tribe and BIA, EPA, and ANA.

Training

The GDSC supplies no-cost training to tribes. The Geo-
graphic Data Service Center 1995 Training Catalog (no
federal document number available) offers eight formal
courses repeatedly throughout the year, a 5-week intern
program, and a cooperative student program. Courses
are held at the GDSC or by request at BIA field offices
and tribal locations.

The GDSC also produces the monthly The Service Cen-
ter Review (ISSN 1073-6190), a helpful compilation of
current issues, available resources, system bugs, and
other items of interest to GDSC users.

Data Collection and Input

Data collection can be divided into three categories:
aerial photography, portable global positioning system
(GPS) data, and ARC/INFO export files created under
contracted studies.

Aerial Photography

Surface features and topography will be obtained us-
ing aerial photography reduced to GIS format, then
downloaded to the GDSC. Coverages will consist of 62
categories of features on a scale of 1 inch = 100 feet
with 2-foot contour intervals.
300
-------
Global Positioning Systems

Use of a portable Trimble, Inc., GPS Pathfinder Pro XL
submeter GPS mapping system purchased with assis-
tance from an ANA grant will allow updating of surface
features and addition of nonsurface features as neces-
sary. It will also facilitate input of attribute data.

The GPS will also be used during field work by USGS
on the Lower Sioux hydrological mapping project to
obtain data that otherwise would not be put into
ARC/INFO export file for any reason (i.e., it might not be
directly related to the project at hand or outside the
agreed upon data to be converted to ARC/INFO export
file form but nevertheless is of importance to the OE).
The alternative is that this type of information never
makes it into the CIS and is lost.

ARC/INFO Export Files

Fortunately, most federal agencies that supply funding
to tribes for environmental work are well versed in CIS
applications and the need for CIS-ready data. The OE
now requires all information and mapping to be deliv-
ered as an ARC/INFO export file with data registered to
a real world coordinate system. Downloading of this
data to the GDSC mainframes allows for direct input of
the data. The OE has contemplated, but not acted on,
conversion of existing data for the CIS. This is an ex-
pensive and time consuming process that must be
weighed in comparison with recollecting the data. Ironi-
cally, the lack of reservation data therefore becomes a
benefit because time consuming and expensive data
conversion is unnecessary.

The Intertribal GIS Council

Information gathering, networking, and addressing
uniquely tribal problems were some of the accomplish-
ments at the first annual meeting of the Intertribal GIS
Council (IGC) held in June 1994. Vendors as well as BIA
regional office and GDSC representatives answered
questions and presented panels. This annual confer-
ence is likely to become a major benefit to the tribe as
it continually develops the GIS.

The Future

As the tribal government becomes more familiarwith the
GIS, its uses, and advantages, recognized governmen-
tal needs will likely drive the development of further
coverages. The OE also expects to access existing
governmental data of importance to the tribe in an effort
to expand the GIS database and is actively seeking
sources of such information.

Philosophical Caveat

Albert Einstein stated that, "The significant problems we
face cannot be solved at the same level of thinking we
were at when we created them." Some assume that GIS
is the next level of reasoning in the environmental pro-
fession because we can accomplish tasks more quickly,
more efficiently, with more variables accounted for, and
beyond what we could have hoped to accomplish prior
to GIS.

Essentially, what we have gained is speed and the
capacity to include additional data, which is not what
Einstein was referring to when he spoke of the next
level. Wisdom, in the sense of a higher level of
understanding, is the necessary ingredient to the
solution of current environmental problems; in other
words, movement beyond the paradigm that created
the problem. GIS may be the tool that pushes the envi-
ronmental professional to the next level of wisdom by
presenting the data and information in a manner that
allows the user to stand back and see more clearly on
a higher plane. But that level can be found only within
the environmental professional himself or herself and
not within GIS.
301
-------
A GIS-Based Approach to Characterizing Chemical Compounds in Soil and
Modeling of Remedial System Design
Leslie L. Chau, Charles R. Comstock, and R. Frank Keyser
ICF Kaiser Engineers, Inc., Oakland, California
Introduction: The Problem

The cost-effectiveness of implementing a computerized
geographic information system (CIS) for environmental
subsurface characterization should be based on long-
term remedial objectives. A CIS project was developed
to characterize soil contamination and to provide design
parameters for a soil vapor extraction remedial system,
as part of a $120-million remediation and "land sale"
project in California. The primary purposes of the CIS
were to efficiently combine and evaluate (model) dispa-
rate data sets, provide "new" and more useful informa-
tion to aid in short-term engineering decisions, and
support the development of long-term cleanup goals.

The project had a major change in scope early on, and
the schedule was expedited to allow for the develop-
ment of "land sale" options and for actual site redevel-
opment at the earliest opportunity. Characterization of
chemically affected soil would have been compromised
given the above circumstances without an ambitious
undertaking of concurrently developing and implement-
ing a CIS with three-dimensional (3-D) geostatistical
and predictive modeling capabilities.

The GIS Approach

Computer solutions included the use of a cross-platform
(DOS and UNIX) GIS to quickly and systematically in-
corporate spatial and chemical data sets and to provide
a distributed data processing and analysis environment
(see Figure 1). Networked, DOS-based relational data-
bases were used to compile and disseminate data for
the numerous investigatory and engineering tasks.
UNIX-based computer aided design (CAD) and model-
ing applications received data from databases, per-
formed quantitative analyses, and provided 3-D
computer graphics. Given the aggressive project sched-
ule, exclusive use of one platform would not be realistic
due mainly to the limited data modeling capacity and 3-D
graphics in DOS systems. On the other hand, the high
startup and operating costs of several UNIX worksta-
tions would render their exclusive use much less cost-
effective.

The hardware and software configurations were inte-
grated in a client/server Intergraph InterPro 6400 with
48 megabytes of memory. It is largely a 3-D CAD system
with add-on modules of geologic mapping and 3-D vis-
ual models capable of consolidating both environmental
and engineering parameters for analysis (see Figure 1).
Textual environmental and geologic data were extracted
by SQL queries from relational databases and were
transferred to mapping and modeling modules via
PC/TCP cross-platform linkage.

The GIS assisted in making short- and long-term deci-
sions regarding health-risk-based regulatory strategy
and engineering feasibility. Use of spatial statistical and
predictive models was part of a CIS-based decision-
making loop (see Figure 2). The process supported
concurrent activities in:

• Data collection: field program.

• Numerical models of remedial system configurations.

• Development of cleanup goals from health risk
assessments.

• Remedial design with CAD capability.

Site Background

In early 1993, the remedial investigation of the operable
unit for soil at a former aircraft manufacturing facility in
southern California was thought to be ready for remedial
alternatives feasibility study. ICF Kaiser Engineers, Inc.,
was awarded the contract to perform feasibility studies
on applicable soil cleanup technologies and to sub-
sequently design and manage the installation and early
operation of the selected technologies. After $700,000
was spent evaluating data collected by previous consult-
ants, it was decided that an additional $5 million worth
302
-------
DOS-Based
Token Ring Network

Site
Survey
Activities
Sampling
K
R*

RDBMS
(Paradox)

UNIX-Based
Data Visualization and Computer-Aided Engineering

-4
PC/TCP

Informix
RDBMS

Visualization
CADD
Laboratory
Analytical
Results
Health Risk
Assessment
Regulatory
Reporting/
Interaction
Modeling Module
3-D Kriging Spatial
Analysis
3-D Vapor Flow
Analysis
3-D Chemical Mass
Transport
Analysis
H
TCP/IP
High Impact 3-D
Solid Models
Volumetric
Analysis of
Static and
Dynamic Mass
Flow
Subsurface
Engineering
3-D Piping

Electrical

Treatment
System Housing
Accelerated Implementation To Establish Cle
Require Low Overhead Upfront Cost and Off-the
PC-Based
Data Visualization
Report Generation
anup Goals
Shelf Software

Regulatory and
Public
Interaction
Client
Deliverables

Extended Engineering Phase
Specialized CAD Software and Analytical Staff

Figure 1. Multiplatform GIS project.
Ongoing Site Investigation
Soil/Soil Vapor Sampling
Data Processing
1. Storage (RDBMS)
2. Dissemination
Modeling
1. Spatial Correlation
2. Fate and Transport
Conceptual
Treatment Alternatives
Health Risk
Assessment
Permitting
Environmental Planning
Site Redevelopment
Establishment of
Cleanup Goals
Final Remedial
Design
Figure 2. GIS-assisted decision tree.
of field activities were required to more definitively esti-
mate the volume of chemically affected soil and the
nature and extent of contamination at the facility. Be-
cause of the data gaps, the selection and design of
alternatives could not be addressed with a high degree
of certainty. Hence, computer assisted data processing
was crucial to speed up the feasibility study, accelerate
downstream work, and reduce the overall project sched-
ule to the minimum.

The site is environmentally complex, covering an area
of approximately 120 acres. As a result of nearly half a
century of aircraft production and development, soil be-
neath the facility is affected by fuel and heavy oil hydro-
carbons (TPH) commingled with volatile compounds,
mainly perchloroethylene (PCE) and trichloroethylene
(TCE) (see Figure 3). Ground water at 170 feet below
ground surface is affected by TCE and PCE, but it is not
part of the drinking water aquifer. The facility has been
demolished, and shallow contaminated soil has been
excavated and back-filled to an interim grade.

Methodology

Health-Risk-Based Cleanup Goals

Central to determining the volume and kinds of data to
be collected was the question of whether chemicals in
soil represented potentially unacceptable risks to human
303
-------
VAPOR DISTRIBUTION W
Figure 3. Aircraft manufacturing facility in California. Outline of demolished buildings located at the 120-acre site are shown as
surface features for reference. A geostatistical model of a 3-D kriged VOC soil vapor cloud in the subsurface was simulated
with Intergraph's MGVA. Views displayed are: 3-D isometric, vertical section of chemical isoplaths, and a nearly plan view.
Digital simulation also illustrates VOCs affecting ground water in a dispersive nature at a depth of nearly 170 feet bgs
(shown at bottom of isometric view).
health and to the environment, with the former being of
particular concern to construction workers onsite during
redevelopment.

Because site redevelopment was scheduled to begin in
the near term, data collection and CIS analysis concen-
trated on shallow depths (top 20 feet), with decreasing
sample density at greater depths. A health-risk-based
cleanup goal (HBCG) approach to collecting more data
was to establish cleanup goals for near-term remedia-
tion of the shallow soils as well as for long-term remedial
measures of contaminated soils at greater depths. Fur-
ther, various regulatory agencies had to approve the
estimated cleanup goals in a short time. Ongoing site
demolition and excavation schedules encouraged the
aggressive regulatory negotiations. The shallow cleanup
goals for volatile organic compounds (VOCs) and TPH
determined the volume of contaminated soil to be re-
moved. At greater depths, data gaps were minimized to
more definitively characterize the nature of TPH and
VOC contaminations and to facilitate the implementation
of long-term remedial objectives (i.e., in situ soil vapor
extraction).

In situ soil vapor and soil sampling composed the field
program, which provided data to map the subsurface
distribution of volatile organic compounds, including
TCE and PCE. Only in situ soil sampling was used for
characterizing TPH. The ratio of soil vapor to soil sam-
ples was 4:1. No previous soil vapor information was
available. ICF Kaiser has been refining the technique of
comparing results from paired soil vapor and soil sam-
ples in past and similar projects. Hydraulic probes were
used instead of drilling to acquire soil vapor samples at
shallow depths. This minimized waste and cost in the
field program significantly.

Risk Assessment and Spatial Analysis

Human health risk analyses were conducted for the
entire site, and risk factors were contoured and overlaid
304
-------
on maps of past usage and known soil contamination
areas. Before the risk modeling could proceed, chemical
and lithological data gathered in the past 7 years and
those acquired by ICF Kaiser populated the environ-
mental relational databases. Approximately 522 soil va-
por probes were located in 100-square-foot spacings
with additional probes in areas requiring better plume
definitions. The database contains approximately
15,000 xyz-records of soil and soil gas laboratory ana-
lytical results. This information in text and graphics form,
combined with site infrastructures and building outlines
with attributes of "past usage," were stored as map
layers, making up the CIS nucleus. Accuracy of site
maps was verified with aerial photographs when avail-
able. Data types combined for computerized evaluation
included known locations of contaminated soil, contami-
nated ground water, soil types, and site features. Com-
posite risk maps of the above data were analyzed for
data gaps at discrete depth intervals. This analysis was
performed while the field program was in progress
and hence gave guidance to optimize the locations of
additional data points and to minimize the number
of samples taken.

The MGLA/MGLM mapping module and the MSM ter-
rain modeling module tracked the earth excavation and
removal of contaminated soil. Excavation was largely
part of site demolition. It also expedited the removal of
TPH contaminated soils, however, because no other
short-term means of remediation are available for these
substances. Tracking of removed soils was essential
because concurrent field activities were occurring in site
demolition, data gathering, and risk modeling.

The CIS coordinated all three. Geologists and surveyors
provided terrain data from daily excavation activities,
which were transcribed into database formats. Maps
illustrated the locations of excavated soil and removed
chemicals in soil at various depths. Although TCE and
PCE were of foremost concern as health risks, all com-
pounds and some metals identified in soil were
screened for unacceptable risk. Terrain modeling (map-
ping) as part of health risk assessment may seem un-
usual, but results of estimated cleanup levels and
accurate locations of left-in-place contamination, mostly
soils at greater depths, were critical to the cost-effective-
ness and proper design of long-term remedial systems.

Characterization of Subsurface VOCs

In situ soil vapor extraction (SVE) of total volatile com-
pounds in dense nonaqueous, liquid, gaseous, and ad-
sorbed solid forms in the subsurface produced favorable
results that have been well documented in recent years.
ICF Kaiser proposed a very large-scale SVE system
(see Figure 4), perhaps the largest yet, as long-term
remedial technology for this former aircraft manufactur-
ing site. The primary design problem was speculating on
air flow capacity and operating time of the complex
system components. The SVE system comprises three
fundamental elements:

• Front-end, in situ subsurface vents (totaling 193
corings).

• Applied vacuum and air transport manifolds linking
the subsurface vents to the treatment compound (dis-
tance of one-quarter mile with over 100 manifolds).

• A multivessel activated carbon treatment system.

To size the pipes, carbon vessels, and vacuum required
to achieve a certain rate of VOC removal, the total mass
and nature of sorption had to be known. Due to the
schedule-driven nature of this project, the SVE design
accounted for the time needed to accomplish the
cleanup goals.

To estimate the extent and total mass of VOCs in the
subsurface, soil vapor data were input to a 3-D kriging
algorithm (1) to produce a concentration continuum
model (see Figure 3). This solid model of predicted total
VOC concentrations took the form of a uniformly spaced
3-D grid-block that completely encased the site. Cell
sizes ranged from 10 to 20 cubic feet, depending on the
model run, number of data clusters, density of data
points in areas of clustered data, and the standard
deviation of variances for estimated values in all cells.
The Fortran program estimated a concentration value
for each cell based on the nearest field sample(s).

The validity of such "block kriging" models can be judged
by the size of the variances, smoothness, and agree-
ment with nearby field data. Because volume is a known
quantity in kriging, the total mass can be calculated by
incorporating soil bulk density or porosity, both of which
were less than abundant for this investigation. Render-
ings of kriged results in 2-D plan view contour maps,
cross-sectional maps, and 3-D "vapor cloud" (see Figure
3) were included in client reports and used in regulatory
presentations and public forums.

Remedial Design Layout

Final Extension of a Fully Integrated CIS

With the total mass and extent of VOCs derived from
3-D kriged results, the applied vacuum at individual vent
heads and the cumulative pressure (negative) neces-
sary to extract and transport VOC vapors from the sub-
surface to the treatment system can be estimated. We
performed 3-D air flow analysis by use of finite differ-
ence fluid flow models and chemical transport models.
The Fortran codes used to approximate compressible
flow and chemical transport were AIR3D (2) and VT3D
(3), respectively. Air flow simulations focused on maxi-
mizing vacuums at the shallow depths down to 20 feet
to expedite remediation of contaminated soils that were
305
-------
Figure 4. A rendering by the Intergraph 3-D Plant Design System of an in situ soil vapor extraction and treatment system. The
cutaway section located near the upper left portion of the figure exposes some of the 193 subsurface extraction vents
bottoming at 120 feet (bgs). These vents are located in a cluster for long-term extraction of the VOC vapor cloud presented
in Figure 3. Vents are connected to a system of parallel airflow manifolds (right side of figure), which runs one-quarter of
a mile to the treatment compound (foreground of figure).
not removed during site demolition and excavation. The
lower depths were also included in each simulation.
Transient mass transport models incorporate flow fields,
given by flow models, and predicted cleanup times
based on established HBCG cleanup goals. As VOC
concentrations in an operating SVE system fall below
cleanup levels in the top 20 feet, thus minimizing human
risk, available vacuums thereafter will be diverted to
vents at lower depths to be part of long-term extraction
scenarios. Models suggested that cleanup for the top 20
feet can be accomplished within 1 year.

Numerical models prescribed vacuum levels at each
vent head, which is the aboveground segment of a
subsurface SVE vent. The 193 vents are connected to
a system of parallel manifolds (see Figure 4) that trans-
port vapor to the treatment system. With the vacuums
known at vent heads, the size of manifolds and capacity
of vacuum blowers can be determined and integrated
into the overall system design. With 3-D Plant Design
module as part of the Intergraph CAD/GIS, manifold
layouts and treatment compound can be modeled in 3-D
and easily checked for pipe routing interferences. The
final layout of the SVE system was overlaid onto contour
maps of total VOC concentrations to check on accuracy
and completeness of vent locations and manifold layouts.

Conclusion

Maximized Visual and Analytical Responses

One goal of this project was to expedite regulatory
negotiations and gain early acceptance of cleanup
goals. The computerized data processing and visualiza-
tion contributed generously to the rapid understanding
of modeling results by expert regulators and the lay
public. Likewise, the CIS facilitated the response to
regulatory comments. Positive comments first came
306
-------
from the client's in-house review of model results and
the high impact 3-D color rendering of kriged VOC dis-
tributions in the subsurface (see Figure 3).

Analytically, benefits were derived from the efficiency of
electronic data access and the ability to "predict" the
presence of contaminant in areas with sparse field data.
The process of kriging involves the linear interpolation
and extrapolation of existing data. The resultant con-
taminant distribution is a "conservative" model that pro-
vided the best fit with field data and validated
conceptualized subsurface conditions. Further, models
provided conservative estimates of mass and extent of
PCE and TCE contaminations. Kriging also provided
information on the uncertainty of the predicted chemical
distribution, which is extremely useful for regulatory dis-
cussion and system design. The efficiency of computer
models allowed investigators to perform numerous
model runs with varied boundary parameters, such as
cell size and search radii, in the kriging process.

Accurate mapping of excavated soil and the removal of
most TPH source areas provided the incentive to criti-
cally assess the feasibility of a no-action remedial sce-
nario for these substances at greater depths. With
removal of many TPH source areas, 1 -D finite difference
models (4) were used to assess the mobility of TPH in
NAPL and adsorbed residual phase. Specifically, mod-
els assessed the likelihood of largely residual-phase
TPH affecting ground water and migrating upward to
affect indoor air volumes via gaseous diffusion. Results
were extremely favorable; models predicted negligible
likelihood of TPH affecting ground water or indoor air
volumes. Combined with CIS graphic evidence of spe-
cific areas of excavated soil and the absence of TPH
sources, regulatory agencies accepted the model re-
sults, and the no-action remedial alternative for TPH
was approved.

References

1. Deutsch, C.V., and A.G. Journal. 1992. Geostatistical software
library and user's guide. New York, NY: Oxford University Press.

2. U.S. Department of the Interior Geological Survey. 1993. AIR3D:
An adaptation of the ground-water flow code MODFLOWto simu-
late three-dimensional air flow in the unsaturated zone. Books and
open file reports. Denver, CO.

3. Zheng, C. 1994. VT3D: Numerical model for VOC removal from
unsaturated soil (draft). Bethesda, MD: S.S. Papadopulos and
Associates, Inc.

4. Rosenbloom, J., P. Mock, P. Lawson, J. Brown, and H.J. Turin.
1993. Application of VLEACH to vadose zone transport of VOCs
at an Arizona Superfund site. Groundwater monitoring and reme-
diation (summer), pp. 159-169.
307
-------
Polygon Development Improvement Techniques for Hazardous Waste
Environmental Impact Analysis
David A. Padgett
Austin Peay State University, Clarksville, Tennessee
Introduction

Recently, concern has arisen regarding the effect Super-
fund sites have on surrounding communities and, spe-
cifically, the distribution of those impacts on target
populations. In designing geographic information sys-
tems (CIS) applications for analyzing potential impacts
of hazardous wastes or waste sites on adjacent neigh-
borhoods, many challenges may be encountered. CIS
database design requires addressing questions of time,
space, and scale.

The U.S. Environmental Protection Agency (EPA) and
other federal agencies have conducted studies that in-
dicate that certain sectors of the population may be
more vulnerable to exposure to toxics than others. To
date, federal departments have enlisted in several CIS-
based research projects that attempt to delineate "geo-
graphic hot spots" of toxic contamination. Such CIS
applications at hazardous waste sites have typically
used polygons to represent data from census tracts
and/or municipal boundaries. In most cases, however,
census tract and other boundaries do not necessarily
jibe with community and neighborhood boundaries; there-
fore, the polygons representing characteristic data for
target populations may not be consistent with the actual
status of those populations.

The objective of this paper is to demonstrate CIS meth-
ods for producing, to the greatest degree possible, so-
cioeconomically and culturally homogenous polygons
for impact analysis of specific sensitive populations
and/or communities. The paper presents case studies
of community/neighborhood characterization problems
encountered in developing polygons during previous
field investigations involving lead (Pb) contamination,
toxic release inventory (TRI) sites, and solid/hazardous
waste sites. The paper attempts to demonstrate effec-
tive solutions and suggestions for improving polygon
development, including CIS data manipulations and
software applications. In addition, the paper provides
geographic and groundtruthing field methods to sup-
port and enhance the accuracy of remotely obtained
information. Finally, the discussion includes commu-
nity and geographic hot spot analyses for potential
public health impacts.

Background

In 1992, EPA established the Environmental Equity
Workgroup. Its members included personnel from the
Offices of Toxic Substances and Civil Rights, as well as
Policy, Planning and Evaluation. The workgroup con-
ducted an extensive study on environmental equity is-
sues. Their report offered several recommendations for
improving federal agency efforts in protecting minority
and low-income populations and recognized a need for
more spatial and demographic data. The final report,
titled Environmental Equity: Reducing Risk for All Com-
munities (1), was released in February 1992 and con-
cluded that "there is (sic) limited data on environmental
health effects by race; there are differences by race and
income in potential and actual exposures to some pol-
lutants." In response to the above findings, the work-
group offered the following recommendations (1):

EPA should establish and maintain information
which provides an objective basis for assessment of
risks by income and race, commencing with devel-
oping a research and data collection plan.

It (EPA) should revise its risk assessment proce-
dures to ensure . . . better characterization of risk
across population, communities or geographic ar-
eas. In some cases it may be important to know
whether there are any population groups at dispro-
portionately high risk.

The Agency for Toxic Substances and Disease Registry
(ATSDR) formed a community health branch to specifi-
cally examine the potential health impact of hazardous
waste sites upon people living in surrounding communi-
ties. The new branch's personnel direct ATSDR's minor-
308
-------
ity health initiative, which focuses upon health threats to
minority populations, including those from environ-
mental contaminants.

In addition, EPA established the Office of Environmental
Equity. The office's mission includes analyzing environ-
mental impacts upon minority populations, providing
technical assistance to disadvantaged communities,
and establishing environmental initiatives at minority
academic institutions (MAIs). The office serves as a
clearinghouse of environmental data and information for
groups and individuals involved in environmental equity
activities.

In 1993, Representative John Lewis (Democrat-Georgia)
introduced the Environmental Justice Act to Congress. The
act requires EPA and the Department of Health and Hu-
man Services (DHHS) to establish the geographic units for
determining environmental high-impact areas (EHIAs),
which are the 100 geographic areas found to have the
highest volumes of toxic chemical releases.

GIS Applications

CIS could potentially help address the above data and
information needs of EPA. The Agency specifically ac-
knowledges this in other recommendations (1):

EPA could further develop its enforcement prioritiza-
tion policy to target high-risk populations. Under this
scheme, the most exposed and highly susceptible
populations in each region would be targeted for
enforcement actions. Geographic Information Sys-
tem technology could be used to identify high-risk
populations.

Several recent environmental studies have employed
computer applications and spatial data. Goldman (2) used
GIS in a major study that graphically displayed counties
having high percentages of African-Americans, hazardous
wastes, and diseases. Mohai and Bryant (3) applied a
linear regression model to show a positive correlation
between increasing proportions of minority populations
and decreasing distances from hazardous waste sites in
Detroit. Lavalle and Coyle (4) conducted an extensive
analysis of computer databases that hold hazardous
waste law enforcement information for the past 10 years.
They found inequity in enforcement and remedial actions
in white communities versus nonwhite communities.

EPA has enlisted GIS for community environmental im-
pact projects at Regions II and III. EPAs Office of Health
Research (OHR) is investigating methods for linking
demographic data with TRI information to evaluate the
relationship between levels of hazardous waste re-
leases and exposure risks in minority communities. EPA
has also developed the TRI "risk screening" process,
which employs TRI, U.S. Census data, and GIS to iden-
tify TRI releases that may pose significant risk to human
health or the environment (5).
Both EPA (6) and the North Carolina Department of Envi-
ronment, Health, and Natural Resources (7) have recently
completed CIS-based environmental investigations. The
EPA study involved GIS analyses of TRI chemical releases
in the southeastern United States. The report included
numerous CIS-produced maps that show locations where
TRI releases may be affecting densely populated areas
and sensitive ecosystems. The North Carolina study
applied GIS in searching for sources of lead-poisoning
in children. Findings indicated a positive spatial correla-
tion between high lead-contamination risk communities
and those having certain socioeconomic characteristics,
such as low income, above-average African-American
population percentages, and above-average percent-
ages of residents receiving public assistance.

The ATSDR recently implemented a study using GIS to
evaluate and analyze the demographic characteristics
of populations near National Priorities List (NPL) sites.
According to the ATSDR, "As a result of our pilot tests,
we have determined GIS to be the best methodology for
identifying potentially impacted minority populations" (8).

Limitations of GIS

While GIS may be a viable tool for investigating environ-
mental inequity, it is not an absolute solution. Issues
involving hazardous waste impact assessments tend to
be very complex without the added dimension of racism
or discrimination. Efforts to determine a causal relation-
ship between the presence of minority communities and
environmental hazards must consider the questions of
time, scale, and place. Unfortunately, in many instances,
GIS applications may be unable to adequately illustrate
these three pertinent issues resulting in skewed or alto-
gether incorrect conclusions.

With respect to scale, among the immediate concerns
when applying GIS is selecting appropriate sizes for
polygons. As indicated above, EPA is in the process of
determining the scale for EHIAs. A polygon may be a
county or a census tract. Figure 1 illustrates problems
EH3
B
a 0.
m m " "• H
o Q
90 Percent Nonminority
Figure 1. Problems of scale in GIS polygon design.
309
-------
of scale associated with polygon size selection. At the
county-size scale, a case of environmental inequity ap-
parently exists with the presence of a Superfund site in
the sample county because 80 percent of the county's
population belongs to a minority ethnic group. A closer
look, however, reveals that the population residing in the
immediate vicinity of the waste site is predominantly
nonminority.

Useful as CIS may be, its output in some cases may
display static conditions without considering human
movements over time. Figure 2 displays a common
situation associated with the "filtering" phenomenon, in
which a nonminority population moves out of an area
while increasing numbers of an ethnic minority group
moves into it. The figure shows that in 1950, a nonmi-
nority community surrounded a TRI site (i.e., an active
industrial site releasing toxic substances). By 1990,
the demographics of the neighborhood had changed
along with the status of the TRI site, which is now an
abandoned Superfund site. A CIS database probably
would contain only information on the community from
1990. Such an instance could suggest that some form
of environmental injustice exists given the presence of
the Superfund site within the minority community. Ac-
counting for the dynamics of time and human move-
ment, however, would show that the waste site preceded
the minority population and that, in actuality, the minority
community moved toward the site. This conflicts with the
prior notion that unsavory forces placed the site in the
minority community.

Figure 3, a schematic of polygons used in an investiga-
tion into sources of lead-poisoning in children, also dis-
plays the limitations of CIS with respect to time and
human dynamics, but at a lesser time interval. The study
area is divided into low- and high-risk areas for lead
contamination. The locations at which children with lead
poisoning were found, however, do not correspond with
the areal risk factors. In this case, the CIS is limited in
its ability to follow human movements on a daily basis.
For instance, a parent with a child who exhibits un-
healthy blood-lead concentrations may report the child's
home address as someplace within the low-risk poly-
gon. The child may attend school in the high-risk area,
however. The children's points of contact with lead may
not necessarily correspond with their home addresses,
resulting in an inaccurate graphic display.

With respect to place, CIS may be limited in its ability to
determine the specific borders of a socioeconomically
and demographically homogenous human population.
Tosta (9) and Coombes et al. (10) discuss the dilemmas
associated with neighborhood boundary delineation in
CIS applications. Figure 4 displays a schematic of a
census tract. The CIS database may list the tract's per
Figure 2. Example of changing community demographics with
time near a hazardous waste facility.
= Homes of Lead-Poisoned Children
Figure 3. Lack of correspondence between locations of lead-
poisoned children and high-contamination risk areas
due to daily dynamics of human movements.
Middle
Income
Community
Census Tract Schematic

Figure 4. Example of significant neighborhood-type variation
within a single polygon, possibly resulting in skewed
socioeconomic data.
310
-------
capita income as relatively low and may list the tract as
a low-income neighborhood. Further investigation may
find, however, that two very different socioeconomic
communities exist within the tract, one middle-class
and the other a public housing facility. Frequently,
middle-income, African-American communities have
low-income housing projects built adjacent to them. With
respect to the polygon in Figure 4, if an investigator
wanted to research health impacts of toxic wastes for
low-income households, using this polygon and others
like it would inaccurately depict communities within
them.

An additional problem in community definition is deter-
mining exactly what defines a minority community. The
most common indicator for discerning a minority com-
munity would be the existence of a clear majority of
some minority group as in Polygon A of Figure 5, or
where the minority group makes up more than 50 per-
cent of the population as in Polygon B. In some in-
stances, however, communities have received minority
status without the presence of the conditions in Poly-
gons A and B.

Previous investigations show a number of measures
used to identify minority communities and census tracts.
Greenpeace conducted a 1990 environmental justice
study that determined a community's status as minority
based upon the relationship between a community's
percentage of minority population and the selected mi-
nority group's national percentage (11).

Polygon C in Figure 5 depicts Greenpeace's minority
community definition. Taking the target ethnic group in
Polygon C as African-Americans and the hypothetical
extent of Polygon C as the United States (African-Ameri-
cans make up approximately 12 percent of the total U.S.
population), one may determine that Subpolygon C is a
minority polygon or community because its minority per-
centage is over twice that of the national percentage or
extent of the large population in Polygon C. The condi-
tion that Polygon C illustrates is also evident in a study
Subpolygon C

30 Percent Minority
Polygon A

80 Percent Minority
Polygon B

51 Percent Minority
Polygon C

12 Percent Minority
Figure 5. Problems in defining minority polygons.
by Mohai and Bryant (3). The authors claim that envi-
ronmental inequity exists in Detroit where they found
that on average, within a 1-mile radius of the city's
hazardous waste facilities, 48 percent of the population
is nonwhite.

Solutions With GIS and Supporting
Technological Methods

From the above discussion, GIS may appear very lim-
ited for use in environmental community impact investi-
gations, but GIS can actually be an extremely effective
tool if employed with appropriate supporting technology.

Preliminary Groundtruthing

To design GIS databases that reflect the true nature of
target groups and human dynamics, preliminary
groundtruthing may be necessary. In some cases, inves-
tigators make gross interpretations of suspected envi-
ronmental inequity without actually visiting the study
area. Without groundtruthing prior to final database de-
velopment, questions of time, scale, and human dynam-
ics may be left unanswered. The integrity of databases
produced this way and the antecedent conclusions
based upon them may fall into question. Thus, because
the nature of environmental and human health impact stud-
ies is complex, some on-the-ground work should precede
or at least accompany database construction efforts.

Cause-Effect Analyses

Epidemiological studies are increasingly employing
GIS. This use is important with respect to environmental
investigations because in many cases, proof of a corre-
lation between a waste site and community health prob-
lems may be necessary. Croner et al. (12) describe
statistics-supported GIS applications for linking "cancer
hot spots" with pollution sources. Without conclusive
evidence that waste sites and other environmental
hazards negatively affect health in socioeconomically
disadvantaged populations, claims of environmental in-
justice may be difficult if not impossible to prove.

In historical analyses of waste facility sitings, GIS may
be useful, along with the support of gravity models, in
investigating whether the sitings followed the prescribed
logic for such siting decisions. Noble (13) wrote that the
costliest aspect of waste facilities management is trans-
portation; therefore, siting decisions should favor loca-
tions in closest proximity to a selected community's
centroid of waste production. Using a GIS gravity model
with data for household wastes produced within a given
locality, the center of gravity of the volumes of wastes
produced could be located. Historical analyses of past
siting decisions may find that past siting decisions defied
logic. Instead of finding waste sites placed in environ-
mentally safe locations as close as possible to areas of
greatest refuse generation, analyses may find instead
that sites have been placed farther away in disadvantaged
311
-------
communities. Both the taxpaying and potentially af-
fected residents would pay for such unsavory siting
practices.

Conclusion

The potential for technological applications, including
CIS, in this arena is great. Increased involvement by
technologically trained environmental professionals is
imminent. Their future involvement, however, must fo-
cus on the scientific soundness of investigative meth-
ods, data integrity, and the equitable participation of
potentially affected citizens in any subsequent decision-
making processes.

References
1. U.S. EPA. 1992. Environmental equity: Reducing the risk for all
communities. EPA/230/DR-92/002. Washington, DC.
2. Goldman, B. 1991. The truth about where you live. New York,
NY: Times Books.
3. Mohai, P., and B. Bryant. 1992. Environmental racism: Reviewing
the evidence. In: Bryant, B., and P. Mohai, eds. Race and the
incidence of environmental hazards: A time for discourse. Boul-
der, CO: Westview Press.
4. Lavalle, M., and M. Coyle. 1992. Unequal protection: The racial
divide in environmental law. The Environmental Professional
153:S1-S12.
5. U.S. EPA. 1990. Toxics in the community: National and local
perspectives. EPA/560/4-90/017. Washington, DC.

6. Stockwell, J.R., J.W Sorensen, J.W. Eckert, Jr., and E.M. Car-
reras. 1993. The U.S. EPA geographic information system for
mapping environmental releases of toxic chemical release inven-
tory TRI chemicals. Risk Analysis 132:155-164.

7. North Carolina Department of Environment, Health, and Natural
Resources. 1993. Lead database evaluation and CIS modeling
project. PR-242592. Raleigh, NC: National Institute for Environ-
mental Health Sciences.

8. Harris, C.H., and R.C. Williams. 1992. Research directions: The
Public Health Service looks at hazards to minorities. EPA J.
181:40-41.

9. Tosta, N. 1993. Sensing space. Geo. Info. Sys. 39:26-32.

10. Coombes, M., S. Openshaw, C. Wong, and S. Raybould. 1993.
Community boundary definition: A GIS design specification. Re-
gional Studies 3:280-86.

11. Bullard, R.D. 1993. Environmental equity: Examining the evidence
of environmental racism. Land Use Forum (Winter), pp. 6-11.

12. Croner, C.M., L.W Pickle, D.R. Wolf, and A.A. White. 1992. A
GIS approach to hypothesis generation in epidemiology. In:
American Society for Photogrammetry and Remote Sens-
ing/American Congress on Surveying and Mapping/RT 92, Wash-
ington, DC (August). Technical papers, Vol. 3: GIS and
cartography, pp. 275-283.

13. Noble, G. 1992. Siting landfills and LULUs. Lancaster, PA: Tech-
nomic Publishing Company, Inc.
312
-------
Comparing Experiences in the British and U.S Virgin Islands in Implementing GIS
for Environmental Problem-Solving

Louis Potter
Office of the Chief Minister of the Government, British Virgin Islands

Bruce Potter
Island Resources Foundation, St. Thomas, U.S. Virgin Islands
The British and U.S. Virgin Islands:
Comparisons and Contrasts
British Virgin Islands

Three miles to the north and east of the U.S. Virgin
Islands (USVI) lie the British Virgin Islands (BVI), a
group of 36 islands and cays with a total area of 60
square miles. The four largest are Tortola (24 square
miles), Anegada (14 square miles), Virgin Gorda (8.5
square miles), and Jost Van Dyke (4.5 square miles).

Geologically, the BVI belong to the Greater Antilles, and
like the USVI and Puerto Rico, rise from the Virgin
Banks (or Puerto Rican Shelf). Rocks of the BVI, except
Anegada, consist of thick, steeply inclined, metamor-
phosed volcanic and sedimentary stratified series of
Cretaceous age, with dioritic and pegamitic intrusions.
Anegada is a 30-foot-high emergent coral limestone
platform, apparently from the Pleistocene age.

The BVI are a crown colony of the United Kingdom (UK)
with a total population in 1991 of 16,108. Most of the
population resides on Tortola (13,225 inhabitants). Be-
tween 1980 and 1991, population increased 46.6 per-
cent. The BVI have internal self-government with an
elected council headed by a chief minister. The UK
appoints a governor to represent the queen and to man-
age defense, internal security, external affairs, civil serv-
ice, and court administration.

The BVI economy is based mainly on tourism and serv-
icing international business. Sailing and diving are piv-
otal features in BVI tourism. The average number of
tourists per capita in the BVI is 221, compared with 119
for the USVI (1). The BVI 1990 per capita income was
recorded at $10,125. Prior to the 1970s, the BVI economy
was based on subsistence agriculture and remittances
from British Virgin Islanders who worked in the USVI.
U.S. Virgin Islands

The USVI are an unincorporated territory of the United
States, purchased from Denmark in 1917. The total popu-
lation in 1990 was about 101,000, divided among St.
Thomas (48,000), St. Croix (50,000), and St. John (3,500).
The tourism-dominated economy of the Virgin Islands gen-
erated a per capita Gross Territorial Product in 1990 of
approximately $11,000—the highest in the Caribbean (2).

Geographically, geologically, and topographically, St.
Thomas (28 square miles) and St. John (20 square
miles) are similar; they are both largely volcanic, have
deeply indented coastlines, and lie on the Puerto Rican
Shelf. St. Thomas and St. John are close to the BVI. St.
Croix is a relatively large (84 square miles) and mostly
limestone island that lies on its own submarine ridge,
which rises more than 4,000 feet from the bottom of the
Caribbean Sea. St. Thomas and St. John are about
5 miles apart, and St. Croix is 48 miles south of them.

During the height of tourism development (from the
late 1950s through the mid-1970s), the USVI experi-
enced average annual compound population growth
rates of over 6 percent, as well as a doubling in real
incomes. This unprecedented paroxysm of growth is
still being assimilated by a population that differs
greatly from the 30,000 people who lived in a predomi-
nantly agricultural USVI in 1950. In 1990, the USVI
received a daily average of 37 visitors per square
kilometer. This compares with a visitor load of 23
visitors per day per square kilometer in the BVI, which
is also a high-density tourist destination (1).

Background to GIS Implementation
Activities

British Virgin Islands

The idea of geographic information systems (GIS) ap-
plications in the BVI first arose with a presentation about
313
-------
a proposed project for St. Lucia, made by Dr. Jan Ver-
meiren of the Organization of American States at the
Caribbean Conference of Planners in Kingston, Ja-
maica, in 1984 (3). The Town and Country Planning
Department recognized that it could use GISs analytical
and display properties to make presentations to the chief
minister and the BVI Executive Council. This proposal
fell on fertile ground, given a relatively long-held tradition
of support for the Town and Country Planning Depart-
ment by the United Nations Development Programme
(UNDP) and the British Development Division, dating
back to the early 1970s.

Subsequent to this inspiration, the Town and Country
Planning Department requested budget authority to de-
velop a land use database. This database would include
buildings, property boundaries, and constructed and
natural features of importance. The Finance Department
hoped this project would help combat growing competi-
tion to the postal services by independent package de-
livery services. They renamed the project the National
Addressing System, and the legislature provided
$200,000 to provide a physical address for each prop-
erty in the territory.

The Town and Country Planning Department conducted
a pilot project, focusing on Road Town, the capital. The
pilot project demonstrated that the hard copy land own-
ership or cadastral maps that the Survey Department
was then using were inadequate for accurately account-
ing for properties, even in the BVI's most developed
urban areas. Therefore, the Town and Country Planning
Department expanded the scope of the National Ad-
dressing System project to identify options for increas-
ing the accuracy of property ownership records,
including maps.

Because the Town and Country Planning Department
had little experience with computerized land information
systems, departmental managers sought support from
the UNDP office in Barbados. The UNDP had previously
assisted with the department's development control ap-
plications database. This mode of operation, in which
the BVI purchase services provided through the UNDP,
which acts as a "vetting agent" for consultants and other
technical assistance, continues to this day. An expert
from the United Nations Community and Housing Serv-
ices (UNCHS) Nairobi office provided the first such con-
sultation by exploring how the BVI might implement a
CIS. Over the next few years, three other experts
provided input, which the Town and Country Planning
Department gradually integrated into a picture of how to
use a CIS within the technical and financial limitations
of a small island government.

In the meantime, external conditions were improving the
chances for the success of the BVI program. British
Virgin Islanders were receiving formal and informal train-
ing in computer applications in general, and specifically
in AutoCAD drafting systems, which increasing numbers
of local architects and engineers are using. In addition,
the power of rugged microcomputer systems that could
withstand the harsh operating conditions of the Virgin
Islands was also improving, and local dealers were
increasing their skills in support of such systems.

U.S. Virgin Islands

In 1988, a proposal to develop a locally supported CIS
was being discussed in detail in the USVI (4). This
project resulted in a formal application from the govern-
ment of the USVI for financial assistance from technical
assistance funds provided by the Office of Territorial
and Insular Affairs of the U.S. Department of the Inte-
rior. The grant was awarded in March 1991. The pro-
posed project combined existing information from
several USVI government agencies to produce CIS
overlays, as shown in Table 1 (5).

In addition, according to the grant application, (which
was written by the Virgin Islands government and may
not have represented U.S. Geological Survey's
[USGS's] intentions) the National Mapping Division
(NMD) of the USGS agreed to digitize the eight USGS
1:24,000-scale quad sheets ("quadrangles") that cover
the USVI, including the following categories:

• Roads and trails
- Power transmission lines

• Hydrography
- Stream networks
- Shorelines
- Wetlands
- Mangroves
- Reefs
Table 1. GIS Overlays

Agency
GIS Overlays
DPNR
WAPA
VITEMA and emergency services
DPW
Zoning
Flood plain
Subdivision

Water distribution
Aquifer profiles
Electrical distribution

Critical routes
Critical facilities

Sewer line network
Transportation
Flood plain
DPNR = Department of Planning and Natural Resources
WAPA = [Virgin Islands] Water and Power Authority
VITEMA = Virgin Islands Emergency Management Agency
DPW = Department of Public Works
314
-------
• Topographic contours

• Political boundaries

(These categories were subsequently adjusted based
on discussion between the USGS, WAPA, and govern-
ment agencies. Documentation for these new cover-
ages is not available, but the general idea holds,
although no VITEMAdata were used, and the USGS did
not digitize the WAPA power system.)

The underlying notion was that the whole would be
bigger than the parts—each organization would bring its
own corporate data and maps so all could share in the
digitized product.

Five objectives were identified for the project:

• Provide the USGS in St. Thomas with a complete
microcomputer (CIS) workstation.
• Develop a digitized
graphic maps.
database from USGS topo-
• Contract for the digitizing.

• Acquire digital data to load into CIS.

• Enter data not in digital format.

The proposal required a 50-percent local cost contribu-
tion to the project, including a matching suite of hard-
ware for data maintenance, backup, and analysis. The
application stated that itemized costs of $305,000 "will
be provided by the Virgin Islands Water and Power
Authority (WAPA) and the Office of Territorial and Inter-
national Affairs."

The project application is ambiguous about the nature
of the hardware and operating systems that the CIS
requires. A list of hardware for USGS use refers to a
"microcomputer workstation." Later references to "ESRI
ARC/INFO workstation software," however, indicate to
sophisticated users that these applications require UNIX
workstations and UNIX language operating systems. In
general, UNIX is not used or supported in the Virgin
Islands. Some Virgin Island officials associated with the
CIS project feel they were not fully informed about the
CIS operating environment and the long-term support
costs to which they were committing.

A frustration for USGS in the project stems from the
somewhat limited role the agency has had in providing
high-quality data conversion services (digitizing). One
USGS staff member explained that unfortunately the
agency's mandate only extends to providing data; the
agency "can't get involved in applications."

The initial project proposal also was unclear about owner-
ship of the digitized data. The proposal spoke in general
terms: "All available digital data and attributes will be com-
plete, accurate, and up-to-date at the end of the project,
and will be available for use and transfer to the Depart-
ment of Planning and Natural Resources (DPNR) (5)."

Given the conditions and environmental constraints dis-
cussed, a number of specific differences developed be-
tween the CIS implementation processes of the BVI and
the USVI.
Initial System Planning Activities
British Virgin Islands

Although the BVI had no formal systems plan, consult-
ants worked with the Town and Country Planning De-
partment on four different occasions to provide insight
into some aspect of CIS applications. Sometimes, the
benefits from the consultations were neither the type
nor the quality the department originally expected. In
general, however, each provided some additional per-
spective on the possible benefits and perils of imple-
menting a CIS.

U.S. Virgin Islands

The USVI apparently made few initial system planning
efforts, although the U.S. Department of the Interior and
USGS have wide experience in the territory. (An in-
formed source claims a grant was made, possible by
EPA, for a preceding $50,000 CIS project, but no men-
tion of this has been found in the materials available for
this article, either as a proposal, or in terms of specific
products.) Possibly, this very familiarity led to a series of
unexamined assumptions and diminished communica-
tions about the exact terms of the assistance and serv-
ices that the USGS exchanged with several agencies of
the Virgin Island government.

One indicator of the lack of system planning activities in
the Virgin Islands is a proposal that the Virgin Islands
Emergency Management Agency (VITEMA) circulated
fora "Geographical Information Systems: Technical Op-
erators Meeting." This proposal, from an agency that
has always been one of the most important participants
in CIS activities, called for a "technical working group"
to examine existing database management systems in
the territory to develop a planning strategy for imple-
menting CIS.1 This proposal was dated September
28, 1992, 10 days before the USGS announced a
demonstration of the completed products of Phase I of
the "comprehensive geographic information system be-
ing developed for the USVI (6)."
1 Ward, R.G. 1992. Geographical information systems: Technical op-
erators meeting. Memorandum to Cyrille Singleton. VITEMA, St.
Thomas, U.S. Virgin Islands (September).
315
-------
Software Selected and Rationale

British Virgin Islands

In part because of the extended timeframe for the BVI
planning and initiation of the CIS system, the Town and
Country Planning Department never committed to a
specific system configuration until the last stages of the
planning process.

This process of "creative procrastination" had three syn-
ergistic results:

• PC power increased (and prices decreased) to the
point where reasonably priced systems could perform
many of the compute- and data-intensive operations
demanded for graphics software mapping.

• ARC/INFO released the ARC/CAD version of its CIS
software, which worked on PCs within the well-known
AutoCAD drafting software. Architect and engineering
offices in the BVI already used computer-aided design
(CAD) software, so upgrading to include CIS func-
tionality was relatively easy.

• The fourth CIS consultant to work in the BVI was
experienced in implementing systems in the Carib-
bean and had special knowledge and access to early
versions of both ARC/CAD and Version 1.0 of
ARC/VIEW. These two tools, based on a last minute
proposal by International Development Advisory
Services (IDAS) of Miami, Florida, a private CIS sup-
port contractor, became the basis for the BVI CIS.

U.S. Virgin Islands

Workstation ARC/INFO 6.1 was selected as the basic
software for the USVI CIS project because it is the
USGS standard software. In the environment surround-
ing the USVI project, this seemed to be a sufficient
explanation, although there may have been other rea-
sons. Because of this decision, however, WAPAand the
DPNR lack any means of updating map or attribute data
files. Outside providers, such as the USGS's NMD, must
perform that service. The USGS has noted that the only
reason WAPA and DPNR lack these capabilities is be-
cause they (WAPA and DPNR) failed to provide the
matching suite of hardware and software specified in the
grant application.

The digitized water supply system offers an example of
the extra costs that such a condition creates. The USGS
built an ARC/INFO coverage by converting mapped data
from AutoCAD source files, which they then linked to
detailed attribute information about each element (e.g.,
pipe, valve, elbow) in the system. The USGS then used
ARC/INFO network software, purchased with project
funds, to build a network model that analyzes and dis-
plays the operation of the entire water distribution sys-
tem—but no software in the Virgin Islands can run the
network model.

Hardware Platforms

British Virgin Islands

The BVI CIS was originally installed on a Compaq 486-
50, with a 21-inch screen. The Town and Country Plan-
ning Department soon learned, however, that data input
would be more efficient if two or three smaller machines
split the work, with the Compaq available for analysis
and data quality checking. The department upgraded its
existing office computers to handle the data entry. Users
already feel the need for networked applications to
share data more quickly. Plans for CIS expansion to
other offices, such as the Electricity Corporation, in-
crease the pressure for an extended local area network.

U.S. Virgin Islands

The system that the USGS used to build the USVI CIS
database was a Data General UNIX workstation with
one large digitizing tablet and one pen plotter. No match-
ing or comparable hardware are installed anywhere in
the USVI, as the original project proposal had foreseen.
Observers tend to agree, however, that the failure to
provide a specific hardware configuration is less signifi-
cant than the lack of committed, senior, full-time techni-
cal staff. This staff is required to operate the level of CIS
facility that the USGS envisioned.

Base Map Priorities and Layers
Constructed

British Virgin Islands

Building a map database is proving to be a long process
for the Town and Country Planning Department. This is
complicated by the failure of a key digitizing contractor
in Texas to provide property lines in a format conducive
to constructing accurate property polygons. Operators
in the Town and Country Planning Department have
increased their data entry efficiencies, however, and
most properties on the most densely inhabited islands
have now been digitized.

Producing demonstration data displays accounts for a
significant part of the cost of developing databases for
the early phases of the BVI CIS implementation. These
demonstrations aim to illustrate possible new applica-
tion areas for other agencies and departments of the BVI
that are interested in cooperating and sharing costs of
additional system development. For example, the Elec-
tricity Corporation and the National Disaster Prepared-
ness Agency need to map emergency services.

Converting the data (i.e., digitizing) in house in the BVI
has produced costs and benefits. The costs revolve
316
-------
around the steep learning curve for data entry proce-
dures and the constant distractions of responding
quickly to "outsiders" who may be important long-term
supporters of the CIS. The benefits include increasing
staff skills and the ability to build constituencies for the
program by promptly responding to real needs.

Coverage priorities for the BVI CIS include a national
addressing system, completion of the territorial land
use mapping, and accurate cadastral mapping (which
has major environmental planning and management
implications).

U.S. Virgin Islands

The USGS produced 44 coverages for the Virgin Islands
from a variety of sources. Table 2 shows the major
coverages and scales, by island.

Table 2. Major Coverages and Scales for the USVI
St. Croix St. John St. Thomas
STC water
distribution
1 :2,400
STC roads
1 :2,400
STC building
footprints
1 :2,400
STC shorelines
1 :2,400
STC DIG
boundaries
1 :24,000
STC DIG
roads
1 :24,000
STC DIG
hydrography
1 :24,000
STC DIG
hypsography
1 :24,000
STJ water
distribution
1 :2,400
STJ roads
1 :2,400
STJ building
footprints
1 :2,400
STJ shorelines
1 :2,400
STJ DIG boundaries
1 :24,000

STC DIG roads
1 :24,000

STJ DIG
hydrography
1 :24,000
STJ DIG
hypsography
1 :24,000
STT water distribution
1:2,400

STT roads
1:2,400
STT building
footprints
1:2,400
STT shorelines
1:2,400
STT DIG boundaries
1 :24,000

STT DIG roads
1 :24,000

STT DIG
hydrography
1 :24,000
STT DIG
hypsography
1 :24,000
In addition, the following National Park Service data
were converted and added to the data set but were not
produced by the USGS:

• STJ NPS boundaries, 1:24,000

• STJ NPS roads, 1:24,000

• STJ NPS hydrography, 1:24,000

• STJ NPS hypsography, 1:24,000

• STJ NPS benthic communities, 1:24,000

• STJ NPS historical sites, 1:24,000

• STJ NPS vegetative cover, 1:24,000
Mapping for St. Thomas, St. John, and St. Croix identi-
fied a total of 10 coverages for each island. They are
based on information from the USGS ("quad sheets"
specifically demarking political boundaries, shorelines
and streams, topography, and roads) and higher preci-
sion WAPA mapping, which derives from 1986 aerial
photogrammetry, including left and right road bounda-
ries, building footprints, shorelines, and water supply
system data. The WAPA data are at 1:2,400 scale, an
order of magnitude more precise than the USGS base
map. St. John mapping consists of 10 added layers
based on data that the Virgin Islands National Park
(VINP) provided.

The original USVI project proposal referred to a two-phase
process of database development, shown below (5):
Phase

I. Base system
development
Agency extensions
(i.e., by USVI
agencies)
Tasks

Water distribution network
Power distribution network
Flood plain maps

Land use maps
Transportation networks
Emergency facility networks
Tax parcel/land value
The USGS announced that Phase I was completed in
October 1992 (6). Supposedly, the contents of these two
phases were subsequently adjusted to reflect a different
range of coverages, but the notion of a "Phase II" in
which local agencies would assume more operating
responsibilities was retained.

Ownership of, access to, and terms that govern the use
of this digital data are confused. The USGS says it is
unable to provide an authoritative catalog of the cover-
ages because "one has not been produced." WAPAsays
it has several diskettes of data in the safe but no equip-
ment to manipulate them. The VINP has learned that it
can use its own data as well as WAPA data converted
and attributed by the USGS, but the park does not
possess or use USGS digital line graph (DIG) data. To
personnel in USVI agencies, USGS statements have
clouded the question of access to the CIS information.
For example, one such statement announced that the
USGS Water Management Division cannot make the
digitized data available to Virgin Island government
agencies.

The DPNR apparently has no means of making direct
use of the digital data. First, DPNR has no hardware or
software that can use the data. Secondly, it has no
operators who can build the CIS systems to actually
apply the data to decision-making needs. The depart-
ment is said to be preparing a new CIS proposal for
training, hardware, and software for a new CIS system.
According to unconfirmed rumors, this system will be
based on MapGrafix, a Macintosh mapping system.
317
-------
In operational terms, the data seem to belong to WAPA,
which contributed major financial support and map re-
sources to the project. WAPA has been helpful in pro-
viding copies of the digital data to other groups and
agencies.

Environmental Problem-Solving in Local
Decision-Making

British Virgin Islands

In the BVI, the first priority of the CIS facilities is to
extend the National Addressing System and to improve
the property ownership system. This will both improve
postal services, as originally proposed, and provide bet-
ter information for important revenue and financial
analyses. Land use mapping and environmental impact
assessments are important second priorities for CIS
applications. Other features already developed for in-
terim studies and analyses include mapping of signifi-
cant coastal and wildlife features and environmentally
sensitive areas from the Anegada Development Plan
and mapping of important submarine habitats adjacent
to Virgin Gorda.

The BVI have concentrated on developing CIS applica-
tions to address strategically important issues in the
territory. Marine and coastal resources are vitally impor-
tant to the BVI economy. They embody historical and
cultural values, as well as maintain a high-quality envi-
ronment to support charter yacht-based tourism, which
is integral to the BVI economy. The Conservation and
Fisheries Department is working with the Town and
Country Planning Department to convert the country's
coastal atlas to digital form (see section on Principal
Users), as has been done on a demonstration basis for
the Anegada and Virgin Gorda mapping.

U.S. Virgin Islands

In the USVI, environmental decision-making generally
follows an adversarial, rather than a problem-solving
format. A combination of historical and cultural factors
have created the general assumption that the develop-
ment process creates winners and losers. In this envi-
ronment, information becomes an important tactical
weapon, making it difficult to gather support for activities
or programs that aim to make information more widely
accessible. Technology is more acceptable, and more
likely to receive leadership support, if it is justified on
technical, less "political" terms.

The USGS team made a presentation on the Virgin
Islands CIS in October 1992, after just completing the
digital coverages for Phase I of the CIS project. The
presentation emphasized that the CIS is intended to
provide decision-makers with easily accessible spatial
information (6).

Originally, the CIS was expected to benefit primarily the
territory's three coastal zone commissions in their as-
sessment of environmental effects of major develop-
ment proposals. The ground-water protection program
of the Division of Environmental Protection of the DPNR
is using CIS analyses produced by the Water Resources
Division of the USGS, employing the USVI CIS cover-
ages with added data (e.g., wells) that the Water Re-
sources Division is digitizing.

GIS Support Factors

According to the GIS support contractor for the BVI, a
successful GIS requires three key support elements:

• GIS policy leadership

• GIS technical leadership

• Competent outside expert assistance

The following summarizes the comparative experience
of the two programs for these three key implementation
support factors:
Support
Factors

GIS
policy
BVI

Town and
Country Planning
Department
director led
project from chief
minister's office

GIS BVI technician
technician trained in the
United States
USVI

No GIS manager
in government
Outside
support
UNDP and IDAS
No GIS specialist
in government
(draftsman at
WAPA)

USGS technical
support and U.S.
Department of
the Interior
financial support
Principal Users: Planned and Actual

British Virgin Islands

The BVI Department of Finance is the first user of data
products from the GIS, based on the initial funding for
the addressing system. This system is based on detailed
parcel maps of the BVI so that the effective base map
resolution of the BVI system is 1:2,500. This is consid-
erably finer than the 1:24,000 scale of the USVI maps.
318
-------
The Town and Country Planning Department, however,
is working to recruit other users to the system, including:

• Public Works.

• Water and Sewerage Department for a systemwide
map (which may eventually spin off as a separate
system, given this department's long-term interest in
engineering-quality facilities management information).

• Conservation and Fisheries.

• British Overseas Development Administration, which
funded a coastal atlas for the BVI (7). At the sugges-
tion of the Town and Country Planning Department,
this mapping was developed in ARC/INFO. A pro-
posal has been made to convert the coastal atlas to
digital form for natural resource management appli-
cations, with a demonstration already developed
showing the distribution of sensitive marine commu-
nities around Virgin Gorda.

In addition to these uses, the CIS group is starting to
experiment with the use of remote sensing products in
CIS production, which would encourage the use of CIS
for natural resource change detection.

The Town and Country Planning Department's CIS spe-
cialist, Mikey Farara, is being reassigned to provide
networking support (including CIS distribution over the
network) for several government agencies. Meanwhile,
the CIS operation is adding a cartographer to assist in
tailoring CIS products to users' needs.

As enthusiasm for the CIS has blossomed in the BVI,
managing for realistic expectations and stressing the
investment costs that participating agencies can expect
have become problems for the Town and Country Plan-
ning Department.

U.S. Virgin Islands

Complex evaluation issues face the USVI's three
coastal zone commissions (one on each island).
Therefore, land use planning in general and coastal
zone permitting specifically were assumed to be im-
portant first users of the CIS. The DNPR, however,
had no process to prepare the Division of Comprehen-
sive and Coastal Zone Planning to implement this
system. In addition, the scale of permitting decisions
may be too fine forthe CIS base map. (See discussion
of scale below.)

WAPA is not using the CIS data. One senior manager
characterized their experience with the CIS project as
"paying a lot of money for a diskette of data that we keep
locked in the safe."

The VINP (part of the U.S. National Park Service) and
Biosphere Reserve have purchased a PC-based CIS
system and employed an analyst to implement it for the
park and adjacent areas on St. John and the surround-
ing seas. With this system, they plan to enter the USGS-
developed data into the database. In addition, the Virgin
Islands Resource Management Cooperative (a collabo-
ration of research and resource management organiza-
tions) makes the VINP CIS data and analytical capabilities
available to members, including government members.

At this time, the only major Virgin Islands government
user of the CIS is the ground-water protection group of
the DPNR's Division of Environmental Protection. Be-
cause they lack equipment or software to manipulate the
CIS data already available, they use the Water Re-
sources Division of USGS as a CIS contractor. This
arrangement has two problems:

• High costs: Although the USGS "owns" the existing
digital data, and processing private contracts would
be complex, DPNR believes it could get similar serv-
ices at cheaper prices from other vendors.

• Inappropriate scale: Environmental management
processes in the Virgin Islands (and in most other
small island states) require knowledge of property
ownership, implying maximum map scales of
1:5,000 to 1:10,000. The USGS quad sheet scale
of 1:24,000 is too coarse for many management
purposes. Costs of remapping areas of concern at
the higher resolution are high, and the problems of
maintaining multiple map resolutions and sources
are not trivial.

What GIS Can Do

Joseph Berry has proposed seven basic categories of
"What GIS Can Do for You" (8). These applications can
be related to the GIS products and proposals forthe BVI
and USVI, with special attention given to natural re-
source and environmental issues. Table 3 shows what
coverages that have been or are being developed for
the two systems can do.

Table 3 illustrates two contrasting issues separating the
two jurisdictions. The USVI have the data available to
perform a number of relatively complex analytical proc-
esses, especially in St. John. They have no capability to
actually execute any such studies, however. The BVI,
on the other hand, have proposed and often developed
pilot or demonstration applications for several GIS uses
but still need to develop the data resources to support
these on a territorywide basis.

Lessons Learned

The comparative experience of these two very distinct
GIS programs reinforces three basic lessons of informa-
tion system design and implementation:

• Plan, don't assume: The prolonged, sometimes repe-
titious, planning process that evolved in the BVI
319
-------
Table 3. Coverage Capabilities
USVI
BVI
Questions:
Can you map it?
Where is what?

Where has it changed?

What relationships exist?
Where is it best?
What affects what?
What if...?
Analytical
Function
Mapping
Natural resource
management

Temporal

Spatial
Suitability
System
Simulation
Application
USGS and WAPA-based
coverages
DIG hydrography
DIG hypsography
Well inventory
STJ national park
coverages
DIG and WAPA
Boundaries and roads
1982 to 1989

STJ national park
coverages (limited
application)
STJ national park
coverages (limited
application)
None discussed
Status Application
Done Land use cadastral
Done Coastal atlas
Sensitive areas
Done Significant features
Population data
Done Land use
Done Sensitive areas

Done Land use updates population

Land use
Population data
Coastal atlas
Done Land use
Coastal atlas
Sensitive areas
Significant features
Done Land use
Coastal atlas
Sensitive areas
Significant features
Population data
Speculation, but no plans to
implement yet
Status
Done and
proposed
Proposed
and partial
Major
islands done

Proposed
Proposed
Partial
Proposed
Partial
Partial
Partial and
proposed

involved multiple consultants providing often conflict-
ing advice. This process served to educate policy-mak-
ers and managers in a much broader range of
possibilities and avoidable problems than were avail-
able to the USVI. A corollary to the need for careful
planning is the need to avoid making decisions or
commitments to specific systems before such deci-
sions are absolutely necessary. Especially in systems
involving high technology, premature decisions often
mean early obsolescence.

• Implement in phases with early demonstration
products: Some issues, such as cadastral mapping
and scale, are so subtle to inexperienced users that
they need practice in real-life situations. If the USGS
had spotted the scale problems at an early stage in
the data conversion process, the USGS may have
been able to provide a better solution. Some USVI
critics claim the "1:24,000—one size fits all" attitude
characterizes the federal approach.

• Identify critical success factors for each situation: In
some environments (e.g., USVI), CIS is most attrac-
tive for its ability to provide enhanced powers of
analysis. In others, such as the BVI, it is seen as a
data integration tool and as a way to better inform
political leadership and the public. To ensure suc-
cess, major CIS implementations also need to meet
the three major support requirements:
- A political/senior management "chief
- A technical "chief
- Competent outside technical assistance

Finally, implementers should recognize that they have a
stake in open information sharing. They should seek
ways to redefine the decision-making process as a
nonzero sum game: more information should benefit all
parties. Of course, such changed attitudes require fun-
damental value shifts that take a long time to achieve
and may have high short-term costs.

References

1. McElroy, J.L. 1991. The stages of tourist development in small
Caribbean and Pacific islands. In: Proceedings of the International
Symposium on the Island Economies: Policy Models for Planning
Development, Lesbos, Greece (November).
2. Bureau of Economic Research, Virgin Island Department of
Commerce. 1994. Economic indicators. St. Thomas, U.S. Virgin
Islands (July).
320
-------
3. Potter, L. 1984. Program notes from the Caribbean Conference of
Planners, Kingston, Jamaica.

4. Potter, B., K. Green, and M. Goodwin. 1988. Management of natu-
ral resource information for the Virgin Islands National Park and
Biosphere Reserve: Special biosphere reserve report. St. Thomas,
U.S. Virgin Islands: Island Resources Foundation.

5. Government of the U.S. Virgin Islands. 1991. Application for tech-
nical assistance funds: Virgin Islands GIS system. Office of Terri-
torial and International Affairs, U.S. Department of the Interior
(April).
6. Parks, J.E. 1992. Invitation to a demonstration of the Virgin Islands
GIS. U.S. Geological Survey, St. Thomas, U.S. Virgin Islands (Oc-
tober).

7. University of Manchester, British Overseas Development Admini-
stration. 1993. British Virgin Islands coastal atlas. Project was
collaboration between graduate students at Manchester University,
the British government, and the government of the British Virgin
Islands.

8. Berry, J.K. 1994. What GIS can do for you. GIS World. Boulder,
CO (May).
321
-------
Application ofGIS for Environmental Impact Analysis in a Traffic Relief Study
Bruce Stauffer
Advanced Technology Solutions, Inc., Lancaster, Pennsylvania

Xinhao Wang
University of Cincinnati, Cincinnati, Ohio
Abstract

This paper presents an application of a geographic infor-
mation system (CIS) in a traffic relief study. Traffic conges-
tion has severely affected the environmental quality and
the quality of life for residents in the study area. A team of
planners, environmental specialists, historians, landscape
architects, traffic engineers, and CIS professionals organ-
ized to solve the problem. The team has evaluated the
environmental and socioeconomic impacts of highway
alignments from the very first step through every major
decision for the duration of the project.

The CIS professionals have played a crucial role in
maintaining constant and active interactions among
members of the project team, federal and state agen-
cies, and the public. CIS has helped to develop a natural
and cultural resource inventory, identify contamination
sources, assess environmental constraints, and evalu-
ate proposed highway alignment alternatives. CIS pro-
vides an ideal atmosphere for professionals to analyze
data, apply models, and make the best decisions. The
high-quality map products that CIS creates enhance the
quality of public presentations and reports. The authors
feel that, as this project has progressed, more people
have realized the benefit of using CIS.

Introduction

A traffic relief study, as one type of transportation
project, aims to resolve traffic congestion problems
through a combined strategy of upgrading existing
infrastructure, building new infrastructure, and con-
trolling traffic demand using congestion management
strategies (CMS). This type of study proceeds through
at least the following steps:

• Problem identification

• Data collection
• Preliminary design

• Environmental impact analysis

• Final design

• Construction

The process heavily involves federal, state, and local
government agencies, as well as the public. The goal of
the project is to develop an environmentally sound so-
lution to the traffic congestion, which also happens to
promote economic development and improve quality of
life for people in the local area and the region. Environ-
mental, social, and economic issues must be equally
addressed from the very first step through the final
design. Federal and state regulations generally require
an environmental impact statement (EIS) when con-
structing new infrastructure or upgrading existing road
systems. Preparing an EIS is a requirement for such a
project and demands a significant commitment of time,
money, staff, and technical resources.

A geographic information system (CIS) has the ability to
process spatially referenced data for particular pur-
poses. Along with the development of computer hard-
ware and software, CIS has progressed from pure
geoprocessing, to management of geographic informa-
tion, to decision support (1). This paper presents the
application of CIS in an ongoing traffic relief study in
Marshalls Creek, Pennsylvania. The CIS function in this
study has had various purposes:

• Inventory data compilation

• Spatial data analysis

• Map production

• Traffic modeling support

• Public presentations
322
-------
The study shows that CIS can play an important and
innovative role in transportation studies.

Project Description

The study area is in the Pocono region, located in
northeastern Pennsylvania (see Figure 1). The Pocono
Mountains and Delaware Water Gap National Recrea-
tion Area possess a wealth of natural and cultural re-
sources. The area is famous for providing year-round
vacation activities. Attractions include fishing, canoeing,
and Whitewater rafting in the summer and downhill and
cross-country skiing, snowmobiling, snowboarding, and
ice fishing in the winter. The area includes quiet wood-
land trails past a rushing waterfall and scenic settings
for camping. In the fall, the area is ablaze with the
brilliant colors of foliage. Various scenic sights, recrea-
tional sites, and national historic sites make the area
ideal for attracting people to come for a day, a weekend,
or a longer vacation.

Although tourism brings people to the Pocono area and
promotes economic growth, it also brings a traffic con-
gestion problem to the community. In addition, the influx
of new home owners from New Jersey and New York
adds the problem of commuter traffic to the area. The
most troublesome section is in the vicinity of Marshalls
Creek where U.S. Route 209 intersects with Pennsylva-
nia (PA) Route 402. Two intersections are only about
500 feet apart. The traffic tieups can extend up to
3.5 miles on northbound Route 209, all the way back to
Interstate Highway I-80. Emergency response times on
U.S. Route 209 can be up to 20 minutes during peak
traffic. The heavy traffic volume results in traffic acci-
dents exceeding state averages on secondary roads as
motorists seek alternative routes to avoid congestion.
Through traffic traveling to and from New England using
U.S. Route 209 as a connector between I-80 and I-84
makes the problem even worse. The year-round outdoor
activities perpetuate the constant traffic problems that
have severely affected the quality of life for residents in
and around the Marshalls Creek area.

In response to the problems, the Pennsylvania Depart-
ment of Transportation (PennDOT) selected a project
team in February 1993 to conduct a traffic relief study in
the Marshalls Creek area. The project team consists of
individuals from seven firms and represents a wealth of
experience in the variety of disciplines necessary to
successfully complete this project. The team members
include land use and traffic planners, biologists, histori-
ans, traffic and environmental engineers, surveyors, and
CIS and global positioning system (GPS) professionals.

In addition to PennDOT, the funding agency, several
federal and state regulatory agencies periodically review
the development of the EIS to ensure that it meets
regulations. These agencies are the Federal Highway
Administration (FHWA), the U.S. Environmental Protec-
tion Agency (EPA), the U.S. Army Corps of Engineers,
the Pennsylvania Department of Environmental Re-
sources (DER), the Pennsylvania Historic Museum
Commission, and the Pennsylvania Fish and Game
Commission. Local planning commissions and citizen
representatives also actively participate in advisory ca-
pacities. A series of agency coordination meetings, pub-
lic meetings, and public information newsletters
coordinates the activities of all participants over the
course of the project.
Primary and Secondary Highways
Light Duty Roads
Phase I Project Boundary-
Streams, Ponds, and Reservoirs
5,000
Feet
10,000
Figure 1. Phase I project area.
323
-------
After collecting traffic data and performing traffic de-
mand modeling, the team realized that adopting strate-
gies to control traffic demand and upgrading existing
roads through widening and intersection improvements
would not suffice to meet demand projections for the
design year 2015. Consequently, the team has deter-
mined a new road is needed to alleviate congestion in
Marshalls Creek.

The study aims to identify alternatives to relieve traffic
congestion along U.S. Route 209, PA Route 402, and
Creek Road, and eliminate backups onto I-80 from U.S.
Route 209. The alternatives should also improve air
quality by reducing fuel consumption and vehicle emis-
sions and facilitate travel through Marshalls Creek for
local and through traffic. The improvements must com-
ply with federal and state regulations. The study team
must consider county and local government goals and
objectives so that the traffic capacity improvements will
be compatible with planned local development.

The project is being conducted in two phases. Phase I,
which is complete, was an investigation broad in scope.
It used inventory of secondary data to describe the
environmental characteristics of the area. A traffic de-
mand model identified the area for detailed study after
a preliminary analysis of a wide range of baseline data.
In Phase II, the team analyzes both primary and secon-
dary data and delineates alternative alignments or trans-
portation upgrading options that meet the need and
minimize impacts. Analysis of environmental and engi-
neering factors assists both in the determination of the
most practical alternative and in preparation of the final
EIS.

The nature of the study requires the analysis of a variety
of data at different scales by different professionals.
Through field investigation, the project team also con-
stantly updates and adds new data to the existing data-
base. The new data may be attribute data about some
geographic features or may be locational data. The
project team has found CIS to be an appropriate tool to
meet the challenge of better conducting the study.

The team has used CIS extensively in both Phase I and
Phase II studies. The two phases vary in data require-
ments, scales, and purposes of spatial analysis. With
the support of CIS, the team has been able to quickly
assemble data at adequate scales and present data in
formats that are familiar to different professionals. GIS's
data manipulation power distinguishes the different re-
quirements of the two phases and at the same time,
clearly depicts the linkage between the two phases. The
following sections describe CIS applications that have
helped facilitate the study and coordinate project team
members, public agencies, and citizens.
GIS Application

CIS contains powerful tools to process spatially refer-
enced data. These processes and their results are
meaningless, however, without a clearly defined objec-
tive. Many professionals point out the importance of
focusing GIS on practical problems. Using GIS is not an
end; it is a means to represent the real world in both
spatial and temporal dimensions. The benefits of using
GIS can be summarized in three aspects:

• GIS helps to portray characteristics of the earth and
monitor changes of the environment in space and
time (2).

• GIS helps us to more deeply understand the meaning
of spatial information and how that information can
more faithfully reflect the true nature of spatially dis-
tributed processes (3).

• GIS helps us to model alternatives of actions and
processes operating in the environment (2), to antici-
pate possible results of planning decisions (4), and
to make better decisions.

This project demonstrates the advantages of applying
GIS to solve practical problems from the above three
aspects. An EIS requires extensive data about natural
resources, land uses, infrastructure, and distribution of
many interrelated socioeconomic factors. The accuracy
and availability of required data depend on the scope of
a study and the size of the study area. Our study shows
that GIS, with its data retrieval, analysis, and reporting
abilities, significantly improves the analysis. GIS helps
to collect data at various scales, store data, and present
data in forms that allow the project team to carry out the
study in an innovative way.

Phase I Study

Phase I of the traffic relief study was completed in 1993.
The goal of Phase I was to acquire understanding of the
general features of the area and to use a traffic demand
model for delineating an area for detailed study. The
study area is approximately 52 square miles. To provide
data for the preliminary analysis and the traffic demand
modeling, the team developed baseline data inventory
with GIS. Data were primarily secondary data that came
from several different sources in different formats. For
example, the U.S. Census Bureau 1990 population data
were in TIGER format, the U.S. Fish and Wildlife Service
National Wildlife Inventory (NWI) files in digital line
graph (DIG) format, the U.S. Soil Conservation Service
Monroe County Soils in DIG format, and the U.S. EPA
Monroe County Natural Areas in ARC/INFO format. The
majority of data sources were at scales between 1:15,000
and 1:24,000. With GIS tools, the team integrated these
baseline data into a common presentation scale and pro-
jection. This process ensured an effective and comprehen-
sive spatial analysis in the study area.
324
-------
The team arranged and stored data in layers according
to themes. Examples of the data layers included:

• Road center lines.

• Twenty-foot elevation contours.

• Utility lines.

• Water features.

• Subdivision boundaries.

• 1990 Census population by Census Tracts and Blocks.

• Flood plains.

• Geological formations.

• Public facilities, including schools, churches, and
cemeteries.

• Political boundaries.

• Hazardous waste locations.

• Potential archaeological areas.

With these data layers, the team generated a series
of 17 thematic maps to describe the features of the
study areas. All maps were plotted on E-sized papers
(48 inches x 36 inches) with the same map layout. The
general reference map served as a base map for the
other themes. It included several data layers to provide
geographic references to the study area.

In addition to the base map, each individual theme map
showed only one theme at a time, such as soils, subdi-
visions, and wetlands. Some theme maps showed de-
rived data from the original data layers. For example, in
developing a slope theme map, the team first built a
three-dimensional surface from the 20-foot elevation
contours, then calculated slope in degrees and aggre-
gated areas based on a 10-degree interval. The slope
theme map showed the result from the data processing.
In addition, the project team created summary statistics
tables to help team members gain knowledge about the
study area. Table 1 is an example of the summary
statistics for land use categories.
Table 1. Phase I Statistical Summaries for Land Use
Categories
Land Use
Acres
Urban

Agricultural

Rangeland

Woodland

Water

Wetland

Transitional
10,628

1,183

20,957

1,394

1,428

422
During preparation of the Phase I inventory data and
summary statistics, traffic planners performed traffic de-
mand modeling to determine new road connections that
would provide a minimum acceptable level of service in
the year 2015. The modeling result was loaded into the
CIS and converted into the same format and projection
as other inventory data. Figure 2 displays the boundary
that the traffic demand model delineated and the actual
Phase II boundary. The two boundaries were not the
Primary and Secondary Highways
Light Duty Roads
Unimproved Roads
Trails

Phase II Project Boundary
Streams and Ponds

Minimum Acceptable Service Level
2,000
Feet
4,000
Figure 2. Phase II project area.
325
-------
same. The thick line enclosed a Phase II study area that
was delineated based on the traffic demand modeling
and the team's understanding of environmental and
other factors in the project area.

Phase II Study

The Phase II study is still ongoing. The area for the
Phase II study is much smaller than that for Phase I. It
is about 3.2 miles by 2.5 miles, or approximately
4 square miles.

The objective of the Phase II study is to conduct a detailed
analysis for delineating a full range of feasible highway
alignment alternatives. The alternatives must meet the
needs of relieving traffic congestion and minimizing its
impact on environmental and cultural resources.
Because the accuracy of the Phase I data was not
sufficient for the Phase II study, the project team has
collected data using different approaches to develop a
similar set of baseline data at a finer scale. The major
data source has been the photogrammetry data pro-
vided by PennDOT at a 1:2,400 scale. The data include:
• Road cartways
• Five-foot elevation contours

• Utility lines
• Water features
• Buildings footprints

• Bridges
The team directly digitized tax parcel boundaries from
Monroe County Tax Assessor's maps that range in
scales from 1:1,200 to 1:4,800. After digitizing each map
sheet separately, the team merged them together to
create a continuous parcel layer. The data layer has
been adjusted to fit with the PennDOT photogrammetry
data although the two data sets do not seem to match
exactly. In addition, digital orthophotographs at 5-foot
pixel resolution were also obtained for the project.

From these baseline data, the team has constructed
Phase II data layers in four different ways.
The first approach digitizes from compilations on project
base maps. For example, the team creates a land use
data layer from the digital orthophotographs and infrared
photography. The CIS group first plots the digital ortho-
photographs on a set of 1:2,400-scale map sheets.
Road cartway and water features are plotted on top of
the orthophotographs. Then land use specialists deline-
ate land use boundaries with fine color markers and
code land uses on maps according to the Anderson land
use classification. In the end, the CIS group digitizes the
land use boundaries from the compilations to create a
land use data layer. Similarly, the 100-year flood plain
data layer is delineated from compilations on project
base maps with 5-foot contours and digitized.

The second approach derives new data layers from
existing data. Buildings and structures are plotted at a
1:2,400 scale. Both historians and environmental engi-
neers use the plots in their field investigations. After
historians identify historic-eligible buildings on the plots,
the CIS group develops an attribute data file that links
the historic inventory data to the building geometry.
Similarly, field investigations identify buildings and struc-
tures associated with contamination sites. The system
stores types of contamination as building attributes. By
overlaying the building data layer with the tax parcel
data layer, the team can identify properties on which
historic buildings or contamination sites are located.

The third approach constructs data layers by refer-
encing Phase I data. For example, Phase II subdivision
boundaries are derived from the digitized tax parcel
boundaries by referring to the Phase I subdivision
boundaries. Phase I subdivisions were manually com-
piled at 1:24,000 using approximate location, which did
not align very well with the more accurate tax parcel
boundaries. Using a 1:7,200-scale plot that shows both
Phase I subdivision boundaries and the Phase II tax
parcel boundaries, planners can verify and indicate
properties associated with each subdivision. These
properties are dissolved to create new boundaries for
subdivisions that precisely fit with tax parcels. The same
approach is used to refine public parks and private
recreation areas.

The fourth approach obtains spatial data with GPS. The
GPS surveyors collect accurate locational data about
key features, such as boundaries of wetlands, site loca-
tions for hazardous waste, and locations of archaeologi-
cal field samples. GPS data also supplement existing
data, such as delineating footprints of new buildings to
update the PennDOT baseline data. The integration of
CIS and GPS provides the project team with accurate
and up-to-date data.

Phase II data layers have provided much richer informa-
tion for a detailed study of environmental features. They
are merged in many different combinations to show the
spatial distributions of different factors from different
perspectives. Table 2 lists some of the map themes
created for the Phase II study. All maps are plotted at a
1:7,200 scale.

In addition to using CIS as a data library and map
production tool, we use CIS to support decision-mak-
ing in two ways. First, the creation of a composite data
layer has revealed the impact of alternative align-
ments on several composite constraints. The compos-
ite data layer is an overlay of several inventory layers
and shows the various factors coincident at any loca-
tion, and the relative importance of these factors.
326
-------
Table 2. Selected Phase II Map Themes

Theme Description
General
reference

Parcels

Community
facilities

Flood plains

Land use

Subdivision

Slope

Wetlands

Historic
resources

Hazardous
wastes
Road networks, hydrographic features,
churches, municipal boundaries, and utilities

Tax parcel boundaries

Public parks and private recreation, cemeteries,
and public buildings

100-year flood plains

Current land uses

Approved subdivisions

Areas delineated with 5-degree slope intervals

Wetlands

Historic buildings and properties

Hazardous waste sites and related buildings
These constraints have been identified by citizens,
government agencies, and the project team. The major
composite constraints include wetlands, historic proper-
ties, steep slopes (slope greater than 15 degrees), pub-
lic parks and private recreation areas, 100-year flood
plains, potential archaeological areas, prime agricultural
land, subdivisions, and existing buildings and structures.

Two maps have been created from the composite con-
straints layer. One map shows the number of coincident
constraint layers that occurs in any one location (see
Figure 3). The other map shows the composite relative
importance of coincident features. Both maps present a
"sensitivity surface" view of the project area.
Secondly, the team uses CIS to perform interactive
summary statistics for each alternative alignment. The
project team analyzes the impact of the alignment alter-
natives on each individual constraint layer.

Two approaches define the impact areas for compari-
son. The first set impact areas are the areas enclosed
by the footprint that traffic engineers delineated for each
alternative. The second set impact areas are 300-foot
buffers on both sides of the alignment delineated by the
traffic engineers. The boundaries of the impact areas
overlay on constraint data layers. Figure 4 displays
wetlands crossed by alignment alternative ROW1B.

A set of summary statistics are calculated for each
alignment. In the end, we compare the statistics for each
alignment in a matrix (see Table 3). The matrix arranges
constraints as rows and alignment alternatives as col-
umns. The statistics include acres of selected features
within each impact area, such as wetlands or high-quality
watersheds, and total counts of features, such as his-
toric-eligible buildings. The summary statistics also in-
clude listings of building names for businesses or public
facilities within the impact areas.

The team has repeated the summary statistics several
times as alignments shift. This procedure ensures that
the final selected highway alignment minimizes environ-
mental impacts, best meets project needs, and is the
most cost-effective alignment to construct. The statisti-
cal matrix of impacts versus alignments is one of the
critical evaluation criteria for comparing alignments and
ultimately for selecting the final alignment.
Existing Roads

Pipeline

Phase II Project Boundary
One to Two Constraint Layers
Three to Four Constraint Layers
Five to Six Constraint Layers
o
1,000
Feet
2,000

Figure 3. Composite constraints.
327
-------
Table 3. Phase II Summary Statistics by Alternative

Constraints Alignment Alternatives

ROW1A ROW1B ROW2A ROW2B
ROWS
Wetlands
Acres
Count

2.76
13

3.27
14

3.25
17

1.8
15

13.64
17
High-Quality Watersheds
Acres 66.18 85.62 71.92 94.58 82.90
Hazardous Waste Parcels
Acres 99.50 116.77 98.25 120.68 215.38
Count 8 5 11 10 6
Parcels With Historic-Eligible Buildings
Acres 14.20 0.10 0.14 0.14 3.46
Count 31222
Historic-Eligible Buildings
Count 1011 1

Benefit of Using GIS

In recent years, many federal, state, and local agencies
have been actively acquiring and automating digital data
(5). These databases provide various types of informa-
tion at scales that are appropriate for a preliminary study
covering a large area.

A more detailed study, which usually covers a smaller
area, often requires more accurate data to describe the
spatial distribution of relevant factors. GIS is flexible,
allowing use of data at the scale and accuracy appropri-
ate to the study purpose. The team has found that this
feature helps improve the efficiency of the project with-
out sacrificing the accuracy. This study has required
two sets of scales ranging from 1:24,000 scale for
Phase I, which required projectwide socioeconomic
and environmental assessments, to 1:2,400 scale for
Phase II, which requires detailed analysis for design of
alternative alignments.

GIS has served as a digital database manager to as-
semble environmental, traffic, geographic, socioeco-
nomic, and other data into a centralized project
database. Data analyzed in this study originate from a
variety of sources, such as PennDOT, U.S. Geological
Survey (USGS), U.S. Census Bureau, Monroe County,
and field survey. They are in different formats, including
digital data in ARC/INFO, INTERGRAPH, and AutoCAD
formats, GPS data, digital images, paper maps, and
tabular data. Many of the public agencies and private
organizations involved in this project already have digital
data that GIS could easily use for this specific project.
This has helped to reduce the overall costs of data
collection and conversion.

This study demonstrates that GIS can support the infor-
mation needs of many disciplines within a common
framework and provide powerful, new tools for spatial
analysis. Aside from technological considerations, GIS
development initiates a higher-order systematization of
geographic thinking (3), which is crucial to the success
of a transportation project. In this project, GIS helps to
determine the total impacts of alternative alignments on
identified constraints. The team accomplishes this by
superimposing the alternative alignments on constraint
Existing Roads

Alternative ROW1B

Buffered Area Boundary

Pipeline

Phase II Project Boundary

Wetlands

Buffered Area of Alternative ROW1B
1,000
Feet
2,000
Figure 4. Wetlands crossed by an alternative alignment.
328
-------
data layers to determine the total amount of each con-
straint layer that each alignment encounters. This ap-
proach supports the analysis of multiple alignments
across the constraint surfaces for a variety of alternative
scenarios. The spatial analysis tools and statistical
function embedded in CIS prove to be very useful in
such study.

Digital data that the CIS stores are used to summarize
environmental, social, and economic data in many dif-
ferent ways. These functions include summing total
acreage, listing entities of special interest, and counting
numbers to provide useful baseline statistics for various
alignments. Within a day, CIS accomplished what would
have required several months of staff labor; CIS sum-
marized impacts of 11 alternative alignments on all con-
straint layers. In addition, CIS creates composite
constraints from individual data layers. A composite con-
straint data layer is created through a series of overlays
to illustrate geographic coincidence of inventory themes.

Conventionally, engineers in a project such as this first
delineate alternatives for alignments from the engineer-
ing perspective; they often consider factors such as
steep slopes and costs. Then, other professionals, such
as environmental specialists, historians, and planners,
evaluate the alternatives from each point of view. CIS
makes possible an early integration of environmental
and engineering activities, ongoing communication with
funding agencies and the public, and continual integra-
tion of a multidisciplinary team.

CIS helps to maintain high-quality data for the project.
It allows for error checking and quality control of multiple
data layers that would not be possible with conventional
mapping. The team always compares a new data layer
with other data to check for conflicts. Making check plots
allows for quick identification of errors and missing data.

For instance, in the process of assigning building-use
attributes to existing buildings, the CIS team first plotted
buildings on a map and created a table with building
identifiers. The field team then used the unique identifi-
ers shown on the plots when noting building names and
building uses in data collection tables. After relating the
data table with the building data layer, the team found
some buildings that did not have building use data.
Moreover, some buildings were assigned uses that were
out of range or seemed out of place, such as a residen-
tial building surrounded by several commercial build-
ings. The team highlighted the data for those buildings
and sent them to engineers for verification. Through this
data cleaning process, the team was able to obtain
complete building use information.

Data quality directly affects project quality. Without CIS,
this type of study often involves using and comparing
maps at different scales, which frequently introduces
serious errors. For each phase of this study, project
team members have used a fixed-base map scale for all
compilations. During data entry and data transformation,
the team has kept accurate registration between data
layers, ensuring data of the same resolution. In addition,
the team has used FREQUENCY, one of the tools that CIS
provides, to look for data values that are out of range as
well as missing data. CIS tools have also helped to
derive new relationships for features. For example, dis-
solving parcels has helped to create subdivision outlines,
or overlaying historic buildings with parcels has helped to
find the parcels on which they are located.

Planners, environmental specialists, historians, and
landscape architects on the project team are responsi-
ble for field data collection, verification, and if necessary,
compilation of field data into the standard project data-
base. Wherever possible, the team has used GPS to
eliminate the task of manual compilation and to improve
accuracy of locating data. A fundamental requirement in
applying the technology appropriately is to understand
its capabilities, requirements, and limitations. Because
several members manage inventory attributes, they
need to know how to maintain unique identifiers for
features so they can link up to the geometry. Because
AutoCAD data transfers occur routinely with engineers,
it is necessary to structure how the AutoCAD drawing
files can be organized, as well as how certain attributes
can be transferred by line color, layer name, or line
width. The CIS group coordinates closely with other
team specialists to identify quick, accurate, and cost-
effective methods of data collection, data analysis, and
presentation. In conjunction with the progress of the
project, specialists from different fields have become
familiar with the concept, requirements, and use of CIS.
They now feel comfortable discussing alternatives while
looking at results displayed on a computer screen.

CIS has created high-quality map products for public
presentations and reports. CIS has also been used in
several agency coordination meetings to display data
and alternative alignments. CIS has allowed for different
data layers to be displayed on the screen with specific
combinations of features at various scales. Public agen-
cies and citizens have been impressed by the clear and
friendly graphic response to their questions. They have
expressed interest in using the technology in future
projects.

Once the study is complete, digital data assembled in
this project will be an excellent resource for future pro-
jects in the study area. For these reasons, CIS provides
more cost-effective project support for gathering, man-
aging, and using data than that provided by paper and
mylar maps.

Summary

For this project, the importance of innovation based on
a solid scientific foundation cannot be overstated. In the
329
-------
current economic and regulatory climate, sound CIS
methods are emerging as the only convincing and cost-
effective means for locating, designing, and gaining
approval for major public and private infrastructure
projects.

The technology offers new and exciting tools for trans-
portation planning.

The methodology that this project team has used can be
successfully applied to other projects that require envi-
ronmental assessment. The team has found CIS to be
an extremely useful tool as users continue to learn its
capabilities and the multiple tools that it offers. The
regulatory agencies have repeatedly made favorable
comments on how CIS can offer interactive viewing in a
show-and-tell environment. This project is one of the first
EIS projects to use CIS in a PennDOT-funded project.
PennDOT appears convinced that CIS is an important
component for conducting EIS for highway studies.
More EIS projects probably will demand CIS services as
part of the project approach, in part, because many
federal and state regulatory agencies are increasingly
using CIS.

References

1. Fischer, M.M. 1994. From conventional to knowledge-based geo-
graphic information systems. Computer, Environment, and Urban
Systems 18(4):233-242.

2. Star, J., and J. Estes. 1990. Geographic information systems: An
introduction. Englewood Cliffs, NJ: Prentice Hall, Inc.

3. Bracken, I., and C. Webster. 1990. Information technology in ge-
ography and planning, including principles of CIS. London, Eng-
land: Routledge.

4. Burrough, P.A. 1986. Principles of geographic information system
for land resources assessment. Oxford, England: Clarendon
Press.

5. Hendrix, W.G., and D.J.A. Buckley. 1992. Use of a geographic
information system for selection of sites for land application of
sewage waste. J. Soil and Water Conserv. 47(3):271-275.
330
-------