NCEA-C-1698
                                   April 2006
    Classification of High Spatial
 Resolution,  Hyperspectral Remote
Sensing Imagery of the Little Miami
         River Watershed in
       Southwest Ohio, USA
        National Center for Environmental Assessment
           Office of Research and Development
          U.S. Environmental Protection Agency
              Cincinnati, OH 45268

-------
                                    NOTICE
      The U.S. Environmental Protection Agency through its Office of Research and
Development funded and managed the research described here under contract no.
68-C-02-060 to Forest One, Inc. and 3C-R337-NTSA to Hyperspectral Data
International,  Inc.  It has been subjected to the Agency's peer and administrative review
and has been approved for publication as an EPA document. Mention of trade names
or commercial products does not constitute endorsement or recommendation for use.
                                  ABSTRACT

      This report summarizes a collaborative project led by the U.S. Environmental
Protection Agency to create a high spatial resolution land use/land cover (LULC)
dataset for the entire Little Miami River watershed in southeastern Ohio, USA from
remotely sensed imagery. The LULC classification was derived from 82 flight lines  of
Compact Airborne Spectrographic lmager(CAS\) hyperspectral imagery acquired from
July 24 through August 9, 2002 via fixed wing aircraft.  Categories within this
classification include: water  (both lentic and lotic),  forest, corn, soybean, wheat,  dry
herbaceous vegetation, grass, urban barren, rural barren,  urban/built, and unclassified.
A hierarchical classification approach was used involving object image segmentation in
eCognition (Definiens Imaging GmbH., 2003), and spectral angle mapper (SAM) in  the
ENvironment for Visualizing Images (ENVI) (Research Systems Inc., 2003).  A final
classification was completed after an extensive Quality Assurance and Quality Control
(QA/QC) phase which included manual editing.  The final product includes classification
results at the original data spatial resolution of 4m x 4m.
Preferred citation:
Troyer, M.E., J. Heo and H. Ripley. 2006. Classification of High Spatial Resolution, Hyperspectral
Remote Sensing Imagery of the Little Miami River Watershed in Southwest Ohio, USA.  U.S.
Environmental Protection Agency, Office of Research and Development, National Center for
Environmental Assessment, Cincinnati, OH.

-------
                       TABLE OF CONTENTS
LIST OF TABLES	iv
LIST OF FIGURES	v
LIST OF ABBREVIATIONS	vi
AUTHORS, CONTRIBUTORS AND REVIEWERS	vii
ACKNOWLEDGEMENTS	ix
PREFACE	x

1.    PROJECT DESCRIPTION	1

2.    THE REMOTE SENSING EFFORT	3

     2.1.  INTRODUCTION	3
     2.2.  RADIOMETRIC  CORRECTION	3
     2.3.  GEOMETRIC CORRECTION	8
     2.4.  ATMOSPHERIC CORRECTION	8

3.    IMAGE CLASSIFICATION	9

     3.1.  INTRODUCTION	9
     3.2.  THE CLASSIFICATION PROCESS	9

          3.2.1. Quality Review of the CASI Data for Purposes of Classification	9
          3.2.2. Ground Truth	13
          3.2.3. Classification Method	14
          3.2.4. Consistency of Classification Across Flightlines	23

4.    POST-PROCESSING AND QA/QC	24

     4.1.  MANUAL EDITING	24
     4.2.  MAP GENERALIZATION	25
     4.3.  ACCURACY ASSESSMENT	25

5.    RESULTS AND DISCUSSION	29

6.    REFERENCES	31

APPENDIX A: FLIGHT LOGS  OF THE CASI DATA COLLECTION IN THE
LITTLE MIAMI RIVER WATERSHED	A-1
APPENDIX B: PROTOCOL FOR COLLECTING GROUND TRUTH LAND
COVERS FOR THE LITTLE MIAMI RIVER WATERSHED 2002	B-1
APPENDIX C: GENERAL RULES FOR OBTAINING TRAINING SET AND
SUPPLEMENTAL GROUND TRUTH DATA	C-1
APPENDIX D: METADATA	D-1

-------
                               LIST OF TABLES






No.                                  Title



1     Spectral Bands Remotely Sensed in the Little Miami River Project	5



2     Projection for the Little Miami River Land Use/Land Cover	22



3     Classification Error Matrix	26



4     Summary Statistics of the Accuracy Assessment	27
                                      IV

-------
                               LIST OF FIGURES
No.                                  Title

1     Orientation Map of the Little Miami River Watershed in Southwestern
      Ohio, USA	2

2     Remote Sensing Instrumentation Used in this Project	4

3     Flight Lines and Acquisition Dates of CASI  Data in the Little Miami
      River Watershed	7

4     An Unclassified Gap Between Flight Lines 13a and 13b, Acquired on
      July 25, 2002	10

5     Cross-track Illumination Artifacts from Flight Line 38 Acquired on
      July 24, 2002	12

6     Cross-track Illumination Artifacts from Flight Line 10 Acquired on
      July 25, 2002	12

7     Image Object Segmentation versus Pixel-Based Classification of a
      Forest Area in the Little Miami River Watershed	16

8     Flowchart of the Classification Method	18

9     Schematic of the Hierarchical Classification and Derived Classes	19

10    Legend for the Final 4m x 4m LULC Product	20

11    Overview of the 4m x 4m LULC Classification for the Little Miami
      River watershed	21

-------
                         LIST OF ABBREVIATIONS

ACORN    Atmospheric CORection Now
BIP        Band Interleaved-by-Pixel
CART      Classification and Regression Tree
CASI       Compact Airborne Spectrographic Imager
DN        Digital Number
ENVI       ENvironment for Visualizing Images
FGDC      Federal Geographic Data Committee
GMT       Greenwich Mean Time
GPS       Global Positioning System
HDI        Hyperspectral  Data International
IFOV       Instantaneous Field of View
LULC       Land Use/Land Cover
MLC       Maximum Likelihood Classification
NLCD      National Land  Cover Dataset
PPS       Pulse-Per-Second
QA/QC     Quality Assurance and Quality Control
RMS       Root-Mean-Square
SAM       Spectral Angle Mapper (classification in ENVI)
                                    VI

-------
                AUTHORS, CONTRIBUTORS AND REVIEWERS
AUTHORS
Michael E. Troyer, Ph.D.
National Center for Environmental Assessment
Office of Research and Development
U.S. Environmental Protection Agency
Cincinnati, OH 45268

Joon Heo, Ph.D.
Forest One, Inc.
Itasca, IL

Herbert Ripley, FRSPSoc
Hyperspectral Data International, Inc.
Halifax, Nova Scotia
CONTRIBUTORS

Christopher Bolton
Earth Satellite Corp.
Rockville, MD

Michael Diller
Earth Satellite Corp.
Rockville, MD

Christopher Jengo
Earth Satellite Corp.
Rockville, MD

William Jones
Hyperspectral Data International, Inc.
Halifax, Nova Scotia
Sitansu Pattnaik
Forest One, Inc.
Itasca, IL

Laura Roy
Hyperspectral Data International, Inc.
Halifax, Nova Scotia

Francois Smith
Earth Satellite Corp.
Rockville, MD

Michelle Warr
Hyperspectral Data International, Inc.
Halifax, Nova Scotia
INTERNAL PEER REVIEWERS

F. Bernard Daniel, Ph.D.
Senior Environmental Scientist
National Exposure Research Laboratory
Office of Research and Development
U.S. Environmental Protection Agency
                                     VII

-------
             AUTHORS, CONTRIBUTORS AND REVIEWERS cont.
NoelW. Kohl, M.P.H.
EPA CIS Working Group
Office of Information Services
U.S. Environmental Protection Agency
Region 5
Chicago, IL

Cheryl G. Itkin
EPA CIS Working Group
Information Management
National Center for Environmental Assessment
Office of Research and Development
U.S. Environmental Protection Agency
Washington,  DC

Tara A. Maddock, Ph.D.
Geographer
National Center for Environmental Assessment
Office of Research and Development
U.S. Environmental Protection Agency

Mary L. White, Ph.D.
EPA CIS Working Group
Water Quality Branch, Water Division
U.S. Environmental Protection Agency
Region 5
Chicago, IL
EXTERNAL PEER REVIEWERS

Nina Lam, Ph.D.
R.J. Russell Professor of Geography
Louisiana State University
Baton Rouge, LA

Victor Mesev, Ph.D.
Associate Professor of Geography
Florida State University
Tallahassee, FL

R. Douglas Ramsey, Ph.D.
Associate Professor and Director of the Remote Sensing and CIS Laboratory
Department of Forest, Range, and Wildlife Sciences
Utah State University
Logan, UT
                                    VIM

-------
              AUTHORS, CONTRIBUTORS AND REVIEWERS cont.


ACKNOWLEDGMENTS

      Many people provided their time and talents to ensure the success of this project.
Essential fieldwork and ground-truth data were provided by the following scientists from
the U.S. Environmental Protection Agency's research facilities in Cincinnati, Ohio: Drs.
F. Bernard Daniel, Michael B. Griffith, James M. Lazorchak,  Tara Maddock, Matthew A.
Morrison, Bruce W. Peirano, Joseph P. Schubauer-Berigan, Christopher Schultz and
William Shuster.  Peter Mueller and Drew Comley (Dr. Troyer's summer interns) also
assisted in these field efforts as participants of the "Research Apprenticeship Program"
sponsored by the University of Cincinnati and the U.S. EPA. Additional aerial imagery
used in this project was generously provided by Dr. Lawrence  Spencer from the Ohio
State University - Center for Mapping.  Early discussions on remote sensing,
classification and choosing the proper spectral bands and widths for this project were
substantively informed by conversations with Dr. Prasad S. Thenkabail of the Center for
Earth Observation at Yale University, and David J. Williams  of the U.S. EPA's
Environmental Services Division in Reston, Virginia. Appreciation also goes to  Bette
Zwayer (U.S. EPA National Center for Environmental Assessment), Lana Wood
(Intellitech Systems, Inc.) and Elizabeth Fryer (ECflex) who helped with preparing
tables, and formatting and editing the document into its final form.
                                       IX

-------
                                   PREFACE

      Geographic products such as the land use/land cover (LULC) dataset presented
herein enable a wide array of studies important for sustaining both society and nature.
For example, opportunities and risks associated with various human-ecological
interactions can be identified, as well as a greater elucidation of other spatial
relationships and processes affecting the Earth's surface and our environment.
      In 2001, an interdisciplinary group of scientists based at the U.S. Environmental
Protection Agency's research facilities in Cincinnati, Ohio formed a collaborative effort
to study the Little Miami River (LMR) and its watershed.  This work was performed in
support of the U.S. EPA's watershed (or geographic) approach for protecting
designated uses of America's rivers and streams. In particular, development of this
LULC dataset provides a basis for studying:  (1) social drivers of land use change, (2)
how land use change affects the hydrology, sediments and  nutrients of streams, and (3)
aquatic biological responses as a result of changes in the landscape. This work also
assisted in exploring and suggesting refinements to spatially-explicit criteria intended to
prevent habitat alteration, excess nutrients, suspended and bedded sediments,
pathogens, toxic chemicals, and other stressors affecting the Nation's waters.
      A primary goal of this project is to help enhance the use of geographic and
spatial analytic tools in risk assessments at U.S.  EPA, and to improve the scientific
basis for risk management decisions. This is important because environmental
problems are inherently spatial.  For example, many pollutants originate from multiple
non-point sources in the landscape and spread to other areas within a particular
watershed, "airshed," or across ecological and political boundaries. From a
technological point of view, applications of this product will represent an improvement
from more readily available data with coarser spatial and spectral resolution. The
hierarchical classification approach used incorporating both object-based pattern
recognition and spectral techniques may be  beneficial and transferable to other settings.
This dataset may also be useful to others outside the Agency; particularly, those
interested in studying anthropogenic and natural processes occurring at watershed or
smaller spatial scales.

-------
                          1. PROJECT DESCRIPTION

      The goal of this project was to create a higher spatial resolution land use/land
cover (LULC) dataset for the entire Little Miami River watershed in southeast Ohio for a
number of studies being conducted by the U.S. Environmental Protection Agency (U.S.
EPA). The  Little Miami River has a drainage area of 175.5 square miles and stretches
in a southwesterly direction for 105.5 miles originating from near South Charleston,
Ohio to its confluence with the Ohio River east of Cincinnati, Ohio (Figure 1). It is one
of the oldest river groups in the state and became Ohio's first  State and National Scenic
River (Sanders, 2002).  Prior to this project, existing LULC datasets based on Landsat
imagery had spatial resolutions of 30 to 60 meters.  Hyperspectral Data International
(HDI) collected 4 meter spatial resolution, hyperspectral imagery of the watershed from
July 24 through August 9, 2002.  Forest One Incorporated (Earth Satellite Corporation)
subsequently classified the resulting 82 flight-lines of Compact Airborne Spectrographic
Imager (CASI) data into the  11 classes of land cover type presented here.

-------
          Springfield-]
                                   South Charleston
                                  Xenia
                                  Wilmington
                                  Little Miami River
                         East Fork Little Miami River
                                                    Little Mia mi River
                                                      Watershed
                                                      Ohio. USA
                                                         A
                                                         60 Mies
                     FIGURE 1
Orientation Map of the Little Miami River Watershed
             in Southwestern Ohio, USA

-------
                      2. THE REMOTE SENSING EFFORT

2.1.   INTRODUCTION
      The remote sensing effort consisted of planning and implementing all the
logistics and details necessary for the collection of CAS I data from a fixed-wing aircraft.
The CASI instrument used in this project was a rack-mounted, fully programmable, high
resolution (12 bit resampled to 16 bit) pushbroom imaging spectrometer system with the
capability to collect data in 19 channels between 400-950 nm at intervals ranging from
as small as 2 nm in width to tens of nanometers.  This system was fully georeferenced
using data from an aircraft mounted gyroscope, an aircraft mounted Global Positioning
System (GPS), base station GPS data,  and Digital Elevation Model (DEM) data (see
Figure 2). As a result,  all recorded data for this project was corrected for variations in
aircraft altitude and attitude during flight. Spectral wavelengths and widths chosen for
this project evolved from conversations with Herb Ripley of HDI, Inc. (based upon his
many years of experience with this particular instrument), Dr. Prasad Thenkabail from
the Center for Earth Observation at Yale University (also see Thenkabail et al., 1994,
2000), and David Williams from the U.S. EPA's Environmental Services  Division in
Reston, Virginia. The resulting  19 spectral bands chosen were selected in the hope  of
best discerning both the urban and agricultural landscapes expected in this particular
watershed (Table 1). Flight line acquisition dates for this mission are shown in Figure 3.
A detailed flight log is also provided in Appendix A.
      The remote sensing effort also involved a series of pre-processing corrections
(radiometric, geometric, and atmospheric) required to make the CASI data suitable for
input and analysis within a geographic information system (CIS) as well  as the next
step, classification of this data into a land use/land cover product (Section 3).

2.2.   RADIOMETRIC CORRECTION
      Two files derived from data collected during the over-flights were  used  as a basis
for all image processing steps in this  project.  One was an ASCII file which contained
aircraft CASI pitch, roll, and GPS data.  The other was a raw image file in band
interleaved-by-pixel (BIP) format.  Remote  sensing instruments on the aircraft (i.e., the
imager, gyroscope, GPS, computer and hard drive storage,  etc.) were linked by a
Pulse-Per-Second (PPS) signal and time stamp based on Greenwich Mean Time
(GMT).

-------
                  FIGURE 2
Remote Sensing Instrumentation Used in this Project

-------
TABLE 1
Spectral Bands Remotely Sensed in the Little Miami River Project
Band
1
2
3
4
5
6
7
8
9
10
11
Spectral
Region
Blue
Blue
Green
Green
Green
Green-Red
Red
Red
Red
Red
Red-edge
Band
Center
Nm
449.6
490.4
520.2
550.2
574.6
600
619.8
659.6
674.8
691
700.5
Band Width
(+/-)
Nm
15.0
15.0
9.5
9.5
7.7
8.6
7.7
7.7
7.8
4.9
4.9
Band
Range
Nm
30.0
30.0
19.0
19.0
15.4
17.2
15.4
15.4
15.6
9.8
9.8
Comments

Crop to soil reflectance ratio
minima.
1 st order derivative. Positive
change in reflectance per unit
change in wavelength is
maximized. Pigment content.
Green band peak. Related to
total chlorophyll.
1 st order derivative. Negative
change in reflectance per unit
change in wavelength is
maximized. Pigment content.


Chlorophyll absorption pre-
maxima.
Chlorophyll absorption
maxima. Greatest crop-soil
contrast. Related to
chlorophyll a and b.

Chlorophyll absorption post-
maxima.

-------
TABLE 1 cont.
Band
12
13
14
15
16
17
18
19
Spectral
Region
Red-edge
Red-NIR
NIR
NIR
NIR
Shoulder
NIR Peakl
NIR Peak2
NIR-
Moisture
Sensitive
Band
Center
Nm
719.6
750.1
799.9
820.1
845.1
899.9
920.2
937.5
Band Width
(+/-)
Nm
6.8
10.7
10.7
9.7
9.8
10.7
9.8
7.9
Band
Range
Nm
13.6
21.4
21.4
19.4
19.6
21.4
19.6
15.8
Comments
1 st order derivative. Maximum
change in slope of reflectance
spectra per unit change in
wavelength. Vegetative stress.
Nitrogen status of plants.
Red edge/vegetative stress.

Atmospheric water
absorption/correction
Center of near-infrared (NIR)
shoulder. Broad or narrow
band may provide same result.
Related to total chlorophyll.
Crop growth, stress or
senescense. Useful for crop
moisture sensitive index.
Crop growth, stress or
senescense.
Atmospheric water
absorption/correction.

-------
                                Acquisition Dates
                                Dats_
                                    7/25.'2Q03
                                    7 1-31, '2003
                                    mama
                                    S 9 2003
               FIGURES


Flight Line Acquisition Dates of CASI Data
    in the Little Miami River Watershed
                    7

-------
      Radiometric correction was the first and most important step in the image
processing stream. In short, this correction calibrates pixel digital numbers to radiance
units. This is necessary because imagery from any given sensor may be influenced by
factors such  as changes in scene illumination, atmospheric conditions, viewing
geometry variations, and instrument response characteristics (Lillesand and Kiefer,
1994).  The CASI instrument used in this project normally undergoes at least one and
sometimes more system calibrations per year. The system manufacturer conducts this
calibration in the laboratory using an U.S. National  Institute of Standards and
Technology (NIST) radiance standard. Full frame uniformity and absolute data  at each
aperture stop are obtained to determine Radiant Sensitivity Coefficients (RSCs) for all
pixels in the CCD array. The RSC files are used to convert scene data into units of
spectral radiance.  The calibration procedures compensate for variations in optical
transmission and CCD responsivity. Signal contributions arising from electronic offset,
dark current, frame shift smear and scattered light are also removed. CASI image scan
lines were then correlated and interpolated with the internal navigation data records,
i.e., position  (X,  Y) and CASI gyro-based attitude (pitch and roll) measurements, to
create a single file suitable for geocorrection.

2.3.   GEOMETRIC CORRECTION
      Each flightline was  geometrically corrected and geographically registered using
the CASI manufacturer's software (ITRES Research Limited, 2006). This project
generated CASI data with a nominal accuracy of 3  pixels RMS or 4 meter RMS
accuracies which fell within the published accuracies of the CASI instrument. PCI
Geomatica geocorrection software was employed subsequent to the ITRES procedures
to further refine the geographic accuracy. Geometric correction also involved the use of
road vectors (Wessex Inc., 1997) to ensure accurate map registration.

2.4.   ATMOSPHERIC CORRECTION
      Each geocorrected flight line was then atmospherically corrected using ACORN
(Atmospheric CORection Now) software  (Analytical Imaging and Geophysics LLC.,
2002).  In brief, ACORN assesses, models, and compensates for the atmosphere to
convert input radiance spectra to apparent surface  reflectance. After atmospheric
correction, the spectral absorption features inherent to surface materials are revealed.
This software uses the MODTRAN4 algorithm for atmospheric correction of calibrated
hyperspectal and multispectral data in the 350 to 2500 nm spectral range.
                                       8

-------
                          3. IMAGE CLASSIFICATION

3.1.   INTRODUCTION
      The end product objective for this project was to develop a 4-meter spatial
resolution classification of the entire Little Miami River watershed, and to do so with an
accuracy of 80% or better for each class. Particular emphasis was placed on discerning
landscapes thought to potentially contribute nutrients to the Little Miami  River and its
tributaries (i.e., various chemical species of nitrogen or phosphorus). A variety of
classification methods were considered during the preparation of a quality assurance
project plan for this project.  The final classification approach chosen (a  hierarchical,
two-step scheme incorporating the strengths of both object-image segmentation and
pixel-based approaches) is detailed in the next section.

3.2.   THE CLASSIFICATION PROCESS
      Key steps for deriving the Little Miami River Watershed Land Use/Land Cover
Classification presented in this report included:
   •  Reviewing the quality of the CASI data for purposes of classification (Section
      3.2.1);
   •  Additional ground-truth work through the collection and interpretation of aerial
      images from 2002 and 2003 (Section 3.2.2);
   •  Generating classification results (Section 3.2.3); and
   •  Ensuring consistency of the classification across flightlines (Section 3.2.4).

3.2.1. Quality Review of the CASI Data for Purposes of Classification. Considering
the climate conditions during the period of collection,  the quality of the CASI data was
deemed sufficient for image classification purposes.  However, two artifacts present in
the data did require some additional work. The first type of artifact was the presence of
clouds and corresponding cloud shadow in some of the imagery.  Overall, this type of
artifact was minor except for a 342 acre gap between flight lines 13a and 13b which
remained unclassified (Figure 4).
      The second type of artifact was cross-track illumination effects. Cross-track
illumination artifacts routinely occur in airborne hyperspectral data and are a result of
sun-sensor-target-geometry, atmospheric conditions, differential path length (across the
instantaneous field of view or IFOV) and spectral band selection.  The CASI dataset of
the Little Miami River watershed is variably affected by cross-track illumination artifacts

-------
•1 U-v B*wd ttouk 1173.1 I 2

                                KBSflOO  "2M  16,800   22.4M
                                    FIGURE 4



        An Unclassified Gap Between Flight Lines 13a and 13b, July 25, 2002
                                        10

-------
with some flight lines more affected than others. The procedure used to evaluate the
data was to examine each flight line both visually and statistically.  Figures 5 and 6
provide a graphical representation of cross-track illumination. These figures show a
section of uncorrected CASI data for two different flight lines acquired on July 24 and
July 25 respectively.  A distinct brightening on the right side of the images is apparent
and visible  in the cross sections on the right side of the graphics.  These cross sections
were generated by averaging the Digital  Numbers (DNs) under  a graphic mask in each
band in the cross-track direction of the flightline. A polynomial curve has been fitted to
the averaged profile to demonstrate the increased DN values at the edges of the flight
line relative to the center. There is variability in the  cover type under the mask as can
be seen in the spectral  profiles, however, the polynomial curve represents an averaged
measure and is generally representative of overall trends.  The  elevated DNs on the left
of the flight line are not overly apparent in the image on the left.
       The  results of image classification using data with cross-track illumination effects
can be variable and is dependent on the spectral properties of the individual classes
relative to the scene. In general, the ability to extract any particular class from a dataset
is based on the presence of a unique statistical signature that represents a subset of the
full  scene variance.  In the case of multiple classes, the individual classes must also be
unique relative to each  other. Class confusion occurs when there is statistical overlap
between two  distinct classes. In images with cross-track illumination effects, the spectral
signature of ground cover types is different at the edges of the image than  at the center.
Consequently, training areas selected at the edges  of the image will have a different
spectral signature, or statistical representation, than at the center of the image.
       Two approaches were considered to  mitigate the effects  of these cross-track
illumination artifacts:  (1) correction of the individual flight lines to remove the cross-track
illumination effects, or (2) adapt to the cross-track illumination effects during the
classification  phase of the project. It was determined that the first approach was
beyond the scope and available budget of the project since it would require substantial
reprocessing  of the data including performing corrections on all  82 flightlines and a
reapplication  of geometric and atmospheric  corrections to each  flight line. As such the
second approach of adapting to the  cross-track illumination effects in the classification
procedure presented a  more economical approach for this project and may in fact  be
superior to  the first approach given the paucity and  nature of  correction algorithm tools
available in most image processing software.  Moreover, the second approach can
effectively deal with misclassification due to cloud and shadows. The chosen approach
                                        11

-------
                               FIGURES



Cross-Track Illumination Artifacts from Flight Line 38 Acquired on July 24, 2002
                               FIGURES



Cross-Track Illumination Artifacts from Flight Line 10 Acquired on July 25, 2002
                                  12

-------
basically amounts to increasing the number and strategic distribution of training areas in
the affected flight lines such that an accurate representation of each class is acquired.

3.2.2. Ground Truth. In addition to 390 ground truth points collected by U.S. EPA
personnel during the image acquisition in 2002, supplementary ground training samples
were collected using two different sources  of high resolution aerial images. The first
was the 2003 metropolitan Cincinnati digital orthophoto quads (Aerials Express Inc.,
2003) used to sample urban classes.  The second dataset consisted of un-rectified
2002 aerial images from the Center for Mapping at the Ohio State University used to
increase the number of sample locations in agricultural fields. Overall, 3613 and 351
reference  polygons were obtained from each data set,  respectively.  Reference
polygons from  aerial images were selected from homogeneous areas of 40m x 40m (or
0.4 acre) minimum. Some of this data was used for training the two supervised
classifications: image object segmentation and spectral angle mapper (SAM).  Other
portions of it were used for accuracy assessment following classification (Section 4.3).
      Obtaining adequate ground truth data was a key component of this project and a
critical aspect to acquiring ground truth data from secondary sources was in setting
business rules or guidelines which precisely specified the process to be followed. The
key guidelines followed included:
   1.  Location of sample site
      It is not  uncommon to collect information on the wrong location because of
      inadequate procedures. This project used orthophotos of high spatial accuracy
      (<2m) as base layers for the ground truthing process.
   2.  Data collection and entry error
      Data collection errors occur when measurements are done incorrectly and
      variables of the classification scheme are misidentified (i.e., crop type). Ground
      truth data given by EPA, particularly on specific variables such as crop type,  was
      the reference for the ground truthing process. Errors were monitored and
      removed from the data set when found.
   3.  Data collection consistency
      Training and the development of objective data collection procedures ensure
      data collection consistency. It is important to ensure that everyone identifying
      ground truth points follow the same  process. Appendix B contains the protocol
      used by EPA to collect ground truth data. Appendix C contains the supplemental
      ground truth data collection protocol for the project.
                                       13

-------
   4.  Date of reference data
      If change in land use/land class occurs between the date of imagery capture and
      the date of ground truth data, the results may be impacted.  Therefore, ground
      truth data were collected as close as possible to the date of the CASI  image
      acquisition. For example, EPA's ground truth data from the field coincided with
      the CASI flyover. Supplemental aerial images used as secondary sources of
      agricultural ground truth came also from  2002.  The 2002 LMR CASI dataset and
      the Aerials Express Inc. (2003) dataset (used as a secondary source of urban
      ground truth) are only a year or less apart, and  seasonal difference within the
      CASI dataset itself was also minimal (ranging only from late July through early
      August 2002).
   5.  Sampling design
      The choice and distribution of ground truth samples is an important aspect of
      accuracy assessment. The concept of randomness is a central issue  to ensure
      that the samples are chosen without bias and eventually the accuracy of the map
      is statistically sound.  For this project, the 82 flight lines were used as  an
      independent spatial framework to guarantee that the required minimum number
      of ground truth data is collected for each class,  and that they are statistically
      random samples.
   6.  Number of ground truth data
      In the remote sensing community, a general guideline or good "rule  of thumb" is
      to collect a minimum of 50 samples for each class. The objective was to collect
      samples for each class for accuracy assessment and training set samples for
      each class on every flight line. Overall, 4354 ground truth samples were
      obtained; 902 of these were used for the accuracy assessment.


3.2.3. Classification Method.  A number of classification algorithms including
Maximum Likelihood Classification (MLC), Spectral Angle Mapper (SAM), and
Classification And Regression Tree (CART) were tested on the CASI imagery collected
from the Little Miami River watershed.  However, no single algorithm alone proved to be
capable of successfully classifying the hyperspectral data. Some of the problems
encountered included the following:
   •  Sub-classes of forest (e.g., deciduous versus conifer) were not separable,
      especially  because the areas are difficult to find ground-truth for.
   •  Other vegetation, such as corn, grass, soybean, etc. gets classified  into forest,
      and vice versa.
   •  Pasture, soybeans, corn and  grass get confounded, particularly in urban areas.
   •  Roads are not discernible from urban/built and barren land covers.
   •  Rock and stone quarries were not discernible either.
                                      14

-------
      Nevertheless, a solution was found to resolve the above problems and to obtain
as many classes as possible under the study constraint of 80% accuracy per individual
class. The solution was to use a hierarchical approach using two different classification
algorithms: "Image Object Segmentation" and "Spectral Angle Mapper."
      Image object segmentation  is an innovative method which uses
homogenous image objects as processing units at a given scale for classification,
instead of pixels.  One motivation for the image object approach is to make use
of powerful generalizations of the image to generate level-1  strata of different
contexts.  With the creation of these strata (urban, rural and water), further
independent classification (e.g., pixel-based or other algorithms) of each stratum
can be applied and then the results combined later to improve the accuracy of
the final classified dataset.
      Figure 7 shows an example result of image object segmentation versus pixel-
based classification of the same forest area in the Little Miami watershed.  Forest areas
typically contain areas of dark shadows, bright canopy tops, and medium bright
illumination under the tree canopy, which pixel-based approaches without level-1 strata
classify into all different classes, for example: forest (dark green), corn (gold), and
soybean (light green).  Unreasonable classes, such as corn and soybean crops in the
midst of forest land will be removed if a hierarchical approach  is used. Also, image
object segmentation could  also be  used without any negative effect on delineating
riparian features such as a thin forest strip along a river.
      On the other hand, the Spectral Angle Mapper (SAM) is a whole pixel, spectral
method for supervised classification developed specifically for hyperspectral data. It is
based on entirely different principles than common multispectral classification
algorithms such as the probability-based Maximum Likelihood Classifier.  SAM is
thought of as a similarity classifier and can be used on multispectral data too, often with
improved results in comparing the  spectral properties of materials. In brief, it is a
physically-based spectral classification that uses an n-dimensional angle to match the
spectra of imaged (remotely sensed) pixels to reference spectra.  The algorithm
determines the spectral similarity between image and reference spectra by calculating
the spectral angle between them, treating them as unit vectors in spectral space with
dimensionality equal to the number of bands (in this particular case, 19 bands or
dimensions). Since only the angle between the vectors matters (the smaller the angle,
the better the match) and not the vector's length, brightness variations such as
topographic and cross-track illumination effects are well accounted for. In other words
SAM's advantage over common or more "traditional" classifiers is its relative
                                       15

-------
                      FIGURE/

Image Object Segmentation versus Pixel-Based Classification
     of a Forest Area in the Little Miami River Watershed
                          16

-------
insensitivity to illumination and albedo effects inherent with remotely sensed imagery
(Research Systems Inc., 2002; Kruse et al., 1993).
      In order to combine the power of both approaches, object image segmentation in
eCognition (Definiens Imaging GmbH., 2003) was applied as a "Level 1" classification of
water, urban, and rural features which required consideration of large-scale factors, as
well as area-based parameters such as adjacency, texture and shape. Next, Spectral
Angle Mapper in ENVI (Research Systems Inc., 2003) was applied as a "Level-2"
classifier to discern the urban and rural areas into more specific classes (barren, built,
grass, corn, soypbeans, wheat, etc.). Wire diagrams of the methodology used are
shown in Figures 8 and 9. Throughout the process, image object segmentation created
a spatially exclusive mask of urban and rural regions and then each region was filled
with the classification result from SAM. As a consequence, the final classification result
remained a uni-scale product with a spatial resolution of 4 meters throughout the entire
watershed.
      Training samples were chosen along all flight lines and selected in order to
capture  intra-class variation. The same classification rules were applied to each
flightline. After the first round of classification, results were fine-tuned by adding training
sets to accurately define inter-class boundaries.  The classification results for each
individual flight line were assessed for accuracy and accepted if they did not show any
overall discrepancies with respect to the aerial images.  Classification results were also
re-examined after joining the classified flight lines together in a mosaic of the
watershed.
      Any problem areas noted during this stage were addressed during the QA/QC
phase (Section 3.2.4 and Section 4).
      Overall, 11 land cover classes were mapped. A twelfth class included pixels that
could not be classified, for example, due to cloud cover.  The 11  land cover classes
consisted of:
      Lentic: Open water associated with still water systems, such as lakes,
      reservoirs,  potholes, and stockponds.  Such bodies typically do not have a
      defined channel or associated floodplain.
      Lotic:  Open water associated with running water systems, such as rivers or
      streams. Such waterways typically have a defined channel and an associated
      floodplain.
      Forest: Contains either or both  deciduous and coniferous trees in any degree of
      mixture.  Single stemmed, woody vegetation with canopy spanning greater than
      4 meters and tree canopy accounting for 25-100% of the cover
      Corn: Area under cultivation of food and fiber, where corn  is the primary crop.
                                       17

-------
Flight line-
   1
                                                                  Flight line
                                                                    -82
Ground
 Truth
                                      - Special care was placed on
                                      riparian features here
                                      - Water was categorized into
                                      lotic and lentic
                                      - Removal of cross track
                                       lumination effect
                          FIGURES
         Flowchart of the Classification  Method
                               18

-------
    I Level 2 Classification
    i  ENVI Spectral Angle
         Mapper
                                Special care was (
                                placed on riparian i
                              i    features    '
                              L
                                    FIGURE 9

Schematic of the Hierarchical Classification and Derived Classes (shadowed)
                                        19

-------
      Soybean: Area under cultivation of food and fiber, where soybean is the primary
      crop.

      Wheat: Area under cultivation of food and fiber, where wheat is the primary crop.

      Dry Herbaceous: Dominated by dry and/or less vigorous herbaceous vegetation;
      herbaceous vegetation accounts for more than 25% of the ground cover.  This
      class mainly includes naturally occurring  and unmanaged herbaceous vegetation,
      and dried out, unhealthy, or stressed croplands.  Dry herbaceous vegetation
      prevailed in croplands, as well as, "Other Agriculture" lands (fallow, hay, pasture,
      or natural grassland prairies or fields), due to drought in the Summer of 2002.
      Dry herbaceous vegetation had little chlorophyll content and very similar spectral
      signatures without regard to vegetative species.

      Grass: Dominated by cultivated grasses planted in developed settings for
      recreation,  erosion control, or aesthetic purposes. Examples include parks,
      lawns,  golf courses,  airport grasses, and  industrial site grasses.

      Urban Barren: Composed of bare soil, rock, sand, silt, gravel, or other earthen
      material with little (less than 25%) or no vegetation within urban areas.
      Examples include exposed soil in urban areas and construction sites.

      Rural Barren: Composed of bare soil, rock, sand, silt, gravel, or other earthen
      material with little (less than 25%) or no vegetation in rural areas. Typically
      fallow fields are included in this class too.

      Urban/Built: Areas covered by structures and impervious surfaces in urban,
      suburban, and rural  areas. Typically buildings, parking lots, and paved roads.

      Unclassified: This class includes areas of image gaps among flight-lines and
      cloud cover where land cover classification was not feasible.

      Figure 10 shows the class code, name and corresponding color schemes for all
classification results.  Figure 11 provides an overview of the LULC classification.
                   QassCode   I    CbssNaroe               Color
                                   FIGURE 10
                    Legend for the Final 4m x 4m LULC Product
                                       20

-------
                                 FIGURE 11



Overview of the 4m x 4m LULC Classification for the Little Miami River Watershed
                                     21

-------
      Final products were delivered in the projection and data format provided in
Table 2.
TABLE 2
Projection for the Little Miami River Land Use/Land Cover
Projection:
1st Standard Latitude:
2nd Standard Latitude:
Latitude of Projection's Origin:
Central (Meridian) Longitude
Origin:
Datum (Ellipsoid)/Spheroid
Units:
Orientation:
Pixel Size:
Precision for mosaicked flight
lines:
Data Format:
Naming Convention
Albers Conic Equal-Area
29 degrees, 30 minutes, 00 seconds
45 degrees, 30 minutes, 00 seconds
23 degrees, 00 minutes, 00 seconds
-96 degrees, 00 minutes, 00 seconds
NAD83/GRS80
Meters
North up
4 meters
+ 3 pixels
Erdas Imagine 32 bit signed integer .img file format
Oriainal/unsmoothed LULC product:
little_miami_river_watershed_4m_before_clumping_
signed32bit.img
Smoothed LULC product:
Iittle_miami_river_watershed_4m_signed32bit.img
                                       22

-------
3.2.4. Consistency of Classification Across Flightlines. Since there were 82
independent flight lines of imagery, it was important to manage consistency of
classification across the flight lines.  In order to do this, the classification team classified
and conducted an extensive accuracy assessment on 8 selected flight lines (10% of
total number of images). As the selected flight lines were classified to an acceptable
level of accuracy, they were used to aid in ensuring consistency across all the
neighboring images.  When an initial classification was completed for any given flight
line,  it was compared to all of its neighbors and any distinct classification differences
along the boundaries or overlap between flight lines were addressed. This process
helped to mitigate categorical edge-matching errors when the 82 individual classified
flight lines were finally stitched together into the watershed mosaic. Discontinuities
along image boundaries were removed using polynomial based rubber-sheeting using
ground control points and corresponding cut-lines.  The final mosaicked product has
three pixel RMS errors on 4 meter spatial resolution data.
                                       23

-------
                      4.  POST-PROCESSING AND QA/QC


      Image post-processing and QA/QC following the Level 1 and 2 classifications
included manual editing (Section 4.1) and map generalization to create a second LULC
product (Section 4.2). Follow-up steps used to ensure the accuracy and proper
interpretation of the end product included:
   •  The completion of an accuracy assessment including an error matrix and
      computations of overall, producer, and user accuracy (Section 4.3); and

   •  Generating metadata for the final product compatible with the Federal
      Geographic Data Committee (FGDC) standard (Appendix D).


4.1.   MANUAL EDITING
      Manual editing was used as a final QA/QC step.  Final image edits included:
      1)     Differentiation of water bodies into lotic and lentic. The discrimination of
            lotic versus lentic waters was completed using the U.S. EPA's National
            Hydrography Dataset or "NHD."
      2)     Misclassifications due to clouds, shadows, or haze.  Hyperspectral images
            were systematically examined along with the classification results to
            identify any misclassified pixels due to the presence of clouds, shadows,
            or haze. These pixels were then manually included in a mask to convert
            them to the "Unclassified" class.
      3)     Removal of boundary effects. Cross-track illumination affected the SAM
            classification results along the boundary of each flight line. As such,
            manual editing was necessary to correctly assign the classes. The other
            task was to remove any inconsistent classifications in overlapping flight
            lines due to the temporal gap in image acquisition. Atmospheric
            conditions varied day-by-day, and a 16-day gap in image acquisition
            throughout the watershed is long enough to expect some changes in
            vegetation condition.  The mosaicking phase was prolonged due to cross-
            track illumination effects and the temporal gap in image acquisition. All
            the image boundaries had to be manually inspected and the correct class
            in the overlap area determined. The percentage of "dry herbaceous" may
            have increased in the southern region due to the later dates of imagery
            acquisition. Wheat class was scarce in the southern region as the fields
            had been harvested, or had senesced. Small portions of some soybean
            and corn fields had also senesced and, in such cases, were not assigned
            to the correct class.  Such fields were manually interpreted and assigned
            to the correct class.
      4)     Other discernible misclassifications.  Dark shadow areas and  black
            asphalt pavement were easily confused with water bodies.  In such cases,
            these areas were manually interpreted and assigned to the correct class.
                                      24

-------
4.2.   MAP GENERALIZATION
      A "clump-sieve-and-fill" technique was used to eliminate single pixels or
groupings of pixels that were smaller than the minimum target mapping unit (e.g.,
random pixels of "Forest" denoting scattered trees in an otherwise homogeneous 40-
acre plot of "Corn"). As a result, a second LULC mosaic product was produced which
eliminated the "salt and pepper" effect common in classifications of smaller pixel or
"finer" spatial resolution imagery.  Although smoothed images are generally more
"pleasing" to the eye, U.S. EPA was concerned about any smoothing process which
might potentially wipe out important small features, or thin, linear features such as
riparian forest.  On the other hand, available literature also suggested that whereas
unsmoothed or  "salt and pepper" classification results may be more realistic (e.g., cases
where random pixels of "Forest" or scattered trees really do exist within a "Corn" field),
smoothed classifications may be more meaningful in terms of deriving land use
statistics important for interpreting  ecological processes across the landscape (e.g.,
Burnett and Blaschke, 2003; Dorren et al., 2003).  As a result, two LULC products were
created for the Little Miami River watershed, one smoothed and one not.  For the
smoothed product, the minimum mapping unit of the classification result is about 0.04 of
an acre as represented by 10 pixel clusters (at 4m x 4m spatial resolution), or linear
chains of minimally four contiguous pixels in any direction.  The original, unsmoothed
product remains at a 4m x 4m spatial resolution.

4.3.   ACCURACY ASSESSMENT
      The accuracy assessment was based on whether the majority of classed pixels
in a 3 x 3 pixel window, centered on a ground truth site, agreed or not. Thus,  if five or
more pixels were classified as corn, and ground truth indicated corn, then the majority
criterion was satisfied and "corn class" would be considered correct for that site.  A
standard error matrix was used in reporting classification accuracies (Table 3). This
matrix reports the number of pixels assigned to a particular category in a classification
relative to the number of pixels assigned to a particular category in a reference
classification. In this case, the classified data,  represented by rows in Table 3 and
Table 4, are the land cover classifications derived for this project, and the reference
data are represented by the columns in the tables. A total of  902 independent ground
truth sites were  used for the accuracy assessment, including  primary data (i.e., data
collected  by U.S. EPA scientists in the field at the time of the  overflights), and
secondary data from 2002 and 2003 aerial images of the watershed as explained in
Section 3.2.2.
                                       25

-------
TABLE 3
Classification Error Matrix
(units are the number of reference points)

Classified
Lentic
Lotic
Forest
Corn
Soybean
Wheat
Dry Herbaceous
Grass
Urban Barren
Rural Barren
Urban/Built
Column Total
Reference (Known Cover Types)
Lentic
85
3
0
0
0
0
2
1
0
6
1
98
Lotic
1
17
2
0
0
0
0
0
0
1
3
24
Forest
0
0
95
1
0
0
3
1
0
0
0
100
Corn
0
0
0
106
3
0
9
0
0
2
0
120
Soybean
0
0
0
3
107
0
12
0
0
0
0
122
Wheat
0
0
0
0
2
18
8
0
0
10
0
38
Dry
Herbaceous
0
0
3
1
5
0
82
8
0
1
0
100
Grass
0
0
4
0
1
0
16
75
1
3
0
100
Urban
Barren
0
0
0
0
0
0
0
8
57
0
4
69
Rural
Barren
0
0
2
0
0
0
8
0
1
20
0
31
Urban/
Built
0
0
0
0
0
0
0
0
4
1
95
100
Row
Total
86
20
106
111
118
18
140
93
63
44
103
902
26

-------
TABLE 4
Summary Statistics of the Accuracy Assessment
(Overall Accuracy of Classification = 83.92%)
(KHAT statistic = 0.82)
Class Name
Lentic
Lotic
Forest
Corn
Soy
Wheat
Dry Herb
Grass
Urban Barren
Rural Barren
Urban/Built
Reference
Totals
98
24
100
120
122
38
100
100
69
31
100
Classified
Totals
86
20
106
111
118
18
140
93
63
44
103
Number
Correct
85
17
95
106
107
18
82
75
57
20
95
Producers
Accuracy
86.73%
70.83%
95.00%
88.33%
87.70%
47.37%
82.00%
75.00%
82.61%
64.52%
95.00%
Users
Accuracy
98.84%
85.00%
89.62%
95.50%
90.68%
100.00%
58.57%
80.65%
90.48%
45.45%
92.23%
27

-------
      Three major descriptive measures were used for accuracy assessment, namely:
"overall accuracy," "producer's accuracy," and "user's accuracy."  Overall accuracy is
computed by dividing the total number of correctly classified pixels (the sum of the
elements along the major diagonal in the error matrix) by the total number of reference
points. Producer's accuracy indicates how well reference pixels of the given cover type
are classified.  It is computed by dividing the number of correctly classified pixels in
each category (on the major diagonal of the error matrix) by the number of reference
points used for the class (the column total). User's accuracy is a measure of
commission error and indicates the probability that a pixel classified into a given
category actually represents that category on the ground. It is computed by dividing the
number of correctly classified pixels in each category (on the major diagonal of the error
matrix) by the total number of pixels classified into that category (row total) (Lillesand
and Kiefer, 1994).
      Tables 3 and 4 present the quantitative summary of the accuracy assessment for
the classification result using 902 reference points. The overall classification accuracy
was 83.92%, which is above the project's target of 80% accuracy.  However, the
producer's and user's accuracy of some classes fell short of the target of 80%. The
KHAT statistic, a comparative statistic to use  in comparing this classification product
with others, was 0.82. In other words, the classification result here was 82% better than
one resulting from chance.  Overall, the strengths of this classification relative to other
existing LULC datasets of this watershed, such as the National Land Cover Dataset or
NLCD (Vogelmann et al., 2001), and the State of Ohio Land Cover (Ohio DNR, 1994)
are:
    •  Higher spatial resolution (e.g., 4m x 4m rather than 30m x 30m pixel resolution of
      previous existing classifications of this watershed),
    •  Higher spectral resolution (the classification is derived from more spectral bands
      of information than provided by Landsat platforms),
    •  Ground-truth  data used as a basis for this classification includes primary sources
      of information taken in the field at the same time the remote sensing data was
      collected,
    •  The discernment of corn from soybeans (the two major agricultural crops found
      within this watershed), and
    •  Better spatial and updated temporal discernment of urban built (existing and new
      infrastructure) versus urban barren (newly developing areas).
                                       28

-------
                         5. RESULTS AND DISCUSSION


      A basic understanding of the physical (i.e., geological and anthropogenic)
processes at work in the Little Miami River watershed will help the user of this LULC
dataset interpret some of the resulting land use patterns shown in Figure 11.  For
example, one should note that the northern half of the watershed is relatively flat and
composed of "Till Plains" with soils that developed from the loamy, limy deposits of the
Wisconsinan glaciation roughly 18,000 years ago. These soils normally have better
natural drainage and fertility than those of the southern half of the watershed (or "Drift
Plain"). The southern half of the watershed has more deeply-leached, acidic, pre-
Wisconsinan till and thin loess,  as well as,  very poorly-drained soils with fragipans
(clays). The southern half of the watershed also exhibits relatively modest relief, but
with dissected areas and somewhat more complex topography than the north (Omernik,
1987; Woods et al., 1998).  As such, the northern and southern parts of the watershed
may be expected to have different types and proportions of certain land uses or land
covers based on the differing soils and micro-climates found in these two distinct
"ecoregions" (the "Till Plain" and the "Drift Plain," refer to Figure 1).
      Spatial patterns separating western and eastern portions of the watershed exist
too. Perhaps most notable is the western urban/exurban corridor stretching from
Cincinnati (in the south) to Dayton and Xenia (in the north) and beyond, encompassing
portions of Hamilton, Warren, and Montgomery Counties. These growing urban
landscapes run  parallel to and already straddle much of the mainstem of the Little Miami
River which can be observed as a nearly contiguous linear band of riparian forest
running up along the western part of the image. The eastern half of the watershed
tends to be more agricultural in character, particularly in the north.  But this appears to
wane  in the east-central part of the image near the city of Wilmington (a crossroads or
pole for the primary economic sector in this region, as well as, a major air transportation
hub), and in the south as well, particularly along the East Fork of the Little Miami River
in Clermont County where urban development and human population continue to rapidly
grow.
      Other major patterns in the final classification include concentrations of "dry
herbaceous" land cover in the western part of the watershed near the urban/rural fringes
of Warren, Montgomery and Green Counties, as well as along, or at the source of,
many headwater streams to the east (i.e., dry,  thin vegetative areas buffering perennial
or low-flow streams from adjacent croplands).  Recall that "dry herbaceous" was defined
                                       29

-------
in this project as a variety of "Other Agricultural" components including hay, pasture,
fallow, dried out crops, and natural herbaceous vegetation. As such, concentrations of
dry herbaceous land cover in the west are likely non-irrigated lands, areas practicing
water conservation, or perhaps areas left fallow due to failed crops or in anticipation of
near-term development.  The thin lines of dry herbaceous cover bordering perennial or
low-flow streams in the east was likely spectrally distinct, or separable from corn and
soybeans based on aerial photography, yet still dry from drought and "managed" only  in
the sense that it was not mowed or turned over by farmers.
      The ratio of corn to soybeans is  lower in the southern part of the watershed. This
is expected to a certain extent since soybeans are well known to be common and well
adapted to spring soil wetness in the southern half of the Little Miami River watershed
(Woods et al., 1998).  This observation may well represent the reality of crop planting
patterns in 2002, and/or failed corn crops susceptible to the early flood then  drought
extremes experienced that year (and thus, classifications of "dry herbaceous" rather
than "corn").  The planting of corn was delayed at least three weeks in many areas of
Ohio in 2002 due to heavy spring rains and flooded fields that year. However,
subsequent to seeding, drought conditions ensued and lingered in 2002.
      The summer drought of 2002 likely affected other classes in this LULC as well.
For example, lotic or running waters were rarely detected because many drainage ways,
or headwater or perennial streams during the period of late July to early August were
already dry or experiencing low flows.  Un-watered grasses in otherwise managed
grassy areas might also have been classified as "dry herbaceous."
      Nevertheless, the resulting LULC is an important dataset for a variety of
environmental and geographic studies within the Little Miami River watershed. Even
given the predominance of the "dry herbaceous" class, it remains meaningful in terms  of
studying several urban and agricultural patterns or gradients, as well as, anthropogenic
and natural processes within the watershed. Many hydrological, ecological,  and
geological applications may be possible too, along with the ability to assist in urban
planning, study urban sprawl, and contribute to measurements and evaluations of
zoning, congestion, pollution and human health.
                                       30

-------
                              6. REFERENCES
Aerials Express Inc. 2003. Cincinnati Metropolitan Aerial Photography. Tempe, AZ.

Analytical Imaging and Geophysics LLC. 2002. ACORN Quick Start Guide.  ACORN
Version 4.10, March, 2002 Edition. Boulder, CO.

Burnett, C. and T. Blaschke.  2003. A multi-scale segmentation/object relationship
modelling methodology for landscape analysis.  Ecol. Model.  168:233-249.

Definiens Imaging GmbH.  2003.  eCognition 3.0. Munich, Germany.

Dorren, L.K.A. , B. Maier and A.C. Seijmonsbergen. 2003.  Improved Landsat-based
forest mapping in steep mountainous terrain using object-based classification. Forest
Ecol. Manag.  183:31-46

ITRES Research Limited. 2006. ITRES: Airborne Hyperspectral  Remote Sensing
Systems & Solutions.  Calgary, Alberta, Canada. Available at http://www.itres.com.

Kruse, F.A., A.B. Lefkoff, J.B. Boardman, K.B. Heidebrecht, AT. Shapiro, P.J. Barloon,
and A.F.H. Goetz.  1993.  The Spectral Image Processing System (SIPS) - Interactive
Visualization and Analysis of Imaging spectrometer Data. Remote Sens. Environ.
44:145-163.

Lillesand, T.M. and R.W. Kiefer.  1994. Remote Sensing and Image Interpretation, 3rd
ed.  John Wiley and Sons, Inc., New York, NY.

Ohio DNR (Ohio Department of Natural Resources).  1994. State of Ohio Land Cover.
Division of Real Estate and Land Management.  Columbus, OH.

Omernik,  J.M. 1987.  Ecoregions of the conterminous United States.  Map
(1:7,500,000). Ann. Assoc. Am. Geograph.  77(1):118-125.

Research Systems Inc. 2002. Spectral Analysis with ENVI. Boulder, CO: The
Professional Services  Group, Research Systems, Inc., A Kodak Company.

Research Systems Inc. 2003. ENVI 3.6 (the Environment for Visualizing Images).
Boulder, CO.  Available at http://www.ResearchSystems.com.

Sanders,  R.E., Ed. 2002. A guide to Ohio streams. Columbus, Ohio: Streams
Committee,  Ohio Chapter of the American Fisheries Society.  In partnership with the
Ohio Environmental Education Fund, Ohio Department of Natural Resources, and the
Ohio Environmental Protection Agency.
                                     31

-------
Thenkabail, P.S., A.D. Ward, J.G. Lyon and C.J. Merry.  1994.  Thematic Mapper
vegetation indices for determining soybean and corn crop parameters.  Photogramm.
Eng. Rem. S. 60:437-442.

Thenkabail, P.S., R.B. Smith and E. De Pauw. 2000. Hyperspectral vegetation indices
for determining agricultural crop characteristics.  Remote Sens. Environ. 71:158-182.

Vogelmann, J.E., S.M. Howard, L. Yang, C.R. Larson, B.K. Wylie and N. Van Driel.
2001. Completion of the 1990s national land cover data set for the conterminous United
States from Landsat thematic mapper data and ancillary data sources.  Photogramm.
Eng. Rem. S. 67:650-662.

Wessex, Inc.  1997.  "First St."  software. U.S. Streets CDROM, Release 4.0. Winetka,
IL. Available at www.wessex.com.

Woods, A.J., J.M. Omernik, C.S. Brockman, T.D. Gerber, W.D. Hosteterand S.H.
Azevedo.  1998.  Ecoregions of Indiana and Ohio. Interior-Geological Survey, Reston,
VA.  Accessed May 19, 2005 at ftp://ftp.epa.gov/wed/ecoregions/oh  in/ohin front.pdf.
                                      32

-------
              APPENDIX A

FLIGHT LOGS OF THE CASI DATA COLLECTION
  IN THE LITTLE MIAMI RIVER WATERSHED
          June 24-August 8, 2002
                 A-1

-------
DATE
July 24, 2002















July 25, 2002

LINE

1
2
3
4
5
6
7

25
26
27
36
37
38
39

8
FILE
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
1
2
START
TIME
12:55:23
13:10:49
13:18:57
13:27:51
13:36:01
13:46:01
13:57:05
14:07:00
14:13:49
14:24:00
14:38:15
14:52:45
15:15:20
15:32:45
15:50:10
16:01:47
12:31:28
12:48:10
STOP
TIME
12:56:30
13:15:50
13:24:26
13:33:01
13:42:40
13:53:45
14:04:00
14:13:20
14:19:30
14:34:54
14:49:20
15:04:12
15:29:40
15:47:11
16:00:10
16:13:10
12:32:40
12:58:02
/STOP
5.6
5.6
5.6
5.6
5.6
4
4
4
4
4
4
4
5.6
5.6
5.6
5.6
4
4
INT
TIME
30
30
30
30
30
30
30
30
30
30
30
30
30
30
30
30
30
30
ALTITUDE
(FEET)
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
HEADING
VISIBILITY
(MILES)
SPEED
(MPH)
Dark Data
E
W
E
W
E
W
E
10
10
10
10
10
10
10
124
120
127
121
128
121
128
Dark Data
W
E
W
E
W
E
W
10
10
10
8
8
8
8
127
123
127
124
127
127
125
Dark Data
W
5
118
A-2

-------
DATE















July 31, 2002



LINE
9
10
11
12
13
13
14
15
16
17
18
19
20
21



22
23a
FILE
3
4
5
6
7
9
10
11
12
13
14
15
16
17
18
1

3
4
START
TIME
13:00:45
13:13:15
13:27:20
13:39:55
13:54:20
14:00:10
14:08:40
14:24:10
14:39:05
14:56:42
15:12:03
15:28:30
15:43:00
15:58:20
16:07:12
12:38:19
12:39:50
12:51:23
13:05:30
STOP
TIME
13:09:10
13:24:45
13:37:10
13:51:25
14:00:00
14:05:00
14:20:30
14:35:45
14:52:35
15:09:00
15:25:40
15:40:00
15:55:50
16:06:34
16:08:15
12:39:20
12:40:50
13:02:00
13:19:00
/STOP
4
4
4
4
4
4
4
4
4
4
4
5.6
5.6
5.6
5.6
4.0
5.6
4.0
4.0
INT
TIME
30
30
30
30
30
30
30
30
30
30
30
30
30
30
30


31
31
ALTITUDE
(FEET)
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500


10,500
10,500
HEADING
E
W
E
W
E
E
W
E
W
E
W
E
W
E
VISIBILITY
(MILES)
5
5
5
5
5
5
5
5
5
5
5
5
5
5
SPEED
(MPH)
130
119
130
118
132
116
132
116
132
116
132
116
132
116
Dark Data


W
E


8-10
8-10


133
105
A-3

-------
DATE















August 9, 2002




LINE
28
29
30
31
32
33
34
35
24
27
26
25
23b
21
20
57
56
55
54
53
FILE
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
2
3
4
5
6
START
TIME
13:34:30
13:47:01
14:03:10
14:15:10
14:30:14
14:41:45
14:56:25
15:10:29
15:31:15
15:46:48
15:51:56
15:54:45
16:02:17
16:05:10
16:09:35
12:58:56
13:17:48
13:38:26
13:56:41
14:18:34
STOP
TIME
13:44:30
14:00:05
14:12:55
14:28:15
14:39:45
14:54:30
15:05:30
15:25:38
15:41:30
15:50:38
15:53:32
16:00:25
16:03:33
16:08:20
16:13:00
13:15:57
13:36:24
13:54:55
14:14:40
14:34:59
/STOP
4.0
4.0
4.0
4.0
4.0
4.0
4.0
5.6
5.6
5.6
5.6
5.6
5.6
5.6

4
4
4
4
4
INT
TIME
31
31
31
31
31
31
31
32
32
32
32
32
32
32

31
31
31
31
31
ALTITUDE
(FEET)
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500

10,500
10,500
10,500
10,500
10,500
HEADING
W
E
W
E
W
E
W
E
W
E
W
E
E
W
E
W
E
W
E
W
VISIBILITY
(MILES)
8-10
8-10













>10
10
10
10
10
SPEED
(MPH)
130
105
130
105
133
105
142
105
140
105
140
105
105
140
105
130
112
127
115
130
A-4

-------
DATE







August 1 , 2002











LINE
52
51
50b
39b
38b
37b
36b


40
41
42
43a
43b
44
45
46
47
48
FILE
7
8
9
10
11
12
13
1
2
3
4
5
6
7
8
9
10
11
12
START
TIME
14:36:32
14:56:00
15:13:35
15:25:14
15:34:57
15:41:07
15:47:25
12:46:47
12:48:37
13:10:30
13:20:05
13:51:29
14:09:20
14:17:46
14:39:20
14:56:25
15:16:45
15:33:25
15:54:07
STOP
TIME
14:54:00
15:12:01
15:17:45
15:33:25
15:39:51
15:45:48
15:50:38
12:48:00
12:49:55
13:26:40
13:48:55
14:06:10
14:14:50
14:32:35
14:54:10
15:14:50
15:30:55
15:51:20
16:08:30
/STOP
4
5.6
5.6
5.6
5.6
5.6
5.6
5.6
40
40
40
40
40
40
40
5.6
5.6
5.6
5.6
INT
TIME
31
31
31
31
31
31
31
32
32
32
32
32
32
32
32
32
32
32
32
ALTITUDE
(FEET)
10,500
10,500
10,500
10,500
10,500
10,500
10,500


10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
HEADING
E
W
E
E
W
E
W


W
E
W
E
E
W
E
W
E
W
VISIBILITY
(MILES)
10
10
10
10
10
10
10


8-10
8-10
8-10
8-10
8-10
8-10
8-10
8
8
8
SPEED
(MPH)
120
135
115
115
135
116
135


136
114
138
110
110
137
108
138
110
137
A-5

-------
DATE


August 3, 2002


August 4, 2002




Augusts, 2002
August 7, 2002







LINE
49

7
13
23
1
2
3
4
5
50a
68a
74
75
76
77
78
79
80
FILE
13
14
1
2
3
1
2
3
14
19
1
1
2
3
4
5
6
7
8
START
TIME
16:10:35
16:36:10
13:19:12
13:27:26
13:34:56
12:59:12
13:01:56
13:07:29
13:09:21
13:11:02
13:06:02
13:16:21
13:27:02
13:35:43
13:45:11
13:53:28
14:02:20
14:10:46
14:17:53
STOP
TIME
16:29:20
16:37:10
13:23:28
13:29:35
13:37:23
13:00:28
13:04:55
13:09:18
13:10:29
13:12:31
13:24:15
13:23:34
13:33:38
13:42:24
13:51:42
13:59:52
14:08:09
14:16:18
14:22:07
/STOP
5.6
5.6
4.0
4.0
4.0
4
4
4
4
4
4
4
4
4
4
4
4
4
4
INT
TIME
32
32
31
31
31
31
31
31
31
31
31
31
31
31
31
31
31
31
31
ALTITUDE
(FEET)
10,500
5000
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
HEADING
E

W
W
E
W
W
E
E
E
W
W
E
W
E
W
E
W
E
VISIBILITY
(MILES)
8

3-4
3-4
3-4
<3
<3
<3
<3
<3
6
>10
>10
>10
>10
>10
>10
>10
>10
SPEED
(MPH)
110

112
110
130
117
117
133
133
133
130
120
120
120
120
120
120
120
120
A-6

-------
DATE








August 8, 2002










LINE
81
82
73
72
71
70
62-2
61-2
60
61-1a
62-1
63
64
65
66
67
69
68b
61-1b
FILE
9
10
11
12
13
14
15
16
1
2
3
4
5
6
7
8
9
10
11
START
TIME
14:23:41
14:30:21
14:37:37
14:47:04
14:56:29
15:07:05
15:21:57
15:25:31
13:09:19
13:30:20
13:49:53
14:06:54
14:23:12
14:39:09
14:54:03
15:09:00
15:22:45
15:37:23
15:48:19
STOP
TIME
14:27:42
14:33:08
14:44:19
14:54:24
15:04:30
15:15:44
15:23:00
15:27:26
13:27:22
13:46:49
14:04:36
14:21:54
14:37:35
14:52:30
15:07:00
15:21:00
15:34:56
15:44:25
15:55:02
/STOP
4
4
4
5.6
5.6
5.6
5.6
5.6
4
4
4
4
4
4
5.6
5.6
5.6
5.6
5.6
INT
TIME
31
31
31
31
31
31
31
31
31
31
31
31
31
31
31
31
31
31
31
ALTITUDE
(FEET)
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
10,500
HEADING
W
E
W
E
W
E
E
W
W
E
W
E
W
E
W
E
W
E
E
VISIBILITY
(MILES)
>10
>10
>10
>10
>10
>10
>10
>10
10
10
10
10
10
10
10
10
10
10
10
SPEED
(MPH)
120
120
120
120
120
120
120
120
120
120
120
120
120
120
120
120
120
120
125
A-7

-------
DATE


LINE
59
58
FILE
12
13
START
TIME
15:58:43
16:18:12
STOP
TIME
16:16:05
16:35:44
/STOP
5.6
5.6
INT
TIME
31
31
ALTITUDE
(FEET)
10,500
10,500
HEADING
W
E
VISIBILITY
(MILES)
10
10
SPEED
(MPH)
125
122
A-8

-------
                                 APPENDIX B
        Protocol for Collecting Representative Ground Truth Land Covers
                    for the Little Miami River Watershed 2002
B.1.   INTRODUCTION
      The purpose of this protocol is to obtain example landcovers within the Little
Miami River Watershed to assist in the classification of remotely-sensed imagery during
the period of July 1 to July 30, 2002.

B.2.   EQUIPMENT NEEDED
      1.  Field sheets (see Attachment A)
      2.  Highway maps
      3.  Map of assigned random lat/long starting points
      4.  GPS (calibrated by GPS Coordinator beforehand)
      5.  Compass
      6.  Digital or 35mm camera
      7.  Range finder (optional)
      8.  Adequate disk space on digital camera or several rolls of 35mm film
      9.  New and/or fully charged batteries for GPS unit and camera
      10. Copies of official letter for private landowners
      11. Pencils
      12. Tape measure

B.3.   PROCEDURE
      On the day before going out to the field, all GPS units and their respective
manuals (if available) need to be temporarily given to the GPS coordinator. The GPS
coordinator will standardize all the units with respect to the  datum, spheroid, and the
lat/long format to be used, and return them  to the field crew the following day.
      Using highway maps and GPS, proceed to first assigned random starting point
(see Attachment B).
      If a landcover of interest (see Section B.4 below) exists at the starting point, fill
out your first record here.
      From this point, ascertain how to travel and look for additional land covers within
a mile radius of the starting point.  This can be done by traveling in four different
directions emanating from this point (e.g., N, S, E, W) or as best you can throughout this
area.
                                      B-1

-------
      While traveling throughout the area (i.e., within a mile radius of each starting
point), locate, stop and fill out a record for as many unique landcovers as possible.  Use
only one (1) field sheet per landcover found.  Once a particular landcover is
recorded around a random starting point do not provide repeated records of this type of
landcover unless it differs significantly  (e.g., an agricultural crop at different stages of
growth, or till or no-till, etc.) or unless you're beginning at a new random  starting point
within the watershed.
      For each type of landcover found, ideally move to a point where you are
surrounded by the  landcover for 100m  on all sides (approximately the length of a
football field).  If the GPS signal is poor (e.g., while located in a forest with  dense
canopy or an urban setting with tall buildings) or you are not able to gain access to a
private site, attempt to take the coordinates of a point just outside of the  landcover area
and using a compass and range finder (if available) note on the field sheet where these
GPS coordinates are with respect to the landcover being sampled (e.g.,  northeast
corner of corn field).  If a GPS reading  would ideally be taken on private  property,
receive permission first.
      Once in position:
      1) set the GPS unit in  a stationary position and allow the GPS to  acquire a 3D fix
         (i.e., on at least 4 satellites) for at least 5 minutes before recording lat/long
         coordinates,
      2) record each satellite # with a solid black bar fix (e.g., on the Garmin units), or
         otherwise the signal strength of each satellite being used by the  GPS unit.
      3) take GPS measurements at  this single  point for a minimum of  three minutes,
         recording all latitude and longitude coordinates displayed by the  GPS unit for
         up to twelve unique sets  of coordinates.
      4) take a picture or digital image of the landcover sampled, and using a
         compass, note on the field sheet the view and orientation of the picture or
         image (e.g., "corn plants  6 inches high taken from  sample point in field
         looking NW," or "Geist reservoir taken from SE shore looking NE").
      5) record how to cross-reference the camera images with the field sheets (note:
         if you run out of camera film  or file space for a digital camera, please provide
         as detailed a description  of the landcover and its adjacent surrounding area
         as possible).
      6) fill out the remaining field sheet in as much detail as possible including
         information on any notable landmarks nearby and  any characteristics about
         the landcover surface which  might assist in classifying the remotely-sensed
         imagery  later (e.g.,  is the terrain flat or hilly? What is the name/type, color,
         height, or texture of the vegetation or other surface sampled?).
                                       B-2

-------
B.4.   LAND COVER CLASSES OF INTEREST AND THEIR DEFINITIONS (Modified
      from the NLCD Land Cover Classification System Key - Rev. July 20, 1999)
B.4.1. Land Cover Classification System Key.
Water
Developed
      Concrete
      Asphalt
      Other construction material
      Low Intensity Residential
      High  Intensity Residential
      Commercial/Industrial/Transportation
Barren
      Bare  Rock/Sand/Clay
      Quarries/Strip Mines/Gravel Pits
      Transitional
Forested Upland
      Deciduous Forest
      Evergreen Forest
      Mixed Forest
Shrubland
Non-natural Woody
      Orchards/Vineyards/Other
Herbaceous Upland
      Grasslands/Herbaceous
Herbaceous Planted/Cultivated
      Pasture/Hay
      Row  Crops
            Corn
            Soybeans
      Small Grains
            Wheat
      Fallow
      Urban/Recreational Grasses
Wetlands
      Woody Wetlands
      Emergent Herbaceous Wetlands
                                    B-3

-------
B.4.2.  Land Cover Class Definitions.
Water. All areas of open water; typically 25% or greater cover of water (per pixel).
Developed. Generally defined as areas of intensive human use with a high
concentration (30% or higher) of constructed materials (e.g., asphalt, concrete,
buildings,  glass and metal structures, etc.).  If possible, look for and take coordinates in
the center of large areas (100m by 100m) which are uniformly asphalt or concrete, or
composed of other constructed materials. In addition, record points in the center of the
following general areas too:
      Low Intensity Residential - Includes areas with a mixture of constructed materials
      and vegetation.  Construction materials account for 30-80% of area. Vegetation
      may account for 20-70% of area.  These areas most commonly include single-
      family housing units. Population densities will be lower than in high intensity
      residential areas.
      High Intensity Residential - Includes highly developed areas where people reside
      in high numbers.  Examples include apartment complexes and row houses.
      Vegetation accounts for less than 20% of the cover.  Constructed materials
      account for 80-100% of the cover.
      Commercial/Industrial/Transportation - Includes infrastructure (e.g., roads,
      railroads, etc.) and all highly developed areas (e.g., central business district,
      shopping or strip mall, warehouses, etc.) not classified as High  Intensity
      Residential.
Barren area. Areas characterized by bare rock, gravel, sand, silt, clay, or other earthen
material with little or no "green" vegetation present regardless of its inherent ability to
support life.  Vegetation, if present, is more widely spaced and scrubby than that in the
"green" vegetated categories; lichen cover may be extensive.
      Bare Rock/Sand/Clay - perennially barren areas of bedrock, desert pavement,
      scarps, talus, slides, volcanic material, glacial debris, beaches,  and other
      accumulations of earthen material.
      Quarries/Strip Mines/Gravel Pits - Areas of extractive mining activities with
      significant surface expression.
      Transitional - Areas of sparse vegetative cover (less than 25% of cover) that are
      dynamically changing from  one land cover to another, often because of land use
      activities.  Examples include forest clearcuts, a transition phase between forest
      and agricultural land, temporary clearing of vegetation, new construction sites,
      and changes due to natural causes (e.g., fire, flood, etc.).
Forested  Upland.  Areas characterized by tree cover (natural or semi-natural woody
vegetation, generally greater than 6 meters tall); tree canopy accounts for 25-100% of
the cover.
                                       B-4

-------
      Deciduous Forest - Areas dominated by trees where 75% of more of the tree
      species shed foliage simultaneously in response to seasonal change.
      Evergreen Forest - Areas dominated by trees where 75% of more of the tree
      species maintain their leaves year-round.  Canopy is never without green foliage.
      Mixed Forest - Areas dominated by trees where neither deciduous or evergreen
      species represent more than 75% of the cover present.
Shrubland. Areas characterized by natural or semi-natural woody vegetation with
aerial stems, generally less than 6 meters tall, with individuals or clumps not touching to
interlocking.  Both evergreen and deciduous species of true shrubs, young trees, and
trees or shrubs that are small or stunted because of environmental conditions are
included.
      Shrubland - Areas dominated by shrubs; shrub canopy accounts for 25-100% of
      the cover. Shrub cover is generally greater than 25% when tree cover is less
      than 25%.  Shrub cover may be less than  25% in cases when the cover of other
      life forms (e.g., herbaceous or tree) is less than 25% and shrubs cover exceeds
      the cover of the other life forms.
Non-natural Woody. Areas dominated by non-natural woody vegetation; non-natural
woody vegetative canopy  accounts for 25-100%  of the cover.  The non-natural woody
classification is subject to  the availability of sufficient ancillary data to differentiate non-
natural woody vegetation from natural woody vegetation.
      Orchards/Vineyards/Other -  Orchards, vineyards, groves, nurseries, ornamental
      horticultural areas or other areas planted or maintained for the production of
      fruits, nuts, berries, or ornamentals.
Herbaceous Upland.  Upland areas characterized by natural or semi-natural
herbaceous vegetation; herbaceous vegetation accounts for 75-100% of the cover.
      Grasslands/Herbaceous - Areas dominated by upland grasses and forbs. In rare
      cases,  herbaceous cover is less than 25%, but exceeds the combined cover of
      woody species present. These  areas are  not subject to intensive management,
      but they are often utilized for grazing.
Planted/Cultivated. Areas characterized by herbaceous vegetation that has been
planted or is intensively managed for the production of food, feed, or iber; or is
maintained in developed settings for specific purposes.  Herbaceous vegetation
accounts for 75-100% of the cover.
      Pasture/Hay - Areas of grasses, legumes, or grass-legume mixtures planted for
      livestock grazing or the production of seed or hay crops. Visual clues  to these
      areas include grazing areas  used by cows or horses,  and fields of alfalfa hay in
      different stages of cutting.
                                      B-5

-------
      Row Crops - Areas used for the production of crops, such as corn, soybeans,
      vegetables, and tobacco, or cotton. In the LMR, we are most likely to find corn
      and soybeans with some vegetables and tobacco.
      Small Grains - Areas used for the production of graminoid crops such as wheat,
      barley, oats, or rice.  In the LMR, we are most likely to find wheat.  Oats may be
      a possibility, but this crop is more common in northerly to northeastern counties
      of Ohio.
      Fallow - Areas used for the production of crops that are temporarily barren or
      with sparse vegetative cover as a result of being tilled in a management practice
      that incorporates prescribed alternation between cropping and tillage.
      Urban/Recreational Grasses - Vegetation (primarily grasses) planted in
      developed settings for recreation, erosion control, or aesthetic purposes.
      Examples include parks, lawns, golf courses, airport grasses, and industrial site
      grasses.
Wetlands.  Areas where the soil and substrate is periodically saturated with or covered
with water as defined by Cowardin,  et al.
      Woody Wetlands - Areas where forest or shrubland vegetation accounts for
      25-100% of the cover and the soil or substrate is periodically saturated with or
      covered with water.
      Emergent Herbaceous Wetlands - Areas  where perennial herbaceous vegetation
      accounts for 75-100% of the cover and the soil or substrate is periodically
      saturated with or covered with water.
                                      B-6

-------
         Attachment A: Remote Sensing Classification Field Data Sheet
                       Little Miami River Watershed 2002
Today's Date:
                              Investigator(s):
Random Starting Point
                                  Landcover#
Picture#
3D fix? Yes / No   Satellite #s:

Displayed Accuracy?	

Latitude:
                                 Begin/End Time

                                      Longitude:
Landcover Type (circle one):
Cement
Asphalt
Other construction material	
Low-density residential
High-density residential
Commercial/Industrial/Transportation
Other urban or built-up land:	
Corn
Soybeans
Wheat
Pasture/Hay (horse? cow? alfalfa hay?	)
Fallow (i.e., croplands temporarily out of production)
Grass (residential, golf course, park, airport, erosion
  control, industrial, developed, or managed site)
Grasslands (primarily upland, natural or semi-natural,
  or not intensively managed)
Orchard, grove, vineyard, nursery, or ornamental
  horticultural area
Other crop or agricultural land:	
                                                Open Water
                                                Woody Wetland
                                                Emergent Herbaceous Wetland
                                                Barren areas:
                                                BareRock/Sand/Clay
                                                Quarry/Strip Mine/Gravel Pit
                                                Transitional (clearcuts,
                                                  clearings, burnt or flood area)
                                                Deciduous Forest
                                                Coniferous Forest
                                                Mixed Forest
                                                Shrubland
                                      B-7

-------
                                                      Attachment B
                            North
  Radius from center (GPS location): -100 meters

Detailed description of landcover type. For example, predominate species
present and growth stage or height of vegetation or other structures? Dry or wet
conditions? Dust or puddles present? Flat or hilly topography? Adjacent
landcovers, roads or landmarks?
                               B-8

-------
                                APPENDIX C
            GENERAL RULES USED FOR OBTAINING TRAINING SET
                 AND SUPPLEMENTAL GROUND TRUTH DATA
      The following guide sets forth the rules to follow while creating polygons of land
areas that are samples of a particular land cover/land use class.  These sample areas
should be as homogenous in pattern and color as possible; in short, the "best" or
"cleanest" examples that can be found of that particular land cover/land use class.


General Rules:
   1.  Polygons should be no smaller than 5x5 pixels or 20m x 20 m, preferably larger.
      Exceptions would be narrow linear features such as roads and riparian forest.
      See the Key below for "Roads".

   2.  Scale should be approximately 1:5000.  Urban areas will likely require a larger
      scale.

   3.  Do not create a polygon that intersects or crosses a flight-line.

   4.  There should be at minimum a 10m border between the edge of the polygon and
      the outside edge of the land use being sampled.  In the case of roads, the border
      should be at least 5 meters. The purpose of the border is to leave ample room
      for error.

   5.  Choose only areas that are uniform in color and pattern for a particular class.
                                     C-1

-------
                                           APPENDIX D
                                            METADATA
        Metadata ("data about the data") created for this project was passed through the

USGS's metadata-parser software to insure that the metadata files were error free.

Metadata included in this appendix includes information about the image sources,

dates, datums, projections, resampling algorithms, processing steps, file records,

accuracy assessment, and other pertinent information associated with this geographic

product.  This metadata complies with Federal Executive Order 12906.



Sample Metadata of the 4-meter Classification of the Little Miami River Watershed

Identificationjnformation:
 Citation:
  Citationjnformation:
   Originator: U.S. Environmental Protection Agency (EPA)
   Publication_Date: 2006
   Title: Little Miami River Watershed Hyperspectral Classification Dataset
   Geospatial_Data_Presentation_Form: Map
   Publicationjnformation:
    Publication_Place: EPA, Cincinnati, OH
    Publisher: EPA, Cincinnati, OH
   Online_Linkage: http://www.epa.gov
   Larger_Work_Citation:
    Citationjnformation:
     Originator:
      This is a land cover classification based upon Compact Airborne
      Spectrographic Imager (CASI) data for the Little Miami River
      watershed, an EPA project.
     Publication_Date: 2006
     Title:
      Classification of High Spatial Resolution, Hyperspectral Remote
      Sensing Imagery of the Little Miami River Watershed in Southwest
      Ohio, USA
     Publicationjnformation:
      PublicationJ^lace: EPA, Cincinnati, OH
      Publisher: EPA
     Other_CitationJDetails:
      This classification is based on 82 CASI flight lines acquired on
      7/24/2002, 7/25/2002, 7/31/2002, 8/1/2002, 8/7/2002, 8/8/2002/,
      8/9/2002.

      Troyer, M.E., J. Heo and H. Ripley. 2006. Classification of High Spatial
      Resolution, Hyperspectral Remote Sensing Imagery of the Little Miami
      River Watershed in Southwest Ohio, USA. U.S. Environmental
      Protection Agency, Office of Research and Development, National Center
      for Environmental Assessment, Cincinnati, OH.
Description:
  Abstract:
   This is a final classification. This data set is the classification of the
   Little Miami River Watershed. This data set consists of 82 mosaicked
   flight lines that were analyzed according to the protocols as set forth
   in a Quality Assurance Project Plan (QAPP) to determine land use/land
   cover. It was created using CASI hyperspectral data by applying the object
   segmentation approach in eCognition (version 3.0) and the Spectral Angle
   Mapper (SAM) approach in ENVI (version 4.0). Then data were manually
   edited and mosaicked. This layer is also available in selected
   sub-watersheds.
  Purpose:
   To classify existing hyperspectral imagery and produce several land
                                                 D-1

-------
   use/land cover products suitable for generating landscape metrics and
   analyses by the EPA.
 Time_Period_of_Content:
  Time_Period_lnformation:
   Range_of_Dates/Times:
    Beginning_Date: 20020724
    Ending_Date: 20020809
  Currentness_Reference: Dates of the CASI collection
 Status:
  Progress: Complete
  Maintenance_and_Update_Frequency: None
 Spatial_Domain:
  Bounding_Coordinates:
   West_Bounding_Coordinate: -84.507357
   East_Bounding_Coordinate: -83.547827
   North_Bounding_Coordinate: 39.991621
   South_Bounding_Coordinate: 38.874112
  Keywords:
  Theme:
   Theme_Keyword_Thesaurus:
   Theme_Keyword: Land Cover Analysis
   Theme_Keyword: Hyperspectral
  Place:
   Place_Keyword_Thesaurus:
   Place_Keyword: Little Miami River Watershed
   Place_Keyword: Ohio
 Access_Constraints: None
 Use_Constraints:
  Data set is not for use in litigation. While efforts have been
  made to ensure that these data are accurate and reliable within
  the state of the art, EPA, cannot assume liability for any
  damages, or misrepresentations, caused by any inaccuracies in the
  data, or as a result of the data to be used on a particular
  system. EPA makes no warranty, expressed or implied, nor does
  the fact of distribution constitute such a warranty.
 Native_Data_Set_Environment: Erdas Imagine signed 32bit integer (.img)
Data_Quality_lnformation:
 Attribute_Accuracy:
  Attribute_Accuracy_Report:
   According to accuracy assessment performed by Forest One,
   the overall accuracy is 83.92% and 81.99% Kappa.
   Each  class accuracy is as follows: (Producers Accuracy/Users Accuracy)

   1: Lotic = 86.73%/98.84%
   2: Lentic = 70.83%/85.00%
   3: Forest = 95.00%/89.62%
   4: Corn = 88.33%/95.50%
   5: Soybean = 87.70%/90.68%
   6: Wheat = 47.37%/100.00%
   7: Dry Herbaceous = 82.00%/58.57%
   8: Grass = 75.00%/80.65%
   9: Urban Barren = 82.61 %/90.48%
   10: Rural Barren = 64.52%/45.45%
   11: Urban/Built = 95.00%/92.23%

   The validation points were both assembled from EPA provided ground truth
   and interpreted in the lab using color aerial photographs.
   There were 902 points used for accuracy assessment total.
   Field collected validation points were collected concurrently
   (or nearly so) with the imagery in July and August of 2002. The field data
   and 2002 unrectified color aerial images were used for ground truthing of
   agricultural classes. 2003 high-resolution color aerial orthophotos were
   used for ground truthing of the  rest of the classes: lotic, lentic,
   forest, grass, urban barren, rural barren, and urban/built.
 Logical_Consistency_Report:
  Tests for logical consistency indicate that all row and column
  positions in the selected latitude/longitude window contain data.
  Conversion and integration with  vector files indicates that all
  positions are consistent with earth coordinates covering the same
  area. Attribute files are logically consistent.
                                                        D-2

-------
Completeness_Report:
 Data exists for all classes.
 All pixels (other than background) have been classified.
Lineage:
 Source_lnformation:
  Source_Citation:
   Citationjnformation:
    Originator: Hyperspectral Data International, Inc.
    Publication_Date: 20030428
    Title:
      High Resolution Remote Sensing of the Little Miami River
      Watershed in Southwest Ohio, USA
    Publicationjnformation:
      Publication_Place: NA
      Publisher: NA
    Online_Linkage: NA
  Type_of_Source_Media: DVD+R and CD-ROM
  Source_Time_Period_of_Content:
   Time_Period_lnformation:
    Range_of_Dates/Times:
      Beginning_Date: 20020724
      Ending_Date: 20020809
   Source_Currentness_Reference: Unknown
  Source_Citation_Abbreviation: NA
  Source_Contribution: NA
 Process_Step:
  Process_Description:
   This dataset was created by Forest One (Joon Heo - Principal
   Investigator, and Sitansu Pattnaik), Earth Satellite Corp.
   (Francois Smith, Christopher Jengo, Christopher Bolton, and
   Michael Diller),and Hyperspectral Data  International, Inc.
   (Herbert  Ripley, William Jones, Laura Roy, and Michelle Warr) under
   EPA purchase orders 3c-R337-NTSA, and 1C-R328-NALX. Project
   Officer: Dr. Michael E. Troyer. The study area is the Little
   Miami River Watershed in Southwest Ohio.

   Summary:
   This section outlines the classification procedure for the
   Little Miami River Watershed. Atmospheric correction and
   rectification were applied to the raw hyperspectral imagery.
   EPA field data, rectified aerial photography (Orthophotos),
   and unrectified aerial images were used for guidance
   in  training point selection. The hyperspectral data were then
   segmented into three Level 1 classes (Urban, Rural, Water) in
   eCognition. The training sets were used as inputs into Level
   2 classifications within the Urban and Rural Level 1 classes.
   Level 2 classification was performed with the Spectral
   Angle Mapper (SAM) method within ENVI. The resulting clusters
   of data were combined into categories related to the
   classification scheme. QA/QC was performed often resulting
   in  a re-combination of clusters to better represent the final
   classes.  Some categories were lumped to correspond to the final
   classification scheme. Data were classified by flight line.
   Then  the data were manually edited and mosaicked. Then a final
   QA/QC was completed.

   Pre-Processing steps:
   The data received for this project were in a state ready
   for classification. Hyperspectral Data International, Inc
   performed atmospheric correction via Atmosphere  Correction
   Now (ACORN) software at the time of data acquisition.
   Initial  data examination by Forest One and Earth Satellite
   Corp. determined that cross-track illumination variations
   on the boundary could be handled by manual editing after
   classifications. Classification was performed on data
   geo-rectified using nearest neighbor resampling.

   Field-Collected Data:
   The EPA made available field data collected by EPA
   personnel during the time of image data acquisition in 2002.
                                                       D-3

-------
 The 390 points were used as a guide in selecting training
 sets for classification.

 Classification:
 Classification for this project was performed in a hierarchical
 manner.  First, a Level 1 classification was performed.
 Training sets guided by EPA groundtruthing and high-resolution
 color aerial photography were selected for the Level 1 classes
 (Urban, Rural, Water). eCognition was used to perform the
 segmentation. QC work on the segmented image was performed
 to reduce the occurrence of obviously misclassified pixels. For
 instance, dark rooftops are often misclassified as water.  The
 next step was to perform Level 2 classification for the Urban
 and Rural Level 1 classes. Training sets were chosen for the
 Level 2 classification. For Urban,  these included Water, Grass,
 Urban Barren, Urban/built, and Forest.  For Rural, these included
 Water,  Corn, Soybean, Dry Herbaceous, Rural Barren, Urban/Built,
 and Forest.
 Water and Forest were included in Level 2 classification in case
 any pixels remained improperly classified from Level 1. The Level
 2 classification was performed using the Spectral Angle Mapper
 algorithm in ENVI. The Level 1 and  Level 2 classes were then
 combined, and QA/QC was performed to locate any misclassified pixels.
 Also, water was differentiated to lotic and  lentic at this point.
 The final classes for this project are:
 -99:Background
 1: Lotic
 2: Lentic
 3: Forest
 4: Corn
 5: Soybean
 6: Wheat
 7: Dry Herbaceous
 8: Grass
 9: Urban Barren
 10: Rural Barren
 11: Urban/Built
 99: Unclassified

 Ancillary Datasets:
 Non-CASI datasets used are 2003 high-resolution color aerial
 orthophotos from Aerials Express, acquired in July, 2003, and 2002
 unrectified color aerial images from the Center for Mapping at the
 Ohio State University, The acquisition dates of 2002 images were
 between Aug. 30, 2002 and Sept.  13, 2002. EPA provided ground
 truth, 35 shape files of sub-basins as well as transportation and
 hydrographic vectors for the Little  Miami River Watershed.

 Post-Processing Steps:
 The 82 flight lines were classified  separately and then mosaicked
 to produce the land use/land cover for the entire watershed at 4m
 spatial  resolution.  The 4m dataset was aggregated to produce an
 additional 30m spatial resolution land use/land cover product too.
 Data products at 4m and 30m spatial resolution were also  created  for
 35 selected sub-basins. Data were later subset by sub-watershed.
Process_Date: 20040215
Process_Contact:
 Conta conformation:
  Contact_Person_Primary:
   Contact_Person: Dr. Michael E. Troyer
   ContactjDrganization: U.S. Environmental Protection Agency
  Contact_Address:
   Address_Type: mailing and physical address
   Address: 26 West Martin Luther King Drive
   City: Cincinnati
   State_or_Province: OH
   Postal_Code: 45268
   Country: USA
  Contact_Voice_Telephone: 513-569-7399
  Contact_Electronic_Mail_Address:  troyer.michael@epa.gov
                                                     D-4

-------
Spatial_Reference_lnformation:
 Horizontal_Coordinate_System_Definition:
  Planar:
   Map_Projection:
    Map_Projection_Name: Albers Conical Equal Area
    Albers_Conical_Equal_Area:
     Standard_Parallel: 29.5
     Standard_Parallel: 45.5
     Longitude_of_Central_Meridian: 96 West
     Latitude_of_Projection_Origin: 23 North
     False_Easting: 0.00000
     False_Northing: 0.00000
   Planar_Coordinate_lnformation:
    Planar_Coordinate_Encoding_Method: Row and column
    Coordinate_Representation:
     Abscissa_Resolution: 4 meters
     Ordinate_Resolution: 4 meters
    Planar_Distance_Units: Meters
  Geodetic_Model:
   Horizontal_Datum_Name: North American Datum 1983
   Ellipsoid_Name: GRS80
   Semi-major_Axis: 6378137.0
   Denominator_of_Flattening_Ratio: 298.257
Entity_and_Attribute_lnformation:
 Detailed_Description:
  Entity_Type:
   Entity_Type_Label: Little Miami River Watershed, Southwest Ohio, USA
   Entity _Type_Definition:
    Little Miami River Watershed as delineated by EPA
    and extent of imagery
   Entity_Type_Definition_Source: EPA
  Attribute:
   Attribute_Label: Land Cover Classification
   Attribute_Definition: Land Cover Classification as determined by EPA
   Attribute_Definition_Source: EPA
   Attribute_Domain_Values:

    Enumerated_Domain:
     Enumerated_Domain_Value: -99 Background
     Enumerated_Domain_Value_Definition:
      This class contains no data due to data voids.
     Enumerated_Domain_Value_Definition_Source: EPA

    Enumerated_Domain:
     Enumerated_Domain_Value: 1  Water - Lotic
     Enumerated_Domain_Value_Definition:
      Open water associated with running water system, such as a river
      or stream.  Such waterways typically have a defined channel
      and an associated floodplain.
     Enumerated_Domain_Value_Definition_Source: EPA

    Enumerated_Domain:
     Enumerated_Domain_Value: 2 Water - Lentic
     Enumerated_Domain_Value_Definition:
      Open water associated with still water system, such as lakes,
      reservoirs, potholes, and stock ponds.  Such bodies typically
      do not have a defined channel or associated floodplain.
     Enumerated_Domain_Value_Definition_Source: EPA
    Enumerated_Domain:
     Enumerated_Domain_Value: 3 Forest
     Enumerated_Domain_Value_Definition:
      Contains either or both  deciduous and coniferous trees in any degree
      of mixture, single stemmed, woody vegetation with canopy spanning
      greater than 4m and tree canopy accounting for 25-100 percent of
      the cover.
     Enumerated_Domain_Value_Definition_Source: EPA

    Enumerated_Domain:
     Enumerated_Domain_Value: 4 Corn
     Enumerated Domain  Value Definition:
                                                      D-5

-------
  Area under cultivation of food and fiber,
  where corn is the primary crop.
 Enumerated_Domain_Value_Definition_Source: EPA

Enumerated_Domain:
 Enumerated_Domain_Value: 5 Soybean
 Enumerated_Domain_Value_Definition:
  Area under cultivation of food and fiber where soybeans are the
  primary crop.
 Enumerated_Domain_Value_Definition_Source: EPA

Enumerated_Domain:
 Enumerated_Domain_Value: 6 Wheat
 Enumerated_Domain_Value_Definition:
  Area under cultivation of food and fiber where wheat is the primary
  crop.
 Enumerated_Domain_Value_Definition_Source: EPA

Enumerated_Domain:
 Enumerated_Domain_Value: 7 Dry Herbaceous
 Enumerated_Domain_Value_Definition:
  Dominated by dry and/or less vigorous herbaceous types of
  vegetation; herbaceous vegetation accounts for no less than 25%
  of the cover. This class mainly includes naturally occurring and
  unmanaged herbaceous vegetation, and dried out, unhealthy,  or
  stressed crop. Dry Herbaceous vegetation prevailed in crop fields
  as well as natural fields, due to a high degree of drought in the
  summer of 2002. These dry herbaceous types of vegetations  had little
  chlorophyll content and very similar spectral signatures.  Not
  enough variation was present in the spectral signatures to further
  classify this class into different vegetative species.
 Enumerated_Domain_Value_Definition_Source: EPA

Enumerated_Domain:
 Enumerated_Domain_Value: 8 Grass
 Enumerated_Domain_Value_Definition:
  Dominated by cultivated grasses planted in developed settings for
  recreation, erosion control, or aesthetic purposes. Examples include
  well-watered parks, lawns,  golf courses, airport grasses, and
  industrial site grasses.
 Enumerated_Domain_Value_Definition_Source: EPA

Enumerated_Domain:
 Enumerated_Domain_Value: 9 Urban Barren
 Enumerated_Domain_Value_Definition:
  Composed of bare soil, rock, sand, silt, gravel, or other earthen
  material with little (less than 25%) or no vegetation within  urban
  areas. Examples in this class include exposed soil in urban areas
  and construction sites.
 Enumerated_Domain_Value_Definition_Source: EPA

Enumerated_Domain:
 Enumerated_Domain_Value: 10 Rural Barren
 Enumerated_Domain_Value_Definition:
  Composed of bare soil, rock, sand, silt, gravel, or other earthen
  material with little (less than 25%) or no vegetation in rural areas.
 Enumerated_Domain_Value_Definition_Source: EPA

Enumerated_Domain:
 Enumerated_Domain_Value: 11 Urban/Built
 Enumerated_Domain_Value_Definition:
  Areas covered by structures and impervious surfaces in urban,
  suburban, and rural areas.  Buildings, parking lots, and
  paved roads typically fall into this class.

 Enumerated_Domain_Value_Definition_Source: EPA

Enumerated_Domain:
 Enumerated Domain Value: 99 Unclassified
                                                  D-6

-------
     Enumerated_Domain_Value_Definition:
      This class includes areas of image gaps among flight lines and
      cloud cover where land cover classification is impossible.
     Enumerated_Domain_Value_Definition_Source: EPA
Distributionjnformation:
 Distributor:
  Contactjnformation:
   Contact_Organization_Primary:
    Contact_Organization:
     National Center for Environmental Assessment, Office of Research
     and Development, U.S. Environmental Protection Agency
    Contact_Person: Dr. Michael E. Troyer
   Contact_Address:
    Address_Type: mailing and physical address
    Address: 26 West Martin Luther King Drive.
    City: Cincinnati
    State_or_Province: OH
    Postal_Code: 45268
    Country: USA
   Contact_Voice_Telephone: 513-569-7399
   Contact_Electronic_Mail_Address: troyer.michael@epa.gov
 Resource_Description: Little Miami River Watershed Hyperspectral Classification
 Distribution_Liabiiity: NA
 Standard_Order_Process:
  Digital_Form:
   Digital_Transfer_lnformation:
    Format_Name: Erdas Imagine signed 32bit integer (.img)
   Digital_Transfer_Option:
    Offline_Option:
     Offline_Media: DVD or CD-ROM
     Recording_Format: ISO 9660
     Compatibilityjnformation:
      ISO 9660 format allows the media to be read
      by most computer operating systems.
  Fees:  NA
Metadata_Reference_lnformation:
 Metadata_Date: 20040217
 Metadata_Review_Date: 20060330
 Metadata_Contact:
  Contactjnformation:
   Contact_Organization_Primary:
    Contact_Organization:
     National Center for Environmental Assessment, Office of Research
     and Development, U.S. Environmental Protection Agency
    Contact_Person: Dr. Michael E. Troyer
   Contact_Address:
    Address_Type: mailing and physical address
    Address: 26 West Martin Luther King Drive
    City: Cincinnati
    State_or_Province: OH
    Postal_Code: 45268
    Country: USA
   Contact_Voice_Telephone: 513-569-7399
   Contact_Electronic_Mail_Address: troyer.michael@epa.gov
 Metadata_Standard_Name:
  FGDC (Federal Geographic Data Committee)
  CSDGM (Content Standard for Digital Geospatial Metadata)
 Metadata Standard Version: FGDC-STD-001-1998
                                                       D-7

-------