4>EPA
United States
Environmental Protection
Agency
Final Report
Impact of Urbanization on the
Hydrology of the Pocono Creek
Watershed: A Model Study
-------
EPA/600/R-07/006
September 2006
Impact of Urbanization on the Hydrology
of Pocono Creek Watershed: A Model
Study
By
Mohamed M. Hantush
U.S. Environmental Protection Agency
Cincinnati, Ohio 45268
Latif Kalin
Oak Ridge Institute for Science and Education (ORISE)
U.S. Environmental Protection Agency
Cincinnati, Ohio 45268
National Risk Management Research Laboratory
Office of Research and Development
U.S. Environmental Protection Agency
Cincinnati, Ohio 45268
-------
Notice
The U.S. Environmental Protection Agency through its Office of Research and
Development funded the research described here. It has been subjected to the Agency's peer
and administrative review and has been approved for publication as an EPA document.
This research was supported in part by an appointment to the Post Doctoral Research
Program at the National Risk Management Research Laboratory, administered by the Oak
Ridge Institute for Science and Education through Interagency Agreement No. DW89939836
between the U.S. Department of Energy and the U.S. Environmental Protection Agency.
Mention of trade names or commercial products does not constitute endorsement or
recommendation for use.
All research projects making conclusions or recommendations based on environmentally
related measurements and funded by the Environmental Protection Agency are required to
comply with the Agency Quality Assurance Program. This report has been subjected to
QA/QC review. The report presented a mathematical framework for modeling hydrology of a
watershed and did not involve collection and analysis of environmental measurements.
-------
Foreword
The U.S. Environmental Protection Agency (EPA) is charged by Congress with protecting
the Nation's land, air, and water resources. Under a mandate of national environmental laws,
the Agency strives to formulate and implement actions leading to a compatible balance
between human activities and the ability of natural systems to support and nurture life. To
meet this mandate, EPA's research program is providing data and technical support for
solving environmental problems today and building a science knowledge base necessary to
manage our ecological resources wisely, understand how pollutants affect our health, and
prevent or reduce environmental risks in the future.
The National Risk Management Research Laboratory (NRMRL) is the Agency's center for
investigation of technological and management approaches for preventing and reducing risks
from pollution that threaten human health and the environment. The focus of the Laboratory's
research program is on methods and their cost-effectiveness for prevention and control of
pollution to air, land, water, and subsurface resources; protection of water quality in public
water systems; remediation of contaminated sites, sediments and ground water; prevention
and control of indoor air pollution; and restoration of ecosystems. NRMRL collaborates with
both public and private sector partners to foster technologies that reduce the cost of
compliance and to anticipate emerging problems. NRMRL's research provides solutions to
environmental problems by: developing and promoting technologies that protect and improve
the environment; advancing scientific and engineering information to support regulatory and
policy decisions; and providing the technical support and information transfer to ensure
implementation of environmental regulations and strategies at the national, state, and
community levels.
This publication has been produced as part of the Laboratory's strategic long-term research
plan. It is published and made available by EPA's Office of Research and Development to
assist the user community and to link researchers with their clients.
Sally Gutierrez, Director
National Risk Management Research Laboratory
11
-------
Abstract
The Pocono Creek watershed located in Monroe County, PA, is threatened by high
population growth and urbanization. Of concern specifically is the potential impact of future
developments in the watershed on the reduction of base flow and the consequent risk of
degradation of wild brown trout habitats in Pocono Creek. Anticipated increase in
imperviousness, on the other hand, is expected to elevate flood risk and the associated
environmental damage. A watershed hydrology based modeling study was initiated by the
U.S. EPA in collaboration with the U.S. Geological Survey and the Pennsylvania Fish and
Boat Commission to assist Monroe County in planning for sustainable future development in
the Pocono Creek watershed.
The Soil and Water Assessment Tool (SWAT) is selected to model the impact of
projected future build out in the Pocono Creek watershed on the hydrologic response thereof.
The model is successfully calibrated and validated for two sources of precipitation data,
raingauge and Next Generation Weather Radar_(NEXRAD) hourly precipitation data. The
results clearly show that NEXRAD is an effective and economic alternative source of spatio-
temporal precipitation, and that future modeling studies in ungauged watersheds may benefit
from the use of NEXRAD rainfall data.
Ensemble model forecast is constructed using time series analysis and Monte Carlo (MC)
simulations to evaluate model predictive uncertainty. The MC simulations over a 20-year
long period yielded an ensemble of rating curves of which the median and 95% confidence
band of daily streamflows are plotted. These plots allow for the construction of the 95%
confidence band for design flows corresponding to any given recurrence or return period.
SWAT simulated daily streamflow rates in the range 2 to 11 (mVs) show the least
uncertainty. Computed daily streamflow rates below 2 m3/s have the greatest uncertainty,
whereas for flows higher than 11 m3/s uncertainty is moderate.
MC simulations over a 20-year period predict that, on the average, daily base flow would
be reduced by 31% based on the projected build out in the watershed. However, predicted
change in the average of daily streamflow (averaged over the entire simulation period)
appears to be negligible. The computed low-flow index, 7Q10, is expected to decline by
11%, and the monthly median daily flow is expected to be reduced by 10% on the average.
The monthly peak of simulated daily flows and annual maximum daily flow on the average
are predicted to increase by 21% and 19%, respectively. Watershed-averaged groundwater
recharge is predicted to decline by 31% due to the projected land use changes. The median of
the MC simulated flow duration curves shows that in general the likelihood that the
watershed will experience high and low streamflows will increase with the projected
urbanization.
An index methodology is developed to rank seven subwatersheds composing the modeled
portion of the Pocono Creek watershed based on their relative impact on watershed response
to anticipated land developments. The first index, a, signifies the absolute impact of a
particular catchment area on the watershed response. The second index, /?, is a normalized
in
-------
by the percentage area of the sub-catchment, and therefore describes the impact per area of
land use changes. With a few exceptions, a and |3 indices produce similar rankings among
the 7 catchment areas for 7Q10, monthly median of daily flow, and annual maximum daily
flow. These ranking results may be related to groundwater recharge, area, topographic
features, and proximity to the streamflow gauge station. The very downstream catchment
area 7 ranked first in terms of impact on annual maximum daily flows, and second in terms
of impact on 7Q10 and monthly median daily flows. Catchment area 4 associated with the
highest groundwater recharge was ranked first and second for impact on 7Q10 based on a
and |3 indices, respectively. Areas characterized by steep topography and intense wetlands
ranked low, some times the lowest, with respect to impact on the three design flows.
The results of this model study point toward significant changes in low as well as high
flow regimes, should the Pocono Creek watershed experience land use changes consistent
with the projected build out in the watershed. Management measures may be taken in the
future to minimize the predicted changes in the watershed hydrology.
Keywords: watershed, radar, rain gauges, hydrologic modeling, distributed parameter,
SWAT, critical source area, land use change, uncertainty, forecast, Monte Carlo,
time series analysis, Pocono Creek, urbanization, base flow
IV
-------
Table of Contents
Notice i
Foreword ii
Abstract iii
Figures vii
Tables ix
Acknowledgments x
1 Introduction 1
1.1 Pocono Creek Watershed 1
1.2 Objectives 2
1.3 Report Organization 3
1.4 Summary 4
2 Model Selection 5
2.1 Background 5
2.2 Summary of Hydrologic Component of SWAT Model 6
2.3 Summary 9
3 Watershed Model 10
3.1 Study Area 10
3.2 Base flow Separation 11
3.3 Next Generation Weather Radar (NEXRAD) 12
3.4 Model Calibration 12
3.5 Model Validation 18
3.6 Conclusions 20
4 Model Prediction Uncertainty 22
4.1 Introduction 22
4.2 Structural and Model Parameters Uncertainty 23
4.2.1 Prediction Error 23
4.2.2 Nonparametric Probability Distribution 28
4.2.3 Nonparametric Random Generation 29
4.3 Model Forecast Evaluation 30
4.4 Uncertainty Due to Precipitation Variability 33
4.5 Monte Carlo Simulations 34
4.6 Conclusions 37
5 Impact of Land Use Changes 39
5.1 Introduction 39
5.2 Present Land Use Map and Future Build Out 39
5.3 Monte Carlo Simulation 41
5.4 Predicted Changes in Streamflow 41
5.5 Predicted Changes to Flow Frequency and Duration 45
5.6 Conclusions 45
v
-------
6 Critical Source Areas 47
6.1 Introduction 47
6.2 Methodology 47
6.3 Results 48
6.4 Conclusions 51
7 Summary and Conclusions 52
8 References 54
VI
-------
Figures
Figure 1. Pocono Creek Watershed: Location, topography, USGS stream gauge,
climate stations and NEXRAD cell centroids (adapted from Kalin and
Hantush, 2006a) 2
Figure 2. Pocono Creek watershed: a) Land use map; b) Soil map (STATSGO) 12
Figure 3. Calibration procedure in the SWAT model (adapted from Santhi et al.,
2001) 14
Figure 4. Comparison of NEXRAD estimated precipitation with measurements
from local gauge stations. Values shown are in mm 15
Figure 5. Measured stream flow (SF) and estimated surface runoff (SR) plotted
against SWAT simulated counterparts for the calibration (7/1/2002-
5/31/2004) period. NEXRAD estimates are used as precipitation data
source. Top two panels show monthly results, and the bottom panel
shows daily results 17
Figure 6. Measured streamflow (SF) and estimated surface runoff (SR) plotted
against SWAT simulated counterparts for the validation period
(6/1/2004-4/30/2005). NEXRAD estimates are used as precipitation data
source. Top and bottom panels denote monthly and daily results,
respectively 18
Figure 7. Measured daily streamflows plotted against SWAT simulated
counterparts for the post-validation period (5/1/2005-9/30/2005).
NEXRAD estimates are used as precipitation data source 20
Figure 8. Time series of model prediction errors (Observed - SWAT simulated) in
the calibration and validation time period (7/1/2002-4/30/2005): (a)
daily simulations, (b) 3-day moving average 24
Figure 9. Autocorrelation (ACF) and partial autocorrelation (PACF) functions of:
(a) prediction error sh (b) first difference of residual error Vst = st - st.i,
(c) residuals (wr) for ARIMA(1,1,1) x (0,0,1)3. The lag is in units of
days 26
Figure 10. Frequency histogram, Gaussian PDF, and the nonparametric model fit to
the computed series of residuals w^. Both the histogram of residuals and
normal distribution has zero mean and standard deviationcr^ 29
Figure 11. SWAT model ensemble forecast (median and 95% confidence band) and
measured streamflows during the validation period (12/16/2004-
4/30/2005). A total of 9 measurements lie outside the 95% confidence
band (6.7 %). The simulated and measured values are 3-day averages 32
Figure 12. SWAT model forecast (median and 95% confidence band) and measured
streamflows during the post-validation period (5/1/2005-9/30/2005). The
simulated and measured values are 3-day averages 32
Figure 13. SWAT model 3-day average ensemble forecast (median and 95%
confidence band) and measured daily streamflows during the validation
period (12/16/2004-4/30/2005). A total of 15 measurements fall outside
the 95% confidence band (11 %) 33
vn
-------
Figure 14. Median and 95% confidence band for the MC simulated duration curve.
The flow duration curves are generated from 500 replicas of 20 years
daily streamflows, ot 37
Figure 15. Coefficient of variation computed from ensemble of flow duration
curves versus probability of exceedance in the left panel P(X > x), and
cumulative probability P(X
-------
Tables
Table 1. Data used in the model construct and their sources 11
Table 2. Mass Balance Error (MBE), coefficient of determination, R2, and Nash-
Sutcliffe efficiency, ENS for Stream Flow (SF), Base flow (BF) and Surface
Runoff (SR) during the calibration period with NEXRAD as precipitation
data source. Values in parenthesis indicate values when rain gauges are used
as precipitation data source 17
Table 3. Mass Balance Error (MBE), coefficient of determination, R2, and Nash-
Sutcliffe efficiency, ENS for Stream Flow (SF), Base flow (BF) and Surface
Runoff (SR) during the validation period, g and n represent gauge and
NEXRAD input precipitations, respectively 19
Table 4. Values of residual variance, Final prediction error (FPF), Akaike's
information criterion (AIC), and Bayesian information criterion (BIC) for
various models applied to model prediction error data, et 27
Table 5. Summary statistics of ensemble streamflow forecast based on most recent
land use (Figure 2a). Values in the table are obtained from 500 time series of
synthesized daily streamflows 35
Table 6. Summary statistics of computed streamflow characteristics at the USGS
gauge station. The results are derived from 50 MC simulations each 20
years long 42
Table 7. Computed indices a and /? for the 7 management areas for 7Q10, monthly
median of daily flows and annual maximum daily flows 49
IX
-------
Acknowledgments
The U.S. Environmental Protection Agency, through its Office of Research and
Development, funded the research described here through in-house efforts and in part by an
appointment to the Postgraduate Research Program at the National Risk Management
Research Laboratory administered by the Oak Ridge Institute for Science and Education
through an interagency agreement between the U.S. Department of Energy and the U.S.
Environmental Protection Agency. The report benefited from the constructive review
comments of Dennis Lai and Rao S. Govindaraju.
x
-------
1 Introduction
1.1 Pocono Creek Watershed
Pocono Creek watershed is a 46.5 square miles (120 km2) basin located in Monroe County in
Eastern Pennsylvania near the New Jersey border (Figure 1). The watershed drains into one
of the main tributaries of Delaware River and has very good water and biological quality, and
is designated as Special Protection Waters by the State and the Delaware River Basin
Commission (DRBC). Wild brown trout population, tourism and significant outdoor
recreation are major economic drivers of the area. Monroe County has the second fastest
growing population in the state of Pennsylvania. The county is threatened by high population
growth because of its attractive, pristine natural resources, and its proximity to the New York
City and Philadelphia metropolitan regions. Since 1980 the population of Monroe County has
nearly doubled and is projected to grow an additional 60% by 2020. Potential impacts,
among others, include a degradation and loss of the forested and agricultural lands and
deterioration of the local quality of life. Specifically, the concern is that the projected growth
and land use changes along with the accompanying increased groundwater withdrawals in the
watershed could well exceed sustainable levels, depleting groundwater and streamflows, and
resulting in the loss of the Creek's wild brown trout. Anticipated increase of imperviousness
due to urbanization in the watershed and projected increase in the demand for ground water
due to the anticipated population growth threaten the sustainability of current base flows.
Further, reduced infiltration and increased runoff rates have the potential to elevate peak
flows and increase flood hazards during large storm events.
The Delaware River Basin Commission in collaboration with the U.S. Environmental
Protection Agency (USEPA), Broadhead Watershed Association, Monroe County
Conservation District, Monroe County Planning Commission, U.S. Geological Survey
(USGS), and other stakeholders formed a consortium to study the potential future impacts of
the projected growth and land use changes on the sustainability of the natural resources in the
Pocono Creek watershed. The goals are to manage flows and growth in the Pocono Creek
watershed such that natural resources are sustainable. The goals will be partly achieved
through three integrated model studies by the USEPA, the U.S. Geological Survey (USGS),
and the PA Fish and Boat Commission (PA F&B). These model studies will evaluate the
effects of growth and land use change on ground water, streamflow, and the habitat of
Pocono Creek. This report documents the development of the Pocono Creek Watershed
model by the USEPA using the Soil and Water Assessment Tool (SWAT), its application to
assess the impact of projected urbanization on streamflow characteristics, and identification
of critical areas within the watershed having major contributions to changes in the
streamflow. The results of this model study will be linked to a USGS groundwater flow
model (MODFLOW) and the Pennsylvania Instream Flow Model (PIFM).
Distributed watershed models are utilized to better understand the role of hydrological
processes that govern surface and subsurface water movement. They provide tools for
environmental decision making and water resources planning and management. They are not
-------
only helpful in making model predictions of future flow conditions, but also in assessment of
hydrologic impacts of management measures scenarios, land cover and climate changes.
(961.614) B
B (961.615)
* Mount Pocono
climate station
(962,614)
(961,613
4 Kilometers (965,612)°
Streams
NEXRADcell centroicls
Elevation (meters)
| 1 183-219
219 - 254
254 - 290
290 - 326
326 - 362
362 - 397
| | 397 - 433
||433-469
| | 469 - 504
| | 504 - 540
^B 540 - 576
^B 576 -612
• 612 - 647
(966,614)
Q
*
Stroudsburg
climate station
Stroudsburg
USGS
stream gauge
Figure 1. Pocono Creek Watershed: Location, topography, USGS stream gauge, climate
stations and NEXRAD cell centroids (adapted from Kalin and Hantush, 2006a).
1.2 Objectives
The goal of the current model study and those by the USGS and the PA F&B, is to provide
the necessary science so that Monroe County and stakeholders can make informed decisions
that will assist them in developing and implementing sustainable water resources
management strategies without compromising the integrity of the watershed's habitat. To
-------
accomplish this overall goal, a watershed model is developed by the U.S. EPA to achieve the
following objectives:
1- Calibrate and validate a watershed model for Pocono Creek Watershed and examine
model performance with Next Generation Weather Radar (NEXRAD) data against that
based on surface raingauge precipitation data: The distributed hydrologic Soil Water
Assessment Tool (SWAT) will be calibrated and validated. Radar generated precipitation
estimates such as from NEXRAD products have found increasing usage in the hydrologic
community lately as an alternative source to gauge data. NEXRAD data can provide
information about the spatial distribution of precipitation patterns. If proven successful,
NEXRAD data can be a cost-effective alternative to the rather more costly and sparse
raingauge data.
2- Predict the impact of projected land use changes on annual average recharge distribution:
Spatial distribution of annual groundwater recharge rates will be computed pre- and after
urbanization build out for use by a USGS MODFLOW model to simulate the impact of
projected increase in groundwater withdrawals on base-flow reductions.
3- Predict the impact of projected land use changes on monthly median daily flows:
Monthly median daily flows will be computed pre and post urbanization build out for use
by the PIFM model which will be used to establish a relationship between flow
reductions and potential habitat degradation in the Pocono Creek watershed.
4- Evaluate model predictive uncertainty: It is a common practice to calibrate and validate
hydrologic and water quality models, but their forecasting abilities are rarely rigorously
evaluated. In this modeling effort, the extra step of evaluating model error propagation
will be conducted using time series analysis and Monte Carlo (MC) type simulations.
5- Compute the effect of urbanization on streamflow characteristics: It is anticipated that
projected population growth and urbanization in Pocono Creek will be accompanied by
an increase in impervious fraction of the watershed. The impact of land-use changes on
low, high, monthly average, and median flows will be particularly investigated, along
with 95% confidence band of the computed changes in flow characteristics.
6- Identify critical areas in the watershed: Once potential changes in flow characteristics are
predicted by the watershed model due to projected land-use changes, the inner
catchments contributing most of the changes will be identified using an index-based
methodology.
1.3 Report Organization
This report is organized as follows. In Section 2 the rationale for the selection of the SWAT
model to investigate the hydrology of Pocono Creek is established. A summary of model
representation of the most important components of the hydrologic cycle and related
parameters is presented. This summary is intended to define key model parameters and
provide insights into the process of model calibration. Section 3 describes the study area,
-------
model data and sources, and model calibration and validation using raingauge and NEXRAD
precipitation data. The viability of NEXRAD technology as an alternative to raingauge
networks as a source of spatio-temporal precipitation is established in the section. Error
propagation and model forecast capability is investigated in Section 4 using time series
analysis and Latin Hypercube Monte Carlo (MC) simulations. Section 5 explores potential
impacts of projected build out in the watershed on changes in key streamflow characteristics
using MC simulations. The impact of land use changes on (low, medium, and high) flow
frequency and duration is also investigated. Section 6 applies an index approach to identify
and rank subwatersheds or areas within subwatersheds that may contribute mostly to
predicted changes in streamflow characteristics. Each section ends with conclusions. The
final summary and conclusions of this model study appear in Section 7. References are
included in Section 8.
1.4 Summary
Pocono Creek watershed is threatened by high population growth and urbanization. Potential
impacts include degradation and loss of the forested and agricultural lands and deterioration
of the local quality of life. The condition of the wild brown trout habitat has been identified
as an indicator of the health of the watershed. Of specific concern is the potential impact of
projected population growth and land use change on the reduction of base flow and the
impact this may have on the degradation of wild brown trout habitats in Pocono Creek.
Projected increase in the imperviousness in the watershed, on the other hand, is expected to
increase flood frequency and reduce the recurrence interval of high flows.
Upon request by the DRBC and the EPA Region III, the National Risk Management
Research Laboratory of the USEPA has been tasked to investigate through a watershed
model study the impact of projected land use changes on potential alterations in the
hydrology of Pocono Creek. Constrained by the existing budget and available resources, it
was concluded that the SWAT model is a suitable and effective tool to conduct this modeling
study.
The objectives of the watershed model study are primarily three-fold. First, to calibrate
and validate a watershed model for Pocono Creek and compute the spatial groundwater
recharge distribution for use by a USGS groundwater flow model (MODFLOW) being
developed to assess the impact of land-use changes on base flow. Secondly, to compute
monthly median daily flows for use by the PIFM model that will be used to establish a
relationship between flow reductions and potential habitat degradation in Pocono Creek.
Thirdly, to investigate the impact of land-use changes on flow duration characteristics and
identify critical areas in the watershed. An important component of this study is investigating
NEXRAD as an alternative source of precipitation data, and the error propagation analysis
required to examine model forecast quality.
-------
2 Model Selection
2.1 Background
The distributed hydrologic Soil Water Assessment Tool (SWAT) (Neitsch et al., 2002a/b)
was chosen to fulfill the project objectives. The SWAT model was originally developed to
quantify the impact of land management practices in large, complex watersheds with varying
soils, land use, and management conditions over a long period of time, on the order of years.
It is developed and maintained by the US Department of Agriculture (USDA) scientists and
is freely available from . Although SWAT is mostly based
on empirical equations and simplified mass balance relationships, it is a widely-used model
and numerous applications can be found in the peer reviewed literature. For instance, as of
February 2006, the SWAT model web site cited 211 peer reviewed publications in the form
of journal papers or book chapters (). Borah (2002) reviewed eleven continuous-simulation and single-event
watershed scale models including SWAT. The study provides a better understanding of the
mathematical bases of the models. Kalin and Hantush (2003) reviewed key features and
capabilities of widely cited watershed scale hydrologic and water quality models, and
identified SWAT as one of the most suitable models for applications related to watershed
management. Robustness of SWAT for simulating watershed responses has also been
demonstrated in comparative studies by Saleh and Du (2002) and Van Liew et al. (2003).
Arnold and Fohrer (2005) provided a list of SWAT applications in the USA and worldwide.
The SWAT model development, operation, limitations, and assumptions were discussed
by Arnold et al. (1998). Srinivasan et al. (1998) reviewed the applications of the SWAT
model in streamflow prediction, sediment and nutrients transport, and effects of management
practices on water quality. Arnold and Allen (1996) evaluated the performance of different
hydrologic components of the SWAT model for three watersheds in Illinois (100-250 km2).
Comparing the model outputs to measured data, the calibrated model reasonably simulated
runoff, groundwater, and other components of hydrologic cycle for the study watersheds.
Most of the simulated average monthly outputs were within 5% of the historical data and
nearly all of them were within 25%. The coefficient of determination (R2) was used to assess
the correlation between the observed and simulated average monthly variables. Also, the
interaction among various components of hydrologic budgets was recognized to be realistic.
SWAT was utilized in a study by Arnold et al. (2000) to compare the performance of two
baseflow and groundwater recharge models. The first model was the water balance
components of the SWAT model. A combination of a digital hydrograph separation tool and
a modified hydrograph recession curve displacement technique composed the second model.
The results of the two models were in general agreement in the Upper Mississippi river basin.
A detailed procedure for calibration of SWAT was laid out by Santhi et al. (2001). Jha et al.
(2003) found curve number (CN) as the most sensitive parameter in streamflow prediction
with SWAT. Muleta and Nicklow (2005) applied SWAT coupled with automated calibration
to estimate daily flow and sediment yield in a 133 km2 Southern Illinois watershed. Eckhardt
and Arnold (2001) developed a version of SWAT having global optimization algorithm
(SWAT-G) to model daily flow in an 81 km2 watershed in Germany. Fohrer et al. (2002)
used SWAT-G in conjunction with two other GIS based agricultural economy and ecology
-------
models in a mountainous 60 km2 watershed in Germany to analyze the effect of land use
changes. Sophocleaous and Perkins (2000) integrated SWAT with MODFLOW and applied
the integrated modeling system to three different Kansas watersheds. Tripathi et al. (2004)
showed on a 92.5 km2 Indian watershed that SWAT can successfully simulate average annual
and monthly flow and sediment yield even if weather input is obtained through SWAT's
weather generator. One may also refer to Jayakrishnan et al. (2005) on SWAT applications to
water resources management. In conclusion, SWAT performance has been extensively
validated for streamflow, and sediment and nutrients yield predictions for different regions of
United States and outside.
2.2 Summary of Hydrologic Component of SWAT Model
SWAT is a distributed, deterministic process-based hydrologic model (Neitsch et al.,
2002a/b). The AVSWAT (Di Luzio et al., 2002) graphical user interface (GUI) which runs
under ArcView GIS is used to preprocess model data, run the SWAT model, and post
process model outputs. SWAT uses readily available inputs and has the capability of routing
runoff and chemicals through streams and reservoirs, adding flows and input measured data
from point sources, and is capable of simulating long periods for computing the effect of
management changes. Major components of the model include weather, surface runoff,
return flow, percolation, evapotranspiration (ET), transmission losses, pond & reservoir
storage, crop growth & irrigation, groundwater flow, reach routing, nutrient & pesticide
loading, and water transfer.
Input data needed to run the SWAT model includes soil, land use, weather, rainfall,
management conditions, stream network, and watershed configuration data. AVSWAT has
the capability of extracting most of these model parameters from readily available GIS maps
such as digital elevation models (DEM), land use maps, STATSGO soil maps, etc. Below is
a short summary of the SWAT model from the model manual and theoretical documentation
version 2000 (Neitsch et al., 2002a; 2002b).
SWAT partitions the watershed into subunits including subbasins, reach/main channel
segments, impoundments on main channel network, and point sources to set up a watershed.
Subbasins are divided into hydrologic response units (HRUs) which are portions of subbasins
with unique land use/management/soil attributes. AVSWAT enables extraction of input
parameters easily. It uses Digital Elevation Models (DEM) as input to extract the channel
network and delineate the watershed and subwatersheds, the resolution of which depends on
the user provided threshold area which is required to initiate a first order channel. The
threshold area can be chosen in such a way that the resultant channel network resembles the
one provided in topographic maps. The user needs to provide two threshold values to create
HRUs, one for land use and one for soil. Land uses that cover a percentage of the subbasin
area less than the threshold level are considered minor and thus eliminated. After the
elimination process, the areas of the remaining land uses are reapportioned so that 100% of
the land area in the subbasin is modeled. The soil threshold is applied next in a similar
fashion to eliminate minor soil types that occupy negligible portions of the HRUs.
In SWAT, the land phase of the hydrologic cycle is based on the water balance equation:
-------
SWt = SW0+^ (Rday,, - <2OTr/, - EaJ - Wseepj - <9W ) (2.1)
;=1
where SWt is the final soil water content (mm), SWo is the initial soil water content (mm), t is
the time (days), Rday,i is the amount of precipitation on day /' (mm), Qsurf,i is the amount of
surface runoff on day /' (mm), Ea,i is the amount of evapotranspiration on day / (mm), wseep,i is
the amount of percolation and bypass flow exiting the soil profile bottom on day /' (mm), and
Qgw,i is the amount of return flow on day / (mm).
Snowmelt is included with rainfall in the calculation of runoff and percolation; it is
controlled by the air and snow pack temperature, the melting rate, and the areal coverage of
snow. The mass balance for the snow pack is given by:
SNOt = SNOo + Rday - Esub - SNOmit (2.2)
where SNOt is the water content of the snow pack at the end of a day (mm), SNOo is the
initial water content of snow pack (mm), Rday is the amount of precipitation on a given day
(mm), Esub is the amount of sublimation on a given day (mm), and SNOmit is the amount of
snowmelt on a given day (mm). This equation assumes that the water released from
snowmelt is evenly distributed over the 24 hours of the day.
SWAT uses the SCS curve number method (USDA Soil Conservation Service, 1972) or
Green & Ampt infiltration method (Green and Ampt, 1911) to compute surface runoff
volume for each HRU. The former option is utilized in this study. The SCS runoff equation is
an empirical model that was developed to provide a consistent basis for estimating the
amounts of runoff under varying land use, soil types, and antecedent moisture conditions
(Rallison and Miller, 1981). The SCS curve number equation is:
(Rda-0.2S)2
day
where, S is the retention parameter (mm). In this equation the initial abstraction, which
includes surface storage, interception and infiltration prior to runoff, is approximated as 0.25".
The retention parameter is defined as:
CN
(2.4)
^ }
in which C7Vis the curve number for the day, which is a function of the soil's permeability,
land use and antecedent soil water conditions.
Evapotranspiration (ET) is the primary mechanism by which the water is removed from a
watershed. It includes all processes by which water at the earth's surface is converted to
water vapor: evaporation from the plant canopy, transpiration, sublimation and evaporation
from the soil. SWAT calculates actual ET from potential evapotranspiration (PET). The latter
is estimated by three methods in SWAT: the Penman-Monteith method, the Priestly-Taylor
-------
method, and the Hargreaves method. The Penman-Monteith method requires solar radiation,
air temperature, relative humidity and wind speed. The Priestly-Taylor method requires solar
radiation, air temperature and relative humidity. The Hargreaves method requires air
temperature only. Penman-Monteith is used in this study.
Once surface runoff is calculated, the amount of surface runoff released to the main
channel is computed from
—surlag
l-e 'conc
Qch,i ~ \Qsurf,i + Qstor,i-l)'
(2.5)
where Qch,t is the amount of surface runoff discharged to the main channel on day /' (mm),
Qsurf.t is the amount of surface runoff generated in the subbasin on day /' (mm), Qst0r,i-i is the
surface runoff stored or lagged from day /'-I (mm), surlag is the surface runoff lag
coefficient, and tconc is the time of concentration for the subbasin (hrs). The last expression
within the bracket on the right hand side of Equation (2.5) represents the fraction of total
available water allowed to enter the reach on a given day. Remaining water becomes
available water for the next day (Qstor.i).
The movement of water through the channel network of the watershed to the outlet is
routed in main channel and reservoirs. Flow is routed through the channel using a variable
storage coefficient method developed by Williams (1969) or the Muskingum routing method,
the latter of which is employed in this study. Transmission losses, which reduce runoff
volume as the flood wave travels downstream, are also accounted for by the model. SWAT is
a continuous time model, i.e., a long-term yield model. The model is not designed to
simulate detailed, single-event flood routing.
The water balance for the shallow aquifer is:
«&/,,/ = «&/,,/-! + Wrchrg,l ~ Qgw,i ~ Wrevap,, ~ H 'deep,, ~ Wpump,Sh,, (2-6)
in which aqsh,t is the amount of water stored in the shallow aquifer on day / (mm), aqsh,t-i is
the amount of water stored in the shallow aquifer on day /'-I (mm), wrchrg,i is the amount of
recharge entering the aquifer on day / (mm), Qgw>i is the groundwater discharge, or base flow,
into the main channel on day /' (mm), wrevap>i is the amount of water moving into the soil zone
in response to water deficiencies on day /' (mm), Wdeep,i is the amount of water percolating
from the shallow aquifer into the deep aquifer on day /' (mm), and wpump:Sh,i is the amount of
water removed from the shallow aquifer by pumping on day /' (mm). The recharge to the
aquifer on a given day is calculated:
Wrchrg,i = - e~sW WSeep,, + C~S" • Wrchrg,,-l (2-7)
where Sgw is the delay time or drainage time of the overlying geologic formations (days),
p,i is the total amount of water exiting the bottom of the soil profile on day /' (mm), and
i is the amount of recharge entering the aquifer on day /-I (mm).
-------
Base flow is computed by SWAT using this equation:
\+ \
(2.8)
where Qgw,i-i is the groundwater flow into the main channel on day /'-I (mm), agw is the base
flow recession constant, and At is the time step (1 day).
2.3 Summary
The rationale for the selection of SWAT was laid down in this section and its track record of
applications to watershed management was discussed. It should be noted that while it is not
the policy of the U.S. EPA to promote the use of a particular model, the unavailability of a
detailed survey of Pocono Creek channels' geometry and characteristics and lack of
information about bathymetric and hydraulic characteristics of impoundments, ponds, and
wetlands in the watershed precluded the use of more physically based, complex watershed
models that are currently available to the public. It will be evident throughout the analyses
hereafter that the selection of the SWAT model was indeed an appropriate decision.
SWAT is a distributed, process-based watershed model, but with significant number of
empirical relationships. It is one of the most suitable models for assessing the impact of
management practices and land disturbances on watershed responses, and has a solid track
record of applications.
-------
3 Watershed Model
3.1 Study Area
The 120 km2 Pocono Creek watershed is located between the latitudes 40°59'N-41°06'N and
longitudes 75°14'W-75°26'W in Monroe County, Eastern Pennsylvania near the New Jersey
state border (Figure 1), located within the Delaware River Basin. Pocono Creek's 26 km long
watershed valley drains from the Pocono Plateau in its headwaters eventually into the
Brodhead Creek, a tributary to the Delaware River. The model is constructed for the area
upstream from the USGS streamflow gauge station that is located about 6.4 km upstream
from the mouth near the city of Stroudsburg, PA (Figure 1) and drains an area about 98.87
km2.
Table 1 summarizes the data used in this study along with their sources and formats.
Precipitation, temperature, humidity and wind speed data were obtained from two National
Weather Service (NWS) climate stations; Mount Pocono to the North and Stroudsburg to the
East. As can be seen from Figure 1, both stations are outside the watershed boundary. To
study the potential effects of this on model performance, NEXRAD data were also utilized
during the study as an alternative precipitation data source. Specifically, the XMRG products
produced by The Middle Atlantic River Forecast Center (MARFC
) were used. XMRG precipitation files are generated in a
specific file format after analyses from both gauges and radar with some manual quality
control and are available at approximately 4 km cell resolutions. The small squares with dots
inside in Figure 1 represent the locations of the centroids of the NEXRAD precipitation cells.
The XMRG files for the MARFC region can be downloaded from
. Climate data from 1960 to
2004 indicate that, on average, annual precipitation is 1237 mm, and varies from a minimum
of 76 mm in February to a maximum of 125 mm in September. The temperature typically
varies from a minimum of-11 °C in January to a maximum of 26 °C in July.
The current land cover (Figure 2a) is dominantly forest (89%). Pasture constitutes about
3.5% and minor agricultural activities less than 0.2%. Residential, commercial and
transportation areas comprise about 5.8% of the watershed including the commercially
developed Route 611 corridor, Big Pocono State Park, Camelback Ski Area, the Nature
Conservancy's Tannersville Cranberry Bog, and state gamelands. Silt loam is the major soil
type in the watershed covering about 84.9%. Two other soil types in the watershed are sandy
loam (11.4%) and loam (3.7%). The elevation in the watershed changes from 183 m at the
outlet to 648 m near the Camel Back Ski Area. The topography in the watershed generally
has an average slope of 11% and ranges from 4% to 23%. A 30 m resolution digital elevation
model (DEM) is used in extraction of the stream network and delineation of the watershed
and sub watersheds. The watershed is divided into 29 subbasins. The STATSGO soil database
was used to acquire the soil-related model parameters (Figure 2b). Hydrologic response units
(FtRU) are generated by using 10% and 0% thresholds for land use and soil maps that
resulted in a total of 129 FIRUs.
10
-------
Table 1. Data Used in the Model Construct and Their Sources
Data
Source
Additional Info
Soil
Land Use
Elevation
Climate
Radar
Stream
flow
SWAT build in
DRBC
DRBC provided, also available at
http://www.pasda.psu.edu/
NOAA National Data Center
http://nndc.noaa.gov
NWS Hydrologic Data Systems
Group:
http://dipper.nws.noaa.gov/hdsb/data
/nexrad/nexrad. html
http://waterdata.usgs.gov/pa/nwis/uv
?dd_cd=02&format=pre&period=16
&site no=01441495
USDA STATSGO soil database
National Land Cover Data (NLCD)
30m resolution DEM
Stations: Mount Pocono, 4F08'N /
75°23'W; Stroudsburg, 4F01'N/
75°11'W; hourly (daily for Stroudsburg)
precipitation, temperature, wind speed,
relative humidity
NEXRAD, Multisensor Precipitation
Estimator (MPE) Data in XMRG format.
Hourly precipitation at approximately 4
km resolution.
USGS 01441495 Pocono Creek ab
Wigwam Run near Stroudsburg, PA, Lat
40°59'27", long 75°15'20", data collected
every 15 min.
3.2 Base flow Separation
Stream flow is usually partitioned into two parts: the fast and the slow response components,
the latter of which is due to the base flow contribution. Any other contribution to stream flow
by various mechanisms can be deemed as the fast response component. SWAT computes the
base flow and surface runoff components of the stream flow separately. Although some
parameters play an interactive role such as the CN, some parameters only affect one
individual component of the flow. For instance, Manning's roughness only affects surface
runoff, whereas base flow recession constant only affects base flow. Hence, to better
calibrate model parameters it is necessary to partition the observed stream flow hydrograph
into base flow and surface runoff components, both of which then become estimated
quantities rather than observed.
We estimated base flow using the method described in Arnold et al. (1995) and Arnold
and Allen (1999). This is a recursive digital filter technique which was originally used in
signal processing and filtering. The filter can be passed over the stream flow data several
times with each pass resulting in less base flow. Various other methods of base flow
separation are also available and each can give significantly different base-flow estimates
which clearly affect not only model calibrated parameters but also model performance.
11
-------
(a)
Open Water
Perennial Ice/Snow
Low Intensity Residential
High Intensity Residential
Commercial/Industrial
Bare Rock/Sand/Clay
Quarries/Strip Mines
| | Transitional
Deciduous Forest
Evergreen Forest
Mixed Forest
Shrubland
| | Orchards/Vineyards
Grasslands/Herbaceous
Pasture/Hay
Row Crops
Small Grains
Fallow
Urban/Recreational Grasses
Woody Wetlands
Emergent Herbaceous Wetlands
(b)
Statsgo soils
PA024
PA025
PA026
PA027
PA030
PA031
PA037
Figure 2. Pocono Creek watershed: a) Land use map; b) Soil map (STATSGO).
3.3 Next Generation Weather Radar (NEXRAD)
Both the Mount Pocono and Stroudsburg climate stations are located outside the watershed
boundary (Figure 1) and provide point measurements. This raises the question whether the
data from these two gauge stations are representative of spatial rainfall pattern over the
Pocono Creek watershed which is partly covered with mountains. It is well known that
precipitation may change significantly in mountainous areas due to orographic effects.
Precipitation is usually higher in upper elevations. To be able to realistically answer this
question several raingauges inside the watershed boundary are needed so that spatial
distribution of the precipitation pattern can be attained. The Next Generation Weather Radar
(NEXRAD) precipitation data offer an opportunity to overlay spatially distributed
precipitation over the entire watershed. NEXRAD also provides a finer temporal resolution
(hourly) compared to daily values from the climate stations that are available. In any case,
hourly precipitation is not required for this study and daily precipitation suffices for the
current model application. Because the ultimate goal is modeling base flow and stream flow
in the Pocono Creek watershed, gauge driven model performance is compared to NEXRAD
driven ones. This sheds light on whether or not consideration of the distributed nature of
precipitation improves the model performance, and whether NEXRAD can be relied on as an
alternative to the surface raingauge measurements. For further details on processing the
NEXRAD data for use in a watershed model, the reader may refer to Kalin and Hantush
(2006a).
3.4 Model Calibration
Models are only simplified representations of natural processes. Even fully physically-based
models cannot avoid simplifications. In addition, parametric uncertainty and measurement
errors make model calibration inevitable in most modeling exercises. A split data set
approach is implemented in this study to calibrate and validate the SWAT model. The period
12
-------
from 7/1/02 to 5/31/04 of the daily flow data, aggregated into monthly flows, is used for
calibration and the remaining data from 6/1/04 to 4/30/05 is used for validation. Three to five
years of data are typically required in calibration, although along with a good set of data and
proper objective function, a single year of data has been shown to be adequate (Sorooshian et
al., 1983). The statistical measures of Mass Balance Error (MBE), coefficient of
determination (R2) and Nash-Sutcliffe (1970) efficiency (ENS) are used as indicators of model
performance that are defined as
N N
YO -Vo, . — -
/_^z^sim,i /_^z^oos,i f^. — r)
i=\ ;=1 _ xi sim z--o
JL n
Vn ^obs
(3.1)
R2=^
l2
(3.2)
Z^ (OSim,i
i=\
(o o\
3.3)
•th
where Qsim,i and Q0bs,i are simulated and observed or estimated flows at i observation,
respectively, N is the number of observations. Similarly, Osim and Oobs are the average
simulated and observed flows over the simulation period. The coefficient of determination
describes the proportion of the total variances in the observed data that can be explained by
the model and ranges from 0 to 1, whereas Nash-Sutcliffe efficiency is a measure of how
well the plot of observed versus predicted values fit the 1:1 line, and can vary from -GO to 1.
A negative ENS indicates that model predictions are not better than the average of observed
data.
The calibration process is adapted from Santhi et al. (2001) and is summarized in Figure
3. The threshold values set for MBE, R2 and ENS in the figure are at monthly time scales.
Curve numbers (C7V) of each soil/land use combination are calibrated first to meet the criteria
set for surface runoff (SR). At the next stage the GW REVAP coefficient which is a limiting
factor for the maximum amount of water that can be removed from the aquifer to the
overlying unsaturated zone due to moisture deficit, threshold depth of water in shallow
aquifer required for base flow to occur (GWQMN), delay time for aquifer recharge
(GW_DELAY), and soil evaporation compensation factor (ESCO) are calibrated to meet the
stream flow (SF) and base flow (BF) criteria. Once the model is calibrated using measured
stream flows for the calibration period, the model is validated using the data for the above
stated validation period and MBE, R2 and ENS are checked against the threshold values shown
in Figure 3. The threshold values set for MBE, R2 and ENS statistical measures must be
13
-------
satisfied for a successfully calibrated model. Before calibrating the model parameters, an
automated sensitivity analysis was performed with a version of SWAT called AVSWATX
that is based on Latin Hypercube (LH) and One factor At a Time (OAT) sampling. The
sensitivity analysis revealed that CN is the most sensitive model parameter (Kalin and
Hantush, 2006a).
Separate SR and BF for
measured daily flow
If avrg. of sim. SR is within ± 15%
of avrg. measured SR and
R2>0.6, EN_S>0.5
YES
If avrg. of sim. SF is within ± 15%
of avrg. measured SF and
R2>0.6, EN_S>0.5
NO
Adjust GW_REVAP,
GWQMN, ESCO,
GW DELAY
YES
If avrg. of sim. BF is within
± 15% of average measured BF
NO
Adjust GW_REVAP,
GWQMN, ESCO,
GW DELAY
YES
Calibration complete
•SR: Surface Runoff
•SF: Stream Flow
•BF: Base Flow
•R2: Coefficient of determination
[0,1]
•EN.S: Coefficient of efficiency or
Nash-Sutcliffe Efficiency [-°°,1]
•CN: Curve Number
•GW_REVAP: Groundwater
revap* coefficient
•GWQMN: Threshold depth of
water in shallow aquifer required
for return flow to occur
•ESCO: Soil Evaporation
compensation factor
•GW_DELAY: Delay time for
aquifer recharge (days)
* movement of water from
unconfined aquifer to overlying
unsaturated layers
Figure 3. Calibration procedure in the SWAT model (adapted from Santhi et al., 2001).
Figure 4 compares the daily and monthly areally averaged NEXRAD estimated
precipitation to measurements at the gauges. All the values in the figure are in mm. Mount
Pocono station falls in between two NEXRAD cells (Figure 1): (961,614) and (961,615)
where the numbers in parentheses represent the X and Y coordinates of the centroids of
NEXRAD cells in the FtRAP coordinate system. Therefore, we compared Mount Pocono
point precipitation measurements to NEXRAD estimates at both cells as well as to their
arithmetic averages. The Stroudsburg station falls inside the grid (966,614) as seen in Figure
1. Estimated hourly precipitations were
14
-------
150
CC
I
400
5 300 -
SJ-
= 200 -
0)
100 -
Monthly-Scale
(1/1/02-4/30/05)
y = 1.03x + 8.96
R2 = 0.87
100 200 300
Mt. Pocono Gauge
150
400
150 -,
Daily Scale
120 -\ (1/1/02-4/30/05)
CD
82-
90 -
60
30
y=1.00x+0.25
R2 = 0.91
150
30 60 90 120 150
Mt. Pocono Gauge
30 60 90 120
Mt. Pocono Gauge
400
in
5 300
5
S-
1 20°
o
CL
2 100
I
Monthly-Scale
(1/1/02-4/30/05)
y=1.01x + 5.21
R2 = 0.94
100 200 300
Mt. Pocono Gauge
350
400
150
30 60 90 120
Mt. Pocono Gauge
150
Q_ '
<
or .
400
300 -
in"
^- 200 -
5
2L
100 -
0
Monthly-Scale
(1/1/02-4/30/05)
y=1.02x+7.09
R2 = 0.92
0 100 200 300 400
Mt. Pocono Gauge
^300-
s- 25° -
CD
a 200 -
8 150 -
< 100-
1 50-
0 -
Monthly-Scale
(1/1/02-2/28/05)
y=0.95x + 1.87
R2 = 0.90
30 60 90 120
Stroudsburg Gauge
150
50 100 150 200 250 300 350
Stroudsburg Gauge
Figure 4. Comparison of NEXRAD estimated precipitation with measurements from local
gauge stations. Values shown are in mm.
aggregated in these cells to obtain the daily and monthly precipitation estimates. We
computed daily values from 7:00 AM to 7:00 AM at the cell (966,614) to be consistent with
the reported daily precipitation data at the Stroudsburg gauge station. Figure 4 reveals that
(961,615) represents Mount Pocono relatively better than both (961,614) and the average of
the two, as it has a higher R2, the slope of its regression equation is closer to 1 and has a
smaller intercept. Further, (961,615) overestimates precipitation by only 5.7% compared to
10.8% overestimation of (961,614) from 1/1/2002 to 4/30/2005. Over the time period from
1/1/2002 to 2/28/2005, NEXRAD underestimates precipitation in the Stroudsburg station by
10.0%. In this watershed, comparisons with gauge measurements indicate that NEXRAD
technology provides an alternative source of precipitation data. Note that NEXRAD
estimates are areal average precipitations of an approximately 4x4 km2 grid in contrast to
point measurements (-100 cm2) of gauge stations. Differences between the two are partly
attributed to variations of point measurements from areal average estimates of precipitations,
and partly due to measurement errors associated with the instrument itself.
15
-------
SWAT is calibrated both with rain gauge and NEXRAD as the precipitation data source.
The simulations are performed starting from 7/1/1970, i.e., with a 32 years of warm up
period to minimize the effect of initial conditions. For more details on calibration using
raingauge data, the readers are referred to Kalin and Hantush (2006a). The surface runoff lag
coefficient (surlag) had to be adjusted to improve the daily simulation performances. The CN
values calibrated based on raingauge data are reduced in all HRUs by 1.5 (for this specific
site) when NEXRAD was the data source. The volume of precipitation over the study
watershed estimated with NEXRAD is greater than that estimated from raingauges, and thus
this adjustment in CN had to be made. Experimentation with other parameters revealed no
further improvement in the model performance. Figure 5 compares SWAT simulations to
observed SF and estimated SR at the monthly time scale and SF at the daily time scale only
for NEXRAD. Rain gauge simulations not shown on the figure as hydrographs are very
similar to NEXRAD generated hydrographs.
in
I 6
ce
0)
o
>2 o
i_ z
w
• Observed
- SWAT - nexrad
CD r- 00 O5 O
12
_(0 g
?
w
- Observed
-SWAT- nexrad
16
-------
40
-Observed
-SWAT-Nexrad
55
Figure 5. Measured stream flow (SF) and estimated surface runoff (SR) plotted against
SWAT simulated counterparts for the calibration (7/1/2002-5/31/2004) period.
NEXRAD estimates are used as precipitation data source. Top two panels show
monthly results, and the bottom panel shows daily results.
As Figure 5 shows, overall the model performs well at both time scales. These results are
comparable to gauge driven simulations (Kalin and Hantush, 2006a). The NEXRAD driven
model performance statistics MBE, R2 and ENS are summarized in Table 2 and are well within
the calibration threshold values shown in Figure 3. From the results obtained, although with a
relatively smaller calibrated CN, it is reasonable to conclude that NEXRAD estimated
precipitation data is a good alternative to gauge measured precipitation data. The benefit of
using distributed rainfall data was negligible at both time scales for the particular application.
On the other hand, noticeable improvement is expected for stream flow estimates at the
interior subwatershed outlets with the use of distributed rainfall data over the available
raingauge data. In the next section, an 11 month period split sample data set is used to
validate the calibrated model.
Table 2. Mass Balance Error (MBE), Coefficient of Determination, R2, and Nash-Sutcliffe
Efficiency, ENS for Stream Flow (SF), Base flow (BF) and Surface Runoff (SR) During the
Calibration Period with NEXRAD as Precipitation Data Source. Values in Parenthesis
Indicate Values when Rain Gauges are Used as Precipitation Data Source
Stream
MBE
Monthly -3.8
(-3.8)
Daily
Flow (SF)
R ENS
0.85 0.84
(0.85) (0.83)
0.74 0.73
(0.74) (0.74)
Base Flow (BF)
MBE
% R ENS
-4.5 0.31 0.05
(-4.0) (0.30) (0.08)
Surface Runoff (SR)
MBE
% R ENS
-3.2 0.79 0.79
(-3.6) (0.77) (0.77)
17
-------
3.5 Model Validation
The period from 6/1/2004 to 4/30/2005 is chosen as the validation period. Simulations are
performed again starting from 7/1/1970 to curtail the effect of initial conditions. Figure 6
compares observed values to model simulations conducted with NEXRAD data. In the figure
the top two panels compare monthly SF and SR and the bottom figure is for daily SF results.
From visual inspection, one can conclude that model simulations match well with observed
SF and estimated SR.
12
b 6
ce
0)
o
- Observed
-SWAT- nexrad
15
JO
CO
_§
1
LL
E
ro
0)
12 -
9-
6-
3
• Observed
-SWAT-nexrad
SSSSSSS8888
Figure 6. Measured streamflow (SF) and estimated surface runoff (SR) plotted against
SWAT simulated counterparts for the validation period (6/1/2004-4/30/2005).
NEXRAD estimates are used as precipitation data source. Top and bottom panels
denote monthly and daily results, respectively.
Table 3 summarizes the model efficiencies at the monthly and daily time scales, respectively,
validated with NEXRAD and raingauge data. In the table "g" denotes gauge input and "n"
indicates NEXRAD input precipitation. In all cases, the model calibration criteria are
satisfied. As can be seen, MBE in SR decreased in absolute value from 15.1% to 7.9% with
the NEXRAD input simulations. In SF and BF, MBE dropped in absolute value to 5.6% from
13.5% and to 1.2% from 10.6%, respectively, all underestimations. Overall NEXRAD input
18
-------
monthly simulations outperformed raingauge based simulations. Daily statistics showed
mixed results.
Table 3. Mass Balance Error (MBE), Coefficient of Determination, R , and Nash-Sutcliffe
Efficiency, ENS for Stream Flow (SF), Base flow (BF) and Surface Runoff (SR)
During the Validation Period, g and n Represent Gauge and NEXRAD Input
Precipitations, Respectively
Precip.
source
3 §
o
S n
g
'ci
C) n
Stream Flow
MBE
/O/ \ r>2
\/0) /v
-13.5 0.81
-5.6 0.89
0.70
0.66
(SF)
ENS
0.66
0.75
0.64
0.62
Base Flow (BF)
MBE
% R ENS
-10.6 0.13 -0.26
-1.2 0.06 -0.40
Surface Runoff (SR)
MBE
O/ 1)2 77
/o K &NS
-15.1 0.83 0.73
-7.9 0.84 0.77
It is interesting to note that while at the monthly time scale NEXRAD had better
performance statistics, at the daily time scale gauge driven simulations had slightly better
statistics. Several reasons may have contributed to this. First, calibration efforts were more
focused to monthly time scale as part of the project goal. Secondly, NEXRAD calibration
was built on the calibrated parameters with gauge driven data, therefore there is a small bias.
An additional set of 5 months streamflow data in the period 5/1/2005 to 9/30/2005, which
was not available during model validation, was downloaded from the source and was utilized
to further evaluate the model performance. The model was run with NEXRAD data and
corresponding calibrated parameters during this post-validation period, and the results are
plotted in Figure 7. Interestingly, expanding the validation period 6/1/2004 - 4/30/2005 to
9/30/2005 results in slightly improved results; the MBE in absolute value decreased from
2.9% to 0.8%, daily R1
0.64.
increased to 0.68 from 0.66, and daily ENS increased from 0.62 to
19
-------
Figure 7. Measured daily streamflows plotted against SWAT simulated counterparts for the
post-validation period (5/1/2005-9/30/2005). NEXRAD estimates are used as
precipitation data source.
3.6 Conclusions
In this section the hydrology in the Pocono Creek watershed was modeled using a distributed
parameter watershed model, and in particular the potential for using NEXRAD as an
alternative source of precipitation data to raingauge stations was explored. The SWAT model
was calibrated and validated for the Pocono Creek watershed.
The model was first calibrated using precipitation data from two gauge stations located
outside the watershed boundary. Model performance was evaluated with computed model
efficiency statistics, i.e., mass balance error (MBE), coefficient of determination (R2), and
Nash-Sutcliffe Efficiency (ENS). Simulation results were promising at the monthly and daily
time scales.
As an alternative data source, the use of radar generated precipitation data (NEXRAD)
was appraised. NEXRAD data obtained from the Middle Atlantic River Forecast Center
(MARFC) provided hourly precipitation estimates over approximately 4x4 km2 grids.
Measured precipitation values at the two gauge stations were in close agreement with the
NEXRAD estimated values.
The SWAT model was fed with the spatially distributed NEXRAD precipitation data and
recalibrated by reducing the average curve numbers (C7V) by a value of 1.5 that were
obtained from model calibration when gauge precipitation data was used as input. The
streamflow and surface runoff hydrographs were in close agreement with measured
hydrographs. Efficiency statistics MBE, R2 and ENS were close to ones obtained with gauge
driven simulations. The model was then run for an additional 11 month period for validation
purposes both with gauge and NEXRAD data. NEXRAD generated smaller MBE. The values
of R2 and ENs were higher at the monthly time scale with NEXRAD, whereas at the daily
time scale gauge driven simulations resulted in improved measures of fit. An additional 5
months set of streamflow data was used to further evaluate the performance of the already
validated model.
20
-------
This section showed that spatially distributed precipitation data obtained through radar
reflectivity measurements provide a viable alternative to raingauge measurements. However,
estimation of precipitation with the help of radar still needs improvements. At present, data
from raingauges will continue to be relied upon to correct NEXRAD hourly digital
precipitation for mean field bias. Future refinement of this technology may provide a cost-
effective alternative source of precipitation data, and may reduce the need for the costlier
raingauges for large scale watershed applications.
21
-------
4 Model Prediction Uncertainty
4.1 Introduction
It is a common practice that the sequence of sensitivity analysis —» calibration —» validation
is followed during applications of distributed hydrologic/water quality models, yet rigorous
attempts are rarely made to assess model predictive uncertainty. Once the model is calibrated
with one set of data and validated with another set, it is used for predicting the impact of
management practices, land-use changes, and/or long-term climate changes, however, with
much less regard to uncertainty bands of predictions. The process of examining model
forecast reliability (post-validation) is an important consideration in development of
watershed management plans. Modeling uncertainty should be rigorously addressed in
development and application of models, especially when the model outcome might have
implications on policy, watershed planning and management, and when stakeholders are
affected by the decisions contingent upon model-supported analyses (NRC, 2001).
Implications of model uncertainty should be factored in the decision making process.
The analysis of uncertainty associated with utility of simulation models appears mostly in
the scientific, research literature (e.g., Spear and Hornberger, 1980; Beven and Binely, 1992;
Spear et al., 1994; van der Perk and Bierkens, 1997; Saltelli et al., 2000; Hossain et al., 2004;
Carpenter and Georgakakos, 2004; and Pebesma et al., 2005). While various approaches exist
for estimating distributed watershed/water quality model prediction uncertainty, careful
examination of the scientific literature reveals three dominant approaches. Amongst the three
approaches, the Bayesian approach is perhaps the most poular. Examples include Bayesian
Monte Carlo (e.g., Dilks et al., 1992; and van der Perk and Bierkens, 1997) and Generalized
Likelihood Uncertainty Estimation (GLUE) (Beven and Binely, 1992; Freer et al., 1996;
Beven and Freer, 2001; Hossain et al., 2004; and Hossain and Anagnostou, 2005)
methodologies. A second approach relies directly on Monte Carlo (MC) method to obtain an
ensemble of model outputs by independently sampling model parameters from prior
distributions based on ranges (i.e., with minima and maxima) derived from literature, or
based on new information gained from experience and model calibration (e.g., Binley et al.,
1991; Carpenter and Georgakakos, 2004; and Hantush and Kalin, 2005). A third approach
can be discerned in which the model noise or residual error is accounted for explicitly (e.g.,
Sorooshian and Dracup, 1980; and Pebesma et al., 2005). In this approach, model predictions
and observations are given as time series data, and attempts are made to fit a deterministic or
statistical relationship to the residual time series. Among these three approaches, the third
one has been given the least attention. Although GLUE methodology has gained a wider
acceptance, the selection of a suitable relationship for computing relative likelihoods
associated with the ensemble simulations is a matter of choice, and therefore subjective in
that sense. Both the GLUE methodology and the third approach implicitly account for the
four major sources of uncertainty: model structural uncertainty, parametric uncertainty,
measurement uncertainty, and rainfall variability.
In this section, we fit a time series model to the residuals (observed - predicted) of daily
SWAT model output. The focus on model performance at the daily time scale is because
22
-------
predictions of monthly median daily flows are required inputs to the PIFM model for wild
brown trout habitats. The time series model is combined with Monte Carlo-type simulation
(Latin Hypercube) to estimate uncertainty bounds for model predictions. Using a new set of
streamflow data (i.e., different from calibration and validation data), SWAT model forecast
performance is evaluated by comparison of the 90% confidence interval or (uncertainty band)
with observed values. One of the advantages of the time series approach, which is described
in the following section, is that the commonly adopted Gaussian and statistical independence
assumptions for model errors are relaxed by brute force application of time series analysis
and using a nonparametric probabilistic approach. Further, MC-type simulation is conducted
on rather a simple time series model for the residual errors as opposed to the computationally
demanding watershed model.
4.2 Structural and Model Parameters Uncertainty
4.2.1 Prediction Error
Figure 8(a) shows the time series of model prediction errors (observed-simulated) based on
simulations during the calibration and validation period (7/1/2002-4/30/2005). The error is
defined by this equation
st=0t~Pt (
where &t is model output noise or prediction error; ot is measured streamflow; andpt is model
computed streamflow; and t is time index in days. The st accounts for model structural
uncertainty, parametric uncertainty, measurement uncertainty, and errors in the rainfall input.
Model structural uncertainty is usually associated with imperfect knowledge, and generally
referred to as epistemic uncertainty. A close look at the daily errors time series (Figure 8a)
reveals systematic errors shown as large positive or negative spikes that are not random in
nature. In many instances a large (-) error is immediately followed by a big (+) error. One
reason might be timing errors in recording either the precipitation or the streamflow.
Another, probably the more reasonable, explanation is the inability of the SWAT model to
use sub daily rainfall data. When a big rainfall event happens at the very end of a day, the
actual watershed response will be about 1 day delayed compared to what SWAT actually
simulates with daily rainfall data. This will clearly result in overprediction (negative error)
and underprediction (positive error) on two successive days, respectively. Hence, to
minimize this type of error to some extent, we decided to use three days (3-day) moving
average of the errors in constructing our model. While one may argue as to why daily
simulations are of interest, especially when SWAT is best suited for long-term simulations
(monthly and annual time steps), there are two compelling reasons for this consideration.
First, monthly median daily flows are a required input to the PIFM habitat model. Secondly,
there is not sufficient monthly streamflow data to conduct a meaningful time series analysis
to model errors. Therefore, in addition to smoothing the effect of the large errors during
significant events, a 3-day averaging of streamflows is intended to provide reasonable
estimates of daily flows for the wild brown trout habitat model. Further, sufficient data of 3-
day moving averages will be available to construct a time series model of residual errors et.
23
-------
Henceforth, st refers to model residual errors based on 3-day averages of ot andpt, and thus
deemed (i.e., st) a surrogate to (rather than exact) daily model prediction errors.
(a)
i/r 20
CO
I 10
0 -
co
T3
CD
"o
T3
£
CD
CD
O -20
CM
O
CM
O
CO
O
CO
o
CO
o
CD
CD
CD
5)
CM
CM
CM
O
in
o
CD
CM
in
CD
CD
CO
0
QJ O
o *=
II
9- CD
T3 >
CD CO
e
CD
(b)
20
10 -
0 -
O)
£ -10 -
-20
CM
O
CM
O
CO
CO
o
CO
o
CO
o
CNi
CO
o
O
CD
O
55
o
CNi
CM
i^
CM
CD
in
o
3
CM
in
o
o
CO
Figure 8. Time series of model prediction errors (Observed - SWAT simulated) in the
calibration and validation time period (7/1/2002-4/30/2005): (a) daily simulations,
(b) 3-day moving average.
Figure 8(b) shows the difference between 3-day average of observed minus predicted
times series of stream flows during the period (7/1/2002-4/30/2005). The errors are not
totally eliminated but their magnitudes are significantly reduced. The Minitab® statistical
computer package is used to identify a time series model and construct reasonable forecasts
for the future of st. In general, a multiplicative seasonal autoregressive integrated moving
average model denoted by AREVIA (p,d,q) x (PJ),Q)s is fitted to the series of interest, which
is the model prediction error st in our case. The parameters/*, d, and q are, respectively, the
24
-------
orders of the autoregressive, difference, and moving average operators; P, D, and Q are,
respectively, the orders of the seasonal autoregressive, seasonal moving average, and
seasonal difference operators; and S is the seasonal period. Autocorrelation function (ACF)
and partial autocorrelation function (PACF) are utilized and the procedure outlined by
Shumway (1988), which derive its basis from Box and Jenkins (1970), is followed to identify
the best model. The objective of the model identification process is to produce identically
distributed, independent residuals wt, with a minimum variance aw2 arising from fitting some
ARIMA (p,d,q) x (PJ),Q)s to the et time series.
The upper two panels in Figure 9 (i.e., Figure 9(a)) show the ACF and PACF of model
errors, et. It is obvious that the series display nonstationarity accentuated by the slowly
decaying ACF as a function of lag and the large positive value of PACF at lag 1. The ACF of
the first difference in Figure 9(b) contains a fairly strong negative peak (-0.44) at lag-3 and
zero thereafter, indicating that a seasonal moving average with S = 3 (days) and Q =1 might
be appropriate. The decreasing peaks of the PACF at multiples of 3 are due to the seasonal
moving average component. Therefore, ARIMA (0,1,0) x (0,0,1)3 appears to be, at the
moment, a suitable selection among, perhaps, other competing models. Table 4 lists several
ARIMA models with computed variance of residuals, cr^,, and various goodness-of-fit
measures: final prediction error (FPF), Akaike's information criterion (AIC), and Bayesian
information criterion (BIC). It is clear that the most appropriate model choice is ARIMA
(1,1,1) x (0,0,1)3, which has the smallest values of cP, FPF, AIC, and BIC. The model was
fitted to the error time series depicted in Figure 8(b) (recall the 3 -day averages of the actual
model errors), and the equation for determining the residuals wt is
->B)Vst = l-&B3(l-0 B)wt (4.2)
where Vst = st - st-i is the first difference operator; Bst = st-i is the backward shift operator;
& 9, and 0 are the fitted parameters. Equation (4.2) can be expanded to yield the following
representation for st:
et =l + ^-i -st-2 +wt -0w,_, -0w,_3 +00w,_4 (4-3)
25
-------
(a)
Autocorrelation Function for (Qbs-Sirn)
(with 5% significance limits for the autocorrelations)
Partial Autocorrelation Function for (Obs-Sim)
(with 5% significance limits for the partial autocorrelations)
1.0-
0.8-
0.6-
g 0.4-
o 0.2-
0
8 -n?-
3
< -0.4-
-0.6-
-0.8-
-1.0-
jjjhl1r
-- ^'JJlUJi1.'
1,0-
0,8-
c 0.6-
1 0.4-
t 0.2-
o
< -0.2-
~m
33 -0,4-
a -0.6-
-0.8-
-1.0-
|__i._4H__r_,.______l__r_. ___________________ --t--T-v
15 10 15 20 25 30 35 40 45 50 55 60 65 70 75 15 10 15 20 25 30 35 40 45 50 55 60 65 70 75
Lag Lag
(b)
Autocorrelation Function for 1st difference (Obs-Sim) Partial Autocorrelation Function for 1st difference (Obs-Sim)
(with 5% significance limits for the autocorrelations) (with 5% significance limits for the partial autocorrelations)
1.0-
0.8-
0.6-
§ 0.4-
0 0,2-
b 0,0-
a -0.2 -
< -0.4-
-0,6-
-0.8-
-1.0-
1,0
0,8
c 0.6
'iii 0.4
0
t 0.2
0 0.0
< -0.2
•S -0,4
S -0,6
-0.8
-1.0
1 '
15 10 15 20 25 30 35 40 45 50 55 60 65 70 75 15 10 15 20 25 30 35 40 45 50 55 60 65 70 75
Lag Lag
(c)
ACF of Residuals for (Obs-Sim) - (l,l,l)x(0,0,l)3 PACF of Residuals for (Obs-Sim) - (l,l,l)x(QAl)3
(with 5% significance limits for the autocorrelations) (with 5% significance limits for the partial autocorrelations)
0,2
.9
1 0-0
I
-0,2
r; if. ,. . 1 ... ,i , ........ r.i.
• ' |l | ' M "If "i| [' >'!'!' Ii"
r
i 0.2-
•J3
B
E
<
:"."il".""":.h,".":,"":"t""""T""".n":;.";""t"rr".
• 1 L • M •• l|. .-|| M II. II"
2
£ -0.2-
16 12 18 24 30 36 42 48 54 60 66 72
Lag
16 12 18 24 30 36 42 48 54 60 66 72
Lag
Figure 9. Autocorrelation (ACF) and partial autocorrelation (PACF) functions of: (a)
prediction error st, (b) first difference of residual error Vst = st - st.i, (c) residuals
(wt) for ARIMA( 1,1,1) x (0,0,1)3. The lag is in units of days.
26
-------
where
-------
to the past by preserving and reproducing the statistical characteristics of observed model
errors.
4.2.2 Nonparametric Probability Distribution
Nonparametric methods are applied when parametric probability distributions fail to describe
the stochastic sequence under consideration. Lall (1995) reviewed applications of
nonparametric function estimation in hydrology. A probability density function of arbitrary
shape can be locally approximated by a nonparametric model. This is particularly important
when commonly used probability distributions (e.g., normal, log-normal, exponential, etc.)
poorly fit the frequency of the observed stochastic series. However, despite their successful
applications to hydrology, nonparametric approaches have the limitation that they do not
extrapolate beyond the range of the record, since the sequence of future wtis hypothesized to
have a similar nonparametric functional form of the fitted probability density function (PDF).
In parametric methods, the density function is estimated by assuming that data are drawn
from a known parametric family of distributions. The methods of moments, maximum
likelihood estimation, or any other methods are commonly used to estimate the parameters of
the chosen PDF. In nonparametric approaches, a kernel function is often used to generalize
the density function estimation. Given a set of n observations w\, w2,..., wn, a mathematical
expression of a univariate kernel probability density estimator is ((e.g., Kim and Valdes,
2005)
(44)
where x is a random variable which stands for vc; K is a kernel function; n is number of
observations; and h is a bandwidth that controls the variance of the kernel function. Kim and
Valdes (2005) provide a list of kernel functions typically used in hydrology, the most widely
used one being the Gaussian:
(4.5)
An optimal estimate for the bandwidth for a Gaussian kernel is provided by Kim and Valdes
(2005) citing Silverman (1986),
(4.6)
Where cris the standard deviation of the observed record, which in this case is equal to the
standard deviation of observed residuals,
-------
Note that the residual term wt in the error stochastic model (4.3) is not directly observed in
the data sample sj, 82, ..., sn. This value can be approximated by assuming that w2 = w\ = w0
= w.i = 0 and then computing from (4.3)
(4.7)
for k = 3, 4, ..., n. Recall that the sequence si, S2, ..., sn, which is used to construct the time
series model (4.3), is computed from Eq. (4.1) using SWAT model simulations and
corresponding streamflow measurements during the calibration and validation period.
Although the residuals wt have zero-mean and are independent, they do not follow any of
the known parametric distributions. Figure 10 shows a relatively poor fit to the frequency
histogram by Gaussian PDF assuming wt~ N(0, cP). The nonparametric fit (4.4) provides a
remarkably improved local fit to the observed histogram.
1.5
JL
f(x)
N(x)
0.5
-3
-2
-1
0
birij,x
1
Figure 10. Frequency histogram, Gaussian PDF, and the nonparametric model fit to the
computed series of residuals Wk. Both the histogram of residuals and normal
distribution has zero mean and standard deviationov,.
In the following section we now combine nonparametric random generation and Latin
Hypercube Sampling (LHS) with the stochastic model (4.3) to synthesize the ensemble of
model error time series wt.
4.2.3 Nonparametric Random Generation
Random observations may be generated from probability distributions by making use of the
fact that the cumulative probability function for any continuous variate is uniformly
distributed over the interval [0, 1] (e.g., Haan, 2002). Thus, for any random variable X with
), the variate FX(X)
29
-------
X
Fx(x)=\fx(u)du (4.8)
-00
is uniformly distributed over [0, 1]. A procedure for generating a random value x fmmfx(x)
starts with generating a random number,/? from the uniform distribution in the interval [0, 1],
then setting R = FX(X), and finally solving for x using the inverse relationship
x = Fxl(R) (4.9)
Unfortunately, an explicit solution to (4.9) is not always possible because FX(X) may not
be a simple function of x. The procedure for solving this equation is achieved in two steps.
First, FX(X) is obtained by substituting the right-hand-side of (4.4) forfx(x) in (4.8) and
commuting the order of summation with the integration operator. The term-wise integration
can be carried either numerically or analytically in terms of resulting error functions.
Secondly, for each randomly generated R, Eq. (4.9) is solved using any of the root searching
techniques. The simple bisection method is used to obtain the root.
A large sequence of independent, identically distributed model residuals wt are randomly
generated using the aforementioned procedure. Latin Hypercube Sampling (LHS) is
employed as an efficient and effective alternative to conventional Monte Carlo sampling
(MCS). The LHS (McKay et al. 1979) divides the CDF (i.e., the F*(x) function) into
segments of equal width from each of which a random variate R is generated. This way the
whole CDF is covered, but with smaller number of replications than by MCS. LHS is more
efficient than MCS at both ends of the CDF. The randomly generated sequence of wt is fed
into Eq. (4.3) to synthesize ensemble time series of st. As indicated at the outset, the
computed st values, in fact, estimate the 3-day average as opposed to daily model prediction
errors.
The generated ensemble of model errors together with the effect of precipitation
uncertainty were used to construct an ensemble of flow duration curves based on a present
land use map (Figure 2a). However, before doing so, model forecast performance was first
examined using the stochastic error model developed above and available streamflow data.
4.3 Model Forecast Evaluation
Figure 7 constitutes the first step in evaluating the model forecast performance during the
post-validation period 5/1/2005-9/30/2005. The simulated streamflows and measured
counterparts are daily and not 3-day averages. The relatively good fit indicated by extending
the validation period further corroborates the calibrated model, and suggests that no further
calibration is warranted. In general, the simulations somewhat overestimated measured
streamflows during high flow periods and underestimated measured values during low flow
periods.
30
-------
To examine model forecast quality, the period from 7/1/2002 to 4/30/2005 (1035 days) is
divided into two parts. The first 900 days (7/1/2002-12/16/2004) are used to construct the
error model and the remaining 135 days from (12/16/2004-4/30/2005) are used for
validation. The synthesized st values are subsequently added to SWAT model simulations
conducted during the same time period to obtain an ensemble of model forecast (500 time
series of streamflows),
ot=Pt+st (4.10)
where the A symbol denotes an estimate. ot is therefore considered as a surrogate to daily
forecast of streamflows. Negative values of ot computed from (4.10) are replaced with zero
flow rates.
Thus, for each t (i.e., day), there are 500 values of computed streamflows,^, available to
construct uncertainty band (confidence interval). Figure 11 compares measured streamflow
with ot. About 7% of observed values fall outside the 95% confidence band. Overall, the
median of simulated 3-day average streamflows compared fairly well with the measured
counterparts. The confidence band tends to be narrower for high flows and wider for low
flows. This indicates that the forecast is more reliable during storm events and least reliable
during smaller events and near base-flow conditions. To further examine model forecast
during low flow conditions, the forecast evaluation period is extended to 9/30/2005. No
significant storm events were recorded during the time period 5/1/2005-9/30/2005. While the
simulated median daily flows generally compare well with measured counterparts, the 95%
confidence band remains relatively wide (Figure 12). It is not clear if this is the result of
inadequate error model fit or poor SWAT model performance during low flow periods. In
general, watershed hydrologic models tend to perform poorer and have larger prediction
uncertainty during low flow events (e.g., Hantush and Kalin, 2005). This phenomenon is
largely attributed to the inadequacy of the models to capture the true intricate nature of
runoff-sub surf ace flow interactions for small events. It will be shown later that simulated low
flows generally have relatively high uncertainty.
-------
60
50
40
0)
D)
| 30
ro
20
10
0
12/16/2004
1/5/2005
1/25/2005
2/14/2005 3/6/2005 3/26/2005 4/15/2005
observed
• (SWAT+error) - median
--•(SWAT+error)-2.5%
•(SWAT+error)-97.5%
Figure 11. SWAT model ensemble forecast (median and 95% confidence band) and
measured streamflows during the validation period (12/16/2004-4/30/2005). A
total of 9 measurements lie outside the 95% confidence band (6.7 %). The
simulated and measured values are 3-day averages.
10
0)
D)
%
&
m
0
5/1/2005
* ' VV ' V i •'' '1JVV V './
5/21/2005 6/10/2005 6/30/2005 7/20/2005 8/9/2005 8/29/2005 9/18/2005
observed
-median
ARMA-normal 2.5%
•ARMA-normal 97.5%
Figure 12. SWAT model forecast (median and 95% confidence band) and measured
streamflows during the post-validation period (5/1/2005-9/30/2005). The
simulated and measured values are 3-day averages.
Since a rationale was set to use the simulated 3-day average streamflows as surrogates to
the daily flows, and for completeness, the ensemble forecast (median and 95% confidence
32
-------
band) and measured daily flows are plotted in Figure 13. The median of 3-day average of the
simulated streamflows appear to have grossly underestimated some of the measured peak
flows. On the other hand, low flows were simulated fairly well.
0 Ll
12/16/2004
1/5/2005
1/25/2005
2/14/2005
3/6/2005 3/26/2005 4/15/2005
observed
-(SWAT+error) - median
•(SWAT+error)-2.5%
•(SWAT+error)-97.5%
Figure 13. SWAT model 3 -day average ensemble forecast (median and 95% confidence
band) and measured daily streamflows during the validation period (12/16/2004-
4/30/2005). A total of 15 measurements fall outside the 95% confidence band (11
In the following subsection we present a methodology for computing SWAT model
response to uncertainty associated with precipitation.
4.4 Uncertainty Due to Precipitation Variability
Precipitation is the central driver of most hydrological processes. Model results are highly
sensitive to precipitation, and uncertainty in precipitation input may affect model output
variability. Other sources of uncertainties, such as the combined effect of parametric
uncertainty, model structural errors and measurement errors, are accounted for by the
stochastic model (4.3). Precipitation temporal variability is addressed here.
For a rational analysis of the potential future impacts of the projected land use alterations
on the hydrologic cycle of the Pocono Creek Watershed, it is essential to generate
meteorological data for future conditions mimicking past conditions. Generated data such as
precipitation, minimum and maximum air temperature, relative humidity, solar radiation and
wind speed need to preserve statistics of past observations. Admittedly, the supposition of
preserving the statistics of the past meteorological data ignores any global or local climate
change effects. Thus, in this analysis, it is assumed that the anticipated changes in the
33
-------
hydrologic regime are solely as a result of the projected modifications in the land use and
land cover. SWAT has a built-in weather generator WXGEN (Sharpley and Williams, 1990)
for atmospheric data generation. WXGEN relies on a first-order Markov-chain model to
define wet and dry periods. For detailed information on weather data generation
methodology, interested readers may refer to Kalin and Hantush (2006b). At the onset of this
study, Mount Pocono station had daily atmospheric data from 10/1/1999 to 6/30/2005, which
means that only 5 to 6 years of data was available to compute monthly statistics. At first
glance, this duration may sound inappropriate for computing climate data statistics as the
SWAT manual recommends the use of 20 years or longer data. Kalin and Hantush (2006b)
investigated the adequacy of relying on a shorter duration data by comparison with rain data
statistics pertinent to the Stroudsburg station, and showed that there are no significant
differences in the statistics of the Stroudsburg station whether the past 5 years or 20 years of
data is used. Therefore, for this specific application, it may be concluded that 5 years
precipitation data reasonably reflects the statistics of a 20 years record for Pocono Creek.
4.5 Monte Carlo Simulations
To take into account the precipitation uncertainty, WXGEN is used to generate 500 sets of
20-year long records of daily precipitation assumed to represent precipitation from 1/1/2005
to 12/31/2024 based on averaged historical rainfall statistics at the Mount Pocono and
Stroudsburg stations. A warm-up period from 1/1/1975 to 12/31/2004 of 30 years is used to
eliminate the effect of uncertain initial conditions. Each of the synthesized time series of
daily precipitation is fed into SWAT with the current land use map (Figure 2(a)) to obtain a
20-year long time series of daily streamflows. For each MC simulation, 3-day moving
average is computed from the simulated daily streamflows. The MC simulation is repeated
for a total number of 500, 20-year long time series of daily streamflows, from which the
ensemble of 3-day averaged flows are computed.
Assuming that future simulations errors preserve the statistical characteristics of the
historical record and follow the same stochastic model obtained in Eq. (4.3), an ensemble
forecast of streamflows (median and 95% confidence band) for the period 1/1/2005 to
12/31/2024 is constructed by generating an ensemble of 20-year long sequences of st which
then are added to the ensemble of MC simulations. In this manner, all sources of errors are
accounted for in the forecast. The sequence of independent, identically distributed wt are
randomly generated according to the procedure outlined in section 4.2.3, then used in
conjunction with Eq. (4.7) to synthesize an ensemble of 500 20-year long sequences of daily
(actually 3-day average) st values. The MC ensemble of 20-year long sequences of daily
streamflows generated above are then corrupted by the randomly generated errors (st) to
obtain the ensemble forecast of streamflows at the USGS gauge station according to Eq.
(4.10). It is worthwhile to note that the streamflows computed according to (4.10) constitute
what is expected to be observed or measured rather than actual flow rates.
Summary statistics of some of the model outcomes for the current land use scenario are
presented in Table 5. Mean, standard deviation (std), coefficient of variation (CV), median,
and 2.5th and 97.5th percentiles (95% confidence interval) of the 500 realizations for each
34
-------
flow statistics are given in the table to explicate uncertainty involved in the model outcomes
due to precipitation as well as structural, parametric, and observation errors.
Table 5. Summary Statistics of Ensemble Streamflow Forecast Based on Most
Recent land use (Figure 2a). Values in the Table are Obtained From
500 Time Series of Synthesized Daily Streamflows
Average Daily Average Monthly Average Monthly Average Annual
Flow (m3/s) Median Daily Maximum Daily Maximum Daily
Flow (mVs) Flow (m3/s) Flow (mVs)
(1) (2) (3J (4)
mean 2.42 2.10 7.12 17.39
std 0.203 0.182 0.411 1.93
CV 0.084 0.087 0.058 0.111
median 2.42 2.10 7.13 17.43
95% C.I [2.02,2.80] [1.75,2.45] [6.35,7.91] [13.91,21.82]
NOTE: Daily flow here refers to 3 day moving average flow.
In the table, summary statistics related to high flow conditions as well as long term
averages are given. The term "Average" in each column title denotes arithmetic average over
the 20-year simulation period. For example, for each realization (i.e., time series) out of 500,
daily flows are averaged over the 20-year simulation record to yield "average daily flow".
Thus, there are 500 such "average daily flow" values from which the mean, median, CV, and
95% confidence limits are computed. Average monthly-median of simulated daily
Streamflow in column (2) represents the arithmetic average over the simulation period of the
median daily stream flows for each month and as indicated above is a required input to the
PIFM wild brown trout habitat model. In column (3) the monthly maximum daily
streamflows averaged over the simulation period are given. Column (4) shows the annual
maximum daily streamflows averaged over the simulation period. Note that the summary
statistics in Table 5 assume that the watershed remains undisturbed in the next 20 years,
which obviously is contrary to what is anticipated. As such, they are hypothetical and are
intended solely to provide insights into the effect of uncertainties on the interpretation of
model predictions. Nevertheless, assuming that the land use/landscape in the watershed did
not undergo significant changes in the past twenty years, the tabulated results may provide
insights into the current flow conditions in Pocono Creek Watershed. The following remarks
can be made from Table 5:
35
-------
i. The average daily flow, average monthly median of daily flows, and average monthly
maximum daily flow have small CV values and relatively narrower 95% confidence
bands. Given all sources of uncertainty, these flow measures show small variability and
appear to be reliable measures of flow characteristics.
ii. The average annual maximum daily flow shows a relatively greater uncertainty, as the
larger CV and wider 95% confidence band indicate. The performance of flood control
measures designed based on annual maximum daily flows should factor in the
computed uncertainty.
iii. The low uncertainty associated with median monthly daily flow is rather encouraging
given that it is an important parameter for the wild brown trout habitat (PIFM) model.
The ensemble characteristics of the simulated flow duration curves are depicted in Figure
14. The figure plots the median of the flow duration curves and the lower (2.5th percentile)
and upper (97.5th percentile) limits of the 95% confidence interval. Each of the 500
synthesized o^time series yielded one duration curve. A flow duration curve is a plot of the
flow rate (say, x) versus probability of exceedance, P(X> x). The return period, T, defined as
the average recurrence interval between events equaling or exceeding a specified flow
magnitude, x, is the reciprocal of P(X > x): T = II P(X> x). High flows are associated with
small probability of exceedance, whereas low flows are associated with large probability of
exceedance. The ensemble of flow duration curves in Figure 14 can be interpreted as follows.
From the figure it can bee seen that for P(X > x) = 0.002, daily streamflow at the gauge
station is between 16 and 25 mVs with 95% confidence. Since the corresponding return
period is T = 1/0.002 = 500 days, this means that 500 day return period flow is between 16
and 25 m3/s with 95% confidence.
One may be lured into believing that uncertainty increases with increasing daily
streamflow as the 95% confidence band gets wider with decreasing probability of
exceedance, however, careful inspection of Figure 15 reveals that low flows have greater CV
values and, thus, show much higher uncertainty than medium range and high flows. It is
interesting to note that predicted streamflow rates ranging from about 2 to 11 (m3/s) daily
flow rates, corresponding to return periods of 2.5 and 50 days, respectively, have the smallest
CV values and, therefore, lowest uncertainty. Higher flows with a recurrence period longer
than 50 days have relatively higher uncertainty but much smaller than that associated with
low flows, with return periods smaller than 2 days. Of course, these results are specific to
Pocono Creek watershed.
36
-------
median
95% C.I- upper limit
95% C.I-lower limit
0.001
0.01 0.1
Probability of exceedance
Figure 14. Median and 95% confidence band for the MC simulated duration curve. The flow
duration curves are generated from 500 replicas of 20 years daily streamflows,
c n A
0 U'4
'ra
-2 03
"o
"c
m n o
Coeffici
o c
-^ K
n
O.C
^,
01
"^
^
^
-
t~
0
'"^~~1
01
—
-
I
~\~\~
!
0.1
—
-
r
~[~i
^
1
Probability of exceedance
0.01 0.1
1-(Probability of exceedance)
Figure 15. Coefficient of variation computed from ensemble of flow duration curves versus
probability of exceedance in the left panel P(X > x), and cumulative probability
P(X< x) in the right panel.
4.6 Conclusions
In this section model prediction capability was investigated using time series analysis and
Monte Carlo-type Latin Hypercube simulations. Structural errors, parametric uncertainty,
measurement errors, and rainfall variability together contribute to the SWAT model
37
-------
predictive uncertainty. To minimize the effect of systematic SWAT prediction errors during
significant events, 3-day averaged streamflows were considered as surrogates to daily flows.
A stochastic model, which lumps the first three sources of errors, was developed by fitting
ARIMA (1,1,1) x (0,0,1)3 time series model to 3-day averaged observed daily model errors
during the calibration and verification period. A split-sample approach was implemented to
construct and validate the error model. The stochastic error model allowed for constructing a
forecast band (the median and 95% confidence band) for the SWAT model simulations. It
was shown that most of the observed streamflows during the validation period (12/16/2004-
4/30/2005) were within the 95% confidence band. The constructed model forecast was
further evaluated using an additional set of measured streamflows during the period
5/1/2005-9/30/2005. Although the results showed consistency, the 95% confidence band was
relatively wide due to the relatively low-flow period.
The effect of temporal precipitation variability was also accounted for using historical
precipitation data recorded at the Mount Pocono and Stroudsburg stations. Synthesized
precipitation rates by the SWAT built-in weather generator WXGEN along with MC
simulation of the SWAT model and generated model errors all together were used to
construct an ensemble forecast of daily streamflows for the next twenty years. It was shown
that given all sources of model uncertainty, the average daily flow, average monthly median
of daily flows, and average monthly maximum daily flow could be reliably predicted. The
relatively low uncertainty associated with monthly median of daily flows indicate that this
flow measure may be used reliably as an input to the brown trout habitat model (PIFM).
Averaged over the simulated 20-years period, annual maximum daily flow showed relatively
greater variability. The ensemble of daily flow duration curves was summarized by the
median and 95% confidence band. The ensemble duration curve graph allows for the
estimation of the daily flow range with 95% confidence for a specified design recurrence
(return) period. SWAT simulated daily streamflow rates ranging from about 2 to 11 (m3/s)
showed least uncertainty. Computed daily streamflow rates below 2 m3/s had the greatest
uncertainty, whereas for higher than 11 m3/s, uncertainty was moderate. The higher
uncertainty associated with low flows was not surprising as watershed models tend to
perform relatively poorer during small events.
38
-------
5 Impact of Land Use Changes
5.1 Introduction
Over the past 25 years, the population of the United States has grown over 30% (USDC
Census Bureau, 2005). Naturally, such an ample growth in population leads to substantial
increase in urbanized areas and results in a degradation and loss of forested and agricultural
lands. Hydrologically, urbanization is accompanied by increased imperviousness in the
landscape. Increase in impervious areas as well as reduction in soil permeability results in
reduced infiltration rate, which means that more precipitation becomes surface runoff and
less water is recharged to ground water. When increased surface runoff is combined with the
effect of reduced surface roughness - a consequence of which is a shorter travel time - it is
inevitable to observe more frequent and more intense local flood events. Such alterations in
the flow regimes of streams and channels may lead to changes in channel morphology in the
form of channel widening and deepening. On the other hand, groundwater recharge reduction
results in the drop of groundwater levels and reduction of base flow to stream flow.
Groundwater depletion can be further amplified due to an increase in groundwater
withdrawal that accompanies population and economic growth. Consequently, stream flow is
further depleted as groundwater levels are increasingly lowered by increased pumping. This
could have undesirable ecological consequences not only due to limited available water, but
also because of increase in pollutant concentrations and limited capabilities of the streams to
dilute any toxic spills.
As indicated earlier, Monroe County, where the Pocono Creek is located, has the second
fastest growing population in the state of Pennsylvania. By the year 2020, a
60% increase in Monroe County population is forecasted, and it is further projected that more
than 70% of the Pocono Creek watershed will turn into commercial and residential areas,
currently standing around 6% (Figure 16). The impact of population growth and urbanization
on potential alterations in surface and groundwater regimes will be assessed through the
modeling framework that has been developed.
5.2 Present Land Use Map and Future Build Out
A simulation period of 20 years, from 1/1/2005 to 12/31/2024, is employed to study the
impact of changes in the land use on the hydrology of the watershed. Two land use scenarios
are considered. First scenario (LU2000) assumes that land use pattern of the year 2000, as
depicted in the top panel of Figure 16, is preserved during the simulation period. In the
second scenario (LU2020) it is assumed that the build out would occur in 2020 or after
(bottom panel in Figure 16). In both scenarios, land use pattern is assumed to remain the
same throughout the 20 year simulation period. In other words, land use pattern is assumed
time invariant during the course of the simulations. Ideally, it is desirable to let the land use
pattern change gradually during the course of the simulations. However, this is not an easy
task due to model limitations as this will require dynamic updating of land use related model
parameters with time. Recall that the watershed model was calibrated and validated based on
the land use pattern LU2000. The model setup for simulation of LU2000 is extended to
39
-------
perform the model simulations of LU2020.
watershed
Open Water
Perennial Ice/Snow
Low Intensity Residential
High Intensity Residential
Commercial/Industrial
Bare Rock/Sand/Clay
Quarries/Strip Mines
Transitional
Deciduous Forest
Evergreen Forest
Mixed Forest
Shrubland
Orchards/Vineyards
Grasslands/Herbaceous
Pasture/Hay
Row Crops
Small Grains
Fallow
Urban/Recreational Grasses
Woody Wetlands
Emergent Herbaceous Wetlands
LAHD USE
Residential-High Density
Residential-Medium Density
Residential-Low Density
Conmer c ial/Transportat ion
Pasture
Water
Wetlands
Forest
Agricultural Land-Row Crops
TOTAL
2000
0.05
3.53
2 .25
3 .52
43
80
1
3
85.23
0. 19
2020
0.77
8.00
22 .84
0.27
1.43
3.80
18.70
100.00 100.00
Figure 16. Distribution of land use pattern in the Pocono Creek watershed in year 2000 (top)
and the projected land use pattern for the year 2020 (bottom).
In simulating the effect of the two land use scenarios on streamflow characteristics all
sources of model output error are neglected except rainfall variability; i.e., only SWAT
model output bt is considered, with the ensemble of rainfall rates generated for the period
from 1/1/2005 to 12/31/2024. This is done primarily for two reasons. First, the error
stochastic model (4.3) applies to LU2000, and it is therefore not conclusive if a similar error
structure would apply to model simulations based on LU2020. Secondly, it is the relative
40
-------
changes in the streamflow characteristics due to projected land use changes that are of
concern rather than absolute predictions. Computed relative changes tend to have
significantly lower uncertainty, consequently, they are much more reliable than absolute
predictions of daily streamflows.
5.3 Monte Carlo Simulation
To take into account the precipitation uncertainly, we generated 50 sets of distinct daily
precipitation records of 20 years length, each assumed to represent precipitation from
1/1/2005 to 12/31/2024. Measured precipitation data from 1/1/1975 to 12/31/2004 is inserted
at the beginning of each record to obtain 50 precipitation input data files, each of which
contain 50 years of daily precipitation. For each scenario, model simulations are performed
for each of the 50 precipitation data files. A total of 2x50x50=5000 years of model
simulations are performed at the daily time scale. Again, the first 30 years (1/1/1975 to
12/31/2004) of each realization is ignored for model warm-up purposes. Only the last 20
years of each realization are retained for further analysis. The reason for using the same
measured precipitation data during the 30 year warm-up periods of all realizations is to
minimize uncertainty relevant to initial conditions. Experimentation revealed that 50
realizations were adequate, and that more realizations had a negligible effect on the model
outcome.
5.4 Predicted Changes in Streamflow
The MC simulations yielded 50 time series for each of the SWAT Model outputs that
include, among others, stream flow, base flow, and groundwater recharge. Summary statistics
of some of the model outcomes for the two scenarios, LU2000 and LU2020, are presented in
Table 6 along with relative changes that might be expected when the land use pattern in the
Pocono Creek Watershed changes from the one given in year 2000 to the one projected past
year 2020. The statistics in the far left column and associated results in the table correspond
to the variations of model output. Mean, standard deviation (std), coefficient of variation
(CV), median, and 5th and 95th percentiles (90% confidence interval) for each design flow are
computed from the resulting MC simulation and are given in the table to explicate
uncertainty involved in the model outcomes due to precipitation. In the table summary
statistics related to low and high flow conditions as well as long term averages are given. The
first two columns given in the table are essentially base flow (BF) and stream flow (SF)
averaged over the 20-year simulation period. The lowest computed flow occurring once
every 10 years averaged over a 7-consecutive-day period (7Q10) listed in column (3) is
widely used as a low-flow index in the United States (Smakhtin, 2001). This hydrologically-
based design flow parameter is also used to protect against chronic effects by requiring that
water quality criteria must be met at all times except during the 7Q10. Average monthly-
median of simulated daily SF in column (4) represents the arithmetic average over the
simulation period of the median daily stream flows for each month and as indicated above is
a required input to the PIFM wild brown trout habitat model. In column (5) and (6) the
monthly and annual maximum daily stream flows averaged over the simulation period are
given, respectively.
41
-------
Table 6. Summary Statistics of Computed Streamflow Characteristics at the USGS Gauge
Station. The Results are Derived From 50 MC Simulations Each 20 Years Long
mean
std
CV
median
[5, 95]%
LU2000
LU2020
% change
LU2000
LU2020
LU2000
LU2020
LU2000
LU2020
% change
LU2000
LU2020
% change
Average BF
(m3/s)
(1)
1.314
0.911
-30.6%
0.071
0.056
0.054
0.062
1.318
0.909
-31.0%
[1.205,1.421]
[0.828,0.993]
Average SF
(m3/s)
(2)
2.452
2.426
-1.1%
0.137
0.140
0.056
0.058
2.457
2.431
-1.1%
[2.232,2.682]
[2.200,2.664]
7Q10
(m3/s)
(3)
0.284
0.252
-11.1%
0.054
0.055
0.191
0.218
0.282
0.242
-14.4%
[0.197,0.364]
[0.167,0.326]
Average
Monthly
Median
Daily SF
(m3/s)
(4)
2.033
1.829
-10.1%
0.108
0.104
0.053
0.057
2.027
1.826
-9.9%
[1.852,2.207]
[1.650,2.002]
Average
Monthly
Maximum
Daily SF
(m3/s)
(5)
6.11
7.40
21.1%
0.450
0.526
0.074
0.071
6.17
7.47
21.0%
[5.35,6.71]
[6.52,8.17]
[21.8,21.7]%
Average
Annual
Maximum
Daily Flow
(m3/s)
(6)
17.47
20.86
19.4%
2.28
2.67
0.131
0.128
17.40
20.81
19.6%
[13.59,22.31]
[16.26,26.55]
[19.6,19.0]%
mean: Arithmetic average of 50 realizations
std: Standard deviation of 50 realizations
CV: Coefficient of variation (std/mean)
median: 50th percentile of 50 realizations
[5,95]%: 5th and 95th percentiles of the 50 realizations
BF: Base flow contribution to stream flow
SF: Stream flow
7Q10: Seven days average low flow with a 10 year
return period.
LU2000: Simulations performed using land use
coverage for year 2000
LU2020: Simulations performed using land use
projections for 2020
% change: (LU2020-LU2000)/LU2000
Table 1 reveals the following observations and associated conclusions:
i. The mean value of 50 realizations for the average of simulated BF is expected to decline
by 31%. This is simply due to reduced infiltration and consequently less recharge to the
ground water. This reduction can be further aggravated if the effect of increased
groundwater withdrawal due to higher water demand associated with population growth
is imposed. To explore the effect of potential increase for groundwater demand, a more
detailed groundwater flow model (MODFLOW) is being developed for the region by the
USGS. In the simulations, groundwater pumpage is ignored. Precipitation uncertainty
does not seem to impart much uncertainty on BF as CV is less than 0.1 for both
42
-------
scenarios. The median, 5th and 95th percentiles are also expected to decline by the same
rate as the mean.
ii. The average of simulated daily SF is relatively unaffected by urbanization. This can be
explained on the basis that reduction in BF is offset by the increase in surface runoff. Yet,
a negligible reduction (1.1%) in SF is observed. Mass balance of soil water dictates that
average evapotranspiration (ET) is not impacted much by land use changes. Indeed, close
examination of ET outputs from the model confirms this. Although forested areas have
high transpiration (T), impervious surfaces have higher solar reflectivity, and thus higher
evaporation (E). The reduction in (T) seems to be compensated by the increased (E).
Similar to BF, uncertainty in precipitation has insignificant impact on SF. It should be
noted that since the average of simulated daily SF is obtained by averaging simulated
daily streamflows over the entire simulation record, it therefore does not represent actual
daily flow conditions.
iii. On average, the computed 7Q10 is expected to decrease by almost 11%. This reduction
can be attributed to the reduction in BF as low flows in perennial streams, such as Pocono
Creek, are driven by BF which is predicted to decline by 31% on average over the study
watershed. With 90% confidence, 7Q10 can be expected to be within the interval [0.197,
0.364] m3/s with the current land use conditions. With the projected land use conditions
the 90% confidence interval for 7Q10 is [0.167, 0.326] m3/s. The CV is estimated to
increase by 14% due to changes in land use leading us to conclude that the combined
effect of precipitation uncertainty and projected land use alterations are likely to cause
higher uncertainty in 7Q10.
iv. Mean, median, 5th and 95th percentiles of average (i.e., averaged over the simulation
period) monthly median of simulated daily SF as shown in column (4) are all reduced by
about 10%. It is also clear from the table that the levels of uncertainties in both scenarios
do not differ considerably, as they are only marginal. From a hydrologist's perspective, it
is hard to reach a conclusion on how much risk this reduction poses to the brown trout
population in Pocono Creek. Simulations based on the PIFM habitat model and the
computed median flows will provide answers.
v. As expected, average monthly maximum of simulated daily SF increases as summarized
in column (5). The expected increase is around 21% for mean, median, and 5th and 95th
percentiles of the 50 realizations. Reasons for increase in peak flow magnitude as well as
peak flow frequencies as a result of urbanization are well known: reduced infiltration rate
and shorter travel times. The levels of uncertainties are about the same for both scenarios
and can be deemed negligible with CV values of 0.07.
vi. Similarly, the increase in the average of annual maximum daily flow is anticipated with
LU2020 projections. The mean and the median of average of annual maximum daily flow
are predicted to increase by about 19%. The uncertainty associated with this design flow
is the second highest after 7Q10, with a CV greater than 0.1. The 90% confidence band is
estimated to increase from [13.59, 22.31] to [16.26, 26.55] in units of mVs.
In Figure 17, spatial distributions of the 20-year simulation average of annual
groundwater recharge estimates [mm] at the subbasin scale are shown for the two scenarios.
These are only model estimates, as real measurements of groundwater recharge are not
43
-------
available. The figure delivers some important insights. There is a significant spatial variation
of groundwater recharge in the watershed. Lower portions of the watershed (associated with
lower elevations) have higher recharge rates than upper portions. When recharge estimates of
the scenarios are compared, it is seen that with few exceptions almost all sub watersheds
experience reduction of groundwater recharge. Another important observation is related to
the extent of spatial variation of recharge in the two scenarios. In LU2000, the computed
average recharge rates vary from 155 mm/year to 576 mm/year. In LU2020 the computed
range of groundwater recharge is from 131 mm/year to 432 mm/year. This indicates that the
projected land use change is expected to slightly reduce the spatial variation of groundwater
recharge over the Pocono Creek watershed.
LU2000
Annual Recharge (mm)
| [100-150
| [150-200
| 200 - 250
j 250 - 300
| 300 - 350
] 350 - 400
] 400 - 450
I 450 - 500
]500 - 550
I 550 - 600
Figure 17. Annual groundwater recharge distributions in the Pocono Creek watershed for the
two land use scenarios.
It should be noted that the results in Table 6 and Figure 17 assume that the build out in
LU2020 occurs by 2020. The results would still characterize the watershed response even if
the build out in LU2020 is presumed to occur after 2020. This may be true provided that
pattern and variability of future precipitation remains unaltered.
The predicted annual groundwater recharge distributions depicted in Figure 17 were
tabulated and passed on to USGS (Malvern, PA) to calibrate a quasi-three dimensional
groundwater flow model (MODFLOW) for the watershed. The computed monthly median of
daily flows were tabulated and passed on to the PA F&B Commission for use as input data to
the brown trout habitat model (PIFM). Annual groundwater recharge averaged over the
simulation record and spatially over the watershed was predicted to decline by 31%, from
424 to 292 mm/yr based on the LU2020 projections. The coefficient of variation (CV) of the
computed annual groundwater recharge did not exceed 0.08, indicating fairly reliable model
estimates thereof.
44
-------
5.5 Predicted Changes to Flow Frequency and Duration
The medians of daily duration curves based on LU2000 and LU2020 are plotted in Figure 18.
The probability of exceeding high flows and the risk for flood hazard are predicted to
increase based on LU2020. On the other hand, flow exceeded at least 90% of the time (< 0.8
m3/s) decreases in moving from LU2000 to LU2020, as depicted by the inner panel (Figure
18). Equivalently, this means that the chance for the flow to be less than or equal to a given
low flow threshold increases. Therefore, base flows are likely to decrease with the projected
land use changes. This is, of course, excluding the effect of the anticipated increase in
groundwater withdrawals which will further decline base flows. Also note from Figure 18
that median flow due to LU2020 is slightly smaller than the one due to LU2000.
30
25
I/T 20
r)
E
I 15
90% X95% 100%
001
0.01 0.1
Probability of exceedance
Figure 18. Median of the ensemble of the MC-simulated duration curves, bold thick line
corresponds to simulations with LU2020, thin line corresponds to simulations
with LU2000. The flow duration curves are generated from 50 replicas of 20
years of daily streamflows, ot.
5.6 Conclusions
The Pocono Creek watershed is threatened by high population growth anticipated over the
next two to four decades. Potential effects of population growth and urbanization have
heightened the need for implementing sustainable water resource management strategies in
the watershed. In this section, potential hydrological changes in Pocono Creek due to
anticipated build out in the watershed were investigated using the calibrated SWAT model
and Monte Carlo simulations. Simulations accounted for anticipated rainfall variability over
a 20 year period, with the current land use pattern and projected build out for the year past
2020, named LU2000 and LU2020, respectively. Simulation results revealed that on the
average, daily base flow is expected to be reduced by 31%. However, this is not expected to
45
-------
cause a noteworthy reduction in average daily streamflow (averaged over the entire
simulation period) as SWAT predicted that reduction in base flow would be balanced by
increased surface runoff. The computed low-flow index, 7Q10, is expected to decline by 11%
due to anticipated base-flow reduction. A metric for the sustainability of fish habitat, the
computed monthly median daily flow is expected to decline by 10% on the average. The
monthly peak of simulated daily flows and annual maximum daily flow are expected to
increase by 21% and 19%, respectively. The computed 7Q10 and average annual maximum
daily flow showed relatively higher uncertainty than the other flow characteristics. Projected
build out in the watershed is estimated to cause a significant decline in the annual
groundwater recharge rates. The decline in the watershed-averaged annual groundwater
recharge is predicted to be 31% based on LU2020 projections. The spatial variability of
groundwater recharge appears to diminish with the projected urbanization in the watershed.
In general, the likelihood that the watershed will experience high and low streamflows will
increase with the projected urbanization, as indicated by the median of the MC simulated
flow duration curves.
46
-------
6 Critical Source Areas
6.1 Introduction
It is now evident from model predictions that the projected land use changes in the Pocono
Creek watershed have the potential of increasing average annual maximum and average
maximum monthly flows, and reducing 7Q10 and average median monthly daily flows at the
watershed outlet. Informed management decisions may benefit from the identification of
portions of the watershed that have the highest contribution to the reduction/increase in the
quantity of interest. In a sense, preserving the land use of a particular area in the watershed
can be considered as a best management practice (BMP). From this point of view, the
problem can also be posed as identifying the locations of those BMPs that minimize the
predicted changes. Of course, socioeconomic and policy matters may interfere with and
preclude the implementation of a particular BMP, but knowing beforehand (i.e., before land
development) critical areas in the watershed provides science-based guidance to the planning
process.
There are limited applications in literature looking at the aspect of identifying or
apportioning areas within a watershed based on their relative contributions to flow at the
outlet. Saghafian and Khosroshahi (2005) address the same problem by focusing on flood
source areas and they also emphasize lack of applications of this type. The approach adopted
here, to be addressed shortly, has some similarities, however it differs from their work in two
ways: i) in addition to high flows, the focus is extended to low and median flow
characteristics (7Q10, monthly median daily flows), and ii) the focus here is not on
individual event hydrographs, but rather on continuous flow time series.
6.2 Methodology
Let us assume that a given watershed is divided into k number of subareas, which will be
referred as "elements" henceforth. There are two land use conditions, current (c) and future
(f). The model run at the daily time scale with the future land use scenario for a given
duration generates the flow time series Qf at the watershed outlet. Suppose that all the
elements have the future land use/cover with the exception of element y retaining its present
status. The generated flow time series at the outlet with this setup will be denoted as Qf'J .
For both flow time series, Qf and Qf'J , the flow characteristics of interest, say F is
computed and designated as Ff and FfJ respectively. The following two indexes are
defined to assess the relative impact of element y' on the flow characteristic F:
i i
Ff AjlA*
where Aj and Aw indicate areas of element y' and the whole watershed, respectively. The first
index, a/, signifies the absolute impact of element y on F. The second index, /^, which is
basically normalization of q with the percentage area of element y, measures the impact of
land use changes in the element on F assuming these very changes occur over the entire
47
-------
watershed. In other words, /?is suitable for assessing the impact of land use changes per unit
area. By computing a/ and $ for j = l,..,k one can rank the areas in the watershed from most
critical to least. It is important to note that the order of ranking could be different not only
depending on the index used but also depending on the flow characteristic of interest, F.
6.3 Results
As part of the modeling requirement, the Pocono Creek watershed was divided into 29
catchments, numbered from 1 to 29 with catchment 29 being the most downstream (Figure
19a). For management purposes this discretization might be too detailed. Hence, as an
alternative we divided the watershed into 7 larger areas which are combinations of the 29
subbasins (Figure 19b). It should be noted that the larger areas correspond to the
management areas in the pilot study (2001), with areas 4,5, and 6 in Figure 19(b) together
falling within the management area 3 of the pilot study (Figure 20). Area 7 in Figure 19(b)
overlaps with management areas 2 and 3 of the pilot study (Figure 20).
(a)
Figure 19. Watershed subdivisions used in determination of critical areas, (a) finer, (b)
coarser.
48
-------
Figure 20. Management areas in Pocono Creek Watershed according to Pocono Pilot Study.
Table 7 summarizes the computed indices, a and (3 with the coarser spatial scale for
7Q10, monthly median of daily flows and annual maximum daily flows, which divides the
watershed into 7 drainage areas, j=l,..,7. Also given in the table are the rankings of each area
for each index and flow characteristics. Figure 21 depicts these rankings on a gray scale
spectrum for visual purposes. One immediate observation is that the rankings based on a and
/? are quite consistent for monthly median of daily flows and annual maximum daily flow.
Areas having the highest and lowest impact on these two flow characteristics do not change
with the type of index. Management area 4 has the highest impact on the reduction of 7Q10
when a index is used, whereas management area 5 is the biggest contributor when /? index
issued.
Table 7. Computed Indices a and /? for the 7 Management Areas for 7Q10, Monthly
Median of Daily Flows and Annual Maximum Daily Flows
rank
1
2
3
4
5
6
7
7Q10
j
4
7
2
5
1
6
3
a\
8.58%
7.15%
6.18%
4.95%
3.60%
1 .95%
1 .47%
I
5
4
7
2
6
3
1
i.
0.54
0.51
0.41
0.35
0.22
0.17
0.17
monthly median of daily flow
I
4
7
2
1
5
6
3
a\
2.55%
2.51 %
2.03%
1 .97%
0.85%
0.73%
0.59%
I
4
7
2
5
1
6
3
&
0.15
0.14
0.12
0.09
0.09
0.08
0.07
annual maximum daily flow
i
7
4
1
6
2
3
5
a\
-5.48%
-3.88%
-2.47%
-1 .96%
-1 .69%
-0.56%
-0.50%
I
7
4
6
1
2
3
5
i
-0.32
-0.23
-0.22
-0.11
-0.10
-0.07
-0.05
49
-------
Figure 21. Ranking of the 7 management areas based on a (a, b, c) and /?(d, e, f) indices for
7Q10, monthly median of daily flows, and annual maximum daily flow,
respectively.
Based on a, area 4 (Figure 19(b)) is ranked first in terms of impact on 7Q10 and monthly
median of daily flows, whereas areas 6 and 3 ranked the lowest, 6th and 7th, respectively.
Comparison of Figure 21 with Figure 17 reveals that area 4, through which Bisbing Run and
Bulgers Run flow (Figure 20), contributes the highest annual groundwater recharge among
the seven areas depicted in Figure 19(a). Since base flow is directly related to groundwater
recharge, it is therefore expected that area 4 would have the highest impact on 7Q10. The
predicted significant reduction in the annual groundwater recharge in area 4 due to projected
land use changes (Figure 17) means a significantly greater fraction of precipitation would be
available for runoff. This may explain area 4 being ranked first in terms of impact of land use
changes on monthly median of daily flows, and ranked second for annual maximum daily
flows.
Area 7 is ranked second based on a for its impact on 7Q10 and monthly median of daily
flows, and first for annual maximum daily flows. Close proximity to the USGS streamflow
gauge station (i.e., being downstream most), relatively high groundwater recharge rates, and
the relatively larger area may be contributing to this ranking.
Significant portion of area 6 is occupied by wetlands (about 50%), which are maintained
in both LU2000 and LU2020 as flow through features with zero infiltration. It is assumed
that wetlands are preserved during land development. Therefore, groundwater recharge in
area 6 is slightly reduced, thus, leading to least impact on 7Q10 and lowest ranking based on
a. Area 3, where the Camel Back Ski Area is located and Coolmoor Creek flows (Figure 20),
is characterized by steep topography that is conducive to high runoff and low infiltration
50
-------
regardless of the landscape features. This explains the lowest and second lowest ranking of
area 3 with a.
The ranking based on /? index is useful in assessing impacts when only portions of each
of the 7 areas are planned to be developed. The most notable is area 5 which is ranked first
for 7Q10 using /? index. With the exception of a small, steep northern portion, the area is
topographically characterized by relatively low steepness and long slope length (Figure 1),
and contributes significant groundwater recharge (Figure 17). Developments planned
according to LU2020 may have the highest impact on 7Q10 if they occur within area 5.
6.4 Conclusions
The Pocono Creek watershed was divided into seven catchment areas and an index
methodology was presented to rank the catchments' areas based on their impact on key
streamflow characteristics due to anticipated land disturbances. The catchments' areas
conformed to the six management areas in the pilot study (2001). Two indices were
proposed. The first index, a, signifies the absolute impact of a particular catchment area on
the watershed response. The second index, /?, is a normalized by the percentage area of the
catchment, and therefore describes the impact of land use changes per unit area.
Using these indices, catchments' areas were ranked based on their potential to impact
changes in the streamflow characteristics due to projected build out in the watershed. With a
few exceptions, a and |3 indices produced similar rankings among the 7 catchment areas for
7Q10, monthly median of daily flow, and annual maximum daily flow. Groundwater
recharge, area, topographic features, and proximity to the streamflow gauge station may have
contributed to the ranking results. The most downstream catchment, area 7, ranked first in
terms of impact on annual maximum daily flows, and second in terms of impact on 7Q10 and
monthly median daily flows. Catchment area 4, associated with the highest groundwater
recharge, was ranked first and second based on a and |3 indices, respectively, with regard to
impact on 7Q10. In general, areas characterized by steep topography and significant wetlands
ranked low, some times the lowest, with respect to impact on changes in the three design
flows.
51
-------
7 Summary and Conclusions
The Pocono Creek watershed, which is located in Monroe County, PA, is threatened by high
population growth and urbanization. Of concern specifically is the potential impact of future
developments in the watershed on the reduction of base flow and the consequent risk for
degradation of wild brown trout habitats in Pocono Creek. Anticipated increase in
imperviousness, on the other hand, is expected to elevate the risk for floods and the
associated environmental damage. A watershed hydrologic modeling study was initiated by
the U.S. EPA in collaboration with the U.S. Geological Survey and the Pennsylvania Fish
and Boat Commission to assist Monroe County in planning for sustainable future
developments in the Pocono Creek watershed.
Good application track-record of the SWAT model and its suitability to address
watershed management problems led to the selection of the model to achieve the objective of
quantifying the impact of anticipated land use changes on the hydrologic response of the
Pocono Creek watershed. The model was successfully calibrated and validated for two
sources of precipitation data, raingauge measured and radar estimated (NEXRAD). Two
raingauge stations were available outside the watershed, but in close proximity to the
perimeter. Simulated daily and monthly streamflows at the outlet compared fairly well to the
observed values, for both sources of the rainfall data. The results reinforced the notion that
NEXRAD is an effective source of spatio-temporal precipitation data and a viable alternative
to the very costly installation, operation, and maintenance of a raingauge network. Future
modeling studies in ungauged watersheds can be conducted with NEXRAD rainfall data.
Stochastic error propagation analysis was conducted using time series analysis and MC
simulations to evaluate model predictive uncertainty. Model errors accounted for model
structure uncertainty, parametric uncertainty, measurement errors, and rainfall variability. It
was shown that the calibrated model was consistent in its forecast capability. MC simulations
over a 20-year long period yielded an ensemble of rating curves of which the median and
95% confidence band of daily streamflows were plotted. These plots allow for the
construction of the 95% confidence band for design flows corresponding to any given
recurrence or return period. SWAT simulated daily streamflow rates in the range 2 to 11
(m3/s) showed least uncertainty. Computed daily streamflow rates below 2 m3/s had the
greatest uncertainty, whereas for higher than 11 m3/s uncertainty was moderate.
MC simulation over a 20-year period showed that the average daily base flow is expected
to be reduced by 31% should the projected build out in the watershed occur. However, this is
not believed to cause a noteworthy reduction in average daily streamflow as the model
predicted that reduction in base flow would be balanced by increased surface runoff. The
computed low flow index, 7Q10 is expected to decline by 11%, and the monthly median
daily flow is expected to be reduced by 10% on the average. Monthly peak of simulated daily
flows and annual maximum daily flow were predicted to increase by 21% and 19% on the
average, respectively. Groundwater recharge rate averaged over the watershed was predicted
to decline by 31% due to the projected land use changes. The median of the MC simulated
flow duration curves showed that, in general, the likelihood that the watershed will
experience high and low streamflows will increase with the projected urbanization.
52
-------
An index methodology was developed to rank seven subwatersheds - composing the
modeled area of the Pocono Creek watershed - based on their relative impact on the
watershed response to land developments. The first index, a, signifies the absolute impact of
a particular catchment area on the watershed response. The second index, /?, is a normalized
by the percentage area of the catchment, and therefore describes the impact of land use
changes per unit area. With a few exceptions, a and |3 indices produced similar rankings
among the 7 catchment areas for 7Q10, monthly median of daily flow, and annual maximum
daily flow. Groundwater recharge, area, topographic features, and proximity to the
streamflow gauge station may have contributed to the ranking results. The most downstream
catchment, area 7, ranked first in terms of impact on annual maximum daily flows, and
second in terms of impact on 7Q10 and monthly median daily flows. Catchment area 4
associated with the highest groundwater recharge was ranked first and second for impact on
7Q10 based on a and |3 indices, respectively. Areas characterized by steep topography (area
3) and significant wetlands (area 6) ranked low, some times the lowest, with respect to
impact on the three design flows.
This model study predicted that low and high flows may, respectively, decrease and
increase significantly as a result of urbanization. Land use changes in that part of the
watershed where Bisbing Run and Bulgers Run flow through were predicted to have the
highest impact on the reduction in low flows due to anticipated reduction in the groundwater
recharge rates. The most downstream of the channel network, immediately upstream of the
USGS gauge station, ranked first in terms of impact on monthly median of daily flow and
maximum annual daily flow. Land disturbances in the topographically steep subwatershed, in
which Coolmoor Creek flows, and the area that contains wetlands, through which Cranberry
Creek flows, were predicted to generally have the least impact on the watershed response.
53
-------
8 References
Arnold, J.G., and N. Fohrer. SWAT2000: current capabilities and research opportunities in
applied watershed modeling. Hydrol. Process. 19: 563-572 (2005)
Arnold, J.G., Muttiah, R.S., Srinivasan, R., Allen, P.M. Regional estimation of base flow and
groundwater recharge in the Upper Mississippi river basin. J. Hydrol. 227(1): 21-40
(2000).
Arnold, J.G., and P.M. Allen. Automated methods for estimating base flow and ground water
recharge from streamflow records. J. Am. Water. Resour. As. 35(2): 411-424 (1999).
Arnold, J.G., Srinivasan, R., Muttiah, R.S., Williams, J.R. Large area hydrologic modeling
and assessment part I: Model development. Journal of the American Water Resources
Association 34(1): 73-89 (1998).
Arnold, J.G. and P.M. Allen. Estimating hydrologic budgets for three Illinois watersheds. J.
Hydrol. 176(1): 57-77 (1996).
Arnold, J.G., Allen, P.M., Muttiah, R., and Bernhardt, G. Automated base flow separation
and recession analysis techniques. Ground Water 33(6): 1010-1018 (1995).
Beven, K. and A. Binley. The future of distributed models: Model calibration and uncertainty
prediction. Hydrol. Process. 6: 279-298 (1992).
Beven, K. and J. Freer. Equifinality, data assimilation, and uncertainty estimation in
mechanistic modeling of complex environmental system using the GLUE methodology. J.
Hydrol. 249: 11-29(2001).
Binley A.M., KJ. Beven, A. Calver, and L.G. Watts. Changing Responses in Hydrology:
Assessing the uncertainty in physically based model predictions. Water Resour. Res. 27(6):
1253-1261 (1991).
Borah, D.K. "Watershed scale non-point source pollution models: mathematical bases." In
Proceedings of the 2002 ASAE Annual International Meeting/ClGR World Congress.,
Chicago, IL, 2002. Paper no. 022091.
Box, G.E.P., and G.M. Jenkins. "Time Series Analysis, Forecasting, and Control" San
Francisco: HoldenDay, 1970.
Carpenter, T.M. and K.P. Georgakakos. Impacts of parametric and radar rainfall uncertainty
on the ensemble streamflow simulations of a distributed hydrologic model. J. Hydrol. 298:
202-221 (2004).
Di Luzio, M., Srinivasan, R., Arnold, J.G., and Neitsch, S.L. ArcView interface for
SWAT2000, User's Guide, TWRI report TR-193. Texas Water Resources Institute, College
Station, Texas, 2002.
Dilks, D.W., R.P. Canale, and P.G. Meier. Development of Bayesian Monte Carlo techniques
for water quality model uncertainty. Ecolog. Model. 62: 149-162 (1992).
Eckhardt, K., and J.G. Arnold. Automated calibration of a distributed catchment model. J.
Hydrol. 251:103-109 (2001).
54
-------
Fohrer, N., Moller, D., and Steiner, N. An interdisciplinary modeling approach to evaluate
the effects of land use change. Phys. Chem. Earth 27:655-662 (2002).
Freer J. and K. Beven. Bayesian estimation of uncertainty in runoff prediction and the value
of data: An application of the GLUE approach. Water Resour. Res. 32(7): 2161-2173
(1996).
Green, W.H., and G.A. Ampt. Studies on soil physics, 1. The flow of air and water through
soils. J. Agr. Sci. 4:11-24 (1911).
Haan, C.T. "Statistical Methods in Hydrology." Second Edition, Iowa State Press, A
Blackwell Publishing Company. 496 pp. 2002.
Hantush, M.M., and L. Kalin. Uncertainty and sensitivity analysis of runoff and sediment
yield in a small agricultural watershed with KINEROS2. Hydro!. Sci. J 50(6): 1151-1171
(2005)
Hossain, F., E.N. Anagnostou, T. Dinku, and M. Borga. Hydrological model sensitivity to
parameter and radar rainfall estimation uncertainty. Hydrol. Process. 18: 3277-3291
(2004).
Hossain, F. and E. N. Anagnostou. Assessment of a probabilistic scheme for flood prediction.
J. Hydrolog. Engrng. 10(2): 141-150 (2005)
Jha, M., Gassman, P.W., Secchi, S., Gu, R., Arnold, J.G., 2003. "Hydrologic simulation of
the Maquoketa River watershed with SWAT." In Proceedings of the AWRA 2003 Spring
Specialty Conference, Kansas City, MO, 2003.
Jayakrishnan, R., Srinivasan, R., Santhi, C., and Arnold, J.G. Advances in the application of
the SWAT model for water resources management. Hydrol. Process. 19:749-762 (2005).
Kalin, L., and M.M. Hantush. Hydrologic Modeling of the Pocono Creek Watershed with
NEXRAD and Rain Gauge Data. J. Hydrolog. Engrng. In press (2006a).
Kalin, L., M.M. Hantush. "Effect of Urbanization on Sustainability of Water Resources in
the Pocono Creek Watershed." In V.P. Singh and Y.J. Xu (ed.) Coastal Hydrology and
Processes, by American Institute of Hydrology, Water Resources Publications, 2006b, 59-
70.
Kalin, L., M.M. Hantush. Evaluation of sediment transport models and comparative
application of two watershed models, EPA/600/R-03/139. Cincinnati, OH: U.S.
Environmental Protection Agency, 2003,
http://www.epa.gov/ORD/NRMRL/Pubs/600r03139/600r03139.pdf
Kim, T-W and J.B. Valdes. Synthetic generation of hydrologic time series based on
nonparametric random generation. J. Hydrol. Engrng. 10(5): 395-404 (2005).
McKay, M.D., Beckman, R.J., and Conover, W. J. A Comparison of Three Methods for
Selecting Values of Input Variables in the Analysis of Output from a Computer Code.
Technometrics 21(2):239-245 (1979).
Lall, U. Recent advances in nonparametric function estimation: Hydrologic applications. Rev.
Geophys. 33(S): 1093-1102 (1995).
55
-------
Muleta, M.K., and J.W. Nicklow. Sensitivity and uncertainty analysis coupled with automatic
calibration for a distributed watershed model. J. Hydrol. 306:127-145 (2005).
Nash, I.E., and Sutcliffe, J.V. River flow forecasting through conceptual models. Part 1: A
discussion of principles. J. Hydrol. 10:282-290 (1970).
Neitsch, S.L., Arnold, J.G., Kiniry, J.R., Williams, J.R., and King, K.W. Soil and Water
assessment Tool Theoretical Documentation, version 2000. TWRI report TR-191, Texas
Water Resources Institute, College Station, Texas, 2002a.
Neitsch, S.L., Arnold, J.G., Kiniry, J.R., Williams, J.R., and King, K.W. Soil and Water
Assessment Tool User's Manual, version 2000. TWRI report TR-192, Texas Water
Resources Institute, College Station, Texas, 2002b.
NRC. "Assessing the TMDL approach to water quality management'' Committee to Access
the Scientific Basis of the Total Maximum Daily Load Approach to Water Pollution
Reduction, Water Science and Technology Board, Division on Earth and Life Studies,
National Research Council, Washington, D.C, 2001.
Pebesma, E.J., P. Switzer, and K. League. Error analysis for the evaluation of model
performance: rainfall-runoff event time series data. Hydrol. Process. 19: 1529-1548 (2005).
Pocono Creek Pilot Study Draft Technical Report. Geology, geomorphology, geohydrology,
and surface water hydrology of the Pocono Creek Basin,
(Jan. 30, 2006). Delaware River Basin
Commission (DRBC), 2001.
Rallison, R.E., and Miller, N. "Past, present and future SCS runoff procedure." In Rainfall
runoff relationship. V.P. Singh, eds., Water Resources Publication, Littleton, Colo., 1981,
353-364.
Saghafian, B. and Khosroshahi, M. Unit response approach for priority determination of
flood source areas. J. Hydro. Engrng. 10(4): 270-277 (2005).
Sal eh, A. and B. Du. "Application of SWAT and HSPF within BASINS program for the
Upper North Bosque River watershed." In Proceedings of the 2002 ASAE Annual
International Meeting, 2002.
Saltelli, A., K. Chan, and E.M. Scott. 2000. "Sensitivity Analysis" John Wiley & Sons, Ltd,
2000, 475 pp.
Santhi, C., Arnold, J.G, Williams, J.R., Dugas, W.A., Srinivasan, R., and Hauck, L.M.
Validation of the SWAT model on a large river basin with point and nonpoint sources. J.
Am. Water. Resour. As. 37(5): 1169-1188 (2001).
Sharply, A.N., and J.R. Williams. Eds. EPIC-Erosion Productivity Impact Calculator, 1.
Model Documentation, U.S. Department of Agriculture, Agricultural Research Service,
Tech. Bull. 1768, 1990.
Shumway, R.H. "Applied Statistical Time Series Analysis" Prentice Hall, 1988, 379 pp.
Silverman, B.W. "Density Estimation for Statistics and Data Analysis" Chapman and Hall,
New York, 1986.
Smakhtin, V.U. Low Flow Hydrology: A Review. J. Hydrol. 240:147-186 (2001).
56
-------
Sophocleous, M., and S.M. Perkins. Methodology and application of combined watershed
and ground-water models in Kansas. JHydrol. 236:185-201 (2000).
Sorooshian, S. and J.A. Dracup. Stochastic parameter estimation procedures for hydrologic
rainfall-runoff models: Correlated and Heteroscedastic error cases. Water Resour. Res.
16(2): 430-442 (1980).
Sorooshian, S., Gupta, V.K., and Fulton, J.L. Evaluation of maximum likelihood parameter
estimation techniques for conceptual rainfall-runoff models: Influence of calibration data
variability and length on model credibility. Water Resour. Res. 19(1):251-259 (1983).
Spear, R.C. and G.M. Hornberger. Eutrophication in peel inlet-II. Identification of critical
uncertainties via generalized sensitivity analysis. Water Res. 14: 43-49 (1980).
Spear, R.C., T.M. Grieb, and N. Shang. Parameter uncertainty and interaction in complex
environmental models. Water Resour. Res. 30(11): 3159-3169 (1994).
Srinivasan, R., Ramanarayanan, T.S., Arnold, J.G., Bednarz, S.T., Large area hydrologic
modeling and assessment part II: model application. Journal of the American Water
Resources Association 34(1): 91-101 (1998).
Tripathi, M.P., Panda, R.K., Raghuwanshi, N.S., and Singh, R. Hydrological modeling of a
small watershed using generated rainfall in the soil and water assessment tool model.
Hydrol. Process. 18:1811-1821 (2004).
US Department of Commerce, Census Bureau, Statistical Abstract of the United States, 2001.
URL: http://www.census.gov/prod/2005pubs/06statab/pop.pdf, 2005.
USDA Soil Conservation Service. "National Engineering Handbook Section 4 Hydrology."
Chapters 4-10, 1972.
Van der Perk, M. and M.F.P. Bierkens. The identifiability of parameters in a water quality
model of the Biebrza, Poland. J. Hydrol. 200: 307-322 (1997).
Van Liew, M.W., Arnold, J.G., Gardbercht, J.D. Hydrologic simulation on agricultural
watersheds: choosing between two models. Trans. ASAE 46(6): 1539-1551 (2003).
Williams, J.R. Flood routing with variable travel time or variable storage coefficients. Trans.
ASAE 12(1): 100-103 (1969).
57
------- |