ProUCL Version 4.0 User Guide


£%	United States
Environmental Protection
m\ Agency
ProUCL Version 4.0
User Guide
RESEARCH AND DEVELOPMENT

-------
US EPA
&TSP.	Headquarters and Chemical Libraries
/\RctflV£-	EPA West Bldg Room 3340
c.p/0.	Mailcode 3404T	epa/6oo/r-o7/o38
/nn	1301 Constitution Ave NW	APni2oo7
($00'	... . . .	_	_ _		www.epa.gov
p	Washington DC 20004
^	202-566-0556
ProUCL Version 4.0
User Guide
Prepared for
Brian Schumacher
U.S. Environmental Protection Agency
Office of Research and Development
National Exposure Research Laboratory
Environmental Sciences Division
ZZ	Technology Support Center
^	Characterization and Monitoring Branch
rr	944 E. Harmon Ave.
o	Las Vegas, NV 89119
<5"
o

Prepared by
Anita Singh, Ph.D.'
Robert Maichle1
Ashok K. Singh, Ph.D.2
Sanghee E. Lee1
'Lockheed Martin Environmental Services
1050 E. Flamingo Road, Suite N240
Las Vegas, NV 89119
department of Hotel Management
University of Nevada, Las Vegas
Las Vegas, NV 89154
pository Material
Permanent Collection
Notice: Although this work was reviewed by EPA and approved for publication, it may not necessarily reflect official
Agency policy. Mention of trade names and commercial products does not constitute endorsement or
recommendation for use.
U.S. Environmental Protection Agency
Office of Research and Development
Washington, DC 20460
129cmb07

-------
Notice
The United States Environmental Protection Agency (EPA) through its Office of Research and
Development (ORD) funded and managed the research described here. It has been peer reviewed by the
EPA and approved for publication. Mention of trade names and commercial products does not constitute
endorsement or recommendation by the EPA for use.
ProUCL software was developed by Lockheed Martin under a contract with the EPA and is made
available through the EPA Technical Support Center in Las Vegas, Nevada. Use of any portion of
ProUCL that does not comply with the ProUCL User Guide is not recommended.
ProUCL contains embedded licensed software. Any modification of the ProUCL source code may violate
the embedded licensed software agreements and is expressly forbidden.
ProUCL software provided by the EPA was scanned with McAfee VirusScan and is certified free of
viruses.
With respect to ProUCL distributed software and documentation, neither the EPA nor any of their
employees, assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of
any information, apparatus, product, or process disclosed. Furthermore, software and documentation are
supplied "as-is" without guarantee or warranty, expressed or implied, including without limitation, any
warranty of merchantability or fitness for a specific purpose.
111

-------
Executive Summary
Statistical inference, including both estimation and hypotheses testing approaches, is routinely used to:
1.	Estimate environmental parameters of interest, such as exposure point concentration
(EPC) terms, not-to-exceed values, and background level threshold values (BTVs) for
contaminants of potential concern (COPC),
2.	Identify areas of concern (AOC) at a contaminated site,
3.	Compare contaminant concentrations found at two or more AOCs of a contaminated site,
4.	Compare contaminant concentrations found at an AOC with background or reference
area contaminant concentrations, and
5.	Compare site concentrations with a cleanup standard to verify the attainment of cleanup
standards.
Several exposure and risk management and cleanup decisions in support of United States Environmental
Protection Agency (EPA) projects are often made based upon the mean concentrations of the COPCs. A
95% upper confidence limit (UCL95) of the unknown population (e.g., an AOC) arithmetic mean (AM),
fi\, can be used to:
•	Estimate the EPC term of the AOC under investigation,
•	Determine the attainment of cleanup standards,
•	Compare site mean concentrations with reference area mean concentrations, and
•	Estimate background level mean contaminant concentrations. The background mean
contaminant concentration level may be used to compare the mean of an area of concern.
It should be noted that it is not appropriate to compare individual point-by-point site
observations with the background mean concentration level.
It is important to compute a reliable and stable UCL95 of the population mean using the available data.
The UCL95 should approximately provide the 95% coverage for the unknown population mean, Based
upon the available background data, it is equally important to compute reliable and stable upper
percentiles, upper prediction limits (UPLs), or upper tolerance limits (UTLs). These upper limits based
upon background (or reference) data are used as estimates of BTVs, compliance limits (CL), or not-to-
exceed values. These upper limits are often used in site (point-by-point) versus background comparison
evaluations.
Environmental scientists often encounter trace level concentrations of COPCs when evaluating sample
analytical results. Those low level analytical results cannot be measured accurately and, therefore, are
typically reported as less than one or more detection limit (DL) values (also called nondetects). However,
practitioners need to obtain reliable estimates of the population mean, ///, and the population standard
deviation, oh and upper limits including the UCL of the population mass or mean, the UPL, and the UTL
based upon data sets with nondetect (ND) observations. Additionally, they may have to use hypotheses
testing approaches to verify the attainment of cleanup standards, and compare site and background
concentrations of COPCs as mentioned above.
Background evaluation studies, BTVs, and not-to-exceed values should be estimated based upon
defensible background data sets. The estimated BTVs or not-to-exceed values are then used to identify the
COPCs, to identify the site AOCs or hot spots, and to compare the contaminant concentrations at a site
V

-------
with background concentrations. The use of appropriate statistical methods and limits for site versus
background comparisons is based upon the following factors:
1. Objective of the study,
2. Environmental medium (e.g., soil, groundwater, sediment, air) of concern,
3. Quantity and quality of the available data,
4. Estimation of a not-to-exceed value or of a mean contaminant concentration,
5. Pre-established or unknown cleanup standards and BTVs, and
6. Sampling distributions (parametric or nonparametric) of the concentration data sets
collected from the site and background areas under investigation.
In background versus site comparison evaluations, the environmental population parameters of interest
may include:
• Preliminary remediation goals (PRGs),
• Soil screening levels (SSLs),
• RBC standards,
• BTVs, not-to-exceed values, and
• Compliance limit, maximum concentration limit (MCL), or alternative concentration
limit (ACL), frequently used in groundwater applications.
When the environmental parameters listed above are not known or pre-established, appropriate upper
statistical limits are used to estimate those parameters. The UPL, UTL, and upper percentiles are used to
estimate the BTVs and not-to-exceed values. Depending upon the site data availability, point-by-point site
observations are compared with the estimated (or pre-established) BTVs and not-to-exceed values. If
enough site and background data are available, two-sample hypotheses testing approaches are used to
compare site concentrations with background concentrations levels. These statistical methods can also be
used to compare contaminant concentrations of two site AOCs, surface and subsurface contaminant
concentrations, or upgradient versus monitoring well contaminant concentrations.
The ProUCL Version 4.0 (ProUCL 4.0) is an upgrade of ProUCL Version 3.0 (EPA, 2004). ProUCL 4.0
contains statistical methods to address various environmental issues for both full data sets without
nondetects and for data sets with NDs (also known as left-censored data sets).
ProUCL 4.0 contains:
1. Rigorous parametric and nonparametric (including bootstrap methods) statistical methods
(instead of simple ad hoc or substitution methods) that can be used on full data sets
without nondetects and on data sets with below detection limit (BDL) or ND
observations.
2. State-of-the-art parametric and nonparametric UCL, UPL, and UTL computation
methods. These methods can be used on full-uncensored data sets without nondetects and
also on data sets with BDL observations. Some of the methods (e.g., Kaplan-Meier
method, ROS methods) are applicable on left-censored data sets having multiple
detection limits. The UCL and other upper limit computation methods cover a wide range
of skewed data sets with and without the BDLs.
vi

-------
3. Single sample (e.g., Student's t-test, sign test, Proportion test, Wilcoxon Singed Rank
test) and two-sample (Student's t-test, Wilcoxon-Mann-Whitney test, Gehan test, quantile
test) parametric and nonparametric hypotheses testing approaches for data sets with and
without ND observations. These hypothesis testing approaches can be used to: verify the
attainment of cleanup standards, perform site versus background comparisons, and
compare two or more AOCs, monitoring wells (MWs).
4. The single sample hypotheses testing approaches are used to compare site mean, site
median, site proportion, or a site percentile (e.g., 95th) to a compliance limit (action level,
regularity limit). The hypotheses testing approaches can handle both full-uncensored data
sets without nondctects, and left-censored data sets with nondetects. Simple two-sample
hypotheses testing methods to compare two populations are available in ProUCL 4.0,
such as two-sample t-tests, Wilcoxon-Mann-Whitney (WMW) Rank Sum test, quantile
test, Gchan's test, and dispersion test. Variations of hypothesis testing methods (e.g.,
Levene's method to compare dispersions, generalized WRS test) are easily available in
most commercial and freely available software packages (e.g., MINITAB, R).
5. ProUCL 4.0 also includes graphical methods (e.g., box plots, multiple Q-Q plots,
histogram) to compare two or more populations. ProUCL 4.0 can also be used to display
a box plot of one population (e.g., site data) with compliance limits or upper limits (e.g.,
UPL) of other population (background area) superimposed on the same graph. This kind
of graph provides a useful visual comparison of site data with a compliance limit or
BTVs. Graphical displays of a data set (e.g., Q-Q plot) should be used to gain insight
knowledge contained in a data set that may not otherwise be clear by looking at simple
test statistics such as t-test, Dixon test statistic, or Shapiro-Wilk (S-W) test statistic.
6. ProUCL 4.0 can process multiple contaminants (variables) simultaneously and has the
capability of processing data by groups. A valid group column should be included in the
data file.
7. ProUCL 4.0 provides GOF test for data sets with nondetects. The user can create
additional columns to store extrapolated (estimated) values for nondetects based upon
normal ROS, gamma ROS, and lognormal ROS (robust ROS) methods.
ProUCL 4.0 retains all of the capabilities of ProUCL 3.0, including goodness-of-fit (GOF) tests for a
normal, lognormal, and a gamma distribution and computation of UCLs based upon full data sets without
nondetects. Graphical displays and GOF tests for data sets with BDL observations have also been
included in ProUCL 4.0. It is re-emphasized that the computation of appropriate UCLs, UPLs, and other
limits is based upon the assumption that the data set under study represents a single a single population.
This means that the data set used to compute the limits should represent a single statistical population. For
example, a background data set should represent a defensible background data set free of outlying
observations. ProUCL 4.0 includes simple and commonly used classical outlier identification procedures,
such as the Dixon test and the Rosner test. These procedures are included as an aid to identify outliers.
These simple classical outlier tests often suffer from masking effects in the presence of multiple outliers.
Description and use of robust and resistant outlier procedures is beyond the scope of ProUCL 4.0.
It is suggested that the classical outlier procedures should always be accompanied by graphical displays
including box plots and Q-Q plots. The use of a Q-Q plot is useful to identify multiple or mixture samples
that might be present in a data set. However, the decision regarding the proper disposition of outliers (e.g.,

-------
to include or not to include outliers in statistical analyses; or to collect additional verification samples)
should be made by members of the project team and experts familiar with site and background conditions.
Guidance on the disposition of outliers and their accommodation in a data set by using a transformation
(e.g., lognormal distribution) is discussed in Chapter 1 of this User Guide.
ProUCL 4.0 has improved graphical methods, which may be used to compare the concentrations of two or
more populations such as:
1. Site versus background populations,
2. Surface versus subsurface concentrations,
3. Concentrations of two or more AOCs, and
4. Identification of mixture samples and/or potential outliers
These graphical methods include multiple quantile-quantile (Q-Q) plots, side-by-side box plots, and
histograms. Whenever possible, it is desirable to supplement statistical results with useful visual displays
of data sets. There is no substitute for graphical displays of a data set. For example, in addition to
providing information about the data distribution, a normal Q-Q plot can also help identify outliers and
multiple populations that may be present in a data set. On a Q-Q plot, observations well separated from
the majority of the data may represent potential outliers, and jumps and breaks of significant magnitude
may suggest the presence of observations from multiple populations in the data set. It is suggested that
analytical outlier tests (e.g., Rosner test) and goodness-of-fit (G.O.F.) tests (e.g., SW test) should always
be supplemented with the graphical displays such as Q-Q plot and box plot.
ProUCL 4.0 serves as a companion software package for Calculating Upper Confidence Limits for
Exposure Point Concentrations at Hazardous Waste Sites (EPA, 2002a) and Guidance for Comparing
Background and Chemical Concentrations in Soil for CERCLA Sites (EPA, 2002b). ProUCL 4.0 is also
useful to verify the attainment of cleanup standards (EPA, 1989). ProUCL 4.0 can also be used
to perform two-sample hypotheses tests and to compute various upper limits often needed in
groundwater monitoring applications (EPA, 1992 and EPA, 2004).

-------
Acronyms and Abbreviations
% NDs
Percentage of Nondetect observations
ACL
A-D, AD
AM
AOC
alternative concentration limit
Anderson-Darling test
arithmetic mean
area(s) of concern
BC
BCA
BDL
BTV
BW
Box-Cox-type transformation
bias-correctcd accelerated bootstrap method
below detection limit
background threshold value
Black and White (for printing)
CERCLA
CL
CLT
CMLE
COPC
cv
Comprehensive Environmental Response, Compensation, and
Liability Act
compliance limit
central limit theorem
Cohen's maximum likelihood estimate
contaminant(s) of potential concern
Coefficient of Variation
DL
DL/2 (t)
DL/2 Estimates
DQO
detection limit
UCL based upon DL/2 method using Student's t-distribution
cutoff value
estimates based upon data set with nondetects replaced by half
of the respective detection limits
data quality objective
EA
EDF
EM
EPA
EPC
exposure area
empirical distribution function
expectation maximization
Environmental Protection Agency
exposure point concentration
ix

-------
FP-ROS (Land)	UCL based upon fully parametric ROS method using Land's H-
statistic
Gamma ROS (Approx.) UCL based upon Gamma ROS method using the gamma
Gamma ROS (BCA)
GOF, G.O.F.
approximate-UCL method
UCL based upon Gamma ROS method using the bias-corrcctcd
accelerated bootstrap method
goodness-of-fit
H-UCL
UCL based upon Land's H-statistic
ID
IQR
identification code
interquartile range
K
KM (%'
KM (Chebyshev)
KM (t)
KM (z)
K-M, KM
K-S, KS
Next K, Other K, Future K
UCL based upon Kaplan-Meier estimates using the percentile
bootstrap method
UCL based upon Kaplan-Meier estimates using the Chebyshev
inequality
UCL based upon Kaplan-Meier estimates using the Student's t-
distribution cutoff value
UCL based upon Kaplan-Meier estimates using standard normal
distribution cutoff value
Kaplan-Meier
Kolmogorov-Smirnov
LN
Log-ROS Estimates
lognormal distribution
estimates based upon data set with extrapolated nondetect values
obtained using robust ROS method
MAD
Maximum
MCL
Mean
Median
Minimum
MLE
MLE (t)
Median Absolute Deviation
Maximum value
maximum concentration limit
classical average value
Median value
Minimum value
maximum likelihood estimate
UCL based upon maximum likelihood estimates using Student's
t-distribution cutoff value

-------
MLE (Tiku)	UCL based upon maximum likelihood estimates using the
Tiku's method
Multi Q-Q	multiple quantilc-quantile plot
MVUE	minimum variance unbiased estimate
ND	nondetect or nondetects
NERL	National Exposure Research Laboratory
NumNDs	Number of Nondetects
NumObs	Number of Observations
ORD	Office of Research and Development
PRG	preliminary remediation goals
Q-Q	quantile-quantilc
RBC	risk-based cleanup
RCRA	Resource Conservation and Recovery Act
ROS	regression on order statistics
RU	remediation unit
S	substantial difference
SD, Sd, sd	standard deviation
SSL	soil screening levels
S-W, SW	Shapiro-Wilk
UCL	upper confidence limit
UCL95, 95% UCL	95
-------
Table of Contents
Executive Summary	v
Acronyms and Abbreviations	ix
Introduction	1
The Need for ProUCL Software	1
ProUCL 4.0 Capabilities	2
ProUCL Applications	4
ProUCL Methods	5
Background versus Site Comparison Evaluations	7
Graphical Capabilities	11
Technical Guide	12
Minimum Hardware Requirements	12
Software Requi rements	12
Installation Instructions	13
Getting Started	13
Chapter 1 Guidance on the Use of Statistical Methods and Associated Minimum
Sample Size Requirements	15
1.1	Background Data Sets	16
1.2	Site Data Sets	17
1.3	Discrete Samples or Composite Samples?	18
1.4	Upper Limits and Their Use		18
1.5	Point-by-Point Comparison of Site Observations with BTVs, Compliance Limits,
and Other Threshold Values	21
1.6	Hypothesis Testing Approaches and Their Use	23
1.6.1	Single Sample Hypotheses - BTVs and Not-to-Exceed Values are Known
(Pre-established)	23
1.6.2	Two-sample Hypotheses - When BTVs and Not-to-Exceed
Values arc Unknown	24
1.7	Minimum Sample Size Requirements	25
1.7.1	Minimum Sample Size for Estimation and Point-by-Point Site
Observation Comparisons	26
1.7.2	Minimum Sample Size Requirements for Hypothesis Testing Approaches	26
1.8	Sample Sizes for Bootstrap Methods	27
1.9	Statistical Analyses by a Group ID	27
1.10	Use of Maximum Detected Value as Estimates of Upper Limits	27
1.10.1	Use of Maximum Detected Value to Estimate BTVs and
Not-to-Exceed Values	28
1.10.2	Use of Maximum Detected Value to Estimate EPC Terms	28
1.10.3	Samples with Nondetect Observations	29
1.10.4	Avoid the Use of DL/2 Method to Compute UCL95	29
1.10.5	Samples with Low Frequency of Detection	30
1.11	Other Applications of Methods in ProUCL 4.0	30
1.11.1	Identification of COPCs	31
1.11.2	Identification of Non-Compliance Monitoring Wells	31
1.11.3	Verification of the Attainment of Cleanup Standards, Cs	31
-------
32
33
33
33
34
34
35
36
36
37
38
39
39
41
41
42
43
43
45
47
47
49
53
55
56
58
60
60
61
65
67
67
71
74
75
75
78
80
.81
.84
.86
87
1.11.4 Using BTVs (Upper Limits) to identify Hot Spots	
Entering and Manipulating Data	
Creating a New Data Set.	
Opening an Existing Data Set	
Input File Format	
Number Precision	
Entering and Changing a Header Name	
Saving Files	
Editing	
Handling Nondetect Observations	
Caution	
Summary Statistics for Data Sets with Nondetect Observations....
Handling Missing Values	
User Graphic Display Modification		
2.12.1	Graphics Tool Bar	
2.12.2	Drop-Down Menu Graphics Tools	
Select Variables Screen	
Select Variables Screen	;	
3.1.1 Graphs by Groups	
Summary Statistics	
Summary Statistics with Full Data Sets	
Summary Statistics with NDs	
Estimating Nondetects Using ROS Methods	
Graphical Methods (Graph)	
Box Plot	
Histogram	
Multi-QQ'.	
6.3.1	Multi-QQ (Full)	
6.3.2	Multi-QQ (with NDs)	
Simple Classical Outlier Tests 	
Outlier Test for Full Data Set	
Outlier Test for Data Set with NDs	
Goodness-of-Fit (G.O.F.) Tests	
ROS Estimated (Est.) NDs - Saving Extrapolated NDs	
Goodness-of-Fit Tests with Full Data Sets	
8.2.1	GOF Tests for Normal and Lognormal Distribution	
8.2.2	GOF Tests for Gamma Distribution	
Goodness-of-Fit Tests Excluding NDs	
8.3.1	Normal and Lognormal Options	
8.3.2	Gamma Distribution Option	
Goodness-of-Fit Tests with Log-ROS Estimates	
8.4.1 Normal or Lognormal Distribution (Log-ROS Estimates)
-------
8.4.2 Gamma Distribution (Log-ROS Estimates)	90
8.5	Goodness-of-Fit Tests with DL/2 Estimates	92
8.5.1	Normal or Lognormal Distribution (DL/2 Estimates)	93
8.5.2	Gamma Distribution (DL/2 Estimates)	96
8.6	Goodness-of-Fit Tests Statistics	98
Chapter 9 Single Sample and Two-Sample Hypotheses Testing Approaches	101
9.1	Single Sample Hypotheses Tests	101
9.1.1	Single Sample Hypothesis Testing for Full Data without Nondetects	102
9.1.1.1	Single Sample t-Test	102
9.1.1.2	Single Sample Proportion Test	104
9.1.1.3	Single Sample Sign Test	106
9.1.1.4	Single Sample Wilcoxon Signed Rank (WSR) Test	108
9.1.2	Single Sample Hypothesis Testing for Data Sets with Nondetects	Ill
9.1.2.1	Single Proportion Test on Data Sets with NDs	111
9.1.2.2	Single Sample Sign Test with NDs	113
9.1.2.3	Single Sample Wilcoxon Signed Rank Test with NDs	115
9.2	Two-Sample Hypotheses Testing Approaches	118
9.2.1	Two-Sample Hypothesis Tests for Full Data	119
9.2.1.1	Two-Sample t-Test without NDs	121
9.2.1.2	Two-Sample Wilcoxon-Mann-Whitney (WMW) Test without NDs 123
9.2.1.3	Two-Sample Quantile Test for Full Data without NDs	126
9.2.2	Two-Sample Hypothesis Testing for Data Sets without Nondetects	129
9.2.2.1	Two-Sample Wilcoxon-Mann-Whitney Test with Nondetects	129
9.2.2.2	Two-Sample Gehan Test for Data Sets with Nondetects	133
9.2.2.3	Two-Sample Quantile Test for Data Sets with Nondetects	135
Chapter 10 Background Statistics	139
10.1	Background Statistics for Full Data Sets without Nondetects	140
10.1.1	Normal or Lognormal Distribution	140
10.1.2	Gamma Distribution	144
10.1.3	Nonparametric Methods	146
10.1.4	All Statistics Option	148
10.2	Background Statistics with NDs	150
10.2.1	Normal or Lognormal Distribution	150
10.2.2	Gamma Distribution	155
10.2.3	Nonparametric Methods (with NDs)	157
10.2.4	All Statistics Option	159
Chapter 11 Computing Upper Confidence Limits (UCLs) of Mean	163
11.1	UCLs for Full Data Sets	164
11.1.1	Normal Distribution (Full Data Sets without NDs)	164
11.1.2	Gamma, Lognormal, Nonparametric, All Statistics Option
(Full Data without NDs)	167
11.2	UCL for Data Sets with NDs	172
Chapter 12 Windows	183
Chapter 13 Help	185
xv
-------
Chapter 14 Handling the Output Screens and Graphs	187
Copying Graphs	187
Printing Graphs	188
Printing Non-graphical Outputs	189
Saving Output Screens as Excel Files	190
Chapter 15 Recommendations to Compute a 95% UCL (Estimate of EPC Term)
of the Population Mean, /jt, Using Symmetric and Positively Skewed
Full Data Set without any Nondetects	193
15.1	Normally or Approximately Normally Distributed Data Sets	193
15.2	Gamma Distributed Skewed Data Sets	194
15.3	Lognormally Distributed Skewed Data Sets	195
15.4	Data Sets without a Discernable Skewed Distribution - Nonparametric Methods for
Skewed Data Sets	197
15.5	Should the Maximum Observed Concentration be Used as an Estimate of
the EPC Term?	200
Chapter 16	Recommendations to Compute a 95% UCL of the Population Mean,
Using Data Sets with Nondetects with Multiple Detection Limits	203
16.1	General Recommendations and Suggestions	203
16.2	Recommended UCL95 Methods for Normal (Approximate Normal) Distribution	205
16.3	Recommended UCL95 Methods for Gamma Distribution	206
16.4	Recommended UCL95 Methods for Lognormal Distribution	206
16.5	Recommended Nonparametric UCL Methods	207
Glossary	211
References	213
About the CD	217
xvi
-------
Introduction
The Need for ProLICL Software
Statistical inferences about the sampled populations and their parameters arc made based upon defensible
and representative data sets of appropriate sizes collected from the populations under investigation.
Statistical inference, including both estimation and hypotheses testing approaches, is routinely used to:
1.	Estimate environmental parameters of interest such as exposure point concentration
(EPC) terms, not-to-exceed values, and background level threshold values (BTVs) for
contaminants of potential concern (COPC),
2.	Identify areas of concern (AOC) at a contaminated site,
3.	Compare contaminant concentrations found at two or more AOCs of a contaminated site,
4.	Compare contaminant concentrations found at an AOC with background or reference
area contaminant concentrations,
5.	Compare site concentrations with a cleanup standard to verify the attainment of cleanup
standards.
Statistical inference about the sampled populations and their parameters are made based upon defensible
and representative data sets of appropriate sizes collected from the populations under investigation.
Environmental data sets originated from the Superfund and RCRA sites often consist of observations
below one or more detection limits (DLs). In order to address the statistical issues arising in: exposure and
risk assessment applications; background versus site comparison and evaluation studies; and various other
environmental applications, several graphical, parametric, and nonparametric statistical methods for data
sets with nondetects and without nondetects have been incorporated in ProUCL 4.0.
Exposure and risk management and cleanup decisions in support of United States Environmental
Protection Agency (EPA) projects are often made based upon the mean concentrations of the COPCs. A
95% upper confidence limit (UCL95) of the unknown population (e.g., an AOC) arithmetic mean (AM),
//i, can be used to:
•	Estimate the EPC term of the AOC under investigation,
•	Determine the attainment of cleanup standards,
•	Compare site mean concentrations with reference area mean concentrations, and
•	Estimate background level mean contaminant concentrations. The background mean contaminant
concentration level may be used to compare the mean of an AOC. It should be noted that it is not
appropriate to compare individual point-by-point site observations with the background mean
concentration level.
It is important to compute a reliable and stable UCL95 of the population mean using the available data.
The UCL95 should approximately provide the 95% coverage for the unknown population mean, pit. Based
upon the available background data, it is equally important to compute reliable and stable upper
percentiles, upper prediction limits (UPLs), or upper tolerance limits (UTLs). These upper limits based
upon background (or reference) data are used as estimates of BTVs, compliance limits (CL), or not-to-
exceed values. These upper limits are often used in site (point-by-point) versus background comparison
evaluations.
Environmental scientists often encounter trace level concentrations of COPCs when evaluating sample
1
-------
analytical results. Those low level analytical results cannot be measured accurately, and therefore are
typically reported as less than one or more detection limit (DL) values (also called nondetects). However,
practitioners often need to obtain reliable estimates of the population mean, jit, the population standard
deviation, ah and upper limits, including the upper confidence limit (UCL) of the population mass or
mean, the UPL, and the UTL based upon data sets with nondetect (ND) observations. Hypotheses testing
approaches are often used to verify the attainment of cleanup standards, and compare site and background
concentrations of COPCs.
Background evaluation studies, BTVs, and not-to-exceed values should be estimated based upon
defensible background data sets. The estimated BTVs or not-to-exceed values are then used to identify the
COPCs, to identify the site AOCs or hot spots, and to compare the contaminant concentrations at a site
with background concentrations. The use of appropriate statistical methods and limits for site versus
background comparisons is based upon the following factors:
1.	Objective of the study,
2.	Environmental medium (e.g., soil, groundwater, sediment, air) of concern,
3.	Quantity and quality of the available data,
4.	Estimation of a not-to-exceed value or of a mean contaminant concentration,
5.	Pre-established or unknown cleanup standards and BTVs, and
6.	Sampling distributions (parametric or nonparametric) of the concentration data sets collected
from the site and background areas under investigation.
In background versus site comparison evaluations, the environmental population parameters of interest
may include:
•	Preliminary remediation goals (PRGs),
•	Soil screening levels (SSLs),
•	Risk-based cleanup (RBC) standards,
•	BTVs, not-to-exceed values, and
•	Compliance limit, maximum concentration limit (MCL), or alternative concentration limit (ACL),
frequently used in groundwater applications.
When the environmental parameters listed above are not known or have not been pre-established,
appropriate upper statistical limits are used to estimate the parameters. The UPL, UTL, and upper
percentiles are used to estimate the BTVs and not-to-exceed values. Depending upon the site data
availability, point-by-point site observations are compared with the estimated (or pre-established) BTVs
and not-to-exceed values. If enough site and background data are available, two-sample hypotheses
testing approaches are used to compare site concentrations with background concentrations levels. These
statistical methods can also be used to compare contaminant concentrations of two site AOCs, surface and
subsurface contaminant concentrations, or upgradient versus monitoring well contaminant concentrations.
ProllCL 4.0 Capabilities
ProUCL Version 4.0 (ProUCL 4.0) is an upgrade of ProUCL Version 3.0 (EPA, 2004). ProUCL 4.0
contains statistical methods to address various environmental issues for both full data sets without
nondetects and for data sets with NDs (also known as left-censored data sets).
ProUCL 4.0 contains:
2
-------
1.	Rigorous parametric and nonparamctric (including bootstrap methods) statistical methods
(instead of simple ad hoc or substitution methods) that can be used on full data sets
without nondetects and on data sets with below detection limit (BDL) or nondetect (ND)
observations.
2.	State-of-the-art parametric and nonparamctric UCL, UPL, and UTL computation
methods. These methods can be used on full-uncensored data sets without nondetects and
also on data sets with BDL observations. Some of the methods (e.g., Kaplan-Meier
method, ROS methods) are applicable on left-censored data sets having multiple
detection limits. The UCL and other upper limit computation methods cover a wide range
of skewed data sets with and without the BDLs.
3.	Single sample (e.g., Student's t-test, sign test, proportion test, Wilcoxon Singed Rank
test) and two-sample (Student's t-test, Wilcoxon-Mann-Whitney test, Gehan test, quantile
test) parametric and nonparametric hypotheses testing approaches for data sets with and
without ND observations. These hypothesis testing approaches can be used to: verify the
attainment of cleanup standards, perform site versus background comparisons, and
compare two or more AOCs, monitoring wells (MWs).
4.	The single sample hypotheses testing approaches are used to compare site mean, site
median, site proportion, or a site percentile (e.g., 95th) to a compliance limit (action level,
regularity limit). The hypotheses testing approaches can handle both full-uncensored data
sets without nondetects, and lcft-ccnsored data sets with nondetects. Simple two-sample
hypotheses testing methods to compare two populations are available in ProUCL 4.0,
such as two-sample t-tests, Wilcoxon-Mann-Whitney (WMW) Rank Sum test, quantile
test, Gehan's test, and dispersion test. Variations of hypothesis testing methods (e.g.,
Levcne's method to compare dispersions, generalized WRS test) are easily available in
most commercial and freely available software packages (e.g., MINITAB, R).
5.	ProUCL 4.0 includes graphical methods (e.g., box plots, multiple Q-Q plots, histogram)
to compare two or more populations. Additionally, ProUCL 4.0 can also be used to
display a box plot of one population (e.g., site data) with compliance limits or upper
limits (e.g., UPL) of other population (background area) superimposed on the same
graph. This kind of graph provides a useful visual comparison of site data with a
compliance limit or BTVs. Graphical displays of a data set (e.g., Q-Q plot) should be
used to gain insight knowledge contained in a data set that may not otherwise be clear by
looking at simple test statistics such as t-test, Dixon test statistic, or Shapiro-Wilk (S-W)
test statistic.
6.	ProUCL 4.0,can process multiple contaminants (variables) simultaneously and has the
capability of processing data by groups. A valid group column should be included in the
data file. '
7.	ProUCL 4.0 provides a GOF test for data sets with nondetects. The user can create
additional columns to store extrapolated (estimated) values for nondetects based upon
normal ROS, gamma ROS, and lognormal ROS (robust ROS) methods.
3
-------
ProllCL Applications
The methods incorporated in ProllCL 4.0 can be used on data sets with and without BDL and ND
observations. Methods and recommendations as incorporated in ProUCL 4.0 are based upon the results
and findings of the extensive simulation studies as summarized in Singh and Singh (2003), and Singh,
Maichle, and Lee (EPA, 2006). It is anticipated that ProUCL 4.0 will serve as a companion software
package for the following EPA documents:
•	Calculating Upper Confidence Limits for Exposure Point Concentrations at Hazardous
Waste Sites (EPA, 2002a), and
•	The revised Guidance for Comparing Background and Chemical Concentrations in Soil
for CERCLA Sites (EPA, 2002b).
Methods included in ProUCL 4.0 can be used in various other environmental applications including the
verification of cleanup standards (EPA, 1989), and computation of upper limits needed in groundwater
monitoring applications (EPA, 1992 and EPA, 2004).
In 2002, EPA issued guidance for calculating the UCLs of the unknown population means for
contaminant concentrations at hazardous waste sites. The ProUCL 3.0 software package (EPA, 2004) has
served as a companion software package for the EPA (2002a) guidance document for calculating UCLs of
mean contaminant concentrations at hazardous waste sites. ProUCL 3.0 has several parametric and
nonparametric statistical methods that can be used to compute appropriate UCLs based upon full-
uncensored data sets without any ND observations. ProUCL 4.0 retains the capabilities of ProUCL 3.0,
including goodness-of-fit (GOF) and the UCL computation methods for data sets without any BDL
observations. However, ProUCL 4.0 has the additional capability to perform GOF tests and computing
UCLs and other upper limits based upon data sets with BDL observations.
ProUCL 4.0 defines log-transform (log) as the natural logarithm (In) to the base e. ProUCL 4.0 also
computes the maximum likelihood estimates (MLEs^ and the minimum variance unbiased estimates
(MVUEs) of unknown population parameters of normal, lognormal, and gamma distributions. This, of
course, depends upon the underlying data distribution. ProUCL 4.0 computes the (1 - a)100% UCLs of
the unknown population mean,//|, using 5 parametric and 10 nonparametric methods. It should be pointed
out that ProUCL 4.0 computes the simple summary statistics for detected raw and log-transformed data
for full data sets without NDs, as well as for data sets with BDL observations. It is noted that estimates of
mean and sd for data sets with NDs based upon rigorous statistical methods (e.g., MLE, ROS, K-M
methods) are note provided in the summary statistics. Those estimates and the associated upper limits for
data sets with NDs are provided under the menu options: Background and UCL.
It is emphasized that throughout this User Guide, and in the ProUCL 4.0 software, it is assumed that one
is dealing with a single population. If multiple populations (e.g., background and site data mixed together)
are present, it is recommended to first separate them out (e.g., using appropriate statistical population
partitioning techniques), and then compute appropriate respective 95% UCLs separately for each of the
identified populations. Outliers, if any, should be identified and thoroughly investigated. ProUCL 4.0
provides two commonly used simple classical outlier identification procedures: 1) the Dixon test and 2)
the Rosner test. Outliers distort most parametric statistics (e.g., mean, UCLs, upper prediction limits
(UPLs), test statistics) of interest. Moreover, it should be noted that even though outliers might have
minimal influence on hypotheses testing statistics based upon ranks (e.g., WMW test), outliers do distort
those nonparametric statistics (including bootstrap methods), which are based upon higher order statistics
such as UPLs and UTLs. Decisions about the disposition (exclusion or inclusion) of outliers in a data set
4
-------
used to estimate the EPC terms or BTVs should be made by all parties involved (e.g., project team, EPA,
local agency, potentially responsible party, etc.) in the decision making process.
The presence of outlying observations also distorts statistics based upon bootstrap re-samples. The use of
higher order values (quantiles) of the distorted statistics for the computation of the UCLs or UPLs based
upon bootstrap t and Hall's bootstrap methods may yield unstable and erratic UCL values. This is
especially true for the upper limits providing higher confidence coefficients such as 95%, 97.5%, or 99%.
Similar behavior of the bootstrap t UCL is observed for data sets having BDL observations. Therefore, the
bootstrap t and Hall's bootstrap methods should be used with caution. It is suggested that the user should
examine various other UCL results and determine if the UCLs based upon the bootstrap t and Hall's
bootstrap methods represent reasonable and reliable UCL values of practical merit. If the results based
upon these two bootstrap methods are much higher than the rest of methods, then this could be an
indication of erratic behavior of those bootstrap UCL values, perhaps distorted by outlying observations.
In case these two bootstrap methods yield erratic and inflated UCLs, the UCL of the mean should be
computed using the adjusted or the approximate gamma UCL computation method for highly skewed
gamma distributed data sets of small sizes. Alternatively, one may use a 97.5% or 99% Chebyshcv UCL
to estimate the mean of a highly skewed population. It should be noted that typically, a Chebyshev UCL
may yield conservative and higher values of the UCLs than other methods available in ProUCL 4.0 This
is especially true when data are moderately skewed and sample size is large. In such cases, when the
sample size is large, one may want to use a 95% Chebyshev UCL or a Chebyshev UCL with lower
confidence coefficient such as 92.5% or 90% as estimate of the population mean.
ProUCL Methods
ProUCL 4.0 provides 15 UCL computation methods for full data sets without any BDL observations; 5
are parametric and 10 are nonparametric methods. The nonparametric methods do not depend upon any
assumptions about the data distributions. The five parametric UCL computation methods are:
1.	Student's t-UCL,
2.	Approximate gamma UCL using chi-square approximation,
3.	Adjusted gamma UCL (adjusted for level significance),
4.	Land's H-UCL, and
5.	Chebyshev inequality-based UCL (using MVUEs of parameters of a lognormal
distribution).
The 10 nonparametric methods are:
1.	The central limit theorem (CLT)-based UCL,
2.	Modified-t statistic (adjusted for skewness)-based UCL,
3.	Adjusted-CLT (adjusted for skewness)-based UCL,
4.	Chebyshev inequality-based UCL (using sample mean and sample standard deviation),
5.	Jackknife method-based UCL,
6.	UCL based upon standard bootstrap,
7.	UCL based upon percentile bootstrap,
8.	UCL based upon bias-corrected accelerated (BCA) bootstrap,
9.	UCL based upon bootstrap t, and
10.	UCL based upon Hall's bootstrap.
5
-------
Environmental scientists often encounter trace level concentrations of COPCs when evaluating sample
analytical results. Those low level analytical results cannot be measured accurately, and therefore are
typically reported as less than one or more DL values. However, the practitioners need to obtain reliable
estimates of the population mean, ///, and the population standard deviation, a/, and upper limits including
the UCL of the population mass (measure of central tendency) or mean, UPL, and UTL. Several methods
are available and cited in the environmental literature (Helsel (2005), Singh and Nocerino (2002), Millard
and Ncerchal (2001)) that can be used to estimate the population mean and variance. However, till to date,
no specific recommendations are available for the use of appropriate methods that can be used to compute
upper limits (e.g., UCLs, UPLs) based upon data sets with BDL observations. Singh, Maichle, and Lee
(EPA, 2006) extensively studied the performance of several parametric and nonparametric UCL
computation methods for data sets with BDL observations. Based upon their results and findings, several
methods to compute upper limits (UCLs, UPLs, and UTLs) needed to estimate the EPC terms and BTVs
have been incorporated in ProUCL 4.0.
In 2002, EPA issued another Guidance for Comparing Background and Chemical Concentrations in Soil
for CERCLA Sites (EPA, 2002b). This EPA (2002b) background guidance document is currently being
revised to include statistical methods that can be used to estimate the BTVs and not-to-exceed values
based upon data sets with and without the BDL observations. In background evaluation studies, BTVs,
compliance limits, or not-to-exceed values often need to be estimated based upon defensible background
data sets. The estimated BTVs or not-to-exceed values are then used for screening the COPCs, to identify
the site AOCs or hot spots, and also to determine if the site concentrations (perhaps after a remediation
activity) are comparable to background concentrations, or are approaching the background level
concentrations. Individual point-by-point site observations (composite samples preferred) are sometimes
compared with those not-to-cxceed values or BTVs. It should be pointed out that in practice, it is
preferred to use hypotheses testing approaches to compare site versus background concentrations
provided enough (e.g., at least 8-10 detected observations from each of the two populations) site and
background data are available. Chapter 1 provides practical guidance on the minimum sample size
requirements to estimate and use the BTVs, single and two-sample hypotheses testing approaches to
perform background evaluations and background versus site comparisons. Chapter 1 also briefly
discusses the differences in the definitions and uses of the various upper limits as incorporated in ProUCL
4.0. Detailed discussion of the various methods to estimate the BTVs and other not-to-exceed values for
full-uncensored data sets (Chapter 5) without any nondetect values and for left-censored data sets
(Chapter 6) with nondetect values are given in the revised background guidance document.
ProUCL 4.0 includes statistical methods to compute UCLs of the mean, upper limits to estimate the
BTVs, other not-to-exceed values, and compliance limits based upon data sets with one or more detection
limits. The use of appropriate statistical methods and limits for exposure and risk assessment, and site
versus background comparisons, is based upon several factors:
1.	Objective of the study;
2.	Environmental medium (e.g., soil, groundwater, sediment, air) of concern;
3.	Quantity and quality of the available data;
4.	Estimation of a not-to-exceed value or of a mean contaminant concentration;
5.	Pre-established or unknown cleanup standards and BTVs; and
6.	Sampling distributions (parametric or nonparametric) of the concentration data sets
collected from the site and background areas under investigation.
In background versus site comparison studies, the population parameters of interest are typically
represented by upper threshold limits (e.g., upper percentiles, upper confidence limits of an upper
6
-------
percentile, upper prediction limit) of the background data distribution. It should be noted that the upper
threshold values arc estimated and represented by upper percentiles and other values from the upper tail
of the background data distribution. These background upper threshold values do not represent measures
of central tendency such as the mean, the median, or their upper confidence limits. These environmental
parameters may include:
•	Preliminary remediation goals (PRGs), Compliance Limits,
•	Soil screening levels (SSLs),
•	Risk-based cleanup (RJBC) standards,
•	BTVs, compliance limits, or not-to-exceed values, and
•	Maximum concentration limit (MCL) or alternative concentration limit (ACL) used in
Groundwater applications.
When the environmental parameters listed above are not known or pre-established, appropriate upper
statistical limits are used to estimate those parameters. The UPL, UTL, and upper percentiles are typically
used to estimate the BTVs,^not-to-exceed values, and other parameters listed above. Depending upon the
availability of site data, point-by-point site observations are compared with the estimated (or pre-
established) BTVs and not-to-exceed values. If enough site and background data arc available, two-
sample hypotheses testing approaches (preferred method to compare two populations) are used to
compare site concentrations with background concentrations levels. The hypotheses testing methods can
also be used to compare contaminant concentrations of two site AOCs, surface and subsurface
contaminant concentrations, or upgradient versus monitoring well contaminant concentrations.
Background versus Site Comparison Evaluations
The following statistical limits have been incorporated in ProUCL 4.0 to assist in background versus site
comparison evaluations:
Parametric Limits for Full-Uncensored Data Sets without Nondetect Observations
•	UPL for a single observation (Normal, Lognormal) not belonging to the original data set
•	UPL for next k (k is user specified) or k future observations (Normal, Lognormal)
•	UTL, an upper confidence limit of a percentile (Normal, Lognormal)
•	Upper percentiles (Normal, Lognormal, and Gamma)
Nonparametric Limits for Full-Uncensored Data Sets without Nondetect Observations
Nonparametric limits are typically based upon order statistics of a data set such as a background or a
reference data set. Depending upon the size of the data set, higher order statistics (maximum, second
largest, third largest, and so on) are used as these upper limits (e.g., UPLs, UTLs). The details of these
methods with sample size requirements can be found in Chapter 5 of the revised Guidance for Comparing
Background and Chemical Concentrations in Soil for CERCLA Sites (EPA, 2002b). It should be, noted
that the following statistics might get distorted by the presence of outliers (if any) in the data set under
study.
•	UPL for a single observation not belonging to the original data set
•	UTL, an upper confidence limit of a percentile
•	Upper percentiles
7
-------
•	Upper limit based upon interquartile range (IQR)
•	Upper limits based upon bootstrap methods
For data sets with BDL observations, the following parametric and nonparametric methods to compute
the upper limits were studied and evaluated by Singh, Maichle, and Lee (EPA, 2006) via Monte Carlo
Simulation Experiments. Depending upon the performances of those methods, only some of the methods
have been incorporated in ProUCL 4.0. Methods (e.g., Delta method, DL method, uniform (0, DL)
generation method) not included in ProUCL 4.0 do not perform well in comparison with other methods.
Note: When the percentage ofNDs in a data set is high (e.g., > 40%-50%>), especially when multiple
detection limits might be present, it is hard to reliably perform GOF tests (to determine data distribution)
on those data sets with many NDs. The uncertainty associated with those GOF tests will be high,
especially when the data sets are of small sizes (< 10-20). It should also be noted that the parametric
MLE methods (e.g., for normal and lognormal distributions) often yield unstable estimates of mean and
sd. This is especially true when the number of nondetects exceeds 40%-50%. In such situations, it is
preferable to use nonparametric (e.g., KM method) methods to compute statistics of interest such as
UCLs, UPLs, and UTLs. Nonparametric methods do not require any distributional assumptions about the
data sets under investigation. Singh, Maichle, and Lee (EPA, 2006) also concluded that the performance
of the KM estimation method is better (in terms of coverage probabilities) than various other parametric
estimation (e.g., MLE, EM, ROS) methods.
Parametric Methods to Compute Upper Limits for Data Sets with Nondetect Observations
•	Simple substitution (proxy) methods (0, DL/2, DL)
•	MLE method, often known as Cohen's MLE method - single detection limit
•	Restricted MLE method - single detection limit - not in ProUCL 4.0
•	Expectation maximization (EM) method - single detection limit - not in ProUCL 4.0
•	EPA Delta log method - single detection limit - not in ProUCL 4.0
•	Regression method on detected data and using slope and intercept of the OLS regression
line as estimates of standard deviation, sd, and mean (not a recommended method)
•	Robust ROS (regression on order statistics) on log-transformed data - nondetects
extrapolated (estimated) using robust ROS; mean, sd, UCLs, and other statistics
computed using the detected and extrapolated data in original scale - multiple detection
limits
•	Normal ROS - nondetects extrapolated (estimated) using normal distribution, mean, sd,
UCLs, and other statistics computed using the detected and extrapolated data - multiple
detection limits.
•	It is noted that the estimated NDs often become negative and even larger than the
detection limits (not a recommended method)
•	Gamma ROS - nondetects extrapolated (estimated) using gamma distribution, mean, sd,
UCLs, and other statistics computed using the detected and extrapolated data - multiple
detection limits
Nonparametric Methods to Compute Upper Limits for Data Sets with Nondetect Observations
•	Bootstrap Methods
o Percentile Bootstrap on robust ROS
o Percentile Bootstrap
8
-------
o BCA Bootstrap
o Bootstrap t
•	Jackknife Method
o Jackknife on robust ROS
•	Kaplan-Meier (KM) Method
o Bootstrap (percentile, BCA) using KM estimates
o Jackknife using KM estimates
o Chebyshev Method using KM estimates
•	Winsorization Method
For uncensored full data sets without any NDs, the performance (in terms of coverage for the mean) of
the various UCL computation methods was evaluated by Singh and Singh (2003). The performance of the
parametric and nonparametric UCL methods based upon data sets with nondetect observations was
studied by Singh, Maichle, and Lee (EPA, 2006). Several of the methods listed above have been
incorporated in ProUCL 4.0 to compute the estimates of EPC terms (95% UCL), and of BTVs (UPLs,
UTLs, upper percentiles). Methods that did not perform well (e.g., poor coverage or unrealistically large
values, infeasible and biased estimates) are not included in ProUCL 4.0. Methods not incorporated in
ProUCL 4.0 arc: EPA Delta Log method, Restricted MLE method, and EM method, substitution method
(0, and DL), and Regression method.
Note: It should be noted that for data sets with NDs, the DL/2 substitution method has been incorporated
in ProUCL 4.0 only for historical reasons and also for its current default use. It is well known that the
DL/2 method (with NDs replaced by DL/2) does not perform well (e.g., Singh, Maichle, and Lee (EPA,
2006)) even when the percentage of NDs is only 5%-10%. It is strongly suggested to avoid the use of
DL/2 methodfor estimation and hypothesis testing approaches used in various environmental
applications. Also, when the % of NDs becomes high (e.g., > 40%-50%), it is suggested to avoid the use
ofparametric MLE methods. For data sets with high percentage of NDs (e.g., > 40%), the distributional
assumptions needed to use parametric methods are hard to verify; and those parametric MLE methods
may yield unstable results.
It should also be noted that even though the lognormal distribution and some statistics based upon
lognormal assumption (e.g., Robust ROS, DL/2 method) are available in ProUCL 4.0, ProUCL 4.0 does
not compute MLEs of mean and sd based upon a lognormal distribution. The main reason is that the
estimates need to be computed in the original scale via back-transformation (Shaarawi, 1989, and Singh,
Maichle, and Lee (EPA, 2006)). Those back-transformed estimates often suffer from an unknown amount
of significant bias. Hence, it is also suggested to avoid the use of a lognormal distribution to compute
MLEs of mean and sd, and associated upper limits, especially UCLs based upon those MLEs obtained
using a lognormal distribution.
ProUCL 4.0 recommends the use of an appropriate UCL to estimate the EPC terms. It is desirable that the
user consults with the project team and experts familiar with the site before using those recommendations.
Furthermore, there does not seem to be a general agreement about the use of an upper limit (e.g., UPL,
percentile, or UTL) to estimate not-to-exceed values or BTVs to be used for screening of the COPCs and
in site versus background comparison studies. ProUCL 4.0 can compute both parametric and
nonparametric upper percentiles, UPLs, and UTLs for uncensored and censored data sets. However, no
specific recommendations have been made regarding the use of UPLs, UTLs, or upper percentiles to
9
-------
estimate the BTVs, compliance limits, and other related background or reference parameters. However,
the developers of ProUCL 4.0 prefer the use of UPLs or upper percentiles to estimate the background
population parameters (e.g., BTVs, not-to-exceed values) that may be needed to perform point-by-point
site versus background comparisons.
The standard bootstrap and the percentile bootstrap UCL computation methods do not perform well (do
not provide adequate coverage to population mean) for skewed data sets. For skewed distributions, the
bootstrap t and Hall's bootstrap (meant to adjust for skewness) methods do perform better (in terms of
coverage for the population mean) than the other bootstrap methods. However, it has been noted (e.g.,
Efron and Tibshirani (1993), and Singh, Singh, and Iaci (2002b)) that these two bootstrap methods
sometimes yield erratic and inflated UCL values (orders of magnitude higher than the other UCLs). This
may occur when outliers are present in a data set. Similar behavior of the bootstrap t UCL is observed
based upon data sets with NDs. Therefore, whenever applicable, ProUCL 4.0 provides cautionary
statements regarding the use of bootstrap methods.
ProUCL 4.0 provides several state-of-the-art parametric and nonparametric UCL, UPL, and UTL
computation methods that can be used on uncensored data sets (full data sets) and on data sets with BDL
observations. Some of the methods (e.g., Kaplan-Meier method, ROS methods) incorporated in ProUCL
4.0 are applicable on left-censored data sets having multiple detection limits. The UCLs and other upper
limits computation methods in ProUCL 4.0 cover a wide range of skewed data distributions with and
without the BDLs arising from the environmental applications.
ProUCL 4.0 also has parametric and nonparametric single and two-sample hypotheses testing approaches
required to: compare site location (e.g., mean, median) to a specified cleanup standard; perform site
versus background comparisons; or compare of two or more AOCs. These hypotheses testing methods
can handle both full (uncensored data sets without NDs) and left-censored (with nondetects) data sets.
Specifically, two-sample tests such as trtest, Wilcoxon Mann-Whitney (WMW) Rank Sum test, quantile
test, and Gehan's test are available in ProUCL 4.0 to compare concentrations of two populations.
Single sample parametric (Student's t-test) and nonparametric (sign test, Wilcoxon Signed Rank (WSR)
test, tests for proportions and percentiles) hypotheses testing approaches are also available in ProUCL 4.0.
The single sample hypotheses tests are useful when the environmental parameters such as the clean
standard, action level, or compliance limits are known, and the objective is to compare site concentrations
with those known threshold values. Specifically, a t-test (or a sign test) may be used to verify the
attainment of cleanup levels at an AOC after a remediation activity; and a test for proportion may be used
to verify if the proportion of exceedances of an action level (or a compliance limit) by sample
concentrations collected from the AOC (or a MW) exceeds a certain specified proportion (e.g., 1%, 5%,
10%). As mentioned before, ProUCL 4.0 can perform these hypotheses on data sets with and without
nondetect observations.
Note: It should be noted that as cited in the literature, some of the hypotheses testing approaches (e.g.,
nonparametric two-sample WMW) deal with the single detection limit scenario. If multiple detection
limits are present, all NDs below the largest detection limit need to be considered as NDs (Gilbert, 1987,
and Helsel, 2005). This in turn may reduce the power and increase uncertainty associated with test. As
mentioned before, it is always desirable to supplement the test statistics and test conclusions with
graphical displays such as the multiple Q-Q plots and side-by-side box plots. ProUCL 4.0 can graph box
plots and Q-Q plots for data sets with nondetect observations. Gehan test as available in ProUCL 4.0
should be used in case multiple detection limits are present. ProUCL 4.0 can draw Q-Q plots and box ¦
plots for data sets with and without nondetect observations.
10
-------
It should be pointed out that when using two-sample hypotheses approaches (WMW test, Gehan test, and
quantile test) on data sets with NDs, both samples and variables (e.g., site-As, Back-As) should be
specified as having nondetects. This means, a ND column (0 = ND, and I = detect) should be provided
for each variable (here D_site-As, and D_Back-As) to be used in this comparison. If a variable (e.g., site-
As) does not have any nondetects, still a column with label D_site-As should be included in the data set
with all entries = 1 (detected values).
Moreover, in single sample hypotheses tests (e.g., sign test, proportion test) used to compare site
mean/median concentration level with a cleanup standard, Cs or compliance limit (e.g., proportion test),
all NDs (if any) should lie below the cleanup standard, Cs
The differences between these tests should be noted and understood. Specifically, a t-test or a Wilcoxon
Signed Rank (WSR) test are used to compare the measures of location and central tendencies (e.g., mean,
median) of a site area (e.g., AOC) to a cleanup standard, Cs, or action level also representing a measure of
central tendency (e.g., mean, median); whereas, a proportion test compares if the proportion of site
observations from an AOC exceeding a compliance limit (CL) exceeds a specified proportion, P0 (e.g.,
5%, 10%). The percentile test compares a specified percentile (e.g., 95th) of the site data to a pre-
specified upper threshold (e.g., reporting limit, action level). All of these tests have been incorporated in
ProUCL 4.0. Most of the single sample and two-sample hypotheses tests also report associated p-values.
For some of the hypotheses tests (e.g., WMW test, WSR test, proportion test), large sample approximate
p-values are computed using continuity correction factors.
Graphical Capabilities
ProUCL 4.0 has useful exploratory graphical methods that may be used to visually compare the
concentrations of:
1.	A site area of concern (AOC) with an action level. This can be done using a box plot of
site data with action level superimposed on that graph,
2.	Two or more populations, including site versus background populations, surface versus
subsurface concentrations, and
3.	Two or more AOCs.
The graphical methods include double and multiple quantile-quantile (Q-Q) plots, side-by-side box plots,
and histograms. Whenever possible, it is desirable to supplement statistical test results and statistics with
visual graphical displays of data sets. There is no substitute for graphical displays of a data set as the
visual displays often provide useful information about a data set, which cannot be revealed by simple test
statistics such as t-test, SW test, Rosner test, WMW test. For example, in addition to providing
information about the data distribution, a normal Q-Q plot can also help identify outliers and multiple
populations that might be present in a data set. This kind of information cannot be revealed by simple test
statistics such as a Shapiro-Wilk (SW) test or Rosner's outlier test statistic. Specifically, the SW test may
lead to the conclusion that a mixture data set (representing two or more populations) can be modeled by a
normal (or lognormal) distribution, whereas the occurrence of obvious breaks and jumps in the associated
Q-Q plot may suggest the presence of multiple populations in the mixture data set. It is suggested that the
user should use exploratory tools to gain necessary insight into a data set and the underlying assumptions
(e.g., distributional, single population) that may not be revealed by simple test statistics.
11
-------
Note: On a Q-Q plot, observations well separatedfrom the majority of the data may represent potential
outliers, and obvious jumps and breaks of significant magnitude may suggest the presence of observations
from multiple populations in the data set.
The analyses of data categorized by a group ID variable such as: 1) Surface vs. Subsurface;
2) A0C1 vs. AOC2; 3) Site vs. Background; and 4) Upgradicnt vs. Downgradient monitoring wells are
quite common in many environmental applications. ProUCL 4.0 offers this option for data sets with and
without nondetects. The Group Option provides a powerful tool to perform various statistical tests and
methods (including graphical displays) separately for each of the group (samples from different
populations) that may be present in a data set. For an example, the same data set may consist of samples
from the various groups or populations representing site, background, two or more AOCs, surface,
subsurface, monitoring wells. The graphical displays (e.g., box plots, Q-Q plots) and statistics
(computations of background statistics, UCLs, hypotheses testing approaches) of interest can be
computed separately for each group by using this option.
Technical Guide
In addition to this User Guide, a Technical document also accompanies ProUCL 4.0, providing useful
technical details of the graphical and statistical methods as incorporated in ProUCL 4.0. Most of the
mathematical algorithms and formulas (with references) used in the development of ProUCL 4.0 arc
summarized in the Technical Guide.
Minimum Hardware Requirements
•	Intel Pentium 1.0 GHz
•	57 MB of hard drive space
•	512 MB of memory (RAM)
•	CD-ROM drive
•	Windows XP operating system
•	Minimum graphics display of 800 by 600 pixels
Software Requirements
ProUCL 4.0 has been developed in the Microsoft .NET Framework using the C# programming language.
As such, to properly run ProUCL 4.0, the computer using the program must have the .NET Framework
pre-installed. The downloadable .NET files can be found at one of the following two Web sites:
•	http://msdn2.microsoft.com/en-us/netframework/aa731542.aspx
Note: Download .Net version 1.1
•	http://www.microsoft.com/downloads/detaiIs.aspx?FamilvId=:262D25E3-F589-4842-
8157-034D1E7CF3 A3&displavlang=en
The first Web site lists all of the downloadable .NET Framework files, while the second Web site
provides information about the specific file (s) needed to run ProUCL 4.0. Download times are estimated
at 57 minutes for a dialup connection (56K), and 13 minutes on a DSL/Cable connection (256K).
12
-------
Installation Instructions
•	Open Windows Explorer and create a new directory called ProUCL 4.0.
•	Download (save) the ProUCL 4.0 zipped files (setup.zip) to the ProUCL 4.0 directory.
•	Use Winzip (or equivalent, e.g., EasyZip) to extract all of the files in the ProUCL
directory.
•	Using Windows Explorer, right-click on the ProUCL 4.0 main directory and make sure
that the read-only attribute is off.
•	Using Windows Explorer, create a shortcut (optional) by right-clicking on the unzipped
file, ProUCL.exe (application), in the ProUCL directory and drag the shortcut to the
desktop (optional: rename to ProUCL 4.0).
•	Using Windows Explorer, start ProUCL 4.0 by left double-clicking on the unzipped file,
ProUCL 4.0.exe (application), in the ProUCL 4.0 directory, or by left double-clicking on
the ProUCL 4.0 shortcut, or use the RUN command from the Start Menu to locate and
run ProUCL.exe.
•	Try to open an example file in the ProUCL 4.0 sub-directory, Data. If the file does not
open, be sure that the read-only attribute is off (right-click on the Data sub-directory).
•	If the computer does not have .NET Framework 1.1 installed (cither a pre-2002 Windows
operating system or a late version of Windows XP), then it will be necessary for the end
user to download it from Microsoft. A Google search for "NET Framework 1.1" will
yield several download locations. (Also available upon download of ProUCL 4.0 for your
convenience.)
•	To uninstall the program, use Windows Explorer to locate and delete the ProUCL folder.
Caution: If you have previous versions of the ProUCL, which were installed on your computer, you
should remove or rename the directory in which earlier ProUCL versions are currently located.
Getting Started
The functionality and the use of the methods and options available in ProUCL 4.0 have been illustrated
using Screen Shots of output screen generated by ProUCL 4.0. ProUCL 4.0 uses a pull-down menu
structure, similar to a typical Windows program.
The screen below appears when the program is executed.
13
-------

0 IM Cblfel

The screen consists of three main window panels:
•	The MAIN WINDOW displays data sheets and outputs from the procedure used.
•	The NAVIGATION PANEL displays the name of data sets and all generated outputs,
o 1 At present, the navigation panel can hold at most 20 outputs. In order to see more
files (data files or generated output files), one can click on Widow Option.
• The LOG PANEL displays transactions in green, warnings in orange, and errors in red.
For an example, when one attempts to run a procedure meant for censored data sets on a
full-uncensorcd data set, ProUCL 4.0 will print out a warning message in orange in this
panel.
o Should both panels be unnecessary, you can click Jill or choose Configure ~
Panel ON/OFF.
The use of this option will give extra space to see and print out the statistics of interest. For an example,
one may want to turn off these panels when multiple variables (e.g., multiple Q-Q plots) are analyzed and
GOF statistics and other statistics may need to be captured for all of the variables.
14
-------
Chapter 1
Guidance on the Use of Statistical Methods and Associated
Minimum Sample Size Requirements
This chapter briefly describes the differences between the various statistical limits (e.g., UCLs, UPLs,
UTLs) often used to estimate the environmental parameters of interest including exposure point
concentration (EPC) terms and background threshold values (BTVs). Suggestions are provided about the
minimum sample size requirements needed to use statistical inferential methods to estimate the
environmental parameters: EPC terms, BTVs and not-to-exceed values, and to compare site data with
background data or with some pre-established reference limits (e.g., preliminary remediation goals
(PRGs), action levels, compliance limits). It is noted that several EPA guidance documents (e.g., EPA
1997, 2002a, 2006) discuss in details about data quality objectives (DQOs) and sample size
determinations based upon those DQOs needed for the various statistical methods used in environmental
applications.
Also, appropriate sample collection methods (e.g., instruments, sample weights, discrete or composite,
analytical methods) depend upon the medium (e.g., soil, sediment, water) under consideration. For an
example, Gerlach and Nocerino (EPA, 2003) describe optimal soil sample (based upon Gy theory)
collection methods. Therefore, the topics of sample size determination based upon DQOs, data validation,
and appropriate sample collection methods for the various environmental media are not considered in
ProUCL 4.0 and its associated Technical Guide. It is assumed that data sets to be used in ProUCL are of
good quality, and whenever possible have been obtained using the guidance provided in various EPA
(2003, 2006) documents. It is the users' responsibility to assure that adequate amount of data have been
collected, and the collected data are of good quality.
Note: In ProUCL 4.0 and its associated guidance documents, emphasis is given on the practical
applicability and appropriate use of statistical methods needed to address statistical issues arising in risk
management, background versus site evaluation studies, and various other environmental applications.
Specifically, guidance on minimum sample size requirements as provided in this chapter is useful when
data have already been collected, or it is not possible (e.g., due to resource limitations) to collect the
number of samples obtained using DQO processes as described in EPA (2006).
Decisions based upon statistics obtained using data sets of small sizes (e.g., 4 to 6 detected observations)
cannot be considered reliable enough to make a remediation decision that affects human health and the
environment. For an example, a background data set of size 4 to 6 is not large enough to characterize
background population, to compute BTV values, or to perform background versus site comparisons. In
order to perform reliable and meaningful statistical inference (estimation and hypothesis testing), one
should determine the sample sizes that need to be collected from the populations under investigation
using appropriate DQO processes and decision error rates (EPA, 2006). However, in some cases, it may
not be possible (e.g., resource constraints) to collect the same number of samples recommended by the
DQO process. In order to address such cases, minimum sample size requirements for background and site
data sets are described.
The use of an appropriate statistical method depends upon the environmental parameter(s) being
estimated or compared. The measures of central tendency (e.g., means, medians, or their upper confidence
limits (UCLs)) are often used to compare site mean concentrations (e.g., after remediation activity) with a
15
-------
cleanup standard, Cs, representing some central tendency measure of a reference area or other known
threshold representing a measure of central tendency. The upper threshold values, such as the compliance
limits (e.g., alternative concentration limit (ACL), maximum concentration limit (MCL)), or not-to-
excccd values, are used when individual point-by-point observations are compared with those not-to-
excccd values or other compliance limit. It should be noted that depending upon whether the
environmental parameters (e.g., BTVs, not-to-exceed value, EPC term, cleanup standards) arc known or
unknown, different statistical methods with different data requirements are needed to compare site
concentrations with pre-established (known) or estimated (unknown) cleanup standards and BTVs.
ProUCL 4.0 has been developed to address issues arising in exposure assessment, risk assessment, and
background versus site comparison applications. Several upper limits, and single- and two-sample
hypotheses testing approaches, for both full uncensored and left-censored data sets, are available in
ProUCL 4.0. The details of the statistical and graphical methods included in ProUCL 4.0 can be found in
the ProUCL Technical Guidance. In order to make sure that the methods in ProUCL 4.0 are properly
used, this chapter provides guidance on:
1.	analysis of site and background areas and data sets,
2.	collection of discrete or composite samples,
3.	appropriate use of the various upper limits,
4.	guidance regarding minimum sample sizes,
5.	point-by-point comparison of site observations with BTVs,
6.	use of hypotheses testing approaches,
7.	using small data sample sets,
8.	use of maximum detected value, and
9.	discussion of ProUCL usage for special cases.
1.1 Background Data Sets
The project team familiar with the site should identify and chose a background area. Depending upon the
site activities and the pollutants, the background area can be site-specific or a general reference area. An
appropriate random sample of independent observations should be collected from the background area. A
defensible background data set should represent a "single" background population (e.g., representing
pristine site conditions before any of the industrial site activities) free of contaminating observations such
as outliers. In a background data set, outliers may represent potentially contaminated observations from
impacted site areas under study or possibly from other polluted sitc(s). This scenario is common when
background samples are obtained from the various onsite areas (e.g., large federal facilities). Outlying
observations should not be included in the estimation (or hypotheses testing procedures) of the BTVs.
The presence of outliers in the background data set will yield distorted estimates of the BTVs and
hypothesis testing statistics. The proper disposition of outliers to include or not include them in the data
set should be decided by the project team.
Decisions based upon distorted statistics can be incorrect, misleading, and expensive. It should be noted
that the objective is to compute background statistics based upon the majority of the data set representing
the dominant background population, and not to accommodate a few low probability outliers that may
also be present in the background data set. A couple of simple classical outlier tests (Dixon and Rosncr
tests) are available in ProUCL 4.0. Since these classical tests suffer from masking effects (e.g., extreme
outliers may mask the occurrence of other intermediate outliers), it is suggested that these classical outlier
tests should always be supplemented with graphical displays such as a box plot or a Q-Q plot. The use of
16
-------
robust and resistant outlier identification procedures (Singh and Nocerino, 1995, Rousseeuw and Leroy,
1987) is recommended when multiple outliers may be present in a data set. Those methods are beyond the
scope of ProUCL 4.0.
An appropriate background data set of a reasonable size (preferably computed using DQO processes) is
needed to characterize a background area including computation of upper limits (e.g., estimates of BTVs,
not-to-exceed values) based upon background data sets and also to compare site and background data sets
using hypotheses testing approaches. As mentioned before, a small background data set of size 4 to 6 is
not large enough to compute BTVs or to perform background versus site comparisons. At the minimum, a
background sample should have at least 8 to 10 (more observations are preferable) detected observations
to estimate BTVs or to use hypotheses testing approaches.
1.2 Site Data Sets
A defensible data set from a site population (e.g., AOC, EA, RU, group of monitoring wells) should be
representative of the site area under investigation. Depending upon the site areas under investigation,
different soil depths and soil types may be considered as representing different statistical populations. In
such cases, background-vcrsus-site comparisons may have to be conducted separately for each of those
site sub-populations (e.g., surface and sub-surface layers of an AOC, clay and sandy site areas). These
issues, such as comparing depths and soil types, should also be considered in a planning and sampling
design before starting to collect samples from the various site areas under investigation. Specifically, the
availability of an adequate amount of representative site data is required from each of those site sub-
populations defined by sample depths, soil types, and the various other characteristics. For detailed
guidance on soil sample collections, the reader is referred to Gerlach and Nocerino (EPA (2003)).
The site data collection requirements depend upon the objective of the study. Specifically, in background-
versus-sitc comparisons, site data are needed to perform:
•	Individual point-by-point site observation comparisons with pre-established or estimated
BTVs, PRGs, cleanup standards, and not-to-exceed-values. Typically, this approach is
used when only a small number (e.g., < 4 to 6) of detected site observations (preferably
based upon composite samples) are available which need to be compared with BTVs and
not-to-exceed values.
•	Single sample hypotheses tests to compare site data with pre-established cleanup
standards, Cs (e.g., representing a measure of central tendency); or with BTVs and not-to-
exceed values (used for tests for proportions and percentiles). The hypotheses testing
approaches are used when enough site data are available. Specifically, when at least 8 to
10 detected (more are desirable) site observations are available, it is preferable to use
hypotheses testing approaches to compare site observations with specified threshold
values. The use of hypotheses testing approaches can control the two types (Type 1 and
Type 2) of error rates more efficiently than the point-by-point individual observation
comparisons. This is especially true as the number of point-by-point comparisons
increases. This issue is illustrated by the following table summarizing the probabilities of
exceedances (false positive error rate) of the background threshold value (e.g., 95th
percentile) by site observations, even when the site and background populations have
comparable distributions. The probabilities of these chance exceedances increase as the
sample size increases.
17
-------
Sample Size
Probability of Exceedance
2
5
10
12
64
8
0.05
0.10
0.23
0.34
0.40
0.46
0.96
• Two-sample hypotheses testing to compare site data distribution with background data
distribution to determine if the site concentrations arc comparable to background
concentrations. Adequate amount of data need to be made available from the site as well
as the background populations. It is preferable to collect at least 8 to 10 detected
observations from each of the population under comparison.
1.3 Discrete Samples or Composite Samples?
In a data set (background or site), collected samples should be either all discrete or all composite. In
general, both discrete and composite site samples may be used for individual point-by-point site
comparisons with a threshold value, and for single and two-sample hypotheses testing applications.
•	If possible, the use of composite site samples is preferred when comparing individual
point-by-point site observations from an area (e.g., area of concern (AOC), remediation
unit (RU), exposure area (EA)) with pre-established or estimated BTV, compliance limit
(CL), or other not-to-exceed value. This comparison approach is useful when few (< 4 to
6) detected site observations arc compared with a pre-established or estimated BTV or
other not-to-exceed threshold.
•	When using a single sample hypothesis testing approach, site data can be obtained by
collecting all discrete or all composite samples. The hypothesis testing approach is used
when many (e.g., exceeding 8 to 10) detected site observations are available. Details of
the single sample hypothesis approaches are widely available in EPA documents (1989,
1997, and 2006). Selected single sample hypotheses testing procedures are available in
ProUCL 4.0.
•	If a two-sample hypotheses testing approach is used to perform site versus background
comparisons, then samples from both of the populations should be either all discrete or
all composite samples. The two-sample hypothesis testing approach is used when many
(e.g., exceeding 8 to 10) site, as well as background, observations are available. For better
and more accurate results with higher statistical power, the availability of more
observations (e.g., exceeding 10-15) from each of the two populations is desirable,
perhaps based upon an appropriate DQO process, as described in an EPA guidance
document (2006).
1.4 Upper Limits and Their Use
The appropriate computation and use of statistical limits depend upon their applications and the
parameters (e.g., EPC term, not-to-exceed value) they are supposed to be estimating. Depending upon the
objective of the study, a pre-specified cleanup standard, Cs, or a risk-based cleanup (RBC) can be viewed
18
-------
as to represent: 1) as average contaminant concentration; or 2) a not-to-exceed upper threshold value.
These two threshold values, an average value, fin, and a not-to-exceed value, A0, represent two
significantly different parameters, and different statistical methods and limits are used to compare the site
data with these two different parameters or threshold values. Statistical limits, such as an upper
confidence limit (UCL) of the population mean, an upper prediction limit (UPL) for an independently
obtained "single" observation, or independently obtained k observations (also called future k
observations, next k observations, or k different observations), upper percentiles, and upper tolerance
limits (UTLs), are often used to estimate the environmental parameters, including the EPC terms,
compliance limits (e.g., ACL, MLC), BTVs, and other not-to-exceed values. Here, UTL95%-95%
represents a 95% confidence limit of the 95th percentile of the distribution of the contaminant under study.
It is important to understand and note the differences between the uses and numerical values of these
statistical limits so that they can be properly used. Specifically, the differences between UCLs and UPLs
(or upper percentiles), and UCLs and UTLs should be clearly understood and acknowledged. A UCL with
a 95% confidence limit (UCL95) of the mean represents an estimate of the population mean (measure of
the central tendency of a data distribution), whereas a UPL95, a UTL95%-95%, and an upper 95th
percentile represent estimates of a threshold value in the upper tail of the data distribution. Therefore, a
UCL95 should represent a smaller number than an upper percentile or an upper prediction limit. Also,
since a UTL 95%-95% represents a 95% UCL of the upper 95lh percentile, a UTL should be > the
corresponding UPL95 and the 95th upper percentile. Typically, it is expected that the numerical values of
these limits should follow the order given as follows:
Sample Mean < UCL95 of Mean < Upper 95'h Percentile < UPL95 of a Single Observation < UTL95%-
95%
It should also be pointed out that as the sample size increases, a UCL95 of the mean approaches
(converges to) the population mean, and a UPL95 approaches the 95th percentile. The differences among
the various upper limits are further illustrated in Example 1-1 below. It should be noted that, in some
cases, these limits might not follow the natural order described above. This is especially true when the
upper limits are computed based upon a lognormal distribution (Singh, Singh, and Engelhardt, 1997). It is
well known that a lognormal distribution-based H-UCL95 (Land's UCL95) often yields unstable and
impractically large UCL values. An H-UCL95 often becomes larger than UPL95 and even larger than a
UTL 95%-95%. This is especially true when dealing with skewed data sets of smaller sizes. Moreover, it
should also be noted that in some cases, a H-UCL95 becomes smaller than the sample mean, especially
when the data are mildly skewed to moderately skewed and the sample size is large (e.g., > 50, 100).
Example 1-1: Consider a simple site-specific background data set associated with a Superfund site. The
data set (given in Appendix 5 of the revised Guidance for Comparing Background and Chemical
Concentrations in Soil for CERCLA Sites (EPA, 2002b)) has several inorganic contaminants of potential
concern, including aluminum, arsenic, chromium, iron, and lead. It is noted that iron concentrations
follow a normal distribution. Upper limits for the iron data set are summarized in Table 1 -1. It is noted
that the upper limits do follow the order as described above.
Tabic 1-1. Computation of Upper Limits for Iron (Normally Distributed)
Mean
Median
Min
Max
UCL95
UPL95 for a
Single
Observation
UPL95 for 4
Observations
UTL95/95
95% Upper
Percentile
9618
9615
3060
18700
11478
18145
21618
21149
17534
19
-------
A 95% UCL (UCL95) of the mean is the most commonly used limit in environmental applications. For an
example, a 95% UCL of mean is used as an estimate of the EPC. A UCL95 should not be used to estimate
a background threshold value (a value in the upper tail of the background data distribution) to be
compared with individual site observations. There are many instances in background evaluations and
background versus site comparison studies, when it is not appropriate to use a 95% UCL. Specifically,
when point-by-point site observations arc to be compared with a BTV, then that BTV should be estimated
(or represented) by a limit from the upper tail of the reference set (background) data distribution.
A brief discussion about the differences between the applications and uses of the various statistical limits
is provided below. This will assist a typical user in determining which upper limit (e.g., UCL95 or
UPL95) to use to estimate the parameter of interest (e.g., EPC or BTV).
•	A UCL represents an average value that should be compared with a threshold value also
representing an average value (pre-established or estimated), such as a mean cleanup
standard, Cs For an example, a site 95% UCL exceeding a cleanup value, Cs, may lead to
the conclusion that the cleanup level, Cs, has not been attained by the site area under
investigation. It should be noted that UCLs of means are typically computed based upon
the site data set.
•	When site averages (and not individual site observations) are compared with a threshold
value (pre-determined or estimated), such as a PRG or a RBC, or with some other
cleanup standard, Cs, then that threshold should represent an average value, and not a not-
to-exceed threshold value for individual observation comparisons.
•	A UCL represents a "collective" measure of central tendency, and it is not appropriate to
compare individual site observations with a UCL. Depending upon data availability,
single or two-sample hypotheses testing approaches are used to compare site averages:
with a specified or pre-established cleanup standard (single sample hypothesis), or with
the background population averages (two-sample hypothesis).
•	A UPL, an upper percentile, or an UTL represents an upper limit to be used for point-by-
point individual site observation comparisons. UPLs and UTLs are computed based upon
background data sets, and individual site observations are compared with those limits. A
site observation for a contaminant exceeding a background UTL or UPL may lead to the
conclusion that the contaminant is a contaminant of potential concern (COPC) to be
included in further risk evaluation and risk management studies.
•	When individual point-by-point site observations are compared with a threshold value
(pre-determined or estimated) of a background population or some other threshold and
compliance limit value, such as a PRG, MLC, or ACL, then that threshold value should
represent a not-to-cxceed value. Such BTVs or not-to-excced values are often estimated
by a 95% UPL, UTL 95%-95%, or by an upper percentile. ProUCL 4.0 can be used to
compute any of these upper limits based upon uncensored data sets as well as data sets
with nondetect values.
•	As the sample size increases, a UCL approaches the sample mean, and a UPL95
approaches the corresponding 95th upper percentile.
20
-------
•	It is pointed out that the developers of ProUCL 4.0 prefer the use of a 95% UPL (UPL95)
as an estimate of BTV or a not-to-exceed value. As mentioned before, the option of
comparing individual site observations with a BTV (specified or estimated) should be
used when few (< 4 to 6) detected site observations (preferably composite values) are to
be compared with a BTV.
•	When enough (e.g., > 8 to 10) detected site observations are available, it is preferred to
use hypotheses testing approaches. Specifically, single sample hypotheses testing
(comparing site to a specified threshold) approaches should be used to perform site
versus a known threshold comparison; and two-sample hypotheses testing (provided
enough background data are also available) approaches should be used to perform site
versus background comparison. Several parametric and nonparametric single and two-
sample hypotheses testing approaches are available in ProUCL 4.0.
It is re-emphasized that only averages should be compared with the averages or UCLs, and individual site
observations should be compared with UPLs, upper percentiles, or UTLs. For an example, the comparison
of a 95% UCL of one population (e.g., site) with a 90% or 95% upper percentile of another population
(e.g., background) cannot be considered fair and reasonable as these limits (e.g., UCL and UPL) estimate
and represent different parameters. It is hard to justify comparing a UCL of one population with a UPL of
the other population. Conclusions (e.g., site dirty or site clean) derived by comparing UCLs and UPLs, or
UCLs and upper percentiles as suggested in Wyoming DEQ, Fact Sheet #24 (2005), cannot be considered
fair and reliable. Specifically, the decision error rates associated with such comparisons can be
significantly different from the specified (e.g., Type I error = 0.1, Type II error = 0.1) decision errors.
1.5 Point-by-Point Comparison of Site Observations with BTVs,
Compliance Limits, and Other Threshold Values
Point-by-point observation comparison method is used when a small number (e.g., 4 to 6 locations) of
detected site observations are compared with pre-established or estimated BTVs, screening levels, or
preliminary remediation goals (PRGs). In this case, individual point-by-point site observations (preferably
based upon composite samples from various site locations) are compared with estimated or pre-
established background (e.g., USGS values) values, PRGs, or some other not-to-exceed value. Typically,
a single exceedance of the BTV, PRG, or of a not-to-exceed value by a site (or from a monitoring well)
observation may be considered as an indication of contamination at the site area under investigation. The
conclusion of an exceedance by a site value is some times confirmed by re-sampling (taking a few more
collocated samples) that site location (or a monitoring well) exhibiting contaminant concentration in
excess of the BTV or PRG. If all collocated (or collected during the same time period) sample
observations collected from the same site location (or well) exceed the PRG (or MLC) or a not-to-exceed
value, then it may be concluded that the location (well) requires further investigation (e.g., continuing
treatment and monitoring) and cleanup.
When BTV contaminant concentrations are not known or pre-established, one has to collect, obtain, or
extract a data set of an appropriate size that can be considered as representative of the site related
background. Statistical upper limits are computed using the data set thus obtained, which are used as
estimates of BTVs and not-to-exceed values. It should be noted that in order to compute reasonably
reliable and accurate estimates of BTVs and not-to-exceed values based upon a background (or reference)
data set, enough background observations (minimum of 8 to 10) should be collected, perhaps using an
appropriate DQO process as described in EPA (2006). Typically, background samples arc collected from
a comparable general reference area or site-specific areas that are known to be free of contamination due
21
-------
to any of the site related activities. Several statistical limits can be used to estimate the BTVs based upon
a defensible data set of an adequate size. A detailed description of the computation and estimation of
BTVs is given in Chapter 5 (for uncensored data sets) and in Chapter 6 for data sets with nondetects of
the revised background guidance document. Once again, the use of this point-by-point comparison
method is recommended when not many (e.g., < 4 to 6) site observations are to be compared with
estimated BTVs or PRGs. An exceedance of the estimated BTV by a site value may be considered as an
indication of the existing or continuing contamination at the site.
Note: When BTVs are not known, it is suggested that at least 8 to 10 (more are preferable) detected
representative background observations be made available to compute reasonably reliable estimates of
BTVs and other not-to-exceed values.
The point-by-point comparison method is also useful when quick turnaround comparisons are required.
Specifically, when the decisions have to be made in real time by a sampling or screening crew, or when
few detected site samples are available, then individual point-by-point site concentrations arc compared
either with pre-established PRGs, cleanup goals and standards, or with estimated BTVs and not-to-exceed
values. The crew can use these comparisons to make the following informative decisions:
1.	Screen and identify the COPCs,
2.	Identify the polluted site AOCs,
3.	Continue or stop remediation or excavation at a site AOC or a RU, or
4.	Move the cleanup apparatus and crew to the next AOC or RU.
During the screening phase, an exceedance of a compliance limit, action level, a BTV, or a PRG by site
values for a contaminant may declare that contaminant as a COPC. Those COPCs are then included in
future site remediation and risk management studies. During the remediation phase, an exceedance of the
threshold value such as a compliance limit (CL) or a BTV by sample values collected from a site area (or
a monitoring well (MW)) may declare that site area as a polluted AOC, or a hot spot requiring further
sampling and cleanup. This comparison method can also be used to verify if the site concentrations (e.g.,
from the base or side walls of an excavated site area) arc approaching or meeting PRG, BTV, or a cleanup
standard after excavation has been conducted at that site area.
If a larger number of detected samples (e.g., greater than 8 to 10) are available from the site locations
representing the site area under investigation (e.g., RU, AOC, EA), then the use of hypotheses testing
approaches (both single sample and two-sample) is preferred. The use of a hypothesis testing approach
will control the error rates more tightly and efficiently than the individual point-by-point site observations
versus BTV comparisons, especially when many site observations are compared with a BTV or a not-to-
exceed value.
Note: In background versus site comparison evaluations, scientists usually prefer the use of hypotheses
testing approaches to point-by-point site observation comparisons with BTVs or not-to-exceed values.
Hypotheses testing approaches require the availability of larger data sets from the populations under
investigation. Both single sample (used when BTVs, not-to-exceed values, compliance limits, or cleanup
standards are known and pre-established) and two-sample (used when BTVs and compliance limits are
unknown) hypotheses testing approaches are available in ProUCL 4.0.
22
-------
1.6 Hypothesis Testing Approaches and Their Use
Both single sample and two-sample hypotheses testing approaches are used to make cleanup decisions at
polluted sites, and also to compare contaminant concentrations of two (e.g., site versus background) or
more (several monitoring wells (MWs)) populations. The uses of hypotheses testing approaches in those
environmental applications are described as follows.
1.6.1 Single Sample Hypotheses - BTVs and Not-to-Exceed Values are Known
(Pre-established)
When pre-established BTVs and not-to-cxceed values are used, such as the USGS background values
(Shacklette and Boerngen (1984)), thresholds obtained from similar sites, or pre-established not-to-exceed
values, PRGs, or RBCs, there is no need to extract, establish, or collect a background or reference data
set. When the BTVs and cleanup standards are known, one-sample hypotheses are used to compare site
data (provided enough site data are available) with known and pre-established threshold values. It is
suggested that the project team determine (e.g., using DQO) or decide (depending upon resources) about
the number of site observations that should be collected and compared with the "pre-established"
standards before coming to a conclusion about the status (clean or polluted) of the site area (e.g., RU,
AOC) under investigation. When the number of available detected site samples is less than 4 to 6, one
might perform point-by-point site observation comparisons with a BTV; and when enough detected site
observations (> 8 to 10, more are preferable) are available, it is desirable to use single sample hypothesis
testing approaches.
Depending upon the parameter (e.g., the average value, /.10, or a not-to-cxceed value, A0), represented by
the known threshold value, one can use single sample hypothesis tests for population mean (t-tcst, sign
test) or single sample tests for proportions and percentiles. The details of the single sample hypotheses
testing approaches can be found in EPA (2006) and the Technical Guide for ProUCL 4.0. Several single
sample tests listed as follows are available in ProUCL 4.0.
One-Sample t-Test: This test is used to compare the site mean, //, with a specified cleanup standard, Cs,
where the cleanup standard, CSt represents an average threshold value, pn- The Student's t-test (or a UCL
of mean) is often used (assuming normality of site data or when site sample size is large such as larger
than 30, 50) to determine the attainment of cleanup levels at a polluted site after some remediation
activities.
One-Sample Sign Test or Wilcoxon Signed Rank (WSR) Test: These tests are nonparametric tests and can
also handle nondetect observations provided all nondetects (e.g., associated detection limits) fall below
the specified threshold value. C5. These tests are used to compare the site location (e.g., median, mean)
with a specified cleanup standard, Cs> representing a similar location measure.
One-Sample Proportion Test or Percentile Test: When a specified cleanup standard, An, such as a
PRG or a BTV represents an upper threshold value of a contaminant concentration distribution
(e.g., not-to-cxceed value, compliance limit) rather than the mean threshold value, p0> of the
contaminant concentration distribution, then a test for proportion or a test for percentile (or
equivalently a UTL 95%-95%) can be used to compare site proportion or site percentile with the
specified threshold or action level, A0 This test can also handle ND observations provided all
NDs are below the compliance limit.
23
-------
In order to obtain reasonably reliable estimates and test statistics, an adequate amount of representative
site data (8 to 10 detected observations) is needed to perform the hypotheses tests. As mentioned before,
in case only a few (e.g., < 4 to 6) detected site observations arc available, then point-by-point site
concentrations may be compared with the specified action level, A0.
1.6.2 Two-Sample Hypotheses - When BTVs and Not-to-Exceed Values are Unknown
When BTVs, not-to-excccd values, and other cleanup standards are not available, then site data arc
compared directly with the background data. In such cases, a two-sample hypothesis testing approach can
be used to perform site versus background comparisons. Note that this approach can be used to compare
concentrations of any two populations including two different site areas or two different monitoring wells
(MWs). In order to use and perform a two-sample hypothesis testing approach, enough data should be
available (collected) from each of the two populations under investigation. Site and background data
requirements (e.g., based upon DQOs) to perform two-sample hypothesis test approaches are described in
EPA (1989b, 2006), Breckenridge and Crockett (1995), and the VSP (2005) software package. While
collecting site and background data, for better representation of populations under investigation, one may
also want to account for the size of the background area (and site area for site samples) into sample size
determination. That is, a larger number (> 10 to 15) of representative background (or site) samples should
be collected from larger background (or site) areas. As mentioned before, every effort should be made to
collect as many samples as determined using DQO processes as described in EPA documents (2006).
The two-sample (or more) hypotheses approaches are used when the site parameters (e.g., mean, shape,
distribution) are being compared with the background parameters (e.g., mean, shape, distribution). The
two-sample hypotheses testing approach is also used when the cleanup standards or screening levels are
not known a priori, and they need to be estimated based upon a data set from a background or reference
population. Specifically, two-sample hypotheses testing approaches are used to compare 1) the average
contaminant concentrations of two or more populations such as the background population and the
potentially contaminated site areas, or 2) the proportions of site and background observations exceeding a
pre-established compliance limit, A0. In order to derive reliable conclusions with higher statistical power
based upon hypothesis testing approaches, enough data (e.g., minimum of 8 to 10 samples) should be
available from all of the populations under investigation. It is also desirable to supplement statistical
methods with graphical displays, such as the double Q-Q plots, or side-by-side multiple box plots, as
available in ProUCL 4.0. Several parametric and nonparametric two-sample hypotheses testing
approaches, including Student's t-test, the Wilcoxon-Mann-Whitney (WMW) test, Gehan's test, and
quantile test are included in ProUCL 4.0. Details of those methods are described in the ProUCL 4.0
Technical Guide. It should be noted that the WMW, Gehan, and quantile tests are also available for data
sets with NDs. Gehan's test is specifically meant to be used on data sets with multiple detection limits. It
is also suggested that for best and reliable conclusions, both the WMW and quantile tests should be used
on the same data set. The details of these two tests with examples are given in EPA (1994, 2006).
The samples collected from the two (or more) populations should all be of the same type obtained using
similar analytical methods and apparatus. In other words, the collected site and background samples
should be all discrete or all composite (obtained using the same design and pattern), and be collected from
the same medium (soil) at similar depths (e.g., all surface samples or all subsurface samples) and time
(e.g., during the same quarter in groundwater applications) using comparable (preferably same) analytical
methods. Good sample collection methods and sampling strategies are given in EPA (1996, 2003)
guidance documents.
-------
1.7 Minimum Sample Size Requirements
Due to resource limitations, it may not be possible (nor needed) to sample the entire population (e.g.,
background area, site area, areas of concern, exposure areas) under study. Statistics is used to draw
inference(s) about the populations (clean, dirty) and their known or unknown parameters (e.g.,
comparability of population means, not-to-exceed values, upper percentiles, and spreads) based upon
much smaller data sets (samples) collected from those populations under study. In order to determine and
establish BTVs, not-to-exceed values, or site-specific screening levels, defensible data set(s) of
appropriate sizc(s) needs to be collected from background areas (e.g., site-specific, general reference or
pristine area, or historical data). The project team and site experts should decide what represents a site
population and what represents a background population. The project team should determine the
population size and boundaries based upon all current and future objectives for the data collection. The
size and area of the population (e.g., a remediation unit, area of concern, or an exposure unit) may be
determined based upon the potential land use, and other exposure and risk management objectives and
decisions. Moreover, appropriate effort should be made to properly collect soil samples (e.g., methods
based upon Gy sampling theory), as described in Gcrlach and Noccrino (2003).
Using the collected site and background data sets, statistical methods supplemented with graphical
displays are used to perform site versus background comparisons. The test results and statistics obtained
by performing such site versus background comparisons are used to determine if the site and background
level contaminant concentration are comparable; or if the site concentrations exceed the background
threshold concentration level; or if an adequate amount of cleanup and remediation approaching the BTV
or a cleanup level have been performed at polluted areas (e.g., AOC, RU) of the site under study.
In order to perform statistical inference (estimation and hypothesis testing), one needs to determine the
sample sizes that need to be collected from the populations (e.g., site and background) under investigation
using appropriate DQO processes (EPA 2006). However, in some cases, it may not be possible to collect
the same number of samples as determined by using a DQO process. For example, the data might have
already been collected (often is the case in practice) without using a DQO process, or due to resource
constraints, it may not be possible to collect as many samples as determined by using a DQO-based
sample size formula. It is observed that, in practice, the project team and the decision makers may not
collect enough background samples, perhaps due to various resource constraints. However, every effort
should be made to collect at least 8 to 10 (more arc desirable) background observations before using
methods as incorporated in ProUCL 4.0. The minimum sample size recommendations as described here
are useful when resources are limited (as often is the case), and it may not be possible to collect as many
background and site (e.g., AOC, EU) samples as computed using DQOs and the sample size
determination formulae given in the EPA (2006). Some minimum sample size requirements arc also given
in Fact Sheet #24, prepared by Wyoming Department of Environmental Quality (June 2005).
As mentioned before, the topics of DQO processes and the sample size determination are described in
detail in the EPA (2006) guidance document. Therefore, the sample size determination formulae based
upon DQO processes are not included in ProUCL 4.0 and its Technical Guide. It should be noted that
DQO-based sample size determination routines are available in DataQUEST (EPA, 1997) and VSP
(2005) software packages. Guidance and suggestions arc provided on the minimum number of
background and site samples needed to be able to use statistical methods for the computation of upper
limits, and to perform single sample tests, two-sample tests such as t-test, and the Wilcoxon-Mann-
Whitney (WMW) test. The minimum sample size recommendations (requirements) as described here arc
25
-------
made so that reasonably reliable estimates of EPC terms and BTVs, and defensible values of test statistics
for single or two-sample hypotheses tests (e.g., t-test, WMW test), can be computed.
1.7.1	Minimum Sample Size for Estimation and Point-by-Point Site Observation Comparisons
•	Point-by-point observation comparison method is used when a small number (e.g., 4 to 6
locations) of detected site observations are compared with pre-established or estimated
BTVs, screening levels, or PRGs. In this case, individual point-by-point site observations
(preferably based upon composite samples from various site locations) are compared with
estimated or pre-established background (e.g., USGS values) values, PRGs, or some
other not-to-exceed value.
•	When BTV contaminant concentrations are not known or pre-established, one has to
collect, obtain, or extract a data set of an appropriate size that can be considered as
representative of the site related background. Statistical upper limits are computed using
the data set thus obtained; which are used as estimates of BTVs and not-to-exceed values.
It should be noted that in order to compute reasonably reliable and accurate estimates of
BTVs and not-to-exceed values based upon a background (or reference) data set, enough
background observations (minimum of 8 to 10) should be collected perhaps using an
appropriate DQO process as described in EPA (2006). Typically, background samples are
collected from a comparable general reference area or a site-specific area.
•	When enough (e.g., > 8 to 10) detected site observations are available, it is preferred to
use hypotheses testing approaches. Specifically, single sample hypotheses testing
(comparing site to a specified threshold) approaches should be used to perform site
versus a known threshold comparison and two-sample hypotheses testing (provided
enough background data are also available) approaches should be used to perform site
versus background comparison.
1.7.2	Minimum Sample Size Requirements for Hypothesis Testing Approaches
Statistical methods (as in ProUCL 4.0) used to estimate EPC terms, BTVs, PRGs, or to compare the site
contaminant concentration data distribution with the background data distribution can be computed based
upon small site and background data sets (e.g., of sizes 3, 4, 5, or 6). However, those statistics cannot be
considered representative and reliable enough to make important cleanup and remediation decisions. It is
recommended not to use those statistics to draw cleanup and remediation decisions potentially impacting
the human health and the environment. It is suggested that the estimation and hypothesis testing methods
as incorporated in ProUCL 4.0 may not be used on background data sets with fewer than 8 to 10 detected
observations. Also, when using hypotheses testing approaches, it is suggested that the site and
background data be obtained using an appropriate DQO process as described in EPA (2006). In case that
is not possible, it is suggested that the project team at least collect 8 to 10 observations from each of the
populations (e.g., site area, MWs, background area) under investigation.
Site versus background comparisons and computation of the BTVs depend upon many factors, some of
which cannot be controlled. These factors include the site conditions, lack of historical information, site
medium, lack of adequate resources, measurement and analytical errors, and accessibility of the site areas.
Therefore, whenever possible, it is desirable to use more than one statistical method to perform site versus
background comparison. The use of statistical methods should always be supplemented with appropriate
graphical displays.
26
-------
1.8 Sample Sizes for Bootstrap Methods
Several parametric and nonparamctric (including bootstrap methods) UCL, UPL, and other limits
computation methods for both full-uncensored data sets (without nondetects) and left-censored data sets
with nondetects are available in ProUCL 4.0. It should be noted that bootstrap resampling methods are
useful when not too few (e.g., < 10-15) and not too many (e.g., > 500-1000) detected observations are
available. For bootstrap methods (e.g., percentile method, BCA bootstrap method, bootstrap t method), a
large number (e.g., 1000, 2000) of bootstrap rcsamplcs (with replacement) are drawn from the same data
set. Therefore, in order to obtain bootstrap resamples with some distinct values (so that statistics can be
computed from each rcsample), it is suggested that a bootstrap method should not be used when dealing
with small data sets of sizes less than 10-15. Also, it is not required to bootstrap a large data set of size
greater than 500 or 1000; that is when a data set of a large size (e.g., > 1000) is available, there is no need
to obtain bootstrap resamples to compute statistics of interest (e.g., UCLs). One can simply use a
statistical method on the original large data set. Moreover, bootstrapping a large data set of size greater
than 500 or 1000 will be time consuming.
1.9	Statistical Analyses by a Group ID
The analyses of data categorized by a group ID variable such as: l) Surface vs. Subsurface;
2) AOC1 vs. AOC2; 3) Site vs. Background; and 4) Upgradient vs Downgradient monitoring wells are
quite common in many environmental applications. ProUCL 4.0 offers this option for data sets with and
without nondetects. The Group Option provides a powerful tool to perform various statistical tests and
methods (including graphical displays) separately for each of the group (samples from different
populations) that may be present in a data set. For an example, the same data set may consist of samples
from the various groups or populations representing site, background, two or more AOCs, surface,
subsurface, monitoring wells. The graphical displays (e.g., box plots, Q-Q plots) and statistics
(computations of background statistics, UCLs, hypotheses testing approaches) of interest can be
computed separately for each group by using this option.
It should be pointed out that it is the users' responsibility to provide adequate amount of detected data to
perform the group operations. For an example, if the user desires to produce a graphical Q-Q plot (using
only detected data) with regression lines displayed, then there should be at least two detected points (to
compute slope, intercept, sd) in the data set. Similarly if the graphs are desired for each of the group
specified by the group ID variable, there should be at least two detected observations in each group
specified by the group variable. ProUCL 4.0 generates a warning message (in orange color) in the lower
panel of the ProUCL 4.0 screen. Specifically, the user should make sure that a variable with nondetects
and categorized by a group variable should have enough detected data in each group to perform the
various methods (e.g., GOF tests, Q-Q plots with regression lines) as incorporated in ProUCL 4.0.
1.10	Use of Maximum Detected Value as Estimates of Upper Limits
Some practitioners tend to use the maximum detected value as an estimate of the EPC term. This is
especially true when the sample size is small such as < 5, or when a UCL95 exceeds the maximum
detected values (EPA, 1992b). Also, many times in practice, the BTVs and not-to-exceed values are
estimated by the maximum detected value. This section discusses the appropriateness of using the
maximum detected value as estimates of the EPC term, BTVs, or other nor-to-exceed values.
27
-------
1.10.1	Use of Maximum Detected Value to Estimate BTVs and Not-to-Exceed Values
It is noted that BTVs and not-to-exceed values represent upper threshold values in the upper tail of a data
distribution; therefore, depending upon the data distribution and sample size, the BTVs and other not-to-
excecd values may be estimated by the maximum detected value. As described earlier, upper limits, such
as UPLs, UTLs, and upper percentiles, are used to estimate the BTVs and not-to-exceed values. It is noted
that a nonparametric UPL or UTL is often estimated by higher order statistics such as the maximum value
or the second largest value (EPA 1992a, RCRA Guidance Addendum). The use of higher order statistics
to estimate the UTLs depends upon the sample size. For an example: 1) 59 to 92 samples, a
nonparametric UTL95%-95 is given by the maximum detected value; 2) 93 to 123 samples, a
nonparametric UTL95%-95 is given by the second largest maximum detected value; and 3) 124 to 152
samples, a UTL95%-95 is given by the third largest detected value in the sample.
Note: Therefore, when a data set does not follow a discernable distribution, the maximum observed value
(or other high order statistics) may be used as an estimate of BTV or a not-to-exceed value, provided the
maximum value does not represent an outlier or a contaminating, observation perhaps representing a hot
location.
1.10.2	Use of Maximum Detected Value to Estimate EPC Terms
Some practitioners tend to use the maximum detected value as an estimate of the EPC term. This is
especially true when the sample size is small such as < 5, or when a UCL95 exceeds the maximum
detected values (EPA, 1992b). Specifically, a RAGS document (EPA, 1992) suggests the use of the
maximum detected value as a default value to estimate the EPC term when a 95% UCL (e.g., the H-UCL)
exceeded the maximum value. ProUCL 4.0 can compute a 95% UCL of mean using several methods
based upon normal, Gamma, lognormal, and non-discernable distributions. In past (e.g., EPA, 1992b),
only two methods were used to estimate the EPC term based upon: 1) Student's t-statistic and a normal
distribution, and 2) Land's H-statistic (1975) and a lognormal model. The use of H-statistic often yields
unstable and impractically large UCL95 of the mean (Singh, Singh, and Iaci, 2002). For skewed data sets
of smaller sizes (e.g., < 30, < 50), H-UCL often exceeds the maximum detected value. This is especially
true when extreme high outliers may be present in the data set. Since the use of a lognormal distribution
has been quite common (e.g., suggested as a default model in a RAGS document (EPA, 1992)), the
exceedance of the maximum detected value by H-UCL95 is frequent for many skewed data sets of
smaller sizes (e.g., < 30, < 50). It is also be noted that for highly skewed data sets, the sample mean
indeed can even exceed the upper 90%, 95%, etc., percentiles, and consequently, a 95% UCL of mean can
exceed the maximum observed value of a data set.
All of these occurrences result in the possibility of using the maximum detected value as an estimate of
the EPC term. It should be pointed out that in some cases, the maximum observed value actually might
represent a highly polluted outlying observation. Obviously, it is not desirable to use a highly polluted
value as an estimate of average exposure (EPC term) for an exposure area. This is especially true when
one is dealing with lognormally distributed data sets of small sizes. As mentioned before, for such highly
skewed data sets that cannot be modeled by a gamma distribution, a 95% UCL of the mean should be
computed using an appropriate distribution-free nonparametric method.
It should be pointed out that the EPC term represents the average exposure contracted by an individual
over an exposure area (EA) during a long period of time; therefore, the EPC term should be estimated by
using an average value (such as an appropriate 95% UCL of the mean) and not by the maximum observed
concentration. One needs to compute an average exposure and not the maximum exposure. It is unlikely
28
-------
that an individual will visit the location (e.g., in an EA) of the maximum detected value all of the time.
One can argue that the use of this practice results in a conservative (higher) estimate of the EPC term. The
objective is to compute an accurate estimate of the EPC term. Several other methods (instead of H-UCL)
as described in EPA (2002), and included in ProUCL 4.0 (EPA 2007), are available to estimate the EPC
terms. It is unlikely (but possible with outliers) that the UCLs based upon those methods will exceed the
maximum detected value, unless some outliers arc present in the data set. ProUCL 4.0 displays a warning
message when the recommended 95% UCL (e.g., Hall's or bootstrap t UCL with outliers) of the mean
exceeds the observed maximum concentration. When a 95% UCL does exceed the maximum observed
value, ProUCL4.0 recommends the use of an alternative UCL computation method based upon the
Chebyshev inequality. The detailed recommendations (as functions of sample size and skewness) for the
use of those UCLs are summarized in ProUCL 3.0 User Guide (EPA, 2004).
Singh and Singh (2003) studied the performance of the max test (using the maximum observed value as
an estimate of the EPC term) via Monte Carlo simulation experiments. They noted that for skewed data
sets of small sizes (e.g., < 10-20), the max test does not provide the specified 95% coverage to the
population mean, and for larger data sets, it overestimates the EPC term, which may require unnecessary
further remediation. The use of the maximum value as an estimate of the EPC term also ignores most
(except for maximum value) of the information contained in the data set. With the availability of so many
UCL computation methods (15 of them), the developers of ProUCL 4.0 do not recommend using the
maximum observed value as an estimate of the EPC term representing an average exposure by an
individual over an EA. Also, for the distributions considered, the maximum value is not a sufficient
statistic for the unknown population mean.
Note: It is recommended that the maximum observed value NOT be used as an estimate of the EPC term
representing average exposure contracted by an individual over an EA. For the sake of interested users,
ProUCL displays a warning message when the recommended 95% UCL (e.g., Hall's bootstrap UCL etc.)
of the mean exceeds the observed maximum concentration. For such scenarios (when a 95% UCL does
exceed the maximum observed value), an alternative 95% UCL computation method is recommended by
ProUCL 4.0.
1.10.3	Samples with Nondetect Observations
Nondetect observations (or less than obvious values) are inevitable in most environmental data sets.
Singh, Maichlc, and Lee (EPA, 2006) studied the performances (in terms of coverages) of the various
UCL95 computation methods including the simple substitution methods (such as the DL/2 and DL
methods) for data sets with nondetect observations. They concluded that the UCLs obtained using the
substitution methods, including the replacement of nondetects by respective DL/2, do not perform well
even when the percentage of nondetect observations is low, such as 5%-10%. They recommended
avoiding the use of substitution methods to compute UCL95 based upon data sets with nondetect
observations.
1.10.4	Avoid the Use of DL/2 Method to Compute UCL95
Based upon the results of the report by Singh, Maichle, and Lec (EPA, 2006), it is strongly recommended
to avoid the use of the DL/2 method to perform GOF test, and to compute the summary statistics and
various other limits (e.g., UCL, UPL) often used to estimate the EPC terms and BTVs. Until recently, the
DL/2 method has been the most commonly used method to compute the various statistics of interest for
data sets with BDL observations. The main reason of its common use has been the lack of the availability
of other defensible methods and associated programs that can be used to estimate the various
29
-------
environmental parameters of interest. Today, several other methods (e.g., KM method, bootstrap
methods) with better performances are available that can be used to compute the various upper limits of
interest. Some of those parametric and nonparametric methods are available in ProUCL 4.0. Even though
the DL/2 method (to compute UCLs, UPLs, andfor goodness-of-fit test) has also been incorporated in
ProUCL 4.0, its use is not recommended due to its poor performance. The DL/2 method is included in
ProUCL 4.0 only for historical reasons as it had been the most commonly used and recommended method
until recently (EPA, 2006). Some of the reviewers of ProUCL 4.0 suggested and requested the inclusion
of DL/2 method in ProUCL for comparison purposes.
Note: The DL/2 method has been incorporated in ProUCL 4.0 for historical reasons only. NERL-EPA,
Las Vegas strongly recommends avoiding the use of DL/2 method even when the percentage (%) of NDs
is as low as 5%-10%. There are other methods available in ProUCL 4.0 that should be used to compute
the various summary statistics and upper limits based upon data sets with multiple detection limits.
1.10.5 Samples with Low Frequency of Detection
When all of the sampled data values are reported as nondetects, the EPC term should also be reported as a
nondetect value, perhaps by the maximum reporting limit (RL) or maximum RL/2. Statistics (e.g.,
UCL95) computed based upon only a few detected values (e.g., < 4 to 6) cannot be considered reliable
enough to estimate the EPC terms having potential impact on the human heath and the environment.
When the number of detected data is small, it is preferable to use simple ad hoc methods rather than using
statistical methods to compute the EPC terms and other upper limits. Specifically, it is suggested that in
cases when the detection frequency is low (e.g., < 4%-5%) and the number of detected observations is
low, the project team and the decision makers together should make a decision on site-specific basis on
how to estimate the average exposure (EPC term) for the contaminant and area under consideration. For
such data sets with low detection frequencies, other measures such as the median or mode represent better
estimates (with lesser uncertainty) of the population measure of central tendency.
Additionally, it is also suggested that when most (e.g., > %95) of the observations for a contaminant lie
below the detection limit(s) or reporting limits (RLs), the sample median or the sample mode (rather than
the sample average which cannot be computed accurately) may be used as an estimate the EPC term. Note
that when the majority of the data are nondetects, the median and the mode will also be a nondetect. The
uncertainty associated with such estimates will be high. It is noted that the statistical properties, such as
the bias, accuracy, and precision of such estimates, would remain unknown. In order to be able to
compute defensible estimates, it is always desirable to collect more samples.
Note: In case the number of available detected samples is small (< 5), it is suggested that the project
team decide about the estimation of the EPC term on site-specific basis. For such small data sets with
very few detected values (< 5), the final decision ("policy decision ") on how to estimate the EPC term
should be determined by the project team and decision makers.
1.11 Other Applications of Methods in ProUCL 4.0
In addition to performing background versus site comparisons for CERCLA and RCRA sites, and
estimating the EPC terms in exposure and risk evaluation studies, the statistical methods as incorporated
in ProUCL 4.0 can be used to address other issues dealing with environmental investigations that are
conducted at Superfund or RCRA sites.
30
-------
1.11.1 Identification of COPCs
Risk assessors and RPMs often use screening levels or BTVs to identify the COPCs during the screening
phase of a cleanup project to be conducted at a contaminated site. The screening for the COPCs is
performed prior to any characterization and remediation activities that may have to be conducted at the
site under investigation. This comparison is performed to screen out those contaminants that may be
present in the site medium of interest at low levels (e.g., at or below the background levels or pre-
established screening levels) and may not pose any threat and concern to human health and the
environment. Those contaminants may be eliminated from all future site investigations, and risk
assessment and risk management studies.
In order to identify the COPCs, point-by-point site observations (preferably composite samples) are
compared with pre-established screening levels, SSL, or estimated BTVs. This is especially true when the
comparisons of site concentrations with screening levels or BTVs are conducted in real time by the
sampling or cleanup crew right there in the site field. The project team should decide about the type of
site samples (discrete or composite) and the number of detected site observations (not more than 4 to 6)
that should be collected and compared with the screening levels or the BTVs. In case BTVs, screening
levels, or not-to-exceed values are not known, the availability of a defensible background or reference
data set of reasonable size (e.g., > 8 to'10, more are preferable) is required to obtain reliable estimates of
BTVs and screening levels. When a reasonable number of detected site observations are available, it is
preferable to use hypotheses testing approaches. The contaminants with concentrations exceeding the
respective screening values or BTVs may be considered as COPCs, whereas contaminants with
concentrations (in all collected samples) lower than the screening value, PRG, or an estimated BTV may
be omitted from all future evaluations including the risk assessment and risk management investigations.
1.11.2	Identification of Non-Compliance Monitoring Wells
In monitoring well (MW) compliance assessment applications, individual (often discrete) contaminant
concentrations from a MW are compared with pre-established ACL, MCL, or an estimated compliance
limit (CL) based upon a group of upgradient wells representing the background population. An
exceedance of the MCL or the BTV by a MW concentration may be considered as an indication of
contamination in that MW. In such individual concentration comparisons, the presence of contamination
(determined by an exceedance) may have to be confirmed by re-sampling from that MW. If
concentrations of contaminants in both the original sample and the re-sample(s) exceed the MCL or BTV,
then that MW may require closer scrutiny, perhaps triggering the remediation remedies as determined by
the project team. If the concentration data from a MW for about 4 to 5 continuous quarters (or another
designated time period determined by the project team) are below the MCL or BTV level, then that MW
may be considered as complying with (achieving) the pre-established or estimated standards. Statistical
methods as described in Chapters 5 and 6 of the revised background guidance document (EPA, 2002b)
can be used to estimate the not-to-exceed values or BTVs based upon background or upgradient wells in
case the ACLs or MCLs are not pre-determined.
1.11.3	Verification of the Attainment of Cleanup Standards, Cs
Hypothesis testing approaches may be used to verify the attainment of the cleanup standard, Cs, at
polluted site areas of concern after conducting remediation and cleanup at the site AOC (EPA, 2006). In
order to properly address this scenario, a site data set of adequate size (minimum of 8 to 10 detected site
observations) needs to be made available from the remediated or excavated areas of the site under
investigation. The sample size should also account for the size of the remediated site area; meaning that
31
-------
larger site areas should be sampled more (with more observations) to obtain a representative sample of the
site under investigation.
Typically, the null hypothesis of interest is H0: Site Mean, |is>=Cs versus the alternative hypothesis, Hi:
Site Mean, < Cs, where the cleanup standard, Cs, is known a priori. The sample size needed to perform
such single sample hypotheses tests can be obtained using the DQO process-based sample size formula as
given in the EPA (2006) documents. In any case, in order to use this test, a minimum of 8 to 10 detected
site samples should be collected. The details of statistical methods used to perform single sample
hypothesis as described above can be found in EPA (2006).
1.11.4 Using BTVs (Upper Limits) to Identify Hot Spots
The use of upper limits (e.g., UTLs) to identify hot spot(s) has also been mentioned in the Guidance for
Comparing Background and Chemical Concentrations in Soil for CERCLA Sites (EPA, 2002b). Point-by-
point site observations (preferably using composite samples representing a site location) are compared
with a pre-established or estimated BTV. Exceedances of the BTV by site observations may be
considered as representing locations with elevated concentrations (hot spots). Chapters 5 and 6 of the
revised background guidance document (EPA, 2002b) describe several methods to estimate the BTVs
based upon full data sets without nondetects and left-censored data sets with nondetect observations.
The rest of the chapters of this User Guide illustrate the use of the various procedures as incorporated in
ProUCL 4.0. Those methods are useful to analyze environmental data sets with and without the nondetect
observations. It is noted that ProUCL 4.0 is the first software package equipped with single sample and
two-sample hypotheses testing approaches that can be used on data sets with nondetect observations.
32
-------
Chapter 2
Entering and Manipulating Data
2.1	Creating a New Data Set
By simply executing the ProUCL 4.0, a new worksheet is generated and displayed for the user to enter
data.
To create a new worksheet: click M or choose File ~ New
2.2	Opening an Existing Data Set
ick-^
If your data sets are stored in the ProUCL data format (*.wst), then click -ill or choose File ~ Open
• If your data sets are stored in the Microsoft Excel format (*.xls), then choose File ~
Other Files... ~Import Excel... OR File ~ Load Excel Data
ProUCL 4.0.- (Worksheet.wst|
I Eta: Confijut Su.tjpmy Situsiics ROS Est ffDs Grephj Ou'.Hftr Tests Goodness-of Fit Kycc;htssTesjnt 5ack4roind UCL Vv'noovY Heto
D.
—j Ootr,
LcecExrtlDftra
u
nkE
Close

Spve is.


Possible Error Messages:
o When you import an Excel file, make sure that you have an empty worksheet. If there
is no empty worksheet, then you must create a new worksheet before importing an
Excel file. Otherwise, there will be an error message in the Log Panel: "[Error]
Worksheet must be empty."
o First open a new worksheet and then import the Excel file,
o Make sure that the file you trying to import is not currently open. Otherwise, there
will be the following warning message in the Log panel:
o "[Information] Unable to open C:\***.xls." Check the validity of this file.
33
-------
2.3 Input File Format
•	Note that ProUCL 4.0 does not require that data in each column must end with a nonzero
value ("ProUCL 3.0 requires this). Therefore, all zero values (in the beginning, middle, or
end of data columns) are treated as valid zero values as part of the data set.
•	The program can read Excel files. The user can perform typical Cut, Paste, and Copy
operations.
•	The first row in all input data files consist of alphanumeric (strings of numbers and
characters) names representing the header row. Those header names may represent
meaningful variable names such as Arsenic, Chromium, Lead, Group-ID, and so on.
o The Group-ID column has the labels for the groups (e.g., Background, AOC1,
AOC2, 1, 2, 3, a, b, c, Site 1, Site 2, and so on) that might be present in the data set.
The alphanumeric strings (e.g., Surface, Sub-surface) can be used to label the various
groups.
o The data file can have multiple variables (columns) with unequal number of
observations.
o Except for the header row and columns representing the group labels, only numerical
values should appear in all other columns,
o All alphanumeric strings and characters (e.g., blank, other characters, and strings),
and all other values (that do not meet the requirements above) in the data file are
treated as missing values,
o Also, a large value denoted by 1E31 (= 1 x 1031) can be used to represent missing data
values. All entries with this value are ignored from the computations. These values
are counted under number of missing values.
2.4 Number Precision
• You may turn "Full Precision" on or off by choosing Configure ~ Full Precision
On/OFF
•	By leaving "Full Precision" turned off, ProUCL will display numerical values using an
appropriate (default) decimal digit option. However, by turning "Full Precision" off, all
decimal values will be rounded to the nearest thousandths place.
•	"Full Precision" on option is specifically useful when one is dealing with data sets
consisting of small numerical values (e.g., < 1) resulting in small values of the various
estimates and test statistics. These values may become so small with several leading zeros
34
-------
(e.g., 0.00007332) after the decimal. In such situations, on may want to use "Full
Precision" option to see nonzero values after the decimal.
Note: For the purpose of this User Guide, unless noted otherwise, all examples have been described
using the "Full Precision " off option. This option prints out results up to 3 significant digits after the
decimal.
2.5 Entering and Changing a Header Name
1. Highlight the column whose header name (variable name) you want to change by clicking either
the column number or the header as shown below.

0
1
2

Arsenic


1
4.5


2
5.G


3
4.3


4
5.4


5
9.2


c "Header Name"



0
1
2
Header Name
Change the Header Name.
4. Click the OK button to get the following output with the changed variable name.
35
-------

0
1
2

Arsenic Site 1


1
4 5


2
5.6


3
4.3


4
5.4


5
92


2.6 Saving Files
Edit Configure
New
Open ...
Load Excel Data
Other Files ... ~
Close
Save
Save As...
Print
Print Preview
Exit
i	 1
•	Save option allows the user to save the active window.
•	Save As option allows the user to save the active window. This option follows typical
Windows standards, and saves the active window to a file in Excel 95 (or higher) format.
All modified/edited data files, and output screens (excluding graphical displays)
generated by the software can be saved as Excel 95 (or higher) spreadsheet.
2.7 Editing
Click on the Edit menu item to reveal the following drop-down options.
00 File ESI Configure Summary Stacsacs ROSEst. NDs Graphs Outfisr tests Goodness-of-Fit h.yoothesisTesting Bsckgrouna UCL Window Help
0|«3]bshs!!E^B
	1	\ Copy Ctrl-rC
Mavigal p££te ctrl+V
1
0 i 1 ! ^
3 | 4 | 5 I 6 | 7 I 8 I 9 I
The following Edit drop-down menu options are available:
•	Cut option: similar to a standard Windows Edit option, such as in Excel. It performs
standard edit functions on selected highlighted data (similar to a buffer).
•	Copy option: similar to a standard Windows Edit option, such as in Excel. It performs
36
-------
typical edit functions on selected highlighted data (similar to a buffer).
•	Paste option: similar to a standard Windows Edit option, such as in Excel. It performs
typical edit functions of pasting the selected (highlighted) data to the designated
spreadsheet cells or area.
•	It should be noted that the Edit option could also be used to Copy Graphs. This topic
(copying and pasting graphs) is illustrated in detail in Chapter 13.
2.8 Handling Nondetect Observations
•	ProUCL 4.0 can handle data sets with single and multiple detection limits.
•	For a variable with nondetect observations (e.g., arsenic), the detected values, and the
numerical values of the associated detection limits (for less than values) are entered in the
appropriate column associated with that variable.
•	Specifically, the data for variables with nondetect values are provided in two columns.
One column consists of the detected numerical values with less than (< DL,) values
entered as the corresponding detection limits (or reporting limits), and the second column
represents their detection status consisting of only 0 (for less than values) and 1 (for
detected values) values. The name of the corresponding variable representing the
detection status should start with d_, or D_ (not case sensitive) and the variable name.
The detection status column with variable name starting with a D_ (or a d_) should have
only two values: 0 for nondetect values, and 1 for detected observations.
•	For an example, the header name, D_Arsenic is used for the variable, Arsenic having
nondetect observations. The variable D_Arsenic contains a 1 if the corresponding Arsenic
value represents a detected entry, and contains a 0 if the corresponding entry for variable,
Arsenic, represents a nondetect. An example data set illustrating these points is given as
follows.
37
-------
n
D: Vexamp Ic. wst;
Arsenic
1
D Arsenic
Mercury
D_Mercury
Vanadium
Zinc
Group
10
11
12
13
14
15
16
17
18
19
20
21
4 51
22
23
24
l5T
°i
1;
0 071
0 07:
' 0111
0	2;
0 B1J
0.121
0 041
0 06
" 0 93"
0 125
o"ie
0 21
0 29
0~44|
oi2j
0 055
0 055
" 0 2T
" 0 67
01
08
" 0 26
"~097
~Cf05
o_26T
V.


r -
16 4j
16 8r
. . L
172.
19 4'
15	3"
30 8
23	4"
138 "
189;"
1725!
'172!
- J_
163
17.il
10.3^
151
24	3] ~
18:
16	9:
12T
		i._
89	3|Surface
90	7[Surface
95 5] Surface
113| Surface
266i Surface
80 9j Surface
80 4 ^Surface
89 2 Surface
182
804
91 9
Surface
Surface
Subsurface
1121 Subsurface
172; Subsurface!
-. J
99, Subsurface
90 7^Subsurface
66 3'Subsurface
j .
75jSubsurface
185) Subsurface
184
684
Subsurface
Subsurface
Caution
•	Care should be taken to avoid any misrepresentation of detected and nondetected values.
Specifically, it is advised not to have any missing values (blanks, characters) in the
D_column (detection status column). If a missing value is located in the D_ column (and
not in the associated variable column), the corresponding value in the variable column is
treated as a nondetcct, even if this might not have been the intention of the user.
•	It is mandatory that the user makes sure that only a 1 or a 0 are entered in the detection
status D_column. If a value other than a 0 or a 1 is entered in the D_ column (the
detection column), results may become unreliable, as the software defaults to any number
other than 0 or 1 as a nondetect value.
•	When computing statistics for full data sets without any nondetect values, the user should
select only those variables (from the list of available variables) that contain no nondetect
observations. Specifically, nondetect values found in a column chosen for the summary
statistics (full-uncensored data set) will be treated as a detected value; whatever value
(e.g., detection limit) is entered in that column will be used to compute summary
statistics for a full-uncensored data set without any nondetect values.
•	Two-Sample Hypotheses: It should be noted that at present, when using two-sample
hypotheses approaches (WMW test, Gehan test, and quantile test) on data sets with NDs,
-------
both samples or variables (e.g., site-As, Back-As) should be specified as having
nondctects, even though one of the variables may not have any ND observations. This
means, a ND column (with 0 = ND, and 1 = detect) should be provided for each variable
(here D_site-As, and D_Back-As) to be used in this comparison. If a variable (e.g., site-
As) does not have any nondetects, still a column with label D_site-As should be included
in the data set with all entries = 1 (detected values).
•	The following sample (not from a Superfund site) data set given on the next page
illustrates points related to this option and issues listed above. The data set considered
contains some nondctect measurements for Arsenic and Mercury. It should also be noted
that the Mercury concentrations are used to illustrate the points related to nondetect
observations only. Arsenic and Zinc concentrations are used to illustrate the use of the
group variable, Group (Surface, Subsurface).
•	If for mercury, one computes summary statistics (assuming no nondetect values) using
"Full" data set option, then all nondetect values (with "0" entries in D_Mercury column)
will be treated as detected values, and summary statistics will be computed accordingly.
2.10	Summary Statistics for Data Sets with Nondetect Observations
•	In order to compute simple summary statistics or to compute other statistics of interest
(e.g., background statistics, GOF test, UCLs, outliers) for variables with nondctect
values, one should choose the nondetect option, "With NDs" from the various available
menu options such as Outliers, Background Statistics, UCLs, Goodness-of-Fit test, Q-Q
plot, Box Plot.
•	It should be noted that "summary" statistics for a data set with nondetect observations
represent (at least in ProUCL 4.0) simple summary statistics based upon the data set
without using nondctect observations. All other parametric and nonparametric statistics
and estimates of population mean, variance, percentiles (e.g., MLEs, KM, and ROS
estimates) for variables with nondetect observations are given in other menu options such
as background statistics and UCL. The simple "Summary Statistics/With NDs" option
only provides simple statistics (e.g., % NDs, max ND, Min ND, and Mean of detected
values) based upon detected values. These statistics (e.g., sd of log-transformed detected
values) may help a user to determine the degree of skewness (e.g., mild, moderate, high)
of the data set consisting of detected values. These statistics may also help the user to
choose the most appropriate method (e.g., KM (BCA) UCL or KM (t) UCL) to compute
UCLs, UPLs, and other limits.
•	As mentioned before, various other statistics and estimates of interest (e.g., mean, sd,
UCLs, UTLs, MLEs, and KM estimates) for data sets with nondctect observations arc
computed in other the menu options (UCLs, Outliers, Background Statistics, GOF tests)
available in ProUCL 4.0.
2.11	Handling Missing Values
•	ProUCL 4.0 can handle missing values within a data set.
39
-------
All blanks, alphanumeric strings (except for group variables), or the specific large
number value 1 e31 are considered as missing values.
•	A group variable (representing two or more groups, populations, AOCs, MWs) can have
alphanumeric values (e.g., MW1, MW2,...).
•	ProUCL 4.0 ignores all missing values in all mathematical operations it performs.
Missing values are therefore not treated as being part of a data set.
•	Number of Valid Samples or Number of Valid Observations represents the Total Number
of Observations less the Number of Missing Values. If there are missing values, then
number of valid samples = total number of observations.
Valid Samples = Total Number of Observations - Missing Values.
•	It is important to note, however, that if a missing value not meant (e.g., a blank, or le31)
to represent a group category is present in a "Group" variable, ProUCL 4.0 will treat that
blank value (or le31 value) as a new group. Any variable that corresponds to this missing
value will be treated as part of a new group and not with any existing groups. It is
therefore very important to check the consistency and validity of all data sets before
performing complex mathematical operations.
•	ProUCL 4.0 prints out the number of missing values (if any) associated with each
variable in the data sheet. This information is provided in several output sheets (e.g.,
summary statistics, background statistics, UCLs) generated by ProUCL 4.0.
For further clarification of labeling of missing values, the following example illustrates the terminology
used for the number of valid samples, number of unique and distinct samples on the various output sheets
generated by ProUCL 4.0.
Example: The following example illustrates the notion of Valid Samples, Unique or Distinct Samples,
and Missing Values. The data set also has nondetect values. ProUCL 4.0 computes these numbers and
prints them on the UCLs and background statistics output.
X
D
2
f
4
1
2.3
1
1.2
0
w34
0
1.0E+031
0

0
anm
0
34
1
23
1
0.5
0
0.5
0
2.3
1
2.3
1
40
-------
2.3	1
34	1
73	1
Valid Samples: Represents the total number of observations (censored and uncensored) excluding the
missing values. If a data set has no missing value, then the total number of data points equals number of
valid samples.
Missing Values: All values not representing a real numerical number are considered as missing values.
Specifically, all alphanumeric values including blanks are considered as missing values. Also unrealistic
big numbers such as 1.0e31 are also considered as missing values and are considered as not valid
observations.
Unique or Distinct Samples: The number of unique samples or number of distinct samples represents all
unique (or distinct) detected values. Number of unique or distinct values is computed for detected values
only. This number is especially useful when using bootstrap methods. As well known, it is not desirable
and advisable to use bootstrap methods, when the number of unique samples is less than 4-5.
2.12 User Graphic Display Modification
Advanced users are provided two sets of tools to modify graphics displays. A graphics tool bar is
available above the graphics display and the user can right-click on the desired object within the graphics
display, and a drop-down menu will appear. The user can select an item from the drop-down menu list by
clicking on that item. This will allow the user to make desired modifications as available for the selected
menu item. An illustration is given as follows.
2.12.1 Graphics Tool Bar
The user can change fonts, font sizes, vertical and horizontal axis's, select new colors for the various
features and text. All these actions are generally used to modify the appearance of the graphic display.
The user is cautioned that these tools can be unforgiving and may put the user in a situation where the
user cannot go back to the original display. Users are on their own in exploring the robustness of these
41
-------
tools. Therefore, less experienced users may want to stay away from using these drop-down menu graphic
tools.
2.12.2 Drop-Down Menu Graphics Tools
These tools allow the user to move the mouse to a specific graphic item like an Axis Label or a display
feature. The user then right-clicks their mouse and a drop-down menu will appear. This menu presents the
user with available options for that particular control or graphic object. There is less of chance of making
an unrecoverable error but that risk is always present. As a cautionary note, the user can always delete the
graphics window and redraw the graphical displays by repeating their operations from the datasheet and
menu options available in ProUCL 4.0. An example of a drop-down menu obtained by right-clicking the
mouse on the background area of the graphics display is given as follows.

Histo_Group.gst
. (~ x
bJi £§
A - E T,i
~

• 0 -s- i sa jg 3? m &
% u
Histograms for Arsenic, NROS_Arsenic
o
Toolbar
Data Editor
Legend Box
Gallery	~
Color	~
Edit title
Point Labels
Font...
Properties...
Statistical Studies
Arsenic	; 0
NROS_Arsenic | 1
42
-------
Chapter 3
Select Variables Screen
Select Variables Screen
•	Variables need to be selected to perform statistical analyses.
•	When the user clicks on a drop-down menu for a statistical procedure, the following
window will appear.
Select Variables
| Variables



Selected

Name
1 ID
I Count |

Name | ID | Count |
Arsenic
0
20 I


Mercury
2
30
» 1


Vanadium
4
20


Zinc
5
20



Group
6
20
i





« 1





i
Group by variable
1 ^1

OK j Cancel j
•	The Options button is available in certain menus. The use of this option leads to a
different pop-up window.
•	Multiple variables can be processed simultaneously in ProUCL 4.0. Note that this option
was not available in ProUCL 3.0. ProUCL 4.0 can generate graphs, compute UCLs, and
background statistics simultaneously for all selected variables.
•	Moreover, if the user wants to perform statistical analysis on a variable (e.g.,
contaminant) by a Group variable, click the arrow below the Group by variable to get a
drop-down list of available variables to select an appropriate group variable. For an
example, a group variable (e.g., Site Area) can have alphanumeric values such as AOC1,
43
-------
A0C2, A0C3, and Background. Thus in this example, the group variable name, Site
Area, takes four values such as AOC1, AOC2, AOC3, and Background.
•	The Group variable is particularly useful when data from two or more samples need to be
compared.
•	Any variable can be a group variable. However, for meaningful results, only a variable,
that really represents a group variable (categories) should be selected as a group variable.
•	The number of observations in the group variable and the number observations in the
selected variables (to be used in a statistical procedure) should be the same. In the
example below, the variable "Mercury" is not selected because the number of
observations for Mercury is 30; in other words mercury values have not been grouped.
The group variable and each of the selected variables have 20 data values.
Select'Variables -
| Variables
Selected


Name | ID | Count |
Name | ID
I Count |

Mercury 2 30
Aisenic 0
20

Group 6 20 i
Vanadium 4
20

» 1
Zinc 5
20

« 1




Group by variable
1 zl



Aisenic (Count = 20)


_
Mercury (Count = 30)



Vanadium (Count - 20



Zinc [ Count = 20 )



[Group (Count = 201
Cancel |


U&.
• It is recommended not to assign any missing value such as a "Blank" for the group
variable. If there is a missing value (represented by blanks, strings or 1E31) for a group
variable, ProUCL 4.0 will treat those missing values as a new group. As such, data values
corresponding to the missing Group will be assigned to a new group.
Caution: Once again, care should be taken to avoid misrepresentation and improper use of group
variables. It is recommended not to assign any missing value for the group variable.
44
-------
More on Group Option
•	The Group Option provides a powerful tool to perform various statistical tests and
methods (including graphical displays) separately for each of the group (samples from
different populations) that may be present in a data set. For an example, the same data set
may consist of samples from the various groups (populations). The graphical displays
(e.g., box plots, Q-Q plots) and statistics of interest can be computed separately for each
group by using this option.
•	In order to use this option, at least one group variable (with alphanumeric values) should
be included in the data set. The various values of the group variable represent different
group categories that may be present in the data set. This can be seen in the example.wst
data set used earlier in Chapter 2.
•	At this time, the number of values (representing group membership) in a Group variable
should equal to the number of values in the variable (e.g., Arsenic) of interest that needs
to be partitioned into various groups (e.g., monitoring wells).
•	Typically, the data for the various groups (categorized by the group variable) represent
data from the various site areas (e.g., background, AOC1, AOC2,...), or from monitoring
wells (e.g., MW1, MW2, ...).
3.1.1 Graphs by Groups
•	Individual or multiple graphs (Q-Q plots, box plots, and histograms) can be displayed on
a graph by selecting the "Graphs by Groups" option.
•	Individual graph for each group (specified by the selected group variable) is produced by
selecting the "Individual Graph" option.
•	Multiple graphs (e.g., side-by-sidc box plots, multiple Q-Q plots on the same graph) are
produced by selecting the "Group Graph" option for a variable categorized by a group
variable. Using this "Group Graph" option, multiple graphs can be displayed for all sub-
groups included in the Group variable. This option is useful when data to be compared
are given in the same column and are classified by the group variable.
•	Multiple graphs (e.g., side-by-sidc box plots, multiple Q-Q plots) for selected variables
are produced by selecting the "Group Graph" option. Using the "Group Graph" option,
multiple graphs can be displayed for all selected variables. This option is useful when
data (e.g., lead) to be compared are given in different columns, perhaps representing
different populations.
Note: It should be noted that it is the users' responsibility to provide adequate amount of detected data to
perform the group operations. For an example, if the user desires to produce a graphical Q-Q plot (using
only detected data) with regression lines displayed, then there should be at least two detected points (to
compute slope, intercept, sd) in the data set. Similarly if the graphs are desired for each of the group
specified by the group ID variable, there should be at least two detected observations in each group
specified by the group variable. ProUCL 4.0 generates a warning message (in orange color) in the lower
panel of the ProUCL 4.0 screen. Specifically, the user should make sure that a variable with nondetects
45
-------
and categorized by a group variable should have enough detected data in each group to perform the
various methods (e.g., GOF tests, Q-Qplots with regression lines) as incorporated in ProUCL 4.0.
As mentioned before, the analyses of data categorized by a group ID variable such as:
1) Surface vs. Subsurface; 2) AOC1 vs. AOC2; 3) Site vs. Background; and4) Upgradient vs.
Downgradient monitoring wells are quite common in many environmental applications.
The usefulness of the group option is illustrated throughout the User Guide using various methods as
incorporated in ProUCL 4.0.
46
-------
Chapter 4
Summary Statistics
This option is used to compute general summary statistics for all variables in the data file. Summary
statistics can be generated for full data sets without nondetcct observations, and for data sets with
nondetect observations. Two Menu options: Full and With NDs are available.
•	Full - This option computes summary statistics for all variables in a data set without any
nondetect values.
•	With NDs — This option computes simple summary statistics for all variables in a data set
that have nondetect (ND) observations. For variables with ND observations, only simple
summary statistics are computed based upon detected observations only.
o For this option, no attempt is made to compute estimates of population parameters
(e.g., mean, sd, SE) using parametric (e.g., MLE) or nonparametric (e.g., KM,
bootstrap) estimation methods. Those statistics are generated in other estimation
modules (e.g., Background and UCL) of ProUCL 4.0.
Each menu option (Full and With NDs) has two sub-menu options:
•	Raw Statistics
•	Log-Transformed
•	In ProUCL, log-transformation means natural logarithm (In)
•	When computing summary statistics for raw data, a message will be displayed for each
variable that contains non-numeric values.
•	The Summary Statistics option computes log-transformed data only if all of the data
values for the selected variable(s) are positive real numbers. A message will be displayed
if non-numeric characters, zero, or negative values are found in the column
corresponding to the selected variable.
4.1 Summary Statistics with Full Data Sets
1. Click Summary Statistics ~ Full
ProUG. 4.0^ [Worksheet.wst]
Summary Statistics
~S File Edit Configure !
ROSEst. NDs C-raohs Outlier Tests Goodnepothefis7e
-------
3. The Select Variables Screen (sec Chapter 3) will appear.
• Select one or more variables from the Select Variables screen.
If summary statistics arc to be computed by a Group variable, then select a group variable
by clicking the arrow below the Group by variable button. This will result in drop-down
list of available variables, and select the proper group variable.
Click on the OK button to continue or on the Cancel button to cancel the Summary
Statistic option.
Raw Statistics
Full Raw Stats.ost
From File: D'\example.mi
Sumraaiy Statistics foi Raw FuD D ata S ets
Variable
Aisenic (subsurface).
Aisenc (surface]
Vanadium (subsurface);
Vanadium (surlacejj
Zinc (siiisurfacejl
Zinc (surface);
NumObs i Minimum 1 Maximum
10
TCP
To""
10
IF
W
45
T3
To 3
13 8 ~
66 3
"ecu"
imuml Me^
:~q ~ ' r'r
Mean
556"
"V
6185
68
3 2
"24T" T 16T
""W ~j 19 53
Tffii"" 114 4
""266 rTf6'r
MJ
Median Variance
5 45 [
~5725T
"1685" T
0 449
25
SD
"""0 67"
MAD/0.67? Skewness' Kurtosis I
4	
13 86
1.581
3~723"
0 667 | 0 344 f -0 047
T075~1 """0969 ] 0295"
'""" 0 495
1 26
17 23
"95 45"
90
3372
2271
5 807
4766
3680
60 66
2 669
"S72T"
1 46
"0 745"
2111
2 092
"07896
-1 288
TC65"
CV_ j
0121 1
0256T
""0 227"!"
"0297"""
"0"41G~i"
052"j"
~ I
Log-Transformed Statistics
SI f ull_LqgJit.ats.QSt'







.01
©

1 I 1 1 IT








!
Z3
j Fiom File: D:\examp!e.mt
- -
Summary Statistics for Log-TransformedFuflData Sets
	
	
	 -
' i
	i

Variable NumObs
Minimum
, Maximum
- Mean , Median j Variance
1 sd"
MAD/U B/T Skewness
\ Kuitosis !
cv i

Aisenic (subsurface); 10
1 504 I
1 917
""" 1 709 j 1 "695 """"00144"
" 012
0123
0 0692 - 0 1 4 i
00702

Arsenic (surface) 10
1 459" j
j 2 219
~1 795 j 1 745 I 0 0591"
0 243 |
"0176
0 527
""-0153" |
" 0135 "I

Vanadium (subsurface); 10 j
" " 2 332
j " "319"
2 774 ]" " 2"824"| 0 0538
0 232
1 00735
""¦0 394""
1.351 j
"00836!

Vanadium (surface) 1G
2 625 |
[ 3 428'"
2 938 ; 2 846 | 0 0699
I ""0 264"
0157
rfgT"
0504"""
~a6S"
i_
Zinc (subsurface), 10
r~Ti94"j
5 22" :
4 666"i 4 558' j CIST
j 0399
0"425"
j 0452"
—-1 394 ";
0 08S5
L
Zmc (surface) 10 !
"4387"
5 583 :
~~""4"673 "j 4"5 j" CM62"
j 0 403
| OTG 3
j 1 734™
2"iw;
0 0862-
1 1
liil


I ; i

.

i
ir1
4. The resulting Summary Statistics screen as shown above can be saved as an Excel file. Click
Save from the file menu.
48
-------
5. On the output screen, the following summary statistics are displayed for each selected variable in
the data file.
NumObs = Number of Observations
Minimum = Minimum value
Maximum = Maximum value
Mean = Sample average value
Median = Median value
Variance = Classical sample variance
SD = Classical sample standard deviation
MAD = Median absolute deviation
MAD/0.675 = Robust estimate of variability, population standard deviation, a
Skewness = Skewness statistic
Kurtosis = Kurtosis statistic
CV = Coefficient of Variation
The details of these summary statistics arc described in an EPA (2006) guidance document and also in the
ProUCL Technical Guide (A. Singh and A.K. Singh (EPA, 2007).
4.2 Summary Statistics with NDs
1. Click Summary Statistics ~ With NDs
ProUCL 4.01 ptforkSheet.wst]
~g rJe Ecu Configure
&\<&\ elBlmNMFu°
Navigation Panel
Summary Statistics |
WithNfe ~.
SOS Est. NDs C-riphs Oudier itsts Goocintss-of-Fit Hypothesis Testing Background UCL VVinrfo'.< Help
Name
KcW^CSUSOCS
PH
3 I *
5
6
7 | 8
9
Log-Transformed
	1 1
j |
|


|

1 1
0 WorkSheet.vvst
2.-	Select either Log-Transformed or Raw Statistics option.
3.	The Select Variables Screen (Chapter 3) will appear.
•	Select variable(s) from the list of variables.
•	Only those variables that have nondetect values will be shown.
•	If summary statistics are to be computed by a Group variable, then select a group variable
by clicking the arrow below the Group by variable button. This will result in a drop-
down list of available variables; then select the proper group variable.
•	Click on the OK button to continue or on the Cancel button to cancel the summary
statistics operations.
Note: It should be noted that in ProUCL 4.0, "Summary Statistics "for a data set with nondetect
observations represent simple summary statistics based upon the data set without using nondetect
observations. All other parametric and nonparametric statistics and estimates of population mean,
49
-------
variance, percentiles (e.g., MLEs, KM, and ROS estimates) for variables with nondetect observations are
given in other estimation menu options such as background statistics and UCL. The simple "Summary
Statistics/With NDs " option only provides simple statistics (e.g., % NDs, max ND, Min ND, Mean of
detected values) based upon detected values. These statistics (e.g., sd of log-transformed detected values)
may help a user to determine the degree of skewness (e.g., mild, moderate, high) of the data set consisting
of detected values. These statistics may also help the user to choose the most appropriate method (e.g.,
KM (BCA) UCL or KM (t) UCL) to compute UCLs, UPLs, and other limits.
Raw Statistics - Data Set with NDs
Fiom File: D:\example. wst
Summary Statistics fra Raw Data Sets with NDs
RanStatistics using Detected Obsavdbons
Variable
NumObs , NumNDs • X NDs : Maximum, Minimum ,
AiseneJ
Metctiry |
17
~25~
[15 OIK
T1S675T

5
"0 04"
92
Mean
""B7T2ET
099
0.312
Median
58
018
SD
"TiT
0 315
MAD/0.67! Skewness
0 S3 ]
CV
J
0.183
1 783 ! 0188 i
T202' T01
4
Log-Transformed Statistics - Data Set with NDs
H WNDJj>&J5tats.ost
~1©
I

I




|


A
From File: DAexampl&wsl


























Summary S tatistics (01 Log-transformed Data Sets wih NDs






















Statistics using Detected Log-transformed Observations


Variable
NumObs
NumNDs
X NDs
Maximum
Minimum
Mean
Median
SD
MAD/0 67! Skewness
CV

Arsenic
17
3
15 00*
1 609
2219
1 798
1 758
0168
1458
0.0935
0106

Mercury
25
5
16 672
¦3 219
¦0 0101
•1 66
¦1 715
1032
0 238
¦0622
1 4

"T"
I
l



:


liU !
I


l !
ir
•	The Summary Statistics screen shown above can be saved as an Excel file. Click the save
from the file menu.
•	On the results screen, the following summary statistics are displayed for each selected
variable from the data file.
50
-------
Num Obs = Number of Observations
NumNDs = Number of Nondetects
% NDs = Percentage of Nondetect observations
Minimum = Minimum value
Maximum = Maximum value
Mean = Sample average value
Median = Median value
SD = Classical sample standard deviation
MAD = Median Absolute Deviation
MAD/0.675 = Robust estimate of variability (standard deviation)
CV = Coefficient of Variation
-------
Chapter 5
Estimating Nondetects Using ROS Methods
Regression on order statistics (ROS) can be used to extrapolate nondetcct observations using a normal,
lognormal, or gamma model. ProUCL 4.0 has three ROS estimation methods that can be used to estimate
or extrapolate nondetect observations. The use of this option generates additional columns consisting of
all extrapolated nondetects and detected observations. These columns are appended to the existing open
spreadsheet. The user should save the updated file if they want to use the generated data for their other
application(s).
1.
Click ROS Est. NDs ~ Gamma ROS
ProUCL 4.0 - [O:\Narain\ProU(CL-Qata\Diit3\Oahu.w5t]
File Edit Configure Summary Statistics	Graphs Outlier Tests Goodness-of-Fit Hypothesis Testing Background UCL Wndow Help
Normal ROS \
salel elBlml ml
10
2. The Select Variables Screen (Chapter 3) will appear.
• Select one or more variable(s) from the Select Variables screen.
Select Variables
Variables
Selected
Accrue
Group by Variable
I	3
OK | Cancel |
• Click on the OK button to continue or on the Cancel button to cancel the option.
53
-------
Output Screen for ROS Est. NDs (Gamma) Option

0
1
2

Arsenic
D_Arsenic
GROS_Arsenic
1
1
0
1 0315131074651300]
2
1
o
1 199323^30983740 j
3
1.7

.... _....
4
1
1 33761048919618,'
5
T - °
i.46028213348334j
6
2} 0
0.957865781528449;
7
3.2
1
3 2!
8
2
0
1 086003591750451
9
2
0
1.15098252772635]
10
2.8
1
2 8!
11
2
0 1.28363461914326]
12
2
0
1 368290930400230!
13
2
0
1.447218922^9559:
14
2
0
1.52178645574469'
15
2
0
1.59289577776005!
16
0.7
1
07)
17
0.9
1
0.9j
18
0.5
1
0.5;
19
0.5

¦ 0 5!
20
0.9
0| 1 226870494768350
21
05
<
0 5;
22
0.7
1
0.7
23
0.6! 1
i
06
24
1 5j 1
1 5
Note: Columns with similar naming convention are generated for each selected variable and distribution
using this ROS option.
54
-------
Chapter 6
Graphical Methods (Graph)
Three commonly used graphical displays are available under the Graphs option:
o Box Plot
o Histogram
o Multi-QQ
•	The box plots and multiple Q-Q plots can be used for Full data sets without nondetects
and also for data sets with nondctect values.
•	Three options are available to draw Q-Q plots with nondetect (ND) observations.
Specifically, Q-Q plots are displayed only for detected values, or with NDs replaced by V%
DL values, or with NDs replaced by the respective detection limits. The statistics
displayed on a Q-Q plot (mean, sd, slope, intercept) are computed according to the
method used. The NDs are displayed using the smaller font and in red color.
•	ProUCL 4.0 can display box plots for data sets with NDs. This kind of graph may not be
very useful when many NDs may be present in a data set.
o A few choices are available to construct box plots for data sets with NDs. For an
example, some texts (e.g., Helsel) display box plots only for the detected
observations. Specifically, all nondetects below the largest detection limit (DL), and
portion of the box plot (if any) below the largest DL are not shown on the box plot. A
horizontal line is displayed at the largest detection limit level,
o ProUCL 4.0 constructs a box plot using all detected and nondetect (using DL values)
values. ProUCL 4.0 shows the full box plot. However, a horizontal line is displayed
at the largest detection limit.
•	When multiple variables are selected, one can choose to: 1) produce a multiple graphs on
the same display by choosing the Graph by group variable option, or 2) produce separate
graphs for each selected variable.
•	The Graph by group variable option produces side-by-side box plots, or multiple Q-Q
plots, or histograms for the groups of the selected variable representing samples obtained
from multiple populations (groups). These multiple graphs arc particularly useful to
perform two (background versus site) or more sample visual comparisons.
o Additionally, Box Plot has an optional feature, which can be used to draw lines at
statistical (e.g., upper limits of background data set) limits computed from one
population on the box plot obtained using the data from another population (a site
area of concern). This type of box plot represents a useful visual comparison of site
data with background threshold values (background upper limits),
o Up to four (4) statistics can be added (drawn) on a box plot. If the user inputs a value
in the value column, the check box in that row will get activated. For example, the
55
-------
user may want to draw horizontal lines at 80th percentile, 90th percentile, 95th
percentile, or a 95% UPL) on a box plot.
6.1 Box Plot
l. Click Graphs ~ Box Plot
i
?| ProUCL 4.0 - [Worksheet.wstj
.00 File edit Configure Summary Statistics ROSEst. NDs

<&\ BlBlm
Navigation Panel

Name
0 WorkSheet.wst
Graphsv
Box Plot
Outlier Tests Goodness-of-Fit Hypothesis Testing Background UCL Window Help
J Full {w/b NDs)
histogram | With NDs
Mula-QQ ~TTr"
The Select Variables Screen (Chapter 3) will appear.
•	Select one or more variable(s) from the Select Variables screen.
•	If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
list of available variables. The user should select an appropriate variable representing a
group variable.
•	When the Group by variable button is clicked, the following window is shown.
-Graph by Groups	
<• Individual Graphs
Group Graphs
Label
Value
1-	r	|
2.	r	r
3.	r	r
4.	r	r
-Graphical Display Options—
<* Color Gradient
For Export (6W Printers
OK
Cancel
56
-------
•	The default option for Graph by Groups is Individual Graphs. This option will
produce one graph for each selected variable. If you want to put all the selected variables
into a single graph, then select the Group Graphs option. This Group Graphs option is
used when multiple graphs categorized by a Group variable have to be produced on the
same graph.
•	The default option for Graphical Display Options is Color Gradient. If you want to use
and import graphs in black and white into a document or report, then check the radio
button next to For Export (BW Printers).
•	Click on the OK button to continue or on the Cancel button to cancel the Box Plot (or
other selected graphical) option.
Box Plot Output Screen (Single Graph)
Selected options: Label (Background UPL), Value (103.85), Individual Graphs, and Color Gradient.
57
-------
Box Plot Output Screen (Group Graphs)
Selected options: Group Graphs and Color Gradient.
t»oo

Box Plots for X X (2), X <3)





»»»






Wan



» 00
s
Q n oo
| €OOD
s
SIB



SOD
2MB

mmm

"?on ¦

|

000
*<1>
*ca

6.2 Histogram
1. Click Graphs ~ Histogram
P? ProUCL 4.0 [Worksheet.wst]
¦S =Je Ed:t Configure Sumnary Statist** ROS Est. NDs
J Outlier Tests Goodness-of-Fit Hypothesis Testing Background
LCL Window Help
o o aim n
Box Plot ~
I	
Navigation Panel |

0
1
Multi-QQ ~
I 3
4 5 6 7
8
9
flame

I





:J Worksheet vvst
id
I





2. The Select Variables Screen (Chapter 3) will appear.
•	Select one or more variable(s) from the Select Variables screen.
•	If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
list of available variables. The user should select and click on an appropriate variable
representing a group variable.
•	When that option button is clicked, the following window will be shown.
58
-------
3? Graphs (Histograms)
a
Graph by Groups
C Individual Graphs
• Group Graphs
Graphical Display Options
(* Color Gradient
For Export (BW Printers)
OK
Cancel
The default selection for Graph by Groups is Individual Graphs. This option produces
a histogram (or other graphs) separately for each selected variable. If multiple graphs or
graphs by groups are desired, then check the radio button next to Group Graphs.
The default option for Graphical Display Options is Color Gradient. If you want to use
and import graphs in black and white into a document or report, then check the radio
button next to For Export (BW Printers).
Click on the OK button to continue or on the Cancel button to cancel the Histogram (or
other selected graphical) option.
Histogram Output Screen
Selected options: Group Graphs and Color Gradient.
59
-------
6.3	Multi-QQ
6.3.1 Multi-QQ (Full)
1.	Click Graphs ~ Multi-QQ
2.	Multi-QQ can be obtained for data sets with (With NDs) and without NDs (Full).
• When that option button is clicked, the following window will be shown.
3. Select either Full or With NDs.
4. The Select Variables Screen (Chapter 3) will appear.
•	Select one or more variable(s) from the Select Variables screen.
•	If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
list of available variables. The user should select and click on an appropriate variable
representing a group variable.
•	When the Group by variable option button is clicked, the following window will appear.
Wultij QQ> Options [_ ~ X
Display Regression Lines
(* Do Not Display
C Display Regression Lines

Graphical Display Options
(• Color Gradient
C" For Export [BW Printers) i


OK Cancel

J
60
-------
The default option for Display Regression Lines is Do Not Display. If you want to see
regression lines on graphs, then check the radio button next to Display Regression
Lines.
•	The default option for Graphical Display Options is Color Gradient. If you want to see
the graphs in black and white, then check the radio button next to For Export (BW
Printers).
•	Click on the OK button to continue or on the Cancel button to cancel the selected Multi-
QQ option.
Note: For Multi-QQ plot option, for both "Full" as well as for data sets 'With NDs, " the values along
the horizontal axis represent quantiles of a standardized normal distribution (Normal distribution with
mean 0 and standard deviation I). Quantiles for other distributions (e.g., Gamma distribution) are used
when using Goodness-of-Fit (GOF) test option.
Output Screen for Multi-QQ (Full)
Selected options: Group Graph, Do Not Display Regression Lines, and Color Gradient.

Multiple Q-Q Plots
Arvrnk (subaurlkce)

for Arsenic (subsurface), Arsenic {surface}
N-10
Mwn -5.5800
9.90



Sd-08703
9.60

Slope-0 7023
930

rtacepl • 5 5800
900

ConeWon,R - 09651

Arsenic («t«1 M«)
870

N-10
840

Mbwi'6.1850


Sd-15811
810

Siope-15830
«! 780

Hercetf - 6 1850
O

CorreMon, R - 0 9474
'€ 750

£ 7.20
m


O 690
o

,
¦g 660


| 630
9

O 6 00
* a

S.70


5.40
• ' 4 *

5.10
*

480


450 4


430


3S0
-1 0 1
Theoretical Quantilei (Standard Normal)

M Af wnc (5Uwur(«c«)
|
full
5
6
7
8
-— ~	
9
Name

	1
With NDs !¦





iSWorisSheet wst
1 1
|	|




61
-------
2.	Select With NDs option by clicking on it.
3.	The Select Variables Screen (Chapter 3) will appear.
•	Select one or more variablc(s) from the Select Variables screen.
•	If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
list of available variables. The user should select and click on an appropriate variable
representing a group variable.
•	When the Group by variable option button is clicked, the following screen appears.
50
Multi-QQ 0 ptions
Qi®:
"Display Non-Detects 	
Do not Display Non-Detects
(* Display Non-Detect Values
C Display 1/2 Non-Detect Values
¦ D isplay R egression Lines 	
(• Do Not Display
C Display Regiession Lines
-Graphical Display Options —
(* Color Gradient
C For Export (BW Printers]
OK
Cancel

•	The default option for Display Regression Lines is Do Not Display. If you want to see
regression lines, then check the radio button next to Display Regression Lines.
•	The default option for Display Nondetects is Display Nondetect Values.
o Do not Display Nondetects: Selection of this option excludes the NDs detects and
plots only detected values on the associated Q-Q plot. The statistics are computed
using only detected data,
o Display Nondetect Values: Selection of this option treats detection limits as detected
values and plots those detection limits and detected values on the Q-Q plot. The
statistics arc computed accordingly.
62
-------
o Display % Nondetect Values: Selection of this option replaces the detection limits
with their half values, and plots half detection limits and detected values on the Q-Q
plot. The statistics are computed accordingly.
•	The default option for Graphical Display Options is Color Gradient. If you want to see
the graphs in black and white, then check the radio button next to For Export (BW
Printers).
•	Click on the OK button to continue or on the Cancel button to cancel the Multi-QQ
option.
Output Screen for Multi-QQ (without NDs)
Options: Do Not Display Regression Lines, Do not display Nondetects, and Color Gradient.
««
s*

Q-Q Plot with without NDs for Mercury
•

«m>-jo
s t w. : —.«
tkmtm it Ml ¦ A
•Nm.OjnM
a ?®:





* >•*
i
| ese





§
-a
O

•



a*





eae

,



or?
• *
, . - - "



-J

J e «
Theoretical Quanta®* {Standa/tJ Normal)

I







63
-------
Output Screen for Multi-QQ (with NDs)
Options: Do not Display Regression Line, Display Nondetcct Values, and Color Gradient.

Q-Q Plot with NDs for Mercury



Merciay
050
060

*


Mwn - 0.3055
Sd- 0-2937
Stop#-02717
Hercept - 0 3055
Correlation, R • 0.9001
070
§ 060
33
m
2
N 050
¦Q
o
TJ
£ 0 40
-5
o
O30
-
M



020





010
ODD
a - * ' *




-2
¦ Mercixy
-1 0
Theoretical Quantilea (Standard Normal)
1

2

Note: The legend size of nondetect values is smaller than that of the detected values and is shown in red.
The legend size is made smaller for BW printers.
Output Screen for Multi-QQ (with NDs)
Selected options: Do not Display Regression Lines, Display Vi Nondetect Values, and Color Gradient.
1J30
Q-Q Plot with NDs at 1/2 for Mercury
N - 30
OK)
•
Sd-02962
Slope -02656
080
A
Corre«on,R. 06721
0.70


|~
t
• 050
a
O
•o
£ 0 40
12
i

0.30
* * *

020


0.10
- - - * * M '

OjOO


•2
-1 0 1
Theoretical Quantilea (Standard Normal)
2
¦ Mercwy


Note: The legend size of nondetect values is smaller than that of the detected values and is shown in red.
64
-------
Chapter 7
Simple Classical Outlier Tests
Outliers are inevitable in data sets originating from environmental applications. There are many graphical
(Q-Q plots, Box plots), classical (Dixon, Rosner, Welch), and robust methods (biweight, Huber, PROP)
available to identify outliers. It is well known that the classical outlier tests (e.g., Dixon test, Rosen test,
EPA, 2006) suffer from masking (e.g., extreme outliers may mask intermediate outliers) effects. The use
of robust outlier identification procedures is recommended to identify multiple outliers, especially when
dealing with multivariate (having multiple contaminants) data sets. However, those preferred and more
effective robust outlier identification methods are beyond the scope of ProUCL 4.0. Several robust outlier
identification methods (e.g., based upon biweight, Huber, and PROP influence functions) are available in
the Scout software package (EPA, 1999).
The two simple classical outlier tests (often cited in environmental literature): Dixon and Rosner tests are
available in ProUCL 4.0. These tests can be used on data sets with and without nondctect observations.
These tests also require the assumption of normality of the data set without the outliers. It should be noted
that in environmental applications, one of the objectives is to identify high outlying observations that
might be present in the right tail of a data distribution as those observations often represent contaminated
locations of a polluted site. Therefore, for data sets with nondetccts, two options are available in ProUCL
4.0 to deal with data sets with outliers. These options are: 1) exclude nondetccts and 2) replace NDs by
DL/2 values. These options are used only to identify outliers and not to compute any estimates and limits
used in decision-making process.
It is suggested that these two classical outlier identification procedures be supplemented with graphical
displays such as Q-Q plots, Box and Whisker plot (called box plot), and 1QR (= upper quartile, Q3 -
lower quartile, Ql). These graphical displays are available in ProUCL 4.0. Box plots with whiskers are
often used to identify outliers (e.g., EPA, 2006). Typically, a box plot gives a good indication of extreme
(outliers) observations that may present in a data set. The statistics (lower quartile, median, upper quartile,
and IQR) used in the construction of a box plot do not get distorted by outliers. On a box plot,
observations beyond the two whiskers may be considered as candidates for potential outliers.
Q-Q plots are also quite useful to identify outliers in a data set. For an example, on a normal Q-Q plot,
observations that are well separated from the bulk (central part) of the data typically represent potential
outliers needing further investigation. Also, significant and obvious jumps and breaks in a Q-Q plot (for
any distribution) are indications of the presence of more than one population. Data sets exhibiting such
behavior of Q-Q plots should be partitioned out in component sub-populations before estimating an EPC
Term or a background threshold value (BTV). It is strongly recommended that both graphical and formal
outlier identification tests should be used on the same data set to identify potential outliers that may be
present in a data set under study. More details about the construction of graphical displays and outliers
test can be found in the Technical Guide for ProUCL 4.0.
Dixon's Test (Extreme Value Test)
• This test is used to identify statistical outliers when the sample size is less than or equal
to 25.
65
-------
•	This test can be used to identify outliers or extreme values in both the left tail (Case 1)
and the right tail (Case 2) of a data distribution. In environmental data sets, extremes
found in the right tail may represent potentially contaminated site areas needing further
investigation or remediation. The extremes in the left tail may represent ND values.
•	This test assumes that the data without the suspected outlier are normally distributed;
therefore, it is necessary to perform a test for normality on the data without the suspected
outlier before applying this test.
•	This test may suffer from masking in the presence of multiple outliers. This means that if
more than one outlier is suspected, this test may fail to identify all of the outliers.
Therefore, if you decide to use the Dixon's test for multiple outliers, apply the test to the
least extreme value first. Alternatively, use more effective robust outlier identification
procedures. Those outlier identification procedures will be available in Scout (EPA,
1999) software.
Rosner's Test
•	This test can be used to identify and detect up to 10 outliers in data sets of sizes 25 and
higher.
•	This test also assumes that the data are normally distributed; therefore, it is necessary to
perform a test for normality before applying this test.
Depending upon the selected variable(s) and the number of observations associated with them, either the
Dixon's test or the Rosner's test will be performed.
NOTE: Throughout this User Guide, and in ProUCL 4.0, it is assumed that the user is dealing with a
single population. If multiple populations are present in a data set, it is recommended to separate them
out using appropriate population partitioning methods and techniques. Appropriate tests and statistics
(e.g., goodness-of-fit tests, 95% UCLs, 95% UPLs) should be computed separately for each of the
identified populations. Also, outliers if any should be identified and thoroughly investigated. The presence
of outliers distorts all statistics including the all of the upper limits (UCLs, UPLs, upper percentiles). The
use of distorted statistics and limits may lead to incorrect conclusions having potential adverse effects on
the human health and the environment. Decisions about the disposition of outliers: inclusion or exclusion
in the data set to be used to compute the UCLs, UPLs, and other statistics should be made by all parties
involved. Statistical methods supplemented with graphical displays (e.g., Q-Q plot and box plots) can
only help identify statistical outliers that may be present in a data set. The project team and experts
familiar with the site should interpret and assign physical meaning and significance to those identified
outliers. The entire project team should be involved in taking decisions about the appropriate disposition
(include or not include) of outliers.
66
-------
7.1 Outlier Test for Full Data Set
1. Click Outlier Tests ~ Full ~Compute
Q WorfcSheet.wst
2. The Select Variables Screen (Chapter 3) will appear.
•	Select one or more variablc(s) from the Select Variables screen.
•	If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
list of available variables. The user should select and click on an appropriate variable
representing a group variable.
•	If at least one of the selected variables has 25 or more observations, then click the option
button for the Rosner Test.
m
Select Number Outliers (Rosner Test).
Number of Outliers for Rosner Test |1
Applicable to Rosnsr's test (N ;= 25) Only
Dixon's test (N < 25) only testa for one outlier.
OK
Cancel
A
•	The default option for the number of suspected outliers is 1. In order to use this test, the
user has to obtain an initial guess about the number of outliers that may be present in the
data set. This can be done by using graphical displays such as a Q-Q plot. On this
graphical Q-Q plot, higher observations that are well separated from the rest of the data
may be considered as potential or suspected outliers.
•	Click on the OK button to continue or on the Cancel button to cancel the Outlier Tests.
7.2 Outlier Test for Data Set with NDs
Typically, in environmental applications, one is interested in identifying high outliers (perhaps
representing contaminated parts of a site area, hot spots) that might be present in the right tail of the data
67
-------
distribution. Therefore, one may want to use the same outlier identification procedures (e.g., Dixon test,
Rosner test) that are used on full-uncensored data sets (without any NDs). The processes to perform such
tests using ProUCL 4.0 arc described as follows.
1. Click Outlier Tests ~ With NDs ~ Exclude NDs
~9 File Edit Configure Summary Statistics ROSEst. NDs Graphs Goodness-of-Fit Hypothesis Testing Background UCL Window Help
CI 
-------
Output Screen for Rosner's Outlier Test
There are many observations in this data set that may represent potential outliers. The potential outliers
can also be seen in the graphical displays associated with this data set.
• Selected Options: Number of Suspected Outliers for the Rosner Test = 4
. Outliei T ests for Selected Variables
User Selected Options!
From File ;D \Narain\ProUCl. 4 OVData\Aroclot 1254 mt
Full Precision (0FF
Test for Suspected Outliers with Dixon test 11
Test for Suspected Outliers fot Rosner test i 2
Rosner* Outliei Test for Aroclor1254
Number of data 44
Number of suspected outliers 2
Mean)
1 15311
j	Potential]
sdi	outlier l
3316 59|	19000 OOj
I	'
Test! Critical!
Critical]
value value (5£); value (1£)j
~527f 308*
2 112565; 199860 830000; 359!
For 5% significance level there are 2 Potential Outliers
Therefore, Potential Statistical Outliers are
19000 0078300 00"'
For 12 Significance Level there are 2 Potential Outliers
Therefore, Potential Statistical Outliers are
1^00 0'a8300 00"
3 071
3 43j
3 411
	L__
Box plot of the Aroclorl254 Data Set
Box Plot for Aroclorl 254
r.ooo :<•
1EOOX
mVjO-x,
2 1X£0X"
o
Q
U&ji-uti Hait-DeifciVa'tevr
A/oclor 12£-s
-------
Q-Q Plot of the Aroclorl254 Data Set
Q-Q Plot with NDs for Aroclor1254
20000. OC
18000.00
16000.00
w 14000.00
c
^ 12000 30
>

-g 6000.00
o
4000.00
2000 00
0.00
Arodor1254
Totel Number of 0a« • 63
Number of Non-Detecti» 9
Nwrbef of Oetects • 44
Mean -1271 7646
Sa - 3105 6855
Slope- 2117 6299
Intercept • 1271.7646
Conelaton. R« 0.6698
i M » t MftMM

Theoretical Quantiles (Standard Normal)
70
-------
Chapter 8
Goodness-of-Fit (G.O.F.) Tests
Several goodness-of-fit (G.O.F.) tests for full data sets (without nondetccts) and for data sets with NDs
are available in ProUCL 4.0. Details of those tests are described in the ProUCL 4.0 Technical Guide. In
this User Guide, those tests and available options have been illustrated using screen shots generated by
ProUCL 4.0.
Two choices are available for Goodncss-of-Fit menu: Full and With NDs.
• Full
This option is used to analyze full data sets without any nondetect observations.
Throughout this User Guide and in ProUCL 4.0, "Full" represents data sets without
nondetect observations.
This option tests for normal, gamma, or lognormal distribution of the variable(s)
selected using the Select Variables option.
G.O.F. Statistics: This option is available for both full data sets and for data sets with
NDs. This option simply generates output log of GOF test statistics and derived
conclusions about the data distributions of all selected variables. This option is also
available for variables categorized by a group variable.
ProUCL 4.0 - [Worksheet.wst]
Eg File Edit Configure Summary Statistics ROSEst. M>s Graphs Outfier Teste
e>l«al elBlml.Bl
Navigation Panel
Name
0 WorkSheet.wst
Arsenic
J	I
1
D Arsentc
1
Goodness-of-Fit I
With NDs ~
Hypothesis Testing Background UCL Window Help
Gamma
Lognormal
G.O.F. Statistic
With NDs
o Analyzes data sets that have both nondetectcd and detected values,
o Six sub-menu items listed and shown below arc available for this option.
1.	Exclude NDs: tests for normal, gamma, or lognormal distribution of the selected
variable(s) using only the detected values.
2.	ROS Estimates: tests for normal, gamma, or lognormal distribution of the selected
variable(s) using the detected values and the extrapolated values for the nondetects.
o Three ROS methods for normal, lognormal, and gamma distributions are
available. This option is used to estimate or extrapolate the NDs based upon the
specified distribution.
o By using the menu item ROS Est. NDs, ProUCL 4.0 actually generates additional
column(s) of ROS estimated NDs based upon the selected distribution. This
71
-------
option should be used for variables with NDs. This is further illustrated by a
screen shot given in the following
3.	DL/2 Estimates: tests for normal, gamma, or lognormal distribution of the selected
variable(s) using the detected values and the ND values replaced by their respective
DL/2 values. This option is included for historical reasons and also for curious users.
Note: The use of fabricated data obtained using DL/2 (DL or 0 values) values is not
a recommended method. At best, the user may use the fabricated data (e.g., DL/2
option) for exploratory reasons. It is suggested that these substitution methods should
not be used for estimation and for hypotheses testing approaches.
4.	G.O.F. Statistics: As for the full data sets, this option simply generates output log of
GOF test statistics and other relevant statistics for data sets with nondetects. The
conclusions about the data distributions for all selected variables are also listed on the
generated output file (Excel-type spreadsheet). This option is also available for
variables categorized by a group variable.
f?JproUCL 4.0. - [WorkSheet.wstj
nO Fde Edit Configure Summary Statistics ROS Est. NDs Graphs Outlier Tests
Goodness-of-Fit
Hypothesis Testing Background UCL Window Help
-------
o The program computes the intercept, slope, and the correlation coefficient for the
linear pattern displayed by the Q-Q plot. A high value of the correlation coefficient
(e.g., > 0.95) is an indication of a good fit for that distribution. This high correlation
should exhibit a definite linear pattern in the Q-Q plot. Specifically, when data are
sparse and correlation is high, the use of correlation statistic to determine data
distribution is not desirable. Note that these statistics are displayed on the Q-Q plot,
o On a Q-Q plot, observations that are well separated from the bulk (central part) of the
data typically represent potential outliers needing further investigation,
o Significant and obvious jumps and breaks in a Q-Q plot (for any distribution) are
indications of the presence of more than one population. Data sets exhibiting such
behavior of Q-Q plots should be partitioned out in component sub-populations before
estimating an EPC term or a background threshold value (BTV). It is strongly
recommended that both graphical and formal goodness-of-fit tests should be used on
the same data set to determine the distribution of the data set under study.
•	Normality or Lognormality Tests: In addition to informal graphical normal and
lognormal Q-Q plots, a formal Goodness-of-Fit (GOF) test is also available to test the
normality or lognormality of the data set.
o Lilliefors Test: a test typically used for samples of size larger than 50 (> 50). When
the sample size is greater than 50, the program defaults to the Lilliefors test.
However, the Lilliefors test (generalized Kolmogorov Smirnov test) is available for
samples of all sizes. There is no applicable upper limit for sample size for the
Lilliefors test.
o Shapiro and Wilk (SW) Test: a test used for samples of size smaller than or equal to
50 (<= 50). In ProUCL 4.0, the SW test is available only for samples of size 50 or
less. It should be noted that the critical values for SW test are now available for
sample of sizes up to 2000 (Royston, 1982). These values are not as yet available in
ProUCL 4.0. This extension of SW test will be available in Scout (EPA, 1999)
software package.
o It should be noted that sometimes these two tests might lead to different conclusions.
Therefore, the user should exercise caution interpreting the results. Specifically, the
user should the pattern exhibited by the associated Q-Q plot.
•	GOF test for Gamma Distribution: In addition to the graphical gamma Q-Q plot, two
formal empirical distribution function (EDF) procedures are also available to test the
gamma distribution of a data set. These tests are the Anderson-Darling test and the
Kolmogorov-Smirnov test.
o It is noted that these two tests might lead to different conclusions. Therefore, the user
should exercise caution interpreting the results,
o These two tests may be used for samples of sizes in the range of 4-2500. Also, for
these two tests, the value of the shape parameter, k (k hat) should lie in the interval
[0.01, 100.0], Consult the ProUCL 4.0 Technical Guide (A. Singh and A.K. Singh
(EPA, 2007)) for a detailed description of gamma distribution and its parameters,
including k. Extrapolation beyond these sample sizes and values of k is not
recommended.
73
-------
•	ProUCL computes the relevant test statistic and the associated critical value, and prints
them on the associated Q-Q plot. On this Q-Q plot, the program informs the user if the
data are gamma, normally, or lognormally distributed.
•	Even though, the G.O.F. Statistics option prints out all GOF test statistics for all selected
variables, it is suggested that the user should look at the graphical Q-Q plot displays to
gain extra insight (e.g., outliers, multiple population) into the data set.
Note: It is highly recommended not to skip the use of a graphical Q-Q plot to determine the data
distribution as a Q-Q plot also provides a useful information about the presence of multiple populations
or outliers.
8.1 ROS Estimated (Est.) NDs - Saving Extrapolated NDs
•	As mentioned before, for a variable with NDs, ProUCL 4.0 can generate additional
column(s) consisting of detected data and the estimated (extrapolated) values of NDs
using the ROS method assuming a normal, lognormal, or a gamma distribution.
•	The user may want to use the resulting full data set (detected and estimated NDs) thus
obtained to compute the statistics of interest such as a bootstrap BCA UCL95 or a gamma
95% upper percentile.
•	This option of saving estimated NDs is provided only for experienced users and
researchers. It is expected that the user knows and understands the theory behind these
methods. Therefore, it is suggested that this option be used with care. For an example,
often, the use of a ROS method yields infeasible (e.g., negative, exceeding the DLs)
estimates of NDs, and therefore, the associated estimates of EPC terms and of BTVs may
be biased and not reliable. This is especially true when the data set contains potential
outlier(s).

Q0 File Edit Configure Summary Statistics ROS Est. NDs Graphs Outlier Tests Goodness-of-Fit

@ WorkSheet.wst
Navigation Panel

Name
11
10
Arsenic
1
f
1.7
1
D Arsenic
NROS Arsenic
0 01233SS71293B579
0^3125541541462!
r7f
1
"'i
2
"12
2
2
Jl
28
2
0j0.909911763702151;
"Oj "T 24633976124866j'
0 f03§iB72i390S9666r
' 0^j2^0645T203256f
0 0.5422965289*7428
2.S
010 773119303054192!
74
-------
8.2 Goodness-of-Fit Tests with Full Data Sets
1. Click Goodncss-of-Fit ~ Full
2. Select the distribution to be tested: Normal, Lognormal, or Gamma
•	To test your variable for normality, click on Normal from the drop-down menu list.
•	To test variable for lognormality, click on Lognormal from the drop-down menu list.
•	To test your variable for gamma distribution, click on Gamma from the drop-down menu
list.
8.2.1 GOF Tests for Normal and Lognormal Distribution
1. Click Goodncss-of-Fit ~ Full ~ Normal or Lognormal
Q Worksheet.wst
2. The Select Variables Screen (Chapter 3) will appear.
• Select one or more variablc(s) from the Select Variables screen.
If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
list of available variables. The user should select and click on an appropriate variable
representing a group variable.
When the option button is clicked, the following window will be shown.
75
-------
Goodness-of-fit (Normal, Lognormal)
m
r Select Confidence Level ¦
C SO '/„
95%
C 99 %
"Method	
<• Shapiro Wilk
C Lilhefors
¦ Display Regression Lines 		
Do Not Display
(* Display Regression Lines
-Graphs by Group	
<• Individual Graphs
C Group Graphs
-Graphical Display Options	
(• Color Gradient
f For Export (BW Printers)
	I
OK	Cancel
o The default option for the Confidence Level is 95%.
o The default GOF Method is Shapiro Wilk. If the sample size is greater than 50, the
program automatically uses the Lilliefors test,
o The default method for Display Regression Lines is Do Not Display. If you want to
see regression lines on a Q-Q plot, then check the radio button next to Display
Regression Lines.
o The default option for Graphs by Group is Individual Graphs. If you want to see
the plots for all selected variables on a single graph, then check the radio button next
to Group Graphs.
Note: This option for Graphs by Group is specifically provided when the user wants to display multiple
graphs for a variable by a group variable (e.g., site AOCI, site AOC2, background). This kind of display
represents a useful visual comparison of the values of a variable (e.g., concentrations of COPC-Arsenic)
collected from two or more groups (e.g., upgradient wells, monitoring wells, residential wells).
o The default option for Graphical Display Options is Color Gradient. If you want
to see the graphs in black and white to be included in reports for later use, then check
the radio button next to For Export (BW Printers).
76
-------
• Click the OK button to continue or the Cancel button to cancel the Goodness-of-Fit tests.
Output Screen for Normal Distribution (Full)
Selected Options: Shapiro Wilk, Display Regression Line, and For Export (BVV Printers).
Normal Q-Q Plot for Arsenic
Arsenic
N - 20
Mean - 5.8725
Sd- 1.2247
Slope • 1.1798
Intercept - 5.8725
Correlation, R - 0.9281
Shapiro Wilk Test
Test Value - 0.8F.8
Critical Val(0.05) - 0.905
Data not Normal
0	1
Theoretical Quantiles (Standard Normal)
Arsenic
Output Screen for Lognormal Distribution (Full)
Selected options: Shapiro Wilk, Display Regression Lines, and Color Cradicnt.
2.30
250
2.10
2.00
a
C

Lognormal Q-Q Plot for Arsenic
*
Araenlc
N - 20
Mean -1 7S19
Sd-01917
Slope-01917
rtercept-1.7519
Correiation, R - 0.9636
Shapro-W* Test
Test Statistic - 0932
Crlical Value(0 05) - 0905
Data appear Lognormal
»
•
n
o
-a t ao
•




T3
O
1 70
1 60
1 so

M
d


A




140




-2

-10 1
Theoretical Quantiles (Standard Normal)

2
» Arsenic




77
-------
8.2.2 GOF Tests for Gamma Distribution
1. Click Goodness-of-Fit ~ Full ~ Gamma
ng File Edit Configure Summary Statistics ROSEst. NDs Graphs Outlier Tests
| Hypothesis Testing Background UCl Window Help
— — ^1^1 JG. uj,tk Mr* »
Norma!

Navigation Panel | 1

0
1
2
3 '	4
Lognormal
G.O.F. Statistics
7
8 | 9
Name
Arsenic
D_Arsenic



|
®WorkSheet.wst
1
!" "T
		L J
_ :	.	[ __ __ j 	!
2. The Select Variables Screen (described in Chapter 3) will appear.
•	Select one or more variable(s) from the Select Variables screen.
•	If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
list of available variables. The user should select and click on an appropriate variable
representing a group variable.
•	When the option button is clicked, the following window will be shown.
78
-------
Goodness-of-Fit (Gamma)
m
"Select Confidence Level -
C 90'/. •
f 95
r 99 %
r Method	
(• Anderson Darling
C Kolrwogorov Srnirnov
"Display Regression Lines	
Do Not Display
(* Display Regression Lines
Graph by Groups	
(* Individual Graphs
Group Graphs
-Graphical Display Options	
(• Color Gradient
C For Export (BW Printers)
OK
Cancel
o The default option for the Confidence Level is 95%.
o The default GOF method is Anderson Darling.
o The default option for Display Regression Lines is Do Not Display. If you want to
see regression lines on the Gamma Q-Q plot, then check the radio button next to
Display Regression Lines,
o The default option for Graph by Croups is Individual Craphs. If you want to see
the graphs for all the selected variables into a single graph, then check the radio
button next to Group Graphs,
o The default option for Graphical Display Options is Color Gradient. If you want
to see the graphs in black and white, check the radio button next to For Export (BW
Printers).
•	Click the OK button to continue or the Cancel button to cancel the option.
•	Click the OK button to continue or the Cancel button to cancel the Goodness-of-Fit tests.
79
-------
Output Screen for Gamma Distribution (Full)
Selected options: Anderson Darling, Display Regression Lines, Individual Graphs, and Color Gradient.

Gamma Q-Q Plot for Mercury


Mercury
1.00


N • 30




Mean ¦ 0 3055




k a® - 11722
0.90
4


SJcpe-10468




Intercept - -0.0111




Correlation, R - 05713
0.80
j


Anoerson-Daring Test




Test SJabstic - 0.730




Critical VAe(0OS)- 0.789
070



Data appear Gamma Detr touted





C 060
*



a




I




« 0 50
* .



o




TJ
* '



£ 0 40




"O




o




0.»
M




a



020
_ £
•v




. M M



0.10





it




** *



000







V> v>


Theoretical Quantilea of Gamma Distribution




Mercury



8.3 Goodness-of-Fit Tests Excluding NDs
1. Click Goodness-of-Fit ~ With NDs ~ Exclude NDs
1 ProUCL 4.0 - [Worksheet.wst]
¦2 Re Edit Configure Summary Statistics ROSEst. NDs Graphs Qutiier Tests

Navigation Panel
Name
& Worksheet wst
0 __1	 2
Arsenic D_Arsenie
1 1 Z3 0
T~| i 0
3 1 1.7	1
a	1 n
Goodness-of-Fit
With NDs ~! Exclude NDs
Hypothesis Testing Background UCL Window Help
Full ~ !

Log-ROS Estimates ~
Dl/2 Estimates	~
G.Q.F. Statistics
2. Select distribution to be tested: Normal, Gamma, or Lognormal.
•	To test for normality, click on Normal from the drop-down menu list.
•	To test for lognormality, click on Lognormal from the drop-down menu list.
•	To test for gamma distribution, click on Gamma from the drop-down menu list.
Normal

Gamma
9
Lognormal

r





i


80
-------
8.3.1 Normal and Lognormal Options
1. Click Goodncss-of-Fit ~ With NDs ~Excluded NDs ~ Normal or Lognormal
pM elBlml m
*X.
Navigation Panel
® Worksheet.wst
Name
9
2. The Select Variables Screen (Chapter 3) will appear.
•	Select one or more variable(s) from the Select Variables screen.
•	If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
list of available variables. The user should select and click on an appropriate variable
representing a group variable.
•	When the option button (Normal or Lognormal) is clicked, the following window will be
shown.
81
-------
Goodness-of-Fit (Normal, Log normal)
¦Select Confidence Level"
r 90 y.
G 95 '/„
r 59 V,
-Method"
<• Shapiro V'/i Ik
<"* Lilliefors
"Display Regression Lines	
Do Not Display
(* Display Regression Lines
"Graphs by Group	
<• Individual Graphs
<** GroupGraphs
"Graphical Display Options	
(* Color Gradient
For Export (BW Printers)
OK
Cancel
m
o The default option for the Confidence Level is 95%.
o The default GOF Method is Shapiro Wilk. If the sample size is greater than 50, the
program defaults to Lilliefors test,
o The default for Display Regression Lines is Do Not Display. If you want to see
regression lines on the associated Q-Q plot, check the radio button next to Display
Regression Lines.
o The default option for Graphs by Group is Individual Graphs. If you want to see
the plots for all selected variables on a single graph, check the radio button next to
Group Graphs.
Note: This option for Graphs by Group is specifically useful when the user wants to display multiple
graphs for a variable by a group variable (e.g., site AOC1, Site AOC2, background). This kind of display
represents a useful visual comparison of the values of a variable (e.g., concentrations of COPC-Arsenic)
collected from two or more groups (e.g., upgradient wells, monitoring wells, and residential wells).
82
-------
o The default option for Graphical Display Option is Color Gradient. If you want to
see the graphs in black and white, check the radio button next to For Export (B\V
Printers).
•	Click the OK button to continue or the Cancel button to cancel the option.
•	Click the OK button to continue or the Cancel button to cancel the Goodness-of-Fit tests.
Output Screen for Normal Distribution (Exclude NDs)
Selected options: Shapiro Wilk, Display Regression Lines, Group Graphs, and For Export (BW Printers).
Normal Q-Q Plots (Statistics using Detected Data)
forArsenlc (subsurface), Arsenic (surface)
Arsenic (subsurface)
Total Number of Data - 10
Numbei treated as ND - 1
Dl. - 4.5
N - 9
Percent NDs - 10%
Mean-5.6778
Sd-0.5911
Slope » 0.1431
Intercept» 5.6778
Correlation, R - 0.2265
Shapiro Wilk Test
Test Statistic - 0.927
Critical Value(0 .05) - 0.829
Data appear Normal
Arsenic (surface)
Total Number o( Data - 10
Number treated as ND - 2
DL - 4.5
N • 8
Percent NDs- 20%
Mean -6.6313
Sd ¦ 1.4400
Slope-0.1952
Intercept - 6.6313
Correlation, R - 0.1262
Shapiro Wilk Test
Test Statistic - 0.807
Critical Value(D.0S) - 0.818
Data not Normal
Theoretical Quantiles (Standard Normal)
-J- Arsenic (subsurface)	-o Arsenic (surface)
83
-------
Output Result for Lognormal Distribution (Exclude NDs)
Selected options: Shapiro Wilk, Display Regression Lines, Group Graphs, and Color Gradient.


Lognormal Q-Q Plots (Statistics using Detected Data)
for Arsenic (subsurface), Arsenic (surface)
Total Nurator oi Ml ¦ ID
Nuntoer treafed as NC • 1
DC • 1.50*0774
N-9
PercertNDs- 10%
Mean-1 7319
Sd-01019
Sk*»-01059
interrupt ¦ 1 7319
C err Motion. H • 0 9736
Test Slateoc - 0 938
ODem VaKO-OS) ¦ 0829
Data appear Lognonral
TcUi ferfcer of Ma • 10
ourfeet trealec a; W> • 2
DL • 1 5040774
N-B
Per cart NDs • 20%
Mean -1£731
Sd • 0 2016
SKpa . 01997
intercept ¦ 1 8731
CorreMnn, R • 09211
Srxpko-Wlk Test
Test Statistic » 0.835
Crfecal Va<0 05) • 0818
DHa appear Lognormal
-» Arsenic (subsurface)
Theoretical Quantiles (Standard Normal)
- Arsenic {surface)

8.3.2 Gamma Distribution Option
1. Click Goodness-of-Fit ~ With NDs ~Excluded NDs ~ Gamma
P ProUCL 4.0 [WorkSheet.wstJ
¦2 File Edit Configure Summary Statistics ROS Est. NDs Graphs Outlier Tests
Hypothesis Testing Background UCL Window Help
Full
~X
| X'" { 1—1 1 L_~l J UAJ | |
Navigation Panel

0 1 2
	¦
3 -
With NDs ~
Name

.Arsenic ¦ D_Arsenic


:> Worksheet wst
1
L H 0


2
T 0



3
1.7 1


Q Normal
Normal-ROS Estimates ~
Gamma-R OS Estimates ~ Lognormal
Log-ROS Estimates ~
DL/2 Estimates	~
G.O.F. Statistics
2. The Select Variables Screen (Chapter 3) will appear.
•	Select one or more variable(s) from the Select Variables screen.
•	If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
list of available variables. The user should select and click on an appropriate variable
representing a group variable.
When the option button (Gamma) is clicked, the following window is shown.
84
-------
Goodness-of-Fit (Ga ninia)
-Select Confidence Level	
r 50 %
95%
C 9B%
-Method	
<• Anderson Darling
C Kolnvogorov Smirnov
-Display Regression Lines	
C Do Not Display
(* Display Regression Lines
-Graph by Groups	
(* Individual Graphs
C Group Graphs
-Graphical Display Options	
(* Color Gradient
C For Export (BW Printers)
o The default option for the Confidence Level is 95%.
o The default GOF test Method is Anderson Darling,
o The default method for Display Regression Lines is Do Not Display. If you
want to see regression lines on the normal Q-Q plot, cheek the radio button next
to Display Regression Lines,
o The default option for Graph by Groups is Individual Graphs. If you want to
display all selected variables on a single graph, check the radio button next to
Group Graphs.
o The default option for Graphical Display Options is Color Gradient. If you
want to see the graphs in black and white, check the radio button next to For
Export (BW Printers).
Click the OK button to continue or the Cancel button to canc2l the option.
Click the OK button to continue or the Cancel button to cancel the Goodness-of-Fit tests.
OK
Cancel
85
-------
Output Screen for Gamma Distribution (Exclude NDs)
Selected options: Anderson Darling, Do Not Display, Individual Graphs, and For Export (BYV Printers).
Gamma Q-Q Plot for Mercury with NDs
Statistics using Detected Data
Mercury
Total Number of Data - 30
Number Heated a* NO - 5
Dl - 0.5
N - 25
Percent NDs - 17%
Mean - 0.3124
kstar- 1.0533
Slope - 1.0294
Intercept - 0.0048
Correlation, R - 0.9590
Anderson-Darling Test
Test Statistic - 0.861
Critical Value(0.05) = 0.770
Data not Gamma Distributed
Theoretical Quantiles of Gamma Distribution
--J- Mercury
8.4 Goodness-of-Fit Tests with Log-ROS Estimates
1. Click Goodness-of-Fit ~ With NDs ~ Log-ROS Estimates
P3 ProUCL 4.0 - [WorkSheet.wst]
¦is-' Rle Edit Configure Summary Statistics ROS Est, NDs Graphs Outlier Tests
Goodness-of-Fit
Hypothesis Testing Background UCL Window Help
Full
M
Navigation Panel

0 1
2

With NDs ~
Name
Arsenic D Arsenic



J Worksheet wst
1
| l| 0


2
1 0


3
1.7 1

Exdude NDs	~
Normal-ROS Estimates ~
Gamma-RQS Estimates ~
Log-ROS Estimates
DL/2 Estimates
G.O.F. Statistics
Q Normal
~ Gamma
Lognormal
2.
Select the distribution to be tested: Normal, Lognormal, or Gamma
•	To test your variable for normality, click on Normal from the drop-down menu list.
•	To test a variable for gamma distribution, click Gamma from drop-down menu list.
•	To test your variable for lognormality, click on Lognormal from drop-down menu.
86
-------
8.4.1 Normal or Lognormal Distribution (Log-ROS Estimates)
1. Click Goodness-of-Fit ~ With NDs ~ Log-ROS Estimates ~ Normal, Lognormal
ProUCL 4.0 - [WorkSheet.wst]
n5, File Edit Configure Summary Statistics ROS Est. 1'fDs Graphs Outiier Tests

©WorkSheet.wst
Navigation Panel

Name
0
Arsenic
1
D_Arsemc
0;
oi
1'
Goodness-of-Rt
FuD
Hypothesis Testing Background UCl Window Help
~I
Exdude NDs	~
Normal-ftOS Estimates ,~
Gamma-ROS Estimates ~
Log-ROS Estimates ~
DL/2 Estimates
G.O.F. Staostics
Normal
Gamma
Lognormal
2. The Select Variables Screen (Chapter 3) will appear.
•	Select one or more variable(s) from the Select Variables screen.
•	If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
list of available variables. The user should select and click on an appropriate variable
representing a group variable.
•	When the option button (Normal or Lognormal) is clicked, the following window will be
shown.
87
-------
Goodness-ofFit(Mormal, Lognormal)
"Select Confidence Level-
C 9Q'/„
& 95 */.
r 99%
r Method ¦
(* Shapiro Wi Ik
C" Lilliefors
¦Display Regression Lines	
Do Not Display
(* Display Regression Lines
-Graphs by Group 			
<•" Individual Graphs
r Group Graphs
-Graphical Display Options	
(• Color Gradient
C For Export (BW Printers)
OK
Cancel
m
The default option for the Confidence Level is 95%.
The default GOF test Method is Shapiro Wilk. If the sample size is greater than 50,
the program defaults to use the Lilliefors test.
The default method for Display Regression Lines is Do Not Display. If you want to
see regression lines on the normal Q-Q plot, check the radio button next to Display
Regression Lines.
The default option for Graphs by Group is Individual Graphs. If you want to
display all selected variables into a single graph, check the radio button next to
Group Graphs.
The default option for Graphical Display Options is Color Gradient. If you want
to see the graphs in black and white, check the radio button next to For Export (BW
Printers).
-------
•	Click the OK button to continue or the Cancel button to cancel the option.
•	Click the OK button to continue or the Cancel button to cancel the Goodness-of-Fit tests.
Output Screen for Normal Distribution (Log-ROS Estimates)
Selected options: Shapiro Wilk, Display Regression Lines, Group Graphs, and For Export (B\V Printers),



Normal Q-Q Plots using Robust ROS Method
Arsenic

10.00


for Arsenic, Mercury
N- 20
Mean -5.8307
Sd - 1.2799





9.00


J
J
Slop* - 1.2517
Intercept - 5.8307

8.00



Correlation, R » 0.9421
Shapiro Wilk Test

7.00



Test Value - 0.895



J 3
Critical Val(0.05) - 0.905
t/>




Data not Normal
o
w
>
6.00


^ j j *
Mleicury
N- 30
cu
(/>
n
5.00



Mean - 0.2767




Sd - 0.2984
O
¦a
4.00
J


Slope - 0.2643
a>




Intercept - 0.2767
¦o
i.
O
3.00
2.00
1.00


a o O °..
Correlation, R - 0.8618
Shapiio Wilk Test
Test Value - 0.733
Critical Val(0.05) = 0.927
Data not Noimal

0.00
o
O Q.
n „ r, w u „ 1, ,| , 		 - -p " " "



—. -



-2	-10	1
Theoretical Quantiles (Standard Normal)
-j- Arsenic	o- Mercury
Note: The legend size of nondetect values is smaller than that of the detected values.
89
-------
Output Screen for Lognormal Distribution (Log-ROS Estimates)
Selected options: Shapiro Wilk, Display Regression Lines, Group Graphs, and Color Gradient.
Lognormal Q-Q Plot for Group
Statistics using Robust ROS Method
'*£19
£
"2170
o
Arsenic (subsurface)
N" 10
Mew -1 7068
Sd-0.1246
Slope-0.1308
Irtercept -1 7068
Correlation, R - 0 9866
Shaptro-W* Test
Test Statistic - 0.981
Critical Value(0 05) - 0.842
Data appear Lognormal
Arsenie (surface)
N * 10
Mean -1 7781
Sd -02678
Slope - 0.2758
intercept -1.7781
Correlation, R « 0 9682
Shapro-V\* Test
Test Statistic ¦ 0 930
Critical Value(0 05) - 0 842
Data appear Lognormal
-a- Arsenic (subsurface)
Theoretical Quantiles of Gamma Distribution
¦* Arsenic (surface)
Note: The legend size of nondetect values is smaller than that of the detected values and is shown in red.
8.4.2 Gamma Distribution (Log-ROS Estimates)
1. Click Goodness-of-Fit ~ With NDs ~ Log-ROS Estimates ~ Gamma
PP ProllCL 4.0 - [Worksheet.wstj
¦2 File Edit Configure Summary Statistics ROS Est. NDs Graphs Outiier Tests |

UO. Window Help
®\
-------
When the option button (Gamma) is clicked, the following window will be shown.
Goodness-of-Rt (Gamma)
Select Confidence Level
T 90%
^ 95 'A
f 99 %
-Graph by Groups	
(* Individual Graphs
C Group Graphs
-Graphical Display Options	
(* Color Gradient
For Export (GW Printers)
"Method	
(* Anderson Darling
C Kolnwgorcf.' Smirrvov
-Display Regression Lines	
Do Not Display
(* Display Regression Lines
OK
Cancel
m
o The default option for the Confidence Level is 95%.
o The default GOF test Method is Anderson Darling.
o The default method for Display Regression Lines is Do Not Display. If you want to
see regression lines on the normal Q-Q plot, check the radio button next to Display
Regression Lines.
o The default option for Graph by Groups is Individual Graphs. If you want to put
all of the selected variables into a single graph, check the radio button next to Group
Graphs.
o The default option for Graphical Display Options is Color Gradient. If you want
to sec the graphs in black and white, check the radio button next to For Export (BW
Printers).
• Click the OK button to continue or the Cancel button to cancel the Goodness-of-Fit tests.
91
-------
Output Screen for Gamma Distribution (Log-ROS Estimates)
Selected options: Anderson Darling, Display Regression Lines, Individual Graphs, and Color Gradient.
0.70
C 0.60
0
1
2
« 050
Gamma Q-Q Plot for Mercury
Statistics using Robust ROS Method
Mercury
N-30
Mean • 0.2767
k star - 10653
Slope -11071
lnte>ceft> -00262
Correlation, R ¦ 0.9579
Anderson-Daring Test
Test SlaBstfc -1.425
Crtcal VaUB(OOS) • 0 772
Date not Carnne Distributed
0X1058 - 0X1400
00165
0.0326
00326
00492
0 0607
00785
0	0785
00974
0.1107
1	1000
0C550
00550
CC600
CC645
C.C700
0X700
0X867
0X3922

6^	C?	C?	6^
Theoretical Quantiles of Gamma Distribution
Note: The legend size of nondetect values is smaller than that of detected values and is shown in red.
8.5 Goodness-of-Fit Tests with DL/2 Estimates
1. Click Goodness-of-Fit ~ With NDs ~ DL/2 Estimates
f? ProUCL 4,0 - [WorkSheet.wst]
«S File Edit Configure Summary Statistics ROS Est. IsDs Graphs Outlier Tests
Goodness-of-Fit
Hypothesis Testing Background UCl Windov. Help
cJl^lPlalml in
Navigation Panel :
Fu#
~
Name
v> WorkSheet.wst
0	1
Arsenic D_Arsenic
il	0
''1 w' """i!
1 o
ExdudeNDs	~
Nomnal-ROS Estimates	~
Gamma-ROS Estimates	~
Log-ROS Estimates	~
DL/2 Estimates
G.O.F. Statistics
2. Select the distribution to be tested: Normal, Gamma, or Lognormal
•	To test the variable for normality, click on Normal from the drop-down menu list.
•	To test the variable for lognormality, click on Lognormal from the drop-down menu list.
92
-------
• To test your variable for gamma distribution, click on Gamma from the drop-down menu
list.
8.5.1 Normal or Lognormal Distribution (DL/2 Estimates)
1. Click Goodness-of-Fit ~ With NDs ~ DL/2 Estimates ~ Normal or Lognormal
ProUCL 4.0 - fWo'rk$heet.wst]
oQ File Edit Configure Summary Statistics ROSEst. NDs Graphs Outiier Tests
¦Goodness-of-Fit
Hypothesis Testing Background UCL Wmdow Help


Normal-ftOS Estimates ~

S
9
Gamma-ROS Estimates ~



log-ROS Estimates ~

DL/2 Estimates
G.O.F. Statistics
Q Normal
TTT"
Gamma
Lognormal
2. The Select Variables Screen (Chapter 3) will appear.
•	Select one or more variable(s) from the Select Variables screen.
•	If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
list of available variables. The user should select and click on an appropriate variable
representing a group variable.
•	When Normal or Lognormal button is clicked, following window is displayed.
93
-------
Goodrtess-of-Fit (Normal, tognormaI)
"Select Confidence Level
C 90 %
f 95 '/~
r 99 V.
r Method ¦
(* Shapiro Wilk
C Lilliefors
"Display Regression Lines	
Do Not Display
(* Display Regression Lines
"Graphs by Group	
(* Individual Graphs
C Group Graphs
'Graphical Display Options	
<• Color Gradient
C For Export (BW Printers)
OK
m
Cancel
The default option for the Confldence Level is 95%.
The default Method is Shapiro Wilk. If the sample size is greater than 50, the
program defaults to the Lilliefors test.
The default method for Display Regression Lines is Do Not Display. If you want to
see regression lines on the normal Q-Q plot, check the radio button next to Display
Regression Lines.
The default option for Graphs by Group is Individual Graphs. If you want to put
all of the selected variables into a single graph, check the radio button next to Group
Graphs.
The default option for Graphical Display Options is Color Gradient. If you want
to see the graphs in black and white, check the radio button next to For Export (BVV
Printers)
-------
•	Click the OK button to continue or the Cancel button to cancel the option.
•	Click the OK button to continue or the Cancel button to cancel the Goodness-of-Fit tests.
Output Screen for Normal Distribution (DL/2 Estimates)
Selected options: Shapiro Wilk, Display Regression Lines, Group Graphs, and Color Gradient.
£
• en)
Normal Q-Q Plot for Group
Statistics using DL/2 Method
Arsenic (subsurface)
N-10
Mean = 5.3350
Sd-1.2188
Slope-1.1396
Intercept» 5.3350
Correlation, R - 0.8792
Shapiro-m Statistic
Test Value - 0 803
Crtical Vai(0 05) • 0.842
Data not Normal
Arsenic (surface)
N-10
Mean -5.7450
Sd-2 2592
Slope • 2.2938
Mercef* ¦ 5 7450
Correlation, R - 0 9547
Sftapro-W* Statistic
Test Value - 0 909
Crtical Vai(0.05)-0842
Data appear Normal
* Ajserac (subs i»f ace)
Theoretical Quantiles (Standard Normal)
Arsenic (surface)
Note: The legend size of nondetect values is smaller than that of the detected values and is shown in red.
95
-------
Output Screen for Lognormal Distribution (DL/2 Estimates)
Selected options: Shapiro Wilk, Display Regression Lines, Individual Graphs, and For Export (B\V Printers).
Lognormal Q-Q Plot for Mercury
Statistics using DL/2 Method
Mercury
N-30
Mean - -1.7413
Sd - 0.9845
Slope - 0.9865
Intercept - -1.7413
Correlation. R - 0.9748
Shapiro Wilk Test
Test Statistic - 0.931
Critical Value(0.05) = 0.927
Data appear Lognormal
1	0	1
Theoretical Quantiles of Gamma Distribution
-=)- Mercury
Note: The legend size of nondetect values is smaller than that of the detected values. The color is not
shown on this graph as this graph is for BW printers.
8.5.2 Gamma Distribution (DL/2 Estimates)
1. Click Goodness-of-Fit ~ With NDs ~DL/2 Estimates ~ Gamma
W ProUCL 4.0 [Worksheet.wst]
File Edit Configure Summary Statistics ROS Est. NDs Graphs Outlier Tests
Goodness-of-Fit
Hypothesis Testing Background UCL Window Help
o\<&\ plslml El
Navigation Panel
Name
0 Worksheet wst
- - ... - ¦¦¦¦--•J 1
With NDs ~
Exdude NDs ~
Normal-ROS Estimates ~
*— - - 		

0
1
2
3
—
	,	

8
9

Arsenic
P_Aj.ser.iC



Gamma-ROS Estimates ~
log-ROS Estimates ~



I 1 I
! 1i
0





a
i
0


DL/manates
B
Normal
< 3
17
1
G.O.F. Statistics
Gamma
4
1
0



Lognormal
2. The Select Variables Screen (Chapter 3) will appear.
•	Select one or more variable(s) from the Select Variables screen.
•	If graphs have to be produced by using a Group variable, then select a group variable by
clicking the arrow below the Group by variable button. This will result in a drop-down
96
-------
list of available variables. The user should select and click on an appropriate variable
representing a group variable.
When the Gamma option button is clicked, the following window will be shown.
Goodness-of-Fit (Ga mma)
pSelect Confidence Level
C 90 %
95 %
r 99 %
¦Method	
Anderson Darling
C Kolrriogorov Srnirnov
"Display Regression Lines	
C Do Not Display
(* Display Regression Lines
[—Graph by Groups	
f* Individual Graphs
C Group Graphs
^Graphical Display Options	
(* Color Gradient
C For Export (BW Printers)
OK
Cancel
o The default option for the Confidence Level is 95%.
o The default Method is Anderson Darling.
o The default method for Display Regression Lines is Do Not Display. If you want to
see regression lines on the normal Q-Q plot, check the radio button next to Display
Regression Lines.
o The default option for Graph by Groups is Individual Graphs. If you want to put
all of the selected variables into a single graph, check the radio button next to Group
Graphs.
97
-------
o The default option for Graphical Display Options is Color Gradient. If you want
to see the graphs in black and white, check the radio button next to For Export (BW
Printers).
• Click the OK button to continue or the Cancel button to cancel the Goodness-of-Fit tests.
Output Screen for Gamma Distribution (DL/2 Estimates)
Selected options: Anderson Darling, Display Regression Lines, Individual Graphs, and Color Gradient.
Gamma Q-Q Plot for Mercury
Statistics using DL/2 Substitution Method
N - 30
Mean • 0 2829
k star -10875
Slope -10897
Intercept - -0.0220
Conefabon.R-0 9617
Anderson-Oaring Tesl
Test SMb&c ¦ 1 126
Critical Vahae(0 OS) • 0 771
Data not Gamma Distributed
C 0.60
O
S
£
• O.SO
n
O
x>
£ 0 40
/

Theoretical Quantiles of Gamma Diatribution
Note: The legend size of nondetect values is smaller than that of the detected values and is shown in red.
8.6 Goodness-of-Fit Tests Statistics
1. Click Goodncss-of-Fit ~ With NDs ~ G.O.F. Statistics
1? ProUCL 4.0 - [WorkSheet.wst]
¦2 File Edit Configure Summary Statistics ROS Est. NDs Graphs Outlier Tests
Goodness-of-fit
I Hypothesis Testing Background
UCl Window Help
ol«a! ib|b m ml
mzmm
K 	
Exdude NDs ~
Normal-ROS Estimates ~
Gamma-ROS Estimates ~
Log-ROS Estimates ~
DL/2 Estimates ~

Navigation Panel j

0
1 2
3



S
9
1 Name
Arsenic
D_Arseric





Ol WorkSheet.wst
1
~"Tj
	0





0 GOF_Stats_wND ost
2
1
0



3
1.7
1




2. The Select Variables Screen (Chapter 3) will appear.
• Select one or more variable(s) from the Select Variables screen.
98
-------
• When the option button is clicked, the following window will be shown.
1 SI Select Confidence Level
rSelect Confidence Level
r 90%
f 95%
r 99 %
OK
Cancel

•	The default confidence level is 95%.
•	Click the OK button to continue or the Cancel button to cancel the option.
Sample Output Screen for G.O.F. Test Statistics
1 1 ' 1 1 1 1 , 1 1 1
[Goodness
of-Fit fftat SlaiistKs far Date Sets with MorvDrtncte


User Selected Options







From File Worksheet v
¦st






Full Precision OFF







CofTfiaence Coefficient 0 55
—






Ajseoac

—
-



- - i

MumObs
Nutr. Miss
Mum Vslis
Oettcts
NDs
V. NDs

Arsenic tea
24
0
2J
11
13
54 17'/.

Statistics (Non-Detects Only)
Number
Minimum
Maximum
Mean
Median
50


05
2
1603
2
0517
'
Slstisucs {Detects Onty)

05
32
1 235
07
0S65 t

Statistics (All NDs trejtea as DLsalue)'
24
* " 0 5"
32
1 438
i 25
0 761'

Statistics (Ail r^Os treated as DL/2 value)
24
045
32
1 002'
0 95
'

Statistics iN:rmsl ROS Essrr.alaa Osta)
24
-00995
32
0 597
0 737
~077S 1

Stsiistics (Gamma ROS Estimates Osta)'
24
"05
32
1 263
1 213
0652 !
!
Statistic* ^ognorrrjl ROS Esamaietf Data),

0349
22
097?
07
0713
1
i
KHat
KSlar
Theta Hat
Log Mean
Log S:dv
Log C V ,

Stajstics [Delects Only) |
2 257
2 002'
0 6-43
-0 0255
0 £94
27 26 ;
+
Statistics {NDs - DL) j
3 532
'312*
" 0406
0215
0574
"2«9 :

Clmix.^t (Uflf . ni "A 1
iiLl

?C*7
n '
.nic"
nw
.1781

-------
Output Screen for the G.O.F. Test Statistics - (continued)
I ! I 1 ! I 1 I ! 1
Normal Distribution Teat Resiib !

Test .-slue
Cnt (0 95), Conclusion with AlphatQ 05) |
Shapuo-Wilks (Detect Only)
0 777
0 85 j Data Not "Normal
Ulhcfcrs (Detects Only)
0 273
0 267 Data Mot Normal 1
Shapir>Wifks (NDs = DL) ]
ois
0 516 Data Nol Normal
blliefors [NDs = DL)
0217
0.1 31 Data Hot Normal
Shapiro-Mlks (MDs = DL*2)
0 701
0&16 'Data Not Normal
Lilliefors (NDs <> OLQ),
0 325
0131 Data Not Normal
Shspiro-Wilks (Normal ROS Estimates)
~ "OSes
0916 ' Data Not Normal
Ulhefors (Normal ROS Estimates)
017
0131 fData Appear Normal
Gamma Distribution Test Resdts
	 	
Testvalue
CnM095) Conclusion with Alpha(0C5)
Anderson-Darling (Detects Onty)
" o:sT
0738 ; " i
KcJmogorov-Srrjrnov (Detects Or.ty")'
"o
0258 'Data appear Approximate Gamma Distribution i
Andersen-Darling (NDs ¦ DL),
Q~58~"
075 'j " ¦ " "!
Kolmogorov-Smirnov (NDs » DL)
02K
0179 Data Not Gamma Distributed
Anderson Darting (NDs ¦ Dl/2),
1 492
o"75i ! ;
Kolmogorw-Snumcv (NDs ¦ DL/2) J
0261 ~
0 179 Data Not Gamma Distributed
jifldtrsoivDarling (Gamma ROS'Esb mates)'
0"52i
0 747"! ,
Kcimogorov-Smirnov (Gamma ROS Est)1
0 127
0179 jDataAppear Gamma Distributed



100
-------
Chapter 9
Single Sample and Two-Sample Hypotheses
Testing Approaches
This chapter illustrates single sample and two-sample parametric and nonparametric hypotheses testing
approaches as incorporated in ProUCL 4.0. ProUCL 4.0 can perform these hypotheses tests on data sets
with and without nondetect observations. It should be pointed out that, when one wants to use two-sample
hypotheses tests on data sets with NDs, ProUCL 4.0 assumes that samples from both of the groups have
nondetect observations. All this means that, a ND column (with 0 or 1 entries only) needs to be provided
for the variable in each of the two groups. This has to be done even if one of the groups has all detected
entries; in this case the associated ND column will have all entries equal to "1This will allow the user
to compare two groups (e.g., arsenic in background vs. site samples) with one of the groups having some
NDs and the other group having all detected data.
9.1 Single Sample Hypotheses Tests
In many environmental applications, single sample hypotheses tests are used to compare site data
(provided enough site data arc available) with prc-specified cleanup standards or compliance limits.
ProUCL 4.0 contains single sample parametric and nonparametric tests including Student's t-test, sign
test, Wilcoxon Signed Rank (WSR) test, and test for proportion. The single sample hypotheses tests arc
useful when the environmental parameters such as the clean standard, action level, or compliance limits
(CLs) are known, and the objective is to compare site concentrations with those known threshold values.
Specifically, a t-test (or a sign test) may be used to verify the attainment of cleanup levels at an AOC after
remediation activity; and a test for proportion may be used to verify if the proportion of exceedances of an
action level (or a compliance limit) by sample concentrations collected from an AOC (or a MW) exceeds
a certain specified proportion (e.g., 1%, 5%, 10%).
ProUCL 4.0 can perform these hypotheses on data sets with and without nondetect observations.
However, it should be noted that for single sample hypotheses tests (e.g., sign test, proportion test) used
to compare site mean/median concentration level with a cleanup standard, Cs,or a compliance limit (e.g.,
proportion test), all NDs (if any) should lie below the cleanup standard, Cs For proper use of these
hypotheses testing approaches, the differences bctween.these tests should be noted and understood.
Specifically, a t-test or a WSR test are used to compare the measures of location and central tendencies
(e.g., mean, median) of a site area (e.g., AOC) to a cleanup standard, Cs, or action level also representing
a measure of central tendency (e.g., mean, median); whereas, a proportion test compares if the proportion
of site observations from an AOC exceeding a compliance limit (CL) exceeds a specified proportion, P0
(e.g., 5%, 10%). ProUCL 4.0 has useful graphical methods that may be used to visually compare the
concentrations of a site area of concern (AOC) with an action level. This can be done using a box plot of
site data with action level superimposed on that graph. The details of the various single sample
hypotheses testing approaches can be found in EPA guidance documents (1989, 2006). A brief discussion
of these methods is also given in the ProUCL 4.0 Technical Guide.
101
-------
9.1.1 Single Sample Hypothesis Testing for Full Data without Nondetects
1. Click Hypothesis Testing^ Single Sample
F?jProUCL 4,0 - [Worksheet, wst]
oy F3e Edit Configure Summary Statistics ROSEst. NDs Graphs Outlier Tests Goodness-of-Fit
Background UCL Window Help
PlelBlBlmlPl
Navigation Panel
Name
O WorkSheet.wst
1 Sample-Prop
4.19.
hypothesis Testing.
Single Sample ~ » Full (w/o NDs) >
Two Sample ~ ! WithhCs
t-Test
Proportion
Sign test
Wilcoxon Signed Rank
2. Select Full (w/o NDs) - This option is used for full data sets without nondetects.
•	To perform a t-test, click on t-Tcst from the drop-down menu as shown above.
•	To perform a proportion test, click on Proportion from the drop-down menu.
•	To run a sign test, click on Sign test from the drop-down menu.
•	To run a Wilcoxon Signed Rank test, click on Wilcoxon Signed Rank from the drop-
down menu.
9.1.1.1 Single Sample t-Test
1. Click Hypothesis Testing ~ Single Sample ~ Full (w/o NDs) ~ t-Test
Igj-ProUCL 4.0 - [Worksheet.wst]
~y Ht Edit Configure Summary Statistics ROSEst. fiDs Graphs Outlier Tests Goodness-of-Fit
I Background UCL Wridow Help
g|«al elBlml m\
Hypothesis Testing I
Single Sample ~ I Fii (w/o NDs) ~ | t-Test
Navigation Panel
Name
Proportion
Sign test
Wilcoxon Signed Rank
OWorkSheet.wst
2. The Select Variables Screen (see page 130) will appear. -
•	Select variable (variables) from the Select Variables screen.
•	When the Options button is clicked, the following window will be shown.
102
-------
m
Single Sample t Test Options
Confidence Level
Substantial Difference. S
(Used v/ith Test Form 2)
Compliance Limit
"Select Null Hypothesis Form	
0.95
(* Mean <= Compliance Limit {Form 1)
C Mean :== Compliance Limit (Form 2)
C Mean >= Compliance Limit + S (Form 2)
C Mean = Compliance Limit (2 Sided Alternative)
OK
Cancel
o Specify the Confidence Level; default is 0.95.
o Specify meaningful values for Substantial Difference, S and the Compliance Limit.
The default choice for S is "0."
o Select the form of Null Hypothesis; default is Mean <= Compliance Limit (Form 1).
o Click on OK button to continue or on Cancel button to cancel the test.
103
-------
Output for Single Sample t-Tcst (Full Data without NDs)
1 Sample-1
Single Sample t-Test
Raw Statistics
Number of Valid Samples j 9
Number of Distinct Samples] 9
Minimum]
82.39

Maximum!
1132

Mean'
99.38

Mediam
1035" "

SD;
10 41

SE of Mean|
3468

HO: Site Mean = 100
	
		
Test Value!
"~o;i78"~

two Sided Critical Value (0 05) j
"~2 3D6~
r""""
P-Value!
i



Conclusion with Alpha = 0.05


Do Not Reject HO. Conclude Mean = 100


P-Value > Al pha (0.06)
9.1.1.2 Single Sample Proportion Test
1. Click Hypothesis Testing ~ Single Sample ~ Full (w/oNDs) ~ Proportion
Vc\ ProUCL 4.0'.- [y/orkSheet.wst]
uQ File Edit Configure Summary Statistics ROSEst. NDs Graphs Outlier Tests Goodness-of-Fit
E)|«S| BlBlEDl-Bl
Navigation Panel I
Name
OWorkSheet.wst
ISample-Prop
" Vft'
5 3085"
Hypothesis Testing
-Single Sample ~
Background UCL Window Help
Full (tv/o NDs) »
Two Sample ~ j With NDs
M*'	b—'T
t-Test
Sign test
Wilcoxon Signed Rank
~r~
2. The Select Variables Screen will appear.
•	Select variable (variables) from the Select Variables screen.
•	When the Options button is clicked, the following window will be shown.
104
-------
H Single Sample Proportion Test Options [x]
Confidence Level
Proportion
Action/Compliance Limit
0.95
0.3
-Select Null Hypothesis Forrrr
f* P <= Porportion (Form 1)
<"* P .?= Proportion (Form 2)
f P = Proportion (2 Side Alternatived)
OK
Cancel
A
Specify the Confidence level; default is 0.95.
Specify the Proportion level and a meaningful Action/Compliance Limit.
Select the form of Null Hypothesis; default is P <= Proportion (Form 1).
Click on OK button to continue or on Cancel button to cancel the test.
-------
Output for Single Sample Proportion Test (Full Data without NDs)
One-Sample Proportion Test
Raw Statistics
Number of Valid Samples j	S5
Number of Distinct Samples j	S3
Minimum!	0.598
Maximum!	7 676
Meanj	5183^
Medianj	5.564
" ~	"	1588
SEofNfeanj	0172
Number of Exceedances j	27
Sample Proportion of Exceedances j	0 318
HO: Site Proportion <=0.3 (Forml)
Large Sample z-Test Value!	0.237 ^
Critical Value (0.05)!	1645 1
P-Vaiuej	(H06 |
Conclusion with Alpha =0.05
Do Not Reject HO. Conclude Site Proportion <= 0.3
P-Value > Alpha (0.05)
9.1.1.3 Single Sample Sign Test
1. Click Hypothesis Testing ~ Single Sample ~ Full (w/o NDs) ~ Sign test
bJ3 File Edt Configure Summary Statistics ROSEsl.NDs Graphs Outlier Tests Goodness-of-Fit JRnSjJSWsSS
Background UCl Wndow Help
rikM Blnlml nl T ii-'if-'irv-MT'FMfl t-Test

Navigation Panel |

0 | '
2

1 I

11
3 4 | -j ' |

j i \
i »]
J wacoxon Siffned Rank
Name |
i Sign Test j
Arsenic
D_Arcenic |
! 1

I

0 WorkSheet.wst
1 !
L..,!!-i		 -!	>	-i- 	i-- -i
2. The Select Variables Screen will appear.
•	Select variable (variables) from the Select Variables screen.
•	When the Options button is clicked, the following window will be shown.
106
-------
Sample Sign; Test Options
Confidence Level
Substantial Difference. S
(Used with Test Form 2]
0.95

Action/Compliance Limit | 0~
Select Null Hypothesis Form	
(* Median <= Compliance Limit (Form 1)
<*" Median >= Compliance Limit (Form 2)
f Median >= Compliance Limit + S (Form 2)
Median = Compliance Limit (2 Sidsd Alternative)
OK
Cancel
A
Specify the Confidence Level; default choice is 0.95.
Specify meaningful values for Substantial Difference, S and Action/Compliance
Limit.
Select the form of Null Hypothesis; default is Median <= Compliance Limit (Form
1).
Click on OK button to continue or on Cancel button to cancel the test.
107
-------
Output for Single Sample Proportion Test (Full Data without NDs)






- - -
Single Sample Sign Test
-


Raw Statistics



Number of Valid Samples
""10"
...

Number of Distinct Samples
~10~ '




Minimum
750




Maximum
Tiei




Mean
9257
'



Median
338




SD
136.7




SEof Mean
~ ~43 24~



Number 4bove Limit
3



Number Equal Limit
0



Number Below Limit
7
	
HO: Site Median >=1000 (Form 2)
		
	 	
	
.. 		
	
test Value
3
I

Lcwer Critical V2lue (0.05) j 1
|
	


P-Value
0.172
I
j.	
Conclusion with Alpha = 0.05
	
	
	
Do Not Reject HO. Conclude Median >= 1000

P-Value> Alpha (0.05)



9.1.1.4 Single Sample Wilcoxon Signed Rank (WSR) Test
1. Click Hypothesis Testing ~ Single Sample ~ Full (w/o NDs) ~ Wilcoxon Signed Rank
n9 File Edit Configure Summary Statis&cs ROSEst. Mis Graphs Outfcer Tests Goodrtess-of-Fit
Sackground UQ_ Window Help
ei|«6l olBlml'nl i
SrgleSample ~
Fid (w/o NDs) ~
t-Test
Proportion
Sign test
11
Navigation Pane! |

Oil
2
3
4 |
r
-5	1
|	B—
"¦¦•¦¦¦¦'¦"-I	B
Name I

; VJSR1 I
V/SR2 j

Arsenic
| D_Ar3enic'

I
.. Wilcoxon Signed Rank- -
0 Worksheet v
-------
SI {Single Sample'Wilcoxon Signed Rank Test Options ; Xj j
Confidence Level | 0^55
Substantial Difference. S
(Used with Test Form 2)
-fl
Action/Compliance Limit ) 0
"Select Null Hypothesis Form	
(* Mean/Median <= Compliance Limit (Form 1)
C Mean/Median Compliance Limit (Form 2]
C Mean/Median >-= Compliance Limit + S (Form 2)
C Mean/Median = Compliance Limit (2 Sided Alternative)
Specify the Confidence Level; default is 0.95.
Specify meaningful values for Substantial Difference, S, and Action/Compliance
Limit.
Select the form of Null Hypothesis; default is Mean/Median <= Compliance Limit
(Form 1).
Click on OK button to continue or on Cancel button to cancel the test.
OK
Cancel
109
-------
Output Tor Single Sample Proportion Test (Full Data without NDs)
Single Sample Wilcoxon Signed Rank Test

Raw Statistics
Number of Valid Samples
10

Number of Distinct Samples
10

Minimum
750

Maximum
11S1

Mean
925 7

Median
S88

SD
135 7

SE of Mean
43 24

Number Above Limit
3

Number Equal Limit
0

Number Below Limit j 7

T-plus 11.5

T-minus 43.5


HO: Site Median <= 1000 (Form 1)

Test Value
11.5

Critical Value {0 05)
45

P-Valuej0 947


Conclusion with Alpha = 0.05
Do Not Reject HO. Conclude Mean/Median <= 1000
P-Value > Alpha (0.05)
110
-------
9.1.2 Single Sample Hypothesis Testing for Data Sets with Nondetects
Most of the one-sample tests such as the Proportion test and the Sign test on data sets with nondetect
values assume that all nondetect observations lie below the compliance limit (CL) or an action level, A0.
The single sample tests cannot be performed if ND observations exceed the CL or action levels.
I. Click on Hypothesis Testing^ Single Sample
§)proUCL 4.0 - [Worksheet.wst]
oS File Edit Configure Summary Statistics ROSEst. NDs Graphs Outlier Tests Goodness-of-Rt
ei|«al BlBlml ml
Navigation Panel
Name
3 I '
Hypothesis Testing
Single Sample ~
Two Sample ~
-5	,	b	
eackgiourtd UCl Window Help
fuH(w/qnd5) > [
Proportion
Sign test
Wilcoxon Signed Rank
0 WorkSheet.wst
2. Select the With NDs option
•	To perform a proportion test, click on Proportion from the drop-down menu.
•	To perform a sign test, click on Sign test from the drop-down menu.
•	To perform a Wilcoxon Signed Rank test, click on Wilcoxon Signed Rank from the drop-
down menu list.
9.1.2.1 Single Proportion Test on Data Sets with NDs
1. Click Hypothesis Testing ~ Single Sample ~ With NDs ~ Proportion
2. The Select Variables screen will appear.
•	Select variable (variables) from the Select Variables screen.
•	When the Options button is clicked, the following window will be shown.
Ill
-------
le Sawpte Proportion.Test Options |X[
Confidence Level
Proportion
Action/Compliance Limit
0.95
03
"Select Null Hypothesis Form"
(* P <= Porportion (Form 1)
C P i= Proportion (Form 2)
P = Proportion (2 Side Alternative^)
OK
Cancel

Specify the Confidence Level; default is 0.95.
Specify meaningful values for Proportion and the Action/Compliance Limit.
Select the form of Null Hypothesis; default is P <= Proportion (Form 1).
Click on OK button to continue or on Cancel button to cancel the test.
-------
Output for Single Sample Proportion Test (with NDs)
Arsenic
Single Sample Proportion Test
Raw Statistics
Number of Valid Samples]	24	!
Number of Distinct Samples]	10	j
Number of Non-Detect Data;	13	i
Number ofbetected Data j	11	I
Percent Non- Detects;	17%	j
Minimum Non-detect- 0 9
			. 	_ i
Maximum Ncn-detect 2 i
Minimum Detected j 0.5 j
Maximum Detected, 3.2 '
Mean of Detected Data! 1.236 :
_ __ 	 ! \
Median of Detected Data j	0 7 j
SD of Detected Data j	0 9S5 |
Number of Exceedsnces	2 j
Sample Proportion of Exceedancesj	0.0333 j
Some Non-Detect Values Exceed
The User Selected Actkxi^Ctxi^iianoe Limit
Unable to do Proportion Test with such pdidiigttas
9.1.2.2 Single Sample Sign Test with NDs
1. Click Hypothesis Testing ~ Single Sample ~ With NDs ~ Sign test
SJjProUCL 4.0 - [Worksheet.wstj
1








~5 FSc Edit Configure S-trnmary Sifitstjcs ROS Est. NDs Graphs Oirtiter Tests Goodness-of-Fit
Hypothesis Testing |
Backgrotrtd
UQ. Window Help
ol«al BlBlml nl
Navigation Panel j





Single Sample ~ j
FuS (vv/o NDs) ~
1

	.	„



Tw Sample ~ j
: .VYithMDs

Proportion '


0
1 1 :
2
3
i 4 1
-5	1
1 o
•~r
	"1
Sgntest
11
Name |

Sj£f>Test

Arsenic |
D_A/senic
L I 1



Wilcoxon Signed Rank

0 Worksheet wst

L	
1:
! 0
!




! ; 1 J

2
1 l.Oii1
L . . . )		

. __ „
" "I"




1 i 1 :
2. The Select Variables screen will appear.
•	Select variable (variables) from the Select Variables screen.
•	When the Options button is clicked, the following window will be shown.
113
-------
lH Single Sample. Sign;tes.t- Options
Confidence Level
Substantial Difference. S
(Used with Test Form 2)
0.55
51
Action/Compliance Limit | 0
"Select Null Hypothesis Form	
<• Median <= Compliance Limit (Form 1)
C Median >= Compliance Limit (Form 2)
f* Median :== Compliance Limit + S (Form 2)
C Median = Compliance Limit (2 Sided Alternative)
OK
Cancel
A
Specify the Confidence Level; default is 0.95.
Specify meaningful values for Substantial Difference, S and Action/Compliance
Limit.
Select the form of Null Hypothesis; default is Median <= Compliance Limit (Form
1}.
Click on OK button to continue or on Cancel button to cancel the test.
-------
Output for Single Sample Sign Test (Data with Nondetccts)
Arsenic
Single Sample Sign Test
Raw Statistics
Number of Valid Samples j	2£
Number of Distinct Samples j	10
Number of Non- Detect Data j	13
Number of Detected Data j"	11
Percent Non-DetectSj54.17% |
Minimum Non-detect-	0 2 j
Maximum Non-detectj	2
Minimum Detected!	0 5 !
Maximum Detected^	3 2 1
Mean of Detected Data]	1~23G :
Median of Detected Data |	0 7 '
SD of Detected Date ]	0 9S5 <
Number Above Limit^	0
Number Equal Limit;	0 f
Number Below Limit j	24 ¦
HO: Site Median <=5 (Form 1)
TestValuei 0 I	I
Upper CriticalValue (0.05) | 17 ,	'
T |	;
i
Conclusion with Alpha = 005
Do Not Reject HO. Conclude Median <= 5
P-Value > Alpha (0.05)	|
9.1.2.3 Single Sample Wilcoxon Signed Rank Test with NDs
1. Click Hypothesis Testing ~ Single Sample ~ With NDs ~ Wilcoxon Signed Rank
F?1 ProUCL 4.0 - [WorkSheet_a.wst] |
File Edt Configure Summary Statis&cs ROSEst. M)s Graphs Outlier Tests Goodness-of-fit
slal BlslmTil
Navigation Pane!
Hypothesis Testing
Single Sample >
Background UCL Window Help
FuD ('A'/o NDs) ~
Two Sample	Proportion
115
-------
The Select Variables Screen will appear.
•	Select variable (variables) from the Select Variables screen.
•	When the Options button is clicked, the following window will be shown.

Confidence Level | 0.95
Substantial Difference. S
(Used with Test Form 2)
Action/Compliance Limit | 0
-Select Mull Hypothesis Form	
(* Mean/Median <= Compliance Limit (Form 1)
f* Mean/Median ;= Compliance Limit (Form 2)
C Mean/Median >= Compliance Limit + S (Form 2)
Mean/Median = Compliance Limit (2 Sided Alternative)
OK	Cancel
A
o Specify the Confidence Level; default is 0.95.
o Specify meaningful values for Substantial Difference, S and Action/Compliance
Limit.
o Select the form of Null Hypothesis; default is Mean/Median <= Compliance Limit
(Form 1).
o Click on OK button to continue or on Cancel button to cancel the test.
-------
Output for Single Sample Wilcoxon Signed Rank Test (Data with Nondetects)
Single Sample Wilcoxon Signed Rank Test
Raw Statistics
Number of Valid Samples | 24
Number of Distinct Samples| 10	j
Number of Non-Detect Data; 13	i
Number of Detected Data j 11	I
Percent Non-beiecti;5i.17vi j
Minimum Non-detect
09
Maximum Non-detect
2
Minimum Detected
05
Maximum Detected
"3 2
Mean of Detected Data
"l 236 "
Median of Detected Data
0 7
SD of Detected Data
0 965
Number Above Limit
0 "
Number Equal Limit
o
	
Number Below Limit, 2-
T-plusj 0
T-nnnusj 300
HO: Site Median <:= 6 (Form 1)
Large Sample z-Test Valuej -<293 |
Critical Value (005) j 1 W5 j
P-Value | 1
Conclusion with Alpha = 0.05
Do Not Reject HO. Conclude Mean/Median <=6
P-Va I ue > Al pha (0.05)
Dataset contains multiple Nan-Detect values!
All Observations <2 are treated as Mori-Detects
-------
9.2 Two-Sample Hypotheses Testing Approaches
In this section, the two-sample hypotheses testing approaches as incorporated in ProUCL 4.0 have been
illustrated. These approaches are used to compare the parameters and distributions of the two populations
(e.g., Background vs. AOC) based upon data sets collected from those populations. Both forms (Form 1
and Form 2, Form 2 with Substantial Difference, S) of two-sample hypothesis testing approaches have
been included in ProUCL 4.0. The methods are available for full data sets as well as for data sets with
below detection limit (BDL) values.
•	Full - analyzes data sets consisting of all detected values. The following parametric and
nonparametric tests are available:
o Student's t and Satterthwaite tests to compare the means of two populations (e.g.
Background versus AOC).
o F-test to the check the equalky of dispersions of two populations,
o Two-sample nonparametric Wilcoxon-Mann-Whitney (WMW) test. This test is
equivalent to Wilcoxon Rank Sum (WRS) test,
o Quantile test is often used to compare upper tails of two data distributions. This test
is normally performed in parallel with WMW test.
•	With NDs - analyzes data sets consisting of both nondetected and detected values. The
following tests are available:
o Wilcoxon-Mann-Whitney test. All observations (including detected values) below the
highest detection limit are treated as ND (less than the highest DL) values,
o Quantile test is used to compare upper tails of two data distributions. This test is
performed in parallel with WMW test,
o Gehan's test, useful when multiple detection limits may be present.
The details of these methods can be found in the ProUCL 4.0 Technical Guide and are also available in
EPA (1997, 2006). It is re-stated that the use of informal graphical displays (e.g., side-by-side box plots,
multiple Q-Q plots) should always accompany the formal hypothesis testing approaches listed above.
This is especially warranted when the data sets may consist of observations from multiple populations
(e.g., mixture samples collected from various onsite locations) and outliers.
Note: As mentioned before, it is pointed out that, when one wants to use two-sample hypotheses tests on
data sets with NDs, ProUCL 4.0 assumes that samples from both of the groups have nondetect
observations. This may not be the case, as data from a polluted site may not have any ND observations.
ProUCL can handle such data sets. However, the user will have to provide a ND column (with 0 or 1
entries only) for the selected variable of each of the two groups. Thus when one of the groups (e.g., site
arsenic) has no ND value, the user supplies an associated ND column with all entries equal to "1. " This
will allow the user to compare two groups (e.g., arsenic in background vs. site samples) with one of the
group having some NDs and the other group having all detected data.
118
-------
9.2.1 Two-Sample Hypothesis Tests for Full Data
Full - This option is used to analyze data sets consisting of all detected values. The following two-sample
tests are available in ProUCL 4.0.
•	Student's t and Satterthwaite tests to compare the means of two populations (e.g.,
Background versus AOC).
•	F-test is also available to test the equality of dispersions of two populations.
•	Two-sample nonparametric Wilcoxon-Mann-Whitney (WMW) test.
•	Two-sample quantile test.
•	Student's t-Test
o This test can be used to compare the site mean concentration of a COPC with that of
the background mean concentration provided the populations are normally
distributed. The data sets are given by independent random observations, X|, X2,. . .,
Xn collected from a site, and independent random observations, Y|, Y2,. . ., Ym
collected from a background population. The same terminology is used for all other
two-sample tests in ProUCL 4.0.
o Student's t-test also assumes that the spread (variance) of the two populations are
approximately equal.
o The F-test can be used to the check the equality of dispersions of two populations.
•	Satterthwaite t-Test
o This test is used to compare the population means of two populations when the
variances or Spreads of those populations may not be equal. As mentioned before, the
F-distribution based test can be used to verify the equality of dispersions of two
populations.
•	Test for Equality of two Dispersions (F-test)
o This test is used to determine whether the true underlying variances of two
populations are equal. Usually the F-test is employed as a preliminary test, before
conducting the two-sample t-test for testing the equality of means of two populations.
o The assumptions underlying the F-test are that the two-samples represent
independent random samples from two normal populations. The F-test for equality of
variances is highly sensitive to departures from normality.
•	Two-Sample Nonparametric WMW Test
o This test is used to determine the approximate equality of the two continuous data
distributions. This test also assumes that the shapes (e.g., as determined by spread,
skewness, and graphical displays) of the two populations are roughly equal. The test
is often used to determine if the measures of central locations of the two populations
119
-------
are significantly different. Specifically, the test can be used to determine if the site
concentrations exceed the background concentrations,
o The Wilcoxon-Mann-Whitney test docs not assume that the data are normally or log-
normally distributed. For large samples (e.g., > 20), the distribution of the WMW test
statistic can be approximated by a normal distribution,
o This test is used to determine if measurements from one population consistently tend
to be larger (or smaller) than those from the other population.
• Two-Sample Quantile Test
o The nonparametric quantile test does not assume that the data are normally or log-
normally distributed. For large samples (e.g., > 20), the distribution of the quantile
test statistic can be approximated by a normal distribution,
o This test is used in parallel with the WMW test. This test is often used in Background
Test Form 1 to determine if the concentrations from the upper tail of site data
distribution are comparable to (lower than or equal to) that of the background data
distribution. The critical values for this Form 1 test are available in EPA, 1994. The
details of the test are given in EPA (1994, 2006).
Note: The use of the tests listed above is not recommended on log-transformed data sets, especially when
the parameters of interests are the population means. In practice, the cleanup and remediation decisions
have to be made in original scale based upon statistics and estimates computed in the original scale. The
equality of means in log-scale does not necessarily imply the equality of means in the original scale. This
topic is discussed in detail in Chapter 3 of the revised background document (EPA, 2002) for CERCLA
sites (currently under revision).
1.
Click on Hypothesis Testing ~ Two Sample

' Fte Edit Configu-e Sunmary Statistics ROSEsLNDs Graphs Outfcr Tests Goodness-of-Fit
Background UCL Wndow Help
ol«sl BlBlml
Navigation Pane! )
Name
0 Worksheet wst
0 WorkSheet_a.wst
Srtgie Sample ~
-s—j

I
"1
tTest
Wfcoxon -Mam -Whi tne y
Quantile test
I
I
. ..I	
2. Select the Full (w/o NDs) option
•	To perform a t-test, click on t Test from the drop-down menu.
•	To perform a Wilcoxon-Mann-Whitney, click on Wilcoxon-Mann-Whitney from the
drop-down menu list.
To perform a quantile test, click on Quantile test from the drop-down menu.
120
-------
9.2.1.1 Two-Sample t-Test without NDs
1. Click Hypothesis Testing ~ Two Sample ~ Full (w/o NDs) ~ t Test
SJjProUCL 4.0 - [C:\Narain\ProUCL-Data\Data\WSR EPA (2006).wst|
c@r!e EC a; Configure Summary Statistics ROSEst. NDs Graphs Cutter Tests Gjodness-of-Fit	Sac'ojroLtnd UCL Window neb
I Single S*mo!e ~ 3Z
q|<&l elalml ml
navigation Panel
Name	|
0 Worksheet.vvst
©WSREPA (2008) wst
VVSR1
97-1
1.044 '
" iToei"
WSR2
Two Sample ~ | Ft! (w/o MDs) ~ | t Test
Arsenic D Arsenic

ith NDs
~ j \ Y3 :o>. on -Minn ¦ \7hi ;n e y
Qusntile test
588
"5 89!
'us'
2. The Select Variables screen will appear.
•	Select variable (variables) from the Select Variables screen.
•	Without Group Variable: This option is used when the data values of the variable
(COPC) for the site and the background are given in separate columns.
•	With Croup Variable: This option is used when data values of the variable (COPC) for
the site and the background are given in the same column. The values are separated into
different populations (groups) by the values of an associated Group Variable. The group
variable may represent several populations (e.g., several AOCs, MWs). The user can
compare two groups at a time by using this option.
•	When using this option, the user should select a group variable by clicking the arrow next
to the Group Var option for a drop-down list of available variable. The user selects an
appropriate (meaningful) variable representing groups such as Background and AOC.
The user is allowed to use letters, numbers, or alphanumeric labels for the group names.
o When the options button is clicked, the following window will be shown.
121
-------
¦Site'vs Background Comparison
Substantial Difference. S | o~
(Used with Test Form 2)
r Confidence Coefficient	
C 99 9%
r 99.5%
r 99%
r 97.5%
c 95%
r 90%
¦Select Null Hypothesis Form-
(* AOC <= Background (Form
C AOC >= Background (Form 2)
C AOC >= Background + S (Form 2)
C AOC = Background (2 Sided)
OK
Cancel
o	Specify a useful Substantial Difference, S value. The default choice is 0.
o	Choose the Confidence level. The default choice is 95%.
o	Select the form of Null Hypothesis. The default is AOC <= Background (Form 1).
o	Click on OK button to continue or on Cancel button to cancel the option.
Click on the OK button to continue or on the Cancel button to cancel the Site versus
Background Comparison.
-------
Output for Two-Sample t-Tcst (Full Data without NDs)
Raw Statistics


Site
Background
Number of Valid Samples
77
47

Number of Distinct Samples
57 | 40


Minimum
0.09
0.22

Maximum
163.6
1 33

Mean
3.915
0.599


Median
043
054

SD
20.02
0 234

SE of Mean
2.231
0 0414






Site vs Background Two-Sample t-Test





HO: Muof Site - Mu of Background <=0





t-Test
Critical

Method
DF
Value
t (0 050)
P-Value
Pooled (Equal Variance)
122
1.134
1.657
0129
Satterthwaite (Unequal Variance)
76.1
1.454
1.665
0075
Pooled SD 15.799
Conclusion with Alpha = 0.050
' Student t (Pooled) Test: Do Not Reject HO. Conclude Site <= Background
' Satterthwaite Test. Do Not Reject HO. Conclude Site <= Background
9.2.1.2 Two-Sample Wilcoxon-Mann-Whitnev (WMW) Test without NDs
1. Click Hypothesis Testing ~ Two Sample ~ Full (w/o NDs) ~ Wilcoxon-Mann-Whitney
Test
123
-------
2.
The Select Variables Screen will appear.
Select-Variables
Variables
Name
1 ID
\ Count |
Stfe
1
10
Background
2
15
Site
A
15
c Without Group Variable
» I Background / Ambient
r
Area of Concern / Site [~
r With Group Variable
» | Variable	|
Group Var	|	3
Background / Ambient	|	3
Area of Concern / Site	[
Options
OK
Cancel
•	Select variable (variables) from the Select Variables screen.
•	Without Group Variable: This option is used when the data values of the variable
(COPC) for the site and the background are given in separate columns.
•	With Group Variable: This option is used when data values of the variable (COPC) for
the site and the background are given in the same column. The values are separated into
different populations (groups) by the values of an associated Group Variable. When using
this option, the user should select a group variable by clicking the arrow next to the
Group Var option for a drop-down list of available variables.
• ProUCL 4.0 has been written using environmental terminology such as performing
background versus site comparisons. However, all the tests and procedures in ProUCL
4.0 can be used for any other application if used properly. The user selects an appropriate
group variable representing groups such as Background and AOC. For other applications
such as comparing a new treatment drug versus older treatment drug, the group variable
may represent the two groups: old drug group and new drug group. The user is allowed to
use letters, numbers, or alphanumeric labels for the group names.
• When the Options button is clicked, the following window is shown.
124
-------
Site vs Background Comparison
Substantial Difference. S J o
(Used v/ith Test Form 2)
Confidence Coefficient	
r 99 9'A	C 97.5%
r 99.5%	95%
r 99%	c 90%
Select Null Hypothesis Form	
(* AOC <= Background (Form 1)
C AOC ;= Background (Form 2)
C AOC ;= Background + S (Form 2)
C AOC = Background (2 Sided)
OK	Cancel
o	Specify a Substantial Difference, S value. The default choice is 0.
o	Choose the Confidence level. The default choice is 95%.
o	Select the form of Null Hypothesis. The default is AOC <= Background (Form 1).
o	Click on OK button to continue or on Cancel button to cancel the selected options.
• Click on the OK button to continue or on the Cancel button to cancel the Site versus
Background Comparison.
125
-------
Output for Two-Sample Wilcoxon-Mann-Whitney Test (Full Data)
Area of Concern Data; Site
Background Data: Background
Raw Statistics
Number of Valid Samples
j Site
...
[Background
¦flb~ j
Number of Distinct Samples
1 9
I 8 !
Minimum
! 15
I 23 I
Maximum
I 100
[ 79 i
Mean
'j 487
| 49 5 j
Median
i 345
J 505 J
^D_
| 33 36
j 16 76 ;
SE of Mean
i 10.55
1 53 i
Wilcoxon-Mann-V/hhney (WMW) Test
HO: Mean/Median of Site or AOC <- Mean/Median at Background
Site Rank Sum W-Stat;
WMW Test U-Stati
' WMW Cnti^ValuMO.OfDj"
S7 5
42.5
T2
Approximate P-Value I 0.727
Conclusion with Alpha = 0.05
Do Not Reject HO. Conclude Site <= Background
9.2.1.3 Two-Sample Quantile Test for Full Data without NDs
As mentioned before, the quantile test is often used in parallel with the WMW test. Typically, both tests
arc performed on the same data set before coming to the conclusion about comparability (or non-
comparability) of the data distributions of the two populations.
1. Click Hypothesis Testing ~ Two Sample ~ Full (w/o NDs) ~ Quantile test
S^ProUCL1 -4.0,; [C:\Narain\proUCLDdta\pata\Quantne.W5t]
c§ File Edit Configure Summary Statrebcs ROSEst, NDs Graphs Outlier Tests Goodness-of-Fit jirmriinm Background UCL Window Help
j Single Sample ~ \

0
Background
'A
36*
Background
D_Background
is:
15~
Two'Sample I * Fufl (w/o NDs)
Site
D_Site
5j
"T6:
With NDs
tTest	U—
Wdcoxon-Mann-Whitney )Q
126
-------
2. The Select Variables Screen shown below will appear.
Variables
1 Name
1 ID
I Count |
1 Background
1-0
.10
Sie
1
10
Background
2
15
Sle
i
15
f7 Without Group Variable
» | Background / Ambient
Area of Concern / Site
With Group Variable
>> | Variable
Group Var
Background / Ambient
Area of Concern / Site

J
^3
Options
OK
Cancel
Select variable (variables) from the Select Variables screen.
Without Group Variable: This option is used when the data values of the variable
(COPC) for the site and the background are given in different columns.
With Group Variable: This option is used when data values of the variable (COPC) for
the site and the background are given in the same column. The values are separated into
different groups by using the values of the associated Group Variable. When using this
option, the user should select a group variable by clicking the arrow next to the Group
Var option for a drop-down list of available variables. The user selects an appropriate
group variable representing groups such as Background and AOC. The user is allowed to
use letters, numbers, or alphanumeric labels for the group names.
When the Options button is clicked, the following window will be shown.
SI Quantile, Test-Options
"Select Confidence Coefficient	
C m	C 97 5V„
C 95'/.	r 90%
OK
Cancel
A
127
-------
o Choose the Confidence level; the default choice is 95%.
o Click on OK button to continue or on Cancel button to cancel the option.
• Click on the OK button to continue or on the Cancel button to cancel the Site versus
Background Comparison.
Output for Two-Sample Quantile Test (Full Data)
Area of Concern Data: Site
Background Data: Background
Raw Statistics
i Site
Number of Valid Samples
Number of Distinct Samples
10
T
Minimum
15
Maximum
"ibo"~
Mean
4S.7-
Median
345
SD
33 36
SE of Mean
~ 10 55
Background
"10
- 2;r "
79
49.5
50.5
16.76
5~3~
Quantile Test
HO: Site Concentration <= Background Concentration (Form 1)
Approximate R Value (0 043) j
Approx i mate K Va I ue [0 043)!
Number of Site Observations in 'R' Largest'
Calculated Alphaj
0 0433 i
J433J
Conclusion with Alpha = 0.043
Do Not Reject HO. Perform Wilcoxon-ftfarn-Whitney Ranked Sum Test
128
-------
9.2.2 Two-Sample Hypothesis Testing for Data Sets with Nondetects
1. Click Hypothesis Testing^ Two Sample
^jProUCL 4.0 ? [C:\Narain\ProUCLrbata\Data\Quantile.wst]
ay, Fde Edit Configure Summary Statistics ROSEst. NDs Graphs Outlier Teste Goodness-of-Fit	Background UCL Window Help
~j Single Sample ^	
'cM !5l Blml El
0
Background
Background
Fufl (w/o NDs) ~ _
D_Background
Site
23;
j
3-S:
15,
15
5
io!
-3—I
D_S)te
] Wilcoxon-Mari-Whitney
Gehan
Quantile Test
2. Select the With NDs option. A list of available tests will appear (shown above).
•	To perform a Wilcoxon-Mann-Whitney test, click on Wilcoxon-Mann-Whitney from the
drop-down menu list.
•	To perform a Gehan test, click on Gehan from the drop-down menu.
•	To perform a quantile test, click on Quantile Test from the drop-down menu.
9.2.2.1 Two-Sample Wilcoxon-Mann-Whitney Test with Nondetects
1. Click Hypothesis Testing ~ Two Sample ~ With NDs ~ Wilcoxon-Mann-Whitney
E£JproUCl_ 4.0.- [C:\Narain\PrqUCL-Dal4\Data\Quahtile.wst]
~£? Re Edit Configure Summary Statistics ROSEst. NDs Graphs Outier Tests Goodness-of-Fit	Background UCL Window Hetp
I Sffigle Sample ~ 1~
ei|
-------
2.
The Select Variables Screen shown below will appear.
Select-Variables
Variables
Name
Tip
Background -
Site	1
Background 2
Site	A
Court
10
15
15
Without Group Variable
>> | Background / Ambient
>> I Area of Concern / Site
r With Group Variable
>> | Variable
Group Var
Background / Ambient
Area of Concern I Site
Options
OK
3]
~3
Cancel
•	Select variable (variables) from the Select Variables screen.
•	Without Group Variable: This option is used when the data values of the variable
(COPC) for the site and the background are given in separate columns.
•	With Group Variable: This option is used when data values of the variable (COPC) for
the site and the background are given in the same column. The values are separated into
different populations (groups) by the values of an associated Group Variable. When using
this option, the user should select a group variable by clicking the arrow next to the
Group Var option for a drop-down list of available variables. The user selects an
appropriate variable representing groups such as Background and AOC. The user is
allowed to use letters, numbers, or alphanumeric labels for the group names.
•	When the Options button is clicked, the following window will be shown.
130
-------
Site vs Background Comparison
Substantial Difference. S | o~
(Used with Test Form 2)
—Confidence Coefficient	
r 99.9%	C 97.5%
C 99.5%
C 99%
f? 95%
r 90%
"Select Null Hypothesis Form"
(* AOC <= Background (Form 1)
C AOC 5= Background (Form 2)
C AOC >= Background + S (Form 2)
C AOC = Background (2 Sided)
OK
Cancel
o Specify a meaningful Substantial Difference, S value. The default choice is 0.
o Choose the Confidence level. The default choice is 95%.
o Select the form of Null Hypothesis. The default is AOC <= Background (Form 1).
o Click on the OK button to continue or on the Cancel button to cancel the selected
options.
Click on OK button to continue or on Cancel button to cancel the Site versus Background
Comparison.
131
-------
Output for Two-Sample Wilcoxon-Mann-Whitney Test (with Nondetects)
Area of Concern Data: Site
Background Data: Background
Raw Statistics
j Site
Number of Valid Samples i 15
Number of Hon-Detect Data
Number of Detect Data
i8
Minimum Non-Detect
Maximum Non-Detect
| Background
in™
i12	
3
5
"300
Percent Non detects j 53 33 X
3
|J5~
\ SO 00 A
Minimum Detected
Maximum Detected
Mean of Detected Data
Median of Detected Data
11
-20lT
22
"74 43 i 15
70 f"T5
SD of Detected Data
6342
_L_
Wilcoxon-Mann-Whitney Site vs Badupuuiid Test
All observations <=300 (Max DL) are ranked the same
Wilcoxon-Mann-Whitney (WMW) Test'
HO: Mean/Median of Site or AOC <= Mean/Median of Background
Site Rank Sum W-Stat j 232 5
vMvTestlJStatjF1275~
WMW Critical Value (0 050)! 152
Approximate P-Value 0.503
Conclusion with Alpha = 0.05 •
Do Not Reject HO. Conclude She <= Background
Note: In the WMW test, all observations below the largest detection limit are considered as NDs
(potentially including some detected values) and hence they all receive the same average rank. This
action may reduce the associated power of the WMW test considerably. This in turn may lead to incorrect
conclusion. As mentioned before, all hypotheses testing approaches should be supplemented with
graphical displays such as Q-Q plots and box plots. When multiple detection limits are present, the use of
the Gehan test is preferable.
132
-------
9.2.2.2 Two-Sample Gehan Test for Data Sets with Nondetects
1. Click Hypothesis Testing ~ Two Sample ~ With NDs ~ Gchan
EJJProUCL 4.0 - [C:\Narain\ProUCL-Oflta\Data\Ouantile.wst]
Hypothesis Testing
Fie Edit Configure Summary Statistics ROS Est NDs Graphs Outfer Tests Goodness-of-Fit Background UCL Window Help
.^1 e>,| raJnlml ml	1 Single Sample >\		
r~
0 ^
1
2

| ruir V"7° NUV r
¦22E!E39^D
| Bsckground
Site
Background
D_Background | Site j D_Site j
I I I
"3S
15
15
10
.. i.
EE
Gehan
Quantile Test
2. The Select Variables Screen will appear.
Variables
Name
I ID
I Count |
Site
H9HI
1
10
Background
2
15
Site
4
15
Count I Without Group Variable
Background / Ambient
» Area of Concern / Site
i" With Group Variable
» | Variable	[~
Group Var	|
Background / Ambient	|
Area of Concern I Site	[~
Options
OK

J
3]
Cancel
Select variable (variables) from the Select Variables screen.
Without Group Variable: This option is used when the data values of the variable
(COPC) for the site and the background are given in separate columns.
With Group Variable: This option is used when data values of the variable (COPC) for
the site and the background are given in the same column. The values arc separated into
different populations (groups) by the values of an associated Group Variable. When using
this option, the user should select a group variable by clicking the arrow next to the
Group Var option for a drop-down list of available variables. The user selects a group
variable representing groups such as Background and AOC.
When the Options button is clicked, the following window will be shown.
133
-------
Siteys Background Comparison
Substantial Difference. S | q~
(Used with Test Form 2)
"Confidence Coefficient	
c 99.9%	r 97.5%
r 99.5=4	a 95=4
c 99%	r 90%
-Select Null Hypothesis Form	
(* AOC <= Background {Form 1)
C AOC "5= Background (Form 2)
C AOC ¦>= Background + S (Form 2)
C AOC = Background (2 Sided)
OK
Cancel
o	Specify a Substantial Difference, S value. The default choice is 0.
o	Choose the Confidence level. The default choice is 95%.
o	Select the form of Null Hypothesis. The default is AOC <= Background (Form 1).
o	Click on OK button to continue or on Cancel button to cancel selected options.
• Click on the OK button to continue or on the Cancel button to cancel the Site versus
Background Comparison.
134 '
-------
Output for Two-Sample Gchan Test (with Nondetccts)
Area of Concern Data: Site
Background Data: Background
Raw Statistics
! Site
Number of Valid Samples \ 10
jBackground
Number of Non-Detect Data ~j2
Number of Detect Data
Minimum Non-Detect
Maximum Non-Detect
FercenfNon detects
Minimum Detected
Maximum Detected
Mean of Detected Data
Medien of Detected Data
SD of Detected Data
4
~35
20.00%
2~~
~~43
23 S3
"22 5""
"474"
25
«T60% ~
i
27
1217 |
j. -3
9.642 |
Sitevs Background Gehan Test
HO: Muof Site or AOC >= Mu of background
Gehan z Test Value' 1769
=rl--
Critical z (0.05); -1.645 !
P-Value
0 962
Concl us i on with Alpha = 0.05
Do Not Reject HO. Conclude Site >= Background
P-Value >= alpha (0.05)
9.2.2.3 Two-Sample Quantile Test for Data Sets with Nondetects
Quantile test as described in EPA (1994) has been included in ProUCL 4.0. The detailed power of the test
with many ND values is not well studied. The conclusion of this test should also be supplemented with
graphical displays. The use of the Gehan test is preferred when the data set may consist of many NDs
with multiple detection limits.
135
-------
Click Hypothesis Testing ~ Two Sample ~ With NDs ~ Quantile Test
£2}ProUCL 4.0 - [C;\Nardin\ProU(X-Dflta\Data\QurtiUile.wst]
~Q F3e Edit Configue Summary Statistics ROSEst. NDs Graphs Outlier Tests Goodness-of-Rt	Background UCl Window Help
¦jl	I Single Sample ~
C3l> | ' Variable	|
Group Var	|	T]
Background / Ambient |	3
Area of Concern / Site	|	33
Options
OK
Csnc«l
•	Select variable (variables) from the Select Variables screen.
•	Without Group Variable: This option is used when the data values of the variable
(COPC) for the site and the background are given in separate columns.
•	With Group Variable: This option is used when data values of the variable (COPC) for
the site and the background are given in the same column. The values are separated into
different populations (groups) by the values of an associated Group Variable. When using
this option, the user should select a group variable by clicking the arrow next to the
Group Var option for a drop-down list of available variables. The user selects an
appropriate group variable representing groups such as Background and AOC.
•	When the Options button is clicked, the following window will be shown.
136
-------
BSl Quantile Test- Options [®](|
|j

Select Confidence Coefficient


r 99!/„
r 97.5'/.


95V.
r go'/.


OK
Cancel



MM
o Choose the Confidence level; the default choice is 95%.
o Click on OK button to continue or on Cancel button to cancel the option.
• Click on the OK button to continue or on the Cancel button to cancel the Site versus
Background Comparison.
Output for Two-Sample Quantile Test (with Nondetects)
Area of Concern Dab: Site
Background Data: Background
Raw Statistics
Site
Number of Valid Samples »15
| Background
lis" r
Number of Non-Detect Data
Number of Detect Data
Minimum Non-Detect
Maximum Non-Detect
Percent Won detects
7
~ *5"
• 300
5133%"
-I?
.L
3
25
)oi>V
Minimum Detected
! 11
8
|
Maximum Detected
T¥' "
22
I
Mean of Detected Data
~ 74~43
" 15 "
......
Median of Detected Data
-po~~
" 15~

SD oTDetected Data
j 68 42
7
I
Quantile Test
HO: Site Concentration <= Background Concentration (Fom 1)
Approximate R Value (005) | 4
Approxin-.ate K Value (0 05)' 4
Number of Sits Observations m 'R' Largest] 4 <
Non-Detect Values in the'R' Largest- Cannot compietE Quantile Test
137
-------
Chapter 10
Background Statistics
This chapter illustrates the computations of various parametric and nonparametric statistics and upper
limits that can be used as estimates of background threshold values (BTVs) and other not-to-excecd
values. The BTV estimation methods are available for all data sets with and without nondctcct (ND)
observations. The details of those methods are given in Chapter 5 (full data sets without NDs) and
Chapter 6 (data sets with NDs) of the revised background document for CERCLA sites (EPA, 2002).
Technical details can also be found in the Technical Guide associated with ProUCL 4.0. For each selected
variable, this option computes various upper limits such as UPLs, UTLs, and upper percentiles to estimate
the background threshold values (BTVs) and other compliance limits that are used in site versus
background evaluations.
As before, two choices for data sets are available to compute background statistics:
•	Full - computes background statistics for a Full data set without any NDs.
•	With NDs - computes background statistics for a data set with nondetected as wells as
detected values. Multiple detection limits are allowed.
The user specifies the confidence level (probability) associated with each interval estimate. The
reasonable confidence level as incorporated in ProUCL 4.0 represents a number in the interval [0.5, 1),
0.5 inclusive. The default choice is 0.95.
For data sets with and without NDs, ProUCL 4.0 can compute the following statistics that can be used as
estimates of BTVs and not-to-excecd values.
•	Parametric and nonparametric upper percentiles.
•	Parametric and nonparametric upper prediction limits (UPLs) for a single observation,
future or next k (> 1) observations, mean of next k observations. Here future k, or next k
observations may also represent k observations from another population (e.g., site)
different from the sampled (background) population (used to compute UPLs, UTLs).
•	Parametric and nonparametric upper tolerance Limits (UTLs).
•	Nonparametric IQR-based upper limits.
139
-------
10.1 Background Statistics for Full Data Sets without Nondetects
l. Click Background ~ Full (\v/o NDs) Background Statistics
Fie EditCortftgire Summary SatstjcsROSEst. fCs Graphs OutSer Tests Goodness-of-ftt Hypothesis Testrg UCLWndowHelp
enl<&l
navigation Panel
Nam9
0 Worksheet wst
0 Cadmium wst
Pop-ID ] Cadmium \
J 02Ji
1 026'
4 |5_
6 Lr
With fJDs Background Statistics
1
2. The Select Variables Screen (Chapter 3) will appear.
•	Select a variable (variables) from the Select Variables screen.
•	If needed, select a group variable by clicking the arrow below the Group by variable to
obtain a drop-down list of available variables and select an appropriate group variable.
140
-------
When the option button is clicked, the following window will be shown.
o Specify the Confidence Level; a number in the interval [0.5, 1), 0.5 inclusive. The
default choice is 0.95.
o Specify the Coverage coefficient (for a percentile) needed to compute UTLs.
Coverage represents a number in the interval (0.0, 1). The default choice is 0.9.
Remember, a UTL is an upper confidence limit (e.g., with confidence level = 0.95)
for a 90% (e.g., with coverage = 0.90) percentile,
o Specify the Different or Future K Values. The default choice is 1. It is noted that
when K = 1, the resulting interval will be a UPL for a single future (or site)
observations. In the example shown above, a value of K = 1 has been used,
o Specify the Number of Bootstrap Operations (rcsamples). The default choice is
2000.
o Click on OK button to continue or on Cancel button to cancel this option.
• Click on OK button to continue or on Cancel button to cancel the Background Statistics
Options.
141
-------
Output Scrccn for Normal Distribution (Full)

! I | | I
Cadmium

|

	
Raw Statistics
1
t
	

Number of Valid Samples!
33j


Number of Unique Samples j
28!
t

	 -
Mimniurr.|
Maxiir.urri|
0094]
'21;
	

Second Largestj
6 957
3024

Meanj
	
First Quantilej
Median^
" 0 25]
"25
	_

Third Quantilei
3 8i
J


SDj
3 751
1
1

Coefficient of Variation i
"1 2ii

	
Skewnessj
"3.607]
~"l


Normal DisbibulianTest



Shapiro Wilk Test Statistic?
C[642i


5'/. Shapiro Wilk Critical Value,
0.931


Data not Normal at 5% Significance Levd



Normal Distribution Test



Shapiro Wilk Test Statistic;
0 642


5% Shapiro Wilk Critical Value-
0 931

	
Data not Normal at 5% Significance Level


Background Statistics Assisting Nonna) Distribution



90% Percentile (z)|
7.B3


95% Percentile (z)j
S 193


99% Percentile (z)j
""11*75






95% UTL with 90% Coverage)
9 549






95% UPL (t)|
9472

Note: UPL (or upper percentile for garrnia distributed
	
		

data)'represents a preferred estimate of BTV


142
-------
Output Screen for Lognormal Distribution (Full)
III 1

Log-Transformed Satisfies

Number of Valid Samples ! 33

Number of Unique Samples! 28
Minimum: -2 36^
Maximum! 3 0*5
Secona Largest1	1 9-1
Mean.	0 459
First Ouantiie'	1.653
Median,	0 916
Third Guantile	0 709
SD^	1 308
Lognormal Distribution Test
Shapiro Wilk Te.ci Statistic 0.909
5'/. Shapiro Wilk Critical Value^ 0 931
Data not Lognormal at 5% Significance Level
Background Statistics Assuming Lognonred Distribution
&0X Percentile (~z)' " 8*58
95 V. Percentile (z); 13 &
99V, Percentile [z)< 33.13
95 Vc UPL' 15
95'/, UTL with 90Vo Coverage 15 £
Background Statistics Assuming Lognafmal Distribution
90V, Percentile izj! 8 *5S
95 a Percentife (z), 13 6
99V, Percentile (zj; 33.18
"	" " "95V; UPLr" 15
95X UTLv.ith 90V, Coverage^ 15£
Some Nonparametric Background Statistics
95V, Chebyshev UPL1 19 &2
95X Bootstrap BCA UTL with 90 V, Coveragej S 737
95V, Percentile Bootstrap UTL with 90% Coverage' 6 967
Note: UPL (or upper percentBe for garrvna distributed
data) represents a preferred esliiivde of BTV
-------
10.1.2 Gamma Distribution
1. Click Background ~ Full (w/o,NDs) Background Statistics ~ Gamma
^jProUCH.O -,[C:\Nurain>ProUCL-Data\Data\tadniium.W3tJ'
ay f=i!e =dft ConSgire Summary Statistics ROSEstNDs Graphs OutEer Tests Goodness-of-Fit Hypothess Testng
IUCL Window Htip
z>\<&I BlBlml ml
Background
Fid (w/bNDs)Bactcgiound Statistics *
Navigation Pane!
Name
Q WorkSheet.wst
O Cadmium, wst
0
1 2
3 4
5 1s1,


Pop-ID
Cadmium
i
1

1 1
J	i __ . I;
Jj . v
0 2i
0 26"
logncrmai
Non-Paramelnc
All
2. The Select Variables Screen (Chapter 3) will appear.
•	Select a variable (variables) from the Select Variables screen.
•	If needed, select a group variable by clicking the arrow below the Group by variable to
obtain a drop-down list of available variables, and select a proper group variable.
•	When the option button is clicked, the following window will be shown.
Background Statistics Options
Confidence Level
Coverage
Different or Future K Values
Number of Bootstrap Opeiations
OK
0.9
2000
Cancel
A
o
o
o
o
Specify the Confidence Level; a number in the interval [0.5, 1), 0.5 inclusive. The
default choice is 0.95.
Specify the Coverage level; a number in interval (0.0, 1). Default choice is 0.9.
Specify the next K. The default choice is 1.
Specify the Number of Bootstrap Operations. The default choice is 2000.
Click on OK button to continue or on Cancel button to cancel the option.
Click on OK button to continue or on Cancel button to cancel the Background Statistics
Options.
144
-------
Output Screen for Gamma Distribution (Full)
Cadmium
Raw Statistics
Number of Valid Samples'^	i
Number of Unique Samples;28	I
Minimum!	0094	,
Maximum! 21	!
Second Largestj	6 9S7	,
Mean!	3 024	i
First Quantile]	0 35	;
Median;	2 5
third Quantilet	3.8	;
SD'~	3751
Gamma Distribution Test
k h=t
Theta hat'
nu hat ¦
k star!
+
0 902 ;
3.352 ;
~59.54 j
- i
0S4 i
3 598 '
55 4S
5 356
0 794
5"% A-D Critical Value| 0 78 '
K-S Test Statistic; 0143 f"
5% K-S Critical Value; 0 158
Data follow Appr. Gamma Distribution at 5% Significance Level
Theta star|
nustar1
95% Percentile of Chisquare (2k)|
A-D Test Statistic!
Background Statistics Assuming Gamma Distribution	|
90% Percentilej 7 265	j
95% Percentilej" 9 637	j
99% Percentile; 15.22	j
Nonparametic Background Statistics
Clieb^sTiev UP'lT	19 62
95% BCA Bootstrap UTL with 90a Coverage!	6 737
95% Bootstrap (:/,) UTL aith 90% Coverage;	6 737
•r
Note: U PL (or upper percentile far gamma distributed
data) represents a preferred estimate of BTV
-------
10.1.3 Nonparametric Methods
l. Click Background ~ Full (w/o NDs) Background Statistics ~ Non-Paramctric
Pe? File Edit Configure Smiwnary Statutes ROS Est. NDs Graphs Outfer Tests Goodness-of-Fit Hypothesis Testing |

E3|
-------
Output Screen for Nonparamctric Option (Full)
Cadmium
Some Non-Parametric Statistics	j
Number ofVehd Samples1	33'
Number of Unique Samples 1	28;
Minimum1	0 09^
MaximunV	21j
Second Largest!	6 9671
Mean;	3 02-]
FirstQusntile	0 35<
Median-	2 5i
Third Quantile1	3 3^
_ sDr	3~751j
Variance1	14 07:
Coefficient of Variation	1.2—!
Skewness'	3 607,
Mean of Log-Transformeddatai	0 4-59j
SD of Log-Transformed data,	1 308 (
_ i	- .
Data Follow Appr. Gamma Distribution at 5X Significance Levd) [
Non-Parametric Background Statistics	j
90% Percentile, 527J
95% Percentile^ 6^68;
7!
' 1
99% Percentile. 16 37;
95% UTLwith 90% Coverage	j
Order Statistic J	32
UTL 6 §67 i
95% BCA Bootstrap UTL with 90V* Coveragej 6 737
95% Percentile Bootstrap UTL Aith 90°/, Coveragej 6 SS7l
95'/. Chebyshev UPL[" 19 62
Upper Limit Based upon IQR! 8 975
Note: UPL (or upper percentile far gamna distributed
data) represents a preferred tstiindte of BTV
-------
10.1.4 All Statistics Option
l. Click Background ~ Full Background Statistics ~ All
JJjProUCL 4.0 - [C:\Narflin\PrQUCL-Ddta\Ddta\MW89.w5tJ
,oy Fte Edit Configue SummaryStabsba ROSEstf^Os Graphs OutferTests Goodness-of-Fit HypothesisTestng
| UCL Window Help
e)|
-------
Output Screen for All Statistics Option (Full)
General
Total Number of Samples, 33 j
Number of Unique Samples! 2B
Raw Statistics
Mir.irr.um'	0 094
Maximum]	21
Second Largest1	6 967
FirstQuantile	0.35
Median	2 5
Third Quantile.	3
Mean	3
. _ ^	^
Coefficient of Variation i	1
Skewness	3
i
02- i
751 j
2iT
607 I
Log-Transformed S&tistics
Minimum'	-2 3&4
Maximum	3 0*5'
Second" Largest:	1 941
First Quantife'	-1.13
Median'	0 916
Thira Quantile;	i.334
Me-n" 0 459
	 SD, '"'l 303"
Background Statistics
Normal Distribution Test	j~
ShapiroWilkTest Statistic'	0 642 j
Shapiro V/ilk Critical Value	0 931 J
Data not Normal at 5%Signi&cance Level	|
_ . „	_ j.
Assuming Normal Distribution
95'/. UTL with 90'/. Coveragej	9.549
"	" ~95%UPL(t)|~"	9 472
90% Percentile (z)j	7 S3
' 95°/. Percentile (z)j	9.193
99V. Percentile (2) 1	11 75~
Lognormal Dbtnbuticn Test
Shapiro V/ilk Test Statistic	6 909
Shapiro V/ilk Critical Value'	6 93T
Data not Lognormal at 5% Significance Level
Assuming Lognormal Distribution
95% UTL with 90'/, Coverage!	15 4
~ * "" ~ ~95% IJPL(i)|	15
90% Percentile (2)j	" 845S
95V. Percentile (z)j	13 6
~ ^95% Percent (z)|^ 33 18 "
Gamma Distribution Test
kstar-^ 0S4
"fhetaStYrj " 3~593"
nustari 55 46
A-D Test Statistic! 0 79
Data Distribution Teste
Data Follow Appr. Gamma Distribution at 5% Significance Level
5% A-D Critical Value
K-S Test Statisticj
5% K-S Cntical Value!
0 78
"0 14T"
0158
Data follow Appx. Gamma Distributional 5^ Significance Level
Assuming Gamma Distribution
90'/. Percentile. 7 285
			_			 i		
95X Percentile: 9 637
99V. Percentile' 1522
Nonparamelric Statistics
90% Percentile] 5 27
95'/, Fercentilej 6 468
99% Percentile! 16 37
95V. UTL 
-------
10.2 Background Statistics with NDs
1. Click Background ~ With NDs Background Statistics

~2 He Fctt Configure Summary Statistics ROSEsifOs Graphs Outo Tests Goodness-of-fit Hypothecs Tesbng	(JCL Wndow Help
®\
2. Select the With NDs Background Statistics option.
•	To compute the background statistics assuming the normal distribution, click on Normal
from the drop-down menu list.
•	To compute the background statistics assuming the gamma distribution, click on Gamma
from the drop-down menu list.
•	To compute the background statistics assuming the lognormal distribution, click on
Lognormal from the drop-down menu list.
•	To compute the background statistics using distribution-free methods, click on Non-
Parametric from the drop-down menu list.
•	To compute all available background statistics in ProUCL 4.0, click on the All option
from the drop-down menu list.
10.2.1 Normal or Lognormal Distribution
1. Click Background ~ With NDs Background Statistics ~ Normal or Lognormal

2. The Select Variables Screen (Chapter 3) will appear.
•	Select a variable (variables) from the Select Variables screen.
•	If needed, select a group variable by clicking the arrow below the Group by variable to
obtain a drop-down list of available variables, and select a proper group variable.
•	When the option button is clicked, the following window will be shown.
150
-------
H Background Statistics Options
~IS®!




Confidence Level
1 tug


Coverage
| 0.9

Different or Future K Values
l i

Number of Bootstrap Operations
j 2000

OK

Cancel




A
o Specify the Confidence Level; a number in the interval [0.5, 1), 0.5 inclusive. The
default choice is 0.95.
o Specify the Coverage level; a number in the interval (0.0, 1). Default choice is 0.9.
o Specify the next K. The default choice is 1.
o Specify the Number of Boostrap Operations. The default choice is 2000.
o Click on the OK button to continue or on the Cancel button to cancel the option.
• Click on OK button to continue or on Cancel button to cancel the Background Statistics
Options.
151
-------
Output Screen for Normal Distribution (with NDs)
Arsenic
Raw Statistics
Total Number of Data j 24
Number of Non-Detect Data 13
Number of Detected Data; il
Minimum Detected! 0 5
						,		[_		
Maximum Detected: 3 2
Percent Non-Detects;54.17'/.
Minimum Non-detect 0 9
Maximum Non-detect, 2
Mean of Detected Data, 1 236
SD of Detected D a ta 0 965
Normal Distribution test with Detected Values Only
Shapiro Vvllk Test Statistic, 0.777
5% Shapiro Wilk Critical Value] 0.85
Data not Normal at 5% Significance Level
Normal Distribution Test with Detected Values Only
Shapiro Wilk Test Statistic! 0 777
5X Shapiro Wilk Critical Vaiuej 085
Data not Normal at 5.% Significance Level
Background Statistics Assuming Normal Distribution
DL/2 Substitution Method
Mear.j 1 002	|
SDr"" 0 699
	i		
95°/, IJTL 9D% Coverage: 2.2%	|
S5^~U PLTtj j T224
90% Percentile (z)j 1 897
95?; Percentile (2) j 2151
99^ Percentile (z) I 2 627
Note: DU2 is.nota recommended method.
Maximum Likelihood Estimate (MLE) Method
MLE Method is Not Appliable for This Data j
152
-------
Maximum Likelihood Estimate (MLE) Method
MLE Method is Not Appliable for This Data;
Kaplan-Meier (KM) Method
Mean]	0949
SDf	0 713
SEof Meanj	0165
S5/', IJTL with SO'/, Coverage'"	2 27
' 957, KM UPL (t);7' 2.196
95'/, KM Chebyihtv UPLj	11~2f
90V* Percentile (z)i	1863
95£ Percentile (2)]	2 122
99!/« Percentile (z) f	2 603
Note: UPL (or upper percentile for gamrra distributed (fata) represent a
preferred estimate of BTV. For an Example: KM-UPL may be used
when multiple detection limits are present
153
-------
Output Screen for Lognormal Distribution (with NDs)
Arsenic
Log-T ransfoimed Satisfies
Number of Valid Samples j	24"
Number of Unique Samples |	10
Minimum'	-0 S93
Maximum'	1 163
		i	
Second Largest^	1 03
Meanj	0215
First Quantile' 0 693
Medianj 0 203
Third Quantile
SD
-0 693
0 574~
Lognormal Distribution Test
Shapiro Wilk Test Statisticj 0 906
, , 5% Shapiro Wilk Critical ValueJ 0916
Data not Lognormal at 5% Significance Level
Background Statistics Assuming Lognormal Distribution
90% Percentile (z) |	2 587
9554 Percentile (z) |	3 186
99% Percentile (zj I	4 711
¥5%UPL!	3 383
95% UTL with 90% Coverage! 3 59
Some Nonparametric Background Statistics
95"/. Chebyshev UPL! 4 825
95V. Bootstrap BCA UTL with 90% Coverage: 18
95% Percentile Bootstrap UTL with 90% Coverage! 3 04
Note: U PL (or upper percentile for (pima distributed
data) represents a preferred estimate of BTV
154
-------
10.2.2 Gamma Distribution
1. Click Background ~ With NDs Background Statistics ~ Gamma
^JjProUCL 4.0 - (C:\Narain\ProllCL-Uqta\Data\MWU9.wst]
~3 FJe Edit Configure Simnwy Statistics ROSEstNDs Graphs OutSer Tests Goodness-of-fit Hypothesis Testing I
I LCI Window Help
2. The Select Variables Screen (Chapter 3) will appear.
•	Select a variable (variables) from the Select Variables screen.
•	If needed, select a group variable by clicking the arrow below the Group by variable to
obtain a drop-down list of available variables and select a proper group variable.
•	When the option button is clicked, the following window will be shown.
Background Statistics (Gamma)
[~
Confidence Level
Coverage
OK

09
Cancel
o Specify the Confidence Level; a number in the interval [0.5, 1), 0.5 inclusive. The
default choice is 0.95.
o Specify the Coverage level; a number in the interval (0.0, 1). Default choice is 0.9.
o Click on the OK button to continue or on the Cancel button to cancel option.
• Click on OK button to continue or on Cancel button to cancel the Background Statistics.
155
-------
Output Screen for Gamma Distribution (with NDs)
Arsenic
Raw Statistics
total Number of Data1	24
Number of Non-Oetect Data i	13
Number of Detected Data,	11
Minimum Detected	0 5
Maximum Detected;	3 2
Percent Non-Detects	54 tf'/*
Minimum Nor.-detect(	0 9
Maximum Non-dstect	2
Mean of Detected Data!	1 236
SD of Detected Data'	0 965
Gamma Distribution Test with Detected Values Onfy
k star	1 702
Theta star	0 727
nustar1	37 44
95'/. Percentile of Chisquare (2k)f	3 503
A-D Test STatistic i	0 787
5V. A-D Critical Value,	0738
K-STest Statistic j 0 254
~5V. K-SCnUcalValJe1 0'258
Data follow Appr. Gamma Distribution at 5% SkjHftcance Level
Background Statistics Assun£ng Gazrcna Distribution
	1
t
Gamma ROS Statistics wrfh Extrapdated Data
Meanj" 1 253^
Median! 1 213,
SD j 0 6521
kStarj 3 974.1
Theta Star| 0 318;
nuStarj
95^. PercentileoTChisquareC2k)|	1543j
90V. Percentile,	1 861J
95V. Percentile j	2 16!
- 99'/. Percentile^	2 80^
Kaplan Meier ( KM) Method	j
,	. "09491
___
SEofMeanj	0165'
95X UTL aO'/rCOTersgei	2 2?1
95YKfTCh"i5ihS"UPLi	4~121'
95'/rKi;t"UPL(t)i	2 19&i
90Percentile'	1 863'
, ______ ~95X Percentile (z):	2122^
—I—
99V. Percentile (2): 2 608J
Note UPL (or upper percentile for gamma distributed data) represent a j
preferredestimateof BTV. Few an Example.	may be used j
when multiple detection lirrita are presort	I
156
-------
10.2.3 Nonparametric Methods (with NDs)
1. Click Background ~ With NDs Background Statistics ~ Non-Paramctric
WJProUCL 4.0 - [C:\Ndrtiin\PioUCL-Ddta\DiitaWW89.wst]
.ny Ffle Edit Configue Summary Statistics ROSEst. NDs Graphs Outier Tests Goodness-of-Fit Hypothesis Testing
I UCl Window Help
Navigation Panel j



			i|
VYtth f*)s Background Staines ~

j Q 1 1
2 3 1
i ! 5 I 6 L

Name |
| Well ID Mn

1 1
1 1 1 II
O Worksheet wst
0 MW89 wst
l
1, 460' ; 1 • 1
l '[ T27| " -- ^ - -j
r 579; i I , ; i
2
3
Normal
Gamma
Log normal
AB
2. The Select Variables Screen (Chapter 3) will appear.
•	Select a variable (variables) from the Select Variables screen.
•	If needed, select a group variable by clicking the arrow below the Group by variable to
obtain a drop-down list of available variables and select a proper group variable.
•	When the option button is clicked, the following window will be shown.
Background Statistics (Gamma)
m
Confidence Level
Coverage
OK
0.9
Cancel
o Specify the Confidence Level; a number in the interval [0.5, 1), 0.5 inclusive. The
default choice is 0.95.
o Specify the Coverage level; a number in interval (0.0, 1). Default choice is 0.9.
o Click on the OK button to continue or on the Cancel button to cancel the option.
• Click on OK button to continue or on Cancel button to cancel the Background Statistics.
157
-------
Output Screen for Nonparametric Methods (with NDs)
Arsenic
Total Number of Data .24
Number of Non-Detect Data; 13
Number of Detected Data, 11
Minimum Detected' 0 5
Maximum Detected! 3.2
Percent Non-Detecis J54.17'/.
Minimum Non-detect; 0 9
Maximum Non-detectj 2
Mean of Detected Data i 1 236
SD of Detected Data ;
0 965
Mean of Log-Transformed Detected Dat3| -0 0255
SDof Log-Transformed DetectedData' 0 694
Data Follow Appr. Gamma Distribution at 5% Significance Level
Nonparametric Background Statistics
95°/. UTLwith 90% Coverage
Order Statistic!
23
—
Achieved CCj 92 02
uYLr'Tif-
Largest Non-detect at Order; 22
Kaplan-Meier'(KM) Method
Mean'
sdT
				„ .. ^	L
Standard Error of Meanj
95'/. IJTL 90°/. Coverage'j
~~ 95 
-------
10.2.4 All Statistics Option
1. Click Background ~ With NDs Background Statistics ~ All
2. The Select Variables Screen (Chapter 3) will appear.
• Select a variable (variables) from the Select Variables screen.
If needed, select a group variable by clicking the arrow below the Group by variable to
obtain a drop-down list of available variables, and select a proper group variable.
When the option button is clicked, the following window will be shown.
Background Statistics
Confidence Level
Coverage
Next K
0 95
0.9
OK
Cancel
o Specify the Confidence Level; a number in the interval [0.5, 1), 0.5 inclusive. The
default choice is 0.95.
o Specify the Coverage level; a number in the interval (0.0, 1). Default choice is 0.9.
o Specify the Next K. The default choice is 1.
o Click on the OK button to continue or on the Cancel button to cancel the option.
• Click on OK button to continue or on Cancel button to cancel the Background Statistics.
159
-------
Output Screen for All Statistics Option (with NDs)
Arsenic
General Statistics
Number of Valid Samples j 24 |
Number cf Unique Samples 8
I
Number of Detected Dataj 11
Number of Non-Detect Data' 13
Percent Non-Detects'54 17'/.
Raw Statistics	|
Minimum Detected;	05	I
Maximum Detected	3 2	i
Mean of Detected'	1.236	!
SDcf Detected	0.965	,
Minimum Non-Detect	0 9	[
Maximum Non-Detect-	2	I
Log-transformed Statistics
Minimum Detected! -0693
Maximum Detected1 1 163
Mean of Detected j -0 0255
SD of Detected 0654
Data with Multiple Detection Limib
Note Data hive multiple DLs - Use cf KM Method is leccmmended
For ail methods (except KM, DIJ2, =nd ROS Methods).
Observations < largest ND are treated as-NDs
Minimum Non-Detect: -0105
Maximum Non-Detect] 0 693
Single Detection Limit Scenario
Number treated es fJon-Detect with Single DLj22
Number treated as Detected with Single DL:2
Single DL Non-Detect Percentage.91 67'/,
Background Statistics
Normal Distribution Test with Ddected Values Onfy
Shapiro V/ilk Test Statistic j 0 777
5'/, Shapiro V/ilk Critical Valuej 0 S5
Data not Normal at5^ Significance Level
Assuming Normal Distribution
DL/2 Substitution Method;
Mean!
_____
95J/» UTL 90'/, Coverage,
„
90*/. Percentile (2)
95*/. Percentile (z)'
99V. Percentile (z);
a
1002
0699
~229~6
2 224
1	S97
2.151
2	627
Lognormal Distribution Test with Detected Values Only
Shapiro WiTk Test Statistic^ 0 86
5Vi Shapiro V/ilk Critical Value: 0 85
Data appear Lognormal at 5% Significance Level
Assuming Log norma) Distribution
DL/2 Substitution Method'
Mean (Log Scale);
SD (Log Scale)'
95% UTL 90^. Coverage;
gWUPLWf
90% Percentile (z)1
95*/. Percentile (z)^
W/l PercentTle~(z)"i
-016
"0 542
"2 327"
2 2
1 707
"2079"
" 3 009
160
-------
Log ROS Methoc'	j
Mean in Onginal Scale, 0.972	i
SD in Original Scale 0 71S	j
Mean in Log Scale -0 209	j
SD in Log Scale 0 571	i
95*4 UTL 90'/. Coverage 2 337	;
55V. UPL (t)~ 2 202	I
90V. Percentile (zj 1 6S6	j
95°/. Percentile (zj 2 075 j
99y--pe~f(:^|e-|2j - 3 062	"|
Data Distribution Tests with Detected V^ues Onfy	j
Data foljow Appr Gamma Distribution at 5V, Significance LeveJ	1
Nonparametric
Kaplan-Meier (KM) Method'
""	l-lean; "
_ _	^ ,	'0713
SE of Mean	0165
95V; KM UTL with 90'/. Coverage	2 27
95*/. KM Chebyshev UPLi	4 121
^•/."KMUPLO); "	2 196
90V. Percentile (2)!	1 S£3
Maximum Likelihood EstirrvatetMLE) Methoa N'A
Gamma Distribution Test with Detected Values Onfy
k star (bias corrected) 1 702
ThetaStar! 0 727 !
nustar 37 il ,
i
A-D Test Statistic 0 7S7
5V. A-D Critical Value 6 733 !
K-S Test Statistic 0254 j
5V; K-S Critical Value 0.253 i
Data follow Appx. Gamma Distribution at 5X Significance Level j
Assuming Gamma Distribution
Gamma ROS Statistics with extrapolated Data]
		 " " "	"" "" Mean*	1 2&3~;
Meaianj	[213 |
SDj*	0 652" ;
k star,	3 97
-------
Chapter 11
Computing Upper Confidence Limits (UCLs) of Mean
The UCL computation module of ProUCL 4.0 represents an update of the UCL module of ProUCL 3.0.
The detailed theory and formulae used to compute gamma and lognormal statistics are given by Land
(1971, 1975), Gilbert (1987), Singh, Singh, and Engclhardt (1997, 1999), Singh et al. (2002a), Singh et al.
(2002b), and Singh and Singh (2003).
Several parametric and nonparametric UCL computation methods for data sets with NDs have been
incorporated in ProUCL 4.0. Methods such as the Kaplan-Meier (KM) and regression on order statistics
(ROS) methods as incorporated in ProUCL 4.0 can handle multiple detection limits. For details regarding
the distributions and methods available in ProUCL 4.0, refer to the ProUCL 4.0 Technical Guide and
Singh, Maichle, and Lee (USEPA, 2006). Recommendations for the computations of UCLs for data sets
with NDs have been made based upon the findings of the simulation experiments performed by Singh,
Maichle, and Lee (USEPA, 2006).
In ProUCL 4.0, two choices are available to compute UCL statistics:
•	Full - Computes UCLs for full data sets without any nondetected values.
•	With NDs - Computes UCLs for data sets that have detected as well as BDL
observations. It is pointed out that it is not desirable to use statistical methods as
incorporated in ProUCL 4.0 on data sets consisting of all nondetect values. Discussion
about the detection sampling frequency is provided in Chapter 1 of this User Guide.
Some of the available methods can handle multiple detection limits. The program
provides a message to the user about the use of an appropriate method when multiple
detection limits may be present.
•	For full data sets without NDs and also for data sets with NDs, the following options and
choices arc available to compute UCLs of the population mean.
o The user specifies the confidence level; a number in the interval [0.5, 1), 0.5
inclusive. The default choice is 0.95.
o The program computes several nonparametric UCLs using the central limit theorem
(CLT), Chebyshev inequality, jackknife, and bootstrap re-sampling methods,
o For the bootstrap method, the user can select the number of bootstrap runs (re-
samples). The default choice for the number of bootstrap runs is 2000.
o The user is responsible for selecting an appropriate choice for the data distribution:
normal, gamma, lognormal, or nonparametric. It is desirable that user determines data
distribution using the Goodness-of-Fit test option prior to using the UCL option. The
UCL option informs the user if data are normal, gamma, lognormal, or a non-
discernable distribution. Program computes statistics depending on the user selection,
o For data sets, which are not normal, one may try the gamma UCL next. The program
will offer you advice if you chose the wrong UCL option,
o For data sets, which are neither normal nor gamma, one may try the lognormal UCL.
The program will offer you advice if you chose the wrong UCL option.
163
-------
o Data sets that are not normal, gamma, or lognormal are classified as distribution-free
nonparametric data sets. The user may use nonparametric UCL option for such data
sets. The program will offer you advice if you chose the wrong UCL option,
o The program also provides the All option. By selecting this option, the UCLs are
computed using most of the relevant methods available in ProUCL 4.0. The program
informs the user about the distribution of the underlying data set, and offers advice
regarding the use of an appropriate UCL.
o For lognormal data sets, ProUCL can compute only a 90% or a 95% Land's statistic-
based H-UCL of the mean. For all other methods, ProUCL can compute a UCL for
any confidence coefficient in the interval [0.5,1.0), 0.5 inclusive,
o If you have selected a distribution, then ProUCL will provide a recommended UCL
computation method for 0.95, confidence coefficient. Even though ProUCL can
compute UCLs for confidence coefficients in the interval [0.5, 1.0),
recommendations are provided only for 95% UCL; as EPC term is estimated by a
95% UCL of the mean.
Note: It is recommended that the user identify a few low probability outlying observations that may be
present in the data set. Outliers distort many statistics of interest including summary statistics, data
distributions, test statistics, UCLs, and estimates of BTVs. Decisions based upon distorted statistics may
be misleading and incorrect. The objective is to compute relevant statistics and estimates based upon the
majority of the data set(s) representing the dominant population^). Those few low probability outlying
observations require separate attention and investigation. The project team should decide about the
proper disposition (to include or not to include) of outliers before computing the statistics to estimate the
EPC terms and BTVs. In order to determine and compare the improper and unbalanced influence of
outliers on UCLs and background statistics, the project team may want to compute statistics using data
sets with outliers and without outliers.
11.1 UCLs for Full Data Sets
11.1.1 Normal Distribution (Full Data Sets without NDs)
1. Click UCL ~ Full ~ Normal
oS* Fde Edit Configure Summary Statistics ROSEst. h©s Graphs Outlier Tests Goodness-of-Rt Hypothesis Testing Background TOM Wrtdovv
Help
£3|
-------
Confidence Level
Confidence Level
m
UliM
OK
Cancel
o Specify the Confidence Level; a number in the interval [0.5, 1), 0.5 inclusive. The
default choice is 0.95.
o Click on OK button to continue or on Cancel button to cancel the option.
• Click on OK button to continue or on Cancel button to cancel the UCL computation
option.
165
-------
Output Screen for Normal Distribution (Full Data without NDs)
| Normal UCL Statistics for Full Data Sets
User Selected Options!
From File jC.\Narain\ProUCL-Data\Data\Aroclor 125i.wst
Full Precision lOFF
Confidence Coefficient |95%
Aroclor_Without_NoribetEcts	;
Number of Valid Samples [	4-ij	j
Number of Unique Samples |	41 j	j
Minimum]	0.21j	j
Maximum!	1900o'	|
			 		J. 	i	|
Mean,	1532,	i
Medianj	S45|	[
__ _ ,
Variance' 11255595'	|
Coefficient of Variation!	219 j	|
Skewness|	3 756]	|
' '	j	;
Shapiro Wilk Test Statistic!	0 526j	j
5V. Shapiro Wilk CriticafValue1	QS441	|
Data rot Normal.at 5'/, Significance Level,	,
95% UCL (.Assuming Normal Distribution)	j
Student's-t LCL~j~ §81.6 j
StudentVtUCL; 2382
Data appear Lognormal (0.05)
May waiitto try Lognorm^ UCLs
166
-------
11.1.2 Gamma, Lognormal, Nonparametric, All Statistics Option (Full Data without NDs)
1. Click UCL ~ Full ~ Gamma, Lognormal, Non-Paramctric, or All
£?|PrqUCL 4.0 - [C:\Narain\ProUCL-pata\Data\MW89:wst]
ay, File Edit Configure Summary Staostts P*OSEst Graphs Outlier Teste Goodness-of-Fit Hypothesis Testing Background
I Wrdow Hdp
Navigation Pane! |







cMmMmna Vv'ltD NDS >
i





6
, 1 — a --

Name |

Well ID
Mn |




I

0 Worksheet, wst
®MW89.wst
1
2
1| iSO' I i : i
1| -- 527j ; ; ' ; '
Q Normal
Gamma
LogrFormal
Non-Pirftmetnc
2. The Select Variables Screen (Chapter 3) will appear.
•	Select a variable (variables) from the Select Variables screen.
•	If desired, select a group variable by clicking the arrow below the Group by variable to
obtain a drop-down list of available variables, and select a proper group variable.
•	When the option button is clicked, the following window will be shown.
UClls
Confidence Level J IflEH
m
Number of Bootstrap Operations J 2000
OK
Cancel
o Specify the Confidence Level; a number in the interval [0.5, 1), 0.5 inclusive. The
default choice is 0.95.
o Specify the Number of Bootstrap Operations (runs). Default choice is 2000.
o Click on OK button to continue or on Cancel button to cancel the UCLs option.
• Click on OK button to continue or on Cancel button to cancel the selected UCL
computation option.
167
-------
Output Screen for Gamma Distribution (Full)
Gamma UCL Statistics for Full Date Sets
User Selected Options
From File C \N2ram\RolJCL-Data\0ataWoclor 12S1 wst
Full Precision ^FF
Confidence Coefficient 35V*
Number of Bootstrap Operations , 2000
PAroclor V/ithout UonDetects
Number of Valid Samples!
Number of Unique Samples £1
Minimum; 0 21
Maximum, 19000
Mean! "1532
Median; 1& 5
Standard Deviation! 3355
Varisnce!11255595
k star (bias corrected)! 0 2^7
Theta Start 6203
nustar^ 2172
I
Approximate Chi Square Value (05)j^ 12 12 j
Adjusted Level of Significance 0 0^51
Adjusted Chi Square Value ( 05)^ 1188
Anderson-Darling test Statistic 1 01
"~~08&4
"0.169"
Kolnwgorov-Smirnov Critical Value, 0.146
Data not Gamma distributed at bX- Significance Levei
.Anderson-Darling Critical Value!
Kolmogorov-Smjrnov Test Statistic)
95"4 UCLs (Adjusted for Skegness)
" 95"/; Adjusted-CLT UCLj 2670"
* 95%~ Modifie? tUCL] "2430
95% NorhParameiric UCLs
95% Bootstrap-TuCLj 3010
95% Hall's Bootstrap UCLj^ 5380

95V: Gamma UCLs(Assunnng Gamma Distribution)
95% Approximate Gamma UCL
2744
"991"
95% Approximate Gamma LCL
95% Adjusted Gamma UCL j 2S00
95% Adjusted Gamma LCLI 975 8
Data appear Lognormal (0.05)
May want to try Lognorma) UCLs
168
-------
Output Screen for Lognormal Distribution (Full)
Lognormal UCL Statistics for Full Data Sets
User Selected Options!
From File ^CANsrain\ProUCL-Data\Dat2\Aroclor 125i wst
Full Precision ,OFF
Confidence Coefficient [95V;
Number of Bootstrap Operations >2000
Aroc lor_Wi thouLNonDetects
~ .... . .	[
Number of Valid Samples j	£4;
Number of Unique Samples!	^1'
Minimum of log data I	-1561,
Max i mum of log data1	9 S52
Mean of log data!	tf(L\
SD of log data1	3.163,
Variance of log data,	10'
Shapiro Wilk Test Statistic	0.9^3
Shapiro Wilk 5*/. Critical Valuej	0 9±4'
Data appear Lognormal at 5'4 Significance Level	|
95% UCL (Assuming Normal Distribution)
95'4 StudentVt UC L1
ML Estimates .Assuming Lognormal Distribution
Mean |
				SD;
Coefficient of Variation j
Skewness;
Median |
80V. Quantile|
90V. Quantile
95V. Quantile,
23S2'
¦¦ -f
" 13038
19382351
U87j
3285S37;"
8777;'
1256:
" 5056;
~15935r
"137549J '
_	_ I
MVU Estimate of Median j 78 25;
MVU Estimate of Meanj 7626
MWEstimSeof SD 1&5089:
MVU Estimate cf Standard Error of Mean
5553]
UCLs (Assuming Lognormal Distribution)	i
95*>. H-UCLi	37513'
_ 	 _ 			 _ 	 -
' ""951/. Chebyshev (MVUE) UCL,	' " 31832!"
97 5'TChebyslieV(MVUE)~UCLl	±2307f
________	, 62881P
Potenti a IU CL to Use
Use 99Va Chebyshev (MVUE) UCLj
Recommended UCL exceeds the maximum observation
628611
-------
Output for Nonparametric Methods (Full Data without NDs)
. Nonparametric UCLStatistics forFiril Data Sets
User Selected Options;
From' File C \Narain\ProUCL-bata\Data\.Arocior 1254v/si
Full Precision lOFF
_i		_ _. _ _ _ _	_ 		
Confidence Coefficient ; S5'-i
Number of Bootstrap Operations ! 2000
Aroclor_W>thout_Nari Defects
Number of Valid Samples'	44;
Number of Unique Samples'	41!
Minimum^	0 21,
Maximum^	19000^
Meanj	1532 [
Medianj	94.51
. •—sol" " 3355:
Variance!	T12555951
Coefficient of Variation |	219^
SkeATiessj	3756;
Mean of log data j	4 474.
SD of log data	3.163:
	 ... . 1	. . . . i

"1

Non-Paiametric UCLs

	

95* CLT UCL
135.1
	
95* Jackknife UCL
136.1
1344
95* Standard Bootstrap UCL
"
	
955; Bootstrap-t UCL
144 5
139 7
95/J Hall's Bootstrap UCL

95% Percentile Bootstrap UCL
1361

95X BCA B ootstrap U CL
141 3


95* Chebyshev(Mean. Sd) UCL
167 3


97.5* Chebyshev(Mean, Sd) UCL
189.7


99* Chebjishev(Mean, Sd) UCL
233.7





Potential UCL to Use


Use 95* Student's-t UCL
1361


Or 95* Modified-t UCL
136 8





170
-------
Output Screen for All Statistics Option (Full Data without NDs)
; General UCL Statistics for Fudl Data Sets


User Selected Options



From File C Warair^PrcUCL-Data^DataWoclof 1254 ws


Full Precision OFF



Confidence Coefficient 95V,



Nuirber of Bootstrap Operations 2000



Af oc 1 or_Wi thout_NanOetBcb
Genera! Statistics


Number of Valia Samples
44
Number of Unique Samples
41
Raw Statistics

Log-transformed Statistics

Minimum
0 21
Minimum of Log Data
-1 5S1
Maximum
19000
Maximum of Log Data
9 852
Mean
1532
Mean of log Data
4 474
Median
Si 5
SD of log Data
3163
SD
3355


Coefficient of Variation
2 19


Sk€ATre55
3.755


Relevant UCL Statistics

Normal Distribution Test

Lognormal Distribution Test

Shapiro V/ilk Test Statistic
0 525
Shapiro V/ilk Test Statistic
0 948
Shapiro V/ilk Critical Value
094i
Shapiro V/ilk Critical Value
0 94i
Data not Normal al 5^ SigniScaice Level

Data appear Lognormal at 5% Sifpvficance Level
Assuming Nonrai Distribution

Assuming Lognormad Distribution

95V, Studenfs-t UCL
2332
95V, H-UCL
87513
95% UCLs (Adjusted far Skewness)

95V, Cheb/shev (MVUE) UCL 31S32
95V, Adjusted-CLT UCL
2£70
97 5V; Chebyshev (MVUE) UCL
42307
95*4 Modified-t UCL
2^30
99:. Chebyshev (MVUE) UCL
52831
Gamma Distribution Test

Data Distribution Tests

k star (bias corrected)
0 247
Data appear Lognormal at 5% Sigrvficase Levd
Thete Star
€203


ru star
21 72


Approximate Chi Square Value (05)
12 12
Nonparametric Statistics

Adjusted Level of Significance
00445
95Ve CLT UCL
2364
Adjusted Cfii Souare Value
11 S3
95V, Jackknife UCL
95V, Standard Bootstrap UCL
2382
2352
Anderson-Darling Test Statistic
1 01
95V, Bootstrap-t UCL
3035
Anderson-Darling 5'/, Critical Value
03Si
95V. Hall's Bootstrap UCL
5521
Kolmogorov-Smirnov Test Statistic
0 159
95V, Percentile Bootstrap UCL
2404
Kolmogorov-Sffiirncv 5% Critical Value
0 1-6
95V; ECA Bootstrap UCL
2727
Data not Gamma Distributed at 5% SfrpxficaDce Level
95X Chebyshe'/(Mean, Sd) UCL
3737


97 5V, Cheby5hcv(Mean Sd) UCL
4690
Assuming Gamma Distribution

99V, Chebyshev(Mean. Sd) UCL
6564
95V, Approxima:e Gamma UCL
27^4


95V, Adjusted Gamma UCL
2S-CO


Potential UCL to Use

Use 99:, Chebyshev (MVUE) UCL
62S31
Recommended UCL exceeds the maxmumofasovaban
171
-------
Note: Once again, it should be noted that the number of valid samples represents the total number of
samples minus (-) the missing values (if any). The number of unique or distinct samples simply represents
number of distinct observations. The information about the number of distinct samples is useful when
using bootstrap methods. Specifically, it is not desirable to use bootstrap methods on data sets with only a
few (< 4-5) distinct values.
11.2 UCL for Data Sets with NDs
1. Click UCL ~ With NDs
F3e Edit Configure Summary Statistics ROS Est. NDs Graphs Outber Tests Goodness-of-Fit Hypothesis Testing Background IU9 Window Help
epM BlBlml ml

mam
With NDs ~
I
Normal
• Gamma
Lognormal
Non-Parametric

Navigation Panel |

0 I
1 | 2
3 | 4
5
6
7
	5	

11
Name |
Well ID j
Mn |







O Worksheet wst
0MW89 wst
1
2
1
1
1
<£0
' " 527,
: : 1 | ! |

3
579
: ' ' !
i

2.	Choose the Normal, Gamma, Lognormal, Non-Parametric, or All option.
3.	The Select Variables Screen (Chapter 3) will appear.
•	Select a variable (variables) from the Select Variables screen.
•	If desired, select a group variable by clicking the arrow below the Group by variable to
obtain a drop-down list of available variables, and select a proper group variable. The
selection of this option will compute the relevant statistics separately for each group that
may be present in the data set.
•	When the option button is clicked, the following window will be shown.
[~
Confidence Level
Number of Bootstrap Operations

2000
OK
Cancel
o Specify the Confidence Level; a number in the interval [0.5, 1), 0.5 inclusive. The
default choice is 0.95.
o Specify the Number of Bootstrap Operations. The default choice is 2000.
172
-------
o Click on OK button to continue or on Cancel button to cancel the UCLs option.
Click on OK button to continue or on Cancel button to cancel the selected UCL
computation option.
Output Screen for Normal Distribution (with NDs)
Usei Selected Options
From File
Full Precision
Confidence Coefficient
Number of Bootstrap Operations
Normal UCL Statistics for Data Sets with Non-Detects
D:\example wst
OFF
95£
2000
Arsenic
Total Number of Data
	f-
Number of Non-Detect Data
Number of Delected Data
Minimum Detected
Maximum Detected
Percent Non-Detects
Minimum Non-detect
Maximum Non-detect
Mean of Detected Data
SD of Detected Data
_2°;
3~j
171
5
^972
15.002
4.3
4.5
G.12G
1.15
Note: Data have multiple DLs- Use of KM Method is recommended
Foi all methods (except KM, DL/2, Robust ROS, and Gamma ROS
the Laigest DL value is used for all NDs
173
-------
Output Screen for Normal Distribution (with NDs) - Continued


















Notmal Distribution Test with Detected Values Only





Shapiro Wilk Test Statistic
0.794





5% Shapiro Wilk Critical Value
0.892




Data not Normal at 5% Significance Level













DU2 Substitution Method








Mean
5.54







sdI
1 779






35% DU2 (t) UCL
6.228












Maximum Likelihood Method




Mean
5.774



SD
1.348



35% MLE (t) UCL
6.295






35% MLE (Tiku) UCL
6 301














Kaplan Meier(KM) Method








Mean
5.958







SD
1.104



Standard Error of Mean
0.255



95% KM (t) UCL
6.398
I

35% KM (z) UCL
6 376



35% KM (BCA) UCL
6.393





35% KM [Percentile Bootstrap) UCL
6.388







Data do not follow a Discernable Distribution (0.05)







May want to tiy Non-Parametric UCLs



174
-------
Output Screen for Gamma Distribution (with NDs)
| Gamma UCL Statistics for Data Sets with Non-Detects
UserSelectedO ptionsj
From File i D. \example wst
Full Precision I OFF
Confidence Coefficient ; 95%
Number of Bootstrap Operations j 2000
Arsenic
Total Number of Data
Number of Missing Values
Number of Non-Detect Data!
Number of Detected Data!
Minimum Detected;
20
2CT
3~
T7
	5~
Maximum Detected 9 2
Percent N on-D elects' 15.00^:
		i	
Minimum Non-detect' 4.3
Maximum Non-detect 4 5
M ean of D elected D ataj G. 12G
Median of Detected Data! 5.8
SD of Detected Data 1.15
k Star of Detected Data! 28.99
Theta Star of Detected Data
'-I-
Nu Star of Detected Data
0.211
985.5
Note: Data have multiple DLs - Use of KM Method is recommended
For all methods (except KM, DL/2. Robust ROS, and Gamma ROS),
the Largest DL value is used for all N Ds
-------
Output Screen for Gamma Distribution (with NDs) - Continued



|



	
--	
	
		
	
' " " I

Gamma Distribution T est with Detected Values Only
|



A-D Test Statistic
1.027
!
I



5%A-D Critical Value
0.737 j




K-S Test Statistic
0.214 j

5% K-S Critical Value
0 209

Data not Gamma Distributed at 5% Significance Level 1






r

Gamma ROS Statistics with Extrapolated Data

				i._	



Minimum
3.51 G





Maximum
9.2


Mean
5.804


Median
5.625


SD
1.325


k Star
18.24


Theta Star
0.318


Nu Star
729 8


35% Percentile of Chisquare (2k)
51.58


AppChi2
6G8.1




35% Gamma Approximate UCL
6.34


35% Gamma Adjusted UCL
6.384 |
i

									_	______		 I	

Kaplan Meier(KM) Method


Mean
5.958


SD
1 104


Standard Error of Mean
0.255


35% KM (t) UCL
6.398


952S KM (BCA)UCL
6.455


35% KM (Percentile Bootstrap) UCL
6.405


35% KM (Chebyshev) UCL
7.067





Data do not follow a Discernable Distribution (0.05]





May want to try Nonparametric UCLs


176
-------
Output Screen for Lognormal Distribution (with NDs)














Lognormal UCL Statistics lor Data Sets with NartDetects


U sei S elected 0 ptions









From File
DAexample.wst







Full Precision
OFF








Confidence Coefficient
'95%







Number of Bootstrap Operations
2000





























Aisenic
	
	
.....







	
	
	
—		
General Statistics
	
	
	
	
		-


Number of Valid Samples
20

Number of Detected Data



Number of Unique Samples
15

N umber of N on-D etect D ata | 3

	
	
		
—		
	

	
Percent N on-D etectsjl 5.00%





Raw Statistics







Maximum Detected
"""" 9 2


Maximum Non-detect; 4.5



Mean of Detected Data
5.126


SD of Detected Data
r ii5
i










:	
I




Log-T ransformed Statistics






Mean of Detected Data
" 1.798


SD of Detected Data: 0.168 I
i
Note Data have multiple DLs ¦ Use of KM Method is recomm'
Number treated as Non-Detect|3

For all methods (except KM, DL/2, Robust ROS,
and Gamma
Number treated as Detected! 17

those Observations < Largest ND are tieated as NDs

Single DL Non-Detect Percentage|15 00%











L


Lognormal Distribution T est with Detected Values Only




Shapiro Wik Test Statistic] 0.854 !
5% Shapiro Wilk Critical Value
| 0 892

	
	 	
.	
Data not Lognormal at 5X Significance Level
	
	

Note: Once again, it should be noted that the number of valid samples represents total number of samples
minus (-) the missing values (if any). The number of unique or distinct samples simply represents number
of distinct observations. The information about the number of distinct samples is useful when using
bootstrap methods. Specifically, it is not desirable to use bootstrap methods on data sets with only a few
(< 4-5) distinct values.
177
-------
Output Screen for Lognormal Distribution (with NDs) - Continued
D L/2 S ubstitution Method
Mean | 5.54
~SDi "T779"
95% H-Stat (DL/2) UCLj G522
Mean [in Log Scale)! 1648
SD (in Log Scale)^ 0 398
Robust BOS Method
Mean	503,
SD j 1 28
Mean (in Log Scale} 1 742
SD (in Log Scaiejj ~ 0.20f

35% Percentile Bootstiap UCL
95% BCA Bootstiap UCL
G 318
6 357
.I.":
Kaplan M eiei (KM ] M ethod
Mean, 5.958
SD' l¥
SE of Mean I 0 255

95*TMlt)UCL
G398 .
952: KM (BCA) UCL
6.43
~95Z KM (% Bootstrap) UCL
6 373
955: KM (Chebyshev) UCL
7 0G7
97 5X KM (Chebyshev) UCL
7 547
99% KM (Chebyshev) UCL
8 49
Potential UCL to Usej
Data do not follow a Discernable Distubution (0 05)
May want to try Nonparametnc UCLs
178
-------
Output Screen for Nonparametric Methods (with NDs)
1 1 -1 1 1 1 1 1
Nonparametric UCL Statistics for Data Sets with Nan-Detects
U ser S elected 0 ptions1
F rom File D1 \example. wst
			 _ i . . 			 	 _ 	 	
Full Precision ;0FF
Confidence Coefficient i 35%
Number of Bootstrap Operations J 2000
	
	
	
		
Arsenic
		



Total Number of Data
Number of Non-Delect Data
		
Number of Detected Data
Minimum Detected
Maximum Detected
Percent Non-Detects
20
3
17
5
9.2
15002
! [
t
i i
i 1
Minimum Non-detect
I
1
¦*1
co!



Maximum Non-detect
4.5

•

Mean of Detected Data
Median of Detected Data
67126
5^8

—

Variance of Detected Data
1.323
i

SD of Detected Data
1.15



CV of Detected Data
0.188



Skewness of Detected Data
1.783



Mean of Detected log data
1.798



SD of Detected Log data
0.168

~~





Note: Data have multiple DLs - Use of KM Method is recommended



For all methods (except KM. DL/2. Robust Ft OS. and Gamma ROS).



the Largest DL value is used for all NDs | j |
179
-------
Output for Nonparamctric Methods (with NDs) - Continued


















Nonparametiic T est with Detected Values Only



Data do not follow a Discernable Distribution (0.05)
		
L.	
	



Winsorization Method
0.168







Mean
57738







SD
0 624






95% Winsor (t) UCL
5.985














Kaplan Meier (KM) Method








Mean
5.958







SD
1.104






Standard Error of Mean
0 255



955i KM (t) UCL
G.398



95% KM (z) UCL
G.37G



95% KM (BCA)UCL
6 478



953; KM (Percentile Bootstrap) UCL
G.438



95% KM (Chebyshev) UCL
7 0G7



97.5% KM (Chebyshev) UCL
7 547



99% KM (Chebyshev) UCL
8.49














Potential UCL to Use





95% KM (Chebyshev) UCL
7 067



180
-------
Output Screen for All Statistics Option (with NDs)


1 1 1 1
1 1
1 i
| Geneial UCL Statistics for Data Sets with Non-D nterts
Usei Selected Options
Fiom File j D \example wst
-
-
... .
Full Precision ! Q FF
Confidence Coefficient j 955:
Number of Bootstrap Operations j 2000




Arsenic
	 -
--- 	 " - " * "
	 	
-
- - -i
--
- —
	
General Statistic
Number of Valid Samples, 20
Number of Unique Samples^ 15
Number of Detected Data
Number of Non-Detect Data
Percent Non-D elects
"""
17
3
~ 15 005;
-


Raw Statistics
Log-transformed Statistics


	
	
Minimum Detected. 5
Maximum Detected 9 2
Minmum Detected
Maximum Detected
1 £09
	
2.219



Mean of Detected; G12G
Mean of Detected
1 798

		
	
SD of Detected 115
M inmnum Non-D elect j 4 3
Maxrnum Non-Deled! 4 5
i
SD of Detected
Minimum Non-Delect
Maximum Non-Detect
0168
' ~1 459
1 504
	 _
Note Data have multiple DLs - Use of KM Method is recommended
For all methods (except KM, DL/2, Robust ROS, and Gamma ROS],
those Observations < Largest ND aie treated as NDs
Number treated as Non-Detect
Number treated as Detected
Single DL Non-D elect Percentage
3
17
Is"®)?!

	
	
UCL Statistics
		
	
	
	
Normal Distribution T est with Detected Vdues Oiif Lognormal Distribution T est with Detected Values Orfcr
Shapiro Wilk Test Statistic' 0 794 Shapiro Wrlk Test StatisticT 0.854
5£ Shapiro Wik Critical Value, 0 892 5% Shaptro V/ilk Critical Value 0.892
D ata not N ormal at 5% S ignif icance Level D ata not Lognormal at 5X S ignificance Level
	
Note: Once again, it should be noted that the number of valid samples represents the total number of
samples minus (-) the missing values (if any). The number of unique or distinct samples simply represents
number of distinct observations. The information about the number of distinct samples is useful when
using bootstrap methods. Specifically, it is not desirable to use bootstrap methods on data sets with only a
few (< 4-5) distinct values.
181
-------
Output Screen for All Statistics Option (with NDs) - Continued
1

III!

|

|
Assuming Normal D istribution
—: —
Assuming Lognoimal Distribution

	
	'

DL72 Substitution Method


DL/2 Substitution Method





Mean,
554


Mean
1 G48




"SD;
1.779


SD
0 398


.. - ¦ "

95% D172 (t) UCL; "
6 228
i


955; H-Stat (DL/2) UCL
G522
.

	i...
Maximum Likelihood Estimate(MLE) Method'
- -	-
	
.... 	
Robust ROS Method


— -


Mean
"5 774!'


Mean n Log Scale
1.742




sd;
1 348


SD in Log Scale
0 207

¦


95%MLE (I) UCL!
G 295'


Mean m Original Scale
5Kff

i


955: MLE (Tiku) UCL:
6 301


SD m Original Scale
1 2S'

'



	T	

95% Percentile Bootstrap UCL
G 314

l






35X BCA Bootstrap UCL
G 37


Gamma Distribution Test with DetectedVatues Drip
"" I ' """
Nonpaiametic T est with Detected Values Ority




k star (bias corrected)'
28 99.
D ata do not follow a Discer nable D istribution [0LO5)

'


Theta Starj
02111





I


nu star'
985 5
I


1





i
!








A-D Tesl Statistic
1 027
N onpaiametric Statntics





5% A-D Oitica) Value|
0 737,
"0737;

Kaplan-Meiei (KM) Method





K-S Tesl Statistic;


Mean
5958




" 5% K-f Critical V^je;
0209
i


SD
1 104j
.
Data not Gamma Distributed at 5% Significance Level
" 1 '
i


SE of Mean
0255

;






95% KM (t) Ua
6 398


Assuming G anuria D istribution



95% KM (z) UCL
6 376

1
Gamma ROS Statistics using Extrapolated Data-
I


95% KM (lackknife) UCL
6 387




Minimum;
3 51

95% KM (bootstrap I) UCL
G 673
i


Maximum;
9.2;


95% KM (BCA) UCL
6 43




Meanj
5 804


952 KM (percentile) UCL
6 398

i


Median j
5S25:

95* KM (Chebjishev) Ua
7 067
I


sd;
1.325

97 55; KM (Chebyshev) UCL
7 547

,


k star^
18 24

99% KM (Chebyshev) UCL
8.49

1


Theta sta?
0 318j





1


Nu staij
729 8[

Potential UCLs to Use


1 I


AppChi2
66811

95% KM (Chebyshev) UCL
7 067
I

952 Gamma Approximate UCL;
8 34|^





	J
i

95% Adjusted Gamma UCL J
G.384|





182
-------
Chapter 12
Windows
I^JProUCL 4.0 - [Worksheet.wst J |
F5e Edit Configure	Summary Statistics ROSEst. NDs Graphs Outlier Tests Goodness-of-Fit Hypothesis Testing Background
Navigation Panel	I 0 12 3 | i [ 5 8 7
Hame	11	|	j_	
0 WorkSheet.wst	I 1 i ! i , L
Click on the Window menu to reveal the drop-down options shown above.
The following Window drop-down menu options are available:
•	Cascade option: arranges windows in a cascade format. This is similar to a typical
Windows program option.
•	Tile option: resizes each window vertically or horizontally and then displays all open
windows. This is similar to a typical Windows program option.
•	The drop-down options list also includes a list of all open windows with a check mark in
front of the active window. Click on any of the windows listed to make that window
active. This is especially useful if you have more than 20 windows open, as the
navigation panel only holds the first 20 windows.
ua IVWWa Help
Cascade

Tde Vertically

Tile Horizontally

<~ 1 Worksheet.wst
—
i
183
-------
Chapter 13
Help
When the Help menu is clicked, the following window will appear.
^ProUCL 4.0 - {WorkSheet.wst]
,¦0 F3e Edit Configure Summary Statistics ROSEst. NDs Graphs Outker icsts Goodness-of-Fit Hypothesis Testng Background UC1 Window
0|<2>l BlBlffll.mi
About ProUCl
Navigation Panel ]
_
0 1 1
2 !
J
;
5
6 ! / ! 8 j j Technical Support
P
Name |
I
1



....1. 	L 1 " 1	1
0 WorkSheet.wst
I 2 i . ¦
: , - - : - J • i- -
Three options are available under Help menu:
•	About ProUCL: This option provides a brief description of ProUCL 4.0, and all
improvements made compared to ProUCL 3.0.
•	Statistical Help: This option executes an online help directory. This option provides
information about the various algorithms and formulae (with references) used in the
development of ProUCL 4.0. More information on the various topics covered under
Statistical Help is provided below.
•	Technical Support: This option will provide contact information for primary technical
support via phone and e-mail.
Statistical Help provides online help notes for the methods and options available in ProUCL 4.0. A screen
(shown below), listing the topics containing help notes, appears after clicking on the Help menu.
b lbc*t«rf4ra Urtj
ProUCL 4.0 Online Help Directoiy
fl/Wt tfi
ftm C^fraec u.CLi)
'ff start' , m !

185
-------
The online help directory is divided into the following submenus:
•	Getting Started: This chapter provides basic information on the software, including
software installation to various menu displays.
•	Summary Statistics: This chapter provides information and examples on procedures for
simple classical summary statistics for data sets with and without nondetects.
•	ROS Estimates of NDs: This chapter briefly describes the estimation (extrapolation) of
nondetects using regression on order statistics (ROS) for normal, lognormal, and gamma
distribution.
•	Graphs: This chapter provides information and examples on the graphical displays that
ProUCL 4.0 can produce: box plots, histograms, and Multi Q-Q plots.
•	Outlier Tests: This chapter provides information and examples for two classical outlier
tests available in ProUCL 4.0: Dixon's and Rosner's tests for data sets with and without
outliers.
•	Goodness-of-Fit: This chapter provides information and examples for several goodness-
of-fit tests available in ProUCL 4.0 for data sets with and without NDs.
•	Background Statistics: This chapter provides information and examples for the
computation of Background Statistics needed to estimate the BTVs and not-to-exceed
values. These statistics are sometimes used to compare point-by-point site data (not more
than 4 to 5 site samples) with the BTVs.
•	Hypotheses Testing: This chapter provides brief descriptions (with examples) of the
various single sample and two-sample hypotheses testing approaches as incorporated in
ProUCL 4.0.
o Single Sample Hypotheses Testing: This chapter provides brief description and
examples of single sample hypotheses testing approaches that are useful to compare
site concentrations with cleanup standards, compliance limits, or not-to-exceed
limits. The minimum sample size requirements for site data are briefly discussed in
Chapter 1 of this User Guide,
o Two-sample hypotheses are used to compare site data with background data (or
upgradient and downgradient wells) provided enough site and background data are
available. The minimum sample size requirements for site and background data are
briefly discussed in Chapter 1 of this User Guide.
•	Upper Confidence Limits (UCLs): This option provides information and examples for the
various UCL computation methods as incorporated in ProUCL 4.0.
186
-------
Chapter 14
Handling the Output Screens and Graphs
Copying Graphs
1. Click the graph you want to copy or save in the Navigation Panel. The graphs can be saved using
the copy option.
^ ProUCL 4.0 - [Histo Group.gst]
0@B
1 File Ed* Configure Window t«elp
I - k]
p|l BlBlml ml
Navigation Panel
0 work she et.wst
0 example.wst
Histo Group.gst
2 Histo_Arsenic (sub
(3 Histo_Arsenic (surf
Name
Histogram Plot for Group
3
cr
S Arsenic (subsurface )^| Arsenic (surface)
Log Panel
LOG: 2:03:21 PM >[Information) Box_Arsenie (4.5).gst closed!
LOG: 2.03:21 PM > [Information) Box_Arsenic (4.3 ).gst closed!
LOG: 2:03:33 PM >(Information) Histogram was generated!
LOG: 2:03:33 PM ^{Information} Histogram was generated!
2. Click Edit ~ Copy Graph.
File
Edit
Configure Window Help

Copy Graph

3. Once the user has clicked "Copy Graph," the graph is ready to be imported (pasted) into most
Microsoft Office applications (Word, Excel, and PowerPoint have been tested) by clicking
Edit ~ Paste in these Microsoft applications.
187
-------
4. It is important to note that the graph cannot be saved as its own file and must be imported into an
application to be saved. This will save the graph, but the overall file attribute and properties will
be that of the application in which the graph was saved in. For example, if the graph was saved
within Microsoft Word, the graph will be saved in a document with a .doc extension.
Printing Graphs
1.	Click the graph you want to print in the Navigation Panel.
2.	Click File ~ Page Setup.
QS Edit Configure Window Help
New
Open ...
Load Excel Data
Close
Save
Save As .,.
Page Setup
Print
Print Preview
Exit
3. Check the radio button next to Portrait or Landscape, and click OK. In some cases, with larger
headings and captions, it may be desirable to use the Landscape printing option.
4. Click File ~ Print to print the graph, and File ~ Print Preview to preview the graph before
printing.
188
-------
J Edit Configure Window Help
New
Open ,,,
Load Excel Data
Close
Save
Save As ...
Page Setup
Print
Print Preview
Exit
Printing Non-graphical Outputs
1. Click the output you want to copy or print in the Navigation Panel.
[P|proUa 4.0 - [output.ostj .1


Qd®
| He Ed* Configure iVrndow ftetp


- & x I
Ie3|
4 5000 Observations < largest NO ate treated as NDs

" i"


1 Number beated as Non-Detecl'
3*
T "


Number treated as Detected,
1 Sncie DL Detection Percentage]
17
" 05 Ottt' '
- t -

. 	 .
|JQ_
	 	
V

iiLI



Log Panel j




LOO 6 17 37 PM »(lnformalion] D lexample wst loaded1
LOG 6 1 7 45 PM ^Information] Generated results'
LOO 6 19 02 PM »(lntoimalion] Generated results'
LOG. 6 30 50 PM »[lnfotrnalionl Q-Q Plot graph for Atsernc was generated'


Done



A
189
-------
2.
Click File ~ Print.
Edit Configure Window Help
New
Open ..,
Load Excel Data
Close
Save
Save As ...
Page Setup
Print
Print Preview
Exit
Saving Output Screens as Excel Files
1.	Click on the output you want to save in the Navigation Panel List.
2.	Click File ~ Other Files... ~Export Excel (preserve formatting).
190
-------
3.
Enter the desired file name you want to use, and click Save, and save the file in the desired folder
using your browser.
Save in. IW Desktop
OS-
My Network
Places
3 My Computer
^3 My Network Places
My Recent
Documents
Desktop
SB
My Computer
"3
CP
File name.
Save as type.

Excel Workbook (* xls)
"3
Save
Cancel
191
-------
Chapter 15
Recommendations to Compute a 95% UCL (Estimate of EPC
Term) of the Population Mean, fj1t Using Symmetric and
Positively Skewed Full Data Sets without any Nondetects
This chapter describes the recommendations for the computation of a 95% UCL of the unknown
population arithmetic mean, \x\, of a contaminant data distribution based upon full data sets without any
nondctect observations. These recommendations are based upon the findings of Singh, Singh, and
Engelhardt (1997, 1999); Singh et al. (2002a); Singh, Singh, and Iaci (2002b); and Singh and Singh
(2003). These recommendations are applicable to full data sets without censoring and nondetcct (ND)
observations. Recommendations to compute UCL95 based upon data sets with NDs are summarized in
Chapter 16.
For skewed parametric as well as nonparametric data sets, there is no simple solution to compute a 95%
UCL of the population mean, //|. Contrary to the general conjecture, Singh et al. (2002a); Singh, Singh,
and laci (2002b); and Singh and Singh (2003) noted that the UCLs based upon the skewness adjusted
methods, such as the Johnson's modified t and Chen's adjusted-CZT do not provide the specified
coverage (e.g., 95 %) to the population mean even for mildly to moderately skewed (e.g., a the sd of log-
transformed data in interval [0.5, 1.0)) data sets for samples of size as large as 100. The coverage of the
population mean by the skewness-adjusted UCLs becomes much smaller than the specified coverage of
0.95 for highly skewed data sets, where skewness is defined as a function of a or a (sd of log-
transformed data).
It is noted that even though, the simulation results for highly skewed data sets of small sizes suggest that
the bootstrap t and Hall's bootstrap methods do approximately provide the adequate coverage to the
population mean, sometimes in practice these two bootstrap methods yield erratic inflated values (orders
of magnitude higher than the other UCL values) when dealing with individual highly skewed data sets of
small sizes. This is especially true when potential outliers may be present in the data set. ProUCL 4.0
provides warning messages whenever the recommendations are made regarding the use the bootstrap t
method or Hall's bootstrap method.
15.1 Normally or Approximately Normally Distributed Data Sets
•	For normally distributed data sets, a UCL based upon the Student's t-statistic provides the
optimal UCL of the population mean. Therefore, for normally distributed data sets, one
should always use a 95% UCL based upon the Student's t-statistic.
•	The 95% UCL of the mean based upon Student's t-statistic may also be used when the
Sd, sy of the log-transformed data, is less than 0.5, or when the data set approximately
follows a normal distribution. Typically, a data set is approximately normal when the
normal Q-Q plot displays a linear pattern (without outliers and jumps of significant
magnitude) and the resulting correlation coefficient is high (e.g., 0.95 or higher). The
jumps and breaks in a Q-Q plot (even with a high correlation coefficient) suggest the
presence of multiple populations in the data set under study.
193
-------
•	Student's t-UCL may also be used when the data set is symmetric (but possibly not
normally distributed). A measure of symmetry (or skewness) is &3. A value of ky close
to zero (e.g., if the absolute value of the skewness is roughly less than 0.2 or 0.3) suggests
approximate symmetry. The approximate symmetry of a data distribution can also be
judged by looking at the histogram of the data set.
15.2 Gamma Distributed Skewed Data Sets
In practice, many skewed data sets can be modeled both by a lognormal distribution and a gamma
distribution, especially when the sample size is smaller than 70-100. As well known, the 95% H-UCL of
the mean based upon a lognormal model often behaves in an erratic manner. Specifically, 95% H-UCL
often results in an unjustifiably large and impractical 95% UCL value when the sample size is small (e.g.,
n <20, 50, ..) and skewness is high. Moreover, it is also observed that a 95% UCL based upon Land H-
statistic becomes even smaller than the sample arithmetic mean. This is especially true for mildly skewed
to moderately skewed data sets of large sizes (e.g., > 50-70). In such cases, a gamma model, G(£, 6), may
be used to compute a reliable 95% UCL of the unknown population mean, ft\.
•	Many skewed data sets follow a lognormal as well as a gamma distribution. It should be
noted that the population means based upon the two models could differ significantly. A
lognormal model based upon a highly skewed (e.g., a > 2.5) data set will have an
unjustifiably large and impractical population mean, and its associated UCL. The
gamma distribution is better suited to model positively skewed environmental data sets.
•	One should always first check if a given skewed data set follows a gamma distribution. If
a data set does follow a gamma distribution or an approximate gamma distribution, one
should compute a 95% UCL based upon a gamma distribution. Use of highly skewed
(e.g., a > 2.5-3.0) lognormal distributions should be avoided. For such highly skewed
lognormally distributed data sets that cannot be modejed by a gamma or an approximate
gamma distribution, nonparametric UCL computation methods based upon the
Chebyshev inequality may be used.
•	The five bootstrap methods do not perform better than the two gamma UCL computation
methods. It is noted that the performances (in terms of coverage probabilities) of the
bootstrap t and Hall's bootstrap methods are very similar. Out of the five bootstrap
methods, bootstrap t and Hall's bootstrap methods perform the best (with coverage
probabilities for population mean closer to the nominal level of 0.95). This is especially
true when the skewness is quite high (e.g., k < 0.1) and the sample size is small (e.g., n <
10-15). Whenever the use of Hall's UCL or bootstrap t UCL is recommended, an
informative warning message about their use is also provided.
•	Contrary to the conjecture, the bootstrap BCA method does not perform better than the
Hall's method or the bootstrap t method. The coverage for the population mean, /
-------
•	From the results presented in Singh, Singh, and Iaci (2002b), and in Singh and Singh
(2003), it is concluded that for data sets which follow a gamma distribution, a 95% UCL
of the mean should be computed using the adjusted gamma UCL when the shape
parameter, k, is: 0.1 < k < 0.5, and for values of k > 0.5, a 95% UCL can be computed
using an approximate gamma UCL of the mean, //].
•	For highly skewed gamma distributed data sets with k < 0.1, the bootstrap t UCL or
Hall's bootstrap (Singh and Singh 2003) may be used when the sample size is smaller
than 15, and the adjusted gamma UCL should be used when the sample size starts
approaching or exceeding 15. The small sample size requirement increases as the
skewncss increases (that is, as k decreases, the required sample size, n, increases).
•	The bootstrap t and Hall's bootstrap methods should be used with caution as these
methods may yield erratic, unreasonably inflated, and unstable UCL values, especially in
the presence of outliers. In case Hall's bootstrap and bootstrap t methods yield inflated
and erratic UCL results, the 95% UCL of the mean should be computed based upon the
adjusted gamma 95% UCL. ProUCL 4.0 prints out a warning message associated with the
recommended use of the UCLs based upon the bootstrap t method or Hall's bootstrap
method. The recommendations for gamma distribution are summarized in Table 15-1.
Table 15-1. Computation of a UCL95 of the Unknown Mean,//,, of a Gamma Distribution
f(	Sample Size, n Recommendation
k s o.5	For all n	Approximate gamma 95% UCL
0.1 ^ k < 0.5	F°r a" n	Adjusted gamma 95% UCL
95% UCL based upon bootstrap t
or Hall's bootstrap method*
Adjusted gamma 95% UCL if available,
otherwise use approximate gamma 95% UCL
*lf bootstrap t or Hall's bootstrap methods yield erratic, inflated, and unstable UCL values (which often
happens when outliers are present), the UCL of the mean should be computed using the adjusted gamma
UCL.
15.3 Lognormally Distributed Skewed Data Sets
For lognormally, LN (//, cr), distributed data sets, the H-statistic-based H-UCL provides specified 0.95,
coverage for the population mean for all values of a. However, the H-statistic often results in unjustifiably
large UCL values that do not occur in practice. This is especially true when the skewness is high (e.g., a >
2.0). The use of a lognormal model unjustifiably accommodates large and impractical values of the mean
concentration and its UCLs. The problem associated with the use of a lognormal distribution is that the
population mean, ^i, of a lognormal model becomes impractically large for larger values of a, which in
turn results in an inflated H-UCL of the population mean,//|. Since the population mean of a lognormal
model becomes too large, none of the other methods, except for the H-UCL, provides the specified 95%
coverage for that inflated population mean,//|. This is especially true when the sample size is small and
k <0.1
k <0.1
n< 15
n* 15
195
-------
the skewncss is high. For extremely highly skewed data sets (with a > 2.5-3.0) of smaller sizes (e.g., < 70-
100), the use of a lognormal distribution-based H-UCL should be avoided (e.g., sec Singh et al. (2002a)
and Singh and Singh (2003)). Therefore, alternative UCL computation methods, such as the use of a
gamma distribution or the use of a UCL based upon nonparametric bootstrap methods or the Chebyshev
inequality-based methods, are desirable. All skewed data sets should first be tested for a gamma
distribution. For lognormally distributed data (that cannot be modeled by gamma distribution), methods
summarized in Table 15-2 may be used to compute a 95% UCL of mean.
ProUCL can compute an H-UCL for samples of sizes up to 1000. For highly skewed lognormally
distributed data sets of smaller sizes, alternative methods to compute a 95% UCL of the population mean,
//i, are summarized in Table 15-2. Since skewness is a function of o (or a), the recommendations for the
computation of the UCL of the population mean are also summarized in terms of a and the sample size,
n. Here, a is an MLE of o, and is given by the Sd of log-transformed data. Note that Table 15-2 is
applicable only to the computation of a 95% UCL of the population mean based upon lognormally
distributed data sets without nondetect observations.
Table 15-2. Computation of a 95% UCL of Mean,//; of a Lognormal Population
a
Sample Size, n
Recommendation
b < 0.5
For all n
Student's t, modified t, or H-UCL
0.5 s (j <1 0
For all n
H-UCL
1.0 s a <1.5
n <25
95% Chebyshev (MVUE) UCL
n* 25
H-UCL
1.5 s <7 <2.0
n< 20
99% Chebyshev (MVUE) UCL
20 s n < 50
95% Chebyshev (MVUE) UCL
n 2 50
H-UCL
1.5 5 a <2.0
n <20
99% Chebyshev (MVUE) UCL
20 ^ n < 50
97.5% Chebyshev (MVUE) UCL
50 5 n < 70
95% Chebyshev (MVUE) UCL
(7S 70
H-UCL
2.5 ^ a < 3.0
n< 30
Larger of 99% Chebyshev (MVUE) UCL or
99% Chebyshev (Mean, Sd)
30 £ n < 70
97.5% Chebyshev (MVUE) UCL
70 ±n< 100
95% Chebyshev (MVUE) UCL
ns 100
H-UCL
3.0 5 a £3.5
n< 15
Hall's bootstrap method*
15 ^ n < 50
Larger of 99% Chebyshev (MVUE) UCL or
99% Chebyshev (Mean, Sd)
50 sn< 100
97.5% Chebyshev (MVUE) UCL
100 £ n < 150
95% Chebyshev (MVUE) UCL
ns 150
H-UCL
a > 3.5
For all n
Use nonparametric methods*
196
-------
*If Hall's bootstrap method yields an erratic or unrealistically large UCL value, then the UCL of the mean
may be computed based upon the Chebyshev inequality.
15.4 Data Sets without a Discernable Skewed Distribution - Nonparametric
Methods for Skewed Data Sets
The use of gamma and lognonnal distributions as discussed here cover a wide range of skewed data
distributions. For skewed data sets which are neither gamma nor lognormal, one can use a nonparametric
Chebyshev UCL or Hall's bootstrap UCL (for small samples) of the mean to estimate the EPC term.
• For skewed nonparametric data sets with negative and zero values, use a 95% Chebyshev
(Mean, Sd) UCL for the population mean,
For all other nonparametric data sets with only positive values, the following method may be used to
estimate the EPC term.
•	For mildly skewed data sets with a < 0.5, one can use the Student's t-statistic or
modified t-statistic to compute a 95% UCL of mean, fit.
•	For nonparametric moderately skewed data sets (e.g., a or its estimate, a, in the interval
(0.5, 1]), one may use a 95% Chebyshev (Mean, Sd) UCL of the population mean, fi\.
•	For nonparametric moderately to highly skewed data sets (e.g., a in the interval (1.0,
2.0]), one may use a 99% Chebyshev (Mean, Sd) UCL or a 97.5% Chebyshev (Mean, Sd)
UCL of the population mean, pi\, to obtain an estimate of the EPC term.
•	For highly skewed to extremely highly skewed data sets with a in the interval (2.0, 3.0],
one may use Hall's UCL or a 99% Chebyshev (Mean, Sd) UCL to compute the EPC term.
•	Extremely skewed nonparametric data sets with a exceeding 3 provide poor coverage.
For such highly skewed data distributions, none of the methods considered provide the
specified 95% coverage for the population mean, n\. The coverage provided by the
methods decrease as a increases. For such data sets of sizes less than 30, a 95% UCL can
be computed based upon Hall's bootstrap method or bootstrap t method. Hall's bootstrap
method provides the highest coverage (but less than 0.95) when the sample size is small.
It is noted that the coverage for the population mean provided by Hall's method (and
bootstrap t method) docs not increase much as the sample size, n, increases. However, as
the sample size increases, coverage provided by the 99% Chebyshev (Mean, Sd) UCL
method also increases. Therefore, for larger samples, a UCL should be computed based
upon the 99%> Chebyshev (Mean, Sd) method. This large sample size requirement
increases as a increases. These recommendations are summarized in Table 15-3.
197
-------
Tabic 15-3. Computation of 95% UCL of Mean,/<;, Based Upon a Skewed Data Set (with all positive values) without a
Discernablc Distribution, where <7 is sd of Log-Transformed Data
a
Sample Size, n
Recommendation
a < 0.5
For all n
95% UCL based on Student's t or modified t-statistic
0 5 < a s 1.0
For all n
95% Chebyshev (Mean, Sd) UCL
1.0 < <7 £2.0
n< 50
99% Chebyshev (Mean, Sd) UCL
ns 50
97.5% Chebyshev (Mean, Sd) UCL
2.0 < o <. 3.0
n <10
Hall's Bootstrap UCL*
n 210
99% Chebyshev (Mean, Sd) UCL
30 < a £3.5
n < 30
Hall's Bootstrap UCL*
n £ 30
99% Chebyshev (Mean, Sd) UCL
a > 3.5
n< 100
Hall's Bootstrap UCL'
nt 100
99% Chebyshev (Mean, Sd) UCL
*If Hall's bootstrap method yields an erratic and unstable UCL value (e.g., happens when outliers are
present), a UCL of the population mean may be computed based upon the 99% Chebyshev (Mean, Sd)
method. The results as summarized in Tables 15-1 through 15-3 are summarized in Table 15-4, shown on
the next page.
198
-------
Table 15-4. Recommended UCL95 Computation Methods for Full-Uncensored Data Sets without Nondetect Observations
Skewness
Sample Size
95% Student t
95% modified t
95% H-UCL
95% Hall's Bootstrap
95% Chebyshev (MVUE)
97.5% Chebyshev (MVUE)
99% Chebyshev (MVUE)
95% Chebyshev (Mean, Sd)
97.5% Chebyshev (Mean, Sd)
99% Chebyshev (Mean, Sd)
95% Approx. Gamma
95% Bootstrap t
95% Adjusted Gamma
Normal or Approximate Normal (with a < 0.5) Distribution

All n
•
•











Gamma Distribution
k <0.1
n< 15



•







•

n> 15










•

•
0.1 <. k <0.5
All n












•
k 2 0.5
All n










•


Lognormal Distribution
a < 0.5
All n
•
•
•










0.5 s it <1 0
All n


•










1.0 ^ a < 1.5
n< 25




•








n s 25


•










1.5s a <2.0
n < 20






•






20 s n < 50




•








n 2 50


•










2.0 ^ a <2.5
n <20






•






20 ^ n < 50





•







50 S n < 70




•








n s 70


•










2.5 £ a < 3.0
n < 30






•


•



30 £ n < 70





•







70 £ n < 100




•








n2 100


•










3.0 SffS3.5
n< 15



•









15 ^ n < 50






•


•



50 sn< 100





•







100 sn< 150




•








ni 150


•










199
-------
Table 15-4. Recommended UCL95 Computation Methods for Full-Unccnsored Data Sets without Nondetect
Observations-Continued

Sample Size
95% Student t
95% modified t
95% H-UCL
95% Hall's Bootstrap
95% Chebyshev (MVUE)
97.5% Chebyshev (MVUE)
99% Chebyshev (MVUE)
95% Chebyshev (Mean, Sd)
97.5% Chebyshev (Mean, Sd)
99% Chebyshev (Mean, Sd)
95% Approx. Gamma
95% Bootstrap t
95% Adjusted Gamma
Nonparametric Distribution Free Methods
a <, 0.5
All n
•
•











0.5 < <7 ^ 1.0
All n







•





1.0 < a £ 2.0
n < 50









•



n 2 50








•




2.0 < a <, 3.0
n< 10



•









ni 10









•



3 0 < a <, 3.5
n < 30 •



•









rn 30









•



a > 3.5
n< 100



•









n 2 100









•



15.5 Should the Maximum Observed Concentration be Used as an
Estimate of the EPC Term?
This topic has been discussed earlier in Chapter 1. It is included here only for the convenience of the user.
In practice, a typical user tends to use the maximum sample value as an estimate of the EPC term. This is
especially true when the sample size is small or the data are highly skewed. The discussion and
suggestions as described in Chapter 1 apply to both Chapters 15 and 16. Singh and Singh (2003) studied
the max test (using the maximum observed value as an estimate of the EPC term) in their simulation
study. Previous (e.g., RAGS document (EPA, 1992)) use of the maximum observed value has been
recommended as a default option to estimate the EPC term when a 95% UCL (e.g., the H-LJCL) exceeded
the maximum value. Only two 95% UCL computation methods, namely the Student's t-UCL and Land's
H-UCL, were used previously to estimate the EPC term (e.g., EPA, 1992). ProUCL 4.0 can compute a
95% UCL of the mean using several methods based upon the normal, gamma, lognormal, and
"nonparamctric" distributions. Furthermore, since the EPC term represents the average exposure
contracted by an individual over an exposure area (EA) during a long period of time, the EPC term should
be estimated by using an average value (such as an appropriate 95% UCL of the mean) and not by the
maximum observed concentration.
200
-------
Singh and Singh (2003) also noted that for skewed data sets of small sizes (e.g., < 10-20), the max test
does not provide the specified 95% coverage to the population mean, and for larger data sets, it
overestimates the EPC term which may require unnecessary further remediation. For the distributions
considered, the maximum value is not a sufficient statistic for the unknown population mean. The use of
the maximum value as an estimate of the EPC term ignores most (except for the maximum value) of the
information contained in a data set. It is, therefore, not desirable to use the maximum observed value as
an estimate of the EPC term representing average exposure by an individual over an EA.
It is recommended that the maximum observed value NOT be used as an estimate of the EPC term.
For the sake of interested users, ProUCL displays a warning message when the recommended 95% UCL
(e.g., Hall's bootstrap UCL) of the mean exceeds the observed maximum concentration. When a 95%
UCL exceeds the maximum observed value, ProUCL recommends the use of an alternative UCL method
based upon a 97.5% or 99% Chebyshev UCL.
It should also be noted that for highly skewed data sets, the sample mean indeed can even exceed the
upper 90%, or higher, etc., percentiles, and consequently, a 95% UCL of the mean can exceed the
maximum observed value of a data set. This is especially true when one is dealing with lognormally
distributed data sets of small sizes. For such highly skewed data sets which cannot be modeled by a
gamma distribution, a 95% UCL of the mean should be computed using an appropriate nonparametric
method as summarized in Table 15-4.
201
-------
Chapter 16
Recommendations to Compute a 95% UCL of the Population
Mean, //*, Using Data Sets with Nondetects with Multiple
Detection Limits
This chapter summarizes the recommendations based on the simulation experiments conducted by Singh,
Maichle, and Lee (USEPA, 2006) to compare the performances of the UCL computation methods based
upon data sets with BDLs and multiple detection limits (DLs). ProUCL 4.0 suggests the use of
appropriate UCLs based upon the findings of Singh, Maichle, and Lee (USEPA, 2006). For convenience,
the recommended UCL95 computation methods have been tabulated in Table 16 as functions of the
sample size, skewness, and censoring intensity. General observations and recommendations regarding the
difficulties associated with data sets with NDs are described first.
16.1 General Recommendations and Suggestions
•	In practice, it is not easy to verify (perform goodness-of-fit) the distribution of a left-
censored data set with NDs. Therefore, emphasis is given on the use of nonparametric
UCL95 computation methods, which can also be used to handle multiple detection limits.
•	This is specifically true when the percentage (%) of nondetects exceeds 40%-50%.
•	Most of the parametric MLE methods assume that there is only one detection limit. It
should also be noted that the MLEs behave in an unstable manner when the % of NDs
exceeds 40%-50%. Moreover, as mentioned before, it is hard to verify and justify the
conclusion of a GOF test for data sets with nondetects in excess of 40%-50%.
•	Therefore, for data sets with many nondetects (> 40%-50%), it is suggested to use
nonparametric methods to estimate the various environmental parameters (BTVs, EPC
terms) of interest and to perform site versus background comparisons.
•	In practice, a left-censored data set often has multiple detection limits. For such methods,
the KM method can be used. ProUCL 4.0 provides UCL computation methods that can
be used on data sets with multiple detection limits including the DL/2 method, KM
method, and robust ROS method.
o As mentioned earlier, for reliable and accurate results, it is suggested that the user make
sure that the data set under study represents a single statistical population (e.g.,
background reference area, or an AOC) and not a mixture population (e.g., clean and
contaminated site areas).
•	It is recommended to identify all of the potential outliers and study them separately. The
computation of the statistics such as UCL95 and background statistics should be based
upon the majority of the data set representing a single dominant population. Decisions
about the appropriate disposition (include or not include) of outliers should be made by
all interested members of the project team. When in doubt, it is suggested to compute and
203
-------
compare all relevant statistics and estimates based upon data sets with and without the
outliers. This step will exhibit the undue influence of outliers on the statistics of interest
such as UCL95 and UPL95.
Simple classical outlier identification methods (Dixon test and Rosner test) are also
available in ProUCL 4.0. More effective robust outlier procedures (e.g., Rousseeuw and
Leroy (1987) and Singh and Nocerino (1995)) are available in Scout (1999).
In case a data set represents a mixture sample (from two or more populations), one should
partition the mixture sample into component sub-samples (e.g., Singh, Singh, and
Flatman (1994)).
Avoid the use of transformations (to achieve symmetry) while computing the upper limits
for various environmental applications, as all remediation, cleanup, background
evaluation decisions, and risk assessment decisions have to be made using statistics in the
original scale. Also, it is more accurate and easier to interpret the results computed in the
original scale. The results and statistics computed in the original scale do not suffer from
an unknown amount of transformations bias.
Specifically, avoid the use of a lognormal model even when the data appear to be
lognormally distributed. Its use often results in incorrect and unrealistic statistics of no
practical purpose or importance or significance. Several variations of estimation methods
(e.g., robust ROS and FP-ROS on log-transformed data, delta lognormal method) on log-
transformed data have been developed and used by the practitioners. This has caused
some confusion among the users of the statistical methods dealing with environmental
data sets. The proper use of a lognormal distribution (e.g., how to properly back-
transform UCL of mean in the log-scale to obtain a UCL of mean in original scale) is not
clear to many users, which in turn may result in the incorrect use and computation of an
estimate (= UCL95) of the population mean.
The parameter in the transformed space may not be of interest to make cleanup decisions.
The cleanup and remediation decisions are often made in the original raw scale;
therefore, the statistics (e.g., UCL95) computed in transformed space need to be back-
transformed in the original scale. It is not clear to a typical user how to back-transform
results in the log-scale or any other scale obtained using a Box-Cox (BC)-type
transformation to original raw scale. The transformed results often suffer from significant
amount of transformation bias.
The question now arises-how one should back-transform results from a log-space (or any
other transformed space) to the original space? Unfortunately, no defensible guidance is
available in the environmental literature to address this question. Moreover, the back-
transformation formula will change from transformation to transformation (BC-type
transformations), and the bias introduced by such transformations will remain unknown.
Therefore, in cases when a data set in the "raw" scale cannot be modeled by a parametric
distribution, it is desirable to use nonparametric methods (many available in ProUCL 4.0)
rather than testing or estimating a parameter in the transformed space.
On page (78) of Helsel (2005), the use of the robust ROS MLE method (Kroll, C.N. and
J.R. Stedinger (1996)) has been suggested to compute summary statistics. In this hybrid
-------
method, MLEs are computed using log-transformed data. Using the regression model as
given by equation (3-21) of Section 3, the MLEs of the mean (used as intercept) and sd
(used as slope) in the log-scale are used to extrapolate the NDs in the log-scale. Just like
in the robust ROS method, all of the NDs are transformed back in the original scale by
exponentiation. This results in a full data set in the original scale. One may then compute
the mean and sd using the full data set. The estimates thus obtained are called robust ROS
ML estimates (Helsel (2005), and Kroll and Stedinger (1996)). However, the
performance of such a hybrid estimation method is not well known. Moreover, for higher
censoring levels, the MLE methods sometimes behave in an unstable manner, especially
when dealing with moderately skewed to highly skewed data sets (e.g., with o>1.0).
o It should be noted that the performance of this hybrid method is unknown,
o It is not clear why this method is called a robust method,
o The stability of the MLEs obtained using the log-transformed data is doubtful,
especially for higher censoring levels,
o The BCA and (% bootstrap) UCLs based upon this method will fail to provide the
adequate coverage for the population mean for moderately skewed to highly skewed
data sets.
•	The DL/2 (t) UCL method does not provide adequate coverage (for any distribution and
sample size) for the population mean, even for censoring levels as low as 5%, 10%, 15%.
This is contrary to the conjecture and assertion (e.g., EPA (2000)) often made that the
DL/2 method can be used for lower (< 20%) censoring levels. The coverage provided by
the DL/2 (t) method deteriorates fast as the censoring intensity increases.
This DL/2 (t) UCL method is not recommended by the authors and developers of ProUCL 4.0; it is
included only for comparison or research purposes.
•	The KM method is a preferred method as it can handle multiple detection limits.
Moreover, the nonparametric UCL95 methods (KM (BCA), KM (z), KM (%), KM (t))
based upon the KM estimates provide good coverages for the population mean (e.g.,
Helsel (2005) and Singh et al. (2006)).
•	For a symmetric distribution (approximate normality), several UCL95 methods provide
good coverage (~95%) for the population mean, including the Winsorization mean,
Cohen's MLE (t), Cohen's MLE (Tiku), KM (z), KM (t), KM (%) and KM (BCA) (e.g.,
Helsel (2005) and Singh ct al. (2006)).
16.2 Recommended UCL95 Methods for Normal (Approximate Normal)
Distribution
•	For normal and approximately normal (e.g., symmetric or with sd, a < 0.5) distribution:
The most appropriate UCL95 computation methods for normal or approximately normal
distributions are the KM (t) or KM (%) methods. For symmetric distributions, both of
these methods perform equally well on left-censored data sets for all censoring levels (%
nondetects) and sample sizes.
205
-------
16.3 Recommended UCL95 Methods for Gamma Distribution
•	Highly skewed gamma distributions, G(k, 0), with shape parameter, k < 1:
o Use the nonparametric KM (Chebyshev) UCL95 method for censoring levels < 30%.
o Use the nonparametric KM (BCA) UCL95 method for censoring levels in the interval
[30%, 50%).
o Use the nonparametric KM (t) UCL95 method for censoring levels > 50%.
•	Moderately skewed gamma distributions, G(k, 0), with shape parameter, 1< k < 2:
o For censoring level < 10%, use the KM (Chebyshev) UCL95 method.
o For higher censoring levels [10%, 25%), use the KM (BCA).UCL95 method,
o For censoring levels in [25%, 40%), use the KM (%) UCL95 method,
o For censoring levels > 40%, use the KM (t) UCL95 method.
•	Mildly skewed gamma distributions, G(k, 0), with k > 2:
o Use the KM (BCA) UCL95 method for lower censoring levels (< 20%).
o For censoring levels in the interval [20%, 40%), use the KM (%) UCL95.
o For censoring > 40%, use the KM (t) UCL95 computation method.
16.4 Recommended UCL95 Methods for Lognormal Distribution
•	Mildly skewed data sets with a < 1:
o For censoring levels (< 20%) and sample of sizes less than 50-70, use the KM
(Chebyshev) UCL95.
o For censoring levels (< 20%) and samples of sizes greater than 50-70, use the KM
(BCA) UCL95.
o For censoring levels in the interval [20%, 40%) and all sample sizes, use the KM
(BCA) UCL95.
o For censoring level > 40%, use the KM (%) or KM (t) UCL95 method.
•	Data sets with a in the interval (1, 1.5]:
o For censoring levels < 50% and samples of sizes < 40, use the 97.5% KM
(Chebyshev) UCL.
o For censoring levels < 50% , samples of sizes > 40, use 95% KM (Chebyshev) UCL.
o For censoring levels > 50%, use the KM (BCA) UCL95 for samples of all sizes.
•	Highly skewed data sets with a in the interval (1.5, 2]:
o For sample sizes < 40, censoring levels <50%, use 99% KM (Chebyshev) UCL.
o For sample sizes > 40, censoring levels < 50%, use 97.5% KM (Chebyshev) UCL.
o For samples of sizes < 40-50 and censoring levels > 50%, use the 97.5% KM
(Chebyshev) UCL.
o For samples of sizes > 40-50, and censoring levels > 50%, use the 95% KM
(Chebyshev) UCL.
•	Use a similar pattern for more highly skewed data sets with a > 2.0, 3.0:
o For extremely highly skewed data sets, an appropriate estimate of the EPC term (in
terms of adequate coverage) is given by a UCL based upon the Chebyshev inequality
206
-------
and KM estimates. The confidence coefficient to be used will depend upon the
skewness. For highly skewed data sets, a higher (e.g., > 95%) confidence coefficient
may have to be used to estimate the EPC.
o As the skewness increases, the confidence coefficient also increases,
o For such highly skewed distributions (with a > 2.0, 3.0), for lower sample sizes (e.g.,
< 50-60), one may simply use 99% KM (Chebyshcv) UCL to estimate the population
mean, EPC term, and other relevant threshold (e.g., UPL, percentiles) values,
o For sample sizes greater than 60, one may use a 97.5% KM (Chebyshcv) UCL as an
estimate of the population mean or mass.
16.5 Recommended Nonparametric UCL Methods
•	For symmetric or approximately symmetric distribution-free, nonparametric data sets
with a < 0.5: Use the same UCL computation methods as for the data sets coming from
a normal or an approximate normal (symmetric) population. These methods are
summarized above in the normal UCL computation section.
•	For skewed distribution-free, nonparametric data sets with a > 0.5: Most of the
recommended UCL computation methods for a lognormal distribution, as described
above in the lognormal UCL section, do not assume the lognormality of the data set.
Therefore, the UCL computation methods, as described in the lognormal UCL
computation section, can be used on skewed nonparametric data sets that do not follow
any of the well-known parametric distributions.
The suggested parametric and nonparametric UCL95 computation methods for data sets with nondetect
observations are summarized in Table 16, shown on the next page.
207
-------
Tabic 16. Recommended UCL95 Computation Methods for Left-Ccnsorcd Data Sets with Nondetcct Observations
Skewness
Sample Size
% ND
95% KM (t)
95% KM (%)
95% KM (Chebyshev)
97.5% KM (Chebyshev)
99% KM (Chebyshev) -
95% KM (BCA)
Normal or Approximate Normal (with a < 0.5) Distribution
^ <0.5
All n
>0%
•
•




Gamma Distribution
k <. 1
All n
< 30%


•



All n
[30%, 50%)





•
All n
2 50%
•

'



1 < k <.2
All n
< 10%


•



All n
[10%, 25%)





•
All n
[25%, 40%)

•




All n
2 40%
•





k >2
All n
< 20%





•
All n
[20%, 40%)

•




All n
2 40%
•





Lognormal Distribution
a s 1.0
n £ 50-70
< 20%


•



n > 50-70





•
All n
[20%, 40%)





•
All n
2 40%
•
•




1 < a £ 1.5
n < 40
< 50%



•


n 2 40


•



All n
> 50%





•
1.5 < <7 £2.0
n < 40
< 50%




•

n 2 40



•


n < 40-50
> 50%



•


n 2 40-50


•



a > 2.0,3.0
n < 50-60
>0%




•

n > 60



•


208
-------
Table 16. Recommended UCL95 Computation Methods for Left-Censored Data Sets with Nondctcct Observations -
Continued














>
0)
£

>
.Q
4)
0)
£
>
ja
0)
x
Q
>
o>
£
(ft
>
JQ

o>
m
o
Symmetric or Approximate Symmetric Non-Discernable Distribution
(7 < 0.5
All n
>0%
•
•




Moderately Skewed to Highly Skewed Non-Discernable Distribution

n < 50-70
< 20%


•



0.5 £ <7 £ 1.0
n > 50-70





•
All n
[20%. 40%)





•

All n
£ 40%
•
•





n < 40
< 50%



•


1 < a £ 1.5
n 2 40


•




All n
> 50%





•

n < 40
< 50%




•

1.5 < a £2.0
n 2. 40



•


n < 40-50 .
> 50%



•



n 2 40-50


•



a > 2.0,3.0
n < 50-60
>0%




•

n> 60



•


. Note: In Table 16, phrase "All n" represents only valid (e.g., n > 3) and recommended (n > 8 to 10)
values of the sample size, n. As mentioned throughout the report, it is not desirable to use statistical
methods on data sets of small sizes (e.g., with n < 8 to 10). However, it should be noted that the_ sample
size requirements and recommendations (n > 8 to 10) as described in this report are not the limitations of
the methods considered in this report. One of the main reasons for the recommendation that the sample
size should be at least 8 to 10 is that the estimates and UCLs based upon small data sets, especially with
many below detection limit observations (e.g., 30%, 40%, 50%, and more), may not be reliable and
accurate enough to draw conclusions for environmental applications. It should be noted that in order to
be able to use bootstrap re-sampling methods, it is desirable to have a minimum of 10-15 observations
(e.g., n > 10-15). Therefore, the phrase "All n " in Table 16, should be interpreted as that the sample size,
n, is least 8 to 10. The software, ProUCL 4.0, will provide appropriate warning messages when a user
tries to use a method on data sets of small sizes.
Also, Hall's bootstrap and bootstrap t methods to compute a UCL based upon a full data set (without
nondetects) should be used with caution. These two bootstrap methods may yield erratic and unstable
UCL results, especially, when outliers are present. In such cases, it is desirable to use alternative UCL
209
-------
methods based upon Chebyshcv inequality. ProUCL software provides a warning message for erratic
UCL results based upon Hall's bootstrap t methods.
210
-------
Glossary
This glossary defines selected words in this User Guide to describe impractically large UCL values of the
unknown population mean, mt. In practice, the UCLs based upon Land's H-statistic (H-UCL), and
bootstrap methods such as the bootstrap t and Hall's bootstrap methods (especially when outliers are
present) can become impractically large. The UCLs, based upon these methods often become larger than
the UCLs based upon all other methods by several orders of magnitude. Such large UCL values are not
achievable as they do not occur in practice. Words like "unstable" and "unrealistic" are used to describe
such impractically large UCL values.
UCL: upper confidence limit of the unknown population mean.
Coverage = Coverage Probability: The coverage probability (e.g., = 0.95) of a UCL of the population
mean represents the confidence coefficient associated with the UCL.
Optimum: An interval is optimum if it possesses optimal properties as defined in the statistical literature.
This may mean that it is the shortest interval providing the specified coverage (e.g., 0.95) to the
population mean. For example, for normally distributed data sets, the UCL of the population mean based
upon Student's t-distribution is optimum.
Stable UCL'. The UCL of a population mean is a stable UCL if it represents a number of a practical merit,
which also has some physical meaning. That is, a stable UCL represents a realistic number (e.g.,
contaminant concentration) that can occur in practice. Also, a stable UCL provides the specified (at least
approximately, as much as possible, as close as possible to the specified value) coverage (e.g., -0.95) to
the population mean.
Reliable UCL'. This is similar to a stable UCL.
Unstable UCL = Unreliable UCL = Unrealistic UCL: The UCL of a population mean is unstable,
unrealistic, or unreliable if it is orders of magnitude higher than the other UCLs of population mean. It
represents an impractically large value that cannot be achieved in practice. For example, the use of Land's
H-statistic often results in impractically large inflated UCL value. UCLs such as the bootstrap t UCL and
Hall's UCL can be inflated by outliers, resulting in an impractically large and unstable value. All such
impractically large UCL values are called unstable, unrealistic, unreliable, or inflated UCLs in this User
Guide.
211
-------
References
Breckcnridgc, R.P. and Crockett, A.B. (1995). Determination of Background Concentrations of
Inorganics in Soils and Sediments at Hazardous Waste Sites. EPA/540/S-96/500.
Cohen, A. C. Jr. 1991. Truncated and Censored Samples. 119, Marcel Dckker Inc. New York,
NY, 1991.
Conover, W. J. (1999). Practical Nonparametric Statistics. Third Edition. John Wiley.
Department of Navy (Navy, 1999). Handbook for Statistical Analysis of Environmental Background
Data. Prepared by SWDIV and EFA West of Naval Facilities Engineering Command. July 1999.
Department of Navy (Navy, 2002a). Guidance for Environmental Background Analysis. Volume 1 Soil.
Naval Facilities Engineering Command. April 2002.
Department of Navy (Navy, 2002b). Guidance for Environmental Background Analysis. Volume 2
Sediment. Naval Facilities Engineering Command. May 2002.
Department of Navy (Navy, 2003). Guidance for Environmental Background Analysis. Volume 3
Groundwater. Naval Facilities Engineering Command. May 2002.
Gerlach, R. W., and Nocerino, J. M. (2003). Guidance for Obtaining Representative Laboratory Analytical
Subsamples from Particulate Laboratory Samples, EPA/600/R-03/027, Nov, 2003.
Gilbert, R.O. (1987), Statistical Methods for Environmental Pollution Monitoring. John Wiley and Sons,
New York.
Hahn, J. G., and Meeker, W.Q. (1991). Statistical Intervals. A Guide for Practitioners. John Wiley, New
York.
Hclscl, D.R., Hirsch, R.M. (1994). Statistical Methods in Water Resources. John Wiley.
Hinton, S.W. 1993. ALognormal Statistical Methodology Performance. ES&T Environmental
Sci. Technol., Vol. 27, No. 10, pp. 2247-2249.
Hogg, R.V., and Craig, A. (1995). Introduction to Mathematical Statistics; 5th edition. Macmillan.
Johnson, N.L., Kotz, S. and Balakrishnan, N. 1994. Continuous Univariate Distributions, Vol. 1.
Second Edition. John Wiley, New York.
Kaplan, E.L. and Meier, O. 1958. Nonparametric Estimation from Incomplete Observations.
Journal of the American Statistical Association, Vol. 53. 457-481.
Land, C. E. (1971), "Confidence Intervals for Linear Functions of the Normal Mean and Variance,"
Annals of Mathematical Statistics, 42, 1187-1205.
213
-------
Land, C. E. (1975), "Tables of Confidence Limits for Linear Functions of the Normal Mean and
Variance," in Selected Tables in Mathematical Statistics, Vol. Ill, American Mathematical
Society, Providence, R.I., 385-419.
Massachusetts DEP (1995). Guidance for Disposal Site Risk Characterization. Interim Final Policy
WSC/ORS - 94-141. July 1995.
Ohio EPA (2004). Background Calculation Methodology. Division of Emergency and Remedial
Response (DERR). DERR-00-RR-039P. June 2004.
ProUCL 3.0. (2004). A Statistical Software. National Exposure Research Lab, EPA, Las Vegas
Nevada, October 2004.
Schulz, T. W., and Griffin, S. (1999), Estimating Risk Assessment Exposure Point Concentrations when
Data are Not Normal or Lognormal. Risk Analysis, Vol. 19, No. 4, 1999. A Data Analysis
Program, Technology Support Project. EPA, NERL-LV, Las Vegas, NV 89193-3478.
Shacklette, H.T, and Boerngen, J.G. (1984). Element Concentrations in Soils an Other Surficial Materials
in the Conterminous United States, U.S. Geological Survey Professional Paper 1270.
- Shumway, R.H., R.S. Azari, and M. Kayhanian. 2002. Statistical Approaches to Estimating
Mean Water Quality Concentrations with Detection Limits. Environmental Science and
Technology, Vol. 36, pp. 3345-3353.
Singh, A., Singh, A.K., and G. Flatman (1994). Estimation of Background Levels of Contaminants. Math
Geology, Vol. 26, No, 3, pp. 361-388.
Singh, A.K., Singh, A., and Engelhardt, M. 1997. The Lognormal Distribution in Environmental
Applications. Technology Support Center Issue Paper, 182CMB97, EPA/600/R-97/006.
Singh, A. and Nocerino, J.M. 2002. Robust Estimation of the Mean and Variance Using
Environmental Data Sets with Below Detection Limit Observations, Vol. 60, pp. 69-86.
Singh, A., Singh, A.K., and Iaci, R.J. 2002. Estimation of the Exposure Point Concentration
Term Using a Gamma Distribution, EPA/600/R-02/084, October 2002.
Singh, A. and Singh, A.K. 2003. Estimation of the Exposure Point Concentration Term (95%
UCL) Using Bias-Corrected Accelerated (BCA) Bootstrap Method and Several other methods
for Normal, Lognormal, and Gamma Distributions. Draft EPA Internal Report.
Singh, A., Maichle, R., and Lee, S. 2006. On the Computation of a 95% Upper Confidence Limit
of the Unknown Population Mean Based Upon Data Sets with Below Detection Limit
Observations. EPA/600/R-06/022, March 2006.
Singh, A. and Singh, A.K. 2007. ProUCL Version 4.0 Technical Guide. Publication EPA/600/R-07/041.
April 2007.
USEPA (1989). Methods for Evaluating the Attainment of Cleanup Standards, Vol. 1, Soils and Solid
Media, Publication EPA 230/2-89/042.
214
-------
USEPA. 1992. Supplemental Guidance to RAGS: Calculating the Concentration Term. Publication EPA
9285.7-08, May 1992.
USEPA (1992a). Guidance for Data Usability in Risk Assessment. Office of Emergency and Remedial
Response Publication. 9285.7-09A. PB92-963356.
USEPA (1992b). Statistical Analysis of Ground-water Monitoring Data at RCRA Facilities. Addendum to
Interim Final Guidance. Washington, DC: Office of Solid Waste. July 1992.
USEPA (1996). Soil Screening Guidance: User's Guide. Office of Solid Waste and Emergency Response,
Washington, DC. EPA/540/R-96/018, April 1996.
USEPA (1994). Statistical Methods for Evaluating the Attainment of Cleanup Standards. Volume 3:
Reference Based Standards for Soils and Solid Media. Office of Policy, Planning, and Evaluation,
Washington, DC. EPA/230/R-94/004, June 1994.
USEPA (1997). Multi-Agency Radiation Survey and Site Investigation Manual (MARSSIM). EPA 402-
R-97-016, NUREG-1575, US EPA, Washington, DC.
USEPA (2000). Guidance for Data Quality Assessment: Practical Methods for Data Analysis, EPA
QA/G-9, QA00 Version. EPA 600-R-96-084. Quality Assurance Management Staff, Washington,
DC. Available at http://www.epa.gov/quality/qa_docs.html.
USEPA. 2002a. Calculating Upper Confidence Limits for Exposure Point Concentrations at
Hazardous Waste Sites. OSWER 9285.6-10. December 2002.
USEPA. 2002b. Guidance for Comparing Background and Chemical Concentrations in Soil for
CERCLA Sites. EPA 540-R-01-003-OSWER 9285.7-41. September 2002.
USEPA (2004), ProUCL Version 3.1, a Statistical Software, National Exposure Research Lab, EPA, Las
Vegas Nevada, October 2004.
USEPA. 2006. Data Quality Assessment: Statistical Methods for Practitioners, EPA QA/G-9S.
EPA/240/B-06/003. Office of Environmental Information, Washington, D.C. Download
from: http://www.epa.gov/quality/qs-docs/g9s-final.pdf
Wyoming DEQ (2005). Establishing Site-Specific Background Metals Concentrations in Soil Under the
Voluntary Remediation Program. Fact Sheet #24.
215
-------
-------
About the CD
The GD accompanying the hard copy of this report, EPA/600/R-07/038 (April 2007), "ProUCL Version
4.0 User Guide," contains this written report in pdf and Microsoft Word formats. The CD also includes
the report, EPA/600/R-07/041 (April 2007), "ProUCL Version 4.0 Technical Guide" in pdf and Microsoft
Word formats, and the ProUCL Version 4.0 (June 13, 2007 edition), EPA7600/C-07/007 (April 2007),
statistical software, along with some sample data sets. Note that the program .NET Framework 1.1 must
be installed on the computer to run ProUCL 4.0 (the .NET Framework 1.1 install program is included on
the CD as the file entitled, dotnetfx.exe). The entire contents of the CD should be copied to a new
directory on the user's hard drive and the ProUCL 4.0 software can be run by double-clicking on the file
entitled, ProUCL.exe.
217
-------
United States
Environmental Protection
Agency
Office of Research
and Development (8101 R)
Washington, DC 20460
Official Business
Penalty for Private Use
$300
EPA/600/R-07/038
April 2007
www.epa.gov
Please make all necessary changes on the below label,
detach or copy, and return to the address in the upper
left-hand corner
If you do not wish to receive these reports CHECK HERE ~,
detach, or copy this cover, and return to the address in the
upper left-hand corner
PRESORTED STANDARD
POSTAGE & FEES PAID
EPA
PERMIT No. G-35
Recycled/Recyclsble
Printed with vegetable-based ink on
paper that contains a minimum of
50% post-consumer fiber content
processed chlorine free
-------
About the CD
The CD accompanying the hard copy of this report, EPA/600/R-07/038 (April 2007), "ProUCL Version
4.0 User Guide," contains this written report in pdf and Microsoft Word formats. The CD also includes
the report, EPA/600/R-07/041 (April 2007), "ProUCL Version 4.0 Technical Guide" in pdf and Microsoft
Word formats, and the ProUCL Version 4.0 (June 13, 2007 edition), EPA/600/C-07/007 (April 2007),
statistical software, along with some sample data sets. Note that the program .NET Framework 1.1 must
be installed on the computer to run ProUCL 4.0 (the .NET Framework 1.1 install program is included on
the CD as the file entitled, dotnetfx.exe). The entire contents of the CD should be copied to a new
directory on the user's hard drive and the ProUCL 4.0 software can be rim by double-clicking on the file
entitled, ProUCL.exe.
-------