Assessment of Errors in Soil Sampling


v>EPA
                         United States
                         Environmental Protection
                         Agency
                         Environmental Monitoring
                         Systems Laboratory
                         P.O. Box 93478
                         Las Vegas NV 89193-3478
May 1990
                         TECHNOLOGY SUPPORT PROJECT
Assessment
of  Errors  in
Soil  Sampling
                         This fact sheet is based on "A Rationale for the Assessment
                         of Errors in the Sampling of Soils" by J. Jeffrey van Ee, Louis
                         J. Blume, and Thomas H. Starks, 1990.
 INTRODUCTION
The sampling and analysis of soils for inorganic contaminants
is a complex procedure from experimental design to the final
evaluation of all generated data. Sources of error abound but
they can be successfully mitigated by careful planning or
isolated by intelligent error assessment. Error (or variability)
can be either bias or random. Biased error is indicative of a
systematic problem that can exist in any sector of soils
analysis, from sampling to data analysis. The first step in
analysis of variability (or error) is to establish a plan that will
identify errors, trace them to the step in which they occurred,
and account for variabilities to allow direct corrective action to
mitigate them. In anticipation of errors, it is essential to ask two
questions:

1 How many and what type samples are required to assess
  the quality of data in a field sampling effort?

2. How can the information from these samples be used to
  identify and control sources of error and uncertainty in the
  measurement process?

Error assessment should be understood by the field scientist
and the analyst. To aid scientists in the estimation and
evaluation of variability, the Environmental Monitoring Systems
Laboratory-Las Vegas (EMSL-LV) has developed a computer
program called ASSESS. By applying statistical formulae to
data entered, ASSESS can trace errors to their sources and
help scientists plan future studies that avoid the pitfalls of the
past.

-------
RANDOM ERRORS
BIAS ERROR
PREVENTION
Random errors can result in variations from
tjie true vikje*that are either positive or
negative but di3 not follow a pattern of
variability.  During the measurement process,
random errors may be caused by:
1) sampling variations
2) handling discrepancies
3) transportation vagaries
4) preparation dissimilarities
5) subsampling problems
6) analytical discrepancies
7) data handling eccentricities

The greatest source of error is usually the
sampling step.  In the Comprehensive
Environmental Response, Compensation,
and Liability Act of 1980 (Superfund)
(CERCLA) and the Resource Conservation
and Recovery Act (RCRA), site
investigations, analytical, and data handling
variability are checked by the CLP protocol.
When more than one laboratory is involved,
handling, transportation, subsampling, and
preparation can be checked at the Level IV
CLP step, too. But how can the analyst know
that the sample in the jar is representative of
the surrounding samples at the site? How can
the field analyst know that the more (or less)
concentrated soil didn't stick to  the auger or
split-spoon?

It is strongly recommended that the traditional
approaches used in mitigating the error in the
last six steps be applied to sampling itself, i.e.,
use of duplicates, splits, spikes, evaluation
samples, and calibration standards.  A certain
amount of random error is inherent in samples
themselves, in fact, the total variance equals
the measurement variances plus the
population variances
               (ot2 = om2 + ap2)
We can address the variance in measurement;
the population variance, however, is a true
picture of the complexity of the  soil.
Some sources of error are systematic, that is,
in a given situation conditions exist that
consistently give positive or consistently give
negative results. This skewing of data can be
initiated early in a sampling regime, e.g., by a
sampling device that alters the composition of
the soil matrix.  It can occur in the middle of
the sampling regime, e.g., by the preferential
handling of a sampler who isn't trained in the
intricacies of sample handling and
preparation. Or it can be introduced in the
later, analytical stages, where it is easier to
trace because of interlaboratory comparisons
and frequent calibration checks.  The
pervasive quality of an early bias error is its
resistance to detection and the fact that  other
variabilities are added throughout the
process until, finally, the reported data may
be significantly non-representative of the true
value. Bias errors can be traced to:
• faulty sampling design
• skewed sampling procedure
• systematic operator error
• contamination
• degradation
• interaction with containers
• displacement of phase (or chemical
  equilibria)
• inaccurate instrument calibration
To avoid both random and bias errors (or at
least to be able to pinpoint their occurrence
and estimate their extent), it is wise to plan a
study well, anticipating possible sources of
error. The inclusion of quality assurance
samples used for quality assessment and
quality control can help isolate variability and
identify its effect.
An effective technique is to concentrate
duplicate sampling early in the study and
send the samples off for rapid CLP analysis.
Dependent on the results, it may not be
necessary to include as many quality
assessment samples after these samples
demonstrate reliability.  This allowance for
early detection of sources of error can help
the field scientist customize the remainder of
the study to meet the specific needs of the
project.

-------
QUALITY ASSESSMENT
SAMPLES
A Remedial Project Manager (RPM) must
ask: how many samples are needed to
adequately characterize the soil at this site?
The key word is "adequately." By
determining the data quality objectives
(DQOs) in advance, the RPM can assure
adequate sampling at a site. Too little
sampling, as well as too much, is a waste of
time and money. The extent of QA/QC effort
is dependent on the risk to human health, the
nearness of action levels to detection limits,
and the size, variability, and distribution of
contamination.  Ultimately, the number of
quality assessment samples is determined by
the DQOs for the site. The following table
explains various types of quality assessment
samples and their uses.
                             QUALITY ASSESSMENT SAMPLES

                             •  ALLOW STATEMENTS TO BE MADE CONCERNING THE QUALITY OF THE MEASUREMENT
                                SYSTEM

                             •  ALLOW FOR CONTROL OF DATA QUALITY TO MEET ORIGINAL DQOs

                             •  SHOULD BE DOUBLE-BLIND:

                                         Types of Samples  Description

                                      Field Evaluation (FES)  Samples of known concentration are introduced in the field as ear'y
                                                           as possible to check for measurement bias and to estimate preciso •

                                            Low Level Field  Low concentration FES samples check for contamination in
                                         Evaluation (LLFES)  sampling, transport, analysis, detection limit

                                         External Laboratory  Similar to FES but without exposure in the field, ELES can measure
                                          Evaluation (ELES)  laboratory bias and, if used in duplicate, precision

                                         Low Level External  Similar to LLFES but without field exposure, LLELES can
                                       Laboratory (LLELES)  determine the method detection limit, and presence of laboratory
                                                           contamination.

                                    Field Matrix Spikes (FMS)  Routine samples spiked with the analytes of interest in the field
                                                           check recovery and reproducibility over batches.

                                       Field Duplicates (FD)  Second samples taken near routine samples check for variability at
                                                           all steps except batch.

                                      Preparation Splits (PS)  Subsample splits are made after homogenization and are used to
                                                           estimate error occurring in the subsampling and analytical steps of
                                                           the process.


                             •  SHOULD BE SINGLE-BLIND:

                                   Field Rinsate Blanks (FRB)  Samples obtained by rinsing the decontaminated sampling
                                                           equipment with deionized water to check for contamination

                                   Preparation Rinsate (PRB)  Samples obtained by rinsing the Blanks sample preparation
                                                           apparatus with deionized water to check for contamination

                                           Trip Blanks (TB)  Used for Volatile Organic Compounds (VOCs), containers filled with
                                                           American Society for Testing and Materials Type II water are kept
                                                           with routine samples through the sampling, shipment, and analysis
                                                           phases
                                MAY BE NON-BLIND: AS IN THE INORGANIC CLP PROTOCOL

-------
SOME STATISTICAL
CONCERNS
SAMPLE
COLLECTION
CONSIDERATIONS
ASSESS
 Confidence in quality assessment sample
 data can be expressed as an interval or as an
 upper limit. All confidence levels/limits are
 based on the number of degrees of freedom
 and the limits get lower (or the intervals get
 smaller) as the number of degrees of freedom
 gets higher. The number of degrees of
 freedom for any particular statistic is
 dependent upon the experimental design.
 For example, if 15 samples are taken at a
 site, and each is divided into 2.preparation
 splits, and each split is extracted twice at a
 CLP laboratory, and 2 injections of each
 extraction are made into an Inductively
 Coupled Plasma/Mass Spectrometer (ICP/
 MS), the total number of degrees of freedom
 associated with this experimental design
 would be calculated as:
   15 samples x 2 preparations splits = 30
   X 2 CLP extractions             =60
.   X 2 injection replicates          = 120

 120 degrees of freedom for the whole
                                                                      process.  But, if only the population variability
                                                                      in tee field samples (which includes the
                                                                      sampling error) is being estimated, the
                                                                      number of degrees of freedom is 15-1, or 14.
                                                                      There are 15 independent samples but one
                                                                      degree of freedom is lost with the estimation
                                                                      of the mean. Therefore, there are 14 degrees
                                                                      of freedom for the sampling variance
                                                                      estimate. As another example, to estimate
                                                                      the variability in the extraction step, one has
                                                                      30;independent pairs of numbers, each pair
                                                                      associated with one extraction. Thus, there
                                                                      are 30 degrees of freedom associated with
                                                                      the extraction error. In fact, this error
                                                                      represents any error associated with the
                                                                      injection step as well because, obviously, no
                                                                      data are available until injections are made
                                                                      and results are reported.

                                                                      Consulting the Rationale Document or a
                                                                      statistical manual can give an RPM a feel for
                                                                      how many quality assessment samples must
                                                                      be sprinkled among the routine site
                                                                      characterization samples.
 If Level IV CLP analysis is performed on the
 soil, we can assume that very little error is in
 the analytical stage. This focuses our
 attention on sources of error in the sampling,
 handling, and preparation steps.  The two
 major considerations in collection of
 environmental samples are:
                                                                      1. Will the collected data give the answers
                                                                        necessary for a correct assessment of the
                                                                        contamination or a solution to the problem?

                                                                      2. Can sufficient sampling be done well and
                                                                        within reasonable cost and time limits?
The EMSL-LV has developed an easy-to-use
program to apply the necessary statistics to
the generated data for an accurate
determination of precision and bias.
ASSESS is a public domain, Fortran program
that is available from EMSL-LV.  It may be
applied to cases where no field evaluation
                                                                      samples are available as well as cases
                                                                      where they are.  ASSESS is user-friendly and
                                                                      its use will greatly aid both field scientists and
                                                                      RPMs in .decision-making based on soil
                                                                     ' studies.
 cf
°      I  echnology    "^
 \
 \
        support

        reject
^
                             FOR FURTHER INFORMATION:

                             This fact sheet is based on "A Rationale for
                             the AssessjosoLof Efror&jn the Sampling of
                             Soils"b/0^JeffreyvariE^J.ouis J. Blume,
                             and ThortaTiTSTaTKsri99u~.	'	'
                             This document is available in hypertext for
                             IBM PCs to enhance readability and aid
                             users at all levels in knowledge of this
                             material.
                                           For additional copies, hypertext disks, or
                                           more information, call, or write, EMSL-LV:
                                          ^
                                           Technology Support Center
                                           U.S. Environmental Protection Agency
                                           P.O. Box93478
                                           Las Vegas, NV 89193-3478
                                           (702) 798-2270
                                           FTS  545-2270
                                           FAX  545-2637

-------