ASA/EPA Conferences on Interpretation of Environmental Data: IV. Compliance Sampling, October 5-6, 1987


United States
Environmental Protection
Agency
Office of Policy, Planning
and Evaluation
Washington, DC 20460
EPA-230-03-047
Statistical Policy Branch
<&ER^ ASA/EPA Conferences on
Interpretation of
Environmental Data
IV Compliance Sampling
October 5 -6th, 1987

-------
PREFACE
This -volume is a compendium of the papers and commentaries that were presented at
the fourth in a series of conferences on interpretation of environmental data conducted by
the American Statistical Association and the U.S. Environmental Protection Agency/s
Statistical Policy Branch of the Office of Standards and Regulations/Office of Polic;
Planning, and Evaluation. The ASA Committee on Statistics and the Environment
developed this series and has general responsibility for it.
The purpose of these conferences is to provide a forum in which professionals from
the academic, private, and public sectors exchange ideas on statistical problems that
confront EPA in its charge to protect the public and the environment through regulation of
toxic exposures^ They provide a unique opportunity for Agency statisticians and scientists
to interact with/.Heir counterparts in the private sector.
The eight papers and accompanying discussions in this volume of proceedings are
about "compliance sampling" to determine how well environmental standards are met.
These papers provide valuable guidance in the planning of future environmental studies.
The papers address many aspects of compliance, and are intended for statisticians involved
in planning how to ascertain general levels of compliance and identify noncompliers for
special attention. Such work is inherently statistical and must be based on anticipation of
the statistical analysis to be performed so that the necessary data can be collected. These
proceedings should help the statistician anticipate the analyses to be performed. In
addition, the papers discuss implications for new studies. No general prescriptions are
offered; none may be possible.
The emphases in these papers are quite different. No two authors have chosen the
same aspect of compliance to examine. This diversity suggests that a major challenge is
to consider carefully each study aspect in the planning process. Meeting this challenge
will require a high degree of professionalism from the statistical community.
The conference itself and these proceedings are primarily the result of the efforts of
the authors and discussants. The discussants not only describe how their views differ from
those of the authors, but provided independent ideas as well. The coordination of the
conference and of the publication of the proceedings was carried out by Mary Esther
Barnes and Lee L. Decker of the ASA staff.
The views presented in this conference are those of individual writers and should not
be construed as reflecting the official position of any agency or organization.
This fourth conference, "Compliance Sampling," was held in October 1987. Others
were the first conference, "Current Assessment of Combined Toxicant Effects," in May
1986, the second , "Statistical Issues in Combining Environmental Studies," in October
1986, and the third , "Sampling and Site Selection in Environmental Studies," in May 1987.
John C. Bailar III, Editor
Chair, ASA Committee on Statistics and the Environment
Department of Epidemiology and Biostatistics, McGill University
and
Office of Disease Prevention and Health Promotion
U.S. Department of Health and Human Services
ii

-------
INTRODUCTION
The general theme of the papers and associated discussions is the design and
interpretation of environmental regulations that incorporate, from the outset, statistically
valid compliance verification procedures. Statistical aspects of associated compliance
monitoring programs are considered. Collectively the papers deal with a wide variety of
environmental concerns including various novel approaches to air emissions regulations and
monitoring, spatial sampling of soil, incorporation of potential health effects
considerations into the design of monitoring programs, and considerations in the statistical
evaluation of analytical laboratory performance.
Several papers consider aspects of determining appropriate sampling frequencies.
Allan Marcus discusses how response time frames of potential biological and health effects
due to exposures may be used to decide upon appropriate monitoring interval time frames.
He demonstrates how biokinetic modeling may be used in this regard.
Neil Frank and Tom Curran discuss factors influencing required sampling frequencies
to detect particulate levels in air. They emphasize the need to specify compliance
monitoring requirements right at the time that the air quality standard is being
formulated. They suggest an adaptive monitoring approach based on site specific
requirements. Those sites that are clearly well above or well below the standard need be
sampled relatively infrequently. Those sites that straddle the standard should be sampled
more frequently to decrease the probabilities of misclassification of
attainment/nonattainment status.
Tom Hammerstrom and Ron Wyzga discuss strategies to accommodate situations
when Allan Marcus' recommendations for determining sampling frequency have not been
followed, namely when monitoring data averaging time intervals are very long relative to
exposure periods that may result in adverse physiological and health consequences. For
example, air monitoring data may be averaged over one hour intervals but respiratory
symptoms may be related to the highest five minutes of exposure during that hour. The
authors model the relationships between peak five minute average concentration during an
hour and the overall one hour average concentration under various stochastic process
assumptions. They combine monitoring and modeling to predict short term peak
concentrations on the basis of observed longer term average concentrations.
Bill Nelson discusses statistical aspects of personal monitoring and monitoring
"microenvironments" such as homes and workplaces to assess total personal exposure.
Such data are very useful for the exposure assessment portions of risk assessment. Dr.
Nelson compares and contrasts personal monitoring with the more traditional area
monitoring. The availability of good personal exposure data would permit much greater
use of human epidemiologic data in place of animal toxicologic data in risk assessment.
Richard Gilbert, M. Miller, and H. Meyer discuss statistical aspects of sampling
"frequency" determination in the spatial sense. They consider the development of a soil
sampling program to estimate levels of radioactive solid contamination. They discuss the
use of multilevel acceptance sampling plans to determine the compliance status of
individual soil plots. These plans have sufficient sensitivity to distinguish between
compliant/noncompliant plots yet result in substantial sample size economies relative to
more naive single stage plans.
Hi

-------
John Holley and Barry Nussbaum present an economist's approach to environmental
regulation. The "bubble" concept specifies that average environmental standards must be
maintained across a dimension such as area, time, auto fleet, or industry group. This
dimension constitutes the "bubble." Lack of compliance in one part of the bubble may be
offset by greater than minimum compliance in other parts. Emissions producers have the
option to trade, sell or purchase emissions "credits" with, from, or to other emissions
producers in the bubble. Alternatively, they may "bank" emissions "credits" for use in a
future time period. Such an approach to regulation greatly enhances the emissions
producers' flexibility, as a group, to configure their resources so as to most economically
comply with the overall standard.
Soren Bisgaard and William Hunter discuss statistical aspects of the formulation of
environmental regulations. They emphasize that the regulations, including their
associated compliance monitoring requirements, should be designed to have satisfactory
statistical characteristics. One approach to this is to design regulations that have
operating characteristic curves of desired shape. Alternative candidate formulations can
be compared in terms of the shapes of their associated operating characteristic curves.
Bert Price discusses yet another statistical aspect of environmental regulation;
evaluating the capabilities of analytical laboratories. He contrasts and compares
strategies to evaluate individual laboratories based only on their own bias and variability
characteristics (intralaboratory testing) with strategies that evaluate laboratories as a
group (interlaboratory testing). Price's paper has commonality with that of Bisgaard and
Hunter in that he argues that first the operating characteristic of a regulation needs to be
specified. This specification is then used to determine the types and numbers of
observations required in the associated compliance tests.
The eight papers in this volume of proceedings deal with diverse aspects of the
statistical design and interpretation of environmental regulations and associated
compliance monitoring programs. A unifying theme among them is that the statistical
objectives and characteristics of the regulations should be specified right at the planning
stage and should be drivers of the specific regulation designs rather than being
(in)consequential afterthoughts.
Paul I. Feder
Chair, ASA/EPA Conference on Compliance Sampling
Battelle Memorial Institute
iv

-------
TABLE OF CONTENTS
Preface. JOHN C. BAILAR ID, McGill University	ii
Introduction. PAUL I. FEDER, Battelle Memorial Institute	iii
Index of Authors	vi
I. TOX3COKINETIC AND PERSONAL EXPOSURE CONSIDERATIONS IN
THE DESIGN AND EVALUATION OF MONITORING PROGRAMS
Time Scales: Biological, Environmental, Regulatory. ALLAN H. MARCUS,
Battelle Columbus Division	1
Discussion. RICHARD C. HERTZBERG, U.S. Environmental Protection
Agency, EC AO-Cincinnati	16
Statistical Issues in Human Exposure Monitoring. WILLIAM C. NELSON,
U.S. Environmental Protection Agency, EMSL-Research Triangle Park	17
Discussion. WILLIAM F. HUNT, JR., U. S. Environmental Protection
Agency, OAQPS-Research Triangle Park	39
H. STATISTICAL DECISION AND QUALITY CONTROL CONCEPTS IN DESIGNING
ENVIRONMENTAL STANDARDS AND COMPLIANCE MONITORING PROGRAMS
Designing Environmental Regulations. SOREN BISGAARD, WILLIAM G. HUNTER,
University of Wisconsin-Madison	41
Discussion. W. BARNES JOHNSON, U.S. Environmental Protection Agency,
OPPE-Washington, D.C.	51
Quality Control Issues in Testing Compliance with a Regulatory Standard:
Controlling Statistical Decision Error Rates. BERTRAM PRICE, Price
Associates, Inc.	54
Discussion. GEORGE T. FLATMAN, U.S. Environmental Protection Agency,
EMSL-Las Vegas	75
EI. COMPLIANCE WITH RADIATION STANDARDS
On the Design of a Sampling Plan to Verify Compliance with EPA Standards
for Radium-226 in Soil at Uranium Mill Tailings Remedial-Action Sites.
RICHARD O. GILBERT, Battelle Pacific Northwest Laboratory; MARK L.
MILLER, Roy F- Weston, Inc.; H. R. MEYER, Chem-Nuclear Systems, Inc.	77
Discussion. JEAN CHESSON, Price Associates, Inc.	Ill
IV. THE BUBBLE CONCEPT APPROACH TO COMPLIANCE
Distributed Compliance: EPA and the Lead Bubble. JOHN W. HOLLEY, BARRY
D. NUSSBAUM, U.S. Environmental Protection Agency, OMS-Washington, D.C.	112
Discussion. N. PHILIP ROSS, U.S. Environmental Protection Agency,
OPPE-Washington, D.C.	121
V

-------
V. COMPLIANCE WITH AIR QUALITY STANDARDS
Variable Sampling Schedules to Determine PMjq Status. NEIL H. FRANK,
THOMAS C. CURRAN, U. S. Environmental Protection Agency, OAQPS-
Research Triangle Park	122
Discussion. JOHN WARREN, U. S. Environmental Protection Agency, OPPE-
Washington, D.C.	128
Analysis of the Relationship Between Maximum and Average in S02 Time
Series. THOMAS S. HAMMERSTROM, Roth Associates, RONALD E. WYZGA,
Electric Power Research Institute	129
Discussion. R. CLIFTON BAILEY, Health Care Financing Administration	154
Summary of Conference. JOHN C. BAILAR III, McGill University and
U.S. Public Health Service	155
Appendix A: Program	160
Appendix B: Conference Participants	162
INDEX OF AUTHORS
Bailar, John C	 ii,155
Bailey, R. Clifton 	 154
Bisgaard, Soren 	 41
Chesson, Jean 	 Ill
Curran, Thomas C	 122
Feder, Paul I	 iii
Flatman, George T	 75
Frank. Neil H	 122
Gilbert, Richard 0	 77
Hammerstrom, Thomas S	 129
Hertzberg, Richard C	 16
Holley, John W	 112
Hunt, Jr., William F	 39
Hunter, William G	 41
Johnson, W. Barnes 			 51
Marcus, Allan H	 1
Meyer, H. R	 77
Miller, Mark L	 77
Nelson, William C	 17
Nussbaum, B. D	 112
Price, Bertram 	 54
Ross, N. Philip 	 121
Warren, John 	 128
Wyzga, Ronald E	 129
vi

-------
TIME SCALES: BIOLOGICAL. ENVIRONMENTAL. REGULATORY
A1 lan H. Marcus
Battelie Columbus Division
P.O. 50* 1375B
Research Triangle Park, NC 27709
1.	INTRODUCTION
E.P.A. has established primary air quality standards to orotecc the
general public against the adverse nealth effects of air pollutants. And
secondary standards to protect against other adverse en-/ i ronm-nta 1
impacts. Compliance with these standards is usually orescripea by an
explicit sampling protocol for the pollutant, with specified properties
of the instrumentation and its calibration. appropriate location of the
sampling device, and the freauenc/ and averaging time of" the samples.
The temooral oroperties of the compliance sampling protocol represent a
compromise among time scales of biological response to an environmental
insult, variation in concentration to which the population i= exposed,
cost and precision of the sample data. Biological and health effects
issues are orimarv and should be kept always in mind. InaGecrj = te sampling
schedules for compliance testing might allow fiuctuatina exposures of
tax ico lcgical significance to esc.ioe detection. Resources for testing
compliance are usually going to be scarce, and focusing on health effects
may allow the analyst and designer of environmental regulations to fir.a
some oatn between oversamoling and undersamp 1 ing environment:;; data.
In this review I will emphasize air quality standards for lead.
Lead is a soft dense metal whose toxic effects have long been -nown. In
modern times atmospheric lead has become a community problem because of
the large quantities of lead used as gasoline additives. While tins
problem was substantially reduced as a result of E.P.A.'= leaded gasoline
phasedown regulations, there are still significant Quantities of
atmospheric lead around primary and secondary metal shelters, battery
plants etc., and substantial residues of previous lead emissions in
surface soil and dust. Other regulatory authorities control lead
concentrations in drinking water, in consumer products, and in the work
place. E.P.A."s air lead regulations are spelled out in C.F.R. ^0:
58 (1982). I will describe these in mere detail below, along w. th some
alternative approaches that are being considered.
I will also very prieflv describe some of the biological and
physical time scale problems arising in the effects of ozone on loss of
agricultural crop yields. This will allow us to loo^ at a gaseous
pollutant whose effects include economic welfare as well as human Health.
2.	AIR LEAD STANDARDS
Atmospheric lead is largely found as inorganic lead salts on small
particles, thus many of the data collection issues are similar to those
encountered in sampling Total Suspended Particulates (TSP>. A great deal
]

-------
of data has been collected by the State and Local Air Monitoring Stations
(SLAMS) network. These provide information about areas wnere the lead
concentration and population density are highest and monitoring for
testing compliance with standards is most critical. In order for a SLAMS
station to be part of the National Air Monitoring StatiGn (MAMS1 network,
very specific criteria must be satisfied about sampler location in terms
of height above ground level, Gistance from the nearest maior roadway,
and soatial scale of which tne station is supposed to be representative.
The citing study must also have a sufficiently long samp ling period to
exhibit typical wind speeds and directions, or a sufficients- large
number of short periods to provide an average value consistent- with a
hour exposure (CD, 19B6).
The current averaging time for the lead primary National Ambient. Air
Quality Standard (NAAQS) is a calendar quarter (3 months), and the air
lead NAAQS is a quarterly average of 1.5 ug/m3 that snail not be
exceeded. The lead standard proposed in 1977 was based on an averaging
time of one calendar month. The longer period has the advantage of
greater statistical stability. However, the shorter period allows some
extra protection. Clinical studies with adult male volunteer subjects
showed that blood lead concentration (PbB> changed to e new equilibrium
level after a or 3 months of exposure (Rabinowitz et al.. s n children.
"The risk of shorter term exposures to air lead concentrations elevated
above a quarterly-averaoed standard that might go undetected were
considered in the 1978 standard decision to be minimized because H r;asea
or. the ambient air quality data available at that time, the possibilities
for significant, sustained excursions were considered small, and 2) it
was determined that direct inhalation of air lead is a relati /=!/ small
component of total airborne lead exposure %'^3 FR ^62^6). < Cohen, 19B6).
The biological reasons for reevaluating the averaging time are discussed
in the next section.
Alternative forms of the air lead standard are now beino evaluated
by E.P.A.'s Office of Air Quality Planning and Standards <0£GPS). The
averaging time is only one of the comDonents in setting an air lead
standard. The "characterizing value" for testing compliance can assume a
wide variety of forms, e.g. the maximum monthly (or quarterly) average as
used in the "deterministic" form of the standards. the maximum of the
average monthly mean over a specified number of years e.g. 3 consecutive
years, the average of the maximum monthly averages for each vear within a
specified number of years, the average of the three highest months (or
quarters) within a specified number of years etc. Some averaging of the
extreme values certainly smoothes out the data, but also conceals extreme
high-level excursions. Some attention has been given to the statistical
prooerties of the alternative characterizing values 'Hunt, lQB£). The
consequences of different characterizing values for biological exposure
indices or health effects indicators has not yet been evaluated.
A final consideration is the sampling frequency. The current normal
situation is a 2^-hour average collected every 6th day. The number of
samples collected also depends on the fraction of lost days; it is not
2

-------
uncommon for 25'/» of the data to be last. Thus one might have only 3 or ^
valid samples per month. Hunt (1986/ examined more frequent sampling
schemes: every day, every other day, every thira day. He also compared
the consequences of deterministic vs. "statistical" form of the standard,
monthly vs. quarterly characteristic values, 25'/. data loss •••=. mo loss.
The community air lead problem in the U.S. is now mare likeJv to be
related to point sources than to area-wide emissions, thus the following
three scenarios for Location were evaluated: (1! source oriencea sites
with maximum annual quarterly averages less than 1.5 ug.'m3; (5) source
oriented sites with maximum annual quarterly average greater than 1.5
ug/m3; (3) NAMS urban maximum concentration sites. Some conclusions
suggested by his study for quarterly averaging time are:
(i! The characteri2ing value with the best precision for =cecifiea
sampling frequency is the statistical quarterly average.
 1.5 ug/m3; (3) for MAMS sites,
every third day. The required precisian here is +/-LOV, of the mean.
Hunt also found that more frequent sampling would tie required if the
monthly averaging times were used. The source-oriented sites would
require every day sampling and the NAMS sites every-ather-dav sampling
to achieve +/- 10'/. precision.
Is such intensive sampling actually required? Are we reall-.-
interested in specified precision for atmosDneric cancentrations. or
should we shift the focus of compliance sampling to more relevant;
indicators of biological effect? Let us examine sows or" these
indicators.
3. BIOLOGICAL KINETICS GF LEAD
Lead is absorbed from the environment through the lungs (direct
inhalation) and through the gastro-intestinal tract (ingestion). Organic
compounds of lead may also be absorbed through the skin. Once lead is
absorbed into blood plasma through the alveoli or throuan the gut lumen,
it is quickly ionized and may henceforth regarded as indistinguisnab1e by
source. Thus the internal kinetics of lead may be deduced from
experimental data whether lead uptake is by intravenous injection,
inhalation or ingestion. Lead is distributed from plasma to the red
blood cells, kidney, liver, skeleton, brain, and other Tissues. The
fractional absorption of lead from the plasma varies great!/ from tissue
to tissue, thus the time scales for transfer of lead also vir/ greatly.
It is often assumed that lead equilibrates quickly and completely between
plasma and red blood cells, thus the whole blood lead concentration can
be used as a surrogate indicator of internal exposure. This is not the
case.
The initial uptake of lead from plasma to the red blood cells is
very rapid, occurring within a few minutes to tens of minutes (CamDbeli
et al . , 198<«; Chamberlain, 198^; De Silva, 19(31). Complete equilibration
does not occur at all concentrations, however, since the relationship
between whole blood lead and plasma lead becomes strikingly nonlinear at
higher concentrations (Manton and Cook, 1985: Marcus, 1985a). The most
3

-------
plausible explanation is that there is reduced transfer at" lead to tns
red blooo cells at higher concentrations, whether attributed to reduced
lead-binding caoacity of the erythrocytes or reduced transfer rate across
tne ervthrocvte membrane as l&ad concentrations increase. This is
reinforced by multi-dose experiments on rats in which lead concentrations
in brain, kidney, and femur are proportional to dose, which is expected
if tissue concentrations equilibrate with plasma concentrations, not with
whole blood lead concentrations.
Lead concentrations in peripheral tissues can be modeled by coupled
systems of ordinary differential equations. Parameters for such systems
can be estimated by iterative nonlinear least squares methods, -often wi th
Marquardt-type modifications to enlarge the domain of initial parameter
estimates which allow convergence to the optimal solution 'Eerman and
Weiss, 1978). Data sets with observations of two or more•cnmoanents
often sustain indirect inferences about unobserved tissue aools.
Analyses of data in (Raainowitz et al.. 1973, 1976; Griffin et si.. 1975;
De Silva, 1981) reported in (Marcus. I985abc: Chamberlain, 1935; CD.
1906) show that lead is absorbed into peripheral tissues in aouit humans
within a few days. The retention of lead by tissues is much longer than
is the initial uptake. Even soft tissues such as kidney ana Iwer appear
to retain lead for a month or so, and the skeleton retains lead far years
or tens of years (Christoffersson et al., 1-986).
The relevance of blood lead and tissue lead concentrations to overt
toxicity is not unambiguous. As in anv biologically variable peculation,
seme individuals can exhibit extremely high blood lead with onlv mild
lead ooisoning (Chamber lain and Massey, 1972). A more direct precursor
of toxicity is tjie erythrocyte prGtopcrphyr-.r. ! EP1 concentration.
Elevated levels of EP show that lead has deranged the heme biosvnthetic
pathway, reducing the rate of production of heme far hemoglobin. EP is
now widely used as a screening indicator for potential tonicity. An
example of the utility of EP is that after a brief maa^ive e-toosure of a
British worker (Williams, 198*+), zinc EP increased to very elevated
levels within a week of exposure even the worker	still largely
asymptomatic. Even though there is considerable biological variability,
EP levels in adults increase significantly within 10 to EO days after
beginning an experimental increase of ingested lead tStui*. 197h; Cools
et al., 1976; Schlegel and Kufner, 1978). Thus biological effects in
adult humans occur very shortly after exposure", certainly within a month.
While the uptake of lead and the onset of potential tonicity cccur
rapidly during increased exposure, the reduction of exposure does not
cause an equally rapid reduction in either body burden or toxicitv
indices. Accumulation of mobilizable pools of lead in the skeleton and
other tissues create an endogenous source of lead that is only slcwlv
eliminateo. Thus the rapid uptake of lead during periods of increased
exposure should be emphasized in setting standards far lead.
The experimental data cited above are indeed human data. bu+ ail for
adults (almost all for males). We are not aware of any direct studies on
lead kinetics in children. One of the more useful sets of data involves
the uptake of lead by infants from formula and milk (Ryu et al., 1904,
1985). Blood lead levels and lead content of food were measured at 28
4

-------
day intervals. The results are negative but informative: Blcod lead
levels in these infants appeared to eauilibrate so much faster that no
estimate of the kinetic parameters was pcssible. A .=ry rough estimate
bv Duncan (l^a^) based on earlier 1nput-autput studies in infants

of h to 6 days. Duggan's method has many assumptions and uncsrtainties.
An alternative method, allometric scaling based cn surface area, suggests
that if a 70 kg adult male has a blood lead mean life of 30 days, then a
7 kg infant should have a blood lead mean life of afcaut 3 davs.
The above estimates of lead kinetics in children are not strictly
acceptable. Children are kinetically somewhat different from adults,
with a somewhat larger volume of blood and much smaller but rapiciy
developing skeleton (especially dense cortical bore that ¦etsins Tiost of
the aduit body burden of lead). Children also absorb lead from the
environment at a greater rate, as they have greater gastrointestinal
absorption of ingested lead and a more rapid ventilation rate than do
adults. A blomathematica 1 model has been developed bv Harlev and Kneip
(193h) and modified for use by CAQPS. This uotake^biokinetic model is
based on lead concentrations in infant and juvenile baooons, who are
believed to constitute a valid animal model for human growth and
development. Preliminary applications of the model are described by
(Cohen, 1996; ATSDR, 1937; Marcus et al.. 1987). The model includes
annual changes of kinetic parameters such as the transfer rates for
blood-to-bone, blood-to-1iver, 1iver-to-gastrointestinai tract, and
growth of blood, tissue, and skeleton. The model predicts ¦s nean
residence time for lead in blood of c-vear-old children as 9 aays.
Blood lead concentrations change suostant i a 11 ,• during childhood
(Rabinowit: et al., 198M . These changes reflect the washout of in ute-'o
lead, the exposure of the cnild to cnanging patterns of food ana water
consumption, and the exposure of the toddler to leaded soil ano dust in
his or her environment. l-Je must thus consider also the temoora!
variations of exposure to environmental lead.
<~. TIME SCALES OF LEAD EXPOSURE
Air lead concentrations change very rapidlv, depending on '.-jino soeed
and direction and on emissions patterns. Biological kinetics tend to
filter out the "hiqh-frequency" variations in environmenta1 lead, so that
only environmental variations on the order of a few days are likelv to
play much of a role. The temoora] patterns depend on averaging time and
sampling frequency, and thus will vary from one location to another
depending on the major lead sources at that site. Figure I shows the
time series for the logarithm of air leac concentration (log PbA) near a
primary lead smelter in the northwestern U.S. The data are E^-hour
concentrations sampled every third day (with a few minor slippages). We
analysed these data using Box-Jenkins time series programs. The temporal
structure is fairly complex, with a significant autoregressive component
at lag a (27 days) and significant .noving average components at lags 1
and 3 (3 days and 9 days). Time series analyses around point source
sites and general urban sites may thus be informative.
5

-------
Direct inhalation of atmosnheric Lead may be only a minor part of
lead exposure attributable to air lead. Previously elevated air leac!
levels may have deposited a substantial reservoir of lead in surface soil
and house dust in the environment;- these, are the primary pathwa/s for air
lead in children aqed 1-5 years. Little is kncwn abeu" temooral
variations in soil and house dust lead. Preliminary results cited in
(Laxen et al., 1987) suggest that lead levels in surface dust anc soil
around redecorated houses and schools can change over periods of tine of
two to six months. While lead levels in undisturbed soils can persist
for thousands of years, the turnover of lead in urban soils due to human
activities is undoubtedly much faster.
Individuals are not stationary in their environment. Thus, the lead
concentrations to which individuals are exposed must include both spatial
and temporal patterns of exposure. The Picture is complex, but much is
being learned from personal exposure monitoring programs.
The amount of variation in air lead concentrations at a stationary
monitor can be extremely large. Coefficients of variation in excess of
100'/. are not uncommon around point sources such as lead smelters, even
wnen monthly or quarterly averages are used. This variability is far in
excess of that attributable to meteorological variation and is due to
fluctuations in the emissions process e.g. due to variations in feed
stock, process control, or production rate. Furthermore, the
concentration distributions are very skewed and heavy-tailed, more nearly
log-normally distributed than normal evsn for long averaging times. The
stochastic properties cf the process are generally unknown. although it
mav be assumed that air, dust, and soil lead concentrations abound point
sources that have been in operation for a long time are aopro:-' imatel y
stationary. In most olaces in the United States, lead levels in ail
sources of exposure, including food, water, and paint, as -•(ell as t^oss
pathways from gasoline lead, have been declining. With these points in
mind, we can begin to construct a quantitative characterization cf a
health effects target for comnliance studies.
5. HEALTH EFFECTS CHARACTERIZATION: A THEORETICAL APPROACH
We will here briefly descrioe a possible aoproach co the problem of
choosing an averaging time that is meaningful for health effects.
Related problems such as sampling 1requency then depend on the precision
with which one wishes to estimate the health effects characterization.
The basic fact is that all of the effects of interest are driven by the
environmental concentration-exposure C!t! at time t integrated over some
oeriad of time, with an appropriate weighing factor.	As people are
exposed to diverse pollutant sources, the uptake from all pathways must
be added up. If the health effect is an instantaneous one whose -.'alue at
time t is denoted X(t), and if the biokinetic processes are all linear
(as is assumed for 0AQP5 uptake biokinetic model! or can be reasonably
aoprcximated by a linear model driven by C(u) at time n. then the
biokinetic model can be represented by an aftereffect tEt m r ( t -u > after
an interval t-u. Mathematically,
6

-------
X(t) = J f(t-u) CCu) du
The after effect function for linear compartmental models is 2 mixture of
exQonential terms.
The time-averagec concentration-exoosure at time t, denoted Y(t), is
also a moving average of concentration Clu) at time u. with a weight
given by g(t-u) after an interval t-u. Thus compliance will be based on
the values of the variable V < t) in adjacent intervals, where
Y(t) = I g(t-u) C(u) du
The simple time-weighted average-for an averaging time of length T 15
g(t-u) = 1 /T if t-T <" u ' t
= 0	otherwise.
The properties of the moving average processes are easily evaluated, e.g.
the expected value EC], variance CI, covariance covC.J, are:
r
EC X(t > ] = J f(t-u> EEC(u)1 du
f f
varCX(t)] = J J f(t-u) f(t-v) covfCiu).Civ) ] du dv
f P
covCX(t),V(5)] = J J f(t-u) g(=-v) covCC(u>,C(v)] du d/
Thus, we could formalize the problem of selecting an averaging time T by
the following mathematical problem: choosing the averaging time T that
maximizes the correlation between X(t) and Y(s), for that time t at which
ECX(t>] is maximum. That is, look for the time(s) t at '.-mich we =xpect
the largest adverse health effect or effect indicator (e.g. blood lead).
Then find the averaging time T such the moving average at some other tirie
5 is as highly correlated as possible with X < t) . Mote that we do not
require that 5 = t. We may also restrict the range of values of T.
EXAMPLE: ONE-COMPARTMENT BIOKINETIC MODEL. MARKOV EXPOSURE MODEL.
Suppose that	the relevant biokine'ic mooel is a simple one-
compartment model.	The aftereffect of a unit pollutant uptake is an
exponential washout	(e.g. of blood lead, to a first approximation) with
time constant k,
f < t-u) = exp(-k (t - u>)	if u t
=	0	i f u > t
Ule will also assume that the concentration-exposure crocess C(t> is
stochastically second-order stationary with covariance function
cov[C(u),C(v>] = varCC] exp(-a | u - v |)
7

-------
After some algebra, one finds that:
var[X(t)] = varCC] < k ! a *¦ >¦ )
var[Y(t)3 = varCC! 2  /!< ( k-a > ( a i-k i -
-exp ( -a ( t + T-s) ) / a(k-a > -exo < -a( s-t > ) / a < a+k > 1 T
If t < s-T then
cov C X ( t) , Y [ s) j - varCC] C exo ( -a s-t -T ) )-exo ( -a ( s-1) > 1 /Tj ! a*k )
If t > s (for predicting from the current sampling time s to future time
t) then
cov[X(t),Y!5;] = varCC] [expi-3(t- =))( L-expi-aT> ) ' a;k-s> -
-Ea exp ! -k ( t-s ) > ( 1 -exp (-kT) > / k i a + t: ) ( k-a ! 11T
A small table of correlations between X(t> and V t t > ate shown in Table 1
for an assumed averaging time 1=30 days. Note that the correlations
between fluctuations in blood lead concentration x ( fc > and montr.! v
averaged lead concentration V < t > are fairly high, but much worse *'cr
children than for adults wnen environmental concentrations fluctuate
rapidly. These correlations are long-term averages fcr one subject: the
correlation, in real populations will be greatly attenuated due to
differences in biological parameters and exoosures among people,
TABLE 1
Correlation between Blood Lead and Monthly Average l sad
Blood Lead Kinetic Parameter V
1/(8 d) Child	1/<40 d' Aduit
Environmental Lead
Time Constant a
t/<4 d)	0.7707	0.S7S3
1/(10 d)	0.8476	0.3933
1/(25 d)	0.9236	G.913E
The uses of this method for assessing the relationship between
health effects and averaging time are shown in Table 2 for the sensitive
case of rapid fluctuations in air lead concentration. It is clear that
for this simple model, the averaging time T with highest correlation for
8

-------
children or for adults is about 1.5/k, and that much longer or much
shorter averaging times will not capture significant excursions in blood
lead. An averaging time of L5-20 days will make vet) reasonably
predictive' of X (t) for Doth adults, and cnildren.
TABLE £
CORRELATION BETWEEN BLOOD LEAD CONCENTRATION AND AVERAGE ENVIRONMENTAL
LEAD CONCENTRATION AS A FUNCTION OF AVERAGING TINE T
Assumed environmental lead correlation scale a = 1/(4 days)
CORRELATION
Averaging	CHILD	ADULT
Time T, Days	V - 1/(8 days)	k = 1/(40 days*
5	0.8792	0.5062
7	0.9287	0.5674
10	0.9560	0.6430
14	0.9h97	0.7207
20	0.S900	0.9020
30	0.7707	0.3703
60	0.3451	0.914!
90	0.4^02	0.8579
Samples collected for compliance testing have a mere complicated
structure for the weight function git-u), namely (for h-hour samples once
every m days in an internal of T davs).
g(t-u) = m/hT	it t.. +¦ ( j — 1 ) H < u < tv (j-1) (H+h)
where H = 24in hours
t„ = beginning of last compliance interval before t
j ® 1 . 0 0 0 , m
and g(t-u) = 0 otherwise
That is, g(t-u) is the sum of T/m rectangles spaced H-hours apart.
Similar calculations could be done using this g(t>.
Assessment of realistic situations will require careful attention to
both the biokinetic model represented by f(t), and the temporal
variations in exposure represented by cov(C(u),C(v)} etc. The example
represented above is the simplest representation of the interplay of
biological time scales (represented by k), environmental time scales
(represented by a), and regulatory time scales (represented by T).
Numerical evaluation of realistic examples should proceed as aoove. If
the underlying biokinetic model is severely nonlinear, then computer
simulations will be needed. The concentration-exposure function here
subsumes all spatial variation. Realistic human exposure models to
various microenvironments may be needed as well. Thus the function C(t)
here is a composite, including fractional absorption of environmental
9

-------
lead, volume of environmental intake (e.g. mJ/d of air, L/d of water,
mg/d of leaded soil and dust, g/d of food) as well as concentration C**p -N) **!.•'& [Note: £ means sum]
PEAK STATISTICS
P7 = seasonal peak of 7-hour daily mean over 0^00-!600 hr;,
PI = seasonal peak hourly concentration
CUMULATIVE STATISTICS
Total Exposure = $ C(h>
Total Impact = C 5 C(h)»*p )**l/o
Phenological ly Weighted Cumulative Impact
= ( $ C(h)**o i-i
-------
EXCEEDANCE STATISTICS
HRSkk = number of hours in which C(h) -¦ . x:: pp-n ozzc.e
SUMxx = total ozone corcBntrat ion X hours with C
-------
For most chemicals of incarest there is not nearly enough
information on pharmacok i ne t ic 5 , toxicokinetics, or temporal variabilit-.
of exposure pattern to allow these calculations to be made. However, for
manv criteria pollutants, the level of information is adequate *nd the
ratio between typical population levels so close to a health effects
criterion level as to make this a serious issue. For example. in L973.
the criterion level for blood lead was 30 ug/dl. on1: the geometric mean
blood lead in urban children was about 15 ug/dl, of which 15 ug'd'
was assumed to be "non-air" background (i.e. reguia'sd by some other
office). Due to the reduction of leaded gasoline during the J9~0's. the
mean blood lead level for urban children haa fallen Q-10 ug/dl by
I960, and is likely to be somewhat lower today. However, better d^ta on
health effects (e.g. erythrocyte oro tooororiyr i n increases ir. iron-
deficient children or hearing loss and neurcbehavior a 1 problems) in
children with lead burdens now suggest a much lower health criterion
level is appropriate, perhaps 10-15 ug/dl. Thus there is still very
little "margin of safety" against random excursions of lead exposure.
This is also true for other criteria pollutants. especially for
sensitive or vulnerable subpopu1 atians. For example, asthmatics may
exDerience sensitivity to elevated levels of suifur dioxide or ozone,
ssoecially when exercising. Ac t i .< i t / levels ceitainly- affect the
kinetics of gaseous pollutant uptake and elimination. Subpopu laticn
variations in kinetics and phai maco*;yuaroics may be important. Acute
exposure sampling in air or water (e.g. L-day Health Advisories for
drinking water) should be sensitize to pharmacokinetic time scales.
Biokinetic information on pollutant uptake and metaboHst» in humans
is not often available for volatile organic compounds aicJ for most
carcinogens. Thus large uncertainty/ factors for animal extrapolation an!:l
for route of exposure variations are used to orovide a conservative level
of exposure. The methods shown here mavbe less useful in such
situations. But the development of lealistic biologically motivated
pharmacokinetic models for e>: tr apo 1 a t i na animal data to humans may
establish a larger role for assessment of compliance testing for these
substances.
ACKNOWLEDGEMENTS
I am grateful to Ms. Judy Kapadia for retyping the manuscript, and
to the reviewer for his helpful comments.
REFERENCES
Berman M, Weiss MF. 1978, SAAM - Simulation, Analysis, ana Modeling.
Manual. U.S. Public Health Service Publ. N1H-18C.
Camobel 1 BC, Meredith PA. Moore MR, Watson US. 198^ , Kinetics of lead
following intravenous administration in man. Tox Letters 2l:H21-233.
CD [Criteria Document]. 19B6. mii Duality criteria for lead.
Environmental Criteria and Assessment Office, US Environmental Protection
Agency. EPA-600/3-B3/028aF (<+ volumes). Res. Tri. Pk. , NC.
12
-------
Chamberlain AC. 1935. Prediction of response of blood lead to airborne
and dietary lead from volunteer experiments with lead isotopes. Proc Roy
Soc Lond B22i:t: 1^+9-182.
Chamberlain MJ, Massey PMO. 1972. Mild lead poisoning with excessively
high blood lead. Brit J Industr Med 29:458-^61.
Chr i stotf ersson JO, Ahlgren L, Sciiutz A, Skerf/ina s. 1^86. Decrease of
skeletal lead levels in man after end of occupational exposure. Arch Env
Health M:312-3lS.
Cohen, J. Personal communicstions shout uAQPS staff paper, April-Mov.
1936.
Cools A, Salle JA, Yerberk MM, Zielinus RL. 1976. Blocnemical response of
male volunteers ingesting inorganic lead for 49 days. Int Arcn Occuo
Environ Health 38:129-139.
DeSilva PE. 1981. Determination of lead in plasma and studies on its
relationship to lead in erythrocytes. Brit J Industr Mea 38:209-21".
Duggan M J. 1983. The uptake and excretion of lead by ycjua children. At en
Environ Health 38:246-247.
Griffin T8, Coulston F. Wills H, Russell JC, Knelson JH. 1975. Clinical
studies of men continuously exposed to airborne lead. Environ Quality
Safety Suppl 2:254-288.
Harley NH, Kneip TH. 1985. An integrated metabolic noae! For lead in
humans of all ages. Report, New York Univ. Deot. Environ. Mea.
Hunt, WF Jr. 1986. A comparison of the precision associated with
alternative sampling plans for one versus three years of information and
monthly versus Quarterly averaging times. Memorandum to John Haines,
Office of Air Quality Planning and Standards, US Environ. Protect.
Agency. Jan. 30, 1986.
Larsen RI, Heck WW. 1984. An air quality data analysis system for
interrelating effects, standards, and needed source reductions: Part 8.
An effective mean 03 crop reduction mathematical model. J Air Pollut
Control Assoc 34: 1023-1034.
Larsen RI, McCurdv TR, Johnson PM. 1987. The relative impo< tance of ozone
concentration and its variation in injuring soybean. Draft report, Atin.
Sc i . Res. Lab., US Env. Protection Ageiu./, Res. Tri. Pk. MC
Laxen. DPH, Lindsay F, Raab EM, Hunter R, Fell GS, Fulton M. 1987. The
variability of lead in dusts within the homes of young chiidien. In
Lead In the Home Environment, ed. E. Culbard. Science Sei i . London.
Lee EH, Tingey DT, Hogsett WE. 1987a. Selection of the best exposure-
response model using various 7-hour ozone exposure statistics. Report for
Office of Air Quality Planning and Standards, US Environ. Protection
Agency. 13
-------
Lee EH, Tingey DT, Hogsett WE. 1967b. Evaluation of ozone exposure
statistics in exposure-response re1 acionshiDS. Submitted for publication.
Manton W[, Cook JD. 1954. high accuracy (stable isotope dilution)
measurements of lead in serum and cerebrospinal fluid. Brit J Industr Med
4 1:313-319.
Marcus SH. 19B5a. Multicomosrtment kinetic moGels f-n lead, p3( t [II,
Lead in blood plasma and erythrocytes. Environ Res 36: 473-<»B9,
Marcus, AH. 1985b. Multicompartment kinetic models fai lead, ^art [I.
Linear kinetics and variable absorption in humans without excessive lead
exposures. Environ Res 36: 459-472.
Marcus, AH. 1935c. Testing alternative nonlinear kinetic models in
compartmentaI analysis. In: Eisenfeld J, DeLlsi C, Eds. Mathematics and
Computers in Biomedical Applications. Elsevier Science. New York, pp.
259-266.
Rabinowitz M. Levi ton A, Needleman H. 1984. Variability of blood lead
concentrations during infancy. Arch Environ Health 39:74-77.
Rabinowitz MB, Wet'heril GW. Kcpple JD. 1973. Lead metabolism in the
normal human: stable isotope studies. Science 162:725-727.
Rabinowit: MB, Wetherill GW, Kooole JO. 1976. Kinetic analysis of lead
metabolism in healthy humans. J Clin Invest 58:260-270.
Ryu JE, Ziegler EE, Nelson SE. Futmm EJ.Dietarv and environmental
exposure to lead and blood leac during early infancy. In Dietary and
Environmental Lead: Human Health Ef ftects, ed Mafia f fey K. Elsevier
Science. New York, pp. 167-209.
Schlegel H, Kufner G. 1979. Long-term observation of biochemical Effects
of lead in human experiments. J Clin Chem Clin Biochem 17:225-233.
Stuik EJ. 1974. Biological response of male arid female /olunteers to
inorganic lead. Int Arch Arbeitsmed 33:83-97.
Williams MK. 1984. Biological tests of lead absorption following a brief
massive exposure. 2 Qccup Med 26:532-533.
14
-------
(S>Q
m
0.25
ft'
~3l>
fO
90
/2-0 TlMfi'j tAtf? 150
-------
DISCUSSION
Richard C. Hertzberg
Environmental Criteria and Assessment Office, U.S. EPA, Cincinnati, OH 45268
Comments on
"Time Scales: Biological, Environmental, Regulatory," Allan H. Marcus
Summary of Presentation
Marcus presents a case for consideration of
physiologic time scales in the determination of
compliance sampling protocols. The general theme of
incorporating physiologic time into risk assessment is
certainly scientifically supportable (e.g., NAS Workshop,
1986, "Pharmacokinetics in Risk Assessment," several
authors), but has been previously proposed only for
setting standards. Marcus takes the application one
step further by showing how improper sampling can fail
to detect exposure fluctuations that have toxicological
significance.
The Regulatory Context
The modeling and data that Marcus presents seem
reasonable, but key items seem to be missing, at least if
this approach is to become used by regulatory agencies.
The examples should show that the refinement will
make a practical difference in the "cost-benefit"
evaluation, and that the required data are accessible.
The first question is: does it matter? Most
standards are set with a fair degree of conservatism, so
that slight excursions above the standard will not pose a
significant health risk. The first impression of Marcus'
proposal is that it is fine tuning, when in fact it is the
coarse control which needs to be turned. Let us
consider the example of lead. Recent research has
suggested that significant impairment of neurological
development can be caused by lead concentrations much
lower than previously thought. In fact, some scientists
have suggested that lead toxicity may be a no-threshold
phenomenon. If such is the case, then EPA's approach
to setting lead standards will change drastically, and
Marcus' example, though not necessarily his proposal,
will probably not apply. But even with the current
standard, it is not clear that results from Marcus'
method will not be lost in the usual noise of biological
data. For example, consider his figure showing the
graphs of data and model fits for 11 human subjects.
First, these results may be irrelevant to the air
pollution issue since that data are following "ingestion"
of lead, not "inhalation." Lead inhalation is in many
ways more complicated than ingestion. Also, using day
30 as an example, the fitted erythrocyte protoporphyrin
levels vary dramatically across individuals (mean-49,
s.d.«20.3, range«3fr-73). I could not read the graphs
well, but even accounting for differing starting values,
the curve shapes also change across individuals, so that
predictions for any untested individual might be
difficult.
The second question, that of data requirements,
cannot be answered from this presentation alone. But
some issues can be mentioned. It is not clear that the
correlations between blood lead (Table 1) and monthly
average lead are good predictors of the correlation
between monthly average lead and neurological
impairment. But is the correlation the best indicator of
performance? A better question, perhaps, is: do
changes in blood lead which could be allowed by using
the weakest sampling protocol actually result in
significantly increased incidence of neurological
dysfunction, when compared to the best compliance
sampling procedure as determined using Marcus'
scheme? It is not clear how much data would be
required to answer that question.
Also, it seems that Marcus' approach must have
pharmacokinetic data on humans. The data
requirements are then more severe for most of the
thousands of environmental chemicals, where only
animal data are available. The situation is even worse
for carcinogens, where human cancer incidence data are
not available at the low regulatory levels. In fact, the
orders-of-magnitude uncertainty in the low-dose
extrapolation of cancer bioassays easily swamps the
error due to non-optimal compliance sampling.
So where might this research go? Certainly it
should be further developed. This approach will
definitely be useful for acute regulatory levels, such as
the 1-day Health Advisories for drinking water, where
internal dose and toxicity are closely tied to
pharmacokinetics. It will probably be more significant
for sensitive subgroups, such as children and those with
respiratory disease, where the pharmacokinetics are
likely to be much different from the norm, and where
the tolerance to chemical exposure Is already low. For
those cases, scaling factors and uncertainty factors are
highly inaccurate. Most important is the example
Marcus presents, chemicals where uptake and
elimination rates are dramatically different. For
control of those chemicals, using the "average"
monitored level is clearly misleading, and some
approach such as Marcus' must be used. I would
recommend the following steps:
• First, demonstrate the need. List at least a
few chemicals that are being improperly
monitored because of their pharmacokinetic
properties.
• Then, show us that your method works and is
practical.
16
-------
Statistical Issues 1n Human Exposure Monitoring
William C. Nelson, U.S. EPA, EMSL, Research Triangle Park
ABSTRACT
Pollutant exposure Information provides a critical link In risk
assessment and therefore in environmental decision making. Traditionally,
outdoor air monitoring stations have been necessarily utilized to relate
air pollutant exposures to groups of nearby residents. This approach is
limited by (1) using only the outdoor air as an exposure surrogate when
most individuals spend relatively small proportions of time outdoors and
(2) estimating exposure of a group rather than an individual. More
recently, air monitoring of non-ambient locations, termed microenvironments,
such as residences, offices, and shops has increased. Such data when
combined with time and activity questionnaire Information can provide
more accurate estimates of human exposure. Development of portable
personal monitors that can be used by the Individual study volunteer
provides a more direct method for exposure estimation. Personal samplers
are available for relatively few pollutants Including carbon monoxide and
volatile organic compounds (VOC's) such as benzene, styrene, tetrachloroethylene,
xylene, and dichlorobenzene. EPA has recently performed carbon monoxide
exposure studies in Denver, Colorado and Washington, D.C. which have
provided new information on CO exposure for Individual activities and
various ml croenvironments. VOC personal exposure studies in New Jersey
and California have indicated that, for some hazardous chemicals,
individuals may receive higher exposure from indoor air than from outdoor
air. Indoor sources include tobacco smoke, cleansers, insecticides,
furnishings, deodorizers, and paints. Types of exposure assessment
included in these studies are questionnaires, outdoor, indoor, personal,
and biological (breath) monitoring.
As more sophisticated exposure data become available, statistical
design and analysis questions also increase. These Issues include survey
sampling, questionnaire development, errors-in-variables situation, and
estimating the relationship between the microenvironment and direct
personal exposure, methodological development is needed for models which
permit supplementing the direct personal monitoring approach with an
activity diary which provides an opportunity for combining these data
with microenvironment data to estimate a population exposure distribution.
Another situation 1s the appropriate choice between monitoring instruments
of varying precision and cost. If inter-individual exposure variability
1s high, use of a less precise Instrument of lower cost which provides an
opportunity for additional study subjects may be justified. Appropriate
choice of an exposure metric also requires more examination. In some
Instances, total exposure may not be as useful as exposure above a threshold
level.
Because community studies using personal exposure and microenvironmental
measurements are expensive, future studies will probably use smaller
sample sizes but be more Intensive. However, since such studies
provide exposure data for Individuals rather than only for groups, they
may not necessarily have less statistical power.
17
-------
INTRODUCTION
Pollutant exposure Information 1s a necessary component of the risk
assessment process. The traditional approach to investigating the
relationship between pollutant level 1n the environment and the concentration
available for human Inhalation, absorption or ingestion, has been 1)
measurements at an outdoor fixed monitoring site or 2) mathematical model
estimates of pollutant concentration from effluent emission rate Information.!
The limitations of such a preliminary exposure assessment have become
Increasingly apparent. For example, recognition of the Importance of
indoor pollutant sources, particularly considering the large amount of
time spent indoors, and concern for estimating total personal exposure
have lead to more in-depth exposure assessments.
One of the major problems to overcome when conducting a risk assessment
1s the need to estimate population exposure. Such estimates require
information on the availability of a pollutant to a population group via
one or more pathways. In many cases, the actual concentrations encountered
are influenced by a number of parameters related to activity patterns.
Some of the more important are: the time spent indoors and outdoors,
commuting, occupations, recreation, food consumption, and water supply.
For specific situations the analyses will involve one major pathway to
man (e.g. outside atmospheric levels for ozone), but for others, such as
heavy metals or pesticides, the exposure will be derived from several
different media.
A framework for approaching exposure assessments for air pollutants
has been described by the National Academy of Science Epidemiology of Air
Pollution Committee.^ The activities shown in Figure 1 were considered
to be necessary to conduct an in-depth exposure assessment.
As knowledge about the components of this framework, particularly
sources and effects, has increased, the need for improved data on exposures
and doses has become more critical. A literature review published in
1982 discussed a large number of research reports and technical papers
with schemes for calculating population exposures.3 However, such schemes
are imperfect, relying on the limited data available from fixed air
monitoring stations and producing estimates of "potential exposures" with
unknown accuracy. Up until the 1980's, there were few accurate field
data on the actual exposures of the population to important environmental
pollutants. Very little was known about the variation from person to
person of exposure to a given pollutant, the reason, for these variations,
or the differences in the exposures of subpopulations of a city.
Furthermore, a variety of field studies undertaken in the 1970s and early
1980s showed that the concentrations experienced by people engaged in
various activities (driving, walking on sidewalks, shopping in stores,
working 1n buildings, etc.) did not correlate well with the simultaneous
readings observed at fixed air-monitoring stations.4-9 Two reviews have
summarized much of the literature on personal exposures to environmental
pollution showing the difficulty of relating conventional outdoor monitoring
data to actual exposures of the population.I0»11 No widely acceptable
methodology was available for predicting and projecting future exposures
18
-------
of a population or for estimating how population exposures might change
in response to various regulatory actions. No satisfactory exposure
framework or models existed.
TOTAL HUMAN EXPOSURE
The total human exposure concept seeks to provide the missing
component in the full risk model: estimates of the total exposures of
the population to environmental pollutants, with known accuracy and
precision. Generating this new type of information requires developing
an appropriate research program and methodologies. The methodology has
been partially developed for carbon monoxide (CO), volatile organic
compounds (VOC's) and pesticides, and additional research is needed to
solve many problems for a variety of other pollutants.
The total human exposure concept defines the human being as the
target for exposure. Any pollutant in a transport medium that comes into
contact with this person, either through air, water, food, or skin, 1s
considered to be an exposure to that pollutant at that .time.
The Instantaneous exposure is expressed quantitatively as a
concentration in a particular carrier medium at a particular instant of
time, and the average exposure 1s the average of the concentration to the
person over some appropriate averaging time. Some pollutants, such as
CO, can reach humans through only one carrier medium, the air route of
exposure. Others, such as lead and chloroform, can reach humans through
two or more routes of exposure (e.g., a1r, food, and water). If multiple
routes of exposure are involved, then the total human exposure approach
seeks to determine a person's exposure (concentration in each carrier
medium at a particular instant of time) through all major routes of
exposure.
Once implemented, the total human exposure methodology seeks to
provide Information, with known precision and accuracy, on the exposures
of the general public through all environmental media, regardless of
whether the pathways of exposure are air, drinking water, food, or skin
contact. It seeks to provide reliable, quantitative data on the number
of people exposed and their levels of exposures, as well as the sources
or other contributors responsible for these exposures. In the last few
years, a number of studies have demonstrated these new techniques. The
findings have already had an impact on the Agency's policies and priorties.
As the methodology evolves, the research needs to be directed toward
identifying and better understanding the nation's highest priority
pollutant concerns.
The major goals of the Total Human Exposure Program can be summarized
as follows:
Estimate total human exposure for each pollutant of concern
Determine major sources of this exposure
Estimate health risks associated with these exposures
Determine actions to eliminate or at least reduce these risks
19
-------
The total human exposure concept considers major routes of exposure
by which a pollutant may reach the human target. Then, it focuses on
those particular routes which are relevant for the pollutants of concern,
developing information on the concentrations present and the movement of
the pollutants through the,exposure routes. Activity information from
diaries maintained by respondents helps identify the microenvironments of
greatest concern, and in many cases, also helps identify likely contributing
sources. Biological samples of body burden may be measured to confirm
the exposure measurements and to estimate a later step in the risk assessment
framework.
In the total human exposure methodology, two complementary conceptual
approaches, the direct and the indirect, have been devised for providing
the human exposure estimates needed to plan and set priorities for reducing
ri sks.
Direct Approach
The "direct approach" consists of measurements of exposures of the
general population to pollutants of concern.12 a representative probability
based sample of the population 1s selected based on statistical design.
Then, for the class of pollutants under study, the pollutant concentrations
reaching the persons sampled are measured for the relevant environmental
media. A sufficient number of people are sampled using appropriate
statistical sampling techniques to permit inferences to be drawn, with
known precision, about the exposures of the larger population from which
the sample has been selected. From statistical analyses of subject
diaries which list activities and locations visited, it usually is possible
to identify the likely sources, microenvl ronments, and human activities
that contribute to exposures, including both traditional and nontraditional
components.
To characterize a population's exposures, it is necessary to monitor
a relatively large number of people and to select them in a manner that
1s statistically representative of the larger population. This approach
combines the survey design techniques of the social scientist with the
latest measurement technology of the chemist and engineer, using both
statistical survey methodology and environmental monitoring in a single
field survey. It uses the new miniaturized personal exposure monitors
(PEMs) that have become available over the last decade, ±3,14,15 ancj ^
adopts the survey sampling techniques that have been used previously to
measure public opinion and human behavior. The U.S. EPA Office of Research
and Development (0R0) has recently conducted several major field studies
using the direct approach, namely, the Total Exposure Assessment Methodology
(TEAM) Study of VOCs, the CO field studies in Washington, D.C. and Denver,
and the non-occupational exposure to pesticides study. These studies
will be described later.
Indirect Approach
Rather than measuring personal exposures directly as in the previous
approach, the "indirect approach" attempts to construct the exposure
profile mathematically by combining information on the times people spend
20
-------
in particular locations (homes, automobiles, offices, etc.) with the
concentrations expected to occur there. This approach requires a
mathematical model, information on human activity patterns, and statistical
information on the concentrations likely to occur in selected locations,
or "microenvironments".'A microenvironment can be defined as a location
of relatively homogeneous pollutant concentration that a person occupies
for some time period. Examples include a house, office, school, automobile,
subway or bus. An activity pattern is a record of time spent in specific
microenvi ronments.
In its simplest form the "indirect approach" seeks to compute the
integrated exposure as the sum of the individual products of the concentrations
encountered by a person in a microenvlronment and the time the person
spends there. The integrated exposure permits computing the average
exposure for any averaging period by dividing the time duration of the
averaging period. If the concentration within microenvironment j is
assumed to be constant during the period that person i occupies
microenvironment j, then the Integrated exposure Ei for the person 1 will
be the sum of the product of the concentration cj in each microenvironment
and the time spent by person i in that microenvironment
J
Ei = I cjtij»
j = 1
where E^ = integrated exposure of person i over the time period of interest;
Cj = concentrations experienced in microenvironment j;
t-j j = time spent by person i 1n mi croenvi ronment j; and
J = total number of microenvironments occupied by person i over
the time period of interest.
To compute the integrated exposure E^ for person i, it obviously is
necessary to estimate_both cj and t-jj. If T is the averaging time,
the average exposure Ei of person i is obtained by dividing by T; that is
Fi = E-j/T, where E^ is summed over time T.
Although the direct approach is invaluable in determining exposures
and sources of exposure for the specific population sampled, the Agency
needs to be able to extrapolate to much larger populations. The indirect
approach attempts to measure and understand the basic relationships
between causative variables and resulting exposures, usually in particular
ml croenvironments, through "exposure modeling." An exposure model takes
data collected in the field, and then, in a separate and distinct activity,
predicts exposure. The exposure model 1s intended to complement results
from direct studies and to extend and extrapolate these findings to other
locales and other situations. Exposure models are not traditional
dispersion models used to predict outdoor concentrations; they are
different models designed to predict the exposure of a rather mobile
human being. Thus, they require information on typical activities and
time budgets of people, as well as information on likely concentrations
in places where people spend time.
21
-------
The U.S. EPA ORD has also conducted several studies using the indirect
approach. An example of a recent exposure model is the Simulation of
Human Activities ad Pollutant Exposures (SHAPE) model, which has been
designed to make predictions of exposures to population to CO 1n Urban
areas. This model is similar to the NAAQS Exposure Model (NEM). The
SHAPE model used the CO concentrations measured in the Washington-Denver
CO study to determine the contributions to exposure from commuting,
cooking, cigarette smoke, and other factors. Once a model such as SHAPE
is successfully validated (by showing that it accurately predicts exposure
distributions measured in a TEAM field study), it can be used in a new
city without a field study to make a valid prediction of that population's
exposures using that city's data on human activities, travel habits, and
outdoor concentrations. The goal of future development is to apply the
model to other pollutants (e.g., VOCs, household pesticides) making it
possible to estimate exposure frequency distributions for the entire
country, or for major regions.
Field Studies
The total human exposure field studies from a central part of the
U.S. EPA ORD exposure research program. Several studies have demonstrated
the feasibility of using statistical procedures to choose a small
representative sample of the population from which it 1s possible to make
Inferences about the whole population. Certain subpopulations of importance
from the standpoint of their unique exposure to the pollutant under study
are "weighted" or sampled more heavily than others. In the subsequent
data analysis phases, sampling weights are used, to adjust for the
overrepresentation of these groups. As a result, it 1s possible to draw
conclusions about the exposures of the larger population of a region with
a study that is within acceptable costs.
Once the sample of people has been selected, their exposures to the
pollutant through various environmental media (air, water, food, skin)
are measured. Some pollutants have negligible exposure routes through
certain media, thus simplifying the study. Two large-scale total human
exposure field studies have been undertaken by U.S. EPA to demonstrate
this methodology: the TEAM study of VOCs and the Denver - Washington DC,
field study of CO.
The first set of TEAM Studies (1980-84) were the most extensive
investigation of personal exposures to multiple pollutants and corresponding
body burdens. In all, more than 700 persons in 10 cities have had their
personal exposures to 20 toxic compounds in air and drinking water measured,
together with levels 1n exhaled breath as an Indicator of blood
concentration.17-19 Because of the probability survey design used,
inferences can be made about a larger target population in certain areas:
128,000 persons in Elizabeth/Bayonne, NJ; 100,000 persons in the South
Bay Section of Los Angeles, CA; and 50,000 persons 1n Antloch/Pittsburg,
CA.
22
-------
The major findings of the TEAM Study may be summarized as follows:
1. Great variability (2-3 orders of magnitude) of exposures occur even
1n small geographical areas (such as a college campus) monitored on the
same day.
2. Personal and overnight Indoor exposures consistently outweigh outdoor
concentrations. At the higher exposure levels, indoor concentrations may
be 10-100 times the outdoor concentrations, even in New Jersey.
3. Drinking water and beverages in some cases are the main pathways of
exposure to chloroform and bromodichloromethane — air is the main route
of exposure to 10 other prevalent toxic organic compounds.
4. Breath levels are significantly correlated with previous personal
air exposures for all 10 compounds. On the other hand, breath levels are
usually not significantly correlated with outdoor levels, even when the
outdoor level is measured in the person's own backyard.
5. Activities and sources of exposure were significantly correlated
with higher breath levels for the following chemicals:
benzene: visits to service stations, smoking, work in chemical and
paint plants;
tetrachloroethylene: visits to dry cleaners.
6. Although questionnaires adequate for identifying household sources
were not part of the study, the following sources were hypothesized:
p-dichlorobenzene: moth crystals, deodorizers, pesticides;
chloroform: hot showers, boiling water for meals;
styrene: plastics, insulation, carpets;
xylenes; ethyl benzene: paints, gasoline.
7. Residence near major outdoor point sources of pollution had little
effect, if any, on personal exposure.
The TEAM direct approach has four basic elements:
Use of a representative probability sample of the population under
study
Direct measurement of the pollutant concentrations reaching these
people through all media (air, food, water, skin contact)
Direct measurement of body burden to Infer dosage
Direct recording of each person's daily activities through diaries
The Denver - Washington, DC CO Exposure Study utilized a methodology
for measuring the frequency distribution of CO exposures in a representative
sample of urban populations during 1982-83.20-22 Household data were
collected from over 4400 households in Washington, DC and over 2100
23
-------
households in the Denver metropolitan areas. Exposure data using personal
monitors were collected from 814 individuals in Washington, DC, and 450
individuals in Denver, together with activity data from a stratified
probability sample of the residents living 1n each of the two urban areas.
Established survey sampling procedures were used. The resulting exposure
data permit statistical comparisons between population subgroups (e.g.,
commuters vs. ncmcommuters, and residents with and without gas stoves).
The data also provide evidence for judging the accuracy of exposure
estimates calculated from fixed site monitoring data.
Additional efforts are underway to use these data to recognize indoor
sources and factors which contribute to elevated CO exposure levels and
to validate existing exposure models.
Microenvironment Models
Utilizing data collected in the Washington, DC urban-scale CO Study,
two modeling and evaluation analyses have been developed. The first,
conducted by Duan, is for the purpose of evaluating the use of microenvironmental
and activity pattern data in estimating a defined population's exposure to
CO.I® jhe second, conducted by Flachsbart, 1s to model the microenvironmental
situation of commuter rush-hour traffic (considering type and age of
vehicle, speed, and meteorology) and observed CO concentrations.^ With
the assistance of a contractor, U.S. EPA has collected data on traffic
variables, traffic volume, types of vehicles, and model year. An earlier
study measured CO in a variety of microenvironments and under a variety
of conditions."
The Indirect method for estimating population exposure to CO was
compared to exposures to the CO concentrations observed while people
carried personal monitors during their daily activities. The indirect
estimate derived from personal monitoring at the low concentration levels,
say 1 ppm but higher at levels above that. For example, at the 5 ppm
level, indirect estimates were about half the direct estimates within the
regression model utilizing these data. Although the results are limited,
1t appears that when monitoring experts design microenvironmental field
surveys, there is a tendency to sample more heavily 1n those settings
where the concentration is expected to be higher, thereby causing exaggerated
levels of the indirect method. The possibility of using microenvironmental
measurements and/or activity patterns from one city to extrapolate to
those of another city 1s doubtful but not yet fuTTy evaluated.
Dosimetry Research
The development of reliable biological Indicators of either specific
pollutant exposures or health effects is in its early stages. A limited
number of biomarkers such as blood levels of lead or CO have been recognized
and used for some time. Breath levels of VOCs or CO have also been
measured successfully. However, the use of other biomarkers such as
cotinine, a metabolite of nicotine, for a tracer compound of environmental
tobacco smoke is still in its experimental phase. This also applies to
24
-------
use of the hydroxyprol1ne-to-creatinine ratio as a measure of NOg exposure
and also to use of DNA adducts which form as a result of VOC exposure and
have been found to be correlated with genotoxic measures. Dosimetry
methods development, though still very new and too often not yet peady
for field application for humans, is obviously a very promising research
area.
Exhaled breath measurements have been used successfuly 1n VOC and CO
exposure studies. Since breath samples can be obtained noninvasively,
they are preferred to blood measurements whenever they can meet the
exposure research goals. A methodology to collect expired samples on a
Tenax adsorbent has been developed and used on several hundred TEAM study
subjects. Major findings have Included the discovery that breath levels
generally exceed outdoor levels, even in heavily industrialized petrochemical
manufacturing areas. Significant correlations of breath levels with
personal air exposures for certain chemicals give further proof that the
source of the high exposure is in personal activities or indoors, at home
as well as at work.
The basic advantages of monitoring breath rather than blood or tissues
are:
1. Greater acceptability by volunteers. Persons give breath samples
more readily than blood samples. The procedure is rapid and convenient,
taking only 5-10 min. in all.
2. Greater sensitivity. Since volatile organic compounds often have a
high air-to-blood partition coefficient, they will have higher concentrations
in breath than in blood under equilibrium conditions. Thus, more than
100 compounds have been detected in the breath of subjects where
simultaneously collected blood samples showed only one or two above
detectable limits.
3. Fewer analytical problems. Several "clean-up" steps must be completed
with blood samples, including centrifuglng, extraction, etc., with each
step carrying possibility for loss or contamination of the sample.
Measurements of CO in expired air often are used as indicators of
carboxyhemoglobin (COHb) concentrations in blood, although the precise
relationship between alveolar CO and blood COHb has not been agreed upon.
The U.S. EPA exposure monitoring program therefore Included a breath
monitoring component in its study of CO exposures in Denver and Washington,
DC. The purpose was (1) to estimate the distribution of alveolar CO (and
therefore blood COHb) concentrations in the nonsmoking adult residents of
the two cities; and (2) to compare the alveolar CO measurements to preceding
personal CO exposures.
The major findings of the breath monitoring program included:
1. The percent of nonsmoking adults with alveolar CO exceeding 10 ppm
(i.e., blood COHb 2%) was 11% in Denver and 6% in Washington, DC.
25
-------
2. The correlations between breath CO and previous 8-h CO exposure were
0.5.for Denver and 0.66 for Washington, DC.
3. The correlations between personal CO exposures at home or at-work
and ambient CO at the nearest stations averaged 0.25 at Denver and 0.19
at Washington, DC. Thus, the ambient data explained little of the
variability of CO exposure.
Sampling Protocols
Statistical sampling protocols are the design for large-scale total
human exposure field studies. They describe the procedures to be used in
identifying respondents, choosing the sample sizes, selecting the number
of persons to be contacted within various subpopulations, and other
factors. They are essential to the total human exposure research program
to ensure that a field survey will provide the information necessary to
meet its objectives. Because one's activities affect one's exposures,
another unique component of the total human exposure research program is
the development of human activity pattern data bases. Such data bases
provide a record describing what people do in time and space.
Whenever the objectives of a study are to make valid inferences beyond
the group surveyed, a statistical survey design is required. For exposure
studies, the only statistically valid procedure that 1s widely accepted
for making such inferences is to select a probability sample from the
target population. The survey designs used 1n the total exposure field
studies have been three-stage probability-based, which consist of areas
defined by census tracts, households randomly selected within the census
tracts, and stratified sampling of screened eligible individuals.20,24
STATISTICAL ISSUES
TEAM Design Considerations
It appears that some variability in the TEAM exposure data might be
due to meteorological factors such as some receptors being downwind of the
sources while others are not. A more careful experimental design that
includes consideration of these factors, including measurement of
appropriate meteorological parameters, may lead to more meaningful data
in future studies.
Other TEAM design considerations are:
1. The intraperson temporal variation in V0C exposure is crucial in
risk assessment and should be given a high priority 1n future studies.
2. Given the substantial measurement error, the estimated exposure
distributions can be substantially more heterogeneous than the true
exposure distributions. For example, the variance of the estimated
exposures is the sum of the variance of the true exposures and the
variance of the measurement errors, assuming that: a) measurement
errors are homoscedastic, and b) there is no correlation between
measurement error and true exposure. Empirical Bayes methods are
available for such adjustments.
26
-------
3. The relatively high refusal rate in the sample enrollment 1s of
concern. A more rigorous effort 1n the future to assess the impact
of the refusal on the generalizability of the sample is desirable.
For example, a subsample of the accessible part of the refusals can
be offered an 1ncent1ye to participate, or be offered a less intensive
protocol for their participation; the data from the would-be refusals
can then be compared with the "regular" participants to assess the
possible magnitudes of selection bias.
4. In future studies, the following might be used:
a. use of closed format questionnaires,
b. use of artlfical Intelligence methodology,
c. use of automated instrument output.
Development of Improved M1croenv1ronmental Monitoring Designs
The direct method of personal exposure is appealing but is expensive
and burdensome to human subjects. Monitoring mlcroenvlronments instead
is less costly but estimtes personal exposure only indirectly. Obviously
these approaches can be used in a complementary way to answer specific
pollutant exposure questions.
With either method, a crucial issue is how to stratify the
mlcroenvironments into relatively homogeneous mlcroenvlronment types
(METS).12 Usually there are many possible ways to stratify the
microenvironments into METs, thus there can be many potentially distinct
METs. Obviously one cannot implement a stratification scheme with five
hundred METs in field studies. It is therefore important to develop
methods for identifying the most informative ways to stratify the
ml croenvironments into METs. For example, if we can only afford to
distinguish two METs in a field study, is it better to distinguish indoor
and outdoor as the two METs, or is it better to distinguish awake and
sleeping as the two METs?
Some of the more important issues which will require additional
methodological development are:
1. How to identify the most Informative ways to stratify microenvironments
i nto METs.
2. How to optimize the number of METs, choosing between a larger number
of METs and fewer microenvironments for each MET, and a smaller
number of METs and more microenvironments for each MET.
3. How to allocate the number of monitored microenvironments across
different METs: one should monitor more microenvironments for the
more crucial METs (those in which the human subjects spend more of
their time) than the less crucial METs.
27
-------
Development and Validation of Improved Models for Estimating Personal
Exposure from Microenvironmental Monitoring Data
Methodological development is needed for models which allow
supplementing the direct personal monitoring approach with an activity
diary enabling these data to be combined with indirect approach
microenvironment&l data to estimate personal exposure through a regression-
like model. The basic exposure model which sums over microenvironments
Ei = 1 cjt1j
j
can be Interpreted as a regression model with the concentrations being
the parameters to be estimated. To fully develop this approach,' it is
necessary to make crucial assumptions about Independence between individuals
and between METs. Therefore, it is very important to validate the method
empi rically.
Errors-in-Var1ables Problem
It 1s Important to recognize an errors-1n-var1ables situation which,
may often occur in exposure assessment. In estimating the relationship
between two variables, Y (a health effect) and X (true personal exposure),
when X 1s not observed but a surrogate of X, say Z, which 1s related to X
1s observed. Such variables may have systematic errors as well as zero-
centered random errors. The effects of the measurement bias are more
serious in estimation situations than for hypothesis testing.
Choice Between Monitoring Instruments of Varying Precision and Cost
When designing monitoring programs, it is common to have available
Instruments of varying quality. Measurement devices that are less
expensive to obtain and use are typically also less accurate and precise.
Strategies could be developed and evaluated that consider the costs of
measurement as well as the precision. In situations of high between-
individual exposure variability, a less precise instrument of lower cost
may be preferred if it permits an opportunity for enough additional study
subjects.
Development of Designs Appropriate for Assessing National Levels
At the present time, the data available for the assessment of personal
exposure distributions are restricted to a limited number of locales.
The generalization from existing data to a very general population such
as the national population requires a great deal of caution. However, it
1s conceivable that large scale studies or monitoring programs aimed at a
nationally representative sample might be implemented in the future. It
would be useful to consider the design of such studies using data presently
available. It would also be useful to design studies of more limited
scales to be conducted in the near future as pilot studies for a possible
national study, so as to collect information which might be useful for
the design of a national study.
28
-------
An issue 1n the design of a national study is the amount of clustering
of the sample: one has to decide how many locales to use, and how large
a sample to take for each locale. The decision depends partly on the
fixed cost in using additional locales, and partly on the intracluster
correlation for the locales. For many of the VOC's measured in the TEAM
studies, there is far more variability within locales than between locales,
1n other words, there is little intracluster correlation for the locales.
This would indicate that a national study should be highly clustered,
with a few locales and a large sample for each locale. On the other
hand, if there is more variability between locales than within locales, a
national study should use many locales and a small sample for each locale.
Further analysis of the existing TEAM data base can help to address
these issues. For example, the TEAM sample to date can be identified as
a "population" from which various "samples" can be taken. The characteristics
of various sample types can be useful for the design of any followup
studies as well as for a larger new study.
Evaluating Extreme Values 1n Exposure Monitoring
Short term extreme values of pollutant exposure may well be more
important from a biological point of view than elevated temporal mean
values. The study of statistical properties of extreme values from
multivariate spatio-temporally dependent data 1s 1n its infancy. In
particular, the possibility of synergy necessitates the development of a
theory of multivariate extreme values. It is desirable to develop estimates
of extreme quantiles of pollutant concentration.
Estimation Adjustment for Censored Monitoring Data
One should develop low exposure level extrapolation procedures and
models, and check the sensitivity of these procedures to the models
chosen. In some cases a substantial fraction of exposure monitoring data
1s below the detection 11 mlt even though these low exposure levels may be
important. The problem of extrapolating from measured to unmeasured
values thus naturally arises. Basically this is a problem of fitting the
lower tail of the pollutant concentration distribution. Commonly used
procedures assume either that below detectable level values are actually
at the detection limit, or that they are zero, or that they are one-half
of the detection limit.
In many monitoring situations we may find a good fit to simple models
such as the lognormal for that part of the data which lies above the
detection limit. Then the calculation of total exposure would use a
lognormal extrapolation of the lower tail.
SUMMARY
Personal exposure assessment 1s a critical link in the overall risk
assessment framework. Recent advances in exposure monitoring have provided
new capabilities and additional challenges to the environmental research
team, particularly to the statistician, to Improve the current state of
29
-------
information on microenvironment concentrations, activity patterns, and
particularly personal exposure. If these opportunities are realized,
then risk assessments can more often use human exposure and risk data in
addition to available animal toxicology information.
30
-------
REFERENCES
1. Lioy, P. J., (1987) In Depth Exposure Assessments, JAPCA, 37, 791-
793.
2. Epidemiology of Air Pollution, National Research Council National
Academy Press, Washington, DC (1985), 1-334.
3. Ott, W. R. (1982) Concepts of human exposure to air pollution,
Environ. Int., 7, 179-196.
4. Cortese, A. D. and Spengler, J.D. (1976) Ability of fixed monitoring
stations to represent carbon monoxide exposure. J. Air Pollut.
Control Assoc., 26, 1144.
5. Flachsbart, P. G. and Ott, W. R. (1984) Field Surveys of carbon
monoxide in commercial settings using personal exposure monitors.
EPA-600/4-94-019, PB—84-211291, U.S. Environmental Protection
Agency, Washington, DC.
6. Wallace, L. A. (1979) Use of personal monitor to measure commuter
exposure to carbon monoxide 1n vehicle passenger compartment.
Paper No. 79-59.2, presented at the 72nd Annual Meeting of the
A1r Pollution Control Association, Cincinnati, OH.
7. Ott, W. R. and Eliassen, R. (1973) A survey, technique for determining
the representativeness of urban a1r monitoring stations with
respect to carbon monoxide, J. Air. Pollut. Control Assoc. 23,
685-690.
8. Ott, W. R. and Flachsbart, P. (1982) Measurement of carbon monoxide
concentrations in Indoor and outdoor locations using personal
exposure monitors, Environ. Int. 8, 295-304.
9. Peterson, W. B. and Allen, R. (1982) Carbon monoxide exposures to
Los Angeles commuters, J. Air Pollut. Control Assoc. 32, 826-833.
10. Spengler, J. D. and Soczek, M. L. (1984) Evidence for improved
ambient air quality and the need for personal exposure research,
Environ. Sci. Technol. 18, 268-80A.
11. Ott, W. R. (1985) Total human exposure: An emerging science focuses
on humans as receptors of environmental pollution, Environ.
Sci. Technol. 19, 880-886.
12. Duan, N (1982) Models for human exposure to air pollutant, Environ.
Int. 8, 305-309.
13. Mage, D. T. and Wallace, L. A., eds. (1979) Proceedings of the
Symposium on the Development and Usage of Personal Monitors for
Exposure and Health Effects Studies. EPA-600/9-79-032, PB-80-
143-894, U.S. Environmental Protection Agency, Research Triangle
Park, NC.
31
-------
14
15
16
17
18
19
20
21
22
23
24
Wallace, L. A. (1981) Recent progress in developing and using personal
monitors to measure human exposure to air pollution, Environ.
Int. 5, 73-75.
Wallace, L. A. and Ott, W. R. (1982) Personal monitors: A state-of-
the-art survey, J. Air Pollut. Control Associ. 32, 601-610.
Ouan, N. (1984) Application of the microenvironment type approach to
assess human exposure to carbon monoxide. Rand Corp., draft
final report submitted to the U.S. Environmental Protection
Agency, Research Triangle Park, NC.
Wallace, L. A., Zweidlnger, R., Erickson, M., Cooper, S., Whltaker,
D., and Pellizzari, E. D. (1982) Monitoring individual exposure:
Measurements of volatile organic compounds in breathing-zone
air, drinking water, and exhaled breath, Environ. Int. 8, 269-282.
Wallace, L., Pellizzari, E., Hartwell, T., Rosenzweig, M., Erickson,
M., Sparacino, C. and Zelon, H. (1984) Personal exposures
to volatile organic compounds: I. Direct measurements 1n
breathing-zone air, drinking water, food, and exhaled breath,
Environ. Res. 35, 293-319.
Wallace, L., Pellizzari, E., Hartwell, T., Zelon, H., Sparacino, C.,
and Whitmore, R. (1984) Analyses of exhaled breath of 335
urban residents for volatile organic compounds, in Indoor Air,
vol. 4: Chemical Characterization and Personal Exposure, pp.
15-20. Swedish Council for Building Research, Stockholm.
Akland, G. G., Hartwell, T. 0., Johnson, T.R., and Whitmore, R. W.
(1985) Measuring human exposure to carbon monoxide in Washington,
DC, and Denver, Colorado, during the winter of 1982-83, Environ.
Sci. Technol. 19, 911-918.
Johnson, T. (1984) A study of personal exposure to carbon monoxide
in Denver, Colorado. EPA-600/4-84-015, PB-84-146-125,
Environmental Monitoring Systems Laboratory, U.S. Environmental
Protection Agency, Research Triangle Park, NC
Hartwell, T. 0., Carlisle, A. C., Michle, R. M., Jr., Whitmore, R.
W., Zelon, H. S., and Whitehurst, D. A. (1984) A study of carbon
monoxide exposure of the residents in Washington, DC. Paper
No. 121.4, presented at the 77th Annual Meeting of the Air
Pollution Control Association, San Francisco, CA.
Holland, D. M. and Mage, D. T. (1983) Carbon monoxide in four cities
during the winter of 1981. EPA-600/4-83-025, Environmental
Monitoring Systems Laboratory, U.S. Environmental Protection
Agency, Research Triangle Park, NC.
Whitmore, R. W., Jones, S. M., and Rozenzeig, M. S. (1984) Final
sampling report for the study of personal CO exposure. EPA-
600/S4-84-034, PB-84-181-957, Environmental Monitoring
Systems Laboratory, U.S. Environmental Protection Agency,
Research Triangle Park, NC.
32
-------
FRAMEWORK FOR EXPOSURE ASSESSMENT
CO
Co
Internal Dose
Health Effect
Outdoor
Emission
Sources
Indoor
Emission
Sources
Time-activity
patterns
Time-activity
patterns
Indoor
Concentrations
Outdoor
Concentrations
Biologically Effective
Dose
Total
Personal
Exposure
-------
TOTAL HUMAN EXPOSURE PROGRAM
GOALS:
Cu
-u
• Estimate total human exposure for each
pollutant of concern
• Determine major sources of this exposure
• Estimate health risks associated with
these exposures
• Determine actions to reduce these risks
-------
PROPORTION OF TIME IN SELECTED MICROENVIRONMENT
EMPLOYED PERSONS
INDOORS, WORK—28%

OUTDOORS—2%
IN TRANSIT—6%
INDOORS, OTHER—1%
INDOORS, HOME—63%
-------
PROPORTION OF TIME IN SELECTED MICROENVIRONMENTS
FULL-TIME HOMEMAKERS
INDOORS, OTHER—5%
INDOORS. HOME—89%
-------
MAJOR EXPOSURE
Outdoors
Industrial
Automobile
Toxic wastes
Pesticides
SOURCES
Indoors
Tobacco smoke
Gas stoves
Cleaners
Sprays
Dry Cleaning
Paints
Polishes
-------
EXPOSURE ASSESSMENT FOR
COMMUNITY STUDIES
u>
00
Questionnaires
Outdoor monitoring
Indoor monitoring
Personal monitoring
Biological monitoring
-------
DISCUSSION
William F. Hunt, Jr.
Chief, Monitoring and Research Branch
Technical Support Division
Research Triangle Park, NC 27711
William C. Nelson's paper provides an
excellent overview of exposure monitoring
and associated statistical issues. The
reader must keep in mind that the paper
is directed at estimating air pollution
in microscale environments—in the home,
at work, in automobiles, etc., as well as
in the ambient air to which the general
public has access.
While it is important to better
understand air pollution levels in each
of these microenvironments, it must be
clearly understood that the principal
focus of the nation's air pollution
control program is directed at
controlling ambient outdoor air pollution
levels to which the general public has
access. The Clean Air Act (CAA) of 1970
and the CAA of 1977 emphasized the
importance of setting and periodically
reviewing the National Ambient Air
Quality Standards (NAAQS) for the
nation's most pervasive ambient air
pollutants—particulate matter, sulfur
dioxide, carbon monoxide, nitrogen
dioxide, ozone and lead. NAAQS(s) were
set to protect against both public health
and welfare effects.
One of these pollutants, carbon
monoxide (CO), is discussed extensively
in Dr. Nelson's paper. CO is a
colorless, odorless, poisonous gas formed
when carbon in fuels is not burned
completely. Its major source is motor
vehicle exhaust, which contributes more
than two-thirds of all emissions
nationwide. In cities or areas with
heavy traffic congestion, however,
automobile exhaust can cause as much as
95 percent of all emissions, and carbon
monoxide concentrations can reach very
high levels.
In Dr. Nelson's paper, he states that
the correlations between personal CO
exposures at home or at work and ambient
CO at the nearest fixed site air
monitoring stations are weak. This does
not mean from an air pollution control
standpoint, however, that there is
something wrong with the fixed site CO
monitoring network. As stated earlier,
the air pollution control program is
directed at controlling outdoor ambient
air at locations to which the public has
access. The microscale CO monitoring
sites are generally located in areas of
highest concentration within metropolitan
areas at locations to which the general
public has access.
The Federal Motor Vehicle Control
Program has been very successful in
reducing these concentrations over time.
In fact, CO levels have dropped 32
percent between 1977 and 1986, as
measured at the nation's fixed site
monitoring networks.' This improvement
has a corresponding benefit for people in
office buildings which use the outdoor
ambient air to introduce fresh air into
their buildings through their ventilation
systems. A major benefit occurs for
people who are driving back and forth to
work in their automobiles, for new cars
are much less polluting than older cars.
This should be clearly understood when
trying to interpret the major findings of
the breath monitoring programs that are
described in Dr. Nelson's paper.
Otherwise, the reader could mistakenly
conclude that somehow the Federal
Government may be in error in using fixed
site monitoring. Such a conclusion would
be incorrect. Further, it should be
pointed out that a fixed site network
also has the practical advantages of
identifying the source of the problem and
the amount of pollution control that
would be needed.
Another area of concern that needs to
be addressed in the future regarding the
breath monitoring program is the
relationship between alveolar CO and
blood carboxyhemoglobin (COHb). Dr.
Nelson states that the precise
relationship between alveolar CO and
blood COHb has not been agreed upon.
Given that, is there an inconsistency in
not being able to determine the
relationship between alveolar CO and
blood COHb and then using alveolar CO
measurements in Washington, D.C. and
Denver, Colorado to estimate blood COHb?
A final point, which needs to be
addressed in the breath monitoring
program,is the ability to detect volatile
organic chemicals, some of which may be
carcinogenic. What is the significance
of being able to detect 100 compounds in
breath, yet only one or two in blood
above the detectable limits? Does the
body expel the other 98 compounds that
cannot be detected in the blood? If so,
why?
STATISTICAL ISSUES
I agree with Dr. Nelson that
meteorological factors should be
incorporated into future TEAM studies,
through more careful experimental design.
The statistical issues identified under
TEAM design considerations, the
development of improved
microenvironmental monitoring designs,
errors-in-variables problem, choice
between monitoring instruments of varying
precision and cost, the development of
designs appropriate for assessing
39
-------
National levels, evaluating extreme
values in exposure monitoring, and
adjusting for censored monitoring data
are all well thought out and timely. I
strongly agree with his recommendation
that when considering multiple pollutant
species, as in the case of the volatile
and semi-volatile organic chemicals, as
well as polar compounds, the possibility
of synergistic effects necessitates the
development of a theory of multivariate
extreme values.
SUMMARY
In conclusion. Dr. Nelson's paper
provides a well thought out overview of
exposure monitoring and the associated
statistical issues. It should be an
excellent reference for people interested
in this topic. The reader should be
aware, however, of the importance of the
nation's fixed site monitoring network, in
evaluating the effectiveness of the
nation's air pollution control program.
REFERENCE
Trends Report. 1986. U.S. Environmenta1
Protection Agency, Technical Support
Division, Monitoring and Reports Branch,
Research Triangle Park, NC 27711.
40
-------
Designing Environmental Regulations
Stfren Bisgaard and William G. Hunter*
Center for Quality and Productivity Improvement
University of Wisconsin-Madison
610 Walnut Street, Madison, Wisconsin 53705
¦ Public debate on proposed environmental regulations
often focuses almost entirely (and naively) on the allow-
able limit for a particular pollutant, with scant attention
being paid to the statistical nature of environmental data
and to the operational definition of compliance. As a
consequence regulations may fail to accomplish their pur-
pose. A unifying framework is therefore proposed that
interrelates assessment of risk and determination of compli-
ance. A central feature is the operating characteristic
curve, which displays the discriminating power of a regula-
tion. This framework can facilitate rational discussion
among scientists, policymakers, and others concerned with
environmental regulation.
Introduction
Over the past twenty years many new federal, state,
and local regulations have resulted from heightened con-
cern about the damage that we humans have done to the
environment - and might do in the future. Public debate,
unfortunately, has often focused almost exclusively on risk
assessment and the allowable limit of a pollutant.
Although this "limit part" of a regulation is important, a
regulation also includes a "statistical pan" that defines
how compliance is to be determined; even though it is typi-
cally relegated to an appendix and thus may seem unimpor-
tant, it can have a profound effect on how the regulation
performs.
Our purpose in this article is to introduce some new
ideas concerning the general problem of designing environ-
mental regulations, and, in particular, to consider the role
of the "statistical part" of such regulations. As a vehicle for
illustration, we use the environmental regulation of
ambient ozone. Our intent is not to provide a definitive
analysis of that particular problem. Indeed, that would
require experts familiar with the generation, dispersion,
measurements, and monitoring of ozone to analyze avail-
able data sets. Such detailed analysis would probably lead
to the adoption of somewhat different statistical assump-
tions than we use. The methodology described below,
however, can accommodate any reasonable statistical
assumptions for ambient ozone. Moreover, this methodol-
ogy can be used in the rational design of any environmental
regulation to limit exposure to any pollutant.
Ambient Ozone Standard
For illustrative purposes, then, let us consider the
ambient ozone standard (1,2). Ozone is a reactive form of
oxygen that has serious health effects. Concentrations from
about 0.15 parts per million (ppm), for example, affect
*) Deceased.
respiratory mucous membranes and other lung tissues in
sensitive individuals as well as healthy exercising persons.
In 1971, based on the best scientific studies at the time, the
Environmental Protection Agency (EPA) promulgated a
National Primary and Secondary Ambient Air Quality
Standard ruling that "an hourly average level of 0.08 pans
per million (ppm) not to be" exceeded more than I hour
per year." Section 109(d) of the Clean Air Act calls for a
review every five years of the Primary National Ambient
Air Quality Standards. In 1977 EPA announced that it was
reviewing and updating the 1971 ozone standard. In
preparing a new criteria document, EPA provided a number
of opportunities for external review and comment. Two
drafts of the document were made available for external
review. EPA received more than 50 written responses to
the first draft and approximately 20 to the second draft.
The American Petroleum Institute (API), in particular, sub-
mitted extensive comments.
The criteria document was the subject of two meet-
ings of the Subcommittee on Scientific Criteria for Photo-
chemical Oxidants of EPA's Science Advisory Board. At
each of these meetings, which wefe open to the public, crit-
ical review and new information were presented for EPA's
consideration. The Agency was petitioned by the API and
29 member companies and by the City of Houston around
the time the revision was announced. Among other things,
the petition requested that EPA state the primary and
secondary standards in such a way as to permit reliable
assessment of compliance. In the Federal Register it is
noted that
EPA agrees that the present deterministic form of
the oxidant standard has several limitations and
has made reliable assessment of compliance
difficult. The revised ozone air quality standards
are stated in a statistical form that will more
accurately reflect the air quality problems in vari-
ous regions of the country and allow more reli-
able assessment of compliance with the stan-
dards. (Emphasis added)
Later, in the beginning of 1978, the EPA held a public
meeting to receive comments from interested panies on the
initial proposed revision of the standard. Here several
representatives from the State and Territorial Air Pollution
Program Administrators (STAPPA) and the Association of
Local Air Pollution Control Officials participated. After
the proposal was published in the spring of 1978, EPA held
four public meetings to receive comments on the proposed
standard revisions. In addition, 168 written comments were
received during the formal comment period. The Federal
Register summarizes the comments as follows:
The majority of comments received (132 out of
168) opposed EPA's proposed standard revision,
favoring either a more relaxed or a more
41
-------
stringent standard. State air pollution control
agencies (and STAPPA) generally supported a
standard level of 0.12 ppm on the basis of their
assessment of an adequate margin of safety.
Municipal groups generally supported a standard
level of 0.12 ppm or higher, whereas most indus-
trial groups supported a standard level of 0.15
ppm or higher. Environmental groups generally
encouraged EPA to retain the 0.08 ppm standard.
As reflected in this statement, almost all of the public dis-
cussion of the ambient ozone standard (not just the 168
comments summarized here) focused on the limit part of
the regulation. In this instance, in common with similar
discussion of other environmental regulations, the statisti-
cal pan of the regulation was largely ignored.
The final rule-making made the following three
changes:
(1) The primary standard was raised to 0.12 ppm.
(2) The secondary standard was raised to 0.12 ppm.
(3) The definition of the point at which the standard is
attained was changed to "when the expected number
of days per calendar year" with maximum hourly
average concentration above 0.12 ppm is equal to or
less than one."
The Operating Characteristic Curve
Environmental regulations have a structure similar lo
that of statistical hypothesis tests. A regulation states how
data are to be used to decide whether a particular site is in
compliance with a specified standard, and a hypothesis test
states how a particular set of data are to be used to decide
whether they are in reasonable agreement with a specified
hypothesis. Borrowing the terminology and methodology
from hypothesis testing, we can say there are two types of
errors that can be made because of the stochastic nature of
environmental data: a site that is really in compliance can
be declared out of compliance (type I eiror) and vice versa
(type II error). Ideally the probability of committing both
types of error should be zero. In practice, however, it is not
feasible to obtain this ideal.
In the context of environmental regulations, an operat-
ing characteristic curve is the probability of declaring a sire
to be in compliance (d.i.c.) plotted as a function of some
parameter 6 such as the mean level of a pollutant This
Probfd.i.c. I Q} can be used to determine the probabilities
of committing type I and type II errors. As long as 6 is
below the stated standard, the probability of a type I error
is 1 - Prob {d.i.c. I Q}. When 0 is above the stated
standard, Prob{d.i.c. I 0/ is the probability of a type II
error. Using the operating characteristic curve for the old
and the new regulations for ambient ozone, we can evalu-
ate them to see what was accomplished by the revision.
The old standard stated that "an hourly average level
of 0.08 ppm [was] not to be exceeded more than 1 hour per
year." This standard was therefore defined operationally in
terms of the observations themselves. The new standard, on
the other hand, states that the expected number of days per
calendar year with a maximum hourly average concentra-
tion above 0.12 ppm should be less than one. Compliance,
however, must be determined in terms of the actual data,
not an unobserved expected number. How should this
conversion be made? In Appendix D of the new ozone
regulation, it is stated that:
In general, the average number of exceedances
per calendar year must be less than or equal to 1.
In its simplest form, the number of exceedances
at a monitoring site would be recorded for each
calendar year and then averaged over the past 3
calendar years to determine if this average is less
than or equal to 1.
Based on the stated requirements of compliance, we have
computed the operating characteristic functions for the old
and the new ozone regulations. They are plotted in Figures
1 and 2. (The last sentence in the legend for Figure 1 will
be discussed below in the following section, Statistical
Analysis.) To construct these curves, certain simplifying
assumptions were made, which are discussed in the section
entitled "Statistical Concepts." Before such curves are
used in practice, these assumptions need to be investigated
and probably modified.
According to the main part of the new ozone regula-
tion, the interval from 0 to 1 expected number of
exceedances of 0.12 ppm per year can be regarded as
defining "being in compliance." Suppose the decision
rule outlined above is used for a site that is operating at a
level such that the expected number of days exceeding 0.12
ppm is just below one. In that case, as was noted by Javitz
(3), with the new ozone regulation, there is a probability of
approximately 37% in any given year that such a site will
be declared out of compliance. Moreover, there is approxi-
mately a 10% chance of not detecting a violation of 2
expected days per year above the 0.12 ppm limit; that is,
the standard operates such that the probability is 10% of
not detecting occurrences when the actual value is twice its
permissable value (2 instead of 1). Some individuals may
find these probabilities (37% and 10%) to be surprisingly
and unacceptably high, as we do. Others, however, may
regard them as being reasonable or too low. In this paper,
our point is not to pursue that particular debate. Rather, it
is simply to argue that, before environmental regulations
are put in place, different segments of society need to be
aware of such operating characteristics, so that informed
policy decisions can be made. It is important to realize that
the relevant operating characteristic curves can be con-
structed before a regulation is promulgated.
Statistical Concepts
Let X denote a measurement from an instrument such
that X = 0 + e, where 0 is the mean value of the pollutant
and e is the statistical error term with variance o2 . The
term e contains not only the error arising from an imperfect
instrument but also the fluctuations in the level of the pol-
lutant itself. We assume that the measurement process is
well calibrated and that the mean value of e is zero. The
parameters 0 and O2 of the distribution of e are unknown
but estimates of them can be obtained from data. A
prescription of how the data are to be collected is known as
the sampling plan. It addresses the questions of how many,
where, when, and how observations are to be collected.
Any function f(X)=f(Xi,X2, ¦ ¦ • ,Xn) of the observa-
tions is an estimator, for example, the average of a set of
values or the number of observations in a sample above a
certain limit. The value of the function f for a given sam-
42
-------
pie is an estimate. The estimator has a distribution, which
can be determined from the distribution of the observations
and the functional form of the estimator. With the distribu-
tion of the estimator, one can answer questions of the form:
what is the probability that the estimate / = / (X) is smaller
than or equal to some critical value c? Symbolically this
probability can be written as P = Probff (X)S c I Q).
If we want to have a regulation limiting the pollution
to a certain level, it is not enough to state the limit as a par-
ticular value of a parameter. We must define compliance
operationally in terms of the observations. The condition of
compliance therefore takes the form of an estimator
f (X|,... ,Xn) being less than or equal to some critical
value c, that is, {f (X\,... ,X„)S c }. Regarded as a func-
tion of 0, the probability Probf f (X|,... ,Xn)£ c I 07 is
therefore the probability that the site will be declared to be
in compliance with the regulation. It is, in fact, the
operating characteristic function.
The operating characteristic function and conse-
quently the probability of type I and type II errors are fixed
by appropriate choice of the critical value and sampling
plan. It is common statistical practice to specify a max-
imum type I error probability a and then to find a critical
value c such that Probff (X)< c 10L and zero otherwise. A year consists of
approximately n =365 x 12 = 4380 hours of observations
(data are only taken from 9:01 am to 9:00 pm LST). The
expected number of hours per year above the limit is then
4380
e=E{-zrL<.xi) = i)=pLxwo.
i=» 1
The probability that a site is declared to be in compliance
(d.i.c.) is
1 n
PM = Prob{ d.i.c. 10;= Prob^ £/(*;) I 0j

(A2)
where 70=^ 1 (I —6o) and % is the fraction above the
limit we at most want to accept (here 1/365).
The exact operating characteristic function is found
by reference to a non-central t-distriburion, but for all prac-
tical purposes the following approximation is sufficient:
Vw^Cr'n-fl) - c)

m
m
-o

8-Plug Composites
1. 3. 5, 7 and 9
8-Plug Composites
2, 4, 6 and 8
FIGURE 2. Sampling Patterns for 5-, 8-, 9-, 16-, and 21-plug
.Composite Soil Samples Collected From Ten 30-m by
.30-m Areas in the.Windblown Mill-tailings Flood Plain
¦ at Shiprock, New Mexico.
80
-------
2.2 DESCRIPTION OF THE DATA
The Ra measurements for the composite samples are plotted in Figs. 3, 4,
and 5. The figures also give the arithmetic mean, x, the standard deviation,
s, and the number of replicate composite samples, n. We wish to determine
the extent to which the true standard deviation, o, increases when fewer than
21 plugs are used to form a composite sample. To avoid confusion, we point
out that Figs. 4 and 5 indicate that Ra measurements of most 5-, 9-, and 21-
plug samples from Areas 1, 3, and 4 are larger than measurements for the 8-
and 16-plug samples from those areas. This is believed to have occurred
because the soil in the central 10-m by 10-m plot (from which 5-, 9-, and 21-
plug composite samples were formed) had higher concentrations of Ra than the
soil in the 30-m by 30-m areas from which the 8- and 16-plug samples were
formed (see Fig. 1).
Measurements for Areas 8, 9, and 10 were below 5 pCi/g (Fig. 3) and the
standard deviations ranged from 0.2 to 0.8 pCi/g, with no apparent trends in
s with increasing number of plugs per sample. The data in Fig. 4 indicates
that 5-plug sample data sets may be more skewed than those for 9- or 21-plug
samples, at least for some pTots. The measurements for Areas 1, 4, and 7 (Fig.
5) had higher means and were more variable than those for the areas in Figs.
3 and 4. In Fig. 6 are plotted the values of s from Figs. 3, 4, and 5 to
show more clearly the changes in s that occurred as the number of plugs per
composite sample changed.
2.3 ESTIMATING AND MODELING CHANGES IN STANDARD DEVIATIONS
In this section we first estimate the changes in a that occur as the
number of plugs per composite sample decreases from 21 to a smaller number.
Then a model for these changes is developed for use in later sections.
A simple model for the ratio of standard deviations is obtained by assuming
that measurements of Ra in individual soil plugs are uncorrelated, than the
soil plugs are thoroughly mixed together before the 500-g aliquot is removed,
and that the standard deviation between soil plugs does not change as the
81
-------
9 9 9 9 9
2.7 2.2 2.5 2.2 2.6
0.7 0.7 0.7 0.6 0.7
9 9 8 8 9
0.7 1.4 0.6 1.6 1.5
0.3 0.3 0.4 0.4 0.5
8 9 9 9 9
0.6 1.4 0.8 1.6 0.8
0.4 0.5 0.2 0.8 0.3
j L
5 8 9_J.6.. 21 5 8 9 16 _5__JL_SL_16_21
Number of Soil Plugs per Composite Sample
8
10
Area Number
ooc
FIGURE 3. Ra Measurements (pCi/g) of 5-, 8-, 9-, 16-, and 21-piug
Composite Soil Samples Taken from Areas 8, 9, and 10 in
the Windblown Mill —tailings Flood Plain at Shiprock, New
Mexico, x and s are the Arithmetic Mean and Standard
•¦^Deviation of the n Measurements for each Data Set.
82
-------
2.2 2.2 2.3 1.4 2.4
0.6 1.8 1.2 0.4 0.6
6.0 1.9 S3 1.5 4.7
1.3 0.6 1.2 0.5 0.5
5.2 1.9 2.6 1.9 2.5
4.7 0.8 2.1 0.8 0.7
3.8 l.B 1.0 1.7 2.0
1.3 0.4 0.4 0.4 0.8
20

&
&
I
' Number of Soil Plugs per Composite Sample
12 3 5 6
I Area Number
FIGURE 4. Ra Measurements (pCi/g) of 5-, 8-, 9-, 16-, and 21-
plug Composite Soil Samples Taken from Areas 2, 3, 5,
and 6 in the Uindblown Mill-tailings Flood Plain at
Shiprock, New Mexico, x and s are the Arithmetic
Mean and Standard Deviation of the n Measurements for
each Data Set.
83
-------
n
x
s
25
20
O)
^ 15
O —
*
CD
a M
9 5 9 5 9
10.2 5.6 9.9 4.5 9.0
1.8 2.1 1.3 0.9 3.1
A
A
1
A
A
A
A
A
A
_A_
5 5 5 5 5
10.5 3.3 8.5 4.7 9.5
2.9 1.4 3.4 2.3 1.7
9 8 9 8 9
13.1 7.9 10.6 7.5 8.0
4.3 1.5 3.1 0.8 1.9
&
A
A
A
A
A
A
A
A
A

&
A
& a
6

A
A A
A
* l
A
A
A
A
A
A
A
A
A
A
A
5 8 9 16 21 5 8 9 16 21 5 8 9 16 21^
Number of Soil Plugs per Composite Sample
4
1
Area Number
FIGURE 5. 226
Ra Measurements (pCi/g) of 5-, 8-, 9-, 16-, and 21-plug Composite Soi1
Samples Taken from Areas 1, 4, and 7 in the Windblown Nill-tailings Flood
Plain at Shiprock, New Mexico, x and s are the Arithmetic Mean and
Standard Deviation of the n Measurements for each Data Set.
84
-------
Mean
| Area 226Ra.
_'Number pCi/g
16
13
18
; 2
10
9
;3
;11
110
'2
;5
3
2
1
1
I
' Mean
I Area jZZ6Ra,
Number j pci/g
jl
5
.2
'2
7
:8
A
4
J 5
,2
! 8
;2
! 3
'2
[1 0
1
1 6
2
9
1
5 9 21
Number of Plugs per Composite Sample
16
FIGURE 6. Standard Deviations of Multiple Composite Samples from Areas
1 Through 10 at the Windblown^Mi11-tailings Flood Plain at
Shiprock, New Mexico. Mean Ra Concentrations for each
Area are Given to Illustrate that Areas with Lower Average
Concentrations tend to have Smaller and More Stable Standard
Deviations.
85
-------
sampling pattern (see Fig. 2) changes. Under these assumptions we have the
model
aV\°Vz 3 = ^p2^pl^lj^2 (1)
where a' is the standard deviation for individual soil plugs.
Table 1 (column 6) gives values of Eq. 1 for comparison with estimated
geometric means (GMs) and arithmetic means (AMs) of the ratios sg/S21* s5^s21'
Sg/Sg, and sg/s^g (columns 2 and 4) where the s values are from Figs. 3, 4,
and 5. The modeled and estimated values are in reasonably good agreement.
(Note that the estimated ratios in columns 2 and 4 of Table 1 were computed
after excluding Areas 9 and 10 since those areas had very low and uniform Ra
measurements.)
Solving Eq. (1) for gives
\ " %(P2/Pi !1/2 <2)
This equation is used here to predict the standard deviation for p^-plug
composite samples using the standard deviation for P2~plug composite samples
(o'p )» where = 21 and p^ < 21.
The model used for
-------
TABLE 1. Comparing Estimated and Predicted Ratios of Standard
Deviations for Composite Samples Formed From Different
Numbers of Soil Plugs.
+ Predicted** Ratios
Ratio of Standard Estimated Ratios Computed Using Computed Using
Deviations Data from Areas 1 through 8 Equation 1
Geometric

Geometric
Mean (GM)
Error
(GSE)
Arithmetic
Mean (AM)
Standard
Error (SE)

Ra (pCi/g) (Including Background of 1 pCi/g)
7Pf\
FIGURE 12, Frequency Distribution of Estimated Mean Ra
Concentrations (pCi/g) in Surface Soil following
Initial Remedial Action for 1053 10-m by 10-m
Plots in the Windblown Mill-tailings Flood Plain
at Shiprock, New Mexico.
99
-------
First, we computed E for the 970 plots in the Q » 12 classes in Fig. 12
that had means less than 6 pCi/g, i.e., for plots that met the EPA standard.
Using the probabilities in Fig. 9 for Rule 2 of incorrectly deciding to take
additional remedial action, we found that E = 27.4 and 40.2 for 21- and 9-
plug samples, respectively. Hence, the use of a single 9-plug rather than a
single 21-plug composite in each plot would result in an expected 13 more
plots undergoing unneeded additional remedial action.
Next, we computed E for the 83 plots in Fig. 12 that had means greater
than 6 pCi/g, i.e., for plots needing additional cleanup. Using Rule 2 and
the probabilities of incorrectly deciding no additional remedial action was
needed from Fig. 9, we found E = 12.95 and 19.5 for 21- and 9-plug samples,
respectively. That is, about 7 more plots would not receive needed remedial
action if 9- rather than 21-plug samples were used.
We note that the 83 plots in Fig. 12 that exceeded the EPA standard were
subsequently further remediated:
2-7. L0GN0RMAL MODEL
The results in Sections 2.3 - 2.6 were obtained by modeling the
untransformed data under the assumption those data were normally distributed.
We used the W statistic to test for normality and lognormality (see, e.g.
Gilbert (1987) or Conover (1980) for descriptions of this test) of the data
in Figs. 5, 6, and 7. We found that 21-plug samples were more likely to be
normally distributed than the 9- or 5-plug samples, and that 9- and 5-plug
samples were more likely to be lognormally distributed than normally
distributed. Also, the increase in the standard deviation as the mean increases
(see Fig. 7) indicates that the lognormal distribution may be a better model
for these data than the normal distribution.
In this section we investigate the extent to which the probability results
in Section 2.5 would change if the lognormal distribution rather than the
normal distribution was appropriate. To do this, the natural logarithms of
the data in Figs. 3, 4, and 5, were computed and a model was developed for
the standard deviation of the logarithms. We found that after deleting the
data for plots 9 and 10 (the standard deviation of the logarithms (sy) for
these plots were about twice as large as for the remaining eight plots) there
100
-------
was no statistically significant linear relationship between Sy and the mean
of the logarithms. This indicates that the lognormal distribution may be a
reasonable model, at least for plots with concentrations at the level of those
in plots 1 through 8. The pooled standard deviation of the logarithms for
plots 1-8 was 0.4, 0.37, and 0.3 for 5-, 9-, and 21-plug samples, respectively.
The probabilities of taking additional remedial action were computed for
Rule 2 for the case of one, two, or three 5-, 9-, and 21-plug samples using
these modeled standard deviations. This was done by computing
= (In 5 - In ft1)/ ay
and referring to the standard normal distribution tables, where ay equalled
0.4, 0.37, and 0.3 for 5-, 9-, and 21-plug samples, respectively.
We found that for 9-plug samples, the false-positive error probabilities
for the lognormal case differed by less than two probability points from those
for the normal case for all mean Ra concentrations less than the EPA limit.
Differences in the false-negative rates were as large as 8 probability points
for mean concentrations between 8 and 10 pCi/g above background for the case
of one 9-plug composite sample per plot. These results, while limited in
scope, suggest that the false-positive and false-negative error probabilities
in Section 2.5 may be somewhat too large if the lognormal distribution is
indeed a better model for the Ra data than the normal distribution.
101
-------
3.0 RTRAK AND ITS CALIBRATION
The RTRAK is a 4-wheel-drive tractor equipped with four Sodium-Iodide
(Nal) detectors, their supporting electronics, an industrial-grade IBM PC,
and a commercial microwave auto-location system. The detectors are
independently mounted on the front of the tractor and can be hydraulically
lifted and angled. Bogey wheels support the detectors to maintain a distance
of 12 inches from the ground during monitoring. Each detector has a tapered
lead shield that restricts its field of view to about 12 inches, with overlap
between adjacent detectors. The RTRAK will take gamma-ray readings while
moving at a constant speed of 1 mph. When a reading above a prespecified
level is encountered, red paint is sprayed on the ground to mark these "hot
spots". The automatic microwave locator system provides x-y coordinates with
the count data. This will permit real-time map generation to assist in control
of contamination excavation. Preliminary data indicate that the RTRAK should
be able to detect Ra in soil at concentrations less than 5 pCi/g. Further
tests of the RTRAK's detection capabilities are underway.
The proper calibration of the RTRAK detectors is important to the success
of the remedial-action effort. The Na(I) detectors detect selected radon
daughter gamma peaks that are related to Ra. Hence, the RTRAK detectors do
not directly measure Ra, the radionuclide to which the EPA standard applies.
Radon is a gas, and the rate that it escapes from the soil depends on several
factors including soil moisture, source depth distribution, soil randon
emanating fraction, barometric pressure, soil density, and soil composition.
The calibration of the detectors must take these variables into account so
that randon daughter gamma peaks can be accurately related to Ra concentrations
under field conditions.
A field calibration experiment near the Ambrosia Lake, NM, mill-tailings
pile was recently conducted as part of the effort to develop a calibration
214
procedure. In this experiment the RTRAK accumulated counts of Bi (Bismuth)
for approximately 2-second intervals while traveling at 1 mph. Red paint was
sprayed to mark the locations and distances traveled for each time interval.
For each detector, from 3 to 5 surface soil samples were collected down the
centerline of each scanned area (Fig. 13). Then, for each of these areas,
102
-------
Detector
!
i
I
; 1
8 Feet
I
4
o oo
o o o
o o o
o o o
Location
2
o oo
o o ,o
O O o
o oo
Soil
IPIugs,
N
O O 'O
O O O
O O O
o OO
FIGURE 13. Pattern of. Soil-Sample Locations and RTRAK
Detector Readings for Obtaining Data to
Calibrate the Detectors.
103
-------
these samples were mixed and a ~ 500-g aliquot was removed and sealed in a
metal can that was assayed for Ra within a few days and then again following
a 30-day waiting period to permit equilibrium to be established between Rn
and ^^Bi.
The data and the fitted least-squares linear regression line are displayed
in Fig. 14. The data for the 4 detectors have been combined into one data
set because there were no important differences in the 4 separate regression
lines. Also shown in Fig. 14 are the 90% confidence intervals for predicted
Ra individual measurements. The regression line and limits in Fig. 14 were
obtained by first using ordinary least-squares regression on the In-transformed
data. Then the equation was exponentiated and plotted in Fig. 14. It is
expected that this calibration equation will be adjusted on a day-by-day basis
by taking several RTRAK-detector measurements per day at the same location in
conjunction with measurements of barometric pressure and soil moisture. This
adjustment procedure is presently being developed.
104
-------
90
60
226
0.01 (RTRAK)
90% Confidence
Intervals
0.77
40
30
T
30
10
20
40
60
5
70
. 609 KeV, cps
FIGURE 14. Least-squares Regression Line for Estimating 25Ra
Concentrations (pCi/g) in Surface Soil Based on
RTRAK-Detector Readings of Bi-214 (609 Kev).
105
-------
4.0 COMPLIANCE ACCEPTANCE SAMPLING
As illustrated by Fig. 14, there is not a perfect one-to-one correspondence
214
between RTRAK detector counts for Bi and measurements of Ra in aliquots of
214
soil. This uncertainty in the conversion of Bi counts to Ra concentrations,
and the fact that the EPA standard is written in terms of Ra concentrations,
suggests that soil samples should be collected in some plots and their Ra
concentrations measured in the laboratory as a further confirmation that the
EPA standard has been met. Schilling (1978) developed a compliance acceptance-
sampling plan that is useful for this purpose.
Schilling's procedure as applied here would be to (1) determine (count)
the total number (N) of 10-m by 10-m plots in the remediated region, (2) select
a limiting (small) fraction (Pl) of defective plots that will be allowed (if
undiscovered) to remain after remedial action has been completed, (3) select
the confidence (C) required that the fraction of defective plots that remain
after remedial action has been conducted does not exceed Pl, (4) enter Table
1 in Schilling (1978) or Table 17-1 in Schilling (1982) with D = NP[_ to
determine the fraction (f) of plots to be sampled, (5) select n = fN plots at
random for inspection, and (6) "reject" the lot of N plots if the inspection
indicates one or more of the n plots does not meet the EPA standard. (The
meaning of "reject" is discussed below.)
In Step 6, each of the n plots would be "inspected" by collecting three
or four 9- or 21-plug composite soil samples and using these to conduct a
statistical test to decide if the plot meets the EPA standard. The choice of
three or four 9- or 21-plug samples is suggested by the results of our
statistical analyses in Section 2.0 in the windblown mill-tailings flood plain
region at Shiprock, NM.
Steps 4 and 5 can be simplified by using curves (Hawkes, 1979) that give
n at a glance for specified N, Pl, and C. Also, the Operating Characteristic
(0C) curves for this procedure (curves that give the probability of rejecting
the lot [of N plots] as a function of the true fraction of plots that exceeds
the standard) can be easily obtained using Table 2 in Schilling (1978) or
Table 17-2 in Schilling (1982).
106
-------
To illustrate the 6-step procedure above, suppose C = 0.90 and PL = 0.05
are chosen, and that the remediated region contains N = 1000 plots. Then we
find from Fig. 1 in Hawkes (1979) that n = 46 plots should be inspected. If
all 46 inspected plots are found to be non-defective, we can be 100C = 90%
confident that the true fraction of defective plots in the population of N =
1000 plots is less than 0.05, the specified value of Pl» If one or more of
the n plots fail the inspection, then our confidence is less than 0.90.
As another example, suppose there are N = 50 plots in the remediated
region of interest. Then, when C = 0.90 and P[_ = 0.05, we find that n = 30
plots should be inspected. Small lots that correspond perhaps to subregions
of the entire remediated region may be needed if soil excavation in these
regions was difficult or more subject to error because of hilly terrain or
other reasons.
The action that is taken in response to "rejecting the lot" may include
collecting three or four 9- or 21-plug composite soil samples in adjacent plots
surrounding the inspected plots that exceeded the EPA standard. The same
statistical test as used previously in the original n plots would then be
conducted in each of these plots. If any of these plots were contaminated
above the EPA limit, they would undergo remedial action and gamma scans using
the RTRAK system, and additional adjacent plots would be sampled, and so forth.
The calibration and operation of the RTRAK Nal detectors would also need to
be double checked to be sure the detectors and entire RTRAK system is operating
correctly.
An assumption underlying Schilling's procedure is that no decision error
is made when inspecting any of the n plots. However, inspection errors will
sometimes occur since "inspection", as discussed above, consists of conducting
a statistical test for each plot using only a small sample of soil from the
plot. When inspection errors can occur, the fraction of defective plots is
artificially increased, which increases the probability of rejecting the lot.
To see this, let P denote the actual fraction of plots whose mean exceeds the
EPA limit, let Pi denote the probability of a false-positive decision on any
plot (deciding incorrectly that additional remedial action is needed), and
let P2 denote the probability of a false-negative decision (deciding incorrectly
that no additional remedial action is needed). Then, the effective fraction
107
-------
defective is Pe = Pl(l-P) + P(l-P2)« For example, if Pi = P2 = P = 0.05,
then Pe = 0.05(0.95) + 0.05(0.95) = 0.095 so that the compliance sampling
plan will operate as if the true proportion of defective plots is 0.095 rather
than 0.05. This means there will be a tendency to reject too many lots that
actually meet the C and Pl specifications.
In Section 2.5 we saw, using Ra data from the Shiprock, NM, mi 11-tailings
site, how P^ and P2 change with the statistical test used, the true mean
concentration, the number of composite samples, and the amount of soil used
to form each composite sample. If remedial action has been very thorough so
that mean concentrations in all plots are substantially below the EPA limit,
then the true fraction of defective plots, P, will be zero and Pg = P^ (since
P = 0) will be small. In that case, the probability of "rejecting the.lot"
using Shillings' compliance acceptance sampling plan will be small. As
indicated above, this probability is given by the 0C curve that may be obtained
using Table 2 in Schilling (1978).
108
-------
5.0 DISCUSSION
In this paper we have illustrated some statistical techniques for
developing more cost-effective sampling plans for verifying that Ra
concentrations in surface soil meet EPA standards. Although the focus here
226
was on Ra in soil, these techniques can be used in other environmental
cleanup situations. Because of the high cost of chemical analyses for hazardous
chemicals, it is important to determine the number and type or size of
environmental samples that will give a sufficiently high probability of making
correct cleanup decisions at hazardous-waste sites. Also, it is clear from
Section 2.5 above that when the level of contamination is close to the allowed
maximum concentration limit, the probabilities of making correct cleanup
decisions depend highly on the particular statistical test used to make
decisions. Plots of probabilities such as given in Figs. 9, 10, and 11 provide
information for evaluating which test is most appropriate for making remedial-
action decisions.
A topic that is receiving much attention at the present time is the use
of in-situ measurements to reduce the number of environmental samples that
must be analyzed for radionuclides or hazardous chemicals. The RTRAK system
discussed in this paper is an example of what can be achieved in the case of
radionuclides in soil. Some in-situ measurement devices may only be sensitive
enough to determine if and where a contamination problem exists. Other devices
may be accurate enough to provide a quantitative assessment of contamination
levels. In either case, but especially for the latter case, it is important
to quantitatively assess the accuracy with which the in-situ method can measure
the contaminant of interest. The regression line in Fig. 14 illustrates this
concept.
It is hoped that this paper will provide additional stimulus for the use
of statistical methods in the design of environmental sampling programs for
the cleanup of sites contaminated with radionuclides and/or hazardous-waste.
109
-------
6.0 REFERENCES
Conover, W. J. 1980. Practical Nonparametric Statistics, 2nd ed., Wiley,
New York.
EPA 1983. Standard for Remedial Actions at Inactive Uranium Processing Sites;
Final Rule (40 CFR Part 19 2). Federal Register 48 (3):590-604 (January
5, 1983).
Exner, J. H., W. D. Keffer, R. 0. Gilbert, and R. R. Kinnison. 1985. "A
Sampling Strategy for Remedial Action at Hazardous Waste Sites: Clean-up
of Soil Contaminated by Tetrachlorodibenzo-p-Dioxin." Hazardous Waste and
Hazardous Materials 2:503-521.
Hawkes, C. J. 1979. "Curves for Sample Size Determination in Lot Sensitive
Sampling Plans", J. of Quality Technology 11(4):205-210.
Gilbert, R. 0. 1987. Statistical Methods for Environmental Pollution
Monitoring. Van Nostrand Reinhold, Inc., New York. ——
Schilling, E. G. 1978. "A Lot Sensitive Sampling Plan for Compliance Testing
and Acceptance Inspection", J. of Quality Technology 10(2):47-51-
Schilling, E. G. 1982. Acceptance Sampling in Quality Control. Marcel Dekker,
Inc., New York.
Wine, R. L. 1964. Statistics for Scientists and Engineers. Prentice-Hall,
Inc., Englewood Cliffs, New Jersey.
no
-------
DISCUSSION
Jean Chesson
Price Associates, Inc., 2100 M Street, NW, Washington, DC 20037
The presentation by Richard Gilbert
provides a good illustration of several
points that have been made by earlier
speakers. My discussion is organized
around three topics that have general
applicability to compliance testing,
namely, decision error rates, sampling
plans, and initial screening tests.
Decision Error Rates
The EPA standard for Cleanup of Land
and Buildings Contaminated with Residual
Radioactive Materials from inactive Uran-
ium Processing Sites (48 FR 590) reads
"Remedial actions shall be conducted so
as to provide reasonable assurance
that, " and then goes on to define
the requirements for concentrations of
radium-226 in the soil. An objective way
to "provide reasonable assurance" is to
devise a procedure which maintains stati-
stical Type II error rates at an accep-
table level. A Type II error, or false
negative, occurs when the site is decl-
ared in compliance when in fact it does
not satisfy the standard. The probab-
ility of a Type II error must be low
enough to satisfy SPA. On the other
hand, the false positive (or Type I)
error rate also needs to be kept reason-
ably low, otherwise resources will be
wasted on unnecessary remedial action.
The aim is to devise a compliance test
that will keep Type I and II errors with-
in acceptable bounds.
Developing a compliance test involves
three steps. First, a plan for collect-
ing data and a rule for interpreting it
is specified. The paper considers sever-
al sampling plans and three decision
rules for data interpretation. Second,
the decision error rates are calculated
based on a statistical model. In this
case, the model involves a normal distri-
bution, a linear relationship between the
variance and mean for composite samples,
ana an assumption of independence between
individual soil plugs making up the comp-
osite. The last two components of the
model are based on empirical data.
Third, the sensitivity of the estimated
error rates to changes in the model ass-
umptions should be investigated. This is
particularly important if the same proce-
dure is going to be applied at other
sites. For example, if the estimated
error rates are very sensitive to the
model relating variance and mean, it will
be necessary to verify the relationship
at each site. Conversely, if the error
rates are relatively insensitive to
changes in the relationship, the com-
pliance test could be applied with con-
fidence to other sites without additional
verification.
Sampling Plans
The sampling plan is an integral part
of the compliance test. The paper illus-
trates how sampling occurs at several
levels. There is the choice of plots
within the site. The current plan in-
volves sampling every plot. The proposed
plan suggests sampling a subset of the
plots according to an acceptance sampling
plan. Then there is the choice of the
number and type of samples. One or more
samples may be collected per plot each
composed of one or more soil plugs.
Usually more than one combination will
achieve the required decision error
rates. The optimum choice is determined
by-the contribution of each type of sam-
ple to the total variance and by relative
costs. For example, if variability bet-
ween soil plugs is high but the cost of
collecting them is low, and the measu-
rement method is precise but expensive,
it is advantageous to analyze composite
samples composed of several soil plugs.
If the measurement method is inexpensive,
it may be preferable to analyze individ-
ual samples rather than composites.
Initial Screening Tests
The RTRAK is an interesting example of
an initial screening test. Initial scre-
ening tests may be used by the regulated
party to determine when the site is ready
for the "real" compliance test, or they
may be an integral part of the compliance
test itself. in either case, the objec-
tive is to save costs by quickly ident-
ifying cases that are very likely to pass
or to fail the clearance test. For ex-
ample, if the RTRAK indicates that the
EPA standard is not being met, additional
remedial action can be taken before final
soil sampling, thereby reducing the num-
ber of times soil samples are collected
before the test is passed. If the init-
ial screening test is incorporated in the
compliance test, i.e., if a favorable
result in the initial screening reduces
or eliminates subsequent sampling re-
quirements, then calculations of decision
error rates must take this into account.
The "reasonable assurance" stated in
the EPA rule is provided by an assessment
of the decision error rates for the en-
tire compliance test. The development
and evaluation of a practical and effec-
tive multi-stage compliance test is a
significant statistical challenge.
111
-------
DISTRIBUTED COMPLIANCE:
John W
Barry D.
U.S. EPA (EN-397F), 401 M
This paper discusses a particular class or
strategies, "bubbles", ror tne management 01
numan exposure to env i ronmenta 1 hazaras and
examines an application 01 such strategies to
tne case 01 lead in gasoline, Uhile gasoline
is by no means the oniy source or environmental
lead, ror most of the population it has been
tne dominant source tor many years ano is
certainly tne most controllable source. Lead
is not only tonic to people, It is also toxic
to catalytic converters which are used on vehi-
cles to reduce emissions ot such conventional
pollutants as carbon monoxide, hydrocarbons,
ana oxides ot nitrogen. The twin objectives or
protecting people trom lead ana rroa the con-
ventional emissions or vehicles with lead-
disaoied catalysts led to the first Enviro-
nmental Protection Agency (EPA; regulation or
the substance in gasoline in 1979. This first
regulation covered the total amount ot lead
allowed in each gallon or gasoline produced by
a rerinery wnen leaded and unleaded gasoline
are considered together and averaged over a
quarter. It also set up temporary standards at
a less stringent level tor small reilners.
Witnout tmnxing or it in these terms, the
Agency had taken the tirst steps toward recog-
nizing the neeo ror and implementing a "bubble"
policy ror lead. The paper will present some
conceptual toois tor discussing bubbles ano
then examine the application or this sanagement
approach to gasoline lead.
Subbies--6enerai Principles
In general, a bubole approach to environ-
mental regulation may be tnougnt or as any
approach that aims at ensuring that environ-
mental exposure to some poiiutant is reduced or
controlled "on the average" white accepting
some variaDiiity across emitters in the magni-
tude or tneir contribution. "Un tne average"
and "emitters" are ideas that obviousiy require
rurtner Discussion.
Purposes or bubble regulations
Regulators nay use bubDles ror at least tour
reasons. First, they may allow Institution ot a
stringent regulation that would be inteasibie
ror each entity ta meet, yet night be feasible
tor an Industry as a whole. Second, bubbles
make it possible to improve the riexinility or
a regulation trom the standpoint ot the regu-
lated entities and may thus lessen any negative
economic impacts. The classic plant bubble is
a case in point, providing ror operating rlexi-
biiity Dy regulating the pollution from the
entire plant rather than that trom each smoke-
stack. Third, aubbies may improve tne
"rairness" or application ot the burdens asso-
ciated witn a regulation. In this way regulators
may mitigate the economic impact or an action
upon firms that are somenow unusually sensitive
to its provisions. The rinal reason ror using
a buodle approach is really derivative or the
EPA AND THE LEAD BUBBLE
Holley
Nussbaum
St., s.w. Washington, D.c.
second and third. By minimizing ana more
fairly distributing the impact ot a regulation,
tne drarter nay make badly needed controls
"possible" in a politico-economic sense. Thus
the public health may be protected by a buoble
regulation in a situation where the economic
impact or a simpler regulation uouia make it
politically impossible to achieve.
Logical elements oi a bubble
A bubble regulation always nas some
dimension or set ot dimensions along which
compliance is distributed. The most oovious
such dimension is space, and is illustrated
again by reference to the pi antt smokes tack
bubble. A lack of compliance in one location
may be balanced orr against greater than mini-
mum compliance in another location. It is
important in planning the implementation cr a
bubble regulation whether sources across which
emissions are to be averagea are part or a
single legally responsible entity tas in tne
plant modelj or are eacn thexseives separate
corporate entities.
T \ me l£ another dimension along whicn com-
pliance may be distributed. Almost ail or our
regulations are to some degree bubDles in this
sense, since the dimension of time is always
involved in our setting oi compliance perioas.
Time even enters into our selection or the
appropriate units las in cubic feet per
minute-i. This dimension becomes most impor-
tant, though, in a situation where it is
actively and intentionally manipulates in the
design oi the compliance strategy so as to
achieve one or more ot the objectives or
bubbles that were mentioned above.
In adcition to dimension, any successful
DUbble approach must have some thought given to
wnat, ror want of a better term, we may cat! an
integrating medium. This medium must assure
that the results ot our allowing an uneven
distribution of compliance across some dimen-
sion does not also result in sharp dirrerences
in the consequences ot exposure across that
same oimension. People in one area surrering
from some kind ot toxic exposure are arroroec
scant contort by knowing tnat in consequenca or
their suirering the people in another area are
not artected at all by the pollutant. So while
we are attempting to achieve rairness in dis-
tributing the economic burdens or compliance
among polluters, we must also consioer the
question of equity in exposure.
The integrating meoia in most bubbles are
the classic air, water, soil, and food. Under
some circumstances we may consider the human
body to be an integrating medium, as in the
case or pollutants whose effects are cumulative
in the body over a lifetime. The air may mix
the emissions from stack A and stack B so that
the downwind victim experiences the average or
the two. Certain pollutants may be diffused
throughout a body or water in such a way tnat
neavy emissions on one day nay be oaiancea orr
112
-------
against very light amissions on another day
with the same effect as if daily emissions had
been carefully held to an intermediate or
average I eve 1.
Enforcement considerations
Measurement and/or sampling problems may
arise with distributed compliance regulations
tnat are rarely a problem with more conventio-
nal approaches. An example is a scheme lor
averaging autoeobile emissions across lodeis or
engine rani lies that was considered by the
Agency some years ago. Without a bubble
approach the cer111ica11 on process is iiaited
to determining whether each engine family meets
a single standard. Under a bubble approach a
whole set ot issues arises around measuring the
emission level ot each tanily within some con-
fidence 1imits--quest1ons of sample si2e and
design and distribution shape rear their heaos.
When tnese vehicles are tested to verity their
in-use performance, statistical concerns again
arise as we consider whether the manufacturer
should be held responsible lor the point esti-
mate of certification emissions, the lower
conridence limit ito provide maximum protection
tor the environment, or tne upper conridence
limit (to protect the manufacturer against
unpleasant surprises that may be based upon
sampling error;. These statistical concerns
clearly have sharply locussed policy and legal
implications.
One effect of some distributed compliance
schemes is to unintentionally compromise an
environmental benerit which arises out ot in-
dustry quality assurance provisions. In the
simple situation where the manufacturer must
meet a standard and face dire consequences tor
railing to do so, some "headroom" is likely to
be left between the actual emission level and
the somewhat higher standard. This gap Benefits
the environment to the extent of the manufac-
turer's intolerance ot risk. A redesign or such
an existing compliance scheme to a distributed
compliance approach with payment ot a monetary
penalty tor each ton ot pollutant over the
overall standard may lead to an increase in
emissions by reducing the manufacturers' uncer-
tainty, even though emissions overall remain
under the statutory standard.
The enforcement of bubble regulations may
cost more than would be the case for simpler
alternatives. This is true because of the com-
plexity of sampling and measurement and the
administrative machinery needea to carry out
enforcement. Where the buDbte regulation pro-
vides signiricant benefits to the industry in
the torn or flexibility, but costs more to ad-
minister, the question arises as to whether the
Agency or tha industry should baar the cost. An
interesting example of the working out Of these
problems can be seen in a groundbreaking regu-
lation for heavy-duty engine emissions negotia-
ted between the Agency and various interested
parties. Where a small manufacturer rinds the
number of tests required by the Agency to estab-
lish a family's emissions level too burdensome,
tne rim may elect a sampling approach that uses
rewer tests. The risk to the environment is
held constant, leading to higher risk of having
to pay unmerited non-compilance penalties in ex-
change for the smaller sample.
Distributed compliance systems that sounded
wonderful when being discussed in theory by
policy makers and economists say contribute to
the development of ulcers by the Agency's legal
fraternity. The very complexity of these
schemes eay become a major problem in court,
where the violator can take pot-shots at the
reasonableness ot the regulation and seek reruge
in the loopholes that are the unintended con-
sequence of complexity. The statistical aspects
ot the design ot the regulation are put to a
severe test as the violator's attorneys and
consultants question the Agency's proor that
statistical assumptions were met or question tne
appropriateness ot the methods chosen. Where
compliance is distributed among different firms,
major difficulties may arise over the fixing or
responsibility tor a violation--* problem tnat
¦ay be unlikely to occur with a simpler com-
pIlance scheme.
The case ot lead
History and background
Lead compounds were first used in gasoline in
the 1920s to boost octane. The effects of lead
on octane can be seen in the sample response
curve, Figure 1. While this curve is different
ror ditfertnt base gasolines, its essential
feature is a declining octane Denetit per unit
or lead as the total lead concentration in-
creases. The nature of this curve creates an
incentive tor refiners to spread the amount ot
lead they are allowed to use as evenly as possi-
ble over the gallons of leaded gasoline pro-
duced. In addition to increasing octane rating,
lead compounds provide some protection from
valve wear to olaer engines designed with soft
vaive seats. This valve protection is provided
by relatively low concentrations or lead com-
pared to the more than two grams per leaded
gallon igplg) once used in leaded gasoline lor
octane reasons.
As mentioned earlier, lead in gasoline was
first regulated in 1&79 both to reduce lead ror
health reasons and to provide tor availability
ot unleaded gasoline. Tougher standards ror
automotive emissions of carbon monoxiae \i.O; ana
hydrocarbons (HC) led auto makers to turn to
catalytic converters as control devices. Wiaeiy
used first in 1975, these devices are very sen-
sitive to poisoning by lead, phosphorus, and
other metallic substances.
Types of refineries
The rerining industry grew up with the auto-
mobile and is thus a relatively old industry.
Refineries are technologically stratified by age
based upon the level of technology when they
were constructed. The geographical development
of the industry has tended to follow concen-
trations of population. Thus the older refi-
neries tend to be located in the East. Newer
refineries tend to be located near emerging
centers of population and more recently devel-
oped sources ot crude oil. These newer racili-
113
-------
ties, incorporating more recent technology, tend
to be located on the West Coast.
As one mignt expect, refineries also vary
considerably in size. Figure 2 shows something
or the size distribution of tne inoustry. A
substantial number or these snail refineries
together produce only a small part or the total
gasoline supply. In certain markets, these
small facilities oay play an important role due
to high transportation costs iron areas where
larger and more efficient refineries are located.
The lead buboies
Quarterly averaging. The first bubble or
averaging approach used in regulating gasoline
lead emerged almost unconsciously in the process
of selecting an efficient way to monitor com-
pliance. Since continuous monitoring of each
refinery's output was not practical, and since
requiring that each gallon or' gasoline must meet
a standard was very inflexible from the indus-
try's standpoint, the first regulations pre-
scribed a compliance period during which tne
average concentration or lead could not exceed
the standard. Tne selection of a caienaar quar-
ter represents a compromise between environmen-
tal concerns and the industry's need ror flexi-
bility. The dimension for this bubble, then, is
time. The relatively high concentrations dic-
tate a short time span in order to protect
public heaith. The integrating media are the
air and soil from which lead emitted in automo-
bile exhaust is taken into the human body. Tne
environmental concerns regarding the use or the
quarter are mitigated oy the fact that the gaso-
line distribution system tends to mix gasoline
from different producers in the marketplace, and
the air and soil smooth out, over the course
quarter, the intensity of human exposure.
T radi ng. The second bubble occurreo in a
¦tore deliberate fashion with regulations that
became effective in late 1932 and early 1363.
These regulations shifted the basis of the stan-
dard and introduced a system of trading in lead
usage rights. The standard was changed from one
pertaining to a refinery's pooled gasoline out-
put (unleaded and leaded considered together' to
a standard applied strictly to leaded gasoline.
The original regulation purposefully encouraged
the increased production of unleaded gasoline as
this product was new to the market. Ely 1962,
unleaded gasoline had become a permanent fix-
ture. The change to base the standard on leaded
gasoline oniy was Bade so tnat tne total amount
of lead in gasoline would decline with the per-
centage of gasoline demand that was leaded.
Under the older pooled standard the amount or
lead per leaded gallon could increase as the
percentage of leaded declined, resulting in a
slower decline in total lead use.
Accompanied by a tightening of standards and
a phaseout of special small refinery standards,
the trading system provided ror an improvement
in the allocation or lead usage among refine-
ries. This was done by permitting refineries
wnich needed less lead than the standard allowed
to seiI their excess to other less technologi-
cally advanced refineries. Thus a modern faci-
lity capable of producing leaded gasoline com-
fortably at 0.70 gplg could sell the product or
its leaded gallonage and the difference between
that concentration and the standard of 1.10 gplg
to one or more other refineries which found it
necessary to use more than l.lu gpig in their
leaded gasoline. Such transactions were required
to occur during the compliance period in ques-
tion and could occur either within corporate
boundaries or across them.
Without changing the time dimension, trading
extended the bubble or distributed compliance
system for lead into the dimension of space.
Incurring no more transportation costs than tne
price of a stamp, a refinery or importer in New
Jersey could purchase the right to use lead that
was not needed by a refinery or importer in
Oregon and thereby legitimize actual lead use
that was over the standard. The integrating
media were essentially the same as for quarterly
averaging, but greater reliance was placed upon
the homogenizing effects of the distribution
system to avoid the development of "hot spots".
Banking¦ Responding to a mounting body or
evidence on tne negative health efrects of leac
and to the problem oi Increased conventional
pollutants from Iead-poisoned emission control
systems, the Agency took further action on lead
in early 1&65. As shown in Figure 3, the re-
suiting regulations reduced the allowable lead
concentration by 91% in two stages (from 1.10
gplg to 0.50 gplg on July 1, 1965, and from u.Su
gplg to 0.10 gplg on January 1, 1S&6). This
sharp tightening of the standard tor lead was
accompanied by a system of banking which effect-
ively extended the lead bubble over a much
longer tine span than the calendar quarter mat
was previously allowed.
Under the banking provisions a refiner was
allowed to store away in a bank account tne
difference between the standard and either 0.10
gplg or actual lead usage, whichever was larger.
Such accumulation of rights was permitted during
the four quarters of calendar 1985. The bankeo
lead rights were to be available ror use or
transfer to another reriner or importer during
any future quarter through 1367. Thus lead
rights foregone during 1965 could be used to
meet the snarply tighter 0.10 gplg standard
during 1966 and 1967 after which any remaining
rignts expire. The 0.10 actual lead use limi-
tation on rights accumulation was intended to
avoid any incentive for reliners to use less
than 0.10 gplg in leaded gasoline, since this
was the level believed sufficient to protect tne
valves or some older engines from excessive
wea r.
The Agency's predictions of probable rerinef
behavior when given the flexibility of banking
are shown in Figure u, in which the concentra-
tions from Figure 3 are weighted by estimates
of leaded gallonage. The shaded areas during
1965 represent the extent to which Agency econo-
mists expected refineries to lower lead concen-
trations in order to bank lead rights for later
use. The shaded areas farther to the right show
the difference between the expected concentra-
tions and the standard during the 1366-1967
period when the banked rights could be used to
supplement the 0.10 gplg allowed under the stan-
dard. AS the figure shows, the Agency expected
114
-------
only partial use ot banking in the tirst quarter
ot 1965 Que to the tine required for reiinenes
to revise their planning horizons under the new
regulations. The heaviest banking was expected
to occur in the second quarter as refineries
were able to take full advantage at the regula-
tion. The third and fourth quarters were ex-
pected to show only slight banking due to the
55% reduction in the standard to 0.50 gpig.
Predictions tor the 1906-1967 period show de-
clining lead use in the second year as addi-
tional octane generation capacity was expected
to cone into service in anticipation ot the 0. 10
standard without banking.
This tinai step in extending a system or
distributed compliance--a bubble--to cover lead
in gasoline completed what was started by the
decision to use quarters as compliance periods,
greatly extending on a temporary basis the time
span over which retineries could demonstrate
compliance. Coupled with the trading provisions
to proviae tor distribution over the space di-
mension, the package provided the industry with
a very substantial degree of flexibility in
meeting a standard which public health needs
required'to be as stringent as possible. The
banking and trading together provided for an
orderly adaptation by the more obsolete Facili-
ties, providing then with the time necessary to
install new equipment.
How we 11 it worked
Use of banking and trading. From the very
beginning oi the trading provisions in 1963,
between one tifth and one third or the reporting
facilities found it either necessary or desira-
ble to purchase lead rights for use in demon-
strating compliance with the regulations. Tne
amount ot lead involved in these transactions
was at first small, amounting to about 7% oi the
totai lead used. By the end of 1964 this iigure
nad cIimbed to 20%.
The trading provisions of the regulation
unintentionally permitted facilities blending
aiconol into leaded gasoline to claim and sell
lead rights based upon their activity. These
facilities, frequently little more than large
service stations, generated lead rights in tne
amount ot the product ot the 1.10 standard ano
tne number of gallons ot aiconol they blended.
Both the lead and the gallons ot leaded gasoline
into jInch the alcohol was blended had already
been reported by others. While these alcohol
blenders increased sharply in number starting in
the second quarter of 1964, their activities
generated only a small amount ot lead rights.
This appearance ot a new "industry" as an unex-
pected consequence of the regulation should
remind the statistician or analyst that "ceteris
paribus" is not always the case. Even with aii
the available information about the regulated
industry to analyze, all else will not be equal
since the regulation itself wili cause pertur-
bations, such as the new and previously non-
existent class ot blender "refiners".
The banking program provided a great deal or
flexibility to the industry, and accordingly was
neavily used from its outset in the first
quarter oi 1965, even though the regulations
were not made final until alter the end ot the
quarter. About hail ot the entities reporting
to the Agency made deposits in that first
quarter, and the industry held the actual lead
concentration to 0.70 gpig--lower than tne
Agency had predicted--thus banking more lead
rights than expected. Along with the banking
came a sharp increase in trading activity. The
lead rights, because they no longer expired at
the end ot each quarter, were worth more and
were traded in a more rational market where
sellers had more time to seek out buyers and
where brokers arose to place buyers and sellers
in touch with each other. The higher price or
lead rights led to an explosion in the numoer of
alcohol blenders. Major refiners' Facilities,
which were previously not motivated to buy or
even sell lead rights, began to bank ana trade
aggressively, stocking up rights for use in the
1936-1967 transition period at the new more
stringent standard ot 0.10 gplg.
Figures 5 and 6 show the lead use outcome ot
banking and trading compared to the standards
and Agency predictions at the time the standaras
were promulgated. Figure 5 shows concentrations
while figure 6 introduces leaded gallonage. The
•arly and vigorous banking reduced concentra-
tions to a lower level than expected, and sub-
stantial banking continued to occur on into tne
second half of the year under a halt gram stan-
dard. Actual lead use, as figure 6 shows, was
higher than predicted in both the secona ana
third quarters as a result ot higher than anti-
cipated leaded gasoline usage. In all, 1965
ended with a net collective bank balance in
excess of ten billion grams.
The first quarter of 1966 saw lead rights
leaving the bank at about the rate that the
Agency had predicted. The second quarter caused
some alarm with a sharp drain on the bank owing
to the unusually high leaded gallonage at a
substantially higher concentration, O.ou gplg,
than predicted, as Figures 5 and 6 show,
though, this early drain was partially onset cy
lower than expected usage in the fourth quarter.
The environmental effect ot the regulation
has been an unusually sharp and rapid decrease
in a major pollutant, one that health stuaies
indicate may be more dangerous at lower con-
centrations and to a broader segment of the
human population than used to be believed. The
banking and trading appear to have done pre-
cisely what tney were intended and expected to,
trading oft lead use lower than the standard in
1935 against higher use in 1966-1967 with a
totai lead use over the period about the same as
it the standards had been rigidly neld to. It
may be the case that a lead reduction this
severe could not have been achieved without the
aistrlbuted compliance approach that was used.
It is certainly true that a transition to lower
standards was achieved with greatly reduced
econoaic impact.
Administration and enforcement. The banking
and trading regulations were conceived with
every intent that the Agency could keep a low
profile and let market mechanisms do most of the
work. While this was achieved to a substantial
degree, the need to ensure compliance involved
the Agency in processing more paperwork than tne
115
-------
drafters of the regulations anticipated. It is
probaoly worthwhile to examine briefly how this
happened.
The flood or alcohol blenders swelling the
ranks ot the reporting population was not expec-
ted. Blenders nad first cose onto the scene
with the trading provisions. By the end of 1901
they numbered something over a hundred, selling
small amounts of lead credits, generated during
the quarter, to small and/or obsolete refineries
which were not otherwise able to meet the 1.10
gplg standard. In the first quarter of 19B5
well over 200 additional blenders reported,
drawn by the prospect of either immediately
felling their lead usage rights at the sharply
higher prices that prevailed with banking or re-
taining then and speculating on the price. As
the word of this opportunity spread among dis-
triDutors and service station chains, the pop-
ulation of these "refineries" exploded, reaching
more than 600 by the third quarter of 1985 and
pushing the reporting population above 900.
The numbers themselves would not have been
such a problem for the Agency if ail of the
reports had been made correctly. The blenders,
though, were new to this business. They didn't
understand the regulations, and they lacked the
accounting and legal departments which usua 1 iy
handled reporting for large refineries. The most
coaaon error aade by the blenders was to attempt
to bank and immediately sell to another refiner
lead rights that could not legitimately be
claimed. This frequently took the form or
simply multiplying the alcohol galionage by the
standard 11.10 or 0.50 gplg, depending on the
quarter), Ignoring the restriction mentioned
earlier that lead rights could be banked only on
foregone lead usage above 0.10 gplg. By the
time the blender filed a report and his error
was detected by the Agency's computer, the
rights haa already been sold to another party
and perhaps resold or used. In addition to the
obvious legai tangle caused by this, there was
the instability of the blender population—the
party responsible for the improperly generated
rights could not always be found.
The enforceaent machinery developed by the
Agency to handle lead phasedown was shaped by
certain reasonaole expectations about the re-
porting population--scaIe ot operations, nuaber
or reporting entities, relative sophistication,
etc. The blenders did not fit these expec-
tations, and the enforceaent process developed
considerable congestion until some adaptation
could take place. The computer system developed
to audit reports and especially to aatch up the
parties in lead rights transfers did precisely
what it was designed to do and generated thick
stacks ot error output where only a few errors
had been expected. The further processing of
the errors had to be done manually and required
clerical and legal staffing at a level that was
not anticipated. By the time these resources
were increased to the appropriate levels the
backlog of errors was substantial and the time
elapsed since the filing of the original reports
made sorting things out tore difficult.
A further illustration of how the crystal
ball can tail is found in the difference between
true refineries and the blenders in scale ot
operations. True refineries deal in such large
quantities or gasoline and lead that for con-
venience all ot the report forms used thousands
ot gallons and kilograms of lead as units. To
report in soailer units would be to claim a
degree ot precision lacking in the basic mror-
aation available to the refineries' accounting
departments. The effect of rounding to thou-
sands, trivial to larger refineries, was defi-
nitely not trivial to the blenders, many of whora
only blended a thousand gallons of alcohoi in a
quarter. The blenders used whatever units
optimized their profit with a fine disregard for
the proper placement of decimal points. Where
their galionage was, say, 1,600 gallons, they
would take advantage of the rounding instruc-
tions on the form to claim credits based upon 2
units of a thousand gallons each. If the amount
was 1,400, they would report in gallons rather
than thousands of gallons, often without
labelling the units or putting a decimal point
in the correct position.
All ot these difficulties of enrorceaent
logistics came into being as a result or the
complexity of the bubble or distributed com-
pliance system. With a simple set of rigid
standards there would have been no blenders.
Fortunately, this was a case where the environ-
ment suffered almost no harm as a result ot the
unforeseen consequences ot the regulations,
however embarrassing the situation may nave Deen
to Agency managers. This was probably mostly
good luck, and should not be counted upon to
happen routinely.
Legai Considerations
The statistician frequently rinds himselr
with a well-thought-out concept for a procedure
only to be raced with complications in the
iop Ieaentat ion scheme. Banking and trading
proved no exception to this problem. The idea
of tree trade of lead rights between parties in
order to increase flexibility of each refinery's
planning was too good to resist. The government
even took great pains to stay at "arms distance"
in the trading process. Prior experience with
the Department of Energy's entitlements program,
in which the Federal government established
formula upon formula to assure that every reri-
nery got Its "fair share" demonstrated that the
Federal government was not the best broker in
the refinery industry! In this case the EPA was
staying out of the business.
So, what could go wrong? Since lead rights
are valuable, there is an incentive to cheat.
The value of lead rights rose from 3/4 of a
penny to slightly over » cents per gram ot lead.
Trading and banking transactions are frequently
in the order of £5 to 50 million grams. Thus
the dollar amounts are in the *1 to $2 million
dollar vicinity. Consequently, monitoring and
enforcement become major issues. Monitoring and
its requirement for extra personnel ana computer
usage has already been discussed. Enforcement
and the legal considerations are another matter.
Prior to banking and trading, the regulations
were applied on a refinery by refinery basis and
enforceaent was a fairly straightforward matter.
Under banking and trading the host of possible
violations increased exponentially. The types
ot violations lncluaed trading rights that were
116
-------
improperly generated, selling the sane rights
twice, and banking rights tor a future quarter
that were in tact required for the current
quarter's compliance. Any Of these trans-
gressions, ot course, may have rani Iications for
the buyers of such lead rights. The situation
becomes very complex from an enforcement stand-
point since frequently rights are sold to an
intermediary who resells then. If the original
rights were bogus, or partly bogus, who anong
all the recipients has good rights and who has
bad ones? These are not like counterfeit bills;
they are entirely fungible, and determining it a
particular right is legitimate can be a night-
•are. Since banking lasts over several tiae
periods, bogus rights can be exchanged frequent-
ly, and tracing the source of the bad rights can
be next to impossible. Further, what action, if
any, should be taken against the good faith
purchaser of such lead rights? This last ques-
tion subdivides into possible different actions
depending upon whether the purchaser just de-
posits the rights into his account or, alterna-
tively, actually uses them beiore they are
discovered to be bogus. The possibilities seen
endless!
An interesting sidelight to these difficul-
ties is that it is frequently a seal I refiner
with snail amounts of rights that causes the
difficulty. More effort is expended to chase
small infractions than can be imagined, ano
enforcement policies designed for use with a
small number of large violators prove awkward
and unwieldy when dealing with • large number ot
small violators. A second side effect, though
no fault 01 the designer of the regulation, is
that many refineries find themselves bankrupt in
today's oil industry. Chasing after lead rights
of a bankrupt concern is generally far iess than
fruitful.
Nevertheless, the system has tared remarkably
well. Over ten billion grams ot lead rights
were banked, roughly two year's worth, and no
one is asking tor government intervention to
make lead rignts trading run more smoothly.
However, the point to be made is that the sta-
tistician can ill-afford to wash his hands of
the proolems involved in day-to-day implemen-
tation and enforcement Of the requlations. He
must guard against being the party who suggestea
the program and then walked away wnen soae as-
pect didn't work as planned.
Cone Iusi ons
Ue have tried to provide in this paper an
analytic framework for understanding the set or
compliance management mechanisms loosely classi-
fied as "bubbles". «le nave seen something or
the attractive features of such approaches,
especially from the standpoint of the economic
flexibility which they may make possible, but
have also seen some of the ways in which things
may go otherwise tnan as the drafters of the
regulations intended. The lead phasedown
banking and trading system was used to illus-
trate soae of the concepts presented, even
though the statistical problems in this regula-
tion were less extensive than those with some
other bubble regulations.
bistributeo compliance schemes are fasci-
nating to economists, and they are attractive to
higher Agency managers from other professional
backgrounds because of their potential to blunt
the resistance to needed environmental regula-
tion and sugarcoat the regulatory pill. The
statistician must have a place in the develop-
ment of these r»gulations--tha questions or
measurement, estimation, and uncertainty that
are frequently involved demand it. The proper
role ot the statistician is not just that of
picking up the pieces after things begin to go
wrong in implementation. Neither is it to be a
nit-picking nay-sayer whose business is to tell
people why "you can't get there from here".
Rather the statistician's role should be an
artirmative one--that of a full partner in tne
regulation development process. As such,
members 01 the proression must not only serve in
the cnticai role of assuring a regulation's
scientific integrity (ano therefore its enforce-
ability) but must also lend their creativity ana
special insights to the fundamental aesign ot
the regulation's compliance system, finding ways
to do things where othen, perhaps, cannot.
117
-------
Figure 1
Gasoline octane enhancement from lead
antiknock compounds
Octane number
100
95 -
9D
0 0.5 1.0 1.5 2.0 2.5 3.0
Crams of lead per gallon
Figure 2
Cumulative percentage of total gasoline
production by refinery size percentile*
Percentage of total gasoline
B0
60
Q 1
Size percentile of refineries
~Cuartor 111, 1903
118
-------
Figure 3
Standards and predicted* lead concentrations
under banking and trading
Grans per leaded gallon
1.5 I
Standard in affect
Predicted by nodal
0.5
1 I I I i I I I 1 1 I ,—1 L.

1234 1 234 1234] 2341234

•Cost* and Ba.nofits of Reducing Load
in Go*elina, fab., 1995. p, 11-63.
Figure 4
Load usage predicted
rith and without banking program*
Billion grams of lead
Banking lum
ID h
It—if I tatting Mia
*• 0-96 III, xm £L1D
i"l§h«Ji ¦mill-*.
Ma tTMlln}
:
a-rrsrsv
A A A A A ^
A A A A A A A A A A A

Groma pradlctec
by tne standard
Cram* predicted
by thg nodal
Lead banked
far future uce
pro
Uce of
bonked lead
liAAiA
Iaaaaa
LmoOmc gallonoj. (or IIM5 nd later far* tak«i (re Ca«ta ami a«n»rit» o* R»Ojcir«i l»«i >« Sa«aUri«. Fe*.. 19(2.
vitft *** — mmfiti™ at 101 reduction in Earlier ggilmniij— or» actual. ^radieted eancwntraLlms*
n Ira p. Ii-63 »f tfta otaev* liacu—l it.
119
-------
Figure 5
Predicted* arid actual lead concentrations
under banking program
Grams par leaded gallon
Predicted by modal
1. 5
I
Actual
•Costa and Benefits of Reducing Lead
In Geaoline, Fab.. 1985. p. i 1-63.
1 t-
0.5
0 '—¦>—1—L-
_1 1 I 1 1 L.
_1 i I 1 1_

Figure 6
Pradictad and actual lead usage
*ith bonking program*
Billion grqms of load
StwMjer'fl SenNiita wiris
t* a. 96 iu, tm a. IB
iM'Imil rlfftt* Mflri.
M» VWftnf wltwmMl.

•PrailJetad laed uuja 1« tha tamm am tn Pjyjra 4 mid Is tawl upon tha Agancy'»
predictMd Immdmi gaU^aga- Actual goilenoga «a* tu^%ar than pradictad.
Grams predictod
by the nodal
Actual load usage
120
-------
DISCUSSION
N. Phillip Ross
US Environmental Protection Agency
The concept of bubbles is intrigu-
ing; an umbrella under which trades can
be made which enable regulated indus-
tries within the bubble to meet
environmental standards—standards that
they otherwise may not have been able to
satisfy. This paper describes such a
bubble; an umbrella of time for compli-
ance with lead in gasoline standards.
The idea has logical appeal. Unfor-
tunately, the world in which it is
implemented is not always as logical.
There is an implicit concept of
uniformity that underlies the ideas of
trading and banking. It's okay to have
high levels of pollutants as long as you
balance them against low levels either
at a later point in time or by purchas-
ing "credits." Although the "average"
levels of the pollutant within the bub-
ble's boundaries may be at or below the
EPA standard, there will be many points
within the bubble where levels are well
above the standard. From a public
health point of view, this may not be
desirable. It eventually translates
into periods when the population at risk
will receive exposures to levels greater
than the standard.
As pointed out by the authors, a
major advantage to use of the bubble in
the case of lead in gasoline was that
many refiners and blenders who could not
immediately meet the standard were able
to continue operations through the pur-
chase of credits. Indeed, imposition of
the standard on many of these companies
may well have forced them out of busi-
ness. This is not a minor concern.
Enforcement of environmental standards
is exceptionally difficult. The
regulated industry must be willing to
cooperate through voluntary compli-
ance. The bubble approach, even under
conditions of non-uniformity, provides
the needed incentives to encourage
voluntary compliance. Environmental
standards which cause major economic
hardship for the regulated industry
will be difficult to enforce. Federal
enforcement resources are minimal.
Lack of a substantial enforcement
presence could result in greater pol-
lution through noncompliance. Even
though the real world does not always
conform to the basic assumption of the
bubble model, the real world will use
the approach to achieve an overall
reductions in pollution.
The lead bubble was very successful.
As the authors have pointed out, there
were problems; however, overall the
levels of lead in gasoline did go down
rapidly. This probably would not have
happened under the more traditional
approach to enforcement.
I agree with the author's conclusion
that statisticians must learn to play
a greater role in developing the
strategies and in "finding ways to do
things where others, perhaps, cannot."
Statistical thinking involves tlie
consideration of uncertainty in
decisionmaking. All problems cannot be
solved statistically; however, statisti-
cal thinking can help solve problems.
Statisticians need to realize that their
roles are not limited to the design or
analysis components of a study. They
have a role to play in the process of
regulation development and in the
development of new an innovative ways
to deal with enforcement and compliance
problems—ways which are not necessarily
based on mathematically tractable
assumptions.
121
-------
VARIABLE SAMPLING SCHEDULES TO DETERMINE PM10 STATUS
Neil H. Frank and Thomas C. Curran
U.S. Environmental Protection Agency, Research Triangle Park, NC 27711
Introducti on
In April 1971, EPA set National
Ambient A1 r Quality Standards (NAAQS)
for particulate matter (PM) and five
other air pollutants - nitrogen dioxide,
sulfur oxides, carbon dioxide, hydrocar-
bons, and photochemical oxidants.1
There are two types of NAAQS: primary
standards designed to protect human
health and secondary standards designed
to protect public welfare. In recent
years, the standard for hydrocarbons has
been rescinded and standards for an
additional pollutant, lead, have been
added. The reference method for measur-
ing attainment of the PM standards pro-
mulgated in 1971 was the "high-volune"
sampler, which collects PM up to a nominal
size of 25 to 45 micrometers (un). This
measure of PM was called "Total Suspended
Particulate (TSP)" and was the indicator
for the 1971 PM standards. The primary
(health-related) standards set in 1971
for particulate matter (measured as TSP)
were 2bU ug/n£ , averaged over a period
of 24-hours and not to be exceeded more
than once per year, and 75 ug/m3 annual
geometric mean. The secondary (welfare-
related) standard set in 1971 (measured
as TSP) was 150 ug/m3, averaged over a
period of 24 hours .and not to be exceeded
more than once per year.
The gaseous NAAQS pollutants including
carbon monoxide, nitrogen dioxide, ozone,
and sulfur dioxide, are sampled with
i nstrunents which operate continuously,
producing data for each hour of the year.
This data is subsequently processed into
various statistical indicators necessary
to judge air quality status and attain-
ment with their respective standards.
Lead and TSP are NAAQS pollutants sampled
on an intermittent basis. For these
pollutants, one Integrated 24-hour mea-
surement is typically scheduled every
sixth day. This 1s designed to produce
measurements which are representative of
every day of the week and season of the
year. This approach has been shown to be
useful in producing unbiased estimates of
quarterly and annual average air quality,
but has various limitations regarding
estimation of peak air quality values.
One shortcoming of concern was that
attainment of the short-term 260 ug/m^
TSP standard could be judged using data
typically collected every sixth day and
there was no specified adjustment for the
effect of incomplete sampling. This was
recognized as a problem in the early
197U's. If the second highest observed
TSP measurement was less than 260 ug/m^,
the primary health related standard was
judged as being attained. These stan-
dards were termed "deterministic."
Pursuant to the requirements of the
1977 amendments to the Clean Air Act, EPA
has reviewed new scientific and technical
data and has promulgated substantial
revisions to the particulate matter
standards.2.3 The review identified the
need to focus from larger, total parti-
cles to smaller, inhalable particles that
are more damaging to hunan health. The
TSP indicator for particulate matter has
therefore, been replaced with a new
indicator called PMiq that only Includes
those particles with an aerodynamic
diameter smaller than or equal to a
nominal 10 micrometers. A 24-hour
concentration of 150 ug/m^ levels was
selected to provide a wide margin
of safety against exposure which is asso-
ciated with increased mortality and
aggravation of respiratory illness; an
annual average concentration of 50 ug/m3
was selected to provide a reasonable
margin of safety against long-term degra-
dation in lung function. The secondary
standards were set at the same levels to
protect against welfare effects. The
EPA review also noted that the relative
protection provided by the previous short-
term PM standards varied significantly
with the frequency of sampling. This
was identified as a flaw in both the
form of the earlier TSP standard and the
associated monitoring requirements.
Following the recommendations of the EPA
staff review, the Interaction between
the form of the standard and alternative
monitoring requirements was considered
1n developing the recently promulgated PM
standards.
Form of the New PMiq Standards
The new standards for particulate
matter are stated in terms of a statis-
tical form. The 24-hour standards were
changed from a concentration level not to
be exceeded more than once per year to
a concentration level not to have more
than one expected exceedance per year.
This form corresponds to the one promul-
gated for the revised ozone standard in
1979.4 The annual standards were changed
from an annual average concentration not
to be exceeded to an expected annual
average concentration. To be more con-
sistent with pollutant exposure, the
annual average statistic was also changed
from a geometric mean to an arithmetic
mean.
The attainment tests, described for
the new expected value forms of the
particulate matter standards, are
designed to reduce the effects of
year-to-year variability in pollutant
concentrations due to meteorology,
and unusual events. For the new 24-hour
PM standard, an expected annual
122
-------
nutiDer of exceedances would be estimated
from observed data to account for the
effects of Incomplete sampling following
the precedents set for the ozone stan-
dard. With averaging of annual arithme-
tic means and estimated exceedances over
a multiple-year time period, the forms of
these standards will permit more accurate
indicators of air quality status and will
provide a more stable target for control
strategy development.
The adjustments for incomplete data
and use of multi-year time periods are
significant improvements in the inter-
pretation of the particulate matter
standards. These changes increase the
relative importance of the 24-hour stan-
dard and play an important role in the
development of the monitoring
strategy. They also help to alleviate
the implicit penalty under the old form
that was associated with more complete
data. The review of alternative forms of
the 24-hour standards identified that
the ability to detect nonattainment
situations Improves with increasing
sample size. This is true for the pre-
vious "deterministic" form and the
current statistical form. With the
new 24-hour attainment test, however,
there is a significant increase in the
probability of failing the attainment
test with incomplete data sets. This
sets the stage for attainment sampling
strategies.
Figure 1 presents the probability of
failing the 24-hour attainment tests for
the new PM^q KAAQS over a 3-year period.
These failure probabilities were based on:
(1) a constant 24-hour PMjq exceedance
probability from an underlying concentra-
tion frequency distribution with a speci-
fied characteristic high value (concen-
tration whose expected nunber of exceed-
ances per year is exactly one), and (2)
a binomial distribution of the nunber of
observed exceedances as a function of
sample size. Lognormal distributions
with standard geometric deviations (sgd)
of 1.4 and 1.6 were chosen for this
illustration to represent typical air
quality situations. The approach used
in Figure 1 and throughout this paper
are similar to analyses presented else-
wnere.5'6»7 This facilitates examining
properties of the proposed standard in
terms of the relative status of a site to
the standard level (e.g. 20 percent above
the standard or 10 percent below the
standard) and the number of sampling aays
per year. It is worth noting that the
percent above or below the standard is
determined by the characteristic high.
This is more indicative of the percent
control requirements than using the
expected exceedance rates.
Sampllny frequency was judged to not
be an important factor in the ability to
Identify nonattai nment situations for
either the currant or previous annual
standards. This is due to the generally
unbiased nature and small statistical
variability of the annual mean which is
used to judge attainment with this stan-
dard. The change to an expected annual
mean form, however, would tend to provide
better estimates of the long-term pol-
lutant behavior and provide a more stable
indicator of attainment status.
With the new 24-hour attainment test,
one important consequence of increased
failure probabilities Is the potential
misclassification of true attainment
areas. In Figure 1, it can be seen
that these Type J errors are generally
higher for small sample sizes, Including
those typical of previous TSP monitoring.
This error is shown to be as high as 0.22
for a site which 1s 10 percent below the
standard and has a sampling frequency of
11S days per year.
During the review of the standards,
1t was recognized that the ideal approach
to evaluate air quality status would be
to employ everyday sampling. This would
minimize the potential misclassification
error associated with the new PM attain-
ment tests. From Figure 1, it can be
seen that this would produce the desir-
able results of high failure probabili-
ties for nonattainment sites and low
failure probabilities for attainment
sites. Unfortunately, existing PM moni-
toring technology as well as available
monitoring resources do not make it
convenient to monitor continuously
throughout the nation. Moreover, while
more data is better than less, it may not
be necessary in all situations. When we
revisit Figure 1, it can be seen that
when a site is considerably above or
below the standard, small sample sizes
can also produce reasonably correct
results with respect to attainment/
nonattai nment decisions. Thus, in order
to balance the ideal and the practical, a
monitoring strategy was developed which
involves variable sampling schedules to
determine PMjq status and attainment with
the new standards.
The new strategy will permit most
locations to continue sampling once in 6
days for particulate matter. Selected
locations will be required to operate
with systematic sampling schedules of
once in 2 days or every day. With
approval of EPA Regional Office these
schedules may also vary quarterly depend-
ing on the local seasonal behavior of
PMjj, Schedules of once in 3 days were
not considered because of the discon-
tinuity in failure probabilities occuriny
at 115 sampling days per year (95% data
capture), seen in Figure 1 and discussed
el sewhere.5!?
Monitoring Strategy
The previous monitoring regulations
which applied to particulate matter
specified that "at least one 24-hour
sample (is required) every 6 days except
123
-------
during periods or seasons exempted by
the Regional Administrator."8 The new
PMlO monitoring regulations would permit
monitoring agencies to continue this sampling
frequency for PMjy but would require them
to conduct more frequent PMio sampling in
certain areas in order to estimate air
quality indicators more accurately for
control strategy development and to
provide more correct attai nment/nonat-
tainment determinations.9 The change in
monitoring practice 1s largely required
to overcome the deficiency of existing
sampling frequency In detecting exceed-
ances of the 24-hour standard. The
operating schedules proposed for the
measurement of PM^g will consist of a
short-term and long-term monitoring plan.
The short-term monitoring plan will be
based on the requirements and time
schedules set forth in the new PM^o
Implementation Regulations for revising
existing State Implementation Plans
(SIPs).lG The requirements ensure that the
standards will be attained and properly
maintained in a timely fashion. The
long-term requirements will depend on
PMjo air quality status derived from future
PMkj monitoring data. These are designed
to ensure that adequate information is
produced to evaluate PMjq air quality
status and to ensure that the standards
are attained and subsequently maintained.
Consistent with the new reference
sampling principle, available PMjq
instruments only produce one integrated
measurement during each 24-hour period.
Multiple instriments operating with
timers, therefore are necessary to avoid
daily visits to a given location. The
new standards, however, will permit
approval of alternative "equivalent"
methods which include the use of contin-
uous analyzers. Because of the new
monitoring requirements, instrunent
manufacturers are currently developing
such analyzers. This will alleviate the
temporary burden associated with more
frequent monitoring.
Short-term Monitoring Plan
The proposed first-year monitoring
requirements will be based on the
requirements for revising SIPs.
Areas of the country have been clas-
sified into three groups, based upon the
likelihood that they are not currently
attaining the PM10 standards as well as
other considerations of SIP adequacy.U
Since PM^y monitoring is in the process
of being established nationwide and
1s quite limited, a procedure was used
which estimated the probability that each
area of the country would not attain the
new standards using existing TSP data 1n
combination with available PM;q data.
This is described elsewhere.1*
Areas have been classified as Group I,
II or III. Group I areas have been
judged to have a high probability,
p >_ 0.95, of not being in attainment with
the new standards. Group II areas have
been judged to be too close to call, but
still very likely to violate the new
standards (0.20 <_ p <0.95). Group III
areas have been judged to be in attain-
ment (p <0.20).
For Group I areas, the value of a
first year intensified PM^j data
collection is most important. This
is because these areas are most likely
to require a revised SIP. Since the
24-hour standard is expected to be
controlling, the development of control
strategies will require at least 1
complete year of representative data.
Consequently, everyday sampling for a
minimin of 1 year is required for the
worst site in these areas in order to
confirm a probable nonattalnment status,
and to determine the degree of the
problem.
The Group II category identifies
areas which may be nonattainment (but
whose air quality status is essentially
too close to call.) For such areas, the
value of additional PM^o information is
important in order to properly categorize
air quality status. For these areas,
more Intensified sampling Is desirable.
Based on the consideration of cost, and
available monitoring resources, however,
a more practical strategy of sampling
once in 2 days at the worst site Is
required for the first year of monitor-
ing.
All remaining areas in the country
{defined in terms of p<0.20) have been
categorized Group III and judged not
likely to violate the new standards. For
such areas, the value of collecting more
than a mlnirotn amount of PM^y data is
relatively low and intensified PM|o data
collection is not warranted. Recognizing
that there 1s still a small chance of
being nonattainment, however, a minimum
sampling program is still required at
these locations. Based on considerations
of failing the 24-hour attainment
test and estimating an annual mean value,
a minimim sampling frequency of once in 6
days is required.
The short-term strategy also contains
previsions for monitoring to be inten-
sified to everyday at the site of ex-
pected maximum concentration if exeeed-
dances of the 24-hour standard are
measured during the first year of moni-
toring. This is intended to reduce the
potential for nonattainment nisclas-
sificatlon (type I error) with the 24-hour
PMjy attainment test. With this provision,
the first observed exceedance is not
adjusted for incomplete sampling and is
assumed to be the only true exceedance at
that location during the calendar quarter
1n which it occurred. The effect on
misclassification error associated with a
3-year attainment test is illustrated in
Figure 2. It can be seen that the sites
most vulnerable to this error are slightly
124
-------
less than the standard. In these com-
parisons, for sites which are 1U percent
less than the standard and are sampling
once 1n 2 days, the type I error Is
reduced from 6 percent to 1 percent. If
these same sites are sampling once in 6
days, the type I error is similarly
reduced from 12 percent to 0.5 percent.
There is, however, a corresponding
increase in the type II error associated
with the attainment test for true nonat-
tainment sites also close to the stan-
dard. This compromise was judged to be
appropriate in developing the new rules.
Long-term Monitoring Plan
The long-term moni ton ng plan starts
with the second year of sampling. The
required sampling frequencies are based
on an analysis of the ratio of measured
PMig concentrations to the controlling
PMiq standard. This determination depends
upon an assessment of (1) whether the
annual or 24-hour standard is controlling
and, if it is the latter, (2) the
magnitude of the 24-hour PM^q problem.
Both i tans are evaluated in terms of the
air quality statistic called the design
concentration. For the annual standard,
the design concentration is the expected
annual mean; for the 24-hour standard,
the design concentration is the
characteristic high value whose
expected exceedance rate is once per
year. In both cases the design
concentration is the value the control
strategy must be capable of reducing to
the level of the standard in order to
achieve attainment. The ratio to the
standard is defined in terms of the
design concentrations and the standard
level; the controlling standard is simply
the standard which has the highest ratio.
Tn1s 1s a somewhat simplified definition
but is adequate for present purposes.
The long-term strategy specifies
frequencies of every day, every other
day, or every sixth day. The long-term
monitoring strategy is designed to
optimize monitoring resources and
maximize Information concerning attain-
ment status. As with the short-term
strategy, the increased sampling fre-
quency provisions only apply to the
site with expected maximun concentra-
tion in each monitoring area.
For those areas where the annual
standard is controlling, 1 1n 6 day
monitoring would be required; this
frequency has been judged to be adequate
for assessing status with respect to
this standard. For those areas where the
24-hour standard 1s controlling, the
required minimun sampling frequency for
the calendar year will vary according to
the relative level of the most current
maximun concentration site to the level
of the standard. In other words, the
sampling requirement applies to the site
which drives attainment/ nonattalnment
status for the monitoring area. The
least frequent monitoring (1 i n 6 days)
would be required for those areas where
the maximun concentration site is clearly
above the standard (MO percent above) or
clearly below the standard (>20 percent
below). For such sites a uiinimun amount
of data collection would be adequate to
verify correct attainment/nonattainment
status. As the area approaches the
standard, the monitoring frequency for
the maximun concentration site would
increase so that the mi sc 1 assi fi cation
of correct attal nment/nonattai nment
status can be reduced. If the area is
either 10-20 percent below or 20-40
percent above the 24-hour standard, 1 in
2 day monitoriny would be required. When
the area is close to the standard, i.e.
10 percent below to 20 percent above,
everyday sampling would be required in
order to improve the stability of the
attainment/ nonattai nment classi fication.
Figures 2 and 3 illustrate mi sclassl fi-
cation rates for a 3-year, 24-hour
attainment test as a function of the
relative status of a site to the standard
and in terms of alternative sampling
frequencies. As with previous analyses,
underlying lognormal distributions with
sgd's of 1.4 and 1.6 for attainment and
nonattai nment sites are utilized.
For sites following the lang-ter.n
incomplete sampling schedules (1 in 6
days and 1 in 2 days) mi sclassi fication
rates can be maintained in or below the
neighborhood of 5-10 percent.
Summary
The revisions to the PM standards
improve the ability to identify non-
attainment situations, provide for more
stable pollutant indicators, and change
the relative importance of tne annual
and 24-hour averaging times. With the
required adjustments for incomplete
sampling in the interpretation of PM
data, the revised standard would correct
for the variable protection afforded by
the current 24-hour PM standard, and it
1s expected that the revised 24-hour
standard will generally be controlling.
Monitoring requirements have been
promulgated which will similarly correct
for the deficiency 1n the current
standards. Variable frequencies are now
required in order to reduce the uncer-
tainty associated with attainment/
nonattai nment classification. This
provides more uniform protection by the
standards but at the same time conserves
scarce monitorlng resources. The initial
requirements will place the most emphasis
on areas with the highest estimated
probability of violating the stand-
ards while the long-term strategy will
allow sampling frequency to vary accord-
ing to the relative status of an area
with respect to the standard concen-
tration levels.
The operational difficulties
associated with implementing the new
125
-------
requirements for everyday monitoring has
generated new research initiatives to
develop a continuous analyzer for PMiq.
Once this is available, particulate matter
can be conveniently monitored everywhere
on the same basis as the gaseous NAAQS
pollutants.
References
1. "National Primary and Secondary Ambient Air
Quality Standards," Federal Register,
36(84 ): 8186. April 30, 1971.
2. Review of the National Ambient Air Quality
Standards for Particulate Matter: Assess^
ment of Scientific and Technical Information,
UAQPS Staff PapeiT U. 5. Environmental
Protect ion Agency, Research Triangle Park,
N.C. 2771 1 . EPA-450/5-82-001 . January
1982.
3. "Revisions to the National Ambient Air
Quality Standards for Particulate Matter,"
Federal Register, 52(126):24634. July 1.
1987:
6. Davidson, J. E., and P. K. Hopke, "Implica-
tions of Incomplete Sampling on a Statis-
tical Form of the Ambient Air Quality
Standard for Particulate Matter,"
Environmental Science and Technology, 18(8),
imr.
7. Frank, N. H., S. F. Sleva and N. J. Berg, Jr,
"Revising the National Ambient Air Quality
Standards for Particulate Matter - A
Selective Sampling Monitoring Strategy,"
presented at the 77th Annual Meeting of
the Air Pollution Control Association,
San Francisco, CA., June 1984.
8. "Ambient Air Quality Surveillance,"
Federal Register, 44(92):27571, May 10,

)/Var(i )
(7B) Var(ZiZ) =
VarU) - "£ov(Z^,Z)Cov(£,Z) '/Var(Z ) .
In more detail, the i'th coordinate of
the vector of covariances of the logs
of the 5-minute readings and the log
of the geometric mean, Cov(Zi ,Z), is
equal to
<3-2 * {1 + p + f ! + ... + £> i-i + p +
... + £ ' * - ~ } / 12
and the variance of the log of the
geometric mean is equal to
V =
. +
cr
e
2 *
" J
{ 12 + 2*111 £ +
J/144.
10
The problem of estimating the joint
distribution of the 5-minute levels,
given the hourly average, is now
reduced to the problem of estimating
the three parameters (mu, sigma, and
rho) in the above expressions, when
one observes only the sequence of
hourly averages. Because the sequence
of observed logs of geometric means is
also a multivariate normal sequence,
it is- simple to estimate the mean,
variance, and covariance of this
sequence. Specifically, the log of
the geometric mean is normal with mean
equal to mu, with variance equal to V
above. Furthermore, the logs of the
geometric means in successive hours
are bivariate normal with covariance
equal to
= 7" 2 *
1 1 f 1 a
{
-------
that one actually observes the
arithmetic mean rather than the
geometric mean. All of the above
equations and distributional formula-
tions are still valid. The only
problem is that they cannot be used
for computation if the geometric means
are not observed. We suggest that the
following approximations be used when
only the arithmetic means are ob-
served. First, compute the first and
second sample moments of the observed
sequence of arithmetic means and use
these values to get method of moments
estimates of the parameters mu,
sigma, and rho. (The arithmetic means
are not lognormal so these are not
maximum likelihood estimates.) These
parameter estimates then specify
numerically the joint distribution of
the 5-minute levels, given the
geometric mean. To complete specif-
ication of this distribution, one need
only give a numeric estimate, based on
the arithmetic mean, of the geometric
mean. A reasonable choice is to set
the estimated sample geometric mean
equal to the observed sample arith-
metic mean times the ratio of the
estimated expectation of the geometric
mean to the estimated expectation of
the arithmetic mean.
Application of the above protocol
requires only expressions, 1n terms of
mu, sigma, and rho, for four moments:
the expectations of the sample
arithmetic and geometric means, the
variance of the sample arithmetic
mean, and the covariance of the
arithmetic means of successive hours.
Given that the logs of the 5-minute
readings are serially correlated
normal ( ju, cr" 2)'s, the expected
values of the arithmetic and geometric
means are, respectively,
Ea = exp(^j + :7'2/2) and
Eg = exp(jj + 0 -r*2/2 ) where
e = { 12 + 2*[11 £ + 10 £ 2 + ... +
] }/144.
The variance of the arithmetic mean is
V* =
exp(2jj+ cr2 )*{ 12 + 2* [ 11 (exp(--T 2^)~1) +
I0(exp(<3" 2 g 2 )-i ) +
... + (exp(
-------
estimating the maximum. If standards
are to be established with the inten-
tion of limiting the health effects
associated with high short-term
exposures, then these limits on the
accuracy in prediction must be borne
in mind in the setting of standards.
Given the ad hoc nature of the
parametric models used, one might
try other parametrizations--e.g.
estimate the transfer function
between the time series of hourly
averages and the time series of
hourly maxima—to see if better
approximations can be obtained.
Because the iterated log model does a
fairly good job of estimating the
maxima in the data set from which the
parameters were estimated and because
the marginal distributions at the two
sites considered are not even of the
same form, we think it unlikely that
other choices of parametrization will
lead to much reduction 1n the cross-
validation errors.
The task of estimating the conditional
distribution of an arbitrary 5-minute
level, given the hourly average,
appears to be equally difficult. It
appears that using ad hoc parameter
estimates obtained from one site to
predict 5-minute levels at another
site leads to biased predictions. In
the two data sets compared here, it
was impossible to tell reliably
whether a given level would be
exceeded 5% or 30* of the time.
Estimation of the joint distribution
of all twelve 5-minute levels, given
their average, appears feasible only
if one is prepared to assume a
lognormal
uncondi tional
readi ngs.
which this is
Thus, it aga
reliable est
only by obse
the 5-minute
mality rough 1
distribution for the
distribution of these
There are data sets for
demonstrably not true,
in appears that the most
imates can be obtained
rving at least enough of
sequence to check lognor—
y ¦
BIBLIOGRAPHY
(1) Grandell, Jan (1984), Stochasti c
Models of Air Pollutant Concentration,
Springer-Verlag, Berlin
(2) Johnson, Norman and Kotz, Samuel
(1970), Continuous Univariate Distri-
butions. vol. 1. John Wiley & Sons,
New York
(3) Larsen, Ralph (1971), A Mathe-
matical Model for Relating Air
Quality Measurements to Air Quality
Standards. U.S. Environmental Protec-
tion Agency, Office of Air Programs,
Research Triangle Park, North Carolina
(4) Legrand, Michael ( 1974), Stati s-
tical Studies of Urban Ai r Pollution—
-Sulfur Dioxide and Smoke, in Statis-
tical and Mathematical Aspects of
Pollution Problems. John Pratt
Marcel Oekker, Inc, New York
ed.
(5) Pollack,
Studies of
Richard
Pol 1utant
I. (1975),
Concentration
Frequency Distributions. U.S. Environ-
mental Protection Agency, Office of
Research and Development, Publication
EPA-B50/4-75-004, Research Triangle
Park, North Carolina
138
-------
TABLE 1
Descriptive Statistics
Hr Avg
Hr Sd
Log (Avg)
Log (SD)
Station
NY
Kincaid
NY
Kincaid
NY
Kincaid
NY
Kincaid
Mean
19.61
20.78
3-34
13-71
2.64
1.77
.84
1.25
S.D.
18
75
3-6
109
.85
1.6
.84
1 .4
Skewness
2.8
47
3.5
109
-.3
.0
• 3
1.01
Kurtosis
15
3810
21
13000
.3
-.2
-.2
.12
Maximum
257
2500
57
5000
5.55
7.82
4.04
8.52
Regression of Log (Ratio) on Log (Average)
Station
NY
Kincaid
Slope
-.077
-.210
Intercept
.499
1.07
RHSE
.20
.69
Ratio <1 When
Average >
652
163
Regression of LogLog (Ratio) on Log (Average)
Station
NY
Kincaid
Slope
-.267
-.258
Intercept
-.719
-.191
RMSE
.62
1.06
Correlation
-.34
-.36
139
-------
TABLE 2
Distribution of Residuals at Kincaid
Value of Percent
Log(log(ratio))
-2.03 .05
-1.43 .10
-.70 .25
.23 .50
.76 .75
1 .24 • • .90
1.43 .95
Table 3
Regressions from Method of Change of Time Scale
Model Data Set Slope Intercept
Iterated New York -0.0854 -0.415
Log -0.0528 0.716
Iterated Kincaid -0.12 0.606
Log -0.170 2.010
140
-------
TABLE 4
Fitted Models for Spread of 5-Minute Levels
Regression of Log (SD) on Log (AVG
Correlation
Station Slope Intercept Squared
MY .687 -.972 .49
Kincaid .6 45 .114 .53
Regression of SD on Average
Correlation
Station Slope Intercept Squared
MY .114 1.109 .33
Kincaid 1.197 -11.169 .67
141
-------
TABLE 5
CONDITIONAL MEANS AND VARIANCES OF
ECZ.'zbar)
Var(Z!zbari
2.61
~ CI. 981
* Czbar—2.61)

2.61
~ 0.9-31
fcCzb-ar-2. 61.)

2.61
~ 1.002
*Czb.£jr—2.61!)

2.61
~ 1.00?
* Czbiir—2.61!)

2.61
~ 1.012
* Czbar-2 . 61!)

2.61
~ 1.013
x Czbai—2.61."

2.61
~1.013
*
-------
FIGURE 7
ERRORS WITH SHORT-TERM MONITORS
(NEW YORK DATA, ITERATED LOG MODEL)
0.9
0.8
0.7
0.6
0.5
0
3
32
316
0
Hour Avrercga
1 st Quartiio +¦ Median 3rd Quartlio
FIGURE 8
ERRORS WITH SHORT-TERM MONITORS
(KINCAID DATA, ITERATED LOG MODEL)
2.6
2.4
2.2
1.3
0.8
0.6
0.2
32
316
3
0
0
1 »t Quartilw
Hour A^«rag«
+• M«dtan 3rd Cuartll«
148
-------
FIGURE 9
ERRORS WITH SHORT-TERM MONITORS
(NEW YORK DATA, LOG MODEL)
.4
2
1
1
0.9
o.a
0.7
0.6
0.5
32
316
0
0
Hcur Average
1 st Ouartllo X Median 3rd Ouartllo
FIGURE 10
ERRORS WITH SHORT-TERM MONITORS
(KINCAID DATA, LOG MODEL)
2.B
2.6
2.4
2.2
2
B
.4
1
0.3
0.6
0.4
0.2
0
3
32
0
0
Hour Av-erojfl
1»t Quartlle x M«dian — 3rd Quartilo
149
-------
FIGURE 1 1
METHOD OF CHANGE OF TIME SCALE
(HR./12 HR. = 5 MIN./HR.)
3.2 H
3
2.8 -i
2.6
2 A
2.2
2 H
1.3
1.«
1.4 -
1.2 -
1
0.3 H
0.«
Now York, Iterated
O New Ygrk Lo
-------
FIGURE 12
ERRORS WITH CHANGE OF TIME SCALE
(NEW YORK DATA, ITERATED LOG MODEL)
.7
.6
.5
A
.2
.1
1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
3
32
0
0
Hour Average
1 st Ouartile + Uedkm 3rd Ouartiie
FIGURE 13
ERRORS WITH CHANGE OF TIME SCALE
(NEW YORK DATA, LOG MODEL)
.3
.7
.3
.4
.3
.2
.1
1
0.9
0.8
0.7
0.6
0.5
0.4
0.1
32
3
0
0
Hour Aviragfl
1 st Ouartlle + Median 1 3rd Quartile
151
-------
FIGURE 14
OBSERVED PERCENTILES OF SCALED DEVIATIONS:
GRAPHS OF MODELLED PERCENTILES
(KINCAID DATA, NEW YORK PARAMETERS)
n
w u oa
u Id
T 1 1 1 1 r
I O.Q 25.1 63.1
153 5
Hour Awerooe
O 5*th percentile
A Tfflh percentile
+ 15'Hi percentile
X 05'th percentile
O SO'lh peroentile
T dS'th percentile
152
-------
o «
5?
3 2
* I
«• a
P
18?
i*
S*
il
PEBCENT OF SCALED DEVIATIONS
EXCEEDING NOMINAL PERCENTILES
o
o
o
o
o
o
o
o
o

u
u
>

OB
<0
I
1
. i
1
1
1
1
i
1
I
O
w
to
«•—N

*:
o
PJ

M
O

K
>

O
Tl
TJ

90
IX
PI

PS
M
SO

o
O
PI

>
*d
a

>
2
H

«•
o
t-
M

o
W
O
*
pj
01
G
H
t-

W
a
t-
O
pj
n
ra
•*1

>
a

—«
H

01
VI
O
TJ
o

ra
>

Tl
SO
t-

>
O
PJ

SO
pj
o

>
25

2
H
o

PI
M
W

H
t-
<

W
M
M

M
01
>

2. REPORT DATE 13. REPORT TYPE AND DATES COVERED
/9R7 1
4. TITLE AND SUBTITLE , / , , . ,/ /",
ASA/t-PA CjvfidUAc&j)gk \/MJAahit&wxJanm£u&l MxEa.,
&c?s-6 & /f$J
5. FUNDING NUMBERS
<
6. AUTHOR(S)
7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) .
mi ^Vrihidicyu
~&P/\
"! J*^ VM.- 0-C.
a. PERFORMING ORGANIZATION
REPORT NUMBER
£pA-33t>~Q3-wy |
a
i
9. SPONSORING/MONITORING AGENCY MAIV";(5) AND ADDRESS(ES)
1Q. SPONSORING / MONITORING 1
AGENCY REPORT NUMBER j
i
i
j
j
11. SUPPLEMENTARY NOTES
12a. DISTRIBUTION/AVAILABILITY STATEMENT

1 ",>vv' \U
13. ABSTRACT (Maximum 200 words)
12b. DISTRIBUTION CODE
The general theme of the papers and associated discussions is the design and
interpretation oF environmental regulations that incorporate, from the outset, statistically
valid compliance verification procedures. Statistical aspects of associated compliance
monitoring programs are considered. Collectively the papers deal with ^ wide variety of
environmental concerns including various novel approaches to air emissions regulations and
monitoring, spatial sampling of soil, incorporation of potential health effects
considerations into the design of monitoring programs, and considerations in the statistical
evaluation of analytical laboratory performance.
-------