Choosing a Percentile of Acute Dietary Exposure as a Threshold of Regulatory Concern


CHOOSING A PERCENTILE OF
ACUTE DIETARY EXPOSURE AS A
THRESHOLD OF REGULATORY
CONCERN
Office of Pesticide Programs
U.S. Environmental Protection Agency
Washington, D.C. 20460
March 16, 2000
F ax-On-Demand
Fax Number: (202) 401-0527
Item: 6046

-------
List of Acronyms
aPAD
acute Population Adjusted Dose
a RfD
acute Reference Dose
CEC
Critical Exposure Contribution
CSFII
Continuing Survey of Food Intake by Individuals
DEEM
Dietary Exposure Evaluation Model
DRES
Dietary Risk Evaluation System
FFDCA
Federal Food, Drug, and Cosmetic Act
FIFRA
Federal Insecticide, Fungicide, and Rodenticide Act
IWG
Implementation Working Group
NCHS
National Center for Health Statistics
NOAEL
No Observed Adverse Effect Level
LOD
Limit of Detection
LOQ
Limit of Quantitation
OPP
Office of Pesticide Programs
PDP
Pesticide Data Program
RfD
Reference Dose
SCS
Supplemental Children's Survey

-------
Table of Contents
EXECUTIVE SUMMARY
I.	OPP's Present Practice and Policy for Acute Dietary Risk Assessment
	6
A.	Introduction	6
B.	Previous Review of OPP's Interim Policy
	8
C.	OPP's Current Approach to Dietary Risk Assessment	9
1.	Chronic vs. Acute Exposure and Risk Assessment
	9
2.	The Risk Equation	10
3.	Data Bases Used in Probabilistic Dietary Exposure Estimates	11
(a)	Food Consumption: USDA Continuing Survey of Food Intake by Individuals
	12
(b)	Residue Data Sources: Field Trials, Monitoring, and Market Basket Surveys
	12
4.	DRES and DEEM 	
5.	DRES 95th Percentile Dietary Food Exposure vs. Monte Carlo 99.9th Percentile Dietary
Food Exposure 	15
II. Issues Related to the Methodology and Data Bases Used in Acute Dietary Risk Assessment	17
A.	Treatment of High-End Consumption Values ("Outliers") in USDA CSFII Survey
	18
B.	Treatment of High-End Residue Values ("Outliers") in Crop Field Trial or Monitoring DatalJ
C.	Representativeness 	20
D.	Size of the Input Data Bases.
72
E.	Potential Variability/Uncertainty in Recipe Translations
	24
F. Impact of Agency Default Assumptions on the Choice of a Percentile Exposure Estimate for
the Threshold of Concern	25
III.	Issues Related to Public Health Policy	27
IV.	Policy Not Rules	34
V.	Bibliography	34
APPENDIX 	36
(Primer on Interpretation of Exposure Distribution Curves)

-------
CHOOSING A PERCENTILE OF ACUTE DIETARY EXPOSURE
AS A THRESHOLD OF REGULATORY CONCERN
EXECUTIVE SUMMARY
EPA is responsible for regulating the nature and amount of pesticide residues in food
under the Federal Food, Drug and Cosmetic Act (FFDCA). FFDCA sec. 408 authorizes EPA to
set a tolerance or an exemption from the requirement of a tolerance if the Agency determines that
the residues would be "safe." The Agency performs various types of risk assessments to evaluate
the safety of pesticides in food, including analyses to determine the nature and the amounts of
pesticides that people might be exposed to over a single day. This paper discusses how EPA
generally applies the statutory safety standard to acute dietary risk assessments as to pesticide
residues in foods.
The Environmental Protection Agency's Office of Pesticide Programs (OPP) previously
announced that, on an interim basis, it intended to use the 99.9th percentile of the distribution of
estimated acute dietary food exposures for calculating a threshold of concern when probabilistic
assessment techniques are used to model the distribution. OPP stated that it would compare this
percentile of estimated exposure to the Population Adjusted Dose (PAD), a value that reflects an
amount of a pesticide to which a person may safely be exposed in one day. The Agency published
a notice in the Federal Register on April 7, 1999 citing the availability of an interim policy and
requested public comment so that the views of all interested parties would be considered (US
EPA, 1999a).
Based in part on the comments received, this science policy paper was revised and is now
being issued in its revised format. This revised document explains OPP's policy and details some
of the various concerns that have been raised, additional associated public health-related issues, as
well as OPP's plans for further evaluation and implementation. This policy has broad applicability
to many pesticides.
OPP's current approach with respect to assessing and regulating the food uses of
pesticides, when using a probabilistic method of estimating acute dietary exposure, is as follows:
If the 99.9th percentile of acute exposure from food, as estimated by probabilistic
(e.g., Monte Carlo) analysis, is equal to or less than the acute Population
Adjusted Dose (aPAD) for the pesticide, then OPP would generally consider its
threshold of concern in applying that the safety standard of FFDCA sec.
408(B)(2)(A) not to be exceeded with respect to acute risk from food. However,
if the analysis indicates that estimated exposure at the 99.9th percentile exceeds
the PAD, OPP would generally conduct a sensitivity analysis to determine to what
4

-------
extent the estimated exposures at the high-end percentiles may be affected by
unusually high food consumption or residue values. To the extent that one or a
few values from the input data sets seem to "drive" the exposure estimates at the
high end of exposure, OPP would consider whether these values are representative
and should be used as the primary basis for regulatory decision making. In either
scenario, EPA would consider submissions by interested parties that question the
appropriateness of the use of the 99.9th percentile in calculating the threshold of
concern for the particular risk assessment in question or question its use generally.
It is important to note here that the above position refers to the 99.9th percentile of
exposure and not consumption. The 99.9th percentile of exposure represents the joining of each
individual's consumption data set with randomly selected residue values from the residue data set.
The consumption values associated with the 99.9th percentile of exposure do not necessarily
represent the 99.9th percentile of consumption since it is both the selected consumption value and
residue concentration which is responsible for determining exposure.
At this time, OPP's current policy is used only with daily exposures to a single chemical
through the food pathway only. Estimates of exposure through drinking water and residential
uses are not sufficiently developed to warrant inclusion in a probabilistic assessment. Establishing
the threshold of concern for the food pathway using the 99.9th percentile of exposure is
considered to be a "first step" toward regulation of exposures on an aggregate, and then
cumulative, basis.
OPP recognizes that different types of risk assessments will generally be needed for
aggregate and cumulative evaluations and that these assessments might also be associated with
different regulatory thresholds. Although OPP is moving toward regulating on the basis of
probabilistic aggregate and cumulative exposure assessments, a decision has not yet been made
regarding how the appropriate threshold of concern should be calculated for these types of
assessments. When exposures through drinking water and residential uses are sufficiently refined
to be incorporated into probabilistic evaluations, they will be aggregated and assessed, and may
use a different population percentile.
Section I of this paper provides an overview of OPP's present practice for acute dietary
risk assessment for residues in food. It describes the statutory, regulatory, and policy framework
for this policy, as well as prior reviews and comments. In addition, this section provides
background information on dietary risk assessment in general and explains how the previous
system (DRES—Dietary Risk Evaluation System) and the current system (DEEM—Dietary
Exposure Evaluation Model) work, as well as what input data sources are used and how.
Section II addresses some of the specific issues and concerns raised about using exposures
at the estimated 99.9th percentile in calculating the threshold of concern. One issue is whether
the nature of the databases available (i.e., robustness, adequacy, etc.) should preclude the use of
5

-------
the estimated 99.9th percentile for regulatory purposes since some consider the uncertainties
associated with this population percentile to be too great. Examples of data used are USDA's
food consumption survey data, registrant crop field trials, USDA Pesticide Data Program (PDP)
data, FDA monitoring data, market basket surveys, etc. Other issues include the treatment of
data "outliers," representativeness and adequacy of the databases, and the impact of Agency
default values on exposure estimates. Concerns, therefore, exist about whether the estimates of
the 99.9th percentile of exposure are sufficiently representative of actual exposure to be
meaningful. This paper summarizes these concerns and how OPP has addressed them.
Section III addresses the issue of protectiveness of the estimated 99.9th percentile of
exposure with respect to the general public health. One view is that using the estimated 99.9th
percentile of exposure is insufficiently conservative because very large numbers of people could
be exposed every day to pesticide intakes which are estimated to exceed the Agency's "level of
concern." This section also explores the contrary view - that the policy is over-protective
because of the conservative assumptions used in the estimation methods and the retention of
potentially unrepresentative values in the data base. The section discusses as well the view that,
whether it over- or under-estimates actual exposure, the estimated 99.9th percentile of exposure
is simply too uncertain to be used in risk management decisions.
Section III also explains that OPP weighs a number of factors in considering which
percentile to use: the size of the exposed population and the proportion that might receive daily
doses above the benchmark of safety, the acute Population Adjusted Dose (aPAD); the level of
confidence OPP has in its exposure estimates; and the extent to which such estimates may
overstate potential exposure because they incorporate conservative assumptions or rely on
atypical and unrealistic data. Further, to the extent understood, OPP considers by how much
individual exposures would be estimated to exceed the aPAD.
Section III also briefly addresses the issues associated with exploratory analysis conducted
by OPP with the DEEM software and the 99.9th percentile issue. Further details and specifics of
this analysis are provided in the associated response to public comments.
Section IV provides a list of the documents referenced in this paper.
The Appendix, entitled "Primer on Interpretation of Exposure Distribution Curves," is a
"plain English" guide to Monte Carlo analysis and interpretation of its results.
I.
OPP's Present Practice and Policy for Acute Dietary Risk Assessment
A. Introduction
6

-------
Under the Federal Food, Drug, and Cosmetic Act (FFDCA), the Environmental Protection
Agency may authorize a tolerance or exemption from the requirement of a tolerance, to allow a
pesticide residue in food, only if the Agency determines that such residues would be "safe"
(FFDCA sec. 408(b)(2)(A)(I)). The term "safe" is defined as a "reasonable certainty that no harm
will result from aggregate exposure to the pesticide chemical residue, including dietary exposures
and all other exposures for which there is reliable information" (FFDCA sec. 408(b)(2)(A)(ii)).
To determine whether food is safe to eat, OPP must assess the potential risks from
pesticide residues in food. The size of the potential risks depends on the toxicity of the pesticide
(how much harm, if any, is caused by specific amounts of the pesticide) and the magnitude of the
exposure to the pesticide. Exposure to a pesticide in the food supply depends, in turn, on two
factors: the amount of the pesticide present in food and how much food a person eats. It is
impossible to know precisely how much food every individual in the country consumes, either
over a lifetime or even on a single day. Similarly, it is impossible to know how much residue each
specific item of food contains. Thus, the Agency must use available and reliable, representative
data to develop estimates of such exposure.
In evaluating the potential risks from pesticides in the diet, OPP assesses both chronic
(long term) exposure and acute (short term) exposure. For chronic exposure, OPP estimates the
average amount of pesticide residue a person might consume over extended periods, potentially
ranging from several months to a lifetime. For acute exposure, OPP is instead interested in the
amount that might be ingested on a single day. To evaluate acute dietary exposure, OPP now
uses a probabilistic exposure modeling technique, an example of which is "Monte Carlo analysis."
For the purpose of discussion, this paper will use the term "Monte Carlo" keeping in mind that
other probabilistic techniques may be used as well. This probabilistic assessment technique
estimates the different levels of exposure people experience as the result of differences in the
types and amount of foods they eat, as well as variations in the level of pesticide residue that may
be present, among other factors.
Over the last several years, OPP has been working to expand its capability of evaluating
acute dietary exposure and risk using probabilistic techniques of assessment. In early 1998, OPP
released an interim policy and a series of guiding principles regarding the use of probabilistic risk
assessment techniques (U.S. EPA, 1998a). In part, this policy was based on an earlier Agency
policy regarding the use of probabilistic techniques in risk assessment. Specifically, in a 1997
memorandum from Deputy Administrator Fred Hansen, EPA stated that probabilistic analysis
techniques, "given adequate supporting data and credible assumptions, can be viable statistical
tools for analyzing variability and uncertainty in risk assessments" (U.S. EPA, 1997a). The
Agency also enumerated a set of conditions to be considered in judging the acceptability of a
probabilistic analysis for review and evaluation; these conditions relate to transparency,
reproducibility, and the use of sound scientific methods (U.S.EPA, 1997a). This Agency policy
document noted that Monte Carlo analysis is the only probabilistic technique that has been
accepted so far, but EPA would be open to considering other probabilistic techniques.
7

-------
Among other things, the 1998 OPP draft guidance document on probabilistic health
assessments indicated that, when probabilistic exposure assessments were available for acute
dietary risk, OPP would refer to the 99.9th percentile of estimated exposure in making its risk
management decisions. In general, OPP would compare this level of exposure to a safety
benchmark, i.e., the acute Population Adjusted Dose (aPAD), in determining whether a particular
regulatory action would be consistent with the statutory safety standard established by FQPA
(U.S. EPA, 1998a).
B. Previous Review of OPP's Interim Policy
In March 1998, OPP brought its interim policy to the FIFRA Scientific Advisory Panel
(SAP). The SAP generally agreed with the probabilistic approach proposed by OPP. They
considered, among other things, the use of a 99.9th population percentile of exposure and
expressed divergent views on whether using the 99.9th percentile is an (adequately) conservative
approach. They noted that, in their view, if the 99.9th percentile is utilized, a percentage of the
population could still be exposed daily to estimated levels that exceed the regulatory threshold of
concern. They further noted that, even though the percentage was small (0.1%), the number of
people represented by that percentage was very large because the exposed group is potentially the
entire population of the country. The following additional remarks were made by the Panel:
• To judge whether any given percentile criterion is conservative for acute effects or
not, it would be necessary to consider the margin of safety which is already
incorporated into the toxicological portion of the risk evaluation.
• To identify the level of risk, variability not only in exposure levels but also in
human thresholds for the toxic effects under consideration would be needed. That
is, a probabilistic "toxicity" component of a risk assessment should be incorporated
into the analysis as well.
• By recognizing and separately modeling subpopulations, it may be possible to
choose a lower, less statistically tenuous percentile in calculating a threshold of
concern for one or more of these subpopulations. This lower percentile may also
be warranted, they indicate, if the risk assessment contains a number of
"conservative" assumptions that might result in overestimates of risk even at the
99.9th percentile.
The Agency's approach to acute dietary risk assessment has been discussed extensively by
the Tolerance Reassessment Advisory Committee. In addition, in January 1998 the FQPA
Implementation Working Group (IWG), an informal coalition of agricultural commodity groups
and food processing and agricultural chemical trade associations, submitted comments to OPP. In
particular, the IWG asserted that the residue and food consumption data sets used as input for
probabilistic exposure techniques contained data points which were "outliers" and which made the
resulting estimates of exposure distribution appear unrealistically high. IWG also argued that
8

-------
other conservative assumptions (assumptions that would likely overstate potential exposure) used
in developing the exposure estimates made the use of the 99.9th percentile an inappropriate point
for use in calculating a threshold of concern.
Following the SAP review and after considering public comment, OPP revised portions of
its interim policy document on probabilistic exposure assessment in OPP. This revised document
incorporated many of the changes recommended by the SAP in its March 1998 meeting discussed
above. On November 5, 1998, EPA announced in the Federal Register (63 FR 59780) the
availability of the revised document as a draft science policy paper entitled "Guidance for
Submission of Probabilistic Human Health Exposure Assessments to the Office of Pesticide
Programs" (U.S. EPA, 1998a). As its title indicates, the science policy paper contained guidance
on the submission of exposure assessments; it also stated that OPP would separately present an
explanation of its policy decision to refer to the 99.9th percentile of estimated acute dietary
exposure in making its risk management decisions. The current document addresses this latter
issue.
The Agency and the U. S. Department of Agriculture (USDA) have also discussed the
OPP policy with respect to determining a level of concern for regulatory decisions about the risks
of acute exposure to pesticides in the diet. USD A has commented that the use of data bases
which contain too few data points to project high-end percentiles of consumption of a particular
food or levels of residues in a specific commodity with statistical confidence raises questions
about the estimates of high-end exposure developed using probabilistic assessment techniques.
EPA has considered USDA's comments and revised this paper to explain and address its
concerns.
C. OPP's Current Approach to Dietary Risk Assessment
1. Chronic vs. Acute Exposure and Risk Assessment
OPP typically performs a dietary exposure assessment for two different exposure time
frames — short term or "acute" exposures and long-term or "chronic" exposures; each assessment
is calculated differently. In chronic exposure assessment, the risk assessor is attempting to
estimate a person's average dietary exposure over the long-term (e.g., several months to a
lifetime). Consequently, the use of both average (or mean) residue value for each food
commodity and average (or mean) consumption of food commodities is generally regarded as
appropriate. Estimates of exposure through drinking water are subsequently combined with these
estimates of exposure through food to calculate combined dietary exposure through food and
water. In acute dietary exposure assessment, however, the risk assessor is trying to estimate the
9

-------
range of exposures that individuals could encounter on a single day and determine the exposure to
which "high-end" persons could be subjected (where "high-end" is defined as a plausible estimate
of exposure for those individuals at the upper end of the exposure distribution). OPP is using
Monte Carlo techniques (and its current 99.9th percentile approach) for these acute food
exposure assessments only. OPP is not using Monte Carlo techniques at
this time for chronic exposures due to the limitations of the existing food
consumption data. EPA and USD A, however, are exploring statistical
techniques that may allow such analyses in the future. The Monte Carlo
Guidance document provides additional information regarding the tiering
process used in acute assessments, for both probabilistic and non-
probabilistic assessments (U.S. EPA, 1998a).
2. The Risk Equation
Dietary risk can be expressed as a function of toxicity and
exposure.
RISK = f (toxicity, exposure)
That is, to determine risk - which can be either acute (one-day) or chronic
(long-term) - one "combines" a value representative of the toxicity for
the pesticide with the amount of pesticide to which an individual is
exposed. The notation above is not meant to imply that the two
quantities represented by "toxicity" and "exposure" are necessarily
multiplied together, just that toxicity and exposure are two quantities
which together determine risk.1
The toxicity part of the risk function is typically expressed as an
acute reference dose (aRfD, in units of mg/kg body weight per day). An
aRfD is an amount of toxicant (in mg/kg bw/day) to which a person can
be safely exposed for one day. In general, an aRfD is set at a level at least
100 times smaller than the no-observed-adverse-effect level (NOAEL, in
units of mg/kg bw/day), if the NOAEL used is from a controlled
toxicological study in laboratory animals. The NOAEL is defined as the largest amount of
toxicant (in units of mg/kg bw/day) which produces no observed adverse effects. The factor of
100 is a generally applied adjustment, (sometimes called a "safety factor" or, more frequently, an
"uncertainty factor") to account for the potential that humans could be more sensitive to the toxic
effects of a compound than laboratory test animals (10 X) and that some humans could be more
1 When toxicity is measured in terms of cancer-causing potential (e.g., in terms of a slope
factor as a Q*) then toxicity and exposure are multiplied together. When toxicity is expressed in
terms of an RfD, then the reciprocal of the RfD (which is representative of toxicity) is multiplied
by the exposure to obtain the estimated risk.
Units of Measure
mg/kg bw/dav. Milligrams
of pesticide per kilogram of
body weight per day.
us. Microgram.
g. Grams
Measures of Toxicity
aRfD. Acute Reference
Dose. An amount of
toxicant (in mg/kg bw/day)
to which a person can be
safely exposed for one day
NOAEL. No Observed
Adverse Effect Level.
Largest amount of toxicant
(again in mg/kg bw/day) in a
controlled toxicological study
which produces no adverse
effect in a test animal
Uncertainty Factor. A series
of safety factors by which
the NOAEL is reduced to
obtain the aRfD. Usually,
these consist of an
interspecies factor (lOx) and
an intra-species factor (lOx).
10

-------
sensitive than others (10 X).
The dietary food exposure part of the function is derived from two distinct pieces of
information: the amount of pesticide residue that is present in and on food (i.e., the residue level)
and the types and amounts of food in a person's diet (i.e., food consumption). The residue
information comes mainly from the crop field trials submitted by pesticide manufacturers and
USDA or from monitoring data collected by the USDA and FDA (see Section I.C.3.(b)).
Consumption information comes primarily from USDA surveys of what people eat (see Section
I.C.3.(a)).
The basic equations for acute dietary food risk assessment are:
Exposure (mglkg bw/day) = Consumption {kg food/kg bw/day) x Residue {mg pesticide/kg food)
aRfD {mglkg bw/day) = 	N0AEL		
interspp. factor (lOx) x intraspp. (lOx) factor
Population Adjusted Dose {aPAD) {mglkg bw/day) = 	aRfD {mglkg bw/day)	
any additional FQPA-only Factor
% aPAD = ExPosure (mSlkg bw/day) ^ 1Q()
aPAD (mglkg bw/day)
Once the aRfD is derived, OPP identifies the population for which the assessment is being
done. If this population includes the fetus, infants and/or children, a determination concerning
retention, reduction, increase, or removal of the FQPA 10X Safety Factor must be made
considering potential pre- and post-natal toxicity and the completeness of the exposure and
toxicity databases. If a decision is made to retain an additional safety factor for reasons solely
based on the FQPA (e.g., increased susceptibility in infants and children), the aRfD would be
modified and the resulting allowable exposure is termed the acute Population Adjusted Dose
(aPAD).
The value of the "%aPAD" reflects the relative size of the aPAD and the estimated
exposure. If the estimated exposure is less than the aPAD, the value will be below 100%.
Conversely, if the exposure is estimated to exceed the aPAD, the value will be greater than 100%.
Traditionally, if the "%aPAD" is less than 100%, the estimated exposure is considered "safe".
11

-------
3. Data Bases Used in Probabilistic Dietary Exposure
Estimates
Currently, OPP is developing acute, probabilistic dietary
exposure assessments using Monte Carlo techniques that require data
on (1) the distribution of daily consumption of specific commodities
(wheat, corn, apples, etc.) by specific individuals (in g commodity/kg
bw/day), and (2) the distribution of concentrations of a specific
pesticide in those food commodities (in //g pesticide/g commodity).
The latter information is generally obtained from crop field trials,
USDA's Pesticide Data Program (PDP) data or FDA monitoring
data, market basket surveys conducted by the registrants, and other
sources while the former is collected by USDA in its Continuing
Survey of Food Intake by Individuals (CSFII). These two input data
sources, the USDA CSFII and the residue data sources, are discussed
below.
(a) Food Consumption: USDA Continuing Survey of
Food Intake by Individuals
The food survey data used in the OPP's probabilistic
exposure and risk assessments are collected by the U.S. Department
of Agriculture and are currently from the 1989-91 Continuing Survey
of Food Intake by Individuals (CSFII)2. The 1989-91 CSFII,
conducted as three separate 1-year surveys in 1989, 1990 and 1991,
was designed to measure what Americans eat and drink. The USDA
has been conducting such food surveys since the 1930's by means of
personal interviews in which interviewers ask individuals to recall
everything they ate and drank over the previous 24 hours. The uses
of Food Survey Research Group survey data are varied and include
the assessment of dietary intakes, dietary trends and food
consumption economics; the development of policies for food
assistance, food labeling and food safety programs; and the
implementation of dietary guidance and nutrition education programs.
Information from the surveys also is widely used across the U.S. to
develop nutrition and education programs, to assess dietary changes associated with participation
in food programs, to develop food fortification and enrichment policies, to monitor the safety of
the food supply, and to assess demand for agricultural products and marketing facilities. In
What Does the Term Distribution
Mean?
Think back to the classic bell curve
we learned about at some point in
our school days. When grades
were being determined, some of us
had scores that were either on the
low -end or high-end of the range
while most of us had scores in the
middle. If the frequency of
occurrence were plotted, the
resulting distribution of grades
would resemble the bell curve:
Essentially what the bell curve tells
c
Letter Grade
us is that most things are near the
middle - there are far fewer
occurrences at the extremes.
The distribution of the pesticides
residues does not typically follow
this bell-curve shape. Instead, the
curve is generally right-skewed
with a long tail to the right. In the
above diagram, it would be as if the
vast majority of grades were A's
and A-'s with very few C's, D's,
and F's.
For an additional explanation of
how this exposure distribution
curve is interpreted in risk-based
decision-making, see the Appendix
to this document
2
Data from the recently completed 1994-96 CSFII have now been released by USDA and are expected to be
incorporated into the OPP's risk assessments beginning in Spring, 2000.
12

-------
accordance with federal data reporting guidelines, USDA identifies and cautions users of its
databases about the lack of adequate numbers of data points for certain statistical projections.
For example, some of the commodities for which EPA sets tolerances are eaten so infrequently
that USDA cautions against using the survey data to estimate high-end percentiles of consumption
of such commodities, e.g., the 95th percentile or greater.
CSFII (1989-91) data are derived from information provided by 15,128 individuals who
participated in the survey. One-day food and nutrient intake data for individuals of all ages were
collected between April 1989 and March 1992. Individuals who took part in the survey were
asked to provide three consecutive days of dietary data. The first day's data were collected in a
personal in-home interview using a 1-day dietary recall. The second and third days' data were
collected using a self-administered 2-day dietary record. Intake amounts were reported and
energy and nutrient intakes were calculated using the USDA Nutrient Data Base for Individual
Intake Surveys. Subject to the cautions about statistical treatment of data, the data collected for
such large numbers of survey participants, who have been scientifically selected so that results
could be projected from the sample to the U.S. population, constitute a reliable and representative
national sample.
(b) Residue Data Sources: Field Trials, Monitoring, and Market Basket Surveys
In addition to the food consumption data provided by USDA's CSFII, information on the
distribution of residue levels in foods is necessary in order to calculate exposure and risk in a
probabilistic manner. Data on the distribution of residues on foods for use in OPP's probabilistic
exposure and risk assessments can be obtained from a variety of sources including: (1) crop field
trials; (2) FDA enforcement monitoring; (3) USDA PDP monitoring; (4) specialized market
basket surveys (usually conducted by a pesticide registrant); and (5) studies on the effects of
commercial processing, peeling, washing, cooking or other activities that may affect residue
levels. Crop field trials are experimental trials, usually performed by a pesticide company or
USDA, in which the maximum usage scenario (with respect to application rate, number of
applications, pre-harvest interval, etc.) is simulated. These OPP-required experimental trials are
conducted according to Agency guidelines, primarily to determine maximum residues that may be
present in fruit, vegetable, grain and other food and feed crops at the earliest point where these
food commodities could enter commerce. These data are used to establish legally enforceable
pesticide tolerance limits.
In contrast to the pesticide residue data collected during the experimental field trials, FDA
and USDA pesticide monitoring data (as well as registrant-sponsored, market basket survey data)
represent residue data in crops collected from commercial trading channels (wholesalers,
warehouses, distribution centers, retailers, etc.). These data better represent pesticide residues to
which consumers are actually exposed because they measure residues in food in commercial
channels that is closer to the consumer than food sampled following experimental field trials
conducted under maximum application scenarios.
13

-------
OPP prefers to use data from FDA or USD A PDP monitoring data or market basket
surveys, when available, in calculating pesticide exposure estimates. However, these data are not
always available or appropriate for use; when this is the case, OPP uses pesticide residue data
collected from the experimental field trials. As the field trial data represent residues resulting
from a maximum application scenario to which only very few crops are actually subjected, OPP
may refine these data to take into account other factors such as residue degradation as a result of
transport or storage, or variabilities in farming practices such as use of longer than label pre-
harvest intervals and lower than label application rates (U.S. EPA, 1999b). In addition, OPP's
exposure estimates can be modified or adjusted, as appropriate, to take into account decreasing or
increasing concentrations in processed commodities as a result of commercial processing practices
or (generally) decreased residues as a result of cooking or in-home preparation such as washing,
peeling, coring etc. (U.S. EPA, 1999c) Finally, information on the percent of the crop which is
treated, if available, is also used to adjust the probability of encountering a treated commodity.
4. DRES and DEEM
Until 1998, OPP used a software program called the Dietary Risk Evaluation System
(DRES) to conduct its acute dietary risk assessments for pesticide residues in foods. Acute
assessments conducted with DRES assumed that 100% of a given crop with registered uses of a
pesticide was treated with that pesticide and that all such treated crop items contained pesticide
residues at the maximum legal (tolerance) level. The resulting DRES acute risk estimates were
considered "high-end" or "bounding" estimates.3 However, it was not possible to know where the
pesticide exposure estimates from the DRES software fit in the overall distribution of exposures
due to the limits of the tools being used. Thus, risk management decisions were being made not
only without a full picture of the distribution of risk among the population, but also without full
knowledge of where in the distribution of risk the DRES risk estimate lay.
OPP is now using the Dietary Exposure Evaluation Model (DEEM) computer software
program for its dietary exposure and risk assessments for pesticide residues in food. Like the
DRES model, DEEM calculates acute and chronic risk using the inputs of: pesticide residues in
and on food, food consumption, and toxicity. Also, like DRES, DEEM is able to calculate an
estimate of the risk to the general U.S. population in addition to 26 population subgroups,
including five subgroups for infants and children:
• U.S. population • children 7-12 • Females 13+, pregnant/not
nursing
A "high-end" estimate is conceptually one that falls between the 90th percentile of the actual exposure
distribution but below the exposure to the person in the population who has the highest exposure. It is a plausible
estimate of the individual exposure for those persons at the upper end of the exposure distribution. A "bounding
estimate," on the other hand, purposely overestimates the exposure or dose in an actual population for the purpose of
developing a statement that the risk is "not greater than..." (U.S. EPA, 1992a. Guidance on Risk Characterization for
Risk Managers and Risk Assessors).
14

-------
U. S. population -spring
U. S. population—summer
U.S. population—autumn
U. S. population—winter
all infants
nursing infants (<1 yr)
non-nursing infants (<1
year)
children 1-6
Hispanics
Non-Hispanic Whites
Non-Hispanic Blacks
Non-Hispanic (other than
Black or White)
Females 13-19, not
pregnant or nursing
Females 20+, not pregnant
or nursing
Females 13-50
Pacific
Females 13+nursing
Males, 13-19
Males 20+
Seniors 55+
Northeast
Midwest
South
West
Unlike DRES, DEEM can generate probabilistic assessments of acute dietary food
exposure. DEEM uses a mathematical technique called Monte Carlo analysis to generate
estimates of the distribution of pesticide dietary exposures. That is, it uses all the individual food
consumption and pesticide residue level data points included in a data set to determine the
combined (or joint) distribution of exposures (and associated risk). For more information on the
interpretation of exposure distribution curves generated by the DEEM software, see Appendix I -
"Primer on Interpretation of Exposure Distribution Curves." At this time, OPP uses the DEEM
probabilistic (Monte Carlo) model to develop probabilistic exposure estimates only for acute
assessments.
The Monte Carlo technique provides a relatively new tool in exposure assessment for
more accurately estimating the complete distribution of exposures, and provides probabilistic and
statistical assessment of dietary risk using more refined information than was used previously.
This analysis uses the actual distribution of pesticide residue levels from either the experimental
field trials performed by the registrant or monitoring or market basket surveys, whereas in DRES
only a single, high-end residue value was used. Also, it can incorporate information on the
percentage of the crop which is treated. That is, it includes the actual distribution of possible
consumption and residue values and weighs these possible values by their probability of
occurrence. Using Monte Carlo, OPP does not assume (as was previously done with DRES) that
100% of the crops with registered uses are treated with the pesticide of interest or that all
residues are present in crops at maximum legal (tolerance) levels. Rather than the crude "high-
end," single point estimates provided by DRES, Monte Carlo provides more accurate information
on the range and probability of possible exposure and their associated risk values.
Monte Carlo techniques are, in and of themselves, neither more conservative nor less
conservative than the DRES system they supplement: the "conservatism" is determined, in part,
by the risk manager when he or she determines the appropriate percentile of the model's output
15

-------
distribution (e.g., 99.9th percentile) to be used for regulation in conjunction with the nature of
inputs selected and assumptions used. Monte Carlo and probabilistic techniques are simply tools
that potentially allow the risk assessor and manager to see a more accurate distribution of risks
among the general population and subpopulations.
5. DRES 95th Percentile Dietary Food Exposure vs. Monte Carlo 99.9th Percentile
Dietary Food Exposure
The Agency has in the past used the estimated 95th percentile of exposure in calculating a
threshold of concern with an acute DRES analysis. Concerns have been raised about what is
seen by some as a significant "raising of the bar" by now choosing to refer to the estimated 99.9th
percentile of exposure from a Monte Carlo analysis. While it may appear at first that the Agency
is taking a more stringent approach, this is actually not so. Estimated exposure at the 99.9th
percentile calculated by DEEM probabilistic techniques is significantly lower than exposure
calculated by DEEM using DRES-type non-probabilistic assumptions at the 95th percentile for
most cases reviewed by OPP to date. There are several reasons for this. An acute DRES analysis
assumes that residues are present at tolerance levels in all crops that have registered uses and
100% of the crop is treated at maximum label rates and harvested at minimum pre-harvest
interval.
In general, Monte Carlo techniques will provide lower and more realistic estimates of
exposure than previous DRES techniques when:
• a lower percentage of the crop is treated (e.g., 10% rather than 100%);
• The bulk of residue values from crop field trials are present at low levels and there
are only a few high values; and
• a greater number of crops are registered (e.g., 10 crops instead of 2 crops).
For example, a given food item (e.g., cherries) can have several dozen or more individual
residue values generated from experimental field trials for a certain pesticide. In an acute DRES
analysis, only the highest residue value (or tolerance level) would be used and all registered crops
would be assumed to be treated and contain these high residue values. In a Monte Carlo run, the
entire set of actual residue data points generated in the crop field trials and the percent of the crop
which was treated would be considered. The differences between the estimated exposure
numbers generated by these two techniques can be substantial, with the Monte Carlo generated
estimated exposures (at the 99.9th percentile) frequently many times lower than DRES-like
estimated exposures (at the 95th percentile). Table 1 illustrates some of these extensive
differences in exposure estimates for a widely used agricultural pesticide which was recently
evaluated by OPP:
16

-------
TABLE 1. Comparison of DEEM 95th Percentile Exposure and %aRfD Estimates From a Tier 1 Analysis to
Monte Carlo 99.9th Percentile Exposure and %aRfD Estimates from a Tier 3 Analysis for One-Widely Used
Agricultural Pesticide
(expressed on a per capita basis using 1989-91 CSFII Data)
Population
Subgroup
Exposure (mg/kg bw/day)
%aRfDa
DEEM
95th Percentile
Estimate
(Tier 1)
DEEM
Monte Carlo
99.9th Percentile
Estimate
(Tier 3)
DEEM
95th Percentile
Estimate
(Tier 1)
DEEM
Monte Carlo
99.9th Percentile
Estimate
(Tier 3)
U.S. Population
0.0192
0.0013
770
50
Infants
0.0375
0.0007
1500
38
Children 1-6
0.0402
0.0017
1610
67
Females 20+/np/nnb
0.0126
0.0011
510
45
Males 20+
0.0119
0.0014
480
55
a The %aRfD represents the portion of the acute "risk cup" which is occupied. The %aRfD is obtained by dividing
the estimated exposure at any given percentile (e.g., 95th or 99.9th percentile) by the aRfD. It should be
remembered that the aRfD may be modified to reflect the decision with regard to the FQPA lOx Safety Factor. This
modification results in an acute Population Adjusted Dose (aPAD). Comparison of the estimated exposure to the
resulting aPAD is then done to determine the acceptability of that exposure.
b Females 20+, not pregnant, not nursing
As can be seen, estimated exposures (and corresponding %aRfDs) are significantly lower
at the 99.9th percentile DEEM/Monte Carlo analysis than they are at the 95th percentile using a
DRES-like analysis. This is almost invariably the case. In fact, at all comparable percentiles, the
exposure estimates derived from DEEM/Monte Carlo are lower than the corresponding DRES-
like estimates. The advantage of this probabilistic technique is that it can refine the exposure and
risk estimates by more fully incorporating all available information and minimizing reliance on
values chosen more for their regulatory and administrative convenience than their scientific merit.
In short, DEEM/Monte Carlo analysis tends to provide a lower (but more reliable)
estimate of actual exposure in exactly those situations where DRES (or DRES-like analysis) is
least realistic. OPP will continue to use the estimated 95th percentile of exposure in calculating a
threshold of concern when actual tolerance levels and 100% crop treated assumptions are used
during exposure assessment, but recognizes that this approach can significantly overestimate
actual exposure levels. In those cases where exposure estimates at the 95th percentile using less
refined assumptions are greater than the regulatory threshold of concern, OPP's policy is to use
Monte Carlo techniques to assess estimated exposure at the 99.9th percentile using more refined
data. In practice, risk assessments done at the estimated 99.9th percentile using more refined data
almost invariably result in lower estimated exposures (and corresponding estimated risk) than
assessments performed at the estimated 95th percentile of exposure using less refined data.
17

-------
II. Issues Related to the Methodology and Data Bases Used in Acute Dietary Risk
Assessment
Concerns have been raised among the academic, public health, industry, and grower
communities with regard to the appropriateness of the estimated 99.9th percentile of exposure as
the default decision point for regulation when using probabilistic techniques for acute dietary risk
assessment. Specifically, these concerns include: the presence of "outliers" in the pesticide
residue and food consumption data; the representativeness of the data sets used in Agency risk
assessments; the limited size of the input data bases; the reliance on "uncertain" consumption
values which fall at the extreme tails of the distribution when generating exposure estimates;
potential variability/uncertainty in the recipes used to convert the reported food items (on an "as
consumed" basis) to agricultural commodities used in the DEEM software; and the degree to
which OPP's 99.9th percentile estimate incorporates conservative default assumptions. Because
of these areas of concern, issues have also been raised about the interpretation of the output
developed by the Monte Carlo technique. Some contend that, if the input data are not reliable and
representative, neither are the outputs of any technique using such data. Therefore, they contend
that the Agency should not use the 99.9th percentile of estimated exposure as a starting point for
regulatory decision making and/or should make adjustments in the data sets which are inputs to
the exposure assessment. Specific concerns addressed in this section include the following:
•	Inclusion of high-end values from USDA food consumption input data sets (see
section HA.);
•	OPP use of upper-end residue data (see section II.B.);
•	OPP consideration of the size and representativeness of the data bases for
consumption estimates and residue profiles (see sections II.C. and D.);
A.
Potential inaccuracies in recipe translations (see section
II.E.); and
OPP use of conservative default assumptions in
treatment of pesticide residue data (see section II.F.).
Treatment of High-End Consumption Values
("Outliers") in USDA CSFII Survey
Concern has been expressed that the USDA's food consumption
data have not been properly evaluated to identify potential errors in the
What's An Outlier?
In a data set, an outlier is a
number that greatly differs
(or is substantially
removed) from the bulk of
a data set. That is, it is a
value that is much larger
or much smaller than most
of the other numbers in
the set. It does not
necessarily represent an
invalid data point, but may
simply represent an
unusual or rare, but still
very real, occurrence.
18

-------
data sets or to assess the potential impacts of outliers on the estimated 99.9th percentile of
exposure. As a result, some contend that errors are propagated throughout OPP's Monte Carlo
analysis resulting in distributions which inappropriately and artificially inflate estimates of risks at
the upper ends of the distribution. Consequently, some believe that tests for outliers should be
conducted, and "outliers" should be removed from the data set, so that the high end of estimated
risk is not defined by the outliers. They state that OPP's failure to do this means that the results
may not be reliable or scientifically-based.
The Agency shares this basic concern and does not want to use data which are not reliable
measures of food consumption. By the same token, OPP does not want to ignore data which
measure real, but perhaps relatively infrequent, consumption events. Anytime a survey as
extensive as USDA's CSFII is conducted, high consumers of a particular food item will be found
and reported. It is also true that some portion of the food consumption reports in the initial
database may be erroneous. Thus, outliers may be present in the raw survey data. Given that
very high food energy intakes do occur in the American population (even though they are not
common), considerable judgment is required to determine whether a high-end value should be
declared in error and discarded or should be retained.
To ensure that the 1994-96 CSFII data base is free of erroneous or unreliable data points,
the USD A extensively validated and cross-checked any questionable survey results prior to their
insertion into the 1994-96 CSFII database. For example, USDA survey interviewers were trained
to probe for additional information when unusual intakes of various kinds are reported, and to ask
questions clarifying large reported amounts, and also if the day's intake was typical or not. If the
reported intake was not typical, queries were made about what was atypical, such as the
occurrence of a holiday, a social occasion, or the like. On preliminary review of survey data,
USDA identified high intakes in various areas (i.e., high consumption of certain foods or high
energy intake) and evaluated the reported intake for feasibility, including notations made by the
survey interviewer relating to the perceived validity of the reported consumption. USDA
carefully reviewed all of the data resulting from its 1994-96 CSFII survey to assure that the
reported results are as close to reality as possible. USDA paid particular attention to data points
that may be "outliers," and that may have been needed to be removed to characterize food
consumption accurately. All reported high intake values retained in the 1994-96 CSFII database
have been checked by USDA and resolution or adjudication of values outside specified ranges
have been accomplished. Thus, the USDA 1994-96 CSFII data base has been properly evaluated
and contains accurate and reliable consumption values that OPP will use in assessment of human
dietary exposure to pesticide residues. OPP has similar confidence in the 1989-91 CSFII survey
data.
OPP does, however, recognize that unusually high intakes can potentially "drive"
calculated exposure and risk estimates and believes that it may be inappropriate to base risk
management decisions on unusual consumption values, particularly if these consumption values
dominate high-end exposure estimates. Therefore, OPP has decided that risk characterizations
will include a "sensitivity analysis" which will take advantage of a recent upgrade to the DEEM
19

-------
software program which is now capable of generating a "Critical Exposure Contribution" (CEC)
analysis when run in the acute Monte Carlo mode. The CEC provides insights into the sources
contributing to the exposure estimated for the most highly exposed people in the exposure
distribution. This listing contains a detailed exposure analysis for individuals having a total
exposure greater than a user-specified "CEC exposure value" (at present, typically around the
99.9th percentile of exposure) in the user distribution profile. The display includes key
demographic information (gender, age, body weight), the food(s) consumed, amount consumed,
the residue value, the total daily exposure estimate, and the exposure estimate by food. Thus, the
CEC provides OPP with comprehensive information on foods (and food-forms) that account for
the largest portion of the person's estimated exposure. If OPP finds that the high-end exposures
are principally driven by suspect high-end consumption values, OPP's risk mitigation decisions
can appropriately consider and weigh these factors.
B. Treatment of High-End Residue Values ("Outliers") in Crop Field Trial or
Monitoring Data
As with the food consumption data, some have stated that the residue data included in
Monte Carlo assessments have not been properly evaluated to identify potential errors in the data
sets or to assess the potential impacts of these outliers on the estimate of the 99.9th percentile
risk. OPP acknowledges that it is not uncommon, when field trial residues comprise the data sets
used in a probabilistic assessment, that these data include one or more residue values which are
significantly higher than the other measured concentrations.4 Just as with food consumption data,
it is important to assure that these data are as accurate as possible. Retaining an erroneous high-
end value may result in overestimating exposure, but discarding accurate high-end values may
lead to an underestimate of exposure.
Even though OPP may previously have reviewed and relied on a data set, each pesticide
residue point in the residue data sets included as input to any Monte Carlo analysis is carefully
reviewed and verified by OPP staff scientists because of the recognized potential impacts outliers
could have on the high-end exposure estimates. OPP's longstanding approach to outliers has
been articulated in the recent draft document "Guidance for Submission of Probabilistic Exposure
Assessments to the Office of Pesticide Programs" (U.S. EPA, 1998a) The decision to discard an
outlier is based on a scientific or quality assurance basis, and is only made with extreme caution,
particularly for environmental data sets which can often contain legitimate extreme values. OPP
believes that statistical tests can be used to identify possible outlier data points which require
further investigation, but that it is inappropriate to eliminate outliers from analysis on this basis
alone unless further review of the suspect data points reveals a significant mistake in protocol
which renders a generated residue value irrelevant to label conditions (e.g., wrong tank mix
4 Frequently, high-end field trial values are the same data initially provided to the Agency by a pesticide
company to support OPP's original decision to allow marketing for its product. In fact, these high end residue values
likely were used in establishing EPA tolerances. Occasionally, these outliers have represented a sizable fraction of the
submitted data sets.
20

-------
concentration, mistaken application rate, too early a PHI, too many applications, etc.) or there is
some other basis to conclude that the data point is not appropriate for use. This is particularly
true in cases where the data points in question have been used by the Agency in establishing a
tolerance or other regulatory limit.
Occasionally, high-end values may be found among the data from the USD A or FDA
monitoring programs. The Agency relies on the extensive QA/QC procedures followed by
USDA's PDP program and the FDA program to determine which data points should be retained.
Therefore, OPP normally does not discard any of these values, absent other evidence of their
invalidity. For example, the monitoring data provided by USDA's Pesticide Data Program are
collected under rigorous QA/QC procedures which include method validation (determination of
limit of detection and limit of quantitation for each pesticide/crop combination), confirmation of
residue identity by alternate detection system, use of blanks, spikes, and internal and external
standards, as well as verification of the analyst's performance (check samples, audits, etc.).
Similarly, FDA uses official analytical methods that include blanks and fortifications, and requires
confirmation of residues of regulatory significance by use of an alternate detection system and
verification of results by a different analysis. These measures are intended to ensure integrity of
monitoring data from the sample collection to data reporting.
As with high-end consumption values from the USD A food consumption survey, OPP
scientists will, as part of the risk characterization, inform the risk manager if high-end residue
values are driving the upper ends of the exposure using DEEM's CEC analysis. If specific
pesticide residues on certain crops substantially contribute to the majority of the exposures above
the specified percentile, the risk manager can incorporate this information into the risk decision
and determine an appropriate Agency response.
C. Representativeness
OPP combines both residue data (from crop field trials, monitoring programs, and market
basket surveys) and food consumption data (from USDA CSFII surveys), using the DEEM
computer software program to generate estimates of the distribution of daily exposures to
pesticide residues in food for the general U. S. population and 26 specific subgroups within the U.
S. population. The reliability of the estimate of the distribution of exposures depends on the
quality of the data used in the model. The data sets used should be sufficiently representative to
support reliable estimates.
The relationship between representativeness and the reliability of estimates of the
distribution of exposure is easy to understand. Even if all of the values in a data base are accurate
(see discussion in II A. and IIB.), the use of a data set in a probabilistic assessment will produce
unreliable exposure estimates to the extent that the data sample is unrepresentative of the larger
population it purports to represent. For example, if no one from low income groups were
interviewed about their eating habits, the survey results would miss the very real impact that
income has on dietary choices.
21

-------
The food consumption data used by OPP are collected by the USD A through a survey
that is carefully designed to assure that the results would be representative of the U. S.
population. The survey design specifically requires that samples be collected from people who
differ in ways that could affect the types and amounts of foods they eat. For example, the survey
covers people of different ages, genders, ethnicity, regions of the country, and socioeconomic
status. People who are selected for interviews are contacted on different days of the week,
scattered throughout the year to capture differences due to the time of year or day of the week. A
number of other aspects of the survey are also controlled in order to maximize the prospect that
the results are representative not only of the entire U. S. population, but also particular
subgroups, including those for which OPP generates acute dietary food exposure distributions.
While the USDA food consumption surveys are designed to be generally representative of
the U. S. population, it is clear that some factors that can influence dietary choices are not
addressed in the survey design. For example, the CSFII surveys do not purport to be
representative of people in institutional living arrangements (colleges, nursing homes, etc.) or of
different religions or health status. In addition, concern has been expressed about how
"representative" the survey results are at the high ends of consumption. This concern, in effect,
involves the size of the food consumption data bases. The Agency addresses this concern in
Section II.D. of this issue paper and presents a summary of the results of further analysis of the
methodology in Section III.
The various data bases on pesticide residues in food raise different set of issues with
respect to "representativeness." If market basket or monitoring data are not available, OPP will
use residue data sets generated by the registrants and submitted to the Agency for tolerance-
setting purposes. The field trial studies are designed to follow the directions on the product
labeling and are required to be performed in different areas of the country where the crop on
which the pesticide is being used is grown. Multiple field trial sites are required if the crop is a
significant component of the diet (e.g. wheat, corn, tomatoes, etc.) and if it is grown in
geographically and climatically distinct regions. Of necessity, these are conducted at maximum
label rates and minimum label pre-harvest intervals in order to establish maximum legal residue
limits (tolerances) on food. Thus, the data set resulting from the required field trials represents
the distribution of residues that are likely to be found in a particular raw agricultural commodity
following a maximum label application scenario. But, due to the design of these field trial
studies, the data are not likely to be representative of the residue values in food, as consumed. As
discussed in section I.C.3., adjustments can be made to these data to better represent the amount
of pesticides actually used (incorporating, for example, the range of typical application rates and
typical PHI's and percent crop treated). Further adjustments and refinements can also be made to
better reflect actual exposures; these can include cooking studies, residue degradation studies,
washing/home processing studies, etc.
An even more representative picture of the amounts of pesticides in food to which the
U.S. population is exposed can be obtained when OPP uses data from market basket surveys or
from USDA PDP or FDA monitoring. These data sources are considered to be more
22

-------
"representative" of actual exposure to consumers than field trials conducted under conditions
using maximum rate, minimum PHI, and other use conditions likely to lead to the highest lawful
residue. Market basket surveys, for example, are statistically designed and are conducted on a
single-serving basis at the point of sale to consumer. These types of studies, thus, best reflect
those residues to which consumers are actually exposed. The USD A, too, exercises great care to
assure that the food items sampled in its PDP program are representative of the large majority of
that type of agricultural commodity consumed in the country. These monitoring data are also
designed to be statistically representative of commodities which are typically available throughout
the year, except that they represent five pound composite samples (and not single-serving items)
collected at distribution points just before release to supermarkets and grocery stores. In
addition, PDP commodities are washed, peeled, de-stemmed, or cored, as appropriate, prior to
laboratory analysis to represent typical consumer practices. FDA surveillance monitoring data are
geared more to tolerance enforcement and not toward OPP's risk assessment needs. Collection
occurs as close to the farm gate as possible and the program is not designed to generate
statistically representative samples for use in risk assessments. Due to sampling and collection
methodologies, residues measured under the FDA surveillance monitoring program likely
overestimate pesticide residues to which consumers are exposed. Nevertheless, they are
considered more representative of residue levels to which consumers are exposed than the
experimental field trial data submitted for tolerance-setting purposes.
D. Size of the Input Data Bases.
In addition to being accurate and representative, the data sets should also be sufficiently
large to permit characterization of the overall exposure of the population of interest. As noted
earlier, each person in the USDA's 1989-91 CSFII survey currently being used by OPP was asked
to contribute information for three different days. While not every respondent provided the full
three days of responses, the 1989-91 USDA food consumption survey includes full (three-day)
data from 11,912 individuals. The CSFII data base used in the DEEM software collectively
represents food records for a total of 35,736 unique person days. Moreover, since each person
typically eats many different types of food during a day, there are a great many data points for
consumption of specific foods in the data base. Table 2 summarizes the same information for
several of the 27 populations that OPP evaluates in its probabilistic exposure assessments.
TABLE 2. Size of 1989-91 and 1994-96 CSFII (with SCS) Data Base
Size of 1989 - 91 CSFII Data Base
(all three days)
Size of 1994-96 CSFII Database with
Supplemental Children's Survey
(both days)
Population Group
Number of
Respondents
Number of
Intakes
Number of
Respondents
Number of
Intakes
23

-------
U.S. Population
11,912
35,736
20,607
41,214
Infants <1
202
606
1,486
2,972
Children 1-5
1,067
3,201
6,487
12,974
Children 6-11
1,172
3,516
1,913
3,826
Females 12-49
3,459
10,377
3,021
6,042
Males 20+
3,381
10,143
4,751
9,502
Note: OPP is currently using the 1989-91 CSFII dataset, but expects to start using the combined 1994-
96 CSFII database and the Supplemental Children's Survey in the second quarter of calender year
2000.
Despite the overall large scope of the USDA CSFII database, some contend that USDA
survey population sample sizes are of insufficient size to provide reliable estimates at the high end
of exposure and risk, and that there is a need for specific OPP criteria for a minimum number of
samples before an estimate is derived and used in establishing policy. The major focus of this
discussion has been the small number of data points at the extremes of the consumption
distribution for any given commodity. In other words, a very small number of people may have
reported having eaten a food containing a particular commodity. Some state, in particular, that
for many infrequently-consumed commodities or for small population subgroups, an adequate
number of individuals is not available to calculate a high-end consumption percentile. They say,
therefore, that the percentile exposure represented by high-end consumers of infrequently eaten
foods is highly uncertain. They further note that USDA has identified minimum population sample
size criteria for estimating various percentiles of food consumption, and recommend that USDA
flag estimates that do not meet these criteria. They believe that OPP should not use data points
that would fall at a percentile which would be flagged by the USDA. Rather, they argue that such
high-end (and "uncertain") values should be discarded (or otherwise adjusted) prior to using the
data set to perform probabilistic exposure analyses.
OPP recognizes that there are limits with respect to the USDA food consumption data
base which would affect the reliability of estimates of high-end consumption of particular
commodities. In particular, for many infrequently-consumed commodities and for small
population subgroups, an adequate number of individuals may not be available to calculate a
reliable high-end consumption percentile5. However, the ability to define a high-end consumption
5 The Agency and USDA, together, recognized this as an issue and initiated a Supplemental Children's Survey
(SCS) for the 1994-1996 USDA CSFII. The 1994-96 CSFII contains data for approximately 5700 children up to 18
years of age and the CSFII-SCS will provide intakes on approximately 5000 additional children through 9 years of age,
based on OPP sample size needs. The sample design is the same as that used for the 1994-96 CSFII so that the data
from the SCS can be merged with data from the 1994-96 CSFII. The size of the merged dataset (1994-96 CSFII and
SCS combined) is shown in the right-hand columns of Table 2. In addition, OPP notes that the next food consumption
survey will be conducted jointly by USDA and CDC's National Center for Health Statistics (NCHS) and is expected to
24

-------
percentile for each commodity in an exposure assessment is not necessarily critical to ensuring
that the exposure assessment can define high-end exposure. As discussed more fully below, the
concern about the size of the input data base is not directed at whether the data sets are adequate
to define high-end percentiles of pesticide residue levels in a particular food or consumption of a
specific food form. Rather, the concern is whether the data bases are sufficiently large to
characterize accurately the distribution of daily pesticide exposures from all foods which an
individual eats in any given day. The distinction between estimating high-end percentiles of
exposure and high-end percentiles of consumption (or residue) for a particular commodity is an
important one. It may be that the part of the exposure distribution which is derived from (or
includes) any single (presumably uncertain) upper-end USD A consumption value, does not
necessarily produce an invalid exposure value. Many of the upper-end exposure estimates might
not contain upper-end USD A consumption values and thus these uncertain USD A consumption
values may not be driving the high-end Agency exposure estimates at all.
Although this concern might need to be heightened when OPP's probabilistic exposure
assessments involve only a single commodity with few residue data points, one or even a few very
high food consumption values do not appear likely to be the primary driver(s) of exposure and
disproportionately influence the outcome of the DEEM exposure estimates (see Section III of this
document for a brief summary of the testing results. A more detailed explanation can be found in
the Response to Comments document associated with this policy document). The more
commodities which are included in the analysis, the more unlikely it is that the upper-end
exposure values are driven by upper-end USD A consumption values.
Finally, as discussed above with respect to the accuracy and representativeness of the
input data bases, OPP will perform a sensitivity analysis on all probabilistic assessments of dietary
exposure. The CEC module will identify the critical input data points, and the Agency can decide
whether to rely on the estimates of high-end exposure in its risk management decisions.
E. Potential Variability/Uncertainty in Recipe Translations
Another area of expressed concern is potential variability/uncertainty in the recipe
translations used to convert foods on an "as eaten" basis (e.g., pizza) to foods on an agricultural
commodity basis (e.g., wheat flour, tomato paste/puree, milk, beef, etc.). In other words, the
USDA CSFII survey requests individuals to report the foods consumed (as consumed) on any
given day, but these foods have to be subsequently expressed on an appropriate agricultural
commodity basis so that processing factors, appropriate matching, and other considerations can
be incorporated into the analysis. This recipe translation information is an intrinsic component of
the DEEM software, and concern has been expressed about the potential for these standard recipe
translations to inadequately reflect the full range of actual recipes used in practice. For example, if
the standard recipe for 100 grams of pizza is assumed to contain 15 grams of tomato paste instead
of only 10 grams (which may have been present in the particular piece of pizza eaten), then tomato
sample an expanded number of persons.
25

-------
paste consumption would in this instance be overestimated. The concern is that it is impossible
for a single standard recipe to be universally applicable and to account for all "real-world"
variations which may exist.
At present, the recipe translation files used by the DEEM software are proprietary and not
available for review. USDA and OPP have, however, collaboratively developed recipe
translations which are expected to be publically-released early in the year 2000 and to be
incorporated shortly after that time into the DEEM software. These translations have been peer
reviewed by government and private industry food and nutrition experts.
While OPP acknowledges that it is impossible for one standard recipe to reflect the
variability that exists in ingredients selected and quantities used in the many kitchens and food
manufacturing facilities across the U.S., OPP does not expect these to be significant sources of
error in its exposure estimates. Firstly, in many instances examined to date it is the fresh
agricultural commodity such as raw apples and tomatoes which are found to be the primary risk
drivers, and commodities in these forms are not subject to translation error (they are not
translated). Only rarely would canned baked apples (as in apple pie) or tomato paste/puree (as in
pizza sauce), for example, be expected to be risk drivers. Secondly, even considering the
variability which exists in the multitude of actual recipes which are used, this variability is unlikely
to be particularly significant or to introduce substantial error in our exposure estimate. If, for
example, the standard recipe for 100 grams of pizza dough contains 70 grams of flour, it is
unlikely that the "true" amount differs from this by more than 20-30% as there are defined limits
as to how far a recipe used in practice can deviate from a standard recipe and still produce an
identifiable and edible product. Finally, variability in recipes can result in either overestimating or
underestimating consumption amounts and exposures and the non-systematic nature of this error
(for each of many translations) is not expected to contribute to large errors which occur
predominantly in any one direction in estimated exposure.
F. Impact of Agency Default Assumptions on the Choice of a Percentile
Exposure Estimate for the Threshold of Concern
Some contend that the use of conservative default assumptions by OPP in its treatment of
pesticide residue data results in the estimate of the 99.9th percentile exposure being significantly
higher than the actual 99.9th percentile exposure. They point specifically to OPP's use of
maximum rate/minimum pre-harvest interval field trial data in the exposure assessment, OPP's
treatment of non-detects, and its use of 95th percentile data from monitoring studies. OPP agrees
that these three assumptions would lead to an overestimate of dietary exposure, but OPP uses
these assumptions only in its early- tiered, screening assessments. OPP's policy is to rely on a
screening estimate of exposure only if the estimate indicates that risk would be acceptable.
Because such screening estimates overstate exposure, OPP refines its exposure and risk
assessments using more realistic data. These refined, higher-tiered estimates do not use the
conservative default assumptions likely to overestimate risk. For more information on these tiers,
26

-------
see "Guidance for the Submission of Probabilistic Human Health Exposure Assessments to the
Office of Pesticide Programs " (U.S. EPA, 1998a).
With respect to residue data, OPP prefers to use data (when appropriate and available)
from market basket surveys, or PDP or FDA surveillance monitoring data in conducting its
pesticide exposure assessments rather than from field trials. However, these market basket or
monitoring data are not always available or appropriate for use. When exposure data are obtained
from field trials, these data can be modified or adjusted to take into account decreasing or
increasing concentrations in processed commodities as a result of commercial processing practices
or decreased residues as a result of cooking or in-home preparation such as washing, peeling,
coring, etc. OPP is also able to incorporate information about lower than label-specified
application rates and longer than label-specified pre-harvest intervals, if available (U.S. EPA,
1999c).
Concerning the treatment of non-detects, OPP has issued two papers containing interim
guidance for handling these type of data: "Assigning Values to Nondetected/ Nonquantified
Pesticide Residues in Human Health Dietary Exposure Assessments" (U.S. EPA, 1998b); and "A
Statistical Method for Incorporating Nondetected Pesticide Residues into Human Health Dietary
Exposure Assessments" (U.S. EPA, 1998c). In the first paper, the Agency describes its policy
to use '/2 the Limit of Detection (LOD) (in place of the full LOD or the limit of quantitation
(LOQ)) for treated commodities in cases where the limit of detection has been adequately
documented. As explained in the science policy paper, empirical data indicate that it is not
unreasonable to assume that treated non-detects contain residues equal to V2 LOD. The Agency
will also be performing sensitivity analyses which can determine whether the assumed residue
value assigned to the non-detect values is driving the risk estimate. In the second paper, OPP has
presented a method for dealing with residue data sets in which many of the observations are
below detectable levels. OPP believes that these refinements in the treatment of non-detects will
alleviate many of the perceived overly-conservative biases in exposure estimates with regard to
the assigning of values to non-detects6.
With respect to concern about use of the 95th percentile residue value from monitoring
data, this policy has recently been revised. Prior to the availability of the probabilistic software, a
registrant could, for blended commodities such as corn, soybean, wheat, etc., use either average
crop field trial concentrations (which reflect the maximum label use scenario) adjusted for percent
crop treated or 95th percentile FDA or USDA monitoring data in its acute risk assessment.
Either of these would be entered as a point estimate for use in an acute risk assessment. The use
of 95th percentile monitoring data had been introduced as an alternative to the use of average
field trial concentrations since OPP believed that this would be a more realistic (but still
6 These policy documents have been revised as per previous public comment and combined into one document
entitled "Assigning Values to Non-Detected/Non-quantified Pesticide Residues in Human Dietary Exposure
Assessments." This is expected to be published in its revised format simultaneously with the publication of the current
Percentile Policy document.
27

-------
conservative) estimate of actual exposures which would take into account actual use practices.
Because probabilistic software is now available, OPP need not rely solely on point estimates of
residue values in its acute dietary risk assessments, and this policy has been revised accordingly.
OPP no longer uses a 95th percentile point estimate from monitoring data for blended
commodities but instead, uses the entire range of monitoring data and therefore incorporates the
entire distribution in its exposure assessment using all of the available monitoring data. Thus, the
concern about OPP reliance on upper-end (95th percentile) monitoring data for blended
commodities in its risk assessments is no longer justified — OPP uses the full set of monitoring
data, thereby fully incorporating the most refined concentration data available.
III. Issues Related to Public Health Policy
The FQPA directs EPA to set or retain tolerances for a pesticide only if the Agency
determines that there is "a reasonable certainty of no harm" from dietary and other non-
occupational sources of exposure. OPP will use available and reliable scientific information to
characterize the toxicity and exposures of a food use pesticide in deciding whether a particular
pesticide meets the FQPA safety standard. Put simply, OPP's goal is to regulate pesticides in
such a manner that everyone is reasonably certain to experience no harm as a result of dietary and
other non-occupational exposures to pesticides.
To implement this statutory standard, OPP has to make a risk management judgment
about what level of pesticide residue in food is consistent with this standard. The reasonable
certainty of no harm standard informs OPP judgment on the overall risk management decision as
well as on component parts of the decision. OPP has decided to express its risk management
judgment for acute dietary risks in quantitative scientific form, as a "threshold of regulatory
concern." By a threshold of concern, OPP means that exposures below the threshold generally
would not be grounds for pursuing risk mitigation but that exposures above the threshold would,
at a minimum, have to be seriously examined to determine whether they exceeded the statutory
standard.
A threshold of regulatory concern for acute dietary risks, when based on probabilistic
exposure estimates, has two elements: (1) a percentile (or proportion) of the population for use in
estimating exposure to a pesticide residue; and (2) a benchmark forjudging a safe level of
exposure. Due to differences in the various inputs to exposure assessments, OPP will vary the
population percentile of exposure used to estimate exposure so as to ensure that in attempting to
protect the general population and significant subpopulations (i.e., by not underestimating the
exposure to such groups) OPP does not unreasonably overprotect these groups (i.e. by
unreasonably overestimating exposure). For exposure estimates using DRES-type high-end
assumptions, risk assessors should generally use the 95th percentile of exposure as a reasonable
high-end exposure. For probabilistic assessments using more realistic inputs, assessors should
generally use in the first instance the 99.9th percentile of exposure. The aPAD will be used as
the benchmark of safety. The rest of this section discusses more fully OPP's rationale for
choosing the 99.9th percentile and the concerns that have been expressed about that choice.
28

-------
In adopting this policy, OPP recognizes that the choice of the population percentile for
formulating a threshold of concern involves a balancing of a number of factors. OPP considered a
variety of factors in formulating the policy and this policy recommends that these factors be
considered in decisions regarding the population percentile in individual risk assessment. The first
consideration is the size of the exposed population and the proportion which might receive daily
doses above the benchmark of safety, the aPAD. A second consideration is the level of
confidence OPP has in its exposure estimates, and the extent to which such estimates may
overstate (or understate) potential exposure because they incorporate conservative assumptions
or rely on atypical and unrealistic data. A third consideration is the degree to which individual
exposures would be estimated to exceed the aPAD to the extent this can be understood. A final
consideration is the degree of public health protection incorporated into the determination of the
aRfD and the aPAD.
Because of the need to balance a variety of factors in selection of a population percentile
for calculating a threshold of concern, OPP is issuing its views regarding population percentiles as
non-binding policy guidance rather than as a binding rule. Complex risk assessment and risk
management issues such as those involved in this policy seldom can be reduced to meaningful
rule-style commands. Rather, the scientist and risk manager need to have flexibility in considering
a variety of factors and outcomes. This policy is intended to focus the analysis on factors deemed
most critical without barring consideration of other factors which may be found to be relevant.
As a policy, this guidance does not - in fact, as a legal matter, cannot - draw bright lines or
preclude reconsideration of basic principles. EPA would retain the option to depart from the
policy. Further, affected parties remain free to challenge the specific application of the policy or
the underpinnings of the policy itself.
In initially determining where to establish the threshold of regulatory concern, OPP
considered a number of issues and past practices, including the 1992 Guidelines for Exposure
Assessment (U.S. EPA 1992b). These Guidelines established a broad framework for Agency
exposure assessments by describing the general concepts of exposure assessment and by providing
guidance on the planning and conduct of an exposure assessment, including the characterization
of uncertainty. Specifically regarding the use of high percentile values, the Agency in these
exposure assessment guidelines has stated the following:
Although the Agency has not specifically set policy on this matter, exposure assessors should observe the
following caution when using simulated distributions. The actual percentile cutoff above which a
simulation should be considered a bounding estimate may be expected to vary depending upon the size
of the population. Since bounding estimates are established to develop statements that exposures, doses,
and risks are "not greater than...", it is prudent that the percentile cutoff bound expected exposures for
the population being evaluated. For example, if there are 100 persons in the population, it may be
prudent to consider simulated exposures above the 1 in 500 level [sic] or 1 in 1000 level (i.e., above the
99.5th or 99.9th percentile, respectively) to be bounding estimates. Due to uncertainties in simulated
distributions, assessors should be cautious about using estimates above the 99.9th percentile for estimates
of high-end exposures, regardless of the size of the population. The Agency or individual program offices
may issue more direct policy for setting the exact cutoff value for use as high-end and bounding estimates
in simulations.
29

-------
Taking the Agency guidance into account, and giving significant weight to the size of the
exposed population, OPP uses as a threshold of regulatory concern the 99.9th percentile of
estimated daily acute exposure from food only, using probabilistic exposure estimation
techniques, should be equal to or less than the acute Population Adjusted Dose (aPAD). Under
this policy, when the 99.9th percentile of estimated exposure is equal to or less than the aPAD,
the estimated exposure of the vast majority of people would not exceed safe levels. Only those
individuals whose exposure is estimated to be in the very high end of the exposure distribution
might theoretically receive amounts of pesticide in their food that even approached the level
where concern would exist. Based on OPP's experience reviewing Monte Carlo acute dietary
exposure estimates, it appears that those with significantly lower exposures (i.e., at lower
percentiles of estimated exposure) would be consuming levels of pesticide in their food potentially
several orders of magnitude below the aPAD.
Some people, however, have argued that OPP's policy is not sufficiently protective, even
at the 99.9th percentile of exposure. Because the group eating food containing pesticide residues
is very close, if not equal, to the entire population of the country, currently about 270 million
people, they argue that if the 99.9th percentile of exposure is equal to the aPAD, very large
numbers of people, including many children, could be exposed at levels which exceed the aPAD
each day. Thus, they argue for additional protections to be provided in the risk assessment or risk
management process.
While OPP recognizes that, under this policy, a large number of people - including infants
and children - would theoretically be exposed at levels potentially exceeding the aPAD, the
Agency believes, for several reasons, that allowing this level of estimated exposure would not
raise public health concerns. First, OPP believes that actual exposure is unlikely to be greater
than that estimated, and in most cases would actually be somewhat lower than the estimates based
on data currently available. As discussed in this and other papers, OPP uses a tiered approach and
the best available data to develop estimates of exposure to pesticide residues in food. Monte
Carlo techniques are used in OPP's highest, most refined tiers - the tiers which are designed to
provide the most realistic estimates of exposure. Nonetheless, at the present time, because of data
limitations, even the most refined estimates of exposure may still include residue values that are
higher than people actually consume for one or more commodities. For some pesticide-
commodity pairs, for example, OPP may not have residue monitoring data, such as USDA's PDP
data. In such cases, therefore, the estimated exposure at the 99.9th percentile (or any other
percentile) may overstate potential exposure, and some portion of the most highly exposed 0.1%
of the population would, in actuality, be exposed at levels less than the aPAD. Nevertheless, OPP
does not believe that this significantly overstates exposure and, in any case, the sensitivity
analysis of Critical Exposure Contribution module will identify the extent to which high residue
values for specific commodities account for the upper-end of exposure. In those instances where
data limitations result in an exposure estimate which is highly conservative, it may be necessary
to generate better data for the specific commodity of concern. Thus, just as when OPP uses the
95th percentile with non-probabilistic exposure assessments OPP is not suggesting that OPP is
leaving 5 percent of the population unprotected, OPP is not by choosing the 99.9th percentile for
30

-------
probabilistic exposure assessments concluding that only 99.9 percent of the population deserves
protection. Rather, it is OPP's view that, with probabilistic assessments, the use of the 99.9th
percentile generally produces a reasonable high-end exposure such that if that exposure does not
exceed the safe level, OPP can conclude there is a reasonable certainty of no harm to the general
population and all significant population groups.
Second, exposure at a level above the aPAD would pose public health concerns only to
the extent that such exposures might result in harm. Certainly it would be difficult to justify
allowing individuals to get doses of a pesticide at such levels, if OPP expected all such exposures
to result in harm. From information about the general shape of the distribution curve of dietary
exposures, OPP expects the vast majority of individuals estimated to be exposed to residues over
the aRfD or aPAD would be exposed only to levels slightly greater than the aRfD or aPAD. OPP
believes that its risk estimation methods incorporate sufficiently conservative (health protective)
approaches, so that the overall approach provides sufficient protection for the small percentage of
people (those above the 99.9th percentile) who may be exposed at levels slightly above the aPAD.
OPP sets the aRfD well below (usually 100 to 1000 times lower than) the appropriately-chosen
"No Observed Adverse Effect Level" (NOAEL) in the most relevant laboratory animal toxicity
study; the aPAD may be the same as the aRfD or even lower depending upon the FQPA Safety
Factor decision. Due to conservative modeling assumptions, it is possible that no actual person is
exposed to levels predicted by the model. This is why OPP has included sensitivity analysis,
commodity contribution analysis, and other techniques to critically examine the results of the
analysis prior to making regulatory decisions.
Third, given the size of the exposed population, the occurrence of an "exceedance" would
(at the 99.9th percentile level) be very infrequent for the typical individual. For example, at the
99.9th percentile, the time between exceedances, on average, would be once every 2 to 3 years.
Depending on an individual's diet, an exceedence may occur more or less frequently.
Collectively, this information gives the Agency confidence that its approach to protecting people
from risks associated with single-day exposure to pesticides in their diet is adequately protective.
Some have asserted that OPP's policy is overly protective. One concern is that OPP's
exposure methodology significantly overestimates actual exposure to the extent that the
underlying data bases include outliers or unrepresentative (and unrealistically high) field trial
residue data. A closely allied concern is that exposure estimates overstate exposure when the
methodology uses unrealistic, conservative assumptions. These concerns are addressed in section
II of this paper and, as discussed there, efforts are made to use only reliable, realistic data.
Because of the careful quality control measures taken by USDA and FDA in the generation of
food consumption and residue monitoring data, OPP typically accepts those data sets compiled by
the respective agencies as being reliable and realistic. OPP conducts its own review of residue
data from field trials and adjusts these data to better reflect actual residues on food. As discussed
in section II.F., OPP believes that it does not use overly conservative assumptions. In sum, OPP
does not believe that the data bases used, and the ways in which they are used to develop
probabilistic exposure estimates, will produce significant overestimates of exposure at the 99.9th
31

-------
percentile.
Another concern is that the data bases available for use in probabilistic exposure estimates
yield estimates at the 99.9th percentile that are unacceptably uncertain, and, because of the
uncertainty, OPP should use a lower percentile (e.g. the 99.5th, 99th, or 95th percentile) of
exposure in its expression of the threshold of concern. While OPP agrees that the estimates of the
99.9th percentile of exposure have some uncertainty due to the use of high-end consumption
and/or residue data, it does not know whether the probabilistic assessments understate or
overstate actual exposure at the 99.9th percentile. Nor does OPP expect that one high-end
residue or consumption value will "drive" high-end exposure estimates. In order to evaluate the
possible impact of high-end values, OPP will perform sensitivity analyses at the threshold of
concern to determine what data account for the largest part of the estimated exposure.
Recognizing that there was considerable concern about both the scientific and regulatory
judgments underpinning the policy, OPP has performed further analysis of the methodology to
provide both the Agency's staff and the public with a better understanding of the most critical
elements of the methodology. Many of the scientific concerns resulted from questions about how
this relatively new approach to assessing acute dietary exposure would be performed and what
aspects of the methodology had the greatest impact on the outcome. In fact, reliance on
probabilistic exposure modeling techniques in regulatory decision making has very few
precedents. These analyses were conducted to evaluate a variety of statistical attributes of the
distributions produced using the Monte Carlo technique. The activities were designed to provide
a better understanding of the most critical elements of the methodology, several of which
addressed the issue of the high consumption individual and his effect on the tails of the
distribution. Briefly, OPP concluded as a result of these analyses:
1. For even a reasonably large dataset as would be expected for a major agrichemical
with extensive nation-wide use, approximately 1,000 iterations of the DEEM
software are adequate to produce reasonably stable exposure estimates at the
99.9th percentile (generally varying less than 1-3%);
2. Given an adequate number of iterations, exposure estimates from the DEEM
software are reasonably reproducible (i.e., any randomly selected run is unlikely to
be more than 2% from the "true value" where the "true value" represents the
DEEM exposure estimate which would result if an infinite number of iterations
were performed.); and
3. Extreme consumption events are not pervasive in the USDA's CSFII survey and
are unlikely to have a significant effect in controlling the exposure estimate at the
99.9th percentile (i.e., acting as primary "risk drivers"). OPP will, in any case, use
the capabilities of the DEEM software to identify any extreme eating occasions
which might occur and fully characterize any exposure estimates which appear to
be driven by high or unusual reported consumption.
32

-------
If OPP adopted a policy that relied on a percentile of exposure lower than the estimated
99.9th percentile, it would need to justify its decision in public health terms as being consistent
with the FQPA statutory standard. As indicated above in the discussion of whether the estimated
99.9th percentile of exposure is adequately protective, the choice of any percentile less than 100%
assumes that, to the extent that estimates understate or correspond to actual, real world exposure,
some portion of the exposed population could receive an amount of pesticide in excess of the
aPAD. As lower percentiles are considered, the estimated size of the population potentially
exposed to levels greater than the aPAD increases. Furthermore, if a lower percentile of
regulatory concern were selected, a greater proportion of the population would be exposed more
frequently to a one-day intake of pesticide residue that exceeds aPAD by a greater margin. For
example, at the estimated 99th percentile of exposure, on average, individuals would experience
an exceedance roughly once over several months. Moreover, the size of the exposed population
potentially exceeding the aPAD at the 99th or 95th percentiles would be 10 and 50 times larger,
respectively, than the number at the 99.9th percentile.
In OPP's view, the above analysis raises concerns about routinely using a lower percentile
than the 99.9th. If, because of uncertainties associated with using the 99.9th percentile, OPP
decided to use a lower percentile of exposure as its threshold of concern, it would still have some
uncertainty in its assessment of acute dietary risk from pesticide residues in food. At the lower
percentiles, the predicted incidence of these exceedances is quite high, and there is a substantial
possibility that some significant number of people would be receiving doses that are considerably
higher than the aPAD. But OPP would not have much certainty about either the number of
people above the aPAD or, more importantly, how close to the aRfD or NOAEL their exposures
might come. Therefore, if OPP chose a lower percentile as its threshold of concern, it would also
need to consider whether other steps (e.g. use of an additional safety/uncertainty factor) would be
needed to assure that the FQPA safety standard was satisfied.
Nevertheless, the selection of the estimated 99.9th percentile of exposure for use in
calculating the "threshold of regulatory concern" should not be regarded as an immutable "bright
line" from which deviation is not possible, regardless of the nature of ancillary data and
information. OPP retains some discretion to choose a different percentile for regulatory concern if
the conditions or situation warrant. A risk assessment should fully characterize the nature of the
analysis for consideration by the risk manager and his or her selection of an appropriate regulatory
threshold. A number of criteria should be considered including, for example, the exposure
estimate's perceived degree of conservatism considering in particular the identity of the risk
"drivers," the reliability and characteristics of the input data, the size of the affected populations,
the results of a sensitivity analysis, etc. In this manner, the risk manager can evaluate how
supportable the 99.9th percentile exposure estimate is and evaluate whether or not it is appropriate
to deviate (up or down) from the 99.9th percentile. Specifically, a full and adequate
characterization of the risk estimates might include a review of the following (in approximate
order of relative importance):
33

-------
whether a "high-end" consumption value actually acts as a "driver" in the
risk assessment, (in many cases, high-end consumption values may not be
actual "drivers" (I.e., significant contributors) in the risk assessment and
thus may not be the primary reason behind high estimated exposures at
the tails of the distribution)
how extreme the upper tails of the consumption curve are. (for example, is
the 95th percentile consumption value greater than four times the mean
consumption?; is the 99thpercentile value greater than six times the mean
consumption?)
how far the presumed high-end consumption value is from where it would
be expected to be given the pattern (or distribution) of reported
consumption values in the lower percentiles, (e.g., if a distribution can be
reasonably establishedfor the reported consumption values in the lower
percentiles (e.g., 70th through 95thpercentiles), how extreme would the
suspected outlier be in an appropriate Q-Q or other statistical plot)
the size of the affected subpopulation and how likely exposure estimates
for the subpopulation would be subject to undue effects of outliers, (a
high-end value would be expected to have more influence on the upper-
end exposure estimates in a small subpopulation than it would in a large
subpopulation)
from a dietary standpoint, how likely the high-end value is to be a valid
reported consumption event, (for example, although they may be equally
extreme from a probabilistic standpoint, consumption of three gingko
fruits in a day might be considered more reasonable than consumption of
10 apples)
the nature of the inputs both in the overall assessment and (particularly) for
the drivers, (this would include, for example, whether input residue data
includedfield trials us. PDP data us. market basket survey data; the use of
default us. actual processing factors; the extent to which single-serving
values are measured us. established by "decomposing"7, the nature and
degree of percent crop treated data, etc.)
comparison of exposure and consumption estimates using the 1989-91 data
7 "Decompositing" is a mathematical deconvolution procedure used by OPP to produce
estimates of pesticide residue levels in single items of produce based on the distribution of
residues measured in composite samples where the residues measured in the composite samples
represent average residues in a group of generally ten or more items.
34

-------
and 1994-96 data, (if both the 1989-91 and 1994-96 CSFII data sets
produce similar estimates of exposure and contain similar extremes of
consumption, it is more likely that high-end reported consumption is
indeed an actual value or at least no affecting the exposure assessment in
any significant way)
In sum, OPP believes that the risk assessor should adequately characterize the nature of
the assessment (including any biases and uncertainties) and perform a sensitivity analysis, where
appropriate, such that the reasonableness of the upper-end percentile estimates (including the
99.9th) can be properly gauged. Any risk assessment performed by OPP should characterize the
effect of any high-end points (on the consumption) on the regulatory percentiles of possible
regulatory interest. Likewise, it is important for the risk manager, in turn, to consider the entire
set of data and information available in deciding if the 99.9th percentile is an appropriate
demarcation point for use in regulation. In particular, any risk management decisions should
consider the effect of any high-end data values (consumption or residue) or other relevant factors
and, when appropriate, be flexible with respect to the regulatory threshold selected. Nevertheless,
based on the several dozen risk assessments and sensitivity analyses we have performed to date
using probabilistic techniques, we do not expect this review to warrant a departure from the
99.9th percentile in the vast majority of cases.
IV.	Policy Not Rules
The policy document discussed in this notice is intended to provide guidance to EPA
personnel and decision-makers, and to the public. As a guidance document and not a rule, the
policy in this guidance is not binding on either EPA or any outside parties. Although this guidance
provides a starting point for EPA risk assessments, EPA will depart from its policy where the
facts or circumstances warrant. In such cases, EPA will explain why a different course was taken.
Similarly, outside parties remain free to assert that a policy is not appropriate for a specific
pesticide or that the circumstances surrounding a specific risk assessment demonstrate that a
policy should be abandoned.
V.	Bibliography
U.S. EPA 1992a. Memorandum from H. Habicht to Assistant Administrators. "Guidance on
Risk Characterization for Risk Managers and Risk Assessors." February 26, 1992.
U.S. EPA 1992b. Federal Register Notice. "Final Guidelines for Exposure Assessment."
57 FR 22888. May 29, 1992.
U.S. EPA 1997a. "Guiding Principles for Monte Carlo Analysis," Appendix A: Policy for Use
of Probabilistic Analysis in Risk Assessment at the U.S. Environmental Protection Agency. Office
of Research and Development. March 1997. EPA/63 0/R-97/001.
35

-------
U.S. EPA 1997b. Memorandum from W. Wood to D. Patton. "Proposed Policy for Use of
Monte Carlo Analysis in Agency Risk Assessments." January 29, 1997.
U.S. EPA 1998a. Federal Register Notice. "Guidance for Submission of Probabilistic Human
Health Exposure Assessments to the Office of Pesticide Programs." Draft. 63 FR 59780
November 4, 1998.
U.S. EPA 1998b. Federal Register Notice. "Assigning Values to Nondetected/Nonquantified
Pesticide Residues in Human Health Dietary Exposure Assessments." Draft. 63 FR 67063-67066.
November 4, 1998.
U.S. EPA 1998c. Federal Register Notice. "A Statistical Method for Incorporating Nondetected
Pesticide Residues into Human Health Dietary Exposure Assessments." Draft. 63 FR 67063-
67066. December 4, 1998.
U.S. EPA 1999a. Federal Register Notice. "Choosing a Percentile of Acute Dietary Exposure as
a Threshold of Regulatory Concern." Draft. 57 FR 16962. April 7, 1999.
U.S. EPA 1999b. Federal Register Notice. "Guidance of the Conduct of Bridging Studies for
Use in Acute Dietary Probabilistic Risk Assessment" (dated 7/29/99) and "Guidance for the
Conduct of Residue Decline Studies for Use in Probabilistic Acute Dietary Probabilistic Risk
Assessment" (dated 7/29/99). Draft. 64 FR 42372. August 4, 1999.
U.S. EPA. 1999c. Federal Register Notice. "Data for Refining Anticipated Residue Estimates
Used in Dietary Risk Assessment for Organophosphate Pesticides" (dated 3/26/99). Draft. 64
FR 16967. April 7, 1999.
36

-------
APPENDIX
Primer on Interpretation of Exposure Distribution Curves
Traditionally, OPP has selected a regulatory threshold of concern (e.g., an individual cancer risk of
no greater than a range of 1-in-a-million or a %aRfD of no more than 100%) which could not be exceeded.
The risk threshold was derived by calculating a high-end (or bounding) point estimate of exposure using
certain high-end exposure assumptions and combining it with the toxicological endpoint to determine whether
a hypothetical "high-end" individual exceeded the regulatory threshold of concern. If so, exposures were
deemed to be unacceptable and mitigation actions were generally sought. However, it was not known whether
OPP's "high-end" exposure estimate represented the 95th, 99th, 99.9th, or 99.999th percentile individual or
if the high-end exposure estimates were well beyond the exposures received by even the maximally exposed
individual (i.e., if high-end exposure estimates were above the 100th percentile).
With the advent of Monte Carlo analysis, OPP is no longer limited to assessing exposure and risks
to the population using methodologies which produce only a single high-end point estimate. Monte Carlo
analyses permit the risk assessor to not only produce more accurate estimates of exposure but to produce
estimates of exposure across the entire population which incorporate the probabilities of being subjected to
these exposures. This distribution of exposures can be represented graphically as a probability density
function similar to the classic bell curve8. An example of one of these curves is illustrated on the following
page:
8 As noted in a side bar on p. 12 of the main document, pesticide exposures (or concentrations) do
not typically follow the classic "bell shaped" curve, but instead have a large "hump" to the left and a long tail
to the right. This is closer to what is termed a log-normal distribution of exposures. The use of a bell-curve
in this primer permits ready recalculation of the sample tabled values from first principles of classic statistics.
If this example were to have used a more realistic log-normal distribution of exposures, then calculation of
the appropriate values would have been considerably more complex and would have likely led to considerably
more confusion among even those familiar with basic statistics. For the purposes of this primer, OPP
believes it makes sense to rely on basic (normal distribution) statistics so as to be accessible to the widest
audience. Although use of a log-normal distribution and all its attendant mathematical complexities would
have been more realistic, OPP believes that this would have severely hampered understanding and would have
contributed minimally to comprehension of the necessary principles at the basic level.
37

-------
50th Percentile Exposure
= 50 |xg/kg bw/day
99.9th Percentile Exposure
= 80.9 |xg/kg bw/day
84.1th Percentile Exposure
= 60 |xg/kg bw/day
99th Percentile Exposure
= 73.3 |xg/kg bw/day
50
60
73.3 80.9
Exposure (^ig/kg bw/day)
The total area under the curve represents the entire population of interest. As one moves
progressively from the extreme left side of the distribution (from an exposure of zero) to the right, an ever
higher proportion of the population falls under the curve. As can be seen in the hypothetical example, 50% of
the population is exposed to levels of 50 »ig/kg bw/day or less, 84.1% of the population is exposed to levels
of 60 ,ig/kg/day or less, 99% of the population is exposed to levels of 73.3 »ig/kg bw/day or less, and 99.9%
of the population is exposed to levels of 80.9 ,: «g/kg bw/day or less.
In the above bell curve example, the exposures across the population range from a low of about 10
Mg/kg bw/day to a high of about 90 »«g/kg bw/day, with a mean (or average) exposure of 50 ,: «g/kg bw/day.
Also, exposures at the 50th, 84.1th, 99th, and 99.9th percentiles are 50-, 60-, 73.3-, and 80.9- ,: «g/kg bw/day,
respectively. Each of these exposure values (in ,: «g/kg bw/day) can be converted to a %aPAD (to be
compared to the threshold of concern) which is calculated by dividing each exposure value by aPAD. These
38

-------
%aPADs are calculated and shown in the table below:
Hypothetical Calculation of %aPAD
at Various Percentile Thresholds of Concern
Percentile
Threshold of
Regulatory
Concern
Estimated Exposure at
Specified Percentile
Threshold of Concern3
(/zg/kg bw/day)
aPAD"
(/ig/kg bw/day)
%aPADc
99.9
80.9
75
108%
99
73.3
98%
84.1
60
80%
50
50
67%
a Obtained as output from DEEM software program
b This is calculated by dividing the NOAEL observed in animal studies (in /ig/kg bw/day) by the
appropriate uncertainty factors, and a decision with regard to the FPQA Safety Factor. In this
hypothetical case, the NOAEL is 7500 /ig/kg bw/day , the uncertainty factor is 100, and the
FQPA Safety Factor has been removed.
The %aPAD is calculated by dividing the estimated exposure at any given percentile by the
aPAD. In general, a %aPAD of 100% or less is not considered to be of concern.
From the above table it is apparent that the %aPAD corresponding to the 99.9th percentile of
exposure (at 108%) exceeds the threshold of concern. It is also apparent that "acceptable" exposures would
occur at the 99th percentile. Thus, exposures would be deemed excessive if the 99.9th percentile is
considered to be the threshold of concern and either risk mitigation or refinement of exposure estimates would
be considered.
39

-------