-------
                                     13
been shown to exhibit similar relationships to a specific contaminant over
several geographical locations or have had significant relationships in one
area.  Further, this section discusses, by pollutant, the known logical
cause-effect relationships.
     Photochemical Oxidants (Ozone)
     Meteorological effects upon the formation and accumulation of oxidants
are well documented, primarily because this particular pollutant has been
the subject of intense investigation.  Since photochemical reactions upon
primary pollutants (NOV and HC) occur in the presence of ultraviolet
                      /\
radiation (UV), oxidant concentrations and available solar radiation
have been found to be positively correlated [3,16,23],  Despite the good re-
lationship, intense solar radiation must be coupled with other meteorological
conditions to cause high oxidant conditions.  For example, in the Los Angeles
area, a hot, desert wind condition known as "Santa Ana" winds, is associated
with abundant solar radiation, but the overall meteorological condition
carries the pollutants out to sea.
     A related parameter, surface temperature, is a function of solar radia-
tion and air-mass characteristics.  Accordingly, temperature also correlates
positively with oxidants [1,4,16,23,25].  Generally, as the surface temperature
increases, oxidant levels increase.  However,  there is usually some limiting
temperature above which conditions become atmospherically unstable, thus
allowing pollutants to escape vertically.
-------
                                   .14
      Low wind speeds correlate with elevated oxidant levels [3,4,16,23]
 because stagnant early-morning conditions represent a greater potential for
 accumulating primary pollutants.  While this condition is essential for resul-
 ting in high oxidant concentrations within major source areas, it does
 not necessarily hold true for outlying receptor areas.  In fact, in
 analyzing data in a distant receptor location, strong, persistent winds blow-
 ing from the source area may be a prerequisite for resultant high oxidant
 concentrations.  As illustrated in Figure 2.1, data from San Bernardino
 (located approximately 60 miles east of Los Angeles) exhibit a strong rela-
 tionship between increased wind velocity and increased oxidant levels.
      Limited mixing conditions as a result of low inversion base heights
 have been shown to be of considerable  importance in causing elevated  oxidant
 levels  [1,25,44].  (For a discussion of "inversion" and "mixing height," see
 Section 2.1.3).   However, surface-based inversions tend to break under sur-
 face heating more  readily thaji persistent,  elevated inversions.    Once the
 inversion is broken, a much greater potential for mixing is achieved.   It is
 therefore advisable  to treat surface inversion conditions as a separate condition
 when relating mixing height to oxidant concentrations.
     Temperature and pressure  conditions  aloft (i.e., 850-mbi 500-mb levels)
have been shown to correlate with oxidant levels [5,12,14,23,24,25,26,44].
Conditions that relate to a synoptic weather pattern associated with a warm
dome of high pressure aloft, usually result in weak ground-level  circulation
under a  very stable atmosphere.   In this  regard, warm temperatures aloft
(associated with much-above-average contour heights) at the  various  atmos-
pheric  levels  are conducive to elevated oxidant levels.
-------
               15
                                                                          O) -i
                                                                          OeC
                                                                          CD +J
                                                                          s- to
                                                                          S- (0
                                                                          3 O
                                                                          O O
                                                                          O
                                                                          4- 3
                                                                          O O
                                                                             CO
                                                                          (!)
                                                                          E CU
                                                                          tO O
                                                                          +-> CO
                                                                          i- O)
                                                                          O »->
                                                                          O S-
                                                                          r 3
                                                                          O) O
                                                                          > CJ
                                                                          CO
                                                                          >   .
                                                                             CO
                                                                          4J O)
                                                                          C CO
                                                                          « CO
                                                                          X O
 c:
 O
 j
                                                                          C  S- ^-?
                                                                          S-    !-
                                                                          CU  tO r
                                                                          £Q 
                                                                             O  3
                                                                             a cr
                                                                          co
                                                                          CU
                                                                          S-
                                            CT>
 (uiijdd)
INVQIXO
-------
                                   16
    Carbon Monoxide
    Unlike photochemical oxidants which require examination of meteorological
conditions conducive to primary pollutant accumulation, photochemical reactions,
and transport, carbon monoxide build-up generally occurs only under extremely
stable, surface inversion conditions.  Inversion base height and wind speed
have been shown to correlate with CO concentration levels [35].  These
parameters may help define a type of overall weather pattern resulting in in-
creased CO concentrations.  Such a pattern is usually typified by clear,
windless nights causing significant radiational cooling.  Thus, by examining
temperature, wind speed, and inversion depth, it is possible that the type
of condition conducive to elevated CO concentrations can be determined.
    Sulfur Dioxide
    Because S02 emissions are almost exclusively point-source oriented, the
governing relationships between meteorological parameters and S02 concen-
tration will vary according to the specific features of the source(s),
such as stack height.  Some success relating S02 concentrations to wind
speed, temperature, and/or mixing height has been obtained.[19,40,41,43] ;
however, some difficulties can arise due to extreme variabilities in SOg
emissions (see discussion in Section 2.2.1).
     Total Suspended Particulates (TSP)
     Basically, the type of meteorological condition conducive to increased
TSP levels is the same stagnant classification necessary for the accumula-
tion of primary pollutants, namely low wind speeds [16,19].  However, the de-
pendency upon wind velocity is a function of distance from major source
regions.  As in the case of oxidant transport, TSP levels in non-urban  areas
may increase as a function of the persistence of source-to-receotor v.'inds.
-------
                               17
As a further complication, any dry, windy area may experience the greatest
participate problem under extremely strong wind conditions when natural
particles are carried aloft (i.e., duststorm conditions).
     For urban areas, stable, anticyclonic conditions are prerequisites for
high TSP levels.   Some success in correlating to TSP has  been shown  with
temperature [16,33], pressure [15] and 850-mb temperatures [11].
 2.1.2   Availability of Meteorological  Data
     Perhaps  the single most  limiting  factor in  performing successful air
 quality  data adjustment  is the  availability of  meteorological data.  Certain
 areas  of the  country, primarily  the larger  urban centers, are  fortunate to
 have extensive sources of meteorological  data.   Other areas more removed from
 the urban complexes  are  subject to limited  meteorological data.
     The following list will  serve as  a  guide to potential sources  of meteo-
 logical  data:
 Upper  Air Data
     National Climatic-Center (catalogue  listing #TDF 56)
     Research Libraries  (e.g., Daily Weather Map Series)
 Surface Data
     National Climatic Center (catalogue  listing #TD1440)
     Pollution Control Agencies
     Flood Control Agencies
     U.S. Forest Service  stations
     Schools/Universities
-------
                                   .18
 2.1.3  Representativeness  of Meteorological Data
      The success  of any methodology  for  producing weather-adjusted air
 quality trends  depends upon  the  representativeness of available meteorolo-
 gical  data.   "Representativeness", in  this context, implies those meteorolo-
 gical  data which  can adequately  define meteorological conditions at the air
 monitoring location.  Ideally, both meteorological and air quality data,
 measured at the same location, provide the best combination of conditions.
 Usually, however, such data  are  not measured at common locations (or at
 least  a  variety of meteorological parameters are not measured at the same
 location).  It  then  becomes  a problem of locating sources of meteorological
 data   and the subsequent selection of appropriate parameters.
     This section will describe  some of the conditions necessary to estimate
 the representativeness of meteorological data from sources which are not co-
 located  with available air quality data.  Although the treatment of data
 "representativeness" is of proportions to warrant a separate study (cur-
 rently in progress by the EPA), an overview is presented here in order to
 highlight pertinent  considerations supporting meteorological adjustment tech-
 niques.  Meteorological  parameters will be considered with respect to three
conditions:  (V) horizontal distance, (2) topographical  effects, and
 (3) weather-type effects.
1.  Surface temperature
     In general, surface  temperature  is an  excellent  meteorological  parameter
to utilize, partly due to  the availability of data  and  partly  due  to  known
correlations  with specific contaminants.   In  flat  terrain, isolated  from
land/sea breeze effects,  surface  temperature  can be representative for large
distances (>  100 km).  In  irregular terrain,  caution  must be taken to  ensure
-------
                                    19
the representativeness of the temperature data; however, it is possible that
data from an elevated location may serve as an indicator of warming  or
cooling aloft, which in turn can affect the stability of the  air mass.
     Locations near large bodies of water are not usually representative  of
locations 10-25 km inland when local sea breezes  prevail.   It is recommended
that sources of temperature data be used which represent similar micro-
climatology.
2.  Surface winds
     Perhaps the most consistently usable meteorological parameter is surface
winds.  Relationships between low wind speeds and pollutant concentration in-
creases are well documented.  Surface winds can also be used to define
source-receptor transport patterns.  Consequently, wind data from distant
locations can be useful, if pollutant transport is a primary mechanism for
affecting a particular monitoring site.
     Extreme variability in wind conditions can occur in areas with elevated
terrain and.to a lesser extent, in micro-meteorological regimes  (i.e., land/sea
breezes).   It is not advisable, therefore,  to use a  single source Of  wind
information.  Rather, an area  composite  of  available stations should  be used.
3.   Upper air data
     The major advantage of using  upper  atmospheric  temperature/humidity/and
wind data  is  the  tendency  toward  homogeneity  of  conditions at or above 850 mb
 ("1500 m).   It is  sufficient to say that such conditions  are probably repre-
 sentative of areas within  a 100-km radius.  Because  the horizontal  variations
 in meteorological  conditions tend to decrease with  altitude, such parameters
 may be most useful in mountainous  terrain,  where surface  measurements may
 reflect extremely localized conditions.
-------
                                   20
4.   Inversion and mixing height data
     Unlike standard atmospheric conditions, where temperature decreases
with height, a temperature inversion is defined as a layer of air in which
temperature increases with height.  Thus the lifting of an air parcel from
the surface, undergoing adiabatic cooling, is inhibited from rising above
the inversion due to the changes in buoyant properties between the parcel
and its surrounding environment.  This is especially important in air
pollution considerations because of the vertical dilution limitations
caused by the inversion.
     Another term reflecting limitations to vertical dilution is "mixing
height."   In,this report,  "mixing height" is defined as that level at which
an  air parcel with  a given surface temperature is lifted and cooled at the
adiabatic  lapse  rate of 9.8°C  km"  until it achieves the same temperature
as  the surrounding  ambient air with a more stable lapse rate.  The exis-
tence of a definite mixing height does not imply the existence of an inver-
sion; however, the  existence of a low-level inversion does identify limited
mixing.
     When inversion/mixing height data are available, definite implications to
pollutant concentrations can be made.  While vertical air-mass properties may
be representative within 100 km, the effects of mixing are governed by surface
heating (i.e., as the surface  temperature increases, the level  at which equi-
librium occurs between the lifted air parcel and the ambient air also increases)
and general atmospheric stability.  As a result, the cautions mentioned for the
widespread geographical use of temperature parameters also apply for inversion/
mixing height data.
-------
                                   21
 5.  Weather types
      No  one single meteorological  parameter can adequately describe the pre-
 vailing  weather conditions  at  the  time of a particular air quality measurement.
,To  get a true  picture  of the cause-effect phenomena of meteorology upon air
 quality, several  independent or  interrelated  variables should be gathered
 to  obtain the  most representative  meteorological  assessment at  a given air
 quality monitoring location.   While  individual parameters may not be represen-
 tative due to  terrain  or localized influences, their use with other parameters
 may be significant.  For example,  a  certain mixing height may not indicate a
 definite relationship  to a  pollutant concentration; however, the mixing
 height coupled with  temperature, relative humidity, and wind speed may  per-
 fectly describe a weather condition  which results in a consistent pollutant
 distribution and concentration.   It  suffices  to  say that combinations of
 meteorological variables tend  to become  more  representative of  dominating
 weather  types than  the individual  parameters, provided that logical meteoro-
 logical  processes are  considered.
     No  specific list describing the representativeness  of available meteor-
ological data can be compiled for every possible location  across the country.
For every rule given, there will  always be an  exception.   The  general  "common
sense" guidelines presented here are intended to stimulate  the  reader toward
a goal of thoughtful, rather than haphazard, selection  of available
meteorological data.   An empirical  method is presented in. Section 4.0  to
assist in the selection process.
-------
                                   22
 2.2  VARIATIONS IN AIR QUALITY DATA DUE TO METEOROLOGY
      The-relationship of specific pollutants  to various meteorological
 parameters, cited in Table 2.1, suggest that  variations in  pollutant
 concentrations, especially on a daily basis,  are primarily  due to
 meteorology.  Except in the case of specific  point  sources, where
 daily changes in emissions can affect air quality in a substantial man-
 ner,  the general uniformity in daily emissions over  most  urban areas dictates
 that  short-term changes of measured concentrations are  caused  by  meteoro-
 logical fluctuations.  To quantify the amount of variation  attributed  to
meteorology  is a difficult and complex problem.  Certainly if techniques  using
meteorological parameters  could explain all of the  daily pollutant variance,
then  one would have achieved  a perfect forecast mechanism.  Nothing approaching
this  level of achievement  has  been  accomplished to  date.  This does not imply
that such near-perfect relationships  do not exist;  but rather the  implication
is that the explicit combination of meteorological  parameters necessary to
determine the variation is too complex to identify and measure exactly.
     The longer the period of  analysis,  the greater the potential  for pollutant
 variances  to be  complicated by both meteorological  and emission factors.   For
 example,  in  one  study  [39], the statistical variance for year-to-year nation-
 wide N02  values  during 1970-1975 was  found to  be  ±20% for the annual Inhour
 maximum,  ±13% for  the  99th percentile of hourly concentrations, and ±11% for
 the annual mean.   No attempt, however,  was made to  distinguish that part of the
 total variance due to  meteorology,  illustrating the complexity of the problem.
      While the quantitative  assessment of meteorological  variations  affecting
 a variety of pollutants can  perhaps  best be  dealt  with in  a  lengthy  investi-
 gation of historical variations of specific  meteorological parameters (beyond
-------
                                     23
 the  scope  of  this  report),  the following subsections will explain some of
 the  known  meteorological  factors affecting air pollutant concentrations as
 well as  the subsequent  effects inherent in the type of air quality para-
 meter(s) chosen for analysis.
 2.2.1   Variation in Meteorological  Effect-Among Pollutants
      The  effect of  meteorology on pollutant concentrations may differ
 from  one  pollutant  to another.  For example, higher wind speeds tend to
 result  in lower concentrations for  primary pollutants such as CO.  But this
 is not  necessarily  true  for  oxidants or S02 which may be affected to a greater
 degree  by large point sources and transport phenomena.  In addition, seasonal
 effects can account for  significant variability of pollution concentration.
For example, in the winter, S02  concentrations  tend to increase,  but oxidant
concentrations  decrease.
      Table  2.2 presents  sample seasonal patterns of three primary pollu-
 tants resulting from fuel  combustion, S02, CO, and NO, and that of the
 secondary pollutant, oxidant.  In Table 2.3 sample diurnal .patterns of the
 same  pollutants as  those in  Table  2.2 are presented.  From these two tables,
 one can deduce a role of meteorology in pollutant concentration levels.
      The  three fuel combustion pollutants, S02, CO, and NO, exhibit higher
 concentrations in winter and lower  concentrations  in summer.  This is a
 function  not only of meteorological conditions conducive to pollutant
 build-up, but  also  of emissions.  Colder winter temperatures bring greater
 fuel-burning requirements, which in turn lead to greater seasonal dif-
 ferences  in pollutant concentrations.  On the other hand, the diurnal pattern
 of these fuel combustion pollutants may be explained more by meteorology than
 by emissions.  The  concentrations of the three pollutants generally increase in
-------
                                   24
   Table 2.2   Sample Seasonal  Patterns  of Pollutant Concentrations
                For Urban Areas
Pollutant
so2
CO
NO
°x
Spring
(M.A.M)
Medi urn
Medium
Medi urn
Medi urn
Summer
(J,J,A)
Low
Low
Low
High
Fall
(S,0,N)
Medium
Medium
Medium
Medium
Winter
(D,J,F)
High
High
High
Low
   Table 2.3   Sample Diurnal Patterns of Pollutant Concentrations
Pollutant
so2
CO*
NO
°x
Morning
(6-12)
High
High
High
Medium
Afternoon
(12-18)
Low
Medium
Low
High
Evening
(18-24)
Medium
High
Medi urn
Low
Night
(0-6)
Medi urn
Medi urn
Medi urn
Low
*  CO typically (but not at all  locales)  has  three  peaks:  around 0700;
   around 1700; and, with surprising regularity,  around 2400.
-------
                                  25
the morning and decrease in the afternoon, despite the fact that the emission
levels remain high in that period.  The probable cause of this difference in
morning and afternoon concentrations will be found in meteorology.   Under
stable, limited mixing, early-morning meteorology, concentrations will  in-
crease.  In the afternoon, solar radiation and wind speeds increase the at-
mospheric ventilation, thus resulting in dilution of pollutant concentrations.
     The secondary pollutant, oxidant, is formed by complex photochemical re-
actions of precursor pollutants, hydrocarbons and oxides of nitrogen.   The
oxidant concentration level is higher in summer than in winter (Table  2.2),
and is higher in the afternoon than in the morning (Table 2.3).   The high
oxidant concentrations during the summer season and afternoon period are pri-
marily due to the unfavorable meteorology (high temperature and  strong  inso-
lation) during those periods rather than due to the slightly increased
hydrocarbon emissions (i.e., increase in evaporative emissions).
     Smelters and power plants are often responsible, for much of the S0?
emissions.  Peak S02 concentrations occur when the wind blows from these
point  sources to a recepter site. Therefore, S02 concentrations originating from
point  sources are more sensitive  to wind direction than wind speed.  To fur-
ther complicate matters, breakdowns of modern, efficient pollution control
equipment can result in substantially higher concentrations at a monitoring
location under a'variety of meteorological conditions.  Thus in certain
areas  of the country, such as Los Angeles,, current SCL concentrations may
be more of a function of industrial control-equipment-breakdown status than
of any specific set or sets of meteorological conditions.
-------
                                     26
     Consideration must be given to the elevation of point-source emissions.
Ground-level emissions, such as CO from vehicular traffic, would be affected
by meteorological conditions which caused strong low-level inversions with
very light wind speeds.  Stack emissions could easily penetrate those types
of inversion conditions.  Strong off-the-surface inversions might trap stack
emissions, but would not be conducive to high concentrations of ground-based
emissions.
     The type of pollutant examined will affect the horizontal  difference
and geographical complexity of the meteorology to be considered.  The
concentration of primary pollutants is determined by local emissions
and local meteorological conditions.  Therefore, meteorological ad-
justment of air quality trends for the primary pollutants should be made by
using the local meteorological data collected as near as possible to the air
monitoring station.  On the other hand, the concentration level of secon-
dary pollutants is  determined by regional  emissions  and  meso-scale  or
synoptic scale meteorological conditions.   In the long-range transport of
secondary pollutants, such as oxidant, it may be necessary to examine the
meteorological conditions nearer the source area than the receptor area.  In
some analyses, this may require investigation of meteorological data 100 km
or more from the monitoring location.
2.2.2  Variation in Meteorological Effect Among Air Quality Parameters
     Currently used air quality parameters can be categorized into three types:
mean concentration at various averaging times; extreme concentrations such as
daily maximum, annual maximum or annual second maximum;  and frequency of
-------
                                 27
hourly or daily maximum hourly concentrations exceeding air quality stan-
dards.  Meteorology-may affect these air quality parameters differently.
     The level of a maximum concentration is largely dominated by a particu-
lar meteorological condition, rather than an emission level.   Variation in
annual 1-hour maximum concentrations reflects the variability in annual "worst-
case" meteorology.  On the other hand, the level of an average concentration
is a more stable parameter:  the longer the averaging period, the more stable
the parameter.  Anomalous meteorological years tend to be smoothed out in
annual average pollutant statistics, although significant seasonal anomalies
can seriously affect the results.  Regardless of the parameter selected,  it
is difficult to extract direct emission-related air quality trends without
an examination of the underlying meteorological conditions.
     Similarly, the number of violations of a standard, or the frequency of
concentrations exceeding a given threshold level can be affected in much the
same manner as a maximum concentration or an average concentration, depend-
ing upon the level of the standard or that of the threshold chosen.  When
the threshold level chosen is very high, frequency of concentrations ex-
ceeding the threshold may vary greatly from year to year, similar to the
annual maxfmum concentration.  When the threshold level is low, the fre-
quency of concentrations exceeding the threshold may vary as that of an average
concentration.
     Whichever air quality parameter is used, a specific set of logical
 meteorological parameters must be specified.   It is rather unlikely that an
 identified meteorological parameter used in analyzing an air quality parameter
 (such as annual average concentrations) could  be used to explain another air
 quality parameter.  For example, an abnormally large number of winter surface
-------
                                     28
inversions might explain an increase in annual CO averages, but could not,
of itself, explain an unusually high single CO hourly average.   Specific
meteorological parameters must be identified to explain the individual  case.
-------
                                  29
     3.0  APPROACHES TO METEOROLOGICAL ADJUSTMENT OF AIR QUALITY  TRENDS
     The adjustment of observed air quality for meteorological  variability
to reflect true trends requires three basic elements:
     (1)  Selecting an air quality index:   What annual  measurement of
          air quality do we wish to adjust?  Candidates include (a) values
          corresponding to the standards (e.g., the second-highest eight-
          hour average CO value in the year); (b) averages of values re-
          lated to the standards (e.g., the average of daily maximum
          eight-hour-average CO over the year); (c) the averages  of values
          related to the standards over a portion of the days of  the year
          (e.g. only the high-pollution season);(d)  the number  of days or
          hours of the year in which the standard is exceeded;  and (e) var-
          iations  on the above themes.
     (2)  Relating meteorological  variables to  air quality:   If air
          quality  measurements are to be adjusted for meteorology, the
          effect of available meteorological  variables  on air quality
          must be  expressed at least qualitatively,  but preferably quan-
          titatively.
     (3)  Adjusting measured air quality:   The  relationship  between air
         .quality  and meteorological  variables  must  be  used  to  adjust
          observed values.   Preferably, the adjusted values  should repre-
          sent annual  pollution measurements  which would have occurred if
          meteorology  were  the same each year.
-------
                                   30
       These three components of a meteorological  adjustment procedure are
  closely interrelated; the best choice of one component may depend on the
  choices we make of the other components.  In this section, we discuss some
  choices suggested in the literature, indicate their relationship to one
  another, and outline their strengths and weaknesses.   In Section 4.0, we
  recommend an approach which is an extension of the best ideas in the liter-
  ature.
      The remainder of  this  section is organized by the way  in which  the
 proposed methodology relates meteorology to air quality:
      (a)  Postulation from basic principles;
      (b)  Regression techniques;
      (c)  Classification techniques;  or
      (d)  Meteorologically invariant  statistics.
 3.1  POSTULATION FROM BASIC PRINCIPLES
      Section 2.0 discusses qualitatively typical relationships of indi-
 vidual meteorological variables to the concentration of various pollutants.
 Using these and similar ideas, one might combine meteorological variables
 to develop an "index" indicating the  likelihood of high pollution levels
 in a given day or year.
     For example, an Environmental Protection Agency (EPA) report  [42]  used a
method of meteorological adjustment of oxidant trends in the Los Angeles area
which compared annual ventilation criteria and yearly hydrocarbon emissions to
the annual average of daily one-hour oxidant maxima (a composite of three key
monitoring locations).  Ventilation data represented the number of days per
-------
                                   31
 year with specified limited mixing and low wind speeds (Los Angeles County
 APCD  "Rule 57" days).  A day meets the "Rule 57" criteria if the following
 all occur on that day:
      (1)  Morning inversion < 1500 feet MSL;
      (2)  Afternoon mixing height £ 3500 feet MSL; and
      (3)  6 A.M. - 12 noon average wind speed £ 5 mph.
     One potential difficulty in the annual agglomeration of Rule 57 days
against annual oxidant data is that poor ventilation in the off-season winter
months does not relate well to high concentrations of oxidant.  As a result,
seasonal biases  can occur  by achieving an unusually high number of winter
Rule 57 days or  an unusually low number of summer Rule 57 days.  This problem
is  partly solved by treating the "smog season" alone rather than the full year,
     Horie and Trijonis [20] used a conceptually similar approach to estimate
the effect of meteorology on annual values of oxidant and N02 in Los Angeles.
If a parameter value in a given year was  sufficiently above its long-term
average,  it was assigned a "+1" value;  if sufficiently below, a "-1" value.
A "score" was calculated for each year to indicate which years had an above-
or below-average propensity for high pollutant concentrations.  The results
were used qualitatively to comment on the likely underlying pollution trends.
       Another index was developed by Davidson [15] using the product of
  mixing height and wind speed in, Los Angeles.  This combination reflected
  atmospheric conditions conducive to photochemical pollution.  Thus, the
  lower the index value, the greater the seasonal potential for increased
  oxidant concentrations.  Ozone trends in the Los Angeles area were norma-
  lized based on the seasonal index value.  Results showed that adjusted
-------
                                   32
seasonal daily one-hour maxima were improving slightly less rapidly than
actual trend data due to the more favorable meteorological conditions in
the early 1970' s.
     An index for S02 near Rotterdam was used by van Dop and Kruizinga [43]
based upon a measure of the strength of the inversion (A6), the maximum
mixing height (L), and the average daytime surface windspeed (U).   The
form was
A =
A8
                                      + C2/LU
where C, and C2 were chosen so that the two terms have equal values for
average conditions.  The daily value of A was averaged over each year for
use in normalizing that year.
     Zeldin [44,46] developed a method for meteorological adjustment using
weighted class- intervals for specific meteorological parameters.  Based on
known cause-effect relationships between oxidant and meteorological parameters
of inversion strength, temperatures aloft, inversion height, and surface pres-
sure  gradients, a stratified system was devised which numerically evaluated
the meteorological potential for each day.  Table 3.1 illustrates the
"scoring" structure.  A high score indicated a high pollution potential.
    Zeldin averaged daily meteorological  index-values for the smog season months
of June through September.  Resultant values were therefore a numerical evalua-
tion of the severity of the entire smog season.  Statistical comparisons were
then made to oxidant data for various cities in the Los Angeles Basin.  In one
study [44], correlations were performed using the number of hours exceeding
the California Air Resources Board (ARE) stage-one episode level of 0.20 ppm.
-------
                                    33
Point
Score
0
1
2
3
1 (°C) ! (°C) ' (feet)
Inv. Strength t 95Q_mb Temo. Inversion
< 5.0 | < 5.
5.1 - 7.0 , 5.1 -
 7.1 - 9.0 ; 8.1 -
i
- j
9.1 - n.o ! n.i -
i
i i
4 11.1 - 13.0 i 14.1 -
I
 5
6
i
i
i
i
8
13.1 - 15.0 17.1 -
!
!
f
I
!
15.1 - 17.0 j 20.1 -
i
i
17.1 - 19.0 ; 24.1 -
i
19.1 - 21.0  28.1 -
9 21.1 - 23.0 i 32.1 -
i >
0 5,000+
8.0 4,001 - 5,000
1
;
11.0 | 3,001 - 4,000
14.0 ! 2,501 - 3,000
i
2,001 - 2,500
17.0
i Surface
20.0 I 1,501 - 2,000
i
24.0 1,001 - 1,500
i
|
28.0 I 701 - 1,000
i
!
32.0 501 - 700
36.0 301 - 500
i
(millibars)
Prpssure Gradient
< -10.0
< +20.0
- 9.0 to - 9.9
+16.0 to +19.9
- 8.0 to - 8.9
+12.0 to +15.9
- 7.0 to - 7.9
+12.0 to +11.9
- 6.0 to - 6.9
+ 8.0 to + 9.9
- 5.0 to - 5.9
+ 6.0 to + 7.9
- 4.0 to - 4.9
+ 4.0 to + 5.9
+ 2.0 to + 3.9
- 3.0 to - 3.9
+ 1.0 to + 1.9
- 2.0 to - 2.9
0.0 to + 0.9
i
  10
> 23.1
36.1
150 -   300 !   - 1.9 to - 0.1
            l
Table 3.1   Zeldin's Point Classification Table.  (Each of the four meteorological
            variables yielded a score.   The daily score was the sum of the
            scores for each variable.)
-------
                                34
 In a second study [46], similar evaluations were made using the averages
 of the daily maximum hourly average.  Results for common locations with
 both air quality parameters exhibited similar trends.
      Using the -postulation approach, some of the indices evolve in rather
 arbitrary units which cannot be related directly to pollutant concentra-
 tions.  To achieve a more desirable result, most of the referenced authors
 used a single-variable linear regression of the annual pollution values
 against annual values (or averages) of the meteorological index in order
 to translate arbitrary meteorological index values into pollution concen-
 tration units (Fig. 3.1).  Suppose the resulting best-fit equation is
                                y " ax + b   ,                      (3-1)
 where y is the annual pollution measure and x the annual meteorological
 index.  Then this equation is intended to give the value of y which would
 be "expected" to result in a year when meteorology yielded a value x.  it is
 assumed that the differences between the expected value and observed value
 in a given year (the "residuals") are primarily the result of emission
 trends.  Zeldin [44,46] used this concept directly and plotted the resid-
 uals to elucidate the true trend.
      Mtist of the other references cited in this section took the more pre-
 ferable approach of attempting to indicate what the pollution level  would
 have been in each year if the meteorology index took its average value
 each year.   The  references are uniform in a lack of specificity as to how
 this adjustment  of measured values  was accomplished.   It appears that
 the procedure used is  as  follows:    Let  x" be the overall  mean  value  of
      ^
       Verified by conversation  for two  references  [15,41].
-------
                                            35
(Annual
Pollution
Measure)
                                                                        Best-fit Line:
                                                                        y = ax + b
                                                         x (Value  of Meteorological  Index)
                 Figure 3J   Annual Value of Pollution Measure Versus  Annual  Values
                              of Meteorological  Index.   (Five years  are assumed  in
                              this figure.)
-------
                                 36
 the meteorological  index (over all  years),  and "x* the average value
 for the k   year.  Assuming Eq. (3-1) holds,
                             a  x  +  b
 is  the  pollution  level  due  to  meteorology  in year  k and
                             a x" +  b
 is  the  average pollution  level due  to meteorology  over all years.  The
 excess  amount of  pollution  due to meteorology in year k is found by sub-
 tracting these two quantities, yielding
                            a (x* -  x)
                                    /N.
The meteorologically adjusted  value yk for year k  is found by subtracting
this excess from  the observed  value y.:
                       yk = yk - a (x  - x)
(3-2)
     Equation (3-2) has built-in insurance against the use of a poor
meteorological index.  If the correlation between pollution levels and
the meteorological index is zero, the coefficient "a" will be zero, and the
adjusted value will be the observed value.
      «
      Equation  (3-2) can be  interpreted as the residual error in the re-
gression for year k,
                         yk  - a 3^ - b
added to the overall average of y,
                          y = a x" + b
Thus, except for a change of origin, a plot of the residuals will look
exactly like a plot of values adjusted by Eq. (3-2).
-------
                                  37
      The major technical weakness of this approach is that the slope of
 the regression line (the constant "a") is affected by the underlying emis-
 sion trend,  as well as  by meteorology.  To see that this  is  true,  one need
 only examine the  extreme case  where  only two years  are  considered.   Since
 the regression line in  this  case  will  go through both points,  the  residuals
 will  be  zero.  .Thus, this  method  will  always indicate no  trend in  the
 meteorologically  adjusted  values, irrespective of their value,  for two
years.  This problem is  discussed in more general terms in the  following
subsection.
3.2  THE REGRESSION APPROACH
      In  the  previous subsection,  we  described  meteorological indexes which
 were  postulated from basic principles.   The underlying objective of  this
 subsection is  the derivation and  use of  such indexes by objective methods.
 Meteorological Adjustment  for  Two Time Periods
      A study of S02 trends in  Oslo, Norway, is typical of this  approach [17],
A study was  undertaken  to meteorologically adjust S02 air quality to com-
 pare  the periods  1959-1963 and 1969-1973.  The meteorological  conditions
 during the former period were  considerably different from those during the
 later period;  hence one  could  not expect a change in air quality to be
directly related to a change in emission potential.
     Data from the earlier period (1959-1963)  were used to do a linear re-
gression analysis for daily values.   It was discovered that two variables
dominated the estimate of S02 concentration:   (1) a temperature difference
between a low-altitude and high-altitude measuring  station,  and (2) the
temperature at the lower station.   For example, a typical  regression  equa-
tion for one station was
-------
                                      38
                        qSQ   =61.5  (Tg-T.,)  -  n.6T] + 472
(3-3)
 where
                                                          Q
      qso  = daily mean value of S02 concentration  in  yg/m  at the  particular
             station
        T2 = temperature at higher station  at 7  P.M.
        TI  = temperature at lower station at 7 P.M.  .
      The temperature-difference term is related to  the  strength  of the  inver-
 sion  and hence  the dispersion,  while the temperature  term is  related  to the
 variation  in  the  emission  of S02 due to space heating.    Since the daily tem-
 perature data for the  later time period is  known, the daily mean S02  expected
 for the meteorological  conditions  during that time  period can be estimated
 by Eq.  (3-3).   This was done for each day  on which  data was available in
 the later time  period; the predicted values were consistently higher  than
 observed values.   Thus, it was  easy to conclude that  emission  potential
 was reduced.
     A  quantitative statement was made in the report that the S02  pollution
was reduced 50%to 60%.  According to a conversation with one of the authors
of the  report, this latter  statement was derived by looking at the ratio of
the coefficient on the  temperature-difference term in the early time period
      This equation explained the observed values of S02 concentration with
a multiple correlation coefficient of .80; that is, the correlation between
values predicted by this equation and observed values for the period was
0.80.
-------
                                   39
 to the coefficient of the temperature-difference term in a similarly derived
 equation for the later time period.  The intuitive justification for such a
 statement is that the coefficient measures the degree to which a given tem-
 perature inversion will  be translated into S02 concentrations.  Thus a 50%
 or 60% reduction in that coefficient might be thought of as a meteorologically
 adjusted measure of the trend .in air quality.  This approach to quantifying
 the reduction is not easily generalized or of clear mathematical validity.
      Meisel suggested a more general approach to comparing the air quality
 in two time periods [28],  He suggested that equations relating daily air
 quality to daily meteorology be derived for the two time periods.  In the
 first period, suppose the best-fit equation is
where q, is an air quality parameter, m-j , ... m^ are meteorological param-
eters and F-, is some function of
suppose the best-fit equation is
  eters and  F-,  is  some  function  of  those  parameters.   In  the second period,
Then we examine the ratio of the equations:
                                P2(m-\ .....
                                                                     (3.5)
This can be calculated for any set of meteorological conditions.  If emis-
sion potential is unchanged, we would expect air quality to be about the
-------
                              40
same in both periods, assuming that all significant meteorological param-
eters have been identified.  Hence, the ratio q2/q-| will be about 1 for
all meteorological conditions if the emission potential is unchanged.  If
q2/q, is approximately constant over all meteorological conditions, but at
a value other than 1, then emission potential has increased or declined by
an amount given by that constant.  (If the constant is 0.90, meteorologically
adjusted air quality, i.e., emission potential, is 90% of its former value;
that is, it has improved 10%.)
     If q2/q-| is highly variable, then the improvement in emission potential
depends on the meteorological condition, i.e., the meteorological parameters
used do not adequately relate to the pollutant, and a more detailed examin-
ation is required.  One might find, for example, that there is a degradation
in adjusted air quality when the wind blows from the direction of a newly
constructed power plant, but an improvement otherwise.
     Mathematically,' one could calculate q2/q-| bY Eq- (3-5)  for the meteoro-
logical conditions of each day during both periods.  The median of all
these values might be taken as representing the meteorologically adjusted
change in air quality between the two periods.  The 95th and 5th percentile
values of q2/q-| might be used to indicate the range of uncertainty in this es-
timate.
Meteorological Adjustment for Multiple Time Periods
     Sidik and Neustadter [33] proposed a linear regression to correct a
series of annual pollution values for the Cleveland area.  A linear regres-
sion of pollutant levels was performed against 29 predictor variables; the
variables were meteorological, except for two rough indicators of economic
-------
activitys a seasonality variable, and the number of days from the inception
of the monitoring.  Because this last variable increases linearly with time,
it has the potential of accounting for a linear trend in emissions.
     A  similar method of meteorological  adjustment was  described by the
EPA  [41]  using seasonally  adjusted averages of wind  speed, temperature, and
rainfall  for  the  Los Angeles area.   Statistical analyses were .performed with
a nine-station  composite of seasonally adjusted  values  of  SC^  and TSP.  It
was  shown that SOp  was  correlated to inverse  wind  speed and  that TSP  was
negatively correlated with rainfall.   Observed quarterly levels were  nor-
malized to the average  meteorological  conditions for each  calendar quarter.
A variable corresponding to the year was used to model  a linear emissions
trend.   Since only  one  meteorological  variable was considered  significant
 for each pollutant, that variable was used to adjust observed  values  using
 Eq. (3-2).
      In the last two studies referenced, a "dummy"  variable  was  used  to rep-
 resent a linear trend in emissions.   This procedure  deserves further  discussion.
 In reference to Figure 3.1, we noted earlier that  direct regression over
 several years could result in a coefficient which  incorporated changes in
 emissions rather than meteorology alone.  In Appendix A, we  generalize this
 problem to the regression  of daily values in several years.   Our conclusion
 is that it is inadvisable to use data for several  years in a regression against
 meteorological  variables alone.  Instead, one should either use dummy variables
 or do  a separate regression in each year.
      If dummy variables are used, we  recommend using one for each year rather
  than a single linear trend variable.   (See Appendix A  for details.)   If the
-------
                                    42
trend is not linear (e.g., is U-shaped), a linear trend variable can create
misleading results.
     If a separate regression for each year is performed, a method such as
that recommended by Meisel and discussed above must be applied to each sue-
                                                                                  y
cessive pair of years to obtain an adjusted trend.
Cautions Regarding Linear Regression
        Because linear regression is one of the best understood and easiest
to use statistical analysis techniques, it can be a.powerful tool in meteoro-
logical adjustment, as discussed.  Some cautions are in order, however; it is
not always the best approach.
     The most serious limitation of linear regression is that its effective-
ness depends on the relationship between the meteorological variables and
pollutant concentration being approximately linear.  As we discussed in
Section 2.0, many meteorological variables are nonlinearly related to pol-
lutant concentrations.  If nonlinearly related variables are avoided, only
a portion of the variation in pollutant concentrations explained by meteor-
ology will be modeled.  This omission can potentially result in distortions:
year-to-year differences  in nonlinear variables would not be reflected
in the adjusted values.
      A nonlinear variable which is monotonically related to the pollution
 concentration will result in a correlation with that variable.   The use of such   .
 a variable in a linear regression can result in biased residuals.  (See
 Figure 3.2.)  The resulting equation will predict high for some ranges of
 the meteorological variable and low for others.  This condition can often
-------
                                  43
                                            linear
                                                   underlying nonlinear
                                                   relationship
Figure 3-2   Example of Linear and Non-Linear  Relationships.   (The meteoro-
             logical variable x is nonlinearly related  to y, but fit with a
             linear equation.  The linear equation will  consistently over-
             predict for large values of x.)
-------
                                     44
be  detected by plotting the residuals  (difference between predicted and actual
values of y) versus y.  If the regression is legitimate, there should appear
to  be no relationship between the two.
     Nonlinearities can be handled through redefinition of variables, in
order to maintain the linear regression methodology.  For example, it is
common to use the logarithms of a meteorological variable.  More generally,
tabular definitions are possible, as in Table 3.1.  In this case, Zeldin has
transformed nonlinearly related variables into a linear point score.   One
then simply uses the redefined variable in a linear regression.
     Nonlinear transformation of the concentration variable y are also
possible, but care is required.  If we predict, for example, log y with
40% of the variance explained, we are not explaining the same 40% of the
variance in y.  The predictions must be transformed into predictions of y
to  calculate the error in predicting that quantity.  Explaining more of
the variance in log y than in y is not sufficient justification for using
log y.  Instead, the percentage of variance explained in y by the log y regres-
sion should be computed.
     On the other hand, since most pollution values can be reasonably ap-
proximated as lognormally distributed, the use of the log of the concentration
may cause the residuals to be less biased.  The equations in this section and
in Appendix A are valid if y is interpreted as the log of the pollutant con-
centration.
     Another typical  error in linear regression is to use too many varia-
bles.   One can overfit the data by adding variables which do not signifi-
cantly improve that fit;  this practice can cause anomalous  results.   A
-------
                                 45
further difficulty in using too many variables is that dependencies among
the variables can make the coefficients uninterpretable.  If linear regres-
sion is used, we recommend stepwise linear regression to avoid using any-
more variables than necessary.
 Nonlinear Regression
      In  general,  if a  relationship  is  nonlinear,  it  is  better  to  perform
 nonlinear regression directly rather than  use  a number  of  tricks  to make
 the  problem  linear.  However,  software for nonlinear regression is not  as
 common or automatic as  linear regression.   Techniques of the following
 sub-section,  however,  allow nonlinear  effects  to  be  incorporated  without
 nonlinear regression.
      CLASSIFICATION APPROACHES TO '-METEOROLOGICAL ADJUSTMENT
     The basic difficulty with the regression approach is finding a simple
relationship between meteorology and air quality which holds for the full
range of meteorological variables.  The approach of this subsection is to
reduce this difficulty by breaking up the problem.  In particular, we focus
attention on days falling in meteorological classes which are conducive to
high pollution levels.
     Such a method  of adjusting oxidant data in  the San Francisco area was
employed by the Bay Area Air Pollution Control District [1].  By taking two
meteorological parameters known to have a significant relationship to oxi-
dant levels on a daily basis, adverse days were  selected for seven locations.
For each location,  a limiting temperature and an inversion  base height were used
to classify days into the "adverse" category.  Annual averages of the daily  .
-------
                                 46
maximum hourly concentrations were computed based on only those days clas-
sified as  "adverse."  The averages from year-to-year thus represented the
meteorologically adjusted oxidant trend.
     The applicability of this technique is apparent.  A specific subset
of days containing similar meteorological conditions approaches the ideal
situation  of holding meteorology constant.  The trend resulting from the
analyses can be regarded as a good representation of the downward oxidant
trend due  to effectiveness of emission control strategies.
     One potential problem, however, in employing this method is the varia-
bility of  meteorological conditions within the prescribed subset of days.
If the criteria selected are too broad, the variability within categorized
subsets may not adequately describe the desired "adverse" effect.  In
essence, meteorology is not held constant, and ultimately, results cannot
be regarded with confidence.  On the other hand, criteria set too strin-
gently can reduce the number of such days to the point of losing statisti-
cal confidence in the results.  Appropriate class definition is critical.
Again, even with good definitions, very large deviations from normal weather
patterns can seriously impact the accuracy of this model.  (See Appendix A.)
     A method for weather-correcting oxidant data in the San Gabriel  Valley
portion of the Los Angeles  Basin was developed by Kerr [24].   He classified
September meteorological  data for the years  1968 to 1973 into five meteoro-
logical  types:   (1) closed  high-pressure center aloft,  (2)  inversion  <  1100 ft.,
(3)  inversion 1100-3500 ft.,  (4)  surface Santa Ana wind  type, and (5)  closed
low or deep trough aloft.   These  definitions  involved some  subjectivity and
-------
                                  47
were not mathematically explicit.  A table was constructed depicting a fre-
quency distribution by weather category per year.  For each category, multi-
year averages of the maximum hourly averages were determined.
     Once the distribution of days by category was established, the weighted
averages of each year of record were computed; that is, Kerr summed the yearly
frequency of occurrence over all the categories  and multiplied by the concen-
tration mean for each category.  Thus, the expected concentration.value for
a year9 given the meteorology of that year, could be  calculated.  The resi-
duals  reflecting the difference between this  predicted value and the observed
value  were  used as  a measure of meteorologically adjusted trends.
3.4 METEOROLOGICALLY  INVARIANT STATISTICS
     Another method, presented by  Zeldin  et al.[46],  explores  the possi-
bility of performing meteorological  adjustment to  air quality  data  without
the use of  meteorological  data!
       In this  particular  study,  oxidant  data  were  analyzed in  the Los Angeles
area.   It  is  known  that meteorological variability affects annual ozone  con-
 centrations.   However,  even in  the most  favorable  of meteorological years,
 there  will  still occur a  number of particularly adverse days under  which
 elevated ozone concentrations  will occur.  The author found that,  by using
 the fifth-highest maximum hourly average at a given site for a particular
 year,  and then comparing that value to the same parameter in other years, the
 general meteorological patterns under which the ozone concentration occurred
 were nearly identical.  Thus no specific meteorological parameters were used,
 but the values reflect underlying emission trends.
-------
                                 48
     This method is basically untested.   It is not known whether the metho-
dology can be utilized more universally; however,  it does offer potential
use in areas where meteorological data are sparse.  If proven to be an ef-
fective method of providing a meteorological adjustment of air quality
data, the method allows one major advantage simplicity.  Since this ap-
proach, strictly speaking, exploits meteorological invariance rather than
meteorological adjustment, we will not pursue it further here.
-------
                                   49
              4.0  RECOMMENDED PROCEDURES  FOR METEOROLOGICAL
                           ADJUSTMENT OF AIR QUALITY
      This section describes in detail a recommended methodology to achieve
a meteorological adjustment of air quality data,  which in turn can result
in an air quality trend based on meteorologically adjusted emissions,
i.e., emission potential.  Guidelines are  presented for the definition
of distinct meteorological classes and for the use of these classes to
determine a meteorologically adjusted trend.  By determining average
concentrations for each class, and then by relating the data to a "typical"
meteorological year, a pollutant concentration trend can be obtained which
provides an estimate of what would have occurred if all years had closely
comparable meteorology.
      It  should  be noted  that  these recommended procedures, while  based on
 extensions  of ideas  generated previously  in  the  literature, have  not been
 tested.   Thus,  until  adequate evaluation  programs  have  been undertaken,
 results  of these recommended  procedures should be  qualified.
  4.1  SELECTION  OF METEOROLOGICAL CLASSES
      In  Section  2.0, we  discussed meteorological parameters which affect
  given pollutants and  the  nature of  those effects.  Presuming that one has
  a  list  of  candidate meteorological  variables for  defining meteorological
  classes,  it  remains  to  define the classes  using all  or some of these
  candidate  variables.
      A  "meteorological  class" is a  set of  days which have a  specific set
  of meteorological  characteristics.   Such classes  can be naturally occurring
-------
                               50
weather categories such as those listed in Table 4.1.  In this type of
classification process it is quite possible for two weather classes to have
the same average pollution level, i.e., the same "pollution potential."
As in Table 4.1, classes 1 and 5, although meteorologically dissimilar,
both result in low pollutant concentrations.  It is clearly unnecessary,
therefore, to be concerned with fine distinctions between meteorological
classes which always result in low pollution concentrations.
     The simplest categorization possible is into two classes:  the "adverse"
class and the "non-adverse" class, as in the Bay Area APCD work [1] reviewed
in Section 3.3.  The adverse-class approach has the disadvantage of requiring
a very accurate definition of meteorologically adverse days.   If the defi-
nition misses many high-pollution days or includes too many low-pollution
days, the resulting trend estimate may be unreliable.  If a greater number
of classes are defined, the requirement for precision in defining each class
is reduced.
      The  definition  of meteorological  classes may  depend  to some extent
 on  the use made of the results  of the  analysis.   It  is  possible to  have
 distinctly different trends  in  each  meteorological class  if the major
 emission  sources  differ for  each  class  (due  to wind  direction, for  example).
 Even  if two  classes  which  are  meteorologically distinct have  approximately
 the same  average  pollution levels, one may wish  to preserve the distinction;
 doing so  may allow isolating the  effect on  the overall  trend  of specific
 emission  sources.
-------
51
JQ
S-
cu o
P 0
-P UO
Q_
O
Q.
O
CU E
Q. >j
t
-o
S- CU
CU -P
.E  ro
PI 	 1 T-
re I--H O
CUOJ O
31  i oo
oo cu
>,  -r-
CO i 
III flrt
LJLJ  E
oo S-
OO CU
I -p
0 3
o
r CO
03
o cu
i .c:
CD-P
o s-
,- i. cu
o o -E
S-M- -P
O ra
cu cu
"p r*^
cu
03
i CU
=d- cu
CJ3
cu
JO
1
*
to
r
oo
o
E S- CU
i- CU -E
3 o "^
i "~ O
S. -E
CU -P -P
_E -i- 00
P 3 «3
i- CU
O 03
E CU CU
s- s-
O1 03 3
E 00
O S- oo 03
s- cu cu cu
p > i- i-
CO O Q. 03
S-
O)
0
cu
^3
00
cu
o.
E
en oo
r- 03
.E CQ
CU -P
O) 03
S- CU
o3 $
_l CD
3
0
en oo
3 E
o o
S- T-
C~ t _^
P T-
TI3
S- E
r- O
03 O
p_ 
03 03
P E
cu
E 03
I- -p
P E
E ro
o 
0 =
S- >>
T3 ro
a
P CU
O -E
re -P
JE
05
r
j^
r-~
(O
0
r- (O
2t-
fO
4-5
I S-
o CU
3 >
CO O
-P .E
ra a>
CU 3
S- 0
CD S-
P
S^
CU i
0 E
cu cu
3 -P -P
00 S-
00 T3 CU
CU E 00
S- 03 CU
a. -a
J= -r- S-
m to cu
i- (O >
re CQ o
"O 4-*
>
-a i?
o cu
E >
«v «\
E E
0 O
0 0
E E
CU S-
5- CU
0 -P
«4- 1-
rO
S- E
-0 -i-
>> s-'cu'
r !- >
CU 03 n3
> s
r- CU
P E -P
fO *r" fO
r- S- CU
CU 03 _E
o: E 
C\J
ro
cu
s-
03
S-
cu
o
cu
5-
3
00
00
cu
a.
-E
ro
I.
re
s-
cu
o
C^
0)
0
4-J
r
(O 4-3
S i-
S- CU
CU 00
J= CU
] VJ
p
o
JE
 n
C__
r-
03
CU
E
r-
S-
03
PS
o
CU
1^
«»- E
- O
a o
IE
cu
P M
oo ra
i 
O E
s: -i-
oo
03
cu
s-
s-
cu
>
0
CD
3
o
S-
1
s-
cu
>
o
.E
O)
1
{-)
I
03 -P
E S-
i- CU
CU 00
JT: cu
1 -a
j~
t-
03
^
 A
S-
 1
ra
cu
E
r*
S-
03
E
"O
CD
^
tj-. C
^ 0
"0 0
E £
cu
A -p
t \ >l
oo 03
r
O E
S -r-
"*
-o
cu
00
o
r
O
5-
O 03
CU
-E S-
O) ro
3
O S-
s- cu
p >
o
a.
cu 3
cu o
a i
s-
cu
>>
o
3
o
r
cu
a.
i
o
i^
p -P
Q. J-
O CU
E 00
>> CU
CO T3
S-
r
ro
CU
E
r-
S-
03
E
a. o3
CU "D
cu
T3 CU
^"
n -{_)
P
00 -P
r- 3
O O
E-E
O)
« 3
i O
0 i-
O J=
0 -P
LO
-------
                                   52
     There are two basic ways to define meteorological  classes,  differing
largely in emphasis:
     (1)  Define the  classes through experience and meteorological
          expertise,  guided by examination of historical  data;  or
     (2)  Define the  classes using a data-directed empirical  procedure,
          guided by experience and meteorological  expertise.
We examine these two approaches in the following two subsections.
4.1.1  Definition Through Experience and Expertise
     The experience gained in working with meteorological and air quality
data can serve as a basis for defining categories.
     If a meteorological pollution-potential index is defined from basic
principles, as exemplified in Section 3.1, classes can be defined by
use of the index.  The range of values taken by the index can be divided
into intervals; each interval then corresponds to a meteorological class.
For example,  using Zeldin's  point scoring system  (Table 3.1), a given day
has a score between 0 and 40, based on the values of four meteorological
variables.  Days with a  score over 34 might form  one class, days with a
score between 29 and 34  another class, and so  on.  When classes  are defined
this way,  they should all have  different average  pollution levels.
     Classes  might be defined using knowledge  of  obvious meteorological
classes common  to a given area, as was illustrated in Table 4.1.  Another
example is "Rule 57" days, discussed in  Section 3.1.  If, as in Table 4.1,
the class  definitions are not mathematically explicit, the analyst must be
prepared  to classify every day  used in the analysis by inspection of the
meteorological  data.
-------
                                  53
     If the subjective approach to defining classes is taken, it is important
to test if there are significant differences in average pollution levels between
at least some classes; otherwise, the categorization is inappropriate for
trend adjustment.  Thus, air quality data are used to check  the classes
defined.
4.1.2  Empirical Derivation of Meteorological Classes
     Because the cause-effect relationships between meteorology and air
pollution are sometimes counter-intuitive, it may be necessary to use data
to guide the definition of meteorological classes more explicitly.  The data
required are the daily meteorological variables and the corresponding daily
                 *
pollutant values.
   .. Some computationally demanding methods of deriving such classes could
be employed.  For example, a program called AID, developed at the University
of Michigan, can be used to analyze meteorological data and corresponding
pollution levels [34].  It will generate from such data definitions of
meteorological classes so that the pollutant levels on days within each
class are as uniform'as possible and such that the levels in each class are
as distinct as possible.  Other approaches to this class-definition problem
are available in the literature [49], but not as standard statistical packages.
     In Appendix B, we describe a procedure which uses common statistical
software packages (for example, SPSS and BMD) and human judgment to implement
a powerful  approach to the data-directed derivation of meteorological
classes which discriminate pollution levels.  To illustrate the basic approach,
we will use as an example work by Davidson [14].
     *
      The daily value chosen is usually related to a standard, e.g., the daily
maximum one-hour-average oxidant concentration.
-------
                                54
     Davidson attempted to define meteorological classes in Los Angeles which
corresponded to different oxidant concentrations.  He used data to guide his
definition of those classes.  He plotted scatterplots of one meteorological
variable versus another.  On such a plot, the values of the two variables
for a given day define a point.  He identified the pollution potential
on that day by indicating that point on the plot which corresponded to
an alert day (maximum one-hour average oxidant above 35 pphm) or a non-alert
day.  In Figure 4.1, an example of an alert day  ("A") and a non-alert day
("N")  is  indicated.   By examination of such a plot,'regions corresponding
largely to alert days can be "fenced off"; similarly, regions corresponding
to non-alert days can be defined.  In Figure 4.1, regions C and A are
directly defined; that is, given a day with particular values of the
meteorological variables, the region to which it corresponds can be deter-
mined from the plot.
     Area B in Figure 4.1 does not have a clear  character, having many alert
and non-alert days.  Thus, when a day fell in region B, Davidson used other
meteorological variables to further differentiate meteorological classes
such that every day could be explicitly classified.  His full classification
scheme  is shown in Figure 4.2.
     This 'procedure can  be represented as a  decision tree  (Figure  4 .3).
Such an end result has a number of advantages.   The most important is  that
it allows  one or two variables  to be used at each "decision box"  in  the tree,
so that visual  derivation of the boundaries  is  possible; yet a larger  number
of variables can be used in the overall  classification.   It also  allows
the natural incorporation of subjective judgment and experience in guiding
the choice of boundaries and variables.
-------
                                           55
-xl
     o-
    £  +15
     ft)
    C3
     o
     o
s_
T
88
                0600 PST 2500-Foot Temperature,Los Angeles Int'l Airport, Deg. F.
                          '       '              X2
                     Figure 4.1
                   First Stage of  Discrimination  Employed
                   by Davidson.
-------
                                                           56
                           CHART I
+20
+15
+10
*! £
       -5
       -10
   O 8)
   in u.
   uw-
   S O
   =ts
   tr 4>
           -  Area A    \ Area B \    Area C
         <55
                60
                      65
                            70   75
                                       80
                                             85  >88
            0600 PST 2500-Foot Temperature, Los
            Angeles Int'l Airport, Deg. F,
                                                              +4.0
                                                              +3.0
                                                         ^   +2.0
                                                              +1 .0
                                                    a. £-
                                                    gl-3.0
                                                                              CHART  II
                                                                                Area B
                                                                                 Area B,,
                                                                              Go To Chart III
                                                                   -20  -15
                                                                                                      +10
                                                             0700 PST Temp. Gradient, Los Angeles Int'l
                                                             Airport-Palmdale, Deg. F.
                                                       Chart III
                                   +13
                                           0800 PST Visibility.  Los Angeles Int'l
                                           Airport, Miles
            Figure 4,2   Full Prediction  Scheme by Davidson.
                            area is indicated.
                                                                      The  prediction  for each
-------
                             57
   3.2%
Alert Days
x, and Xo
 Area B or
  Area C?
               73.3%
            Alert Days
             x3 and x
                                   in Sub-Area B, ?
                            11.835
                       Alert Days
                           and
                                             Sub-Area D.. or D. ?
                                                       n     L
                                      Class  1
                                       14.4%
                                   Alert Days
                                      Class 2
                                        62.9%
                                    Alert  Days
      Figure 4.3   The Decision Scheme of Figure 4.2  Drawn as a  Decision Tree.
-------
                                   58
      Notice that there are two meteorologically adverse classes which
 correspond to two different "natural" meteorological classes which cause
 high pollution levels.  These could be described qualitatively as follows
 (referring to Figure 4.2):
       (1)  'A high 2500-foot temperature at 0600 Pacific Standard Time at LA
           International Airport  (x-j) in combination with a large 24-hour
           500-millibar height change at Vandenberg  (x2); or
       (2)  Intermediate values, of X2» or jointly high values of x-, and x~;
           low-to-intermediate values of the  LA-Palmdale pressure gradient
           at 0700 PST  (or low values of temperature difference between LA
           and Palmdale at 0700); and a large temperature increase between
           0600 and  0700 PST at Palmdale in combination with low visibility
           at LA  International Airport at 0800.
One might  treat the  combination of  the two classes as a single class  ("adverse
days") or  treat the  two subclasses  separately in determining trend by methods
discussed  later.  In fact, all five classes resulting from the tree could be
used.  We recommend that the subclasses be treated separately rather than
grouped.
     Appendix B elaborates in some  detail a similar procedure for deriving
such a tree.
4.2  A METEOROLOGICAL ADJUSTMENT  PROCEDURE
     Once the appropriate meteorological  parameters have been selected and
the specific meteorological  classes have been determined,  a procedure to
adjust a multi-year set of air quality data for meteorological  variation
must be utilized.
-------
                               59
     A meteorological adjustment can be performed rather simply by using a
"typical-year" approach.  This can be accomplished by determining the average
frequency of occurrence for each usable category over the analysis period
(e.g., 5 years).  These values represent the "average" occurrences, thereby
delineating the basis for a "typical" meteorological year.  From the typical-
year frequencies, observed concentrations by category and year can be used
to calculate the concentration values for each year which would have occurred
 if the year had typical meteorology.   In other words, average concentrations
 are restated for each year assuming  each year  has  the same meteorology.  This
 results  in  normalized values  which relate  to the same scale of values repre-
 sented by the actual observations.   Thus actual trend data and the  meteoro-
 logically   adjusted  trend  data  can be  plotted  in a super-imposed manner to
 achieve  an  easy-to-interpret  comparison, the adjusted values  showing  the
 underlying  trend in  emission  potential  and the difference indicating  the
 effect of favorable  or  adverse  meteorology.  It also  becomes meaningful to
 calculate meteorologically adjusted  percentage improvements.
      To  achieve this objective, we  proceed as  follows:
      Tabulate
      nkm =  the number of  da^s 1'n meteorological class k  in year m
      Ckm =  the average  concentration over  the  n.   days  in class k in  year m
       M =  the number of years  under consideration
       K =  the number of meteorological  classes used
             M
      "*=  J, "»
            (the number  of  days  in class  k  in  the  entire M-year  period)
-------
                                   60
            (the total  number of days in all  K classes  and all  M years).
 Then
                         m
'km
(4-1)
 is the observed average concentration in all K classes for year m.  These
 values for each year represent a measure of the unadjusted trend.
     The quantity
                        Pk = Nk/N
 (4-2)
 is the "typical" fraction of class k days occurring in a year (based upon the
 frequency of occurrence of class k days over the M-year period).  Then a
 "typical" year will be one where days in the kth class occur with a frequency
 Pk'
     The average pollutant concentration over all K classes in year m if
the distribution of meteorological classes was "typical" is then
                           _
                        Cm =     Pk Ckm  '
 (4-3)
These values represent the meteorologically adjusted trend, i.e.,  the average
concentration if each year had the same distribution of meteorological  classes,
 Determination  of  "Typical-Year"  Conditions  (for  Days Exceeding Threshold)
     A  method  similar to that of concentration normalization can be  employed
 to  determine "typical-year" conditions for  the number of days exceeding some
-------
                                61
designated threshold level (e.g., the NAAQS).  Using the same category struc-
ture as constructed for the previous analysis, computations can be based on
the percentage of days exceeding the threshold value by category.
     If we let E,   be the number of exceedences of the threshold level for
                Kill
category k in year m, then "Ckm can be replaced by
                           r   _ Ekm
                            km-\m  '
the fractional number of exceedences for category k in year m.  Following
the procedures outlined previously, we compute
                            "  _  K
                            Em = J-, Ekm pk  '
which  now  represents the meteorologically adjusted fraction of days exceeding
the threshold level  in year m.   To obtain the meteorologically adjusted number
                                                           />.
of days  per year  exceeding the  threshold, we just multiply Em by  365.
Limitations on Averaging Period
     We  have discussed the methodology as an average of daily values over the
entire year.  Naturally there will be, in any practical situation, days for
which  all  data are missing or for which sufficient data are missing to make
the estimate of the  standard-related value (such as the maximum 1-hour average)
suspicious.  Further, some pollutants such as TSP are typically sampled rather
infrequently  (e.g.,  every 6th day).  There is nothing in the methodology de-
veloped  which changes its validity if the analysis is limited to  days of the
year  in  which data are available and the averages are taken over  those days
of the year.  When the missing  data are randomly spaced throughout the year,
-------
                             62
the probabilities of each meteorological class occurring should be determined
from as full a set of meteorological data as possible and not be limited to
those days on which pollution data are available; however, the average con-
centration within each class can (and indeed must) be limited to those days
in that class on which data are available.
     The same logic applies to pollutants for which there is a strong sea-
sonal effect.  For example, high oxidant concentrations are centered about
the late summer, and carbon monoxide concentrations are higher in the months
of October through February.  One could thus look at the trend from year to
year in the pollution season.  In that case, the averaging is again done
over the available data in the season to which the analysis is restricted.
In this case, however, the relative frequency of meteorological classes
should be determined only for meteorological data in the seasons being
analyzed.
Limitations of Data Avai1abi1ity
     A minimum of five years of data is recommended to determine a reasonable
"typical year."  Once this has been established, continuing trend analyses
can be performed in reference to the already established typical-year data.
If, on the other hand, it appears,,at some future time, that the typical-year
meteorology of the original data set was no longer representative of the
longer-term period, then the P. 's can be re-computed to establish a new
typical-year data base.  The recommendation for five years of meteorological
data does not necessarily imply that five years of pollutant data are re-
quired.
-------
                                     63
 Number of Categories  Used
      In -some  cases we may  be  interested  only  in trends of the highest pollution
 levels.  A possible procedure is  to  limit analysis to those categories
 comprising approximately 50%  of the  sample days containing the higher concen-
 tration.   For example,  if  we  classify the days into six categories, and if
 half the  days occurred  in  high-pollution classes  1 through 3 while the other
 half occurred in  low-pollution classes 4 through  6, then the latter group
 would be  excluded from  further consideration.  If this approach is taken,
 the formulas  of this  subsection can  be interpreted to be over the K worst
 meteorological classes.
      Caution, however, must be exercised if the classes used are not
 inclusive of  all  days in the  year.   If 50% of the days are excluded at
.one locale and 60% at another, the two trends may not be strictly comparable.
 If trends are stronger  for the more  adverse classes,  then including more
 lower-pollution days  will  make either an uptrend  or downtrend less pro-
 nounced.   Thus, if trends  at  several locales  are  to be compared, it is
 perhaps best  to use  classes which encompass all days, and perhaps to note
 trends in individual  meteorological  classes as well.
      A way to minimize  (and  possibly eliminate) the uncertainty  caused by
 differing absolute trends  in each class  is  to average the logarithms of the
 concentration; i.e.,  wherever we  have  used C"km in the equations, log Ckm
 can be substituted.   Then, the year-to-year change  in adjusted  pollutant
 levels is. related to the  percentage changes  in each category rather than of
 the.absolute  changes in each category.
 Adjustment of Means within Classes
     There  is  an  implicit  assumption in  the methodology discussed that  each
 meteorological class is sufficiently well-defined that the  meteorological
-------
                                64
 conditions  occurring  from year  to year  do  notivary  significantly within
 the class.   If they do,  then  one can meteorologically  adjust  the average
 concentration  within  each class.  Since this  is  a second-order effect,
 it does  not deserve extensive discussion.   We,however,  note that the linear
 regression  approach can  be used to  adjust  the mean  within  a class.   (It is
 much more reasonable  to  assume  that a linear  relationship  holds within a
 class than  for all  days.)
     Given a derived regression  equation for data within a  class (such as
shown in Appendix A, Equation (A-3)), how should  it  best be used for meteoro-
 logical  adjustment  of observed  values?  We  can extend Equation (3-2) (which
 related  to  a single variable) to multiple  variables.  Let
     7.  = the average value of the  i    meteorological  variable over all
           years of interest(within  the  class)
     x^  = the average value of the  i    variable in  the k  year (within
           the class)
     y^  = the observed average of daily pollution values over year k
            (within  the class).
Then the meteorologically adjusted values for each year  (for the class  being
 adjusted) are
                                     a.  (x.  -  x*
The  technique discussed in Appendix A should be used to derive the regression
coefficients for each meteorological class separately.
-------
                                   65
4.3  CHOICE OF THE AIR QUALITY MEASURE TO BE ADJUSTED                             ?
.		                                                          ^
     We have noted previously that one element of meteorological adjustment
procedure is the basic decision of which air quality measure (or index) to
adjust.  We have delayed a specific discussion of the tradeoffs involved in
this choice to this point in the report in order to place it in the perspec-
tive of available methodology.  (Some general considerations were addressed
in Section 2.2.2 on the differences in meteorological effects on various air
quality parameters.)  In our recommended procedure, we settled  on two mea-
sures:
      (1)  The annual  average of daily standard-related'values  (daily 1-^hour       ^
          maximum, daily 8-hour maximum, daily average);  and
                                                                                 dl
-   -  (2)  The fraction  of days in  the year  exceeding  a pre-specified             V
          threshold  (e.g.,  the fraction  of  days  each  year the  daily
          maximum 1-hour average  oxidant exceeded  the secondary standard).
We will  discuss  the  motivation  for these particular choices of air  quality
measures and then discuss  other  alternatives.
      Since  the  daily standard-related value (e.g., the daily 1-hour maximum)
 is a  measure of the  health effect on that  given  day, then the average  of
 daily values yields  an overall  average impact for the year or for the  por-
 tion of the year where the problem .exists.   This is a useful measure of
 overall progress in pollution control for all meteorological classes and
 for all pollutant levels;  it is, however,  not directly related to progress
 in meeting the air quality standards.  These standards emphasize the more
 meteorologically adverse (high pollution-potential)  days.
-------
                                 66
     The second measure, the number of days per year for which the pollutant
concentrations exceed a specified value, is a measure oriented directly
toward progress in meeting the standards.  It can be derived separately for
primary and secondary standards and can also be limited, to the pollutant
season.  A meteorologically adjusted trend in the number of days in which
the standard is exceeded is a very direct measure of progress in meeting
the standards, yet it is not quite so particular to a single day's meteor-
ology as is the annual maximum of the standard-related;value.
     An adaptation of the first measure which makes it similar to the
second measure was suggested in Section 3.0limitation of the meteorological
classes considered to the more adverse classes.  This limitation is more or
less automatic with the second measure:  Meteorological  classes with no
exceedences will not be used and thus need not be defined.
Other Alternatives
     Many standards are stated in terms of the annual maximum of a short-
term average, and these standard-related values are regularly reported.  One
could ask for the meteorologically adjusted value of such a quantity.  The
difficulty in doing so directly is that such values often occur due to un-
usual and extreme meteorology and are not easily amenable to adjustment.
It would, for example, be difficult to identify a "typical" worst case to
which one might adjust.
     However, the measures of the annual average standard-related value and
the exceedences of a threshold value can be combined to give an estimate of
the meteorologically adjusted standard-related value.  Specifically, suppose
-------
                                67
we are interested in the annual maximum 1-hour average of a pollutant.
Suppose we calculate the meteorologically adjusted annual mean of the daily
maximum 1-hour average, say y.  Suppose we have calculated the meteorologi-
cally adjusted probability,Psof the 1-hour maximum in a day exceeding the
threshold,r.  Then we have two parameters of the meteorologically adjusted
distribution of the daily maximum 1-hour average:  The mean y and the
(1-P)   percentile value T.  If we assume a two-parameter distribution for
the pollutant, e.g., log-normal, then the distribution is fully determined.
Any percentile of the distribution, including the percentile corresponding
to the annual maximum,can be estimated.  (The highest day of 365 can be
                     f*h
estimated as the 99.7   percentile.)
     Another alternative in measuring the pollution trend is the use of
other than daily measures.  One might propose to adjust, for example,
monthly, quarterly, or annual data.  We have discussed approaches to doing
so which adjust such longer-time-period measures by the number of adverse
days in the period or by the value of a meteorological parameter or index.
When daily values are used, the meteorology for the day is directly related
to the pollution level for the day.  When data are aggregated over a larger
time period, the direct correspondence is lost.
     One could further disaggregate and go to hourly values rather than
daily values.  If the methodology were extended in this direction, the im-
plication would be that hourly meteorology is related to hourly pollutant
values, an assumption seldom satisfied.  Further, hourly values of pollu-
tant and meteorological measurements are certainly not independent, while
day-to-day  levels of  pollution and meteorological variables are considerably
more independent.
-------
                                    68
      4.4  AN EXAMPLE OF METEOROLOGICAL ADJUSTMENT OF AIR QUALITY  DATA
     This subsection presents a simple hypothetical example  to  illustrate
the type of trend analysis possible and to provide numerical  examples.
SO? Point Source to Receptor
       In this hypothetical  situation,  a  point source  of  S02 emissions is
  located 10 miles  due east  of an  air monitoring  facility.  Over a five-
  year period, control efforts have  supposedly reduced S02  emissions, but
  increased workload demands upon  the facility have  resulted in an increase
  in production rates.  The problem now is to determine which  of the two
  counter-balancing measures has had the  greatest impact  upon  the ambient
  S02 air quality.   Examination of the  annual  average  S02 concentration
  levels  from the air monitoring facility  has  been inconclusive due  to the
  year-to-year meteorological  variations  (see  Table  4.2).
  Table 4.2   Annual S02 Average Concentrations for  the Hypothetical Receptor Site
   Annual S02
   Average
   Concentration
       (PPM)
                                         YEAR
                          1968      1969      1970      1971      1972
.082
.088
.100
.077
                                        .089
-------
                                      69
      A search for meteorological  data indicates that wind records  are
 available at both the source (S)  and receptor (R)  locations.   Using  1968-1972
 data, mutually exclusive categories  are selected as  follows,  based upon
 the severity of wind conditions causing transport of the  S02  emissions
 from S to R:
      Category 1:   Wind direction  at  both  S  and R are from the east for
                   at least 12  hours  per day.
      Category 2:   Wind direction  at  both  S  and R is  from  the  east  for
                   at least 4 to 11 hours  per  day.
      Category 3:   Wind direction  at  either  S  or R, but not both, contains
                   an easterly  component for at least 4 hours  per day.
      Category 4:   Wind direction  at  either  S  or R, or both, contains an
                   easterly component for  1  to 4 hours per day.
      Category 5:   Wind directions at both S and R  have no easterly
                   component during the  entire day.
      For  the  five  years  of analysis,  the  average daily S02 concentration
 is  evaluated  against.the wind  data as specified in the five categories.
 Results are shown  in  Table  4.3.
     Assume we are interested only in the most adverse 50% of the days.   The
category cut-off point is determined by summing the P.  values until  the  top
categories approximate 0.50.  In this instance, in categories  1 through  3,
  (1)
ZPk  = °-487-  Therefore, categories  4 and 5 will not be  processed  any  further.
      The  next step in the  process is to compute the  mean  concentrations
 for "typical-year" meteorology.   Table  4.4  gives the tabular  layout  for  the
 computation  of the "typical-year" values.
-------
                                                70
 03
 -Q
 S-
 o
f
o_
-a
 c
 ro
 w>
 cu
r*
4J
UJ
  ^*i CU
|O  I/)
~"EO
 C  i
 CU  O
40  +J
     CU
     O
     a.
 (O
 s-
 (O
     CU
     4->
     s-
     o
I   Q.
CO
  *
 a
r
 to
E
S-
-
*CM .
V  *
Q.
K  *
O.
Icf
-^
CM
r--.
en
r
en
r
O
en
r
en
to
cr>
r
co
to
cn
""
S
'"_
£=
=i-
cn
CD
'
to
0
CM
cn
r***
cn
cn
"
CO
^"
co
co
CM
LO
^
LO
CM
^^
CM
to
r-
CM
^^
^t*
r
CO
0
CM
O
CO
r
r
CM
to
O
r
'
cn
o
co"
CO
""
to
to
r
s
^J-
to
f**x
' 
O
CM
co
co
r
^^
co
to
CO
^j-
r
s*^
r
CM
to
LO
r
'
[>^
CO
CM
CD
CO
to
LO
CO
CO
'
to
cn
0
cr>
r--
LO
CO
cn
CD
LO
r
p~
0
O
i 
CO
o
"~
to
cn
0
^^_
CO
*~~
LO
o
«
^^^
LO
CO
r
^
0
co
o
*~~
CO
cn
CO
cn
LO
o
0
0
to
r^
LO
o
r
r
"~
co
LO
0
^
to
r
O
to
o
^^
LO
0
1
cn
to
CD
^^
j^
cn
cn
LO
o
to
CM
~
*
CM
to
*
CD
O
O
CO
CM
O
o
LO
LO
LO
O
j^-
«^-
^f
O ,
» 	
5:
1 
=*
O
c
*^^
to
P-.
co
CO
CD
,_r
to
LO 2;
CO
o
CQ
C\J
r-.
cn
CO
0
CD"
^»
co
^
rv.
0
<
5
CO
o
0
^^
cn
CO
co
co
CO
o
s^
CO
CO
CM
co
o
j^
LO
CO
E| !oE
Js^
^.
(^
J^
^
II
** *
CO
z.
o
cn
in
CU
1
o
cn
CU
eo
0
CU
CU
i.
4->
4J
in
i.
 03
 O
 CU
 in
 to
jQ
            CU
           o
           CT)
O r^
fO ro
I £
Q. a.
o
Q.
03
o
r
a.
03 |
CU CU
5 5
           (/)
          I~"
         CM
          a.
                                            AH093IV3
-------
                                     71
Table 4.4  Tabular Entries Showing the Product of the Top-Three-Gategory
                                     (2)
           A Priori Probabilities, Pkv  ',and the Average Category Concentration,
                  (C  is the sum of the individual category entries and repre-
           'knr
           sents the normalized concentration by year.)
                                    YEAR
1
2
3
/^
C
 m
i
1968
.019
.033
.053
.105
1969
.020
.032
.072
.124
1970
.024
.038
.064
.126
1971
.022
.038
.069
.129
1972
.018
.036
.067
.121
            o
            CD
        For  comparative  purposes,  the  same  procedure for meteorological
  normalization has been computed for the full five categories.  Results
  are shown  in Table  4.5.
                   Table  4.5   Same as Table 4.4, except over all "five
                               categories, using P^O ),instead of
1
2
3
4
5
s\
Cm
1968
.009
.017
.026
.021
.005
.078
1969
.010
.016
.035
.024
.007
.092
1970
.012
.019
.032
.021
.007
.091
1971
.011
.019
.034
.018
.007
.089
1972
.010
.018
1
. 033 j
.020 ;
.006
.087
-------
                                    72
      Figure 4.4 shows the actual and meteorologically adjusted trend data.
The adjusted trend over categories 1  to 3 indicated that S02 conditions  de-
teriorated until 1971, after which improvement began.   One can see that  the
effects of increased production appear  to have overshadowed the control
efforts in the first few years, but that the control  efforts did indeed
have an effect upon improving the nearby air quality.   Notice,too, that
this conclusion is not discernible from the actual  data.
      Also shown in Figure 4.4 is the annual adjusted and unadjusted trend
over all categories.  In this analysis, the worst condition occurred in  1969,
with slow  improvement thereafter.  The  importance of this effect  can
be  attributed  to the impact of remote sources affecting background levels.
Since the  inclusion of categories 4 and 5 contain meteorological  conditions
not conducive  to transport from  the primary source facility, the  change
in  trend indication is due to the impact from other sources being trans-
ported  over much larger distances.  Therefore, one can conclude that
an  overall improvement in background levels since 1969, due most  likely
to  effective control efforts at  the distant  sources, has made a  signifi-
cant impact on the air quality at the monitoring location.  Even  though
the shape  of the unadjusted curves for both analyses are about the same,
the meteorological adjustment methodology is able to distinguish both
the effects from the primary source of SC^ and the remote transport
affecting  background levels.
     Let us now consider the evaluation of the trend  of S02 NAAQS  violations
(0.14 ppm for a 24-hour average).  An  examination of  the data  from the
-------
                                   73
CO      CM
                                en
                                o
                                     co
                                     o
                                                         O
un
O
                                                                                     in
                                                                                    -a
                                                                                     1
                                                                                     
                                                                             o
                                                                             r^
                                                                             en
                                                                             10
                                                                             CT>
                                                                             CO
                                                                             t£>
                                                                             cr>
                                                                                 C  CO
                                                                                i-  3
                                                                                 3 T>
                                                                                 o -o
                                                                                 Q. (O .
                                                                                 E  O
                                                                                 a T-
                                                                                 x  01
                                                                                 LU  o
                                                                                 r  O
                                                                                 CO  S-
                                                                                 o  o
                                                                                 r-  CU
                                                                                 (-> +J
                                                                                 I*
                                                                                 o -o
                                                                                 Q. 0)
                                                                                 >>*-»
                                                                                 Z  fO
                                                                                 H- 'G
                                                                                 o  o
                                                                                    CO
                                                                                 CO  CO
                                                                                     CO 4->
                                                                                     QJ -r-
                                                                                    o; 3
                                                                                    cu
                                                                                    O)
(uidd)  uoi^Baq.u9DUOO [enuuv
                                                   OS
-------
                                       74
    receptor location indicates the following number of violations per
    year:
                   Table 4.6   Number of Annual SOg NAAQS Violations
                               for Hypothetical Receptor Site
1968
1969
1970
19.71
1972
         Number of
         SC>   violations
62
48
76
32
57
         Using  the  five  categories  established previously, Table  4.5 is duplicated
    below, except the C"km values have been replaced by Ekm (the  number  of violations),
   Also  shown  in parentheses are the  values of Efkm  (= Ekm/nktn):
      Table 4.7   Tabular Layout (similar to Table 4.5} for Weather
                  Correcting the Number of Violation Days.  (n|<
                  is the Total Number of Sample Days per Category,
                  and PL1'is the A Priori Probability of Each Category.)
1
2
3
4
5
1968
30*/30**
(1.000)
37/27
(.0.730)
103/5
(0.049)
126/0
(0)
61/0
(0)
1969
14/14
(1.000)
21/12
(0.571)
135/21
(0.155)
97/1
(0)
76/0
(0)
1970
12/12
(1.000)
63/51
(0.809)
118/13
(0.110)
105/0
(0)
41/0
(0)
YEAR
1971
5/5
(1.000)
20/15
(0.750)
108/11
(0.102)
161/0
(0)
47/0
(0)
1972
18/17
(0.944)
41/28
(0.683)
115/12
(0.104)
m/o
(0)
55/0
(0)
nk
79
188
579
600
280
pf
.046
.106
.335
.349
.162
UJ
fe
o
       Number of days in category.
     **
       Number of days in category which violated standards.
-------
                                           75
          The next step is to compute the expected frequency of violations per year
     under typical meteorological conditions.  This is done by summing the product
     of the P^ and E.   values, as follows:
          Table 4.8   Normalization, by Category, of Violation Days
                      (Same format as Table 4\5]
O
CD
LU
CO
1
2
3
4
5
Normalized frequency
of violation days,Em
Normalized number of
days exceeding SO?
standards (E -365)
1968
.046
.077
.016
.000
.000
.139
50.7
1969
.046
.061
.052
.003
.000
.162
59.1
YEAR
1970
.046
.086
.037
.000
.000
.160
61.7
1971
.046
.080
.034
.000
.000
.160
58.4
1972
.043
.072
.035
.000
.000
.150
54.8
           The final step is now to calculate the number of days per "typical"  year
      that violations occurred.  Resultant frequencies are multiplied by the number
      of days per year.  Final results are shown in the bottom line of Table 4.8.
           Whereas the actual data indicate  significant variation in year-to-year
      exceedences of the S02 standards, the normalized values clearly indicate  that
      conditions worsened through 1970, after which a noticeable improvement of
      violation-days occurred.
-------
                                    76
     In summary, the recommended procedures offer a flexible approach  for
meteorologically adjusting both pollutant concentration levels  and specified
number of violation days.  The most difficult aspect of the entire metho-
dology will be the determination of appropriate meteorological  classes.
However, our recommended procedures, based on experience and empirical
techniques, can assist in achieving an effective classification system.
Once classification is accomplished, the "typical-year" meteorology approach
can be applied without elaborate computation.
-------
                                       77
                                   REFERENCES
 1.   Bay Area Air  Pollution  Control  District,  "A Study of Oxidant Concentration
     Trends in the BAAPCD (1962-1972)  Based  on Temperature and Inversion Criteria,"
     January 1973.
 2.   o.  L.  Blumenthal,  et al.5  "Three  Dimensional Pollutant Gradient Study--!972
     Program," Meteorology Research, Inc., Altadena, May 18, 1973.
 3-   S.  M.  Bruntz, W. S.  Cleveland,  T.  E.  Graedel and B. Kleiner, Science. Vol.
     186, 257 (1974).
 4.   	,  W.  S.  Cleveland, B. Keliner and  J. L. Warner, "The Dependence
     of Ambient Ozone on  Solar Radiation,  Wind, Temperature, and Mixing Height,"
     Preprint Volume, Amer.  Meteor.  Soc. Symposium  on Atmospheric Diffusion and
     Air Pollution, 1974.
 5.   California Air Resources  Board, (CARB)  California Air Quality  Data  (CAQD),
     Vol. IV, No.  1, pp.  3-5.
 6.   	, Vol.  IV,  No. 2, pp.  3-5.
 7.         , Vol.  IV,  No. 3, pp.  4-7.
 8.   	, "Changes  and Trends  in Carbon Monoxide Levels," Vol. VII, No. 3,
     pp. 2-4.
 9.   	, "Effect of Wind Speed on  Pollutant Concentration," Vol. VI, No. 4,
     pp. 11-12.
10.   	, "Historical Occurrence of Temperatures Aloft Some  5000 Feet Above
     Sea Level,"  Vol.  VII, No. 4, pp. 10-12.
11.   	, "Impact of Meteorological Conditions  on Particulate  Concentrations,"
     Vol. VII, No. 4,   pp. 13-14.
12.   	, "Oxidant Prediction," Vol. VI, No.  4, pp. 5-8.
13.   	5 "Weekday vs. Weekend Oxidant Concentrations-II," Vol. VI,  No.  4,
      pp. 10-11.
14.   A.  Davidson,  "An Objective Ozone Forecast System for July Through October in
     the Los  Angeles Basin," Los Angeles Air Pollution Control District, Technical
     Services  Division Report, 1974.
15-   A.  Davidson, et  al.,  "Air Quality Trends in Los Angeles  County," Southern
     California Air Pollution  Control District, Metropolitan Zone,  December 1975.
-------
                                        78
 16.
 19.
 20.
 21.
 22.
 23.
 24.
 25.
 26.
 27.
28.
 Enviro  Control,  Inc.,  "Study  in Trends in National Air Pollution and Related
 Effects,  Phase I," prepared under Contract No. 68-01-0471 to the Environmental
 Protection Agency, May 1973.
 K.  E. Gronskei,  E. Joranger and F. Gram, "Assessment of Air Quality in Oslo,
 Norway,"  Published as  Appendix D to the NATO/CCMS Air Pollution Document
 "Guidelines  to Assessment of  Air Quality (Revised) SOX, TSP, CO, HC, NOX,
 Oxidants," Norwegian Institute for Air Research, Kjeller, Norway, February
 1973.
 E.  L. Fisher, et al.,  "Forecasting Pollutant Dosages," Los Angeles County
 Air Pollution Control  District, prepared for the California Air Resources
 Board under  Standard Agreement ARB-1112, June 1973.
 P.  H. Guldberg,  et al., "Air  Quality Impact of the Energy Shortage," J. Appl.
 Meteor.,  16, January 1977.
 Y.  Horie, and J. Trijonis, "Analysis and Interpretation of Trends in Air
 Quality and  Population  Exposure in the Los Angeles Basin," TSC-PD-A152-1,
 Technology Service Corporation, Santa Monica, Calif., March 1977 (to U.S.
 Environmental Protection Agency, Research Triangle Park, N.C.).
 	»   and A.  S. Chaplin, "Population Exposure to Oxidants and Nitrogen
 Dioxide in Los Angeles, Volume III:   Long Term Trends, 1965-1974," U.S.
 EPA/OAQPS Publication  #EPA-450/3-77-004c, Technology Service Corporation,
 Santa Monica, CA,  January 1977.
 R.  M. Ingels, "Trends  in Atmospheric Concentration of Oxides of Nitrogen
 1957-1961,"  Los  Angeles County Air Pollution Control District, August 1962.
 T.  R. Karl and G. A.  DeMarrais, "Meteorological  Conditions Conducive to High
 Levels of Ozone," International Conference on Photochemical  Oxidant Pollution
 and Its Control, EPA-600/3-77-001a,  January 1977.
 R.  E. Kerr,  Jr., "Effects of Weather on Air Quality Measurements:   San Gabriel
 Valley Ozone - Meteorological  Model," Working Paper, June 1974.
 J.J.R. Kinney, "Predicting Oxidant Concentrations in the Los Angeles Basin,"
 California Air Resources Board, March 1974.
 J.  R. Kinosian and S.  Duckworth, "Oxidant Trends in the South Coast Air Basin
 1963-1972,"  California Air Resources Board,  April 1973.
 M.  H. McCutchan  and M.  J.  Schroeder, "Classification of Meteorological  Pat-
 terns in Southern California by Discriminant Analysis,"  Journal of Applied
 Meteorology, Vol. 12,  571  (1972).
 W. S. Meisel, "Empirical Techniques  for Analyzing Air Quality and Meteorological
 Data.  Part  I:  The Role of Empirical  Methods in Air Quality and Meteorological
Analyses," U.S.   Environmental  Protection Agency Publication  #EPA-600/4-76-029a,
 Technology Service Corporation, Santa Monica, CA,  July  1976.
-------
                                         79
29.   M.  K.  Liu,  et al.,  "Effects  of Atmospheric  Parameters  on  the  Concentration
     of Photochemical Air Pollutants,"  J.  Appl.  Meteor.,  15 (August  1976).
30.   0.  T.  Peterson and  E.  C.  Flowers,  "Interactions  between Air Pollution and
     Solar Radiation," Solar Energy, Vol.  19,  1977.
31.   J.  H.  Price,  "A Study of Factors Associated with High  Urban Ozone Concentrations
     in  Texas,"  Texas Air Control  Board,  Presented  at the Air  Pollution Control
     Association Conference on Ozone/Oxidants, Dallas,  Texas,  March  1976.
32.  E.  A.  Schuck, J. N.  Pitts, Jr., and  J.K.S.  Wan,  Air  &  Wat. Pollut. Int. J.,
     Vol.  105  689 (1966).
33.   s.  M.  Sidik,  and H.  E. Neustadter, "Meteorological Adjustment of Yearly Mean
     Values for Air Pollutant Concentration  Comparisons," NASA Technical Note
     TN  D-82535  National  Aeronautics and  Space Administration, Washington, D.C.,
     July 1976,
34.  J.  A.  Sonquist, E.  L.  Baker,  J. N. Morgan,  Searching for  Structure, University
     of Michigan,  1971.
35    G.  C.  Tiao, 6.E.P.  Box, and  W.  J.  Hamming,  "A  Statistical Analysis of the Los
     Angeles Ambient Carbon Monoxide Data 1955-1972," J.  Air Poll. Control Assoc., Vol.
     25  (Nov.  1975).
36.   	9 M. S. Phadke, and G.E.P.  Box, "Some  Empirical Models for the Los
     Angeles Photochemical  Smog Data,"  J.  Air Poll. Control Assoc., Vol. 26 (May 1976).
37v  J.  C.  Trijonis,"An  Economic  Air Pollution Control  Model - Application:  Photo-
     chemical  Smog in Los Angeles County  in  1975" Ph.D. Thesis, California Institute
     of Technology, May  1972.
38.  	, et al., "Emissions and Air Quality  Trends  in the  South  Coast Air Basin,"
     EQL Memo No.  16, Environmental  Quality  Laboratory, California Institute of
     Technology, Pasadena, CA , January 1976.
39.  	, "Empirical Studies of Nitrogen Dioxide Air Quality  and NOp/Precursor
     Relationships," Report in preparation to the U.S.  Environmental Protection
     Agency under Contract No. TSC 68-02-2299.
40.  U.S. Environmental  Protection Agency, "Monitoring and  Air Quality Trends
     Report, 1973," EPA-450/1-74-007, October 1974.
41.  	,  "Monitoring and  Air  Quality  Trends Report,  1974," EPA-450/1-76-001,
     February  1976.
42.  	,  "National Air Quality  and Emission Trends Report, 1975," EPA-450/1-76-002,
     November  1976.
-------
                                         80
43.  H. van Dop, and S. Kruizinga,  "The Decrease of Sulphur Dioxide  Concentrations
     near Rotterdam and Their Relation to Some Meteorological  Parameters  During
     Thirteen Consecutive Winters (1961-1974)," Atmospheric Environment,  Vol.  10,
     pg. 1 (1976),
44.  M. D. Zeldin and D. Thomas, "Ozone Trends in the Eastern  Los  Angeles Basin
     Corrected for Meteorological Variations," presented at the International
     Conference on Environmental Sensing and Assessment, Las Vegas,  Nevada,  1975.
45.  	, A. Davidson, M. F. Bronelle, and J. E. Dickinson, "A  Meteorological
     Assessment of Ozone and Sulfate Concentrations in Southern California,"
     Southern California Air Pollution Control District, Report #E&P 76-1,
     August 1976.
46.  	, A. M. Winer, D. Grosjean, and J. N. Pitts, Jr., "Weather Adjusted
     Oxidant Trends in the South Coast Air Basin," Statewide Air Pollution Research
     Center, Riverside, Calif., May 1977 (to be submitted to Atmospheric Environment),
47   California Air Resources Board, "Sulfate Predictions in South Coast Air Basin,"
     Vol. VIII, No. 2, pp. 4-10.
48.  L. Breiman, Statistics:  With  a View Toward Applications. Houghton-Mifflin,
     1973.       	  	
49.  H.J. Payne  and W. S. Meisel,  "An Algorithm for Constructing  Optimal  Binary
     Decision Trees," IEEE Trans, on Computers, Vol.  C-26,  No.  9,  September  1977,
     pp.905-916.
-------
             81
          APPENDIX A
MULTIPLE-YEAR LINEAR REGRESSION
     Suppose we wish to determine a linear equation relating a meteorological
variable to a pollutant concentration.  Suppose further that we had several
years of data.  Simply regressing the meteorological variable against the
pollution concentration for all the data would not be appropriate.   The co-
efficients of the meteorological variables might be misleading if emissions
change from year to year.  Figure A-l exemplifies this problem.  Suppose in
two consecutive years, the pollutant level was predicted well by a meteoro-
logical variable x.  Suppose the best-fit equations were
                                 y = ax + b-,
and
                                 y = a'x + bg   »
respectively, where a
-------
                                                 82
 (pollutant
concentration)
                             - Data from year 1
                            X - Data from year 2
                  Best
                  for
:-fit
                                                                            y = ex + d
                                      Best-fit line to
                                      both years
                                                                            y = a'x + b.
                                                                            y = ax
                                                                            Best-fit line
                                                                            for year 1
                                                                    x (met. variable)
                     Figure A-l    An Example  of  the Difference in Fitting Each Year
                                  Separately  Rather than Fitting All the Data.
-------
                           87
Figure 1-2   Data from Region R-,  (i.e., for which  x-, _<  a1
             Plotted for Another  Meteorological Variable,
-------
                                 88
                       ^11"   X2 Jl k  £and xl <  al 
R-J2 is a final region.  Continue this process on any remaining  ill-defined
(with respect to pollutant) regions such as R,,.
     The procedure continues until
     (1)  all regions are relatively well-defined with respect  to pollutant
levels, i.e., the standard deviation in pollutant levels is low or there
are very few high- pollution values in the region; or
     (2)  there are too few samples: in an uncertain region to divide it
further and retain statistical  significance.   (Formally, a statistical test
could be applied for statistical significance of a proposed split [48].)
       Suppose Rn is split once more by parameter x3 into
Rm:  xsl c
R112:  X3 > c
                                                b>
                                           X2 !b>  x]  1
and these are all final  classes. -
     The result is a decision tree defining  the  meteorological categories
as illustrated in Figure B-3.
Extensions
     We have used one variable at  each  step.  As  in the example used in
Section 4.1.2 of the text, we could use two  (or more) variables in any step.
The ideas are easy to extend,  but  the software (e.g.,  University of Michigan's
AID and TSC's ADSS program)  required is less standard.
-------
                               89
Figure B-3   Decision Tree Approach Toward Defining Meteorological
                Classes Based on Meteorological Parameters X-,, X£> and
-------
                                   TECHNICAL REPORT DATA
                            (Please read Instructions on the reverse before completing)
 1, REPORT NO.
     EPA-450/3-78-024
                              2.
                                                            3. RECIPIENT'S ACCESSION-NO.
 4. TITLE AND SUBTITLE
 Use  of Meteorological Data  in  Air Quality Trend
 Analysis
5. REPORT DATE
   May 1978
S. PERFORMING ORGANIZATION CODE
 7. AUTHOR(S)
  Melvin  D.  Zeldin and William  S.  Meisel
                                                           8. PERFORMING ORGANIZATION REPORT NO.
9. PERFORMING ORGANIZATION NAME AND ADDRESS
  Technology Service Corporation
  2811  Wilshire Blvd.
  Santa Monica, California  90403
                                                            10. PROGRAM ELEMENT NO.
11. CONTRACT/GRANT NO.
 '   68-02-2318
 12. SPONSORING AGENCY NAME AND ADDRESS
                                                            13. TYPE OF REPORT AND PERIOD COVERED
 U.S.  Environmental Protection  Agency
 Office  of Air and Waste Management
 Office  of Air Quality Planning and Standards
 Research  Triangle Park, North  Carolina  90403
14. SPONSORING AGENCY CODE
    200/04
 15. SUPPLEMENTARY NOTES
 16. ABSTRACT
      Meteorological influences  upon air quality trend  analyses can complicate"
 the evaluation of air pollution control efforts.   It is  important to isolate
 the meteorological effects  in order to determine air quality trends as a function
 of emissions.   This report  surveys  existing methods for  meteorologically adjusting
 air quality trends, including a review of known relationships between specific  air
 pollutants  and various meteorological  parameters,  and  presents a recommended
 methodology to normalize air quality trends with respect to "typical" year meteor-
 ology.   Procedures involve  the  determination of mutually exclusive meteorological
 classes  and the treatment of air quality variables  stratified according to the
 established meteorological  classes.  A hypothetical example is included to illustrate
 both the mathematical processes and the interpretation of the methodological results.
17.
                                KEY WORDS AND DOCUMENT ANALYSIS
                  DESCRIPTORS
                                              b.lDENTIFIERS/OPEN ENDED TERMS
             c.  COSATI Field/Group
 Trend Analysis
 Air Pollution
 Meteorology
13, DISTRIBUTION STATEMENT
 Unlimited
                                              19. SECURITY CLASS (ThisReport)'
                                                   Unclassified
              21. NO. OF PAGES
                    96
                                              20. SECURITY CLASS (Thispage)
                                                   Unclassified
                                                                         22. PRICE
EPA Form 2220-1 (9-73)
-------