United States
 Environmental Protection
 Agency
 Environmental Sciences Research
 Laboratory
 Research Triangle Park NC 27711
 Research and Development
 EPA-600/S2-80-195  Apr. 1981
 Project  Summary
 Modified  Factor
 Analysis of Selected
 RAPS  Aerosol  Data
 Stephanie C. Davis
  Target transformation factor anal-
 ysis (TTFA) has been applied to a
 subset of the aerosol-composition
 data acquired during the Regional Air
 Pollution Study (RAPS) for St. Louis,
 Missouri. The RAPS program collected
 a large number of samples with 10
 continuously operated dichotomous
 samplers from March 1975 to March
 1977. The purpose  of the present
 study was to evaluate the capability of
 TTFA to resolve sources of airborne
 particulate  matter in a large set of
 ambient-aerosol samples. Only the
 samples from July and August 1976,
 both fine and coarse  fractions, were
 examined in this study. To determine
 the most appropriate way to apply
 TTFA, two separate sets of data were
 analyzed—all the samples collected
.during the  two months at a single
 station and  all the samples collected
 during a single week  at all 10 RAPS
 stations. A more detailed source res-
 olution was obtained when the  data
 were further divided into subsets
 corresponding to the fine- and coarse-
 particle fractions. Because of the large
 number of different sources in the St.
 Louis area, superior results were ob-
 tained from the examination of the
 variation in aerosol composition with
 time at a single location rather than
 the spatial variation over multiple sites
 during a shorter time period.
  This Project Summary was devel-
 oped by EPA's Environmental Sciences
 Research Laboratory, Research Tri-
angle Park. NC. to  announce  key
findings of the research project that is
fully documented in a separate report
of the same title (see Project Report
ordering information at back).

Introduction
  In 1971, the first National Ambient
Air Quality Standards were established,
setting  limits on permissible levels of
total suspended  particulate   matter.
Because these regulations concentrated
only on the absolute amounts of sus-
pended particulate matter, control efforts
were directed primarily toward the
removal of the larger, more massive
particles that comprise most of the total
mass of particle emissions. There is
growing evidence, however, that the
smaller particles, although represent-
ing a smaller fraction of the total mass,
present the greater threat to public
health and welfare. Because of their
light-scattering properties, the smaller
particles also contribute to a larger
extent to the reduction in visibility. As a
result of the growing concern over the
possible health effects of smaller diam-
eter particulate matter in the atmos-
phere, there has been an increase in the
development and  use of aerosol sam-
pling equipment that measures the
concentrations of particles in the in-
halable and fine particle-size fractions.
New regulations, now under review by
the Environmental Protection Agency,
may include standards not only on the
mass of total suspended particles but
also on the levels of inhalable and fine
particles.

-------
  The effectiveness of these regula-
tions would be enhanced greatly if the
sources and relative emission rates of
these particles were known. Determin-
ing the sources of airborne paniculate
matter is a very difficult problem be-
cause of the complexity of urban eco-
systems. The problem is aggravated by
the presence of fugitive sources and the
lack of understanding of the modes of
formation and transport of airborne par-
ticles. The advent of receptor meth-
odology and recent developments in the
areas of trace-element analysis and
multivariate statistical techniques now
permit a much more detailed analysis of
ambient aerosol samples. By providing
detailed information on the sources of
fine and inhalable particles, such tech-
niques could become a major part of any
strategy for controlling airborne panic-
ulate matter.

The Study
  One approach to the problem  of
determining the contribution of each
source to the total level of suspended
particulate matter  is the method of
source-emissions inventories and at-
mospheric dispersion models. Using
estimated emission rates for all known
sources along with a model for their
behavior following emission, this method
attempts to determine the contribution
of each  source to the  total aerosol
concentration. The results of source-
emissions inventories are too often
unsatisfactory because of a lack of
accurate  source-emission profiles and
the inability to include fugitive sources.
The development of improved aerosol
sampling equipment permittedthe
development of receptor methods for
the determination of source contribu-
tions. These methods employ statistical
analysis  techniques to  calculate the
ambient aerosol samples. Receptor
methods  have primarily taken the form
of the chemical element balance (CEB)
method. In this method, it is assumed
that the number and composition of the
sources are known. The observed ele-
mental concentrations in a set of am-
bient aerosol samples are fit by means
of a regression analysis. Early applica-
tions  of the chemical element balance
were to ambient aerosol samples from
Pasadena, California; Ghent, Belgium;
Heidelberg, Germany; and Chicago,
Illinois.
  These early applications of the chem-
ical element balance method  suffered
from a lack of reliable data for the source
emission profiles needed for the anal-
ysis. Recent improvements in the quality
of source profile determinations have
produced good agreement between the
calculated and measured elemental
concentrations for airborne particulate
matter  in Washington, DC, and for
dichotomous sampler data in St. Louis,
Missouri, and the Great Smoky Moun-
tains. The scope of the chemical element
balance method was further expanded
in an extensive study of the Portland,
Oregon, aerosol as part of the PACS
program. This study included compo-
nents such as vegetative and grass
burning and chemical species such as
volatile and nonvolatile carbon, nitrate,
and sulfate that had not previously been
used. By including the larger number of
source components, very good agree-
ment with the measured concentrations
was achieved for the 23 elements and
chemical species included in the anal-
ysis.
  Chemical element  balance methods
have a  major drawback in that they
require an a priori knowledge of both the
number and composition of the sus-
pected  source emissions. The only
quantities calculated are the contribu-
tion of each of the presumed sources to
each of the measured samples. Though
extensive efforts by the groups in Mary-
land and Oregon have produced much
better determinations of the elemental
profiles  of a number of sources, these
determinations are time consuming,
expensive, and, in many cases, inappli-
cable to the resolution of aerosol sources
in different locations. In addition, source
determinations  of in-stack material
cannot account for chemical and phys-
ical transformations of volatile species
following emission.
  Multivariate statistical techniques,
including factor and cluster analyses,
represent a different approach to aero-
sol source resolution. A major advantage
of these methods is that they require no
prior assumption about the nature of the
system under study. In an early applica-
tion of multivariate statistical tech-
niques to elemental  source resolution,
factor and cluster analyses were used to
identify sources of airborne particulate
matter in the Boston urban aerosol. In
this  study, the data were first trans-
formed  into a  standardized form that
removed the effects of using different
metrics in describing the data. The
identification of sources was made by
associating the largest calculated factor
loading  with  the marker element for
each source. In a similar fashion, factor
and  cluster analyses were used to
characterize the aerosol of Tucson,
Arizona.  Factor analysis  was used to
identify sources of paniculate matter in
St. Louis, Missouri. Factor analysis of a
matrix of standardized variables re-
solves only the variance in the system,
not the actual data values. In addition,
the calculated factor scores provide only
a measure of the relative importance of
each source rather than the actual
amount of contributed material.
  Recently, a new form of factor anal-
ysis called target transformation factor
analysis (TTFA) has been employed.
This technique  permits  an analysis
similar in nature to the chemical ele-
ment balance method, but without the
presumptions as  to  the number or
nature of the sources in the system.
Unlike thenjrior applications of factor
analysis, TTFA allows the investigator to
use his understanding of the system to
determine the best representation of
the actual source concentration profiles.
TTFA also permits the direct calculation
of the contribution of each source to
each sample.
  Target transformation factor analysis
has  been applied to  a subset  of the
aerosol composition data acquired during
the Regional Air Pollution Study (RAPS)
for St. Louis, Missouri.  In the RAPS
program,  automated dichotomous sam-
plers were operated over a two-year
period at 10 sites in the St.  Louis
metropolitan area. Ambient aerosol(
samples were collected in fine, 2.4 |/m,'
and  coarse, 2.4 to 20 fjm, fractions.
Samples were analyzed at the Lawrence
Berkeley  Laboratory for total mass by
beta-gauge measurements and for 27
elements by x-ray fluorescence.
  To determine the most appropriate
way to apply target  transformation
factor analysis to a large set of aerosol
samples,  two subsets of this data were
analyzed—all the samples collected
during July and August at one station
and  the samples collected during a
single week at all 10 RAPS stations. The
data  from station 112 were selected for
the first subset and the data from the
week beginning July 31, 1976, for the
second. The relative completeness of
the data in these two  subsets was the
primary criterion for their  selection.
  Station 112 was  located near Wash-
ington University, west of downtown St.
Louis. During the 62 days of July and
August, filters were changed at 12-hour
intervals, producing a total of 124

-------
samples in each of the fine and coarse
fractions. Data were missing for 48 of
the samples, leaving a total of 200. Of
the 27 elements determined for each
sample, a majority of the determinations
of 10 elements were below the detec-
tion limits. Since a  complete and accu-
rate data set is required to perform a
factor analysis, these 10 elements were
eliminated  from the analysis. Table 1
lists the full 27 elements and the number
of values below detection  limits. Note
that the element arsenic was excluded
because of the large number of deter-
minations below detection limits. Arsenic
determinations by  x-ray fluorescence
are often unreliable  because  of an
interference between the arsenic K x-
ray and the lead L x-ray. A  neutron
activation analysis of these samples
should produce better arsenic deter-
minations. Reliable data for arsenic may
be important to the differentiation of
coal fly ash and crustal material, two
materials with very  similar source
profiles. The remaining 200 samples
were  analyzed in two separate  parts.
Two different divisions  in the data were
made to determine which division per-

Table 1.     RAPS Station 112. July
           and August 1976.

            Number of Values Below
  Element       Detection Limits
Al
Si
P*
S
Cl
K
Ca
Ti
V*
Cr*
Mn
Fe
Ni
Cu
Zn
Ga*
As*
Se
Br
Rb*
Sr
Cd*
Sn*
Sb*
Ba
Hg*
Pb
'Elements
15
2
179
0
35
0
0
39
162
151
29
0
95
39
0
194
196
113
1
130
73
161
160
180
131
196
0
excluded from the analyses.
mitted the extraction of the most in-
formation. Separate analyses were then
performed on each group. First, the data
were divided into groups corresponding
approximately to the months of July and
August. In the second analysis, the data
were divided into groups corresponding
to the fine and coarse fractions of both
months.
  During the week beginning July 31,
1976, filters were changed at 12-hour
intervals at five of the stations and at 6-
hour intervals at four of the stations. A
total of 182 samples  were collected in
each of the fine and coarse fractions.
Data were missing for 43 samples, and
another 50 samples had a  majority of
determinations below the detection
limits; these 93 samples were excluded
from the analysis. Of the 27 elements, a
majority of the determinations of 12 had
values below detection limits. These 12
elements were excluded. Table 2 lists
the number of values below detection
limits in the data for one week. Again,
two separate divisions in the data were
made and each individually analyzed.
The data were first divided into two
groups composed of days Saturday,

Table 2.    One Week Beginning July
           31, 1976.

            Number of Values Below
    Element     Detection Limits
Al
Si
P*
S
Cl
K
Ti
Ca
V*
Cr*
Mn
Fe
Ni*
Cu
Zn
Ga*
As*
Se
Br
Rb*
Sr*
Cd*
Sn*
Sb*
Ba
Hg*
Pb
135
27
265
16
115
0
170
1
318
288
149
1
267
120
18
321
316
201
0
294
224
273
258
309
291
320
3
                                       'Elements excluded from the analyses.
Sunday, and Monday (SSM), and of days
Tuesday,  Wednesday, Thursday, and
Friday (TWTF). The first group, with 115
samples, corresponds to days 213, 214,
and 215,  and the second group, with
156 samples, to days 216 through 219.
The ana lysis was then repeated with the
data divided into groups corresponding
to the fine and coarse fractions with 139
and 132 samples, respectively.
  The 27 elements determined for each
sample account for only a small fraction
of the total sample mass. The remaining
mass consists primarily of oxygen,
nitrogen, and carbon.  Determining the
source of particulate carbon is of grow-
ing concern because  of the potential
mutagenicity associated with carbo-
naceous particles. Measurements have
shown that both soot and organic carbon
can account for a sizeable fraction of the
total mass of airborne particulate matter.
Though no measurements of carbon are
included in the RAPS data, that portion
of the sample mass must be accounted
for  by the other sources. This can often
lead to distortions  in the scaling factors
produced  by the  multiple regression
analysis.  In  order to produce the best
possible source resolutions, it is vital to
have both accurate TSP measurements
and determinations for as many elements
as possible.
  For use in the target transformation
portion of the analysis as test vectors, a
set  of 17 potential source profiles was
assembled. These include nine different
crustal factors, two determinations of
coal fly ash, three automobile factors,
and three industrial sources. The 17 test
vectors are listed in Table 3.

Conclusions
  Using these data, target transforma-
tion analysis was  applied, and several
conclusions were made; they are as
follows:
  • Target transformation factor anal-
     ysis (TTFA) can be used to resolve
     sources of airborne particulate
     matter from large sets of ambient
     aerosol samples. With this method,
     the contribution of each source to
     each sample can be calculated. In
     addition, TTFA requires no a priori
     assumptions  as to the number or
     nature of the sources in the system.
     This method could be an important
     tool in the achievement of man-
     dated limits on suspended partic-
     ulate  matter in the urban areas of
    this  country  by identifying those
     sources where imposition of control

-------
Table 3.    Source Profiles Used as Test Vectors, mg/g
                                                        Soils
Element
Al
Si
P
S
Cl
K
Ca
Ti
V
Cr
Mn
Fe
Ni
Cu
Zn
Ga
As
Se
Br
Rb
Sr
Co-
Sr)
Sb
Ba
Hg
Pb

Element
Al
Si
P
S
Cl
K
Ca
Ti
V
Cr
Mn
Fe
Ni
Cu
Zn
Ga
As
Se
Br
Rb
Sr
Cd
Sn
Sb
Ba
H9
Pb
Shale*
80
273
0.7
2.4
0.18
26.6
22.1
4.6
0.13
0.09
0.85
47.2
0.068
0.045
0.095
0.02
0.013
0.0006
0.004
0.14
0.3
0.0003
0.006
0.0015
0.58
0.0004
0.02
Fly Ash
NBS
127
210


0.042
16.1
47
7.4
0.24
0.13
0.5
62
0.16

0.2

0.058
0.01
0.012
0.13
1.7


0.007
2.7

0.075
Sandstone
25
368
0.17
0.24
0.01
10.7
39.1
1.5
0.02
0.035
10
9.8
0.002
0.001
0.016
0.02
0.001
0.00005
0.001
0.06
0.002



0.01

0.007
Carbonate
4.2
24
0.4
1.2
0.15
2.7
302
0.4
0.02
0.011
1.1
3.8
0.02
0.004
0.02
0.004
0.001
0.00008
0.0062
0.003
0.61


0.0002
0.01

0.009
IAEA
82
332
1


19
22
5
0.15
.03
15
45
0.013
0.077
0.4
0.02
0.09
0.001
0.005
0.14
0.33
0.0015

0.56
0.6
0.001
0.13
Soil 1
82
200



15
15
4
0.06

1.1
32
0.04
0.08










0.6

0.2
Soil 2
69
260



25
10
3
0.04

0.6
20
0.02
0.02










1

0.06
Motor Vehicles
Gladney
133
208



13.6
10.6
8.1
0.26
0.19
0.39
123
0.0006

0.23
0.078
0.16
0.025
0.006
0.11
0.81


0.006
1.14

0.056
Auto 1*




0.1

0.054




0.05

0.005
0.01


0.0005
0.38


0.002

0.0008
0.013

1
Auto 2


0.9
6
19





0.6
1


3



37





0.9

148
Auto 3




68






4


1.4
•


79







400
Cement
24
107



5.3
460
1.4



11















Soil3
71
330
0.8
0.85
0.1
13.6
13.7
4.6
0.1
0.2
0.85
38
0.04
0.02
0.05
0.03
0.005
0.00001
0.005
0.1
0.3



0.5

0.01
Industrial
Paint







422


4
204















Rock
82
282
1.1
0.26
0.13
20.9
41.5
5.7
0.14
0.1
0.95
56.3
0.075
0.055
0.07
.015
0.002
0.00005
0.003
0.09
0.4
0.002
0.002
0.0002
0.43
0.0001
0.013

Steel
11
7
1



12



31
570















Soil 4
60
350






0.14
0.06
0.99
37.8


0.08

0.005
0.0006





0.0006
































* Normalized to Pb - 1.0.

-------
     technology would be the most cost-
     effective approach.
   • TTFA can resolve strong sources
     that are present in only a few of the
     samples, and a source-profile vec-
     tor for the source can be predicted
     from the data. Other methods for
     performing quantitative source
     resolutions  cannot  account for
     such sources unless their presence
     is initially known and a measure-
     ment of the source composition  is
     available.
   • The presence  of several  samples
     with one very strong source was
     found to conceal the presence of
     other, weaker, factors in the full
     data set. Removal of the affected
     samples permitted the resolution
     of additional sources.
   • Source profiles of coal-burning
     power plant emissions may be
     identified by TTFA. Distinguishing
     crustal material from fly ash would
     be  more certain if sources were
     available.
   • TTFA can resolve sources with
     dependence on the same element
     such  as lead in particles from
     motor vehicle exhaust and lead
     and/or zinc smelters.
   • The analysis of the July and August
     data from one RAPS station pro-
     duced a reasonable source resolu-
     tion. The data from one week at all
     RAPS stations seem to be composed
     from a number of sources that are
     significantly different at each sta-
     tion. The large number of sources
     contributing to the 10 RAPS stations
     precluded the resolution of any but
     the most important sources.
   • By dividing both data sets intofine-
     and coarse-fraction subsets, addi-
     tional sources were  resolved, in-
     dicating that the sources contribute
     primarily to either the fine or coarse
     fractions, but not both.

 Recommendations
   With some additional development,
 target transformation factor  analysis
 could be a valuable tool in the efforts of
 local authorities to devise cost-effective
 strategies for the control of airborne
 particulate matter. To ensure the credi-
 bility of any sort of statistical analysis, it
 is important that there be some indication
 Df the reliability of the calculated results.
 n the present study, the actual nature of
 •he sources was not known, so the
^accuracy of the quantitative results
P:ould not be determined. Testing, under
 more controlled conditions, is essential
 to ensure that the calculated results are
 both accurate and reliable. One way to
 evaluate the accuracy of this method
 would be to analyze computer-generated
 data sets containing randomly generated
 errors that approximate  typical ana-
 lytical uncertainties and source-com-
 position variability. Comparison of the
 starting data with the calculated results
 would give an indication of the reliability
 of the analysis. Another way to verify
 the quantitative  results would be to
 analyze samples prepared in the lab
 from  known amounts of well-charac-
 terized materials. Comparison of the
 calculated results with the known  con-
 centration of each sample would deter-
 mine the quality of the source resolution.
 A third, and more complex, way to verify
 the quantitative  results would be to
 compare the results of the factor anal-
 ysis of real samples with results obtained
 by x-ray diffraction and/or microscopic
 examinations of the same samples.
  Further development of target trans-
 formation factor analysis would include
 error  models to give a better indication
 of the reliability of the calculated source
 profiles and to provide limits on the
 uncertainties of the calculated contribu-
 tions  of each factor. In addition, more
 objective and explicit criteria for deter-
 mining the number of correct factors to
 be retained are needed. The analyses of
 computer-generated  data  sets could
 shed additional light on the still nebulous
 process of determining the number of
 factors.
  Performing a source resolution of
 ambient aerosol samples requires a
 complete and accurate data set. If all the
 sources that contribute to a system are
 to be resolved,  it  is important that the
 data set include as many  elements as
 possible. In the  present  study, the
 measured elements accounted for less
 than half of the total measured mass of
 each  sample. The inclusion  of other
 elements such as carbon and other
 gaseous species  such as SO2 would
 have greatly facilitated the analyses. In
 addition, the exclusion of the element
 arsenic because of the large number of
 determinations below the detection
 limits inhibited the resolution of a factor
for coal fly ash. Future air quality studies
should attempt to produce accurate data
for both  the total  level of suspended
particulate matter and as many elements
and chemical species as practical.

-------
This Project Summary was authored by Stephanie C. Davis of WAPORA, Inc..
  Cincinnati. OH 45233.
T. G. Dzubay and C. W. Lewis are the EPA Project Officers (see below).
The complete report,  entitled "Modified Factor Analysis of Selected RAPS
 Aerosol Data," was authored by Daniel J. Alpert and Philip K.  Hopke of the
 Ihstitute for Environmental Studies  and Nuclear  Engineering  Program.
  University of Illinois. Urbane. IL 61801.
The above report (Order No. PB 81-120 784; Cost: $9.50. subject to change) will
 be available only from:
        National Technical Information Service
        5285 Port Royal Road
        Springfield. VA 22161
        Telephone: 703-487-4650
The EPA Project Officers can be contacted at:
        Environmental Sciences Research Laboratory
        U.S. Environmental Protection Agency
        Research Triangle Park. NC 27711
                                                                                0 US. OOVERNMENTPWNTINOOFFICE 1W1-757-01Z/7086

-------
United States
Environmental Protection
Agency
Center for Environmental Research
Information
Cincinnati OH 45268
Postage and
Fees Paid
Environmental
Protection
Agency
EPA 335
Official Business
Penalty for Private Use $300
      PS   0000329
      U S tlNVIR PROTECTION  AGENCX
      REGION 5  LIBKAKSf
      230 S  DEARBORN STRtET
      CHICAGO  JL  60604

-------