910-R-03-004
Receptor Model Analyses of Aerosol PM2.5 Data from the




      IMPROVE Monitor at Denali National Park
               EPA Technical Report 910-R-03-004

-------
Receptor Model Analyses of Aerosol PM2.5 Data from the IMPROVE Monitor at Denali




National Park









EPA Technical Report 910-R-03-004




Authors: Robert Kotchenruther and Rob Wilson, U.S. Environmental Protection Agency




Region-10, Office of Environmental Assessment, 1200 Sixth Ave., Seattle, WA 98101.









1. Summary




       The UNMIX and PMF receptor models were used to identify sources of PM2.5 at Denali




National Park.  The models used fourteen years of IMPROVE data, from 1988 to 2002.  Both




models indicated four sources and gave similar results, however, the model performance in both




cases was poor due to low filter mass loadings, which lead to high levels of uncertainty in the




chemical analyses and relatively poor fitting statistics in the models.  Despite the poor fitting




statistics in the models, three of four sources were identified.  These sources were identified as




biomass burning, soil dust, and sulfate and nitrate haze. PMF was found to be better than




UNMIX in isolating source signatures and gave results with higher confidence. The fourth and




smallest source could not be reliably quantified by UNMIX, but was quantified by PMF. This




source remains unidentified.









2. Receptor Models




       The two receptor models that were available for this analysis were the U.S.




Environmental Protection Agency's (EPA) UNMIX model and the Positive Matrix Factorization




(PMF) model.  We present below the results of both model analyses.  It is advantageous to

-------
compare and contrast the results of both models because their approach to source apportionment




is derived from very different mathematical methods, and therefore, when taken together offer




solutions with some measure of independence from the mathematics involved. We did not use




the EPA's Chemical Mass Balance receptor model due to the lack of available source profile




data. We used UNMIX and PMF because they do not require source data and provide a




technically valid approach when the number of samples is large.









2.1. The UNMIX Receptor Model




       Version 2.3 of the UNMIX (EPA UNMIX 2.3 User Guide, 2002) multivariate receptor




model was used in this analysis.  Information about, and copies of, the software can be obtained




from Gary Norris at the EPA (Norris.Gary@epa.gov). UNMIX uses geometric features in the




data called "edges" to constrain the model and determine source apportionment. These "edges"




are formed when a particular source contribution falls to zero at the receptor. The edges are, in




fact, boundaries in the speciated data that are formed when it is plotted in an n-dimensional




'source space', where n is the number of sources.




       As with any model, UNMIX has strengths and weaknesses. The strengths of UNMIX are




that it does not require prior knowledge of the number or composition of sources and can




independently determine the number of sources.  The weaknesses of UNMIX are that it has




difficulty identifying ubiquitous sources (where the contribution rarely falls to zero), very




infrequent sources, and relatively small sources  (contributing less than about 10% to the total




mass).  Additionally, the UNMIX solution is highly dependant on the species that are selected




and UNMIX assumes that source compositions don't change over time.  One feature of UNMIX

-------
that is a drawback in this instance, but could be a benefit elsewhere, is that it does not use or




require information about measurement uncertainty.









2.2. The Positive Matrix Factorization (PMF) Receptor Model




       Positive matrix factorization (PMF) is a form of principal component analysis developed




by Pentti Paatero at the University of Helsinki (Paatero and Tapper, 1994) and uses a weighted




least squares approach to determine source profiles (User's Guide for Positive Matrix




Factorization, 2000).  PMF has many of the same strengths and weaknesses as UNMIX, with




several important differences. Like UNMIX, PMF does not require prior knowledge of source




compositions.  However, several advantages of PMF in contrast to UNMIX are that it makes use




of the measurement uncertainty to weight data, does not require source contributions to




occasionally fall to zero, and is better able to identify small sources.  On the other hand, one




drawback of using PMF is that one must a priori declare the number of sources prior to running




the model. As with UNMIX, PMF also assumes that the source compositions don't change over




time.









3. IMPROVE Data and Data Processing




       IMPROVE aerosol data from Denali National Park were downloaded from the




IMPROVE web site (http://vista.cira.colostate.edu/improve/).  The dataset contained 1481




aerosol samples spanning the dates 3/2/88 to 5/26/02.  Each sample consisted of chemical and




elemental mass analyses for approximately 40 species, PM2.5 mass, PMio mass, and numerous




derived quantities. Also listed with each species was the associated measurement uncertainty




and minimum detection limit (MDL).

-------
       Prior to modeling, we analyzed the Denali data looking for the predominant mass




constituents. It is widely assumed that the largest sources impacting Denali National Park are




biomass burning from summer wildfires, soil dust, and sulfate and nitrate haze (most often




associated with emissions from industrial sources).  A rough approximation of these source




categories can be calculated directly from the IMPROVE dataset, by summing elemental ('EC' in




the IMPROVE dataset) and organic carbon ('OMC') for biomass burning, by taking 'SOIL' for




soil dust, and by  summing ammonium nitrate ('NH4NO3') and ammonium sulfate ('(NFL^SOV)




for nitrate and sulfate haze.  Figures la,  Ib, and Ic, show the seasonal distributions of




'EC'+'OMC', 'SOIL', and 'NH^Os'+'CNH^SOV, respectively. Figure 2 shows the monthly




average mass distribution of these categories and the distribution of the remaining uncategorized




mass. This analysis can only be considered a very rough source apportionment because it




assumes that these three source categories have no other significant constituents  (e.g., assumes




NH4NO3 and (NH4)2SO4 are the only constituents of haze) and that the above constituents can




solely be attributed to these three sources.  While this rough source apportionment supports




assumptions about the relative importance of sources impacting Denali National  Park, a more




refined source attribution using receptor models is needed to affirm these results and identify any




unexpected sources.




       In this receptor modeling analysis, only measurements associated with aerosol fine mass




(PM2.5) were considered for modeling and are listed in Table 1.




       Measurements in the IMPROVE dataset that were reported as less than the minimum




detection limit (MDL) were replaced with one half the reported MDL. The uncertainty of the




replaced data was also set to one half the MDL, unless the reported analytical uncertainty was




larger.  Efforts were made to exclude data with large uncertainty relative to the measurement.

-------
While this is not particularly important for PMF modeling, it was important for UNMIX because




UNMIX does not make use of measurement uncertainty.  First, data were excluded if the




uncertainty was more than twice the measured value. After this, the ratio of uncertainty (a) to




measurement (x), a/x, was calculated for each measurement and the average o/x computed for




each species.  Species were excluded from further analyses if the average a/x exceeded 0.7.




Excluding these species had two effects, it eliminated species with high relative measurement




error (important for UNMIX), and it excluded species where most of the measurements were




replaced with half the MDL (important for both models).  Table 1 lists the average a/x for each




species and those species chosen for receptor modeling analyses.




      Lastly, aerosol samples were excluded if any of the remaining species had missing




values.  The resulting dataset retained  1194 of the initial 1481 aerosol samples and 20 of the




initial 39 fine mass species. Figure 3a shows the 1194 fine mass measurements plotted




sequentially, and Figure 3b shows them plotted with the years overlapping to show the seasonal




cycle. The data processing outlined above is similar to that described by Lee et al. (1999) and is




consistent with guidelines established in the users' manuals of both PMF and UNMIX.









4. Model Analysis and Results




4.1. UNMIX Analysis




      The matrix of 20 species and 1194 samples was input into UNMIX for source-receptor




analysis. The model was set to consider fine mass measurements as the total mass, and results




were normalized to the fine mass measurements.  Data weighting factors were kept at their




default values, which decreased the influence of data with the lowest 15% of mass in the model.




The model identified four sources, however, as discussed above, the model's diagnostic output

-------
indicated that the model performance was poor. The four-source solution listed a minimum




correlation coefficient (r2) of 0.19, minimum signal to noise ratio of 7.52, and an overall




"strength" of 1.33.  Recommendations are that these values should be larger than 0.80, 2.0, and




3.0, respectively. Our conclusion was that confidence in the solution should be low. The most




likely reason for the poor performance of UNMIX is that PM2.5 impacts at Denali National Park




are, in general, very small. The mass loadings on many filter samples were low, which caused




the relative uncertainty in mass analyses in many cases to be high.  Indeed, we eliminated from




consideration nearly half (19 of 39) of the available species because of excessively large




uncertainty relative to the measurements. A good portion of the remaining data also had  high




levels of uncertainty (See Table 1). Because UNMIX makes no use of uncertainty information,




one would expect model performance to suffer under these conditions.




       Table 2 lists the source profiles as mass fractions, the estimated uncertainty in mass




fraction, and the relative certainty of each species mass fraction. The relative certainty was




calculated as the mass fraction divided by twice the uncertainty. Table 2 shows that despite the




model's poor performance, three of the four sources found by the model showed a reasonable




amount of confidence in source composition (i.e., high relative certainty).




       The  relative certainty in the fourth source was less than one for all species, meaning that




the uncertainty was more than half the mass fraction. This result is expected because UNMIX




has difficulty identifying small sources, those with less than 10% of the total mass (see UNMIX




users' manual).  The average sample loading was 1.96 ng/rn3 PM2.5 and the average fine mass




attributed to each of the sources was 0.87, 0.44, 0.59, and 0.03 |J,g/m3 for sources 1, 2,  3,  and 4




respectively. Hence, UNMIX attributed only 1.5% of the total fine mass to Source 4, much less

-------
than 10%. Source 4 was not further considered in the UNMIX analysis because of high




uncertainty in the mass fractions and the low attributed percent of total fine mass.




       Figures 4a, 4b, and 4c show the fine mass attributed by UNMIX to sources 1, 2, and 3,




respectively, with each year plotted overlapping to show the seasonal cycle. Figures 5a, 5b, and




5c show the mass fraction distribution attributed by UNMIX to sources 1, 2, and 3, respectively




(please note, the vertical scales differ in many of the figures presented in this report). For a




given source, by inspecting the relative abundance of species, which species have the highest




relative certainly, and the seasonal distribution of mass, we can surmise a general source




category.




       The species with the highest mass fractions in Source 1 are EC1, H, OC4, OP, and




(NH4)2SO4 and those with the largest relative certainties are EC1, H, K, and OC4. These species




are indicative of biomass burning, and Figure 4a shows the seasonal pattern of high mass impacts




one would expect for wildfires in Alaska (Kasischke  et al., 2000).




       Source 2 has Al, S, Si, and (NH/t^SC^ as the highest mass fractions and Al, Ca, Fe, H, K,




Si, and Ti as the species with the highest relative certainty.  The EPA Speciate database lists a




composite of soil dust having an elemental composition of Si  17.0%, Al 6.3%, Fe 3.0%, and Ca




0.8%, or the elemental ratios Al/Si, Fe/Si,  and Ca/Si of 0.37, 0.18, and  0.05, respectively.




Roughly the same elemental ratios appear in Source 2:  0.45, 0.29, and 0.12, respectively.




UNMIX also adds significant contributions from sulfur and carbon containing species.




However, because of the ubiquitous nature of sulfate  and carbonaceous species in the data record




(see Figure 1) and UNMIX's difficulty with ubiquitous sources (EPA UNMIX 2.3 User Guide,




2002), it is possible that the model is introducing bias in the results for sulfate and carbonaceous




species in Source 2.

-------
       Figure 4b shows that the highest impacts from Source 2 were between April and June.




This corresponds well with the annual spring thaw in this region of Alaska (on average in April,




from National Weather Service data), with the months of minimum precipitation (January




through May), and with the months of highest wind speed (May and June). However, this time




of year also corresponds with the highest frequency of trans-Pacific dust transport events (Husar




et al., 1998).  Apportioning soil dust between local and trans-Pacific transport sources is beyond




the scope of this study.




       Source 3 has Na, NFLJSTOs, S, and (NH/t^SC^ as the highest mass fractions and Ca, H, K,




S, and (NH4)2SO4 as the species with the highest relative certainties.  The seasonal pattern




shown in Figure 4c is indicative of impacts from winter and springtime Arctic haze (Polissar et




al., 1999). Arctic haze refers to aerosol, originating from industrial sources in Europe, Asia, and




North America, which becomes trapped over the Arctic due to large scale winter and springtime




synoptic weather patterns. The mass fractions for Source 3 also shows significant contributions




from Na and carbon containing species, which again may be artifacts of the UNMIX model.









4.2. PMF Analysis




       The matrix of measurements, 20 species and 1194 samples, and a matrix of measurement




uncertainties of the same size and corresponding to the measurements was input into the PMF




model for source-receptor analysis. One difficulty in running the PMF model is determining the




optimal number of sources the model should solve for. One way to solve this problem is to run




PMF using the number of sources determined by UNMIX.  Another method that relies solely on




PMF is described by Lee at al. (1999).  Briefly, the method involves running PMF for




sequentially larger numbers of sources, then plotting the maximum value of the rotation matrix

-------
(a diagnostic output of PMF) for each run versus the number of sources solved for. A significant




increase in the maximum rotation matrix value indicates the optimal number of sources, Lee et




al. recommend using the number of sources just prior to the increase. In conducting this analysis




we ran the model five times and generated solutions for two, three, four, five, and six sources.




Figure 6 shows the maximum rotation matrix value for each model solution. A significant




increase occurs between the four- and five-source solutions, so this method suggests that the




four-source solution is optimal and agrees with the results of UNMIX. In the subsequent




analysis we will  consider only PMF's four-source solution.




       In  order to account for all of the mass measured at the receptor location, some




investigators have introduced scaling factors into the PMF model (Dr. Tim Larson, University of




Washington, Personal Communication).  The use of scaling factors attempts to create the best




match between source contributions determined by the model, and measurements.  They are




determined through a multiple linear regression, where source contributions are regressed to the




total measured mass. The resulting regression slope for each source is the scaling factor for that




source. The scaling factors are used as follows: the initial masses attributed to each source are




multiplied by the scaling factor for that source, and the fractional compositions determined for




that source are divided by the scaling factor.  The use of scaling factors  makes the assumption




that the model has identified all the sources impacting the receptor.




       We followed these  steps in our analysis and determined the scaling factors for sources 1,




2, 3, and 4 to be  6.17, 0.96, 1.41, and 0.22, respectively. The r2 of the multiple linear regression




was 0.94 and the slope was 0.96, which indicates that there is a good fit between the measured




masses and that the sum of the regression adjusted sources.

-------
       We summed the mass fractions for each source, after rescaling as above, and these sums




were 0.22, 0.51, 2.02, and 0.73 for sources 1, 2, 3, and 4, respectively. The sum for Source 3




was greater than one, which is not physically possible. Hence, the scaling factor determined by




multiple linear regression was too small for Source 3. We decided to further adjust Source 3




such that the sum of mass fractions equaled 1.0.  We achieved this by using a new scaling factor




of 2.83 for  Source 3.  Using this new scaling factor caused the r2 for multiple linear regression to




fall from 0.94 to 0.91 and the slope of the regression increased from 0.96 to 1.04.




       As with UNMIX, the PMF model performance was relatively poor, likely due to low




overall mass on the filters causing higher uncertainty in the species analyzed.  Table 3 lists the




model output for PMF's four-source solution.  Listed are the regression adjusted mass fractions,




the uncertainty, and the relative certainty in each species mass fraction.  Despite PMF's poor




performance, the relative certainty of the source profiles was generally higher for PMF than for




UNMIX, and PMF does a much better job quantifying a fourth source. Figures 7a, 7b, 7c, and




7d show the adjusted fine mass attributed to sources  1, 2, 3, and 4, respectively,  plotted with each




year overlapping to show the seasonal cycle.  Figures 8a, 8b, 8c, and 8d  show the distribution of




mass fractions for sources 1, 2, 3, and 4, respectively.




       PMF also  outputs information on the amount of variability in each species explained by




each source. This explained variability (EV) can be  a useful tool for qualitative  source




attribution. Those species that have high EV are on some level the most important in




determining that source in the model. Table 4 lists the EV for each species and the variability




left unexplained.  Figures 9a, 9b, 9c, and 9d show the distribution of EV for sources 1, 2, 3, and




4, respectively. Figure 9e shows the amount of variability in each species that remained




unexplained.
                                                                                       10

-------
        As with UNMIX, general source categories can be surmised by inspecting the relative




abundance of species within a source, which species have the highest relative certainty in mass




fraction, and the seasonal distribution.  Additionally, the EV of each species can be evaluated.




       Inspecting the distributions for Source 1 in Figures 8a and 9a, EC1, H, OC4, OP, and




(NH4)2SO4 dominate the mass fraction and EC1, H, K, OC4, and OP have the highest EV. These




patterns, along with the seasonal distribution depicted in Figure 7a, are consistent with a biomass




burning source.




       The mass fraction distribution for Source 2 in Figure 8b shows that Al, Fe, Si, and




(NH4)2SO4 dominate the mass fraction and the distribution of EV in Figure 9b shows that Al, Ca,




Fe, K, Si, and Ti have the highest EV.  These species, the seasonal distribution depicted in Figure




5b, and the ratios Al/Si, Fe/Si, and Ca/Si of 0.33, 0.30, and 0.14, suggest a soil dust source.




       For Source 3, Figures 8c and 9c show that S and (NH4)2SO4 dominate both the mass




fraction and EV and suggest primary or secondary industrial sources.




       For Source 4, Ca, EC1, Na, NFLjNOs, OC4, and Zn dominate the mass fraction depicted




in Figure 8d.  However, Ca, Cu, Pb, and Zn have the highest certainty in mass fraction and, as




depicted in Figure 9d, Cu, Pb, and Zn have the highest EV. We are at present unsure how to




attribute this source.




       One hypothesis we considered for Source 4 was fugitive dust from the Red Dog zinc and




lead mine north of Kotzebue, Alaska.  The concentrated product of this mine is primarily sub-20




micrometers in size, and therefore could potentially be transported long distances.  However, the




mine concentrates are primarily zinc sulfide and lead sulfide and there was virtually no sulfur




attributed to Source 4. Therefore, we discounted this possibility.
                                                                                      11

-------
4.3. Comparison of PMF and UNMIX Results




       The mass fraction distribution pattern for Source 1 seen in Figures 5a and 8a for UNMIX




and PMF, respectively, are nearly identical.  However, the magnitude of the UNMIX mass




fractions are roughly twice that of PMF. Despite this difference, the total mass attributed to




Source 1 by each model agrees quite well. This is seen by comparing the seasonal distributions




for Source 1 represented in Figures 4a and 7a, for UNMIX and PMF, respectively.  They are




nearly identical in pattern and scale.  Figure  10 shows a scatter plot of the masses attributed to




Source 1 by UNMIX plotted against that by PMF (please note, UNMIX allows small negative




values in source contributions in order to reduce bias in the results).  A linear regression of the




data gives a slope of 0.92 and r2 of 0.91, with UNMIX tending to assign slightly more mass to




Source 1 than PMF. The chemical composition of Source 1, the  seasonal distribution of mass,




and the good agreement between UNMIX and PMF give us a high confidence in identifying




Source 1 as biomass burning.




       There are larger differences between  PMF  and UNMIX for Source 2. Figure 11 shows a




scatter plot of the masses attributed to Source 2 by UNMIX plotted against Source 2 for PMF. A




linear correlation of the data shows a strong correlation, with an r2 of 0.97, but the slope is 0.40.




This indicates that both models track the same source almost perfectly, but UNMIX attributes




more than twice the mass than PMF to this source. An analysis of the mass  fraction distribution




allocated by each model provides some explanation of the difference. The primary elemental




constituents of soil are Al, Si, Ca, Fe, and Ti. The mass fraction  ratios of Al, Fe, Ca, and Ti to Si




are similar between PMF and UNMIX solutions. These ratios are: Al/Si, 0.33 and 0.45, Fe/Si,




0.30 and 0.29, Ca/Si, 0.14 and 0.12, and Ti/Si, 0.03 and 0.03, for PMF and UNMIX,
                                                                                      12

-------
respectively. While these ratios are similar, the magnitude of the mass fractions attributed to




these species differed.




       IMPROVE data protocols recommend calculating a total soil component using the




formula 2.20*A1 + 2.49*Si + 1.63*Ca + 2.42*Fe + 1.94*Ti. The scalar factors multiplying each




of these species take into account the most common form of oxides of these species found in soil.




Using this formula, the mass fraction attributed to  soil dust in Source 2 is 43.1% for UNMIX and




89.6% for PMF. Further analyses showed that UNMIX attributed 32.7% of the mass fraction to




the sum of (NH/t^SC^ and NFLtNOs, and 7.2% to the sum of carbonaceous species,




EC1+OC4+OP, whereas PMF attributed only 7.2% and 0.8%, respectively, to these summed




sources. It is plausible that there is a source that combines soil, sulfate, nitrate, and carbonaceous




species. One example is trans-Pacific transport of mixed Asian dust and industrial pollution.




However,  due to UNMIX's difficulty with ubiquitous sources (i.e., sulfate, nitrate, and




carbonaceous species in this case), it is more plausible to explain model differences as UNMIX




introducing a ubiquitous source bias while PMF demonstrates a better capacity to isolate the soil




signature.  Hence, we recommend using the PMF mass fractions for Source 2.




       We have a high confidence in identifying Source 2 as soil dust due to the chemical




composition of the mass fractions, the seasonal distribution of attributed mass, and the good




correlation between UNMIX and  PMF (despite the difference in absolute mass attributed).




       There are large differences between PMF and UNMIX for Source 3. Figure 12 shows a




scatter plot of the masses attributed to Source 3 by UNMIX plotted against Source 3 for PMF. A




linear regression of the data gives a poor fit, with a r2 of 0.50 and a slope of 0.40.




       Despite this poor regression, there are some similarities between PMF and UNMIX in




their mass fraction distributions.  The ratio in mass fraction for S/(NH4)2SO4 is 0.25 for both
                                                                                     13

-------
PMF and UNMIX, and the ratio M^NCVCNH^SC^ is 0.08 and 0.16 for PMF and UNMIX,




respectively. While these ratios in mass factions are similar, their percent contribution is very




different. The percent contribution of nitrogen and sulfur species, (NH4)2SO4+NH4NO3+S, is




94.9 and 65.9 for PMF and UNMIX, respectively, the percent contribution of carbonaceous




species, EC1+OC4+OP, is 1.1 and 7.8 for PMF and UNMIX, respectively, and the percent




contribution of metal  species is 3.9 and 9.7 for PMF and UNMIX, respectively.  Hence, Source 3




for PMF is almost entirely made up  of nitrogen and sulfur species, whereas for UNMIX they are




only about two thirds of the mass. UNMIX attributes about seven times as much mass as PMF




to carbonaceous species and over twice the mass to metal species.  These complex differences




between PMF and UNMIX likely  contribute both to the poor correlation seen in Figure 12 and to




the increased mass attributed to Source 3 by UNMIX. Based on the differences between PMF




and UNMIX discussed above for Source 2, it is likely that Source 3 for UNMIX includes some




bias as discussed above and perhaps some mixing of sources, whereas Source 3  for PMF is a




better representation of the actual  source profile.




       Therefore, overall, PMF is better able to separate the sources, as well as  quantify a fourth




source.  Having said that, the PMF result likely overestimates the contribution of some of these




sources. This is because, having regressed the results of PMF to the total fine mass, we made the




assumption that four sources were all that impact the receptor.  This assumption neglects those




sources that PMF cannot resolve, for example, sources whose composition changed significantly




over time or sources that impacted the receptor too infrequently.




       Figure 13 depicts the monthly average contribution to total fine mass for each PMF




source.  The actual contribution of these sources likely lies somewhere between that depicted in




Figure 13 and the rough estimate depicted in Figure 2.
                                                                                     14

-------
5.  Discussion




       As part of the Clean Air Act, the Regional Haze Rule requires States to mitigate




anthropogenic sources of visibility degradation such that visibility in national parks reaches




background levels within 60 years.  Hence, it is important to establish the magnitude and




chemical composition of both background (i.e., natural and non-Alaskan) and man made sources.




       Receptor modeling in this study has quantified four sources and we have identified three.




Each of the three identified sources is potentially a mixture of Alaskan man made, Alaskan




natural, and non-Alaskan sources.




       Impacts from biomass burning during summer months at Denali are likely from Alaskan




wildfires.  However, there is a significant contribution to aerosol mass from biomass burning




during winter months, on average about 1 ng/m3 (see Figure 13), which is likely not from




wildfires.  This  wintertime biomass burning source could represents a hemispheric background,




local wood stoves, or some combination of these and other sources.




       The highest soil dust impacts occur in April and May, contributing on average about 0.5




Hg/m3. There are a number of potential sources for this dust, including wind generated dust from




within Alaska, dust brought in from Asia through trans-Pacific transport, and dust generated




from vehicular traffic on unpaved roads.




       Sulfate and nitrate haze have their highest impacts in January through March, with




monthly averages around 1 ng/m3.  Arctic haze potentially makes up a large portion of sulfates




and nitrates measured during the winter months, but the partitioning between Arctic haze and




local sources needs to be established.  Additionally, during summer months, the concentration of
                                                                                      15

-------
sulfate and nitrate aerosol approaches 0.5 ng/m3 and cannot be attributed to Arctic haze. The




source of this aerosol during summer has not been determined.




       A regional haze monitoring pilot study has recently been proposed by the State of Alaska




Department of Environmental Conservation.  This study will compare aerosol measurements




within Denali Park with those made concurrently over a longitudinal transect across Alaska.




Concurrent aerosol measurements gathered over a broad area of Alaska can establish what




portion of the aerosol measured in Denali Park is the result of local sources, and what portion is




due to regional/synoptic scale phenomena like Arctic haze and trans-Pacific transport. However,




it must be noted that the magnitude and frequency of these regional/synoptic scale phenomena




likely changes from year to year.  Hence, in order to accurately quantify the local and




background impacts to visibility degradation,  a multiyear study is recommended.




       In addition to a regional impacts study, a more detailed chemical, elemental, and isotopic




analysis of the aerosol data could allow 'fingerprints' of specific sources to be established and




better quantified.  This, in conjunction with further receptor modeling, could help in determining




sources more specifically.









6. References




       Henry, R. C.,  and G. A. Norris, EPA UNMIX 2.3 User Guide, U. S. Environmental




Protection Agency, 2002.




       HusarR.B., etal., The Asian Dust Events of April 1998, J. Geophys. Res., 106, 18317-




18330,2001.




       Kasischke, E.S., K. P. O'Neill, N. H. F. French, and L. L. Bourgeau-Chavez, Controls on




patterns of biomass burning in Alaskan boreal forests, pages 173-196 in E. S. Kasischke and B.
                                                                                      16

-------
J. Stocks, editors, "Fire, Climate Change, and Carbon Cycling in the North American Boreal




Forest", 2000, Springer-Verlag, New York.




       Lee et al., Application of positive matrix factorization in source apportionment of particle




pollutants in Hong Kong, Atmospheric Environment, 33, 3201-3212, 1999.




       Paatero, P., and U. Tapper, Positive matrix factorization: a non-negative factor model




with optimal utilization of error estimates of data values, Environmetrics, 5, 111-126, 1994.




       Paatero, P., User's Guide for Positive Matrix Factorization Programs PMF2 and PMF3,




University of Helsinki, 2000.




       Polissar, A. V., P. K. Hopke, P. Paatero, Y. J. Kaufmann, D. K. Hall, B. A. Bodhaine, E.




G. Button, and J. M. Harris, The aerosol at Barrow, Alaska: long-term trends and source




locations, Atmospheric Environment, 33, 2441-2458, 1999.
                                                                                     17

-------
Table 1. Measured Aerosol Fine Mass (PM2s) Species from the Denali National Park IMPROVE Sampler, the
Average Value of Species' Relative Measurement Uncertainty*, and Those Species Chosen For Receptor
Modeling**.
Available Species for Receptor Modeling
Average Value of Relative
Species Used in Receptor
	Modeling***'^
Aluminum, Al
Arsenic, As
Bromine, Br
Calcium, Ca
Chloride, Chi
Chlorine, Cl
Chromium, Cr
Copper, Cu
Elemental Carbon Fraction 1, EC1
Elemental Carbon Fraction 2, EC2
Elemental Carbon Fraction 3, ECS
Iron, Fe
Hydrogen, H
Potassium, K
PM2 5 Fine Mass, MF
Magnesium, Mg
Manganese, Mn
Molybdenum, Mo
Nitrite, N2
Sodium, Na
Nickel, Ni
Ammonium Nitrate, NH4NO3
Organic Carbon Fraction 1, OC1
Organic Carbon Fraction 2, OC2
Organic Carbon Fraction 3, OC3
Organic Carbon Fraction 4, OC4
Organic Carbon Fraction from Pyrolesis, OP
Phosphorus, P
Lead, Pb
Rubidium, Rb
Sulfur, S
Selenium, Se
Silicon, Si
Ammonium Sulfate, (NH4)2SO4
Strontium, Sr
Titanium, Ti
Vanadium, V
Zinc, Zn
Zirconium, Zr
          0.56
          0.81
          0.26
          0.14
          0.80
          0.77
          0.78
          0.66
          0.45
          0.76
          1.03
          0.09
          0.09
          0.15
          0.19
          0.84
          0.72
          0.92
          0.99
          0.51
          0.88
          0.50
          0.98
          0.86
          0.80
          0.69
          0.67
          0.96
          0.41
          0.75
          0.06
          0.78
          0.12
          0.12
          0.68
          0.46
          0.77
          0.18
          0.92
          Al

          Br
          Ca
          Cu
         EC1
          Fe
          H
          K
          MF
          Na
       1.29(NO3)
       1.4(OC4)
        1.4(OP)

          Pb
          Si
       1.375(SO4)
          Sr
          Ti

          Zn
*The relative measurement uncertainty is defined here as the ratio of the uncertainty to measured value.
**Species were chosen for receptor modeling if their relative measurement uncertainty was less than 0.7.
***Some species were multiplied by constants to account for the mass of the most prevalent form. For example,
nitrate (NO3+) usually exists as ammonium nitrate (NH4NO3), and the molecular weight of NH4NO3 divided by the
molecular weight of NO3+ is 1.29.
                                                                                                   18

-------

_S£ecies__

Al
Br
Ca
Cu
EC1
Fe
H
K
Na
NH4NO3
OC4
OP
Pb
S
Si
(NH4)2SO4
Sr
Ti
Zn
	 SourceJ^omposition 	
Mass
Fraction
0.0003
0.0001
0.0007
0.0000
0.0957
0.0010
0.0553
0.0045
0.0012
0.0146
0.0983
0.0996
0.0000
0.0186
0.0031
0.0685
0.0000
0.0002
0.0003

0.0005
0.0000
0.0003
0.0000
0.0104
0.0004
0.0030
0.0005
0.0012
0.0027
0.0066
0.0133
0.0000
0.0042
0.0012
0.0162
0.0000
0.0001
0.0000
Relative
0.3
2.9
1.3
1.1
4.6
1.4
9.2
5.0
0.5
2.7
7.4
3.8
0.8
2.2
1.3
2.1
0.3
1.4
2.7
	 Spjrrce^Cpjnposition^^^
Mass
Fraction
0.0437
0.0003
0.0117
0.0001
0.0188
0.0283
0.0297
0.0139
0.0014
0.0095
0.0197
0.0337
0.0003
0.0782
0.0975
0.3179
0.0001
0.0025
0.0002
JJncejtainty__
0.0037
0.0001
0.0010
0.0000
0.0054
0.0022
0.0032
0.0009
0.0035
0.0040
0.0051
0.0067
0.0000
0.0100
0.0093
0.0417
0.0000
0.0002
0.0001
Relative
5.8
2.6
6.1
1.4
1.7
6.6
4.6
7.5
0.2
1.2
1.9
2.5
3.2
3.9
5.2
3.8
2.3
7.2
1.2
	 SjDurceJ^CojripjDsition
Mass
Fraction
-0.0011
0.0008
0.0085
0.0003
0.0400
0.0025
0.0349
0.0053
0.0726
0.0769
0.0251
0.0131
0.0008
0.1150
0.0044
0.4674
0.0000
0.0006
0.0021

0.0009
0.0001
0.0006
0.0000
0.0048
0.0006
0.0026
0.0004
0.0081
0.0072
0.0043
0.0053
0.0001
0.0071
0.0020
0.0304
0.0000
0.0001
0.0004
Relative
-0.6
5.5
6.8
3.3
4.1
2.3
6.7
6.6
4.5
5.4
2.9
1.2
3.3
8.1
1.1
7.7
0.2
3.3
2.8
Source 4 Composition
Mass Relative
0.0005
0.0049
0.0228
0.0017
0.0540
0.0122
0.0734
0.0118
0.0076
0.0951
0.1109
0.0998
0.0026
0.0922
0.0577
0.2955
0.0094
0.0045
0.0021
0.0615
0.0511
0.2147
0.0238
0.3508
0.0404
0.4535
0.0714
0.1331
0.8698
0.9705
0.6422
0.0271
0.4520
0.3395
2.3902
0.1346
0.0376
0.0261
0.0
0.0
0.1
0.0
0.1
0.2
0.1
0.1
0.0
0.1
0.1
0.1
0.0
0.1
0.1
0.1
0.0
0.1
0.0
*Here relative certainty is defined as the mass fraction over twice the uncertainty.
                                                                                                                                                  19

-------
TableS^SourceCompositi^^                                              Data from Denali National Park.
^»npr*ipc ^IniiTV'p 1 f^nTnnncitimi ^IniiTV'p ^ f^nTnnncitimi
_Jc_!l_?..^^

Al
Br
Ca
Cu
EC1
Fe
H
K
Na
NH4NO3
OC4
OP
Pb
S
Si
(NH4)2SO4
Sr
Ti
Zn
Mass
Fraction
0.00000
0.00010
0.00020
0.00001
0.04827
0.00050
0.05921
0.00303
0.00041
0.00498
0.06000
0.04670
0.00000
0.01108
0.00180
0.03060
0.00000
0.00009
0.00003
JJncertainty_
0.00000
0.00000
0.00002
0.00000
0.00080
0.00001
0.00020
0.00004
0.00006
0.00019
0.00104
0.00087
0.00000
0.00015
0.00005
0.00060
0.00000
0.00001
0.00000
Relative
0.0
30.6
6.4
3.6
30.3
20.9
149.3
38.8
3.2
12.8
28.9
26.8
0.0
37.9
16.6
25.4
3.8
7.1
8.1
Mass
Fraction
0.07062
0.00020
0.03007
0.00006
0.00001
0.06319
0.00269
0.02658
0.00000
0.00001
0.00098
0.00662
0.00023
0.01911
0.21171
0.07227
0.00023
0.00615
0.00011
JJncertainty_
0.00041
0.00001
0.00023
0.00001
0.00015
0.00021
0.00055
0.00024
0.00003
0.00015
0.00368
0.00299
0.00001
0.00064
0.00084
0.00369
0.00001
0.00006
0.00001
Relative
86.7
11.4
66.4
5.3
0.0
148.6
2.5
55.2
0.0
0.0
0.1
1.1
11.6
14.8
126.6
9.8
16.6
49.7
5.2
Source^Comrjositipn
Mass
Fraction
0.00000
0.00050
0.00465
0.00008
0.01117
0.00251
0.00091
0.00289
0.02116
0.05735
0.00001
0.00001
0.00038
0.17841
0.00667
0.71317
0.00007
0.00006
0.00000
JJncejtainty_
0.00000
0.00001
0.00006
0.00000
0.00141
0.00004
0.00029
0.00007
0.00031
0.00067
0.00014
0.00020
0.00000
0.00058
0.00015
0.00227
0.00000
0.00002
0.00000
Relative
0.0
50.0
36.2
13.6
4.0
35.8
1.6
19.3
33.9
43.0
0.0
0.0
39.2
154.9
22.3
156.8
12.8
1.9
0.0
Source 4 Composition
Mass Relative
0.00000
0.00176
0.06712
0.00382
0.15314
0.00832
0.01236
0.01211
0.11288
0.23058
0.05428
0.00008
0.00501
0.00079
0.01320
0.00074
0.00041
0.00142
0.04916
0.00005
0.00006
0.00100
0.00006
0.01726
0.00035
0.00361
0.00081
0.00454
0.00932
0.02577
0.00167
0.00009
0.00440
0.00163
0.01237
0.00004
0.00020
0.00019
0.0
14.2
33.7
30.2
4.4
11.9
1.7
7.5
12.4
12.4
1.1
0.0
28.0
0.1
4.0
0.0
5.1
3.6
126.2
*Here relative certainty is defined as the mass fraction over twice the uncertainty.
                                                                                                                                       20

-------
Table 4. The Fraction of Variability in the Mass of Each Species that is Explained by a Four-Source Solution Using
                    Ur^
Species 	 V^riaMh^ofJp^desExpjaine^biSource*

Al
Br
Ca
Cu
EC1
Fe
H
K
Na
NH4NO3
OC4
OP
Pb
S
Si
(NH4)2SO4
Sr
Ti
Zn
Source 1
0.000
0.198
0.025
0.031
0.583
0.074
0.895
0.322
0.020
0.102
0.684
0.604
0.000
0.143
0.071
0.097
0.040
0.074
0.049
Source 2
0.595
0.050
0.362
0.040
0.000
0.650
0.007
0.314
0.000
0.000
0.002
0.013
0.068
0.033
0.655
0.027
0.219
0.478
0.021
Source 3
0.000
0.380
0.231
0.169
0.066
0.151
0.007
0.142
0.344
0.455
0.000
0.000
0.360
0.753
0.113
0.791
0.228
0.022
0.000
Source 4
0.000
0.064
0.154
0.322
0.042
0.027
0.005
0.030
0.093
0.089
0.014
0.000
0.198
0.000
0.012
0.000
0.064
0.025
0.850
_UrKxplained_
0.405
0.307
0.228
0.437
0.309
0.098
0.086
0.192
0.544
0.354
0.300
0.383
0.374
0.071
0.149
0.084
0.449
0.401
0.080
* Assumes a total variability in each species of 1.0.
                                                                                                     21

-------
    (0
    (D
30^
25-
20-
10-
5-
(a) IMPROVE 'EC+OMC



J F M >


	 ^
\ M J


I



111 ImLlLuLL j Liirfjiju ^L^^t^
I"" ""i""" 	 1 	 	 	 1 	 " 	 "1
J A S O N D J
            (b) IMPROVE'SOIL1
         3-
JFMAMJJAS
 (c) IMPROVE 'NH4NO3+(NH4)2SO4'
                                                 O
\     \
N   D
         0
           JFMAMJJASONDJ
   Measurement Date (month of year, multiple years overlapping)
Figure 1. IMPROVE Fine Mass (PM2.s) Data Plotted with Multiple Years Overlapping. Three
Source Categories are Depicted, (a) IMPROVE 'EC'+'OMC' as an Estimate for Biomass Burning,
(b) IMPROVE 'SOIL' as an Estimate for Soil Dust, and (c) IMPROVE 'NF^MV+'CNH^SOV as
an Estimate for Sulfate and Nitrate Haze.
                                                                    22

-------
     CD
     CD
     D)
     (0
4.5-,
4.0-
3.5-
3.0-
2.5-
2.0-
1.5
1.0
0.5
0.0
                    | OMC+EC
                    I SOIL
                    I NH4N03+(NH4)2S04
                    I remainder
                                                     S    O   N   D
                                  Month of Year
Figure 2. Monthly Averages of IMPROVE Fine Mass (PM2.5) Data. Four Categories are
Depicted, IMPROVE 'EC'+'OMC' as an Estimate for Biomass Burning, IMPROVE 'SOIL' as an
Estimate for Soil Dust, IMPROVE 'NH4NO3'+'(NH4)2SO4' as an Estimate for Sulfate and Nitrate
Haze, and the Remaining Mass on the Filter.
                                                                          23

-------
          1988     1990     1992     1994     1996     1998
                                Measurement Date (year)
2000
2002
                 FMAMJJASONDJ
               Measurement Date (month of year, multiple years overlapping)
Figure 3.  PM2.5 Aerosol Fine Mass Measurements From the Denali National Park IMPROVE
Sampler Plotted (a) sequentially and (b) with years overlapping.
                                                                               24

-------
    f_
     (/)  30-
     0)  20-
     O  10-1
     p
           (a) UNMIX Source 1
               I    I     I     I    I     I     I    I     I     I    I     I
          JFMAMJJASONDJ
    ro
         8-
         6-
     (0
     0
     8
     2
    4!
         2-
           (b) UNMIX Source 2
M
M
         6-,
           (c) UNMIX Source 3
O    N   D
          JFMAMJJASONDJ
     Measurement Date (month of year, multiple years overlapping)
Figure 4.  Seasonal Distribution of Aerosol Fine Mass (PM2.5) Attributed to Sources 1 (a), 2 (b),
and 3 (c), by the UNMIX Receptor Model.
                                                                     25

-------
        0.10n
    g  0.08J


    TO  0.06J
        0.04-
        0.02-
    (0
        0.00-
(a) UNMIX Source 1
         0.5n
     §  0.4^
     C/)

     (0
         0.1 q


         0.0
             (c) UNMIX Source 3

                                                       co  co -g  co H
                                                              I
0.35n
0.30-j
Q 0.25 -I
(0 0.20 -j
1 0.10^
0.05-j
nnn:
(b) UNMIX Source 2 |


,_, ^^n^ ^m













K)
CO
%





                                                       co  co -g  co H
                                                              I
                                          8
                                      Species
                                          co  co  -g  co H

                                                 5  "
                                                 K)
                                                 CO
                                                 2
Figure 5. Source Composition Mass Fractions for Sources 1 (a), 2 (b), and 3 (c), Determined by
the UNMIX Receptor Model.
                                                                            26

-------
       0.10-,
       0.08-
       0.06-
     X

    •fe
       0.04-
     g
    to
       0.02-
       O.OO-l	f-
              Two Sources   Three Sources   Four Sources    Five Sources    Six Sources

                             Number of Sources Specified in PMF
Figure 6. Rotation Matrix Maximum Value for Different Numbers of Sources in the PMF
Receptor Model.
                                                                                  27

-------
    s
     (D
    •f
     '
     co
     co
     co
     co
     (D
40-,
35-
30-
25-
20-
15-
10-
 5-
 0
              (a) PMF Source 1
 3-

 2-

 1-

 0
        F    M    A   M
    (b) PMF Source 2
                                                      WK!>VJ^«AIV^VM^%A.M

                                                            O    N    D
   J    F    M    A
    (c) PMF Source 2

                                                                            .,
                                                                  ^    r
                                                            O    N    D
         0.0
            JFMAMJJASONDJ
                Measurement Date (month of year, multiple years overlapping)
Figure 7.  Seasonal Distribution of Aerosol Fine Mass (PM2.5) Attributed to Sources 1 (a), 2 (b),
3 (c), and 4 (d) by the PMF Receptor Model.
                                                                                28

-------
0 0.06 -]
t5 0.05 -
£ 0-04-
w 0.03-
c§ 0.02-
^ 0.01 -
0.00 J
0.25 -,
.Q 0.20-
03 0.15-
1 1
(/) ' ~
c§ 0.05-
^ 0.00-
c 0.80-,
O
t$ 0.60-
LJ- 0.40-
c8 0.20-
0.00-
^~ 0.25 —
"I °-2°-
E 0.15-
LJ_
(g 0.10-
g 0.05-
nnn_
ia

; n

vir source




(b) PMF Source

^

s?
n n
S'PPJS1
i

2
1




. — .
i — i

8
^
Z
(c) PMF Source 3






I§
I





o





5





i — i
0)




1 — 1


CO
s
n „
C/) ^ CO H N


n
— ""* Q) C O fl> Q)
(d) PMF Source 4


n

, — ,
i i
i i

8

§
n
°

°"



1
*

CO
S
1 1


                > ro  o p
<*>
— =5
                                              8
                                          Species
Figure 8.  Source Composition Mass Fractions (PM2.s) for Sources 1 (a), 2 (b), 3 (c), and 4 (d)
Determined by the PMF Receptor Model.
                                                                                29

-------
                _, (a) PMF Source 1
IB .-6- °-8:
.ES °-6-
4-S °A--
iB5 °-2:
n n
•D >> 0.6-
0) .-^
•i ^ 0.4-
J5 co
8-'§ °-2-
W > oo "
•S.^0-8:
.ES °-6-
J5 .cp_ o.4-
^< .g 0.2-
n n
•S.-^0-8:
.ES °-6-
J5 .ro o.4-
^< .g 0.2-
n n
^"8 0.6 -,
."^ C.
!E ro 04
(0 Q.
co S 0.2-
^00
n
>
>
(c
ro
))F
S
>M
— i
O m -n n:
C O 
-------
        35
        30-
     co
     CO
         5,
         0-
        -5-
          -5
UNMIX(x) and PMF(y) source 1 fine mass
1:1 Line
Linear Regression, y = 0.92 x + 0.66, r2 = 0.91
          5       10       15       20       25
          UNMIX Source 1 Fine Mass (ng/m3)
30
35
Figure 10.  Aerosol Fine Mass (PM2.5) Attributed to Source 1 by the UNMIX Receptor Model,
plotted on the x axis, and by the PMF Receptor Model, plotted on the y axis.

-------
        6-,
        5-
        4-
     Cfl
     
-------
    6-,


    5-


    4-


(0  3-

i
d)  2 _

IT
CO

8  1
        o-
        -1-
        -2-
                  UNMIX(x) and PMF(y) source 3 fine mass
                  1:1 Line
                  Linear Regression, y = 0.50x + 0.31, r2 = 0.40
          -2-101234

                           UNMIX Source 3 Fine Mass (ng/m3)
Figure 12.  Aerosol Fine Mass (PM2.s) Attributed to Source 3 by the UNMIX Receptor Model,

plotted on the x axis, and by the PMF Receptor Model, plotted on the y axis.
                                                                               33

-------
    co

     CD
     (D
     D)
     2
     (D
5.0
4.5
4.0
3.5
3.0
2.5
2.0
1.5
1.0
0.5
0.0
PMF Source 1 (biomass burning)
PMF Source 2 (soil)
PMF Source 3 (industrial)
PMF Source 4 (?)
               J    F   M   A   M
                      \
                      J
    O 0)
         100 H
          80-
    3|
    c<40^
       020^
           0
                    \
                    F
               M   A   M    J    J   A
                        Month of Year
 \^
o
                                        N   D
Figure 13.  The Monthly Average Contribution of Sources as Determined by PMF Receptor
Modeling, Depicted as Both Total Mass and Percent Contribution.
                                                                      34

-------