United States        Air and Radiation      EPA420-R-96-009
Environmental Protection               March 1996
Agency
Methodology for
Estimating Emission
Rates from
State-Specific
IM240 Data
                     > Printed on Recycled Paper

-------
              METHODOLOGY FOR ESTIMATING
                   BASIC EMISSION RATES
                    FROM STATE-SPECIFIC
                          IM240 DATA

                        FINAL REPORT
                           Prepared for:

                        Ms. Connie Radwan
                      Office of Mobile Sources
                 U.S. Environmental Protection Agency
                        2565 Plymouth Road
                        Ann Arbor, Ml 48105
                           Prepared by:

                    E.H. Pechan & Associates, inc.
                       5537-C Hempstead Way
                        Springfield, VA 22151
                           March 8,1996

                     EPA Contract No. 68-D3-0035
                       Work Assignment II-68
                   Pechan Report No. 96.03.004/1768
THIS DOCUMENT HAS NOT BEEN PEER OR ADMINISTRATIVELY REVIEWED WITHIN EPA AND IS
FOR AGENCY USE/DISTRIBUTION ONLY. DO NOT QUOTE, CITE, OR DISTRIBUTE. MENTION OF
TRADE NAMES OR COMMERCIAL PRODUCTS DOES NOT CONSTITUTE ENDORSEMENT OR
RECOMMENDATION FOR USE.

-------
                          CONTENTS
                                                           Page

ACRONYMS AND ABBREVIATIONS	 v

CHAPTER I
INTRODUCTION 	 1

CHAPTER II
IM240-TO-FTP CORRELATION  	 3
   A.  DEVELOPMENT OF IM240 LANE-TO-FTP BAG 3 CORRELATION	 3
   B.  DEVELOPMENT OF FTP CORRELATIONS 	 4

CHAPTER III
METHODOLOGY FOR INCORPORATING STATE-GENERATED IM240 DATA	 7
   A.  DIRECT INCORPORATION OF STATE DATA 	 7
   B.  METHOD FOR DEVELOPING BER EQUATIONS	 8

CHAPTER IV
EVALUATION OF PROCEDURE  	 11

CHAPTER V
RECOMMENDATIONS FOR FURTHER ACTION	 13
   A.  RECOMMENDATIONS OF DATA ELEMENTS TO BE COLLECTED	 13
   B.  STRATIFICATION OF DATA SAMPLE 	 13
   C.  EFFECTS OF VEHICLE PRECONDITIONING	 13

REFERENCES 	 15
                               11

-------
               ACRONYMS AND ABBREVIATIONS
BERs         basic emission rates
DR           deterioration rate
EPA          U.S. Environmental Protection Agency
FTP          Federal Test Procedure
I/M        inspection and maintenance
ZML        .  zero-mile level
VINs         Vehicle Identification Numbers

-------
                                 CHAPTER  I
                              INTRODUCTION
    In the development of the U.S. Environmental Protection Agency's (EPA) MOBILE5
emission factor model, emission data from the Federal Test Procedure (FTP) were
supplemented with emission data collected from the IM240 test procedure. A statistical
correlation was performed between the FTP and IM240 data sets so that the larger data
set of IM240 tests could be used in developing basic emission rates (BERs). Historically,
the BERs in the MOBILE model had been based primarily on FTP data, with correction
factors applied to the emissions measured at the standard conditions of the FTP.

    The purpose of this report is to assist EPA in developing a methodology to use State-
generated IM240 test data to develop locality-specific emission factors. The emphasis of
this report is to make recommendations that apply to new efforts that are to be performed
in the future by the States that use IM240 tests in their vehicle emissions inspections.

    For States with enhanced inspection and maintenance (I/M)  programs, there will be
two main situations where IM240 emission test data are gathered. One is where the
IM240 test is the primary test method in the I/M program. All subject vehicles receive an
IM240 test annually or biennially. The other situation is where IM240 testing is being
performed as part of an evaluation program where a minimum of 0.1 percent of the
vehicles subject to inspection in a given year are tested. The testing shall be done on a
representative random sample of vehicles and is required  in all enhanced I/M program
areas as a check on the effectiveness of the I/M program.  With the recent emphasis in
many States on attempting to meet enhanced I/M requirements through test-and-repair
networks, it is uncertain whether IM240 testing will be the primary test method.
Assuming that widespread use of IM240 test equipment is unlikely, more stringent
requirements might be placed on the IM240-based evaluation program testing to ensure
that this data set is representative of the vehicle fleet emissions in any given
State/nonattainment area.

-------
                                 CHAPTER  II
                     IM240-TO-FTP CORRELATION
    This chapter discusses the proposed methodology for developing the correlation
 coefficients to be used in converting the IM240 emissions data to FTP-equivalent emissions
 data. These correlation coefficients were derived from a data base consisting of IM240 and
 FTP tests performed on a subset of the Hammond, Indiana Lane Data Base vehicles at an
 EPA contractor's lab in South Bend, Indiana.

    Our proposed methodology for correlating the IM240 data to FTP data consists of the
 following basic steps: (1) developing a correlation between the IM240 lane data base and
 Bag 3 of the FTP; and (2) developing a correlation between Bag 3 of the FTP and each of
 the other Bags as well as the total FTP.

 A.  DEVELOPMENT OF  IM240 LANE-TO-FTP BAG 3 CORRELATION

    Based on comments in both the Sierra (Sierra, 1994) and SAI (SAI, 1994) reports
 evaluating the methodology used to derive the BERs used in MOBILESa, Pechan
 recommends eliminating the fuel and temperature adjustments used previously to
 correlate IM240 lane scores to IM240 lab scores. Although EPA's intent in applying fuel
 and temperature adjustments was to make the IM240-to-FTP correlations more generic
 (i.e., less region-specific), the resultant effects of the fuel and temperature corrections
 appear to be minimal. Instead, Pechan recommends developing a direct correlation
 between IM240 lane test data and the corresponding FTP Bag 3 data, since the IM240
 driving cycle was developed from the Bag 3 FTP driving cycle. As such, a good correlation
 should be observed between IM240 and FTP Bag 3 emissions.

    Although variations in fuel RVP and temperature can affect emissions, in this
 instance, the temperature and fuel corrections appeared to make little difference when
 compared with the unadjusted lane data. A number of other factors may also influence
 differences between lane and lab IM240 scores such as (1) differences in preconditioning
 between the two tests; (2) inconsistent dynamometer settings; (3) vehicle changes between
 tests; and (4) operator effects. As illustrated in the Sierra report (Sierra, 1994), several
 different approaches to the lane versus lab temperature and fuel adjustments yielded
almost no difference from the approach used for MOBILESa.  With no clear trends in the
data by month or season, the small sample sizes available for some months, in combination
with the variability added by applying the seasonal adjustments, Pechan recommends that
no lane to lab adjustments be applied to the IM240 data.

   Equation (1) illustrates the regression equation.

                 Bag 3 = a + b * FTP                                        (1)

-------
 B.  DEVELOPMENT OF FTP CORRELATIONS

    The next step of the proposed methodology is to regress the FTP Bag 3 data against the
 FTP Bag 2 data, developing a relationship that will then be applied to the IM240 data that
 has been correlated to the FTP Bag 3.  Preliminary analysis using EPA's FTP data base as
 a whole (i.e., not stratified) indicated a relatively good correlation between Bag 3 and Bag
 2. Using linear regression, an R2 value of 0.86 was obtained for HC, 0.88 for CO, and 0.91
 for NOX. Presumably, these correlation coefficients would improve even more if the data
 base is first stratified to obtain separate correlation equations for each technology group,
 model year group, and emitter class combination. A log fit of the data should also be
 investigated to determine whether a better fit of the data is obtained with a log regression.

    Finally, the Bag 3 FTP data needs to be related to Bag 1,  the cold start Bag. Since
 both Bag 1 and Bag 3 follow the same cycle, differences between these two Bags should be
 attributable to the cold start.  For MOBILESa, the cold start offset was calculated as the
 mean value of the difference between the FTP and the IM240 for the set of normal emitters
 with an FTP value greater than the IM240 value. For the vehicle-specific correlations, a
 cold start offset equation was developed where the cold start offset was calculated as a
 function of mileage, assuming that as a vehicle ages, cold start emissions will increase
 because the catalyst takes longer to reach light-off.

    Our proposed procedure includes incorporating mileage in the calculation of the cold
 start offset. For the proposed procedure, the difference between the FTP Bag 1 and FTP
 Bag 3 would be calculated for all vehicles in the FTP data set. This cold start offset would
 then be regressed against mileage for each technology group, model year group, and
 emitter category.  Emissions computed using the resulting regression equation would then
 be added to the calculated Bag 3 results to provide an estimated Bag 1 for each of the
 vehicles in the IM240 data base.  These steps are illustrated in equations (2) through (4):

                 Bag 2  =  c  + d  * Bag 3                                         (2)


          CS  = Bag 1 - Bag 3 = e + f *  ODOM                                  (3)


             Bag 1 =Bag 3 + e + / * ODOM                                     (4)


where:
    CS      =   cold start offset, and
    ODOM  =   odometer reading (miles)

Once the regression coefficients a through /have been calculated, FTP emissions by Bag
can be estimated for each of the IM240 lane test data points.  It should be noted that the
above equations do not imply that only linear regressions are appropriate.  The actual data
should be examined using logarithmic or log-linear regressions,  as well, and the regression
type yielding the best fit to the data should be selected.

-------
    For States that are only performing IM240 tests for their evaluation program (i.e., with
a sample size of 0.1 percent of the set of vehicles subject to an I/M test), the sample size of
data collected in the evaluation program will likely be too small to perform the described
stratification of data.  In such cases, it may be advantageous for States to sample a greater
share of vehicles. Alternatively, a smaller number of data stratifications might be used,
depending on where gaps are  found in the data. For example, if the sample size of the
1981-1982 model year group is very small relative to the 1983 and later model year group,
then the model year group stratification might be eliminated.

-------
                                CHAPTER III
         METHODOLOGY FOR INCORPORATING STATE-
                       GENERATED IM240 DATA
A.  DIRECT INCORPORATION OF STATE DATA

    Before the State-collected IM240 data are converted to FTP-equivalent data, the
State's data base of IM240 emissions needs to be quality assured. In other words, any data
records which might be missing certain pieces of data (such as mileage, model year,
technology type, etc.) should be eliminated, so that the resulting BER equations are not
inappropriately skewed. Vehicles with mileage listed as "0" or above a certain level should
be removed from the data base. The upper bounds of acceptable mileage should be
determined by plotting a histogram to determine an obvious point after which the data
obviously drops off. Any data that are entered into the State's data base using codes, such
as a code indicating which of the four technology types applies to a given vehicle, should be
checked to make sure that all codes are valid.

    The correlations developed in the previous chapter would then be applied to each
vehicle in the State's IM240 data base. The procedure for adjusting each IM240 record is
summarized below.

    •    Calculate the FTP Bag 3 equivalent for each record using the technology group,
        model year group, and emitter category regression coefficients developed above in
        equation (1).

    •    Estimate FTP Bag 2 emissions by applying the FTP Bag 3-to-FTP Bag 2
        correlation to the FTP Bag 3 emissions that were calculated in the previous step,
        again according to technology type, model year group, and emitter category (i.e.,
        calculate Bag 2 emissions using equation (2)).

    •    Estimate Bag 1 emissions by adding the appropriate cold start offset to the
        calculated Bag 3 emissions as a function of the vehicle mileage, with different
        equations applying to each technology group, model year group, and emitter
        category combination (equation (4)).

    •    Calculate the total FTP-equivalent emissions using the following equation (i.e.,
        weight each of the Bags according to the fraction of total FTP mileage
        accumulated in each mode):
 FTP = 0.206 * Bag  1 + 0.521  * Bag 2 + 0.273 *  Bag 3                         (5)

-------
    These IM240-to-FTP correlations using the Hammond data are needed because the
 States will not be performing FTP tests, and, therefore, would not be able to develop State-
 specific IM240-to-FTP correlations.

 B.  METHOD FOR DEVELOPING BER EQUATIONS

    Once FTP-equivalent emissions have been calculated for each vehicle in the State
 IM240 data base, BER equations would need to be developed for each model year group,
 technology type, and emitter category combination to be input to the TECH model.  This is
 accomplished by performing a regression of the FTP-equivalent emissions against mileage
 for each of the subcategories of data. The y-intercept of the regression would represent the
 zero-mile level (ZML) for the BER equation, while the slope would represent the
 deterioration rate (DR).

    Two options are available at this point: (1) use the BERs as calculated for input to the
 TECH model; or (2) adjust the FTP-equivalent emissions data to remove the effects of the
 I/M program.

    If option (1) is selected, the BERs input to MOBILE and the emission factors output by
 MOBILE would represent in-use emissions resulting from the State's current I/M program
 averaged over the time that the testing was performed. For vehicles that have failed their
 initial test, only the data from the final test should be included  (whether the vehicle passed
 the test at this point or was given a waiver).  Any records containing test data prior to the
 final pass/waiver determination should be removed from the data base before computing
 the BER equations.

    If option (2) is selected, Vehicle Identification Numbers (VTNs) should be used to track
 vehicles that have failed the IM240 test over successive test cycles. In this case, test data
 from vehicles that have failed the IM240 test in the first test cycle should be identified.
 Emissions from the initial test in the first cycle should be compared with the emissions
 from the final retest for that vehicle in the  first cycle (i.e., after the vehicle passes the
 IM240 test or is given a waiver).  The reduction (or increase) in  emissions between these
 two tests should be calculated. Next, the vehicle's initial test data from the next test cycle
 should be located. Assuming that the effect of any repairs made to the vehicle would
 deteriorate at the same rate as the vehicle's initial deterioration rate, the difference in
 emissions between the initial and final tests in the first cycle should be added to the
 emissions from the second cycle.  This process would be repeated in successive test cycles.
These adjustments should be performed using the FTP-equivalent emissions data. Finally,
 using the adjusted data from all test cycles (excluding data other than each vehicle's initial
 test in a given cycle), BER equations could  be developed by technology type/model year
group/emitter category using linear regressions of the FTP-equivalent emissions versus
mileage. The resulting equations would be used as input to the TECH model.

    By using the results from both option (1)  and  option (2), a State could estimate the
overall effects of its I/M program on vehicle emissions.  The output from both options (1)
and (2) would be input (separately) to the TECH model. The resulting output BER
equations could be used as input to the MOBILE model, with each of the two BER equation
sets (i.e., with and without I/M) in a separate MOBILE input file.  The data would be
accessed by appropriately setting the control flag  "NEWFLG" within the MOBILE input

-------
file.  By keeping all other inputs within the two MOBILE input files identical, the
difference in the emission factors output by MOBILE for these two files will represent the
effect of the I/M program in place at the time the data were collected.

    This methodology could only be used by States using IM240 as their primary motor
vehicle emission test method. States that are only performing testing on a sample of
vehicles would not have the ability to trace a vehicle's pre- and post-repair emissions.

-------
                                 CHAPTER IV
                     EVALUATION OF PROCEDURE
    The approach discussed in this report is analyzed here. First, the methodology for
 calculating FTP-adjusted emissions is examined. This is followed by an evaluation of the
 two options for developing BERs from the FTP-equivalent emissions data.

    The primary strength of applying the proposed procedure to calculate FTP-adjusted
 emissions is that the method would be compatible with the current structure of
 MOBILESa. All correction factors within the MOBILE model that are based on the FTP
 test cycle would still be valid (such as the speed adjustments). In addition, the fact that
 the data are State-specific, rather than the MOBILE default BERs developed from the
 vehicle fleet in another State, adds a degree of reality to the emission factors that are
 calculated with these State-specific BERs.

    One weakness of this approach stems from the lack of FTP testing at the State level.
 IM240 emissions from a State-specific data base are being adjusted to FTP-equivalents
 using FTP correlations that were based on the vehicle fleet from a different region. This
 will likely bias the FTP-equivalent emissions in that they will not be truly representative
 of the State in which the data were collected, but will be somewhat influenced by the
 vehicle fleet in other regions (e.g., Hammond, Indiana). As indicated earlier in the report,
 temperature and fuel effects are not expected to cause significant differences in the
 resultant data adjusted to FTP equivalents. However, differences in the vehicle mix
 between the FTP test site and the State of interest may be a more important consideration.
 If the average age and/or accumulated mileage in each of the technology type/model year
 group/emitter category groupings in the State data differ significantly from the
 corresponding data used to calculate the IM240-to-FTP conversions, the effects may be
 more severe than any fuel/temperature effects. It might be possible to evaluate whether
 this is an important issue by weighting the data from the IM240-to-FTP data base such
 that the average age or mileage of each subgrouping is significantly higher or lower than
 the current data base, and then recalculating the correlation coefficients.

    The strength of using the first option  for developing BERs, in which the effects of the
 State's I/M program are included, is that the actual emissions resulting from the I/M
program will be seen.  This would show whether an I/M program is achieving the emissions
benefits needed in the area projected for meeting attainment and rate of progress
demonstrations.

    The primary advantage of the second  option for developing BERs, in which the effects
of the State's I/M program are excluded from the BER equations, is that it makes it
possible within the current structure of the MOBILE model to calculate no-I/M emission
factors. It also enables an area to model emission factors that represent a change to the
I/M program from the program characteristics included in the program at the time the


                                        8

-------
IM240 test data were collected. Regional influences, other than the effects of I/M, can be
seen by comparing these emission factors with MOBILE emission factors calculated with
default BER equations. This also provides a consistent baseline for developing an emission
inventory for the State, using the State-specific BERs for areas with an I/M program from
option 1, and option 2 BERs for areas without an I/M program. Also, if a State has
different I/M program characteristics in different areas of the State, the State-specific
BERs would represent a combination of the two.  One way to alleviate these problems
would be to develop separate sets of BERs for  each area. However, this would significantly
decrease sample sizes, and in some instances,  a valid analysis  may not be possible if the
sample size is too small.

-------
                                CHAPTER V
          RECOMMENDATIONS FOR  FURTHER ACTION
 A.  RECOMMENDATIONS OF DATA ELEMENTS TO BE COLLECTED

    Section 51.365 of the I/M Final Rule (57 FR 52950, 1992) provides a list of the
 minimum data elements. Data items of interest for this project are listed below:

        Date of the test;
        Emission test start time and the time final emission scores are determined;
        VIN;
        Gross vehicle weight rating;
        Vehicle model year, make, and type;
        Odometer reading;
        Category of test performed;
        Fuel type;
        Type of vehicle preconditioning (if any); and
        Emission scores.

    The information on test times and dates could be used with information from the
 National Climatic Center to determine hourly (or every 3rd hour) temperatures from a
 nearby meteorological site. Temperature data could also be collected at the test site.

    The work scope also mentions fuel samples as a potentially required item. For
 reformulated gasoline areas, it may be possible to use the data collected by EPA to enforce
 the reformulated gasoline program.

 B.  STRATIFICATION OF DATA SAMPLE

    Stratification of the data are always desirable,  but if there are too few records to
 analyze in each strata, other problems are introduced. Stratifying data allows for
 differences that might be masked if the data were considered as a whole, much in the way
 that extreme values are muted when considering a mean value. However, if there are
 insufficient numbers of records in the strata, the benefits of stratification can  be lost.

    The vehicles in a specified region may have certain characteristics that  are more
 prevalent (fleet age, cars versus trucks, 4-wheel drives, etc.) in that State. If so, it should
 be worthwhile and possible (numberwise) to stratify the data by that factor  to assure that
the effects are not overlooked.
                                       10

-------
C.  EFFECTS OF VEHICLE PRECONDITIONING

    There seems to be an implicit assumption in the EEA and Sierra studies that all
IM240 tests are performed with the vehicle in a fully warmed-up (hot stabilized) condition.
This assumption may be invalid.

    A presentation made by a Toyota representative pointed out (Babcock, 1994) that if
there is no vehicle preconditioning before running an IM240 test, there may be many false
failures. It has been suggested that each vehicle undergo a set amount of preconditioning
before being given an IM240 test.  EPA staff suggest that this is an unnecessary step that
would add time to every test, and that the two ways to pass test procedure is going to allow
vehicles to pass based on Bag 2 cutpoints, even in instances where a vehicle begins the
IM240 procedure with a highly loaded evaporative canister.  Canister purging will occur in
the first 93 seconds of the test. The concern about the need for preconditioning was raised
by test results based on testing performed in Toyota's lab. (This testing was performed for
a small number of vehicles, though.) EPA feels that it has a much more extensive data set
of IM240 results from test lanes that match what the State programs will experience, EPA
finds that more than 50 percent of vehicles tested pass the composite test without
preconditioning.

    The Toyota presentation, on the other hand, suggested that based on Toyota's data,
without preconditioning, there will be many false failures. In general, Toyota thinks that
without preconditioning, the average test result will be biased high. This would be a
concern if States are going to be using IM240 results as an indicator of overall emission
levels.  The upshot of the above is that in a situation where IM240 test results for a small
sample of vehicles is being used to represent a State's emission rates, consistent
preconditioning could be important in ensuring that emission test results are not biased.

    Another consideration is that the data collected by States that choose to implement a
fast-pass criteria will differ from those that perform testing for the entire 240 seconds of
the test. The emissions data in such situations will likely be skewed high.

    The effects of both issues (preconditioning and fast-pass tests) could be mitigated by
using full IM240 tests with preconditioning, at least on a sample of the vehicles. It is
highly recommended that both preconditioning and full IM240 tests be used by  States that
are only testing the required sample of vehicles.
                                         11

-------
                               REFERENCES
Babcock, 1994: Robert Babcock, Toyota USA, Presentation at Tenth Annual Mobile
    Sources/Clean Air Conference, Estes Park, CO, September 27-29, 1994.

57 FR 52950, 1992:  Federal Register, Vol. 57, No. 215, p. 52950, "Inspection/Maintenance
    Program Requirements," Final Rule, November 5, 1992.

SAI, 1994: Systems Applications International, "Investigation of MOBILESa Emission
    Factors - Assessment of Exhaust and Nonexhaust Factor Methodologies and
    Oxygenate Effects," prepared for American Petroleum Institute, February 7, 1994.

Sierra, 1994: Sierra Research, Inc., "Investigation of MOBILESa Emission Factors --
    Evaluation of IM240-to-FTP Correlation and Base Emission Rate Equations," prepared
    for American Petroleum Institute, June 1994.
                                       12

-------
                        TECHNICAL REPORT ABSTRACT


REPORT TITLE: Methodology for Estimating Basic Emission Rates from State-Specific
               IM240 Data

REPORT DATE: March 8, 1996

CONTRACT NO.: 68-D3-0035

PRIME CONTRACTOR: E.H. Pechan & Associates, Inc.

WORK ASSIGNMENT NO./DELIVERY ORDER NO. (if applicable): 11-68

PROJECT OFFICER:  Mary Wilkins

PROJECT OFFICER ADDRESS: U.S. EPA, MD-12, RTP, NC 27711 TEL.: (919) 541-5229

PROGRAM OFFICE: Office of Mobile Sources

NO. OF PAGES IN REPORT:  16

DOES THIS REPORT CONTAIN CONFIDENTIAL BUSINESS INFORMATION

YES	           NO	X	

REPORT ABSTRACT - Include a brief (200 words or less) factual summary of the scope
and nature of the work performed and referenced in the report.

    In the development of EPA's MOBILES emission factor model, emission data from the
Federal Test Procedure (FTP) were supplemented with emission data collected from the
IM240 test procedure.  A statistical correlation was performed between the FTP and IM240
data sets so that the larger data set of IM240 tests could be used in developing basic
emission rates (BERs).

    Two reports were provided that detail shortcomings in the approach and statistical
analyses used to develop the MOBILES exhaust BERs.  The objective of this work
assignment is to assist EPA in improving the use of IM240 data used in developing the
MOBILE model exhaust BERs  (for LDGVs only). Since the time that the MOBILES BERs
were developed, additional FTP and IM240 emission test data have been collected, which
EPA plans to incorporate into the development of exhaust BERs for MOBILE6. This report
documents a proposed methodology for developing exhaust BER equations from State-
collected IM240 data that can be used as input to EPA's TECH model.

KEY WORDS/DESCRIPTORS - Select the scientific or engineering terms that identify the
major concept of the research and are sufficiently specific and precise to be used as index
entries for cataloging.

    MOBILES; MOBILE6; TECHS; FTP; IM240; I/M programs

-------