EIS Requirements Document

Issue Description: Enhancement to Option Group / Option Set Functionality
EIAG Point(s) of Contact: Rich Mason

Summary

In the Source Classification Code table, EIAG can define SCCs that have a hierarchical nature. That is
there may be a "General" group, but then more specific SCCs within the same group. These relationships
are defined by the "Option Group / Option Set" (OGOS) fields in the SOURCE_CLASSIFICATION_CODE
table. During the selection process, if all data were compiled across multiple data sets, there is the
potential for the double-counting of data sources. For example: The EPA may report data to a "General"
SCC, but the State / Local / or Tribal (SLT) organization submits to the detail SCCs. Without the OGOS
evaluation, both sets of data would be included in the selection.

The current OGOS rules employed in the Selection assumes that if a SLT submits data, they are
summitting data for the entire group and no additional data sets are to be used to "back-fill" any SCCs
within the same option set. The desired function is for the selection to back-fill any SCCs within the same
option set.

Definitions
Option Group:

The Option Group defines the general categories for which an SCC hierarchy exists. Examples:
"Cattle", "Commercial", "Poultry", "Goat", etc.

Option Set:

An Option Set is a group (1 or more) SCCs that exist within an Option Group. There may be
multiple levels of an Option Set. The highest-ranking option set is determined alphabetically.
Using the "Goat" Option Group example from above, the option sets (in priority order) may look
like this:

Option Group

Option Set

SCC Level 3 Description

SCC Level 4 Description

Goat

A

Goats Waste Emissions

Not Elsewhere Classified

Goat

B

Goats Waste Emissions

Milk Goats

Goat

B

Goats Waste Emissions

Angora Goats

Ranking Dataset

The Ranking Data Set is the defined by the Selection Definition in EIS. When creating a selection,
the list of data sets to be used within the selection must be specified. These are assigned in
priority order. The data set that has the highest priority that is found for the given Option
Group will define what level of Option Set to use.


-------
Business Rules to be Used

1.	The OGOS rules cannot be applied to the Point Data Category

2.	If no OGOS exist for an SCC, the following hierarchy will be used:

a.	Highest Ranking Data Set

b.	Location

c.	SCC

d.	Emissions Type Code

e.	Pollutant

3.	If a OGOS does exits, the selection process will determine what Option Sets within an Option
Group to use based on the following:

a.	Location

b.	Pollutant

c.	Option Group

d.	Minimum Option Set Level (Length to Evaluate and the Group Letter. "3-B" for
Example)

e.	Data Set Ranking

f.	Highest Ranking Option Set within the Group

4.	The structure of the Option Sets will be assumed to be as follows:

a.	The first position must be a capital letter ("A", "B", "C", etc.) and must be populated if
there is an Option Group Defined for the SCC.

b.	The second position (if provided) must be a number between 1 and 9.

c.	The third position (if provided) must be a capital letter ("A", "B", "C", etc.)

d.	If there is a second position provided, there must also be a third position provided.

e.	No more than 3 levels may be specified

f.	The first-position letter assignment should be thought of as "This is the set that I want
to have a higher priority over another" rather than ""A" is assigned to the total
grouping".

5.	The ranking of the Option Sets will be as Follows:

a.	The lowest alphabetical value for the first position of the Option Set value

i. An "A1A" SCC will rank higher than a "B" SCC within the Option Group

b.	Option Sets with 1 character rank higher than those with 3 characters

i. If SLTs supply SCCs with both an "A" and an "A1A", then only the "A" SCCs will
be used.

c.	Option Sets with 3-character Option Sets are Grouped by the first 2 characters in the
Option Set description. Within each of these groups, the third position will determine
the ranking.

i. Example:

1.	In the selection definition, the SLT data set ranks higher than the EPA
data set.

2.	An SLT reports SCCs with the following Option Sets within an Option
Group: A1A, A2A, A2B, A3B, and A3C.

3.	The EPA Submits data for this Option Group with the following Option
Sets: A, B, A1A, A2A, A3 A, A4A


-------
4.	Since the SLT has the priority, it will set what level of Option Sets to be
used. In this case, it is the second level of the Option Sets (since they
did not supply any "A" or "B" Option Set SCCs). The Option Sets to
Evaluate within the Option Group are:

a.	A1 (From SLT)

b.	A2 (From SLT)

c.	A3 (From SLT)

d.	A4 (From EPA)

5.	The highest-ranking Option Sets within each of these groups are:

a.	A1A (From SLT)

b.	A2A (From SLT)

c.	A3B (From SLT)

d.	A4A (From EPA)

Examples

The following examples are to illustrate how different data scenarios would be implemented with these
rules. They are strictly for informational purposes only and do not reflect any specific known data
submission scenario. A key to the table in all the examples are:

•	~ = The agency Submitted Data

•	I	]= The data point will be selected

Example 1:

Selection Definition (Listed in Priority Order):

1.	SLT Data

2.	EPA Dataset 1

3.	EPA Dataset 2

Submitted Data by Each Organization
Option Set = RWC_Hydronic

SCCs in
Option Set

SCC Description

Option
Set Code

SLT
Data

EPA
Dataset 1

EPA
Dataset 2

2104008600

Hydronic heater: general, all types

A



~



2104008640

Hydronic heater: meets NESCAUM
phase II standards

B

~



~

2104008630

Hydronic heater: pellet-fired

B





~

2104008610

Hydronic heater: outdoor

B





~

2104008620

Hydronic heater: indoor

B

~





What Data Gets Included and Why (or Why Not)?

•	None of the "A" Option Set SCCs will be chosen because the SLT Data is the highest-ranking data
set. The highest-ranking Option Set Code they submitted was "B", thus making the "Minimal
Option Set Level" for this example "1 - B" (one character long, set code of "B")

•	2104008640 - Selected because it was submitted by the SLT and is in the "B" Option Set


-------
•	2104008630 - Selected because it is the highest-ranking data set for the SCC and is in the "B"
Option Set

•	2104008610 - Selected because it is the highest-ranking data set for the SCC and is in the "B"
Option Set

•	2104008620 - Selected because it was submitted by the SLT and is in the "B" Option Set

Example 2:

Selection Definition (Listed in Priority Order):

1.	SLT Data

2.	EPA Dataset 1

3.	EPA Dataset 2

Submitted Data by Each Organization
Option Set = WW Treatment

SCCs in
Option Set

SCC Description

Option
Set Code

SLT
Data

EPA
Dataset 1

EPA
Dataset 2

2630010000

Industrial |Total Processed

B1A




-------
3. EPA Dataset 2

Submitted Data by Each Organization
Option Set = P and P Product Tran

SCCs in
Option Set

SCC Description

Option
Set Code

SLT
Data

EPA
Dataset 1

EPA
Dataset 2

2505000000

All Transport Types |Total: All
Products

A




-------
SCCs in
Option Set

SCC Description

Option
Set Code

SLT
Data

EPA
Dataset 1

EPA
Dataset 2

2505000900

All Transport Types |Tank Cleaning

B





~

2505010000

Rail Tank Car|Total: All Products

B1A

~





2505020000

Marine Vessel |Total: All Products

B2A



~

~

2505030000

Truck|Total: All Products

B3A



~

~

2505040000

Pipeline|Total: All Products

B4A



~

~

2505010030

Rail Tank Car|Crude Oil

BIB



~



2505010060

Rail Tank Car| Residual Oil

BIB



~



2505020030

Marine Vessel | Crude Oil

B2B

~





2505020041

Marine Vessel | Lube Oil-Barge

B2B



~



2505030030

Truck|Crude Oil

B3B



~

~

2505030060

Truck| Residual Oil

B3B

~

~

~

2505040030

Pipeline | Crude Oil

B4B

~





2505040060

Pipeline | Residual Oil

B4B





~

What Data Gets Included and Why (or Why Not)?

•	The first thing to consider in this example is determining the Minimum Option Set Level. Out of
this Option Group, the SLT is the highest-ranking data set. The level where they submitted data
is "3-B".

•	None of the Option Set "A" or "B" values will be selected because of the Minimum Option Set
Level being "3-B".

•	The 2 "BIB" values are not selected because the SLT reported values at the "B1A" level.

•	SCC 2505010000 is selected because it is the highest-ranking dataset within the "B1A" group in
the "3-B" Minimum Option Set Level.

•	None of the B2A, B3A, or B4A SCC are selected because there is a higher-ranking data set that
reported data at a different level.

•	2505020030 and 2505020041 are selected because they are the highest-ranking data sets in the
B2B grouping.

•	2505030030 and 2505030060 are selected because they are the highest-ranking data sets in the
B3B grouping of SCCs.

•	2505040030 and 2505040060 are selected because they are the highest-ranking data sets in the
B4B grouping of SCCs.


-------