EPA-450/3-76-011 August 1975 CDHS EXECUTIVE SUMMARY REPORT U.S. ENVIRONMENTAL PROTECTION AGENCY Office of Air and Waste Management Office of Air Quality Planning and Standards Research Triangle Park, North Carolina 27711 ------- EPA-450/3-76-011 CDHS EXECUTIVE SUMMARY REPORT by International Business Machines Corporation 18100 Frederick Pike Gaithersburg, Maryland 20760 Contract No. 68-02-2054 _ EPA Project Officer: Lloyd Hedgepeth Prepared for ENVIRONMENTAL PROTECTION AGENCY Office of Air and Waste Management Office of Air Quality Planning and Standards Research Triangle Park, N.C. 27711 August 1975 ------- This report is issued by the Environmental Protection Agency to report technical data~of interest to a limited number of readers. Copies are available free of charge to Federal employees, current contractors and grantees, and nonprofit organizations - as supplies permit - from the Air Pollution Technical Information Center, Environmental Protection Agency, Research Triangle Park, North Carolina 27711; or, for a fee, from the National Technical Information Service, 5285 Port Royal Road, Springfield, Virginia 22161. This report W(as<*furnished to the Environmental Protection Agency by International Business Machines Corporation, Gaithersburg, Maryland 20760, in fulfillment of Contract No. 68-02-2054.' The contents of this report are reproduced herein as received from International Business Machines Corporation. The opinions, findings, and conclusions expressed are those of the author and not necessarily those of the Environmental Protection Agency. Mention of company or product names is not to be considered as an endorsement by the Environmental Protection Agency. Publication No. EPA-450/3-76-011 11 ------- NOTICE This report discusses the following three subsystems of the Comprehensive Data Handling System (CDHS): Air Quality Data Handling Subsystem-II (AQDHS-II) Emissions Inventory/Permits and Registration Subsystem (EIS/P&R) Enforcement Management Subsystem (EMS) While the EMS was originally included as a subsystem of the CDHS, installa- tion, maintenance and other matters dealing with EMS are handled by the Division of Stationary Source Enforcement (DSSE) . Any inquiries regarding the EMS should be directed to: Office of Enforcement and General Counsel Office of General Enforcement Waterside Mall, 3220L Washington, D. C. 20460 All inquiries regarding AQDHS-II or EIS/P&R should be directed to the Office of Air Quality Planning and Standards. ------- 1.0 THE COMPREHENSIVE DATA HANDLING SYSTEM - AN OVERVIEW This section provides a conceptual description of the Comprehensive Data Handling System (CDHS) to establish its role in air pollution control and to give the user an insight into the overall application of the system. This foundation will allow the user to incoporate CDHS subsystems into his normal routine with greater appreciation of the advantages and benefits that can be obtained. 1.1 INTRODUCTION Air Quality Management is today one of the most important of the public disciplines. It assumes this importance because of its relative newness and its pervasive influence on our society. While the social importance has certainly caused some helpful fiscal backing, it has also created severe challenges for Air Quality Managers. Working under severe time constraints with limited resources the state Air Quality Manager has had to: a. explore this new area of concern, b. establish procedures for enforcing air pollution control, c. prepare and implement long term policies which win lead to improvement of the air quality, and always in the harsh spotlight of public scrutiny. The exercise of air pollution control is highly dependent upon the collection, processing and analysis of large amounts of data. This data, which describes both the sources of pollutants - the emission inventory and the resultant ------- ambient concentrations - the air quality file, is constantly changing and growing in amount. Proper maintenance and processing imposes special resource requirements on the state and local agency. The Environmental Protection Agency has recognized the work load imposed on state and local agencies as the result of data collection, processing and reporting functions arising from existing Federal regulations. EPA has responded by creating a data handling system patterned to satisfy both state and Federal needs for collecting, processing, and reporting air qualityand emissions data. The need to manage resources and schedules is also recognized as an inherent part of this system. The EPA solution is the Comprehensive Data Handling System (CDHS) which is.comprised of: . . . The Emission Inventory/Permits and Registration Subsystem (EIS/P&R) The Air Quality Data Handling Subsystem (AQDHS II) The Enforcement Management Subsystem (EMS) The CDHS has been designed to support state and local agencies by providing an automated means for managing their emissions and air quality data in support of the many functions of air pollution control. A significant feature of the system is the capability to automatically respond to the Federal reporting requirement for both air quality and emissions data. Figure 1.1-1, CDHS System Summary, provides a ready reference relating air pollution control functions to CDHS support capabilities. The manner in which CDHS support capabilities are applied at the state and local level are purposely left to the descretion of the Air Pollution Control Manager so that he can have full control of his operation while exploiting the advantages of automation. EPA's approach to assisting both state and local governments in air quality management has been to regard air po.llution control as an overall system in and of itself. This system is complex with many internal interrelationships, with distinct feedback loops with many external influences. The system depends on the collection, evaluation and reporting of a large amount of data. Figure 1.1-1 illustrates these features and highlights the data relationships between ------- AIR POLLUTION CONTROL FUNCTIONS Status Recommendations Analysis and Evaluation System Data Dalii System Status Data Management Policy Regulations Directed Actions Recommendations Proposed Laws Reports Standards Laws Time Schedule Enforcement Functions I Variance System J ( Permit System J (Inspection ^ System J (Enforcement A System ) (Surveillance \ System J (Air Quality System I Complaint System J EPA Legislature Hearings Hearings Public Recommended Actions Laws Enforcement Powers Courts Legal Agency Complaints Figure 1.1-1. An Air Pollution Control System ------- management, analysis and the enforcement functions. The source enforcement functions generate data describing emission sources, air quality and related information. This data, in turn, is analyzed, evaluated and reduced to management reports measuring the success of control activities. Management translates the findings into policies and, where necessary, regulations. The nature of the end product in air quality management suggests that measurements are needed both at specific locations and at less specific or global locations. This monitoring must be of sufficient resolution so as to adequately describe the air quality found; it must also be measured with sufficient rapidity to be of more than historical value. With these considerations in mind, the system structure has been adapted for CDHS to address the many interfaces and feedbacks of the air pollution system as well as the pure data handling functions. 1.2 COMPREHENSIVE DATA HANDLING SYSTEM (CDHS) Management of air quality requires constant surveillance of air quality and careful recording of all significant pollution sources with respect to: a. Location by name and address b. Categorization by emission type and quantity c. Identification with processes d. Relation to other facility attributes. Measurements are required to trace progress of particular emission control attempts and of the general state of air quality. Involved long range plans are required to change and control specific plant processes. Detailed analysis is required to predict future air pollution conditions and to determine the contributory effect of any particular emission. These considerations must, in ------- urn, be related to Air Pollution Control Implementation Plans to ensure that nforcement actions are achieving desired goals. These and the many unmentioned asks in air quality control add up to a massive data handling requirement. he CDHS was designed to meet these specific requirements by responding to the omplex relationships of air pollution control as depicted in Figure 1.2-1. It irovides computerized data storage and retrieval subsystems to serve in irocessing the details of point and area emission inventories as well as high volumes of repetitive air quality measurements. Files and procedures necessary :o manage the long range scheduling inherent in bringing emission levels under :ontrol are provided as part of the system. U.1 subsystems have been developed with state and local needs in mind. Consequently, two system features are significantly important. All systems contain the capability for agencies to locally devise qualification criteria for accessing the data bases. This means that each agency can selectively retrieve data records (or portions thereof) according to their special needs. All systems can accommodate a wide variety of data (e.g., multiple sources, pollutants and air quality) which are selected locally. All subsystems enable users to select records from the data base and process the answers using subsystem programs to produce several classes of reports, including detail and summary reports. Provisions are made to simplify the addition of programs to extend the type, content, and formats of reports that can be generated from the data bases. In addition, the extended version of EIS permits an agency to record, store, and access information whose content and format are locally selected. This capability gives the agency a means for handling special information beyond the normal scope or of different content from generally applicable air pollution control data. The subsystems are capable of accepting machine readable information formatted ac.cording to the CDHS defined or the Federal (EPA) formats for emission and air quality data. The internal files are designed to accept multiple formats ------- Emission Inventory/ Permits and Regis- tration Subsystem (Data) Emissions Permits Compliance Complaints Prosecution Inspection Surveillance Reports Comprehensive Data Handling System Enforcement Management Subsystem (Schedules) Actions Permits Inspections Prosecution Complaints Compliance Reports Air Quality Data Handling Subsystem (Air Quality) Trends Models Surveillance Reports V Evaluation — Decision — Control Figure 1.2-1. CDHS Design Overview ------- including those defined by the system, National Emissions Data System (NEDS), the Storage and Retrieval of Aerometric Data (SAROAD) System or the local agency. Full compatibility with Federal requirements is, therefore, achieved while allowing agencies to handle data in local formats. The retrieval language format is common to all subsystems, thus providing a uniform user interface. In all cases, this facility takes simple but powerful logical expressions of the type : IF a condition is true, THEN select the record, e.g., IF STACK ='100' and converts them into computer program statements which test the data base." The language allows for compound logical queries which give the user the ability to selectively retrieve from one to many records. This capability lets the user rapidly respond to many different types of questions related to air pollution control. If the test is successful, a copy of the qualifying record is added to an output file called an answer set from which appropriate reports are generated. CDHS serves an Air Quality Control Agency by: a. Allowing better utilization of professional personnel by providing organized data handling systems. b. Assisting management by giving day-to-day visibility of the status of operations such as air quality monitoring, source monitoring, permits and registrations, inspections, complaints, surveillance, compliance, schedules, and legal actions. c. Providing a mechanism for appropriately formatting the basic data for inputs to Federal reports. ------- CDHS, or any of its subsystems, is available without charge to state and local agencies through the appropriate EPA Regional Office. EPA encourages the use of CDHS by state and local agencies and conducts a project to provide installation assistance and detailed system orientation training when requested by state and local agencies. System maintenance and further development is scheduled for at least the next several years. CDHS will store, update, retrieve and maintain a history of information to support the following agency functions: a. Emission Inventory Data Base Maintenance b. Air Quality Data Base Maintenance c. Scheduled Activity Monitoring d. Permit System Operation and Enforcement e. Compliance System Operation and Enforcement f. Inspection System Operation g. Complaint System Operation. CDHS supports these activities by a combination of independent subsystems: the Emissions Inventory/Permits and Registration Subsystem (EIS/P&R), the Enforcement Management Subsystem (EMS), and the Air Quality Data Handling Subsystem (AQDHS-II). A conceptualization of the system is shown in Figure 1.2-2, The Emissions Inventory Subsystem, with its extension to handle Permits and Registrations, (EIS/P&R), provides a means for monitoring point and area source engineering and emissions data. It has special capabilities for recording permit data and can handle narrative information such as rules and regulations. EIS/P&R also provides means for handling special data of local importance while maintaining full compatibility (for reporting purposes) with Federal requirements. EIS/P&R is especially significant since it can be used to support such agency activities as permit control, source inventory, .legal actions, and the monitoring and recording of enforcement and inspection activities. ------- The Comprehensive Data Handling System State Local EPA Air Pollution Control Figure 1.2-2. CDHS Concept ------- The Enforcement Management Subsystem (EMS) provides the means for recording and monitoring the many scheduled activities needed for air pollution control. EMS assists management by reporting scheduled actions (and providing individual Action Cards) as well as generating form letters to organize activities such as permit applications, meetings with industry, and enforcement notices for which appropriate standard letters can be locally devised. The Air Quality Data Handling Subsystem (AQDHS-II), as its name implies, records and processes air quality data. In addition it provides a means for monitoring special site information such as sensor descriptions. AQDHS-II contains a full set of statistical processing routines designed to reduce air quality data into a meaningful form in presentation. The reports generated by AQDHS-II assist agencies in monitoring the progress and effectiveness of air pollution control. AQDHS-II simplifies reporting requirements by automatically generating appropriate Federal air quality reports. The CDHS subsystems are straightforward and simple by design. They derive their power from the ease of execution of three basic capabilities: data storage, data retrieval, and report generation. The general flow of information through the system is shown in Figure 1.2-2. The greatest utility of CDHS comes from its capability to selectively retrieve and display in various formats the contents of one or more data elements in a source record. Using this capability users can answer such requests as, "Show me all data on.firms whose principal product is steel." or "Show me all data for plants who are. within a square bounded by four UTM coordinates and whose stacks are less than 100 feet and emit S02." or "How many facilities have stacks taller than 100 feet?" or "What is the distribution of emissions in Montgomery County by major categories of pollutant sources?" using EIS/P&R. AQDHS-II supplies information supporting answers to such questions as: "Are there any centers of high pollution concentration in Western City?," "Has the particulate standard been exceeded at the City Hall site this year?," "Is there any periodicity to the concentration of hydrocarbon at the corner of 5th Avenue and Vine?," "Has the 10 ------- sulfur oxide concentrations in Carroll County decreased in the last two years?" or "What are the monthly mean carbon monoxide concentrations along Interstate 70S?" EMS on the other hand helps agencies keep track of scheduled events and supplies answers to such questions as: "What inspectors will be out next week and where will they go?," "Do we have any court cases scheduled in September?," "Are there any sources who are delinquent in requesting permits?," or "Do we have any special surveillance activities next month?" The fact that separate computer program facilities are used to extract data from files and to list the data makes the system generally amenable to local extension with specialized programs. These locally developed programs resolve special local problems and analyze particular atmospheric conditions. 1.3 EMISSION INVENTORY/PERMIT AND REGISTRATION SUBSYSTEM (EIS/P&R) The Emission Inventory/Permit and Registration Subsystem (EIS/P&R) is designed to help resolve agency problems associated with maintaining and controlling sources of pollution (Figure 1.3-1). The subsystem provides the basic capabilities needed to record data describing sources of pollution in terms directly applicable to control and enforcement rules and regulations. The re- corded data includes engineering data describing the ultimate source of pollution as well as the flow through the system to the atmosphere. The subsystem also accommodates descriptive data related to permits or registrations pertaining to each source. Since the emission inventory identifies sources of pollution which are the ultimate objectives of control regulations and enforcement actions, the subsystem has been designed to support not only the basic functions but also a number of related functions. Prime among these are: 11 ------- AREA SOURCES POINT SOURCES EIS/P&R Inspection Enforcement j Management Federal Reports Legal Actions Control Evaluation Complaints Emergency Action Compliance Source Surveillance PERMITS :.,->Vl^J - vX~ ' iS^i ifl U*-' v Figure 1.3-1. EIS/P&R Concept ------- a. Inspections b. Complaints c. Legal compliance d. Source surveillance. Inspections are supported by the subsystem and, in turn, support the subsystem. EIS/P&R provides a readily accessible medium for recording the results of inspections. The results may appear in the system as verified standard entries or as narrative accounts. The subsystem is supported in the sense that the content of the files are given additional validity and, thus, greater utility because of the verifications achieved during inspections. EIS/P&R can be easily used to verify or identify sources against which complaints have been made. For example, a geographical search of the file could be used to find the possible source or sources that may have given rise to a complaint. Conversely, the data on file may eliminate sources from consideration. Again the flexibility of the system can be used to record the complaint in narrative form once the appropriate source has been isolated. Legal actions are also supported by EIS/P&R which provides a means for recording facts that could be introduced as evidence. Such facts could include as a standard part of the recorded data such items as measured and allowable emissions, pertinent permit information, and appropriate engineering data. Additional material can be included in EIS/P&R to cover such facts as registered complaints, source surveillance reports, and cross references to admissable legal documents. This information can be related to court calendars either by schedules incorporated in the special fields of the EIS/P&R record or by manual cross reference to the schedules maintained by the EMS. All of this capability provides greater management control as well as the means for presenting pending actions for management review. EIS/P&R supplements EMS arid the Compliance Data System by providing a direct association between sources, permits and compliance schedules. Compliance schedules can be inserted as a part of the emission inventory record by using 13 ------- the specialized storage and retrieval capabilities of the subsystem. Thus, all aspects related to the degree of compliance of a source can be recorded within a single record of the source thereby permitting the retrieval and reporting of all pertinent data and compliance data. Such reports can be expected to be useful in establishing inspection schedules, monitoring compliance, highlighting problems, and guiding enforcement actions. Source surveillance information consisting of emission measurements (e.g., stack tests) and special air quality measurements can also be recorded as part of a source record. This information can then be readily available for review when considering source enforcement actions of various types. Of course the interface with EMS for scheduling purposes helps organize the acquisition and use of valuable data. The EIS/P&R consists of a number of programmed packages that perform the following functions: a. Create an Emissions data base from NEDS formatted data b. Create an Emissions/Permit data base from EIS/P&R formatted data c. Add to, change, or delete data in the file d. Selectively retrieve data from the file e. Calculate estimated emission from stored emission factors f. Generate detail emission reports g. Generate detail permit .reports 14 ------- h. Generate summary emission reports i. Generate Federal (NEDS) reports. There are two types of input to the subsystem. They are cards punched according to the National Emissions Data System (NEDS) format, and cards punched according to the EIS/P&R format. Both types are used for creating the original file, while the EIS/P&R formats are generally used for adding to, changing, or deleting data from the file. Major subsystem functions are file maintenance, internal data edits, data retrieval and report generation. File maintenance includes those functions that permit an up-to-date file to be maintained. Three types' of file maintenance transaction are allowed—add, change, and delete. The subsystems can: a. Add new source records b. Add data to existing source records c. Change a data value in a source record d. Delete a source record e. Delete a portion of a source record. In ordinary use, most data values in a record can be used in a query statement to develop simple or compound logical inquiries about the data contents of a source record. Such queries can be used to retrieve from one to all source records, depending only on the type of source information desired. Information retrieved from the master file can be presented in three ways, namely: a. Detailed reports b. Summary reports c. NEDS reports. 15 ------- Detailed reports present all data about the source. Summary reports present a summary of emissions from several sources. NEDS reports are simply the source data presented in NEDS input formats. 1.4 ENFORCEMENT MANAGEMENT SUBSYSTEM (EMS) The Enforcement Management Subsystem was developed in 1972 to provide state and local air pollution agencies with an effective means of controlling enforcement activities (Figure 1.4-1). The subsystem emphasizes management control of the enforcement function and establishes standardized methods of handling data. Presently, a number of changes and additions to improve operation of the system are underway, including a mechanized link with the Compliance Data System (CDS) which will permit state agencies to meet many EPA reporting requirements with a minimum of additional effort. Another improvement will permit better selection of output data. Installation of the system at an air pollution agency should improve the efficiency of the enforcement function and increase management's ability to control these operations. A number of states have been operating the system for varying periods of time. In addition, four states are in the process of installing the EMS. All enforcement staff members of an air pollution agency should benefit from the use of the EMS. Agency management will obtain improved operational control over the enforcement function. They will be able to prepare many periodic reports more efficiently than before. Engineers, inspectors and other staff members should also benefit through easy access to specific source information and improved control and visibility over their routine activities. 16 ------- EMS Schedules Compliance Permits Meetings Action Performance Assessment Source Surveillance Inspections Form Letters Figure 1.4-1. EMS Concept ------- EMS is concerned directly with enforcement activities such as inspections, variances, permits, and any action which the agency undertakes. It is concerned with both historical and future actions, and makes future workloads highly visible to management and section heads through the Future Schedule Summary Report. It improves the agency's ability to evaluate the upcoming workload in terms of available personnel, and permits scheduling changes in activities. For example, at any time, an agency can determine the status of its activities with respect to a given source. These activities can include a history of past contact, current activities, and other actions planned for years into the future, such as a compliance schedule or a routine every-two-years checkback inspection. This whole sequence of activity is available for inspection by staff members and management, showing what has been done, what is due, and what is planned. The availability of this record of all present and planned contact between agency personnel and a specific source generally represents a significant improvement over most state files. One member of a state agency described the EMS as a sophisticated tickler file. That description is largely correct. The ability to store future schedules, or to plan inspections or legally mandated increments of progress, and to present these activities sorted out by time or by staff group, has proven to be useful. Middle management should be able to look ahead and evaluate the upcoming actions that his group will be expected to perform within a given time period. Looking at the realities of daily agency activity, it becomes apparent that technical personnel spend significant time digging out data to support external requests for informatives. Answers to citizens, to the legislature, to other state officials, to the EPA, and to others are needed. An up-to-date EMS data base permits the staff to leave much of the "digging out" of the information to the system. For example, one report permits a staff member to prepare a report listing all agency activity of a designated type, such as inspections, and allows the staff member to select the time period covered, even including future periods if desired. 18 ------- EMS is a dynamic system, not one which simply maintains a static data base for reporting purposes. It features the Action Card. The Action Card is a "turn- around document," in that it is produced by the system and is also designated to be reentered with new information. For example, an increment of progress is scheduled for a certain date. The system prepares one of these cards identifying the upcoming increment of progress. This card goes to the person who performs the needed action. Next, the person who is responsible for the activity can write the results right on the card. If it is a routine action, the staff member need circle only one or two numbers. However, he has the capability of adding English language comments or even scheduling a reinspection or other activity by means of the card. When he has done this, these results are punched into the card itself and it is fed back into the system which updates its own records. If an additional action has been called for, the system will produce a second card at the appropriate time and the process is repeated. A new feature of the EMS is now under development. This will provide a direct link to the national Compliance Data System. We believe it will substantially reduce the effort required by state agencies to meet federally mandated reporting requirements of increments of progress. The link works this way: at reporting time, the operator from the state agency who is running the system schedules a special computer program to be run, using his EMS data base. The special "converter" computer program extracts appropriate information from the state's EMS base, reformats it into CDS input format, and punches out a deck of cards representing that information. State personnel will have an opportunity to review the data being extracted through the printed report to ensure that all data which they need to pass on is included and that it is completed correctly. The reports capable of being produced by the system are as follows: a. Action card b. Action Summary Report c. Edit and Update Report 19 ------- d. Future Schedule Summary Report e. Geographic Locator Report f. Source Action Summary Report g. Source Registration. Additional retrieval capabilities are now being implemented and these will be available within a short period of time. It will be possible, however, to install the existing system and directly go to the retrieval programming without substantial effort. Of particular interest to heads of State agencies, is the basic capability of EMS to provide individual service to local agencies. For example, the centralized operation of EMS by the state provides a respository for the information pertaining to the local agencies operations. Periodic generation of ^reports for distribution provides a tailored schedule monitoring system for each local agency. In this way both state and local agencies can contribute data to the system and utilize its outputs. 1.5 THE AIR QUALITY DATA HANDLING SUBSYSTEM (AQDHS-II) The AQDHS-II is, as a subsystem, similar to all other components of CDHS and is designed to assist agencies resolve problems dependent upon the distribution and trends of air quality. It has basic design functions which are supplemented by built-in flexibility to support related enforcement functions (Figure 1.5-1). Its basic functions include data storage, update and retrieval capabilities, as well as a broad range of statistical processing all of which support air quality monitoring. Related enforcement functions which are supported include: a. Control strategy .evaluation and development b. Recording source surveillance measurements c. Generation of SAROAD reports 20 ------- AQDHS-II Air Quality Management Control Strategy Evaluation Federal Reports Legal Actions Compliance Enforcement Emergency Actions Complaints Trends Projections Figure 1.5-1. AQDHS-II Concept 21 ------- d. Providing data for modeling e. Providing data for legal actions. AQDHS-II stores and maintains measurements of air quality which are input to the subsystem in either of two formats. The SAROAD format can be used and is recommended as a source of data for the initial generation of a file. The AQDHS-II format provides a full range of input capability including adding, changing or deleting information in the file. The more significant processing functions of the subsystem are: a. Data checking b. Statistical calculations c. Automatic generation of Federal reports d. Report generation. Input data is checked for validity in those instances where such action is possible. These checks are supplemented by normal data processing checks to ensure that data is entered in the proper form. For example, numeric fields are checked to see that only numbers have been key punched. Statistical calculations include both arithmetic and geometric means and standard deviations, maximum and minimum values, sliding averages for selectable intervals, and a broad range of percentile values. A special function provides for flags attached to air quality data to indicate whether the data has been reported to the Storage and Retrieval of Aerometric Data (SAROAD) system and whether the data has been changed since last updated to SAROAD. This is valuable in reducing resource needs when generating the quarterly SAROAD input. In addition to the SAROAD input, AQDHS-II generates three major reports for routine agency use. There is a detail report of raw data showing file contents on a site-by-site basis. There is a sliding average report showing a sliding average of a selected pollutant for the usual intervals taken over a twenty-four hour 22 ------- period. Finally, there is a statistical analysis report which contains the results of the statistical processing including the three highest values in the sample set. All reports are titled with descriptive headings. 1.6 IMPLEMENTATION CONSIDERATION All subsystems within CDHS have been designed using simplicity as the major principle. This has resulted in subsystems which can be operated with a minimum of modification on almost any general purpose computer. All programs are written in either ANS COBOL or ANS FORTRAN. All input transactions have standard punched card formats. All system files are organized for either magnetic tapes or disk storage. CDHS implementation activities will take various amounts of time depending on local circumstances. However, a general schedule can be made which should be reasonably applicable. Such a schedule is shown (Figure 1.6-1) which allows somewhat less than five months to accomplish necessary actions to install one or more CDHS subsystems. All components of the system were developed so that they can be installed and used by any agency having access to a minimally equipped computer. The basic equipment requirements are: a. Three tape drives (or disk equivalent) b. 100K Bytes (or equivalent) of CPU memory c. Card Reader d. Printer e. Keypunch 23 ------- Decision to Proceed Computer Arrangements Funding Requests Authority to Proceed Assign Key Personnel Develop Schedule Request Subsystem(s) Request Data Installation and Demonstration Bring System On-line Staff Training Start Routine Use EPA On-Call Services 12345 Figure 1.6-1. Sample Implementation Schedule ------- The CDHS components can, with little modification, be operated on any computer with the capability to: a. Compile ANS COBOL and ANS FORTRAN b. Sort files c. Accept single sequential inputs and single sequential output files d. Print: reports and punch cards. All operations are in batch mode; that is, a deck is submitted for a computer run and at some later time the results are returned to the user. Use of the • system is scheduled and processed, and output is returned to the requestor according to local computer center procedures. Each subsystem builds and maintains its own file from well defined input cards which carry a code directing the computer to add, change, or delete data from its files. Data in the files are automatically maintained in the most useful order by reference to identification data in each record. All system master files are ordered sequentially and may be stored on magnetic tape or disk. Programs have only one master file input and a maximum of one updated master file output. Intermediate "answer" files may be retained so that several different reports may be prepared from the same source records. A file entry can be made as soon as identifying information (plant information for EIS/P&R and EMS and site information for AQDHS-II) is available. Additional data may be added to or changed in the file by creating a change transaction with the proper identifying fields and providing the new or changed data. It is, however, more efficient to collect changes and additions to the file and perform file update activity periodically. 25 ------- The arrangement of individual files of data have been structured to accommodate a large data base which is organized for ease of maintenance and use. This data base will logically include information provided from many diverse sources. Although this would normally create significant duplication of data within the individual files, organization in the hierarchical fashion serves to eliminate much of this duplication and consequently enhances the effective file storage capability. Programs to build and maintain these hierarchical files are available as standard elements of CDHS. These programs offer the data base user a standard symbolic interface which permits him to manage and coordinate individual records in the system without developing an in-depth knowledge of the actual file organization. The ability to group files and data from many diverse sources into one family of logical structures, coupled with the ability to provide conformity between these structures, makes this file organization scheme a highly attractive one for the air quality management system. The CDHS master files are arranged using simple hierarchical structures. This means that in describing a particular phenomenon, the data which is characteristic of all aspects is presented first. Data which relate .only to small facets are associated with the general descriptive information by being filed behind the header information. For example, the emissions data structure places process and engineering descriptive information subordinate to a point source which, in turn, is subordinate to a plant. This type of filing is similar to manual filing systems in which a header ledger card is followed by trailer cards which describe particular events. In one access to the file all generic and detail descriptions can be retrieved. 26 ------- In EIS/P&R, the hierarchical structure is plant, point, process, and permits. In AQDHS-II, the major control is pollutant and location while subset records describe measurements taken through time. The EMS describes a particular pollution source at its highest level of control. Detail data describes action taken to bring the emissions under control. 1.7 PERSONNEL SUPPORT Two types of personnel support are required. The assistance of a computer specialist, a System Coordinator, is necessary to provide those services which deal directly with computer equipment, performing tasks such as establishing local machine procedures, allocating direct access space providing file backup support and other tasks requiring familiarity with computer procedures. This systems support is not normally a full time task and is not involved with the day-to-day use of the contents of files. Other uses for systems support are to provide assistance as required for coding special COBOL language retrievals, and restoring the system in the event of a failure. The requirement for programmer personnel has been minimized. Data support personnel, headed by a Data Base Coordinator, are responsible for data oriented operations including securing input data, coding data input forms, coding retrieval statements, keeping track of the latest version of files, scheduling computer runs, and similar activities designed to ensure efficient use of the system. These people who must rely on the CDHS User's Guides supplemented by locally prepared instructions, may have little or no computer background training. The primary responsibilities of data support personnel are to maintain complete, current data in the files and to use CDHS facilities to prepare reports satisfying management requests. The system requires ,no specialized training for CDHS operations personnel. However, local procedures will be required to organize the use of the subsystems. These deal with standard procedural decisions such as how many 27 ------- generations of each file should be kept, how to minimize use of storage devices, when to update the files, and how to maintain data integrity. A user should designate an individual whose responsibility includes the collection, storage and organization of data; that is, a Data Base Coordinator (DEC) is essential to the subsystem. The DEC sees that inspectors, engineers, and others responsible for collecting data present their data for inclusion in the EIS/P&R data base. He is also responsible for coordinating such activities as: a. Assignment of identification numbers b. Assignment of storage devices c. Organizing and scheduling subsystem use d. Generating periodic reports. For small to medium size data files with moderate usage, one individual is probably more than sufficient. However, as the frequency of use increases and the files expand, the function may exceed the capabilities of one man. In any event he must be supported by keypunching which can be provided in many ways including: a. A keypunch section b. Keypunching by data collectors c. Use of centralized keypunch services d. Contracted services. 1.8 RESOURCES The cost or resources required to operate CDHS vary in proportion to the size of the data base frequency of update and retrieval, the response time requirement, and the rates charged by the local computer center. For most agencies, CDHS 28 ------- will use only a small proportion of a computer's daily schedule. Such usage expressed as CPU time may be on the order of one half-hour a week per subsystem for regular updates by a small agency with relatively infrequent need to extract data from the files. . Larger agencies with many sources and sensor sites could be expected to use the system more frequently, and for planning purposes, should expect to need 1-2 hours per week per subsystem. 1.9 SYSTEM IMPLEMENTATION SCHEDULE Successful installation of CDHS or any one of its subsystems is more assured if an agency carefully considers several scheduling activities and decisions. The first decision, of course, is that which affirms the need for one or more CDHS subsystems. Subsequently, the following actions should be taken: a. Arrangements should be made for appropriate access to a suitable computer. Normally this is achieved by negotiation with the head of the state (or county, or city) central data processing center. These negotiations should consider both the time and frequency of computer use as well as the mechanisms and costs for obtaining access. b. Based upon cost estimates developed during these negotiations, funding authorization should be established. c. At this point, key personnel should be identified and given the responsibility for the implementation plan. Key personnel consist, at a minimum, of the Data Base Coordinator, and the System Coordinator. In all agencies a manager should be identified and given the appropriate managerial responsibility. The Data Base Coordinator should be assigned from the agency staff. The System Coordinator may be assigned from the agency staff, but may be, by special arrangement, a part of the data processing center staff. 29 ------- d. The responsible key personnel should now schedule required actions consisting of at least the following elements: 1. Request for subsystem(s) from EPA through the Regional Office 2. Request for initial data base from EPA, NADB through the Regional Office 3. Subsystem(s) installation and demonstration 4. Bringing the subsystem(s) on-line by creating the initial data base 5. Provision of subsystem(s) training for appropriate agency staff in the selected subsystem(s) 6. Inauguration of routine use. The accomplishment of this action will lead to an easy and well organized installation of CDHS. 30 ------- TECHNICAL REPORT DATA (Please read Instructions on the reverse before completing) 1. REPORT NO. EPA-450/3-76-011 3. RECIPIENT'S ACCESSION-NO. 4. TITLE AND SUBTITLE CDHS Executive Summary Report 5. REPORT DATE March 1976 6. PERFORMING ORGANIZATION CODE 7. AUTHOR(S) The I Corporation - Federal Systems Division Gaithersburg, Maryland 20760 8. PERFORMING ORGANIZATION REPORT NO. 9. PERFORMING ORGANIZATION NAME AND ADDRESS 10. PROGRAM ELEMENT NO. The IBM Corporation - Federal Gaithersburg, Maryland 20760 Systems Division 11. CONTRACT/GRANT NO. 68-02-2054 12. SP QNSORING.AGENCY NAME AJ\I D_A D D R E S S . „ I). S. Environmental Protection Agency Office of Air and Waste Management Office of Air Quality Planning and Standards Research Triangle Park, NC 27711 13. TYPE OF REPORT AND PERIOD COVERED Final 14. SPONSORING AGENCY CODE 15. SUPPLEMENTARY NOTES 16. ABSTRACT Management of air quality requires constant surveillance of air quality and careful recording of all significant pollution sources. Measurements are required to trace progress of specific emission control attempts and of the general state of air quality. Detailed analysis is required to predict future air pollution conditions and to determine the contributory effect of any specific emission. These considerations must, in turn, be related to Air Pollution Control Implementation Plans to ensure that enforcement actions are achieving desired goals. These and the many unmentioned tasks in air quality control add up to a massive data handling requirement. The CDHS was designed to provide using States (via the EPA Regional Offices) with a universal and compatible system for managing air data. It will also provide an efficient mechanism for timely, complete and accurate input of air quality data to the National Systems. This report contains a brief description of the CDHS system and is intended to provide an executive overview of the system. 17. KEY WORDS AND DOCUMENT ANALYSIS DESCRIPTORS Computer Programs Computer Software Data Processing Air Pollution Data Handling Computer Systems Programs b.IDENTIFIERS/OPEN ENDED TERMS c. COSATI Field/Group CDHS AQDHS-II EIS/P&R EMS SAROAD NEDS CDS 13. DISTRIBUTION STATEMENT Release unlimited 19. SECURITY CLASS (This Report) Unclassified 21. NO. OF PAGES 35 20. SECURITY CLASS (This pagei Unclassified 22. PRICE EPA Form 2220-1 (9-73) ------- INSTRUCTIONS 1. REPORT NUMBER Insert the EPA report number as it appears on the cover of the publication. 2. LEAVE BLANK 3. RECIPIENTS ACCESSION NUMBER Reserved for use by each report recipient. 4. TITLE AND SUBTITLE Title should indicate clearly and briefly the subject coverage of the report, and be displayed prominently. Set subtitle, if used, in smaller type or otherwise subordinate it to main title. When a report is prepared in more than one volume, repeat the primary title, add volume number and include subtitle for the specific title. 5. REPORT DATE Each report shall carry a date indicating at least month and year. Indicate the basis on which it was selected (e.g., date of issue, date of approval, date of preparation, etc.). 6. PERFORMING ORGANIZATION CODE Leave blank. 7. AUTHORtS) Give name(s) in conventional order (John R, Doe, J. Robert Doe, etc.), List author's affiliation if it differs from the performing organi- zation. 8. PERFORMING ORGANIZATION REPORT NUMBER Insert if performing organization wishes to assign this number. 9. PERFORMING ORGANIZATION NAME AND ADDRESS Give name, street, city, state, and ZIP code. List no more than two levels of an organizational hirearchy. 10. PROGRAM ELEMENT NUMBER Use the program element number under which the report was prepared. Subordinate numbers may be included in parentheses. 11. CONTRACT/GRANT NUMBER Insert contractor grant number under which report was prepared. 12. SPONSORING AGENCY NAME AND ADDRESS . . . . Include ZIP code. : 13. TYPE OF REPORT AND PERIOD COVERED Indicate interim final, etc., and if applicable, dates covered. 14. SPONSORING AGENCY CODE Leave blank. 15. SUPPLEMENTARY NOTES Enter information not included elsewhere but useful, such as: Prepared in cooperation with, Translation of, Presented at conference of, To be published in, Supersedes, Supplements, etc. 16. ABSTRACT Include a brief (200 words or less) factual summary of the most significant information contained in the report. If the report contains a significant bibliography or literature survey, mention it here. 17. KEY WORDS AND DOCUMENT ANALYSIS (a) DESCRIPTORS - Select from the Thesaurus of Engineering and Scientific Terms the proper authorized terms that identify the major concept of the research and are sufficiently specific and precise to be used as index entries for cataloging. (b) IDENTIFIERS AND OPEN-ENDED TERMS - Use identifiers for project names, code names, equipment designators, etc. Use open- ended terms written in descriptor form for those subjects for which no descriptor exists. (c) COSATI FIELD GROUP - Field and group assignments are to be taken from the 1965 COSATI Subject Category List. Since the ma- jority of documents are multidisciplinary in nature, the Primary Field/Group assignment(s) will be specific discipline, area of human endeavor, or type of physical object. The application(s) will be cross-referenced with secondary Field/Group assignments that will t'ollcm the primary posting(s). 18. DISTRIBUTION STATEMENT Denote releasability to the public or limitation for reasons other than security for example "Release Unlimited." Cite any availability to the public, with address and price. 19. & 20. SECURITY CLASSIFICATION DO NOT submit classified reports to the National Technical Information service. 21. NUMBER OF PAGES Insert the total number of pages, including this one and unnumbered pages, but exclude distribution list, if any! 22. PRICE Insert the price set by the National Technical Information Service or the Government Printing Office, if known. EPA Form 2220-1 (9-73) (Reverse) ------- |