United States Environmental Protection Agency Atmospheric Research and ^ Exposure Assessment Laboratory -y Research Triangle Park NC 27711 / Research and Development EPA/600/S3-88/043 Mar. 1989 &EPA Project Summary Meteorological Processor for Regulatory Models (MPRM-1.1)-User's Guide John S. Irwin, James O. Paumier, and Roger W. Brode Version 1.1 of MPRM provides a general purpose computer processor for organizing available meteor- ological data into a format suitable for use by air quality dispersion models. Specifically, the processor is designed to accommodate those dispersion models that have gained EPA approval for use in regulatory decision making. A unique feature of the processor is the ability to employ user collected meteorological measurements as well as those routinely collected by the National Weather Service (NWS). MPRM-1.1 will support the following dispersion models listed in the Guideline on Air Quality Models (Revised) (EPA, 1986), as well as three screening models: • Those requiring RAMMET for- matted data: BLP, RAM, ISCST, MPTER, CRSTER, and COMPLEX1. • Those requiring STAR formatted data: COM (with either 16 or 36 wind direction sectors), ISCLT, and VALLEY (long-term). • Those requiring special formats: CALINE-3 and RTOM (default). MPRM can be envisioned as a three-stage system. The first stage retrieves the meteorological data from computer tape or disk files and processes the data through various quality assessment checks. The second stage collects all data available for a 24-hour period (up- per air observations, hourly surface weather observations, and data col- lected as part of an on-site meteorological measurement pro- gram) and stores these data in a combined (merged) format The third stage reads the merged meteor- ological data and performs the necessary processing to produce a meteorological data file suitable for use by the specified dispersion model. This Project Summary was devel- oped by EPA's Atmospheric Research and Exposure Assessment Laboratory, Research Triangle Park, NC, to announce key findings of the research project that Is fully documented in a separate report of the same title (see Project Report ordering information at back). Introduction EPA has recently issued guidance on the use of meteorological data, collected via an on-site measurement program, for regulatory modeling applications. The meteorological processors currently available from EPA do not have the capability of processing user collected on-site meteorological data as directed by the guidance. Therefore, MPRM-1.1 has been designed to construct meteor- ological data files of upper air, mixing height, surface observations, and on- site data for air pollution dispersion models that are routinely used in regulatory decision making by EPA. Specifically, the processor is designed to accommodate those dispersion models recommended for use in the Guideline on Air Quality Models (Revised in 1986). ------- Why This Design? Recognizing that the list of approved dispersion models is apt to change, and that those changes are likely to call for use of various processing methods, the new processor is designed to have a highly modular structure. Additionally, the new processor is designed to avoid computer conflict problems. It was further decided to develop an input structure that would ultimately support use of a menu data entry system. This would allow construction of the input through a computer-controlled question and answer session, and possibly facilitate the usability of the processor by a variety of users. What Does MPRM Do? MPRM can be envisioned as a three- stage processing system, as depicted in Figure 1. During the first stage, the processor extracts upper air, mixing height, and surface data from the raw data files delivered from NCDC and on- site data from the raw data files developed from the on-site meas- urement program. The extracted data are processed through a series of quality assessment checks. As a result, reports of missing and suspect values are generated. During the second stage, the processor combines the available data for each midnight-to-midnight 24- hour period (twice-daily upper air soundings and mixing height data, hourly surface weather observations, and hourly on-site data) and stores these data in a combined (merged) format. During the third and final stage, the processor reads the merged data and develops a meteorological data file for the dispersion model selected by the user. Extraction and Quality Assessment (Stage 1 Processing) The goal of this first stage of pro- cessing is to: • Read the on-site and NWS meteor- ological data files • Find the data within the time period specified by the user • Store these data in American Standard Code for Information Inter- change (ASCII) data files • Scan the stored values and report occurrences of missing or suspect values. MPRM can currently process hourly surface observations in CD-144 format, upper air soundings in TD-5600 format, and mixing height data in TD-9689 for- mat. Persons experienced with RAMMET have a working knowledge of hourly surface weather (CD-144) data and twice-daily mixing height (TD-9689) data. NCDC can provide CD-144 and TD-9689 data on computer magnetic tape or on 5 1/4-inch diskettes in a format suitable for use in IBM compatible personal computers (PCs). MPRM in- stalled on a mainframe computer can process these data either from the computer tape or from mass storage data files. MPRM installed on a PC can process these data either from data files on the hard disk or from 5 1/4-inch diskettes. We would at least suggest an IBM-AT or equivalent with 640K random access memory (RAM) and a hard disk. As of 1984, NCDC had converted to new data storage formats for surface I Meteorological Processor for Regulatory Models Stage 1 Processing Extract and Quality Assess Data Stage 2 Processing Combine (Merge) Data Files Stage 3 Processing Create Data File for Modeling Figure 1. Overview of processing stages within MPFtM. observations and upper air soundings namely, elemental formats TD-3280 an< TD-6201, respectively. NCDC nov converts from these formats to servic« requests for CD-144 and TD-560( formatted data. MPRM is designet internally for ready adaptation to thes< new formats. One, if not the first, upgrade will expand the data formats supportec by MPRM to include TD-3290 and TD 6201 formats. Because there is no standard forma for storage of on-site meteorologica data, MPRM is designed to process i variety of on-site data formats by having the user define the structure of the inpu data. The two major restrictions t< consider in processing the on-site dat: are 1) the order in which the data value; are presented and 2) the data file mus be a standard ASCII data file. In principle MPRM will be able to process the on site data as long as the data values fo each observation are ordered correcth (date and time, then meteorologica values) and the observation can be rea< using a FORTRAN FORMAT statement. An additional capability of this firs stage is assessing the quality of the dat. by checking for possible missing o suspect values. Any occurrences c missing or suspect data values an reported before the upper air soundings mixing height data, surface observations and on-site data are combined. The output files from this first stage c processing can be edited using standan text editors routinely available oi computer systems. The only foreseeabd problem in editing these data files is the some text editors have a limitation on th size of the file to be edited. This probler is of most concern when the editing i performed on a PC where the text editc is typically limited to the available RAN/ For example, because of the RAJ limitation, a file consisting of 700 Kb of year of hourly surface observations i CD-144 format could not be edited. . possible solution to this problem is t break the file into parts that can be edite with software designed for this purpose The modified larger file is recreated b concatenating the smaller files. Combining Data (Stage 2 Processing) The goal of this second stage c processing is to: • Combine into one file the availabl on-site and NWS meteorological dal ------- files created during Stage 1 process- ing • Store the data in a more compact format. For the specification of the dispersive state of the atmosphere, it is most convenient to consider the physics of the atmosphere on a daily basis, i.e., a 24- hour period. Estimation of the depth of convective mixing is in reality the summation of effects starting with the heating of the surface shortly after sunrise. Thus, the merging of the available data for each 24-hour period is the next logical step in the processing before developing the characterization of the input meteorological data files for the dispersion models. The merged data are stored in un- formatted form because this format provides a more efficient usage of storage than the formatted ASCII data file storage used during the first stage of processing. The ASCII files are convenient for text editors but are no longer needed once the quality assessment and editing are completed. Creating a Model Input File (Stage 3 Processing) The goal of this third stage of processing is to: • Create a meteorological data file for use with a regulatory dispersion model chosen by the user. MPRM can generate any one of several output formats to meet the input requirements of the regulatory dispersion model chosen by the user. The RAMMET format has been selected as the default output with default methods for processing wind, temperature, stability category and mixing heights. These methods employ the NWS hourly surface weather observations and NCDC twice-daily mixing heights and dupli- cate the processing performed by the RAMMET meteorological processor. 4 Since no "modeling* 'has beten^ performed prior to the third stage of* processing, it is anticipated* that future changes to the modeling guidelines will have the most impact on this stage of meteorological processing. Acceptance of new algorithms for mixing height estimation, or methods for characterizing the variation of wind speed and wind direction with height, would require that new computer algorithms be supplied within MPRM for use at this stage of processing. Acceptance of a new dispersion model might require changes to the output subroutine within MPRM in order to provide the meteorological data in the format required by the new ac- cepted dispersion model. In consid- eration of these possibilities, MPRM has a highly modularized design. This allows upgrading of specific parts of the computer code without having to rede- sign the processor. Relationship of MPRM to EPA Air Pollution Modeling Guidance The data processing methods incor- porated into MPRM are intended to implement the recommendations con- tained in the EPA on-site meteor- ological program guidance document. These recommendations include the determination of Pasquill stability cate- gories from on-site measurements, based on the recommendations in the EPA Guideline on Air Quality Models (Revised). As data processing recom- mendations are modified, MPRM will be upgraded to reflect the latest guidance. Moreover, any discrepancies that might exist between MPRM and current regu- latory guidance should not be construed as guidance, but as errors within the MPRM system. It is not the purpose of this user's guide to provide a comprehensive sum- mary of all relevant guidance on disper- sion modeling for regulatory applications. Other recommendations from the guid- ance document for conduct of an on- site meteorological measurement pro- gram may be relevant to a particular application. For example, the on-site meteorological program guidance docu- ment contains recommendations on in- strument siting and quality assurance. An important recommendation is that a minimum of 90 percent valid data •recovery exist for each variable before a data set can be used for regulatory modeling. The issue of handling missing data values for dispersion modeling is discussed below in more detail. Please note that data substitution cannot be .used to reach the 90 percent data recovery rate required by regulatory modeling guidance. Missing Values, the Bane of All Data Sets When the meteorological conditions are insufficiently declared, a dispersion model will not be able to produce concentration estimates. In the case of dispersion models currently accepted for regulatory applications, the situation is aggravated by the fact that none of the hourly dispersion models can continue processing if the meteorological record is not continuous. The only way to continue processing is to present the dispersion model with an unbroken meteorological record having no missing values. Substitutions based on other on-site data, if available and deemed to be representative, may provide the best solution for providing an unbroken meteorological record. If the situation is such that only 1 hour's data is missing, it might be practical to use linear inter- polation between adjacent hours to estimate the missing meteorological conditions. As the time period of missing values increases, the usefulness as well as the reasonableness of linear inter- polation to fill in missing values becomes increasingly more dubious. Rational- izations involving use of monthly mean values from climatological records are sometimes employed. The fact remains that, given the right circumstances, any technique employed for filling in missing values can prove to be inadequate. A clear consensus has yet to be reached on how best to resolve the dilemma created by a broken meteor- ological record. A possible solution may involve making substitutions for missing meteorological values for isolated 1- hour periods, and treating longer breaks in the meteorological record within the dispersion model. At the very least, the dispersion models might be modified to process available valid data and to skip (or output a missing value indicator for the concentration estimate) those hours when processing could not continue due to missing values in the meteorological record. While the guidance document for conduct of on-site meteorological measurement programs contains a recommended hierarchy of data substi- tution strategies for regulatory ap- plications, the implementation of these recommendations requires expert judg- ment of "representativeness" of the data substitutions. Yet to be developed is a system of numerical rules having sufficient expertise that we can con- fidently recommend their use in a universal sense for automatic processing of missing data values. Since MPRM is constrained to include only processing methods that have been accepted for use in regulatory applications and since we have yet to develop a set of numerical techniques for universal use in handling missing values, MPRM has no automatic method for ------- correcting missing values. If and when techniques are accepted for handling missing values on an automatic basis, they will be incorporated into MPRM, unless of course the resolution is within the dispersion model algorithm. The EPA author, John S. Irwin, is with the Atmospheric Research and Exposure Assessment Laboratory, Research Triangle Park, NC 27711; James 0. Paumier is with the Computer Sciences Corporation, P.O. Box 12767, Research Triangle Park, NC 27709; and Roger W. Brode is also an EPA author with the Office of Air Quality Planning and Standards, Research Triangle Park, NC 27711 D. Bruce Turner is the EPA Project Officer (see below). The complete report, entitled, "Meteorological Processor for Regulatory Models (MPRM-1.1) User's Guide," (Order No. PB 89-127 5261 AS; Cost: $28.95, subject to change) will be available only from: National Technical Information Service 5285 Port Royal Road Springfield, VA 22161 Telephone: 703-487-4650 The EPA Project Officer can be contacted at: Atmospheric Research and Exposure Assessment Laboratory U.S. Environmental Protection Agency Research Triangle Park, NC 27711 States Environmental Protection Agency Center for Environmental Resear Information Cincinnati OH 45268 Boi* Business - Penalty for Private Use $300 EPA/600/S3-88/043 ------- |