1999 EPA Conference o 3 3 3 fD l/l r* fit and Information w Repository Material Permanent Collection &EPA ------- SP 2^2 - ' ~< -J Welcome Letter Agenda \J$ EPA Headquarters and Chemical Libraries EPA West Bldg Roonrl Mailcode 340411 !30i Constitution Av§ Washington DC 2 202-566-0556 Repository Mat Permanent Coll EJBD ARCHIVE EPA 230- R- 99- 002 Abstracts Evaluation Form Registrants ------- af\- ooi- \t> N O ------- ------- Welcome to the 1999 EPA Conference on Environmental Statistics and Information It is my pleasure to welcome you on behalf of EPA's Center for Environmental Information and Statistics to the 1999 EPA Conference on Environmental Statistics and Information. For many "old timers" to this conference, you will immediately recognize that we have changed the name and scope of the conference. Just as EPA is currently planning a new Office of Information, we have expanded our coverage to information in addition to statistics. We welcome new attendees to the conference which covers all aspects of data, information, computer technology, data systems and statistics. We have the data gatherers, the data administrators, the data users, the data analyzers, the data simulators, the data reinventors, the data modelers, the data miners, the data assessors, and those who still can't figure out if data are singular or plural. We hope that all of the attendees will better understand the interrelationships among these groups as the new Office of Information will encompass so many of these activities. This year's theme is "EPA's Vision for the 21st Century". I am well aware that many of this year's conferences on any topic will have similar names. (I also fear any ho-hum building constructed this year will- shortly be labeled "turn of the century" architecture.) However, as EPA is undergoing major changes with the creation of the Office of Information, the Agency is trying to position itself to meet the burgeoning information challenges of the new century, so I feel the theme is truly appropriate. I hope that this conference will enable us to better understand those challenges and approaches to meeting them. We have an exciting collection of plenary sessions, featured talks, concurrent presentations, training sessions, poster/computer sessions and panel discussions. However, as with most meetings, some of the informal opportunities to meet and chat with your colleagues can frequently be the most productive aspect of the conference. I encourage you to take full advantage of this year's campus-type setting to continue your dialogues with your associates. I owe a great deal of thanks to the planning and arrangements committees for their efforts to organize this conference. Special thanks also go to Margaret Conomos and Connie Lorenz who assisted me in putting it all together, and to Temple University's Institute for Survey Research that handled the details and coordination. We encourage you to have a good time, learn a lot, and tell us about any enhancements you would like to see in the future. f \^ \^ 1 fc*"t*.. i ix V Barry D. Nussbaum 1999 Conference Chair Conference Planning Committee Arrangements Committee Henry Kahn Susan Auby Elizabeth Margosches Joan Bundy George Flatman Trudy McCoy Ruth Allen Ed Lloyd John Warren ------- ------- ------- Agenda for the 1999 EPA Conference on Environmental Statistics and Information Monday, May 10,1999 3:00-6:00 REGISTRATION AND CHECK-IN Foyer 4:00-6:00 CONCURRENT TRAINING SESSIONS Woollcott Smith (Temple University) and Peter Petraitis -.. . n 111 *. rn i \ HI I i. »* Dining Room (University of Pennsylvania) - Workshop on Monte Carlo Methods in Environmental Statistics . Joe Anderson (EPA/OIRM) - EPA's Web Site and You Room D 6:00-7:00 Cash Bar Foyer Tuesday, May 11,1999 8:30-9:00 Welcoming Remarks and Introduction of Speakers . Wendy Cleland-Hamnett, Director, EPA Center for Environmental Information and Statistics Peter Goodwin, Dean, Graduate School, Temple University Dining Room 9:00-9:30 Al Morris, Director, Office of Environmental Data, EPA Region Dining Room III, Philadelphia, PA - Information, Statistics and the Region 9:30-10:30 Keynote Address Jay Hakes, Administrator, Energy Information Administration Dining Room 10:00-12:00 Statistical Training Session (Videotapes) RoomC ------- Tuesday, May 11,1999 (Continued) 10:30-10:45 Break 10:45-12:00 CONCURRENT PRESENTATIONS Statistics, Information, and GPRA (Chair: George Bonina, EPA) . Judith Calem Lieberman, (EPA/OCFO), Analytic Challenges and the Government Performance and Results Act George Bonina (EPA) - Reinventing Environmental Information RoomD Local Applications of EPA Data (Chair: Ron Shafer, EPA/CEIS) . Henry Topper (EPA/OPPT) - The Baltimore Community Environmental Partnership: Lessons Learned Kimberly Nelson (Pennsylvania Department of Environmental Protection) - The Department of Environmental Protection Compliance Reporting System . N. Bouwes, Steven M. Hassur (EPA/OPPT), S. Keane, E. Fechner Levy, B. Firlie, and R. Walkling (Abt Associates, Inc.) - Risk-Screening Environmental Indicators Model Dining Room Statistical Methods for Lab and Air Quality Data Analysis (Chair: Larry Cox, EPA/ORD/NERL) . Mary Lou Thompson and Kerrie Nelson (University of Washington) - Statistical Modeling of Multiply- Censored Data . Peter Craigmile (University of Washington) - Trend Estimation Using Wavelets . Joel H. Reynolds (University of Washington) - Meteorological Adjustment of Surface Ozone for Trend Analysis: Pick an Answer, Any Answer RoomB 12:00-1:15 Lunch ------- Tuesday, May 11,1999 (Continued) 1:15-2:30 CONCURRENT PRESENTATIONS Databases: The Manager's View (Chair: Phil Lindenstruth, EPA) Room D . Panel: Phil Lindenstruth - STORET, Abraham Siegel - SDWIS, Mike A. Mundell - PCS (EPA) Ensuring the Quality of Environmental Information Dining Room (Chair: Nancy Wentworth, EPA/ORD) Nancy Wentworth (EPA/ORD) - Quality Assurance and Environmental Information Malcolm Bertoni (Research Triangle Institute) - Using SimSITE to Illustrate Sampling Techniques Models and Model Assessment of Environmental Data Room B (Chair: Mary Lou Thompson, University of Washington) Rafael Ponce (University of Washington) - Development of a Linked Pharmacokinetic-Pharmacodynamic Model of Methylmercury-Induced Developmental Neurotoxicity Samantha Bates, Cullen, A. C., and A. E. Raftery (University of Washington) - Bayesian Model Assessment . Marianne Turley, E. David Ford, and Joel Reynolds (University of Washington) - Pareto Optimal Multi- Criteria Model Assessment 2:30-4:30 Statistical Training Session (Videotapes) Room C 2:30-2:45 Break ------- Tuesday, May 11,1999 (Continued) 2:45-4:00 CONCURRENT PRESENTATIONS Use of the Internet for Sharing Statistics Dining Room (Chair: Steve Hufford, EPA/CEIS) Pat Garvey (EPA/OIRM) - Envirofacts Warehouse: Environmental Data On The Internet - Empowering the Citizen toward Environmental Protection and Awareness Anne Frondorf (USGS) - National Biological Information Infrastructure Chris Miller (NOAA) - National Environmental Data Index . Bob Shepanek (EPA/ORD) - EPA's Environmental Information Management System Epidemiology and Risk Assessment Cumulative and/or Aggregate Room B Risk Assessment (Chair: Ruth Allen, EPA) . David Miller (EPA/OPPTS/OPP) - Food Quality Protection Act and its Implementation: an Overview of Statistical and Probabilistic Issues Facing the Office of Pesticide Programs Hans D. Allender (EPA/OPP/HED) - Finding a Statistical Distribution to Use in the Monte Carlo Exposure Assessment of Livestock Commodities . Breeda Reilly (EPA/CEPPO) - Applying Epidemiology to Study the Prevention of Major Chemical Accidents Statistical Research Issues in Quality Assurance (Chair: John Room D Warren, EPA/ORD) John Warren - Integrating Data Quality Indicators (DQIs) into Data Quality Objectives (DQOs) Charles White (EPA/OW/OST) - A Performance Evaluation of the Method Detection Limit ------- Tuesday, May 11,1999 (Continued) 4:15-5:15 PLENARY SESSION (Chair: Wendy Cleland-Hamnett, EPA/CEIS) Dining Room . Corrinne Caldwell, Acting Provost & Vice President, Temple University - Welcoming Remarks . Tom Curran (EPA/OAR) - Data, Information and Statistics: Putting it All Together for Decision-Making 5:15-8:00 POSTER AND COMPUTER SESSIONS . George T. Flatman (EPA/ORD/NERL) - Satellite data for Landscape Ecology . Lawrence Lehrman (EPA/RMD/OIS) - Cluster analysis of fish species and land use . Connie Lorenz (EPA/CEIS) - All the Stats that are Fit to Surf . Brand Neimann (EPA) - Digital Library Demonstration . Stuart H. Kerzner (EPA/Region III) - Information Visualization - Turning Data into Information You Can Easily Understand . Maliha S. Nash (EPA/LEB) - Geostatistical Analysis of Ecological Indicators . Arthur Lubin (EPA/OSEA) - Environmental Random Stratified Sampling Designs Developed Via Cluster Analysis . Heather Case (EPA Customer Service) - What the CEIS National Telephone Survey will be able to Tell EPA's Information Providers . Susannah Dillman (EPA/OPPTS/OPPT) - Methods to Minimize Human Error in Reporting Analysis Results . John S. Graves (EPA/Region III) - Using Perl Scripts to Import Data into GIS: An Example Using USGS Ground Water Site Inventory Data Rich Heiberger (Temple University) - Demonstration of ESS, S-Plus, and Trellis Graphics . William P. Smith (EPA/OPPE/CES) - CD Toxic Release Inventory (TRI) Data Explorer Room A 5:30-6:30 Wine and Cheese Party - Hosted by William Tash, Vice Provost for Research, Temple University Pool House ------- Wednesday, May 12,1999 8:00-9:00 CONCURRENT PRESENTATIONS Data Integration and Quality: Vision for the Future (Chair: Ruth Allen, EPA) . Susan Devesa, D. Grauman, W. Blot, G. Pennello, R. Hoover, and J. Fraumeni (NCI) - Atlas of Cancer Mortality in the United States, 1950-94 Ruth Allen (EPA) - Surveillance Improvement Report Dining Room Analysis of Cleanups (Chair: Michael J. Messner, EPA/OGWDW) Room B . Michael J. Messner (EPA/OGWDW) - Cryptosporidium Occurrence in the Nation's Drinking Water Sources . Bimal Sinha (EPA/OPPE/CES) - Statistical Estimation of Average Reid Vapor Pressure of Regular Gasoline Sampling and Design Issues in Environmental Studies (Chair: Tony Olsen, EPA/NHEERL) . David Marker (WESTAT) - Sample Designs for Environmental Data Collection: Ranked Set Sampling and Composite Sampling . Paul D. Sampson (University of Washington) - Monitoring network design with applications to regional air quality Room D 9:00-9:15 Break 9:15-11:15 Statistical Training Session (Videotapes) Room C ------- Wednesday, May 12,1999 (Continued) 9:15-10:45 CONCURRENT PRESENTATIONS Some Analyses and Potential Analyses at EPA (Chair: Doreen Sterling, EPA/CEIS) . Mike Barrette (EPA/OE) - Integrating Data for Planning and Targeting . Tom DeMoss and Tom Pheiffer (EPA) - The Mid Atlantic Integrated Assessment Program (MAIA) . John Moses (EPA/CEIS) - Strategy to Address Evolving Environmental Information Needs RoomD Measurement Issues Related to our Water Supply (Chair: Barnes Johnson (EPA/OSWER/OSW) . Andrew Schulman, Jennifer Wu, and Benjamin Smith (EPA/OGWDW) - Forays into the Unforgiving - Occurrence Estimation in the Realm of Data with Multiple Censoring Points (arsenic in the public water supply) . Henry Kahn, Helen L. Jacobs, and Kathleen A. Stralka (EPA/OW/EAD) - Estimated Water Consumption In The U.S. Based On The CSFII . Virginia A.Colten-Bradley (EPA/OSWER/OSW) - Development of a Neural Network Tool for Evaluation of Waste Management Unit Designs Dining Room Assessing Risk (Chair: Elizabeth Margosches (EPA/OPPTS) . Mary Marion (EPA/OPPT) - Simulation and Acute Dietary Risk Assessments . David Pawel (EPA/OAR) - Proposed EPA Methodology for Assessing Risks from Indoor Radon . Elizabeth H. Margosches, Ph.D., Jennifer Seed, Ph.D., and Khoan T. Dinh, Ph.D. (EPA/OPPTS) - Health Data: How Do We Use It To Protect the Public/Environment? Margaret Conomos (EPA/CEIS) - Discussant RoomB ------- Wednesday, May 12,1999 (Continued) 10:45-11:00 Break 11:00-11:30 PLENARY SESSION (Chair: Barry Nussbaum, EPA/CEIS) Woollcott Smith (Temple University) - A Walk on the Wild Side of Statistical Communication Dining Room 11:30-12:30 PLENARY SESSION (Chair: Doreen Sterling, EPA/CEIS) Robert English (EPA) - Proposed Information Management Office Dining Room 12:30-2:00 Lunch 2:00-4:00 Statistical Training Session (Videotapes) Room C 2:00-3:30 CONCURRENT PRESENTATIONS The Visual Presentation of Data (Chair: Al Morris, EPA, Dining Room Region III) . Al Morris (EPA) - Enviroviz-Turning Numbers into Visual Relationships . David Mintz (EPA/OAR/OAQPS) - Methods for Displaying Temporal and Spatial Trends . Daniel Carr (George Mason University) - Two Templates for Visualizing Georeferenced Statistical Summaries Listening To Our Information Customers (Co-Chairs: Brendan Room D Doyle (EPA/CEIS) and Margaret Morgan-Hubbard (EPA/Office of Communications) . Panel: Margaret Morgan-Hubbard, Director, EPA Office of Communications, Brendan Doyle, (Acting) Director, CEIS Customer Survey and Access Division, Emma McNamara, (Acting) Director, EIMD, OIRM, and Pat Bonner, EPA Customer Service 8 ------- Wednesday, May 12,1999 (Continued) Application of Sampling in Aquatic Resources (Chair: Henry Kahn, EPA/OW/EAD) . Henry Kahn and Silvestre Colon (EPA/OW/EAD) - Composite Sampling Analysis of Contaminant Levels in Fish . Anthony R.Olsen (EPA/NHEERL) - National Fish Tissue Contaminant Lake Survey: A New Spatially-Restricted Survey Design Barnes Johnson (EPA/OSWER) - How to Survey Water Designs 3:30-3:45 Break Room B 3:45-5:15 PLENARY SESSION: Statistics and Information at EPA as we Start a New Century: Where Are We Going? (Chair: Phil Ross, EPA/CEIS) . Larry Cox (EPA/ORD/NERL) . Karen Klima (EPA/IWI) . Heather Case (EPA/CEIS) G.P. Patil (Pennsylvania State University) Dining Room Thursday, May 13,1999 8:30-10:30 TRAINING Steven P. Millard (PSI) - Applying Monte Carlo Simulation Techniques with S-PLUS Dining Room 9:00-10:30 PRESENTATIONS (Note this time overlaps with above training) The Data Come In, the Data Go Out Room D . Rick Westlund (EPA/OP) - Reducing Paperwork Burdens at EPA . Charlotte Cottrill (EPA/ORD) - EMPACT's Role in the 21st Century 9 ------- Thursday, May 13,1999 (Continued) 10:30-10:45 Break WRAP-UP SESSION (Chair: Barry Nussbaum, EP/VCEIS) Dining Room 10-45 11 -45 * William Raub, Deputy Assistant Secretary for Science Policy, Department of Health and Human Services - Perspectives on Data and Information from the Department of Health and Human Services 11:45-12:00 Door Prize and Closing Remarks Dining Room 10 ------- stats sched wpd Page 1 ' SCHEDULE FOR CEIS EXHIBIT SUPPORT FOR THE CEIS STATS CONFERENCE IN PHILLY MAY 10-13 Note: The last people manning the exhibit each day will need to assure the PC's are safely secured and that the people manning the booth the next morning will know where the PC's are and will have access to them. MONDAY. MAY 10 Exhibit set up - 2:00 - 4:00 p.m. Connie Lorenz *Margaret Conomos TUESDAY. MAY 11 Staff for Exhibit (Including assuring PC set up, Internet access, etc.) 10:00 -12:00 a.m. *Nathan Wilkes Connie Lorenz 1:15-3:00 p.m. Lee Ellis Margaret Conomos 3:00-5:15 p.m. Ed Brandt Lee Ellis 5:15-8:00 p.m. (Will allow for runs to the wine and cheese as needed) Connie Lorenz Nathan Wilkes Margaret Conomos WEDNESDAY. MAY 12 Staff for Exhibit 8:00 -10:00 a.m. Lee Ellis Ed Brandt 10:00-12:00 a.m. Connie Lorenz ------- stats sched wpd Page 2 Nathan Wilkes WEDNESDAY. MAY 12 fConf d) 2:00 -5:15 p.m. Margaret Conomos Lee Ellis THURSDAY. MAY 13 Staff for Exhibit 8:30 -10:30 a.m. Lee Ellis 10:30-12:00 a.m. *Ed Brandt *Margaret Conomos Note: Ed and Margaret will break the exhibit down, package it up and assure its return, along with OP's lap top PC's, to Waterside Mall. ------- ------- Notes ------- ABSTRACTS 1999 EPA Conference on Environmental Statistics and Information 4:00-6:00 Monday, May 10 CONCURRENT TRAINING SESSIONS WORKSHOP ON MONTE CARLO METHODS IN ENVIRONMENTAL STATISTICS Woollcott Smith Statistics Department, Temple University Peter Petraitis Biology Department, University of Pennsylvania This workshop is divided into two parts: 1. Smith will present an overview of modern computer intensive Monte Carlo methods. The review will include the statistical motivation as well as technical and philosophical advantages and disadvantages in using these methods in administrative and legal settings. We will briefly describe how these methods are used to attack hard statistical problems in missing data imputation, measurement error and Bayesian analysis. Finally the details of randomization and simulation methods will be illustrated using a basic aired comparison design. 2. Petraitis will present a case study on the pros and cons of using randomization methods as an alternative to analysis of variance and the analysis of covariance. ------- 10:45-12:00 Tuesday, May 11,1999 STATISTICS, INFORMATION, AND GPRA (CHAIR: GEORGE BONINA, EPA) ANALYTIC CHALLENGES AND THE GOVERNMENT PERFORMANCE AND RESULTS ACT Judith Calem Lieberman OCFO, US Environmental Protection Agency The Government Performance and Results Act (GPRA) of 1993 set into motion a spate of activity in Agency strategic planning and accountability. In essence a legal constitution for good management, the GPRA requires federal agencies to set goals, measure performance, and report on the degree to which goals are met. It also places emphasis on attaining results rather than tracking program activities. The Office of the Chief Financial Officer has been leading EPA's effort to meet GPRA's statutory requirements, which includes development of a 5-year strategic plan, annual performance plans (and budgets), and annual performance reports. During the first cycle of GPRA implementation, several analytical challenges have been revealed. The most significant ones relate to identification of outcome goals, development of performance measures, validation/verification of performance data, and comparison of performance with annual goals. Working through these challenges will require a good understanding of the Agency's mission, a little creativity and the analytical skills to understand the impact of program activities on environmental results. LOCAL APPLICATIONS OF EPA DATA (CHAIR: RON SHAFER, EPA/CEIS) THE BALTIMORE COMMUNITY ENVIRONMENTAL PARTNERSHIP: LESSONS LEARNED Henry Topper US Environmental Protection Agency In this case study, participants in the Baltimore Community Environmental Partnership will describe their experiences and present lessons they have learned. The experience presented will be based on a three-year project involving a Partnership among the residents, governments, and businesses in south Baltimore and northern Anne Arundel County. This Partnership worked together to begin addressing the long term environmental and economic concerns in four neighborhoods in south Baltimore and northern Anne Arundel County. For many years, both residents and businesses in this heavily industrialized section of the metropolitan Baltimore area have expressed concerns about health and the environment in their neighborhoods. By working ------- together in a Partnership, the community completed a comprehensive review of all aspects of its environment and has begun work to implement a plan to make real improvements. The Partnership has taken a holistic view of community problems and has developed efforts to address a broad range of issues facing the community including health concerns, housing issues, illegal dumping, subsistence fishing, park restoration and enhancement, community gardening, economic development, air quality, and crime. Based on this holistic approach, the Partnership has begun to develop an understanding of the complexity of the environmental stresses facing the community and the need for a multifaceted approach to improving community health and building a sustainable community. In the area of community health, Partnership committees are now working to address the issues of indoor air, fish consumption, truck traffic, and industrial toxic releases. As a part of this effort, the Air Committee of the Partnership completed a comprehensive screening analysis of air releases from all the businesses and facilities in and around the Partnership area. This analysis, based on exposure modeling, has given the community information on the cumulative concentrations of toxics from all sources in each of the four Partnership neighborhoods. The Air Committee has developed a protocol to compare these modeled concentrations with established health effect values to determine areas for pollution prevention. The committee has also developed a protocol and screened for potential combined effects of multiple chemicals that have similar target organs, e.g. all the chemicals that are respiratory tract irritants. As a result of the work of the Air Committee, the community now has some key parts of the information it needs to monitor and improve the local environment. THE DEPARTMENT OF ENVIRONMENTAL PROTECTION COMPLIANCE REPORTING SYSTEM Kimberly Nelson Pennsylvania Department of Environmental Protection The Pennsylvania Department of Environmental Protection (DEP) has made significant strides in improving data management. PA DEP has successfully integrated across more than 12 programs data to present a holistic view of the people and places it regulates. The data reside in the DEP client/site database which is fully integrated with departmentwide application processing and compliance reporting systems. The DEP compliance reporting system is one of the few systems in the country that can track multi-media inspections, violations, penalties and enforcement actions for a single facility and is the only system in the country that is on-line for citizens to track compliance activities. The client/site system also is integrated with the department's new Pennsylvania Facility Analysis System, a web based GIS application that went on-line for the public in March. Currently, the department is focusing priority attention on an Environmental Futures Team whose charge it is to develop a plan for measuring environmental outcomes. ------- OPPT'S RISK-SCREENING ENVIRONMENTAL INDICATORS MODEL* Bouwes, N. and Hassur, S. Office of Pollution Prevention and Toxics, U.S. Environmental Protection Agency S. Keane, £, Fechner Levy, B. Firlie, and Walkling, R. Abt Associates, Inc. The Toxics Release Inventory (TRJ) provides raw data on the quantities of chemicals released by US manufacturing facilities, but these raw data alone do not provide information about the relative toxicity or exposure potential of these releases. The Office of Pollution Prevention and Toxics (OPPT) of the US EPA has created the Risk-Screening Environmental Indicators Model to provide a risk-based perspective of these releases, in a PC-based model. The Indicators Model integrates toxicity scores with a measure of exposure potential and the size of the potentially exposed population to calculate individual Indicator Elements for each combination of facility, chemical, and release media reported under TRI. Each year of reporting generates approximately 250,000 of these Elements which are summed to provide overall Indicator Values. The Indicator Elements can also be summed to create sub-Indicators that rank relative impacts by medium, chemical, geographic area, industry sector or a combination of these and other variables. This flexibility provides the analyst with the opportunity to examine trends year-to- year, and to rank and prioritize chemicals, industries and regions for strategic planning, risk- related targeting for enforcement and compliance purposes, and community-based environmental protection. The model also permits the user to investigate the relative influence of toxicity, exposure and population on the results. *Work supported under EPA Contract Number 68-W6-0021, WA#3-02. STATISTICAL METHODS FOR LAB AND AIR QUALITY DATA ANALYSIS (CHAIR: LARRY COX, EPA/ORD/NERL) STATISTICAL MODELING OF MULTIPLY CENSORED DATA Mary Lou Thompson and Kerrie Nelson The National Research Center for Statistics and the Environment, University of Washington Laboratory analyses in a variety of contexts may result in doubly left censored measurements, i.e. amounts of contaminants of concern may be reported by the laboratory as "non-detects" or "trace". The analysis of singly censored observations has received attention in the biostatistical (e.g. in the context of survival analysis) and in the environmental literature. We consider maximum likelihood and semi-parametric approaches to linear models in the doubly censored setting. ------- TREND ESTIMATION USING WAVELETS Peter Craigmile Department of Statistics, University of Washington A common problem in the analysis of environmental time series is how to deal with a possible trend component, which is usually thought of as large scale (or low frequency) variations or patterns in the series that might be best modeled separately from the rest of the series. Trend is often confounded with low frequency stochastic fluctuations, particularly in the case of models such as fractionally differenced processes (FDPs), which can account for long memory independence (slowly decaying auto-correlation) and can be extended to encompass non- stationary processes exhibiting quite significant low frequency components. In mis talk we assume a model of polynomial trend plus fractionally differenced noise and apply the discrete wavelet transform (DWT) to separate a time series into pieces that can be used to estimate both the FDP parameters and the trend. The estimation of the FDP parameters is based on an approximation maximum likelihood approach that is made possible by the fact that the DWT decorrelates FDPs approximately. Once the FDP parameters have been estimated, we can then test for a non-zero trend. After outlining the work that we have done to date on testing for non- zero trends, we demonstrate our methodology by applying it to an air quality time series. METEOROLOGICAL ADJUSTMENT OF SURFACE OZONE FOR TREND ANALYSIS: PICK AN ANSWER, ANY ANSWER Joel H. Reynolds NRCSE, Department of Statistics, University of Washington A variety of statistical methods for meteorological adjustment of surface ozone have been proposed in the literature over the last decade. As part of a larger review of the literature, we summarize and compare six different methods applied to the analysis of surface ozone observations in the Chicago region from the 1981 -1991 period: nonlinear regression, regression tree models, extreme events models, time-series filtering, nonlinear additive time-series models, and canonical covariance analysis. Differences in the resulting trend analyses are discussed in terms of differences in each analysis' spatial domain and choice of ozone statistic. The review highlights the need for development of techniques for extreme value analysis of space-time processes. ------- 1:15-2:30 Tuesday, May 11,1999 DATABASE: THE MANAGER'S VIEW (PANEL SESSION) Philip Lindenstruth, Michael A. Mundell, and Abraham Siegel US Environmental Protection Agency The Panel will present for discussion several issues involved in the administration of a national database. These issues start with requirements for the database and addresses optional data fields, data quality, data ownership, database management issues, and support for the system during its life cycle. Those on the Panel would like their initial presentations to stimulate a discussion of these issues with the attendees. ENSURING THE QUALITY OF ENVIRONMENTAL INFORMATION (CHAIR: NANCY WENTWORTH, EPA/ORD) USING SIMSITE TO ILLUSTRATE SAMPLING TECHNIQUES Malcolm J. Bertoni Center for Environmental Measurements and Quality Assurance, Research Triangle Institute The Simulated Site Interactive Training Environment (SimSITE) is a computer-based training support system that helps environmental scientists and engineers learn how to plan a field investigation at a hazardous waste site. Through the use of a graphical user interface provided by the ArcView geographic information system (GIS), training participants apply concepts such as Data Quality Objectives (DQOs), Data Quality Indicators (DQIs), statistical sampling design, and Data Quality Assessment (DQA). SimSITE contains statistical design and analysis tools and sampling simulation routines that allow the participants to develop and implement sampling plans that satisfy their DQOs. SimSITE then generates a data set (including sampling and measurement errors), and allows the participants to make decisions about whether or not to clean up areas of the artificial site, based on their statistical analysis of the data. At the end of the simulation, the features of the underlying true contamination are revealed to illustrate the phenomenon of decision errors. During this interactive presentation, the features and classroom uses of SimSITE will be demonstrated. ------- MODELS AND MODEL ASSESSMENT OF ENVIRONMENTAL DATA (CHAIR: MARY LOU THOMPSON, UNIVERSITY OF WASHINGTON) DEVELOPMENT OF A LINKED PHARMACOKINETIC- PHARMACODYNAMIC MODEL OF METHYLMERCURY-INDUCED DEVELOPMENTAL NEUROTOXICITY T.A. Lewandowski, S.M. Bartell, R.A. Ponce, C.H. Pierce, and E.M. Faustman Department of Environmental Health, University of Washington Methyl mercury (MeHg) has been shown to cause adverse developmental effects in human and animal conceptuses exposed in utero. A toxicological model of the disposition and cellular action of MeHg in the developing fetus can be used to estimate health outcomes for various levels of exposure. Modeling can also incorporate differences in dose rate, chemical species, or inter-species variability. A linked toxicokinetic and toxicodynamic model for MeHg has been developed for the rat based on work performed in our laboratory. The toxicokinetic model incorporates many of the changes in organ size and blood flow associated with gestation. In the toxicokinetic model, changes in the population of committed fetal neural cells have been estimated based on the observed effects of MeHg on rates of cellular death, proliferation and differentiation in vitro. We are currently determining these rates in vivo using BrdU-Hoechst flow cytometry. The toxicokinetic model demonstrates an adequate fit to experimental toxicokinetic data. For example, 3 days after a dose of 1 mg/kg (given on day 16 of gestation), the model predicts fetal brain and fetal blood levels within 10% of the values observed by Wannag (1976). In terms of toxicodynamic effects, the model predicts 20% and 65% decreases in the number of committed neural cells (on gestational day 15, relative to untreated baseline) at fetal brain concentrations of 10 and 50 umol/kg. It is anticipated that the existing model can be extended to address other species (i.e., humans) and other developmental toxicants which act by similar mechanisms (i.e., cell cycle disruption). Sponsored by the following grants: USEPA R825358 and CR825173 and NIEHS T32ESO-7032. BAYESIAN MODEL ASSESSMENT Samantha Bates and A. E. Raftery Department of Statistics, University of Washington, Cullen, A.C. Graduate School of Public Affairs, University of Washington In this paper we discuss a Bayesian method of analysis which incorporates both prior knowledge of the distributions of the inputs to a deterministic model and any available data on the model inputs and outputs. This method uses Monte Carlo simulation from the prior distributions for the inputs and resampling of these simulations with weights determined by the observed data under ------- the sample importance resampling scheme of Rubin. The method yields posterior distributions for the output from which to find distributions for quantities of interest. The method also allows the separation of the contributions of variability and uncertainty on the posterior distribution of soil concentration. We will present an application of this method to modeling poly-chlorinated biphenyl (PCB) concentrations in various media at a Superfund site in New Bedford Harbor (NBH), MA. Dredging during this clean-up of the Harbor exposes inhabitants of the surrounding region to PCB contaminated air, soil and plants. A deterministic model for PCB concentration in soil was developed by Cullen (1992). The Bayesian method is used to find distributions for the PCB concentration in soil at this site. In addition we will contrast the results of this Bayesian method with those of a traditional Monte Carlo approach and a trial-and-error approach. PARETO OPTIMAL MULTI-CRITERIA MODEL ASSESSMENT Marianne Turley, E. David Ford, and Joel Reynolds University of Washington Evolutionary computation (EC) is an optimization technique for finding Pareto optimal solutions to multiple objective functions. It borrows ideas from evolutionary theory to direct the optimization search through the parameter space. We applied this optimization to process models to improve model assessment by requiring a solution, a model parameterization, to achieve multiple criteria simultaneously. In this talk, I will discuss the algorithm, two alternative search errors and some examples. 2:45-4:00 Tuesday, May 11, 1999 USE OF THE INTERNET FOR SHARING STATISTICS (CHAIR: STEVE HUFFORD, EPA) ENVIROFACTS WAREHOUSE: ENVIRONMENTAL DATA ON THE INTERNET - EMPOWERING THE CITIZEN TOWARD ENVIRONMENTAL PROTECTION AND AWARENESS Pat Garvey US Environmental Protection Agency Governments and the courts are acknowledging more and more that the Public has a right to know what is being discharged and released to the environment. The U S Congress and the Executive Branch have taken decisive action to ensure this public right to access of data and information. The U.S. EPA created the Envirofacts Warehouse to provide the public with direct access to the vast amounts of information and data in its national program environmental data systems. The 8 ------- Envirofacts Warehouse helps EPA fulfill its responsibility to make information available to the public, as required by federal legislation and Executive Order. Envirofacts is available from the Internet, (www.epa.gov/enviro) allowing EPA to disseminate information quickly and easily. Envirofacts Warehouse contains: a relational database of the national databases on Superfund (abandoned hazardous waste) sites, hazardous waste handlers, discharges to water, toxic releases, air releases, and drinking water suppliers, the relational database also contains the facility index system, the Envirofacts Master Chemical Integrator, locational reference tables, and, spatial data and demographic data from the other sources. Internet applications are available and part of the Envirofacts Warehouse Internet site to provide easily designed queries to the databases and to create maps and other reports. The Presentation shows the capabilities and reasons for the Envirofacts Warehouse. The presentation demonstrate the features and principles behind the design of the Web site, the database design and model and demonstrates the various application features and query options from the Web. The presentation will demonstrate: How On-line Queries and Results are useful to the concerned public, interested organizations, governmental regulatory staff and to Environmental Officer of a plant, facility or company; CIS Mapping capabilities and Outputs that are On-line and what are the CIS capabilities in the future; Data refresh schedules and the importance of On-line Documentation; and C'usiomer Feedback procedures for data quality and user needs. The presentation will address the US EPA directions and program initiatives in public access of governmental data and community empowerment with environmental data. NATIONAL BIOLOGICAL INFORMATION INFRASTRUCTURE Anne Frondorf U.S. Geological Survey This presentation will provide a brief description/overview of the National Biological Information Infrastructure (NBII) program, a collaborative effort to build a distributed, Internet- based federation of biological science data, information and analytical tools. Examples of the types of data and information available from the NBII and the types of different agencies and organizations and partnerships involved in building the NBII will be provided. Two key elements of the NBII "infrastructure" (i.e. the standards-related activities that help to support and pull together this distributed data network) will be highlighted. These are the ------- development of a biological metadata content standard (and an accompanying biological metadata clearinghouse network) and the continued development of the Integrated Taxonomic Information System (ITIS) as a standard reference for biological nomenclature and taxonomy ITIS is a partnership among USGS, EPA, NOAA, USDA, and the Smithsonian Institution. EPA'S ENVIRONMENTAL INFORMATION MANAGEMENT SYSTEM Bob Shepanek Office of Research and Development, US Environment Protection Agency Presented is an integrated vision for scientific information management approaches supporting monitoring and assessment activities within the US EPA's, Office of Research and Development (ORD). This vision was developed based upon lessons-learned from the implementation of several scientific information management systems and from development of the ORD's strategic and implementation plans for scientific information management. The vision reflects that effective management of scientific information must address technical, cultural and management challenges. Technical challenges include management and integration of metadata, data, and the modeling, analysis, and visualization tools used as part of assessment activities. Cultural challenges relate mainly to the protection of intellectual capital produced by individual investigators. Management issues include commitment of adequate resources for systems development and operation, support for related policies and procedures, and appropriate incentives for involvement by staff and project participants. EPIDEMIOLOGY AND RISK ASSESSMENT CUMULATIVE AND/OR AGGREGATE RISK ASSESSMENT (CHAIR: RUTH ALLEN, EPA) FOOD QUALITY PROTECTION ACT AND ITS IMPLEMENTATION: AN OVERVIEW OF STATISTICAL AND PROBABILISTIC ISSUES FACING THE OFFICE OF PESTICIDE PROGRAMS David Miller US Environmental Protection Agency With the passage of the Food Quality Protection Act, the Agency's Office of Pesticide Programs is now required to aggregate risks from pesticides across exposure pathways and to accumulate risks from pesticides across chemicals. As a result and in an attempt to develop better risk and exposure estimates that consider the probabilities associated with simultaneous exposures, the Office of Pesticide Programs is now using probabilistic (Monte Carlo) techniques in its risk and exposure assessments. This had necessitated that OPP develop further refinements to its risk assessment procedures. This presentation will provide an overview of FQPA and discuss its major science impacts. It will review the traditional (deterministic) type methods used by OPP in exposure and risk assessments as well as the probabilistic techniques now being used with increasing frequency. Finally, it will review some of the statistical and policy issues which are 10 ------- now being considered by the Office as it implements the probabilistic risk analysis framework now in place. FINDING A STATISTICAL DISTRIBUTION TO USE IN THE MONTE CARLO EXPOSURE ASSESSMENT OF LIVESTOCK COMMODITIES Hans D. Allender, Ph.D., P.E. US Environmental Protection Agency The presentation develops a methodology to find a frequency distribution of animals' contamination because of the ingestion of pesticide-contaminated food. Given the percentage of crops treated (%CT), the methodology calculates the distribution of animals that will be exposed. Determination of the frequency distribution can be used later in connection with the application of a Monte Carlo Analysis to the Exposure Assessment of humans to contaminated animal products. The flexibility of the method allows the construction of frequency distributions to multiple cases with different %CT. A non-agricultural example explains the process in a way that everyone can relate to the calculations. The ubiquitous spreadsheet is used as the preferred medium to obtain random numbers, recalculate probabilities, generate totals, and produce graphics. A detail explanation of how the spreadsheet is constructed ensures the audience the possibility of duplicating the exercise. The simplicity of the methodology makes the process easy to replicate and to extend to similar situations. It also allows the study of severe contamination by pointing out the percentage of animals which diet has been contaminated from different sources. In summary, the article indicates a way of calculating a realistic statistical distribution of animal contamination based on ingestion of contaminated food. Also, the procedure can be extended to non-agricultural situations. APPLYING EPIDEMIOLOGY TO STUDY THE PREVENTION OF MAJOR CHEMICAL ACCIDENTS Breeda Reilly Chemical Emergency Preparedness and Prevention Office, US Environmental Protection Agency Mandated by the Clean Air Act Amendments of 1990, accident histories from some 69,000 chemical facilities in the United States will become available in the fall of 1999. This presentation describes the challenges of using the tools of epidemiology with this data to investigate drivers of severity and frequency of accidents. This study was proposed by Center for Risk Management and Decision Processes at the Wharton School and is a major focus of an EPA cooperative agreement. The Major Accident Epidemiology Project aims to contribute to the process of determining which plants are most likely to incur major events, by ascertaining whether certain predictors (characteristics of manufacturing plants or of the companies that own them) are associated with increased probability of a major event. This knowledge can be helpful in two ways: (1) plants with such risk factors can be monitored more closely (by the companies themselves as well as by regulators and other stakeholders); and (2) these associations may provide clues about characteristics of companies' organizational systems that act as underlying causes of major events. 11 ------- STATISTICAL RESEARCH ISSUES IN QUALITY ASSURANCE (CHAIR: JOHN WARREN, (EPA/ORD) INTEGRATING DATA QUALITY INDICATORS (DQIS) INTO DATA QUALITY OBJECTIVES (DQOS) John Warren Quality Assurance Division, Office of Research and Development, US Environmental Protection Agency EPA Order 5360.1 CHG 1 (July 1998) requires all EPA organizations to use a systematic planning process to develop acceptance or performance criteria for the collection, evaluation, or use of environmental data. Systematic planning identifies the expected outcome of the project, the technical goals, the cost and schedule, and the acceptance criteria for the final result. The Data Quality Objectives (DQO) Process is the Agency's recommended planning process when data are being used to select between two opposing conditions, such as decision-making or determining compliance with a standard. The outputs of this planning process (the data quality objectives themselves) define the performance criteria. The DQO Process is a seven-step planning approach based on the scientific method that is used to prepare for data collection activities such as environmental monitoring efforts and research. It provides the criteria that a data collection design should satisfy, where to collect samples; tolerable decision error rates; and the number of samples to collect. Data Quality Indicators (DQIs) are the individual performance characteristics specified in the mandatory Quality Assurance Project Plan (QAPP) that accompanies any environmental data collection. Typical DQIs include precision, completeness, comparability, and sensitivity. This discussion centers on how the Agency can effectively make the link between DQOs and DQI A PERFORMANCE EVALUATION OF THE METHOD DETECTION LIMIT Charles White US Environmental Agency Performance criteria specified in the original (1981) publication are evaluated using EPA data. Data available for preliminary evaluation include over thirty combinations of pollutant by chemical analytical technique. 12 ------- 5:15-8:00 Tuesday, May 11,1999 POSTER AND COMPUTER SESSIONS INFORMATION VISUALIZATION - TURNING DATA INTO INFORMATION YOU CAN EASILY UNDERSTAND Stuart H. Kerzner US Environmental Protection Agency, Region III The poster shows "EnviroSnax", which are graphics showing tidbits of environmental information in ways that are easy to understand and highlight past or future environmental impacts on the Region. They are used for management briefings, public use, press releases and presentations. WHAT THE CEIS NATIONAL TELEPHONE SURVEY WILL BE ABLE TO TELL EPA'S INFORMATION PROVIDERS Heather Case EPA Customer Service, US Environmental Protection Agency This presentation will describe the potential uses of the results from a national telephone survey recently completed by the CEIS. The national telephone survey, which began in February 1999, was designed to: identify and describe environmental information customers within the U.S. population; identify the public's high interest environmental topics; and determine the public's access preferences for obtaining and using information. The survey results will be used to guide CEIS information product and service development. The survey results will be available for peer review in mid-August 1999. This presentation will highlight potential uses by information providers in the Programs and Regions. METHODS TO MINIMIZE HUMAN ERROR IN REPORTING ANALYSIS RESULTS Susannah Dillman US Environmental Protection Agency Using "Paste Special" multiple graphs and tables in Excel can be linked to the report in WordPerfect 8 and updated all at once. 13 ------- USING PERL SCRIPTS TO IMPORT DATA INTO CIS: AN EXAMPLE USING USGS GROUND WATER SITE INVENTORY DATA John S. Graves US Environmental Protection Agency, Region HI One of the primary tools in EPA Region III for evaluating environmental data is the Geographic Information System or GIS. A difficulty in using a GIS is that environmental data is not always readily available in a GIS format. The Perl computer language was used to translate U.S. Geological Survey ground water data into a format, which could then be imported into a GIS. This poster presents relevant portions of the Perl script used with explanations of the data processing steps undertaken as well as examples of GIS generated plots from the resulting data in EPA Region III. DEMONSTRATION OF ESS, S-PLUS, AND TRELLIS GRAPHICS Richard M. Heiberger Department of Statistics, Temple University ESS [Emacs Speaks Statistics] is a GNU Emacs interface for interactive statistical programming and data analysis. Languages supported include S-Plus, XLispStat, and SAS. ESS provides a standard interface between statistical programs and statistical processes and has as one of its goals an increase in efficiency for statistical programming and data analysis, over the usual tools. ESS displays source code in these languages with syntactic indentation and highlighting of source code. ESS interacts "directly" with the statistical package. ESS allows intelligent interaction with the transcript of previous interactive session. Trellis is a graphical display system that uses multiple panels to simultaneously view relationships between different variables in your multivariate dataset through conditioning. Trellis was developed at Bell Labs as part of S-Plus. We will have a live demonstration of ESS, S-Plus, and trellis graphics. I will analyze and display several examples of continuous and discrete multivariate and time series data sets. CD TOXIC RELEASE INVENTORY (TRI) DATA EXPLORER William P. Smith Center for Environmental Information and Statistics, US Environmental Protection Agency The TRI Data Explorer is a web product designed to provide the user quick and easy queries to EPA's TRI Chemical release data for years 1988-1997. The Explorer's portal to TRI chemical release data is through multiple data views which provide detailed and comprehensive chemical reports at all geographic levels down to the facility level by year or across years. In addition for each chemical the explorer provides interesting information such as factoids and information on the top 100 releasing facilities and counties. 14 ------- The TRJ Explorer will help our customers find information on topics such as: the chemicals released in their county during the year; the facilities that are releasing these chemicals in the county, state or the nation; the top chemicals released in their county, the state, or the nation; and, the top 100 ranking facilities and counties in the nation that release a given chemical, or all chemicals. And much more. The application runs on the web at hup://athena.was.epa.gov:2002/~wsmith/tri2/explorer.htm. or on CD for running off-line without the Internet. The CD application will be demonstrated. 8:00-9:00 Wednesday, May 12 DATA INTEGRATION AND QUALITY: VISION FOR THE FUTURE (CHAIR: RUTH ALLEN, EPA) ATLAS OF CANCER MORTALITY IN THE UNITED STATES, 1950-94 Susan Devesa, D. Grauman, W. Blot, G. Pennello, R. Hoover, and J. Fraumeni Division of Cancer Epidemiology and Genetics, National Cancer Institute The geographic patterns of cancer around the world and within countries have provided important clues to the environmental determinants of cancer. In the mid-1970s the NCI prepared county-based maps of cancer mortality in the U.S. that identified distinctive variations and hot- spots for specific tumors, thus prompting a series of analytic studies of cancer in high-risk areas of the country. We have prepared an updated atlas of cancer mortality in the United States during 1950-94. based on mortality data from the National Center for Health Statistics and population estimates from the Census Bureau. Rates per 100.000 person-years, directly standardized using the 1970 US population, were calculated by race (whites, blacks) and gender for 40 forms of cancer. The new atlas includes more than 140 computerized color-coded maps showing variation in rates during 1970-94 at the county (more than 3000 counties) or State Economic Area (more than 500 units) level. Summary tables and figures are also presented. Selected maps for the 1950-69 period are also included. Accompanying text describes the observed variations and suggests explanations based in part on the findings of analytic studies stimulated by the previous atlases. The geographic patterns of cancer displayed in this atlas should help to target further research into the causes and control of cancer. 15 ------- ANALYSIS OF CLEANUPS (CHAIR: MIKE MESSNER, EPA/OGWDW) CRYPTOSPORIDIUM OCCURRENCE IN THE NATION'S DRINKING WATER SOURCES Michael J Messner, Ph.D. US Environmental Protection Agency Cryptosporidium is a microbial pathogen which occurs in most of the nations surface waters. Information on cryptosporidium occurrence will be used in estimating the costs and benefits of ruture drinking water regulations. A recently completed survey generated monthly estimates of cryptosporidium concentrations in the source waters of over 400 large drinking water utilities. With only two months of validated data in hand, it appears that 80 to 90 percent of the water volumes analyzed yielded zero oocysts. On its face, this sparsely of nonzero results appears to severely limit the data's usefulness. In this presentation, a Bayesian approach is outlined for estimating hierarchical model parameters and their uncertainties. Time permitting, the approach will be illustrated using a small simulated data set. SAMPLING AND DESIGN ISSUES IN ENVIRONMENTAL STUDIES (CHAIR: TONY OLSEN, EPA/NHEERL) SAMPLE DESIGNS FOR ENVIRONMENTAL DATA COLLECTION: RANKED SET SAMPLING AND COMPOSITE SAMPLING David Marker Westat Historically environmental statistics and survey sampling have had relatively limited interaction. Most environmental studies use pre-existing data collection locations, collect from known hot spots, and/or purposively select data collection locations. Efficient survey sampling that can support the evaluation of a wide range of hypotheses has been used to a lesser degree with environmental data than in health, education, or many other types of data. This talk will describe two NRCSE funded research activities that try to bridge this gap between survey sampling and environmental statistics. Ranked set sampling (RSS) is a method to potentially increase precision and reduce costs by using "rough but cheap" information to obtain a more representative sample before the real, more expensive sampling is done. We have explored under what conditions RSS becomes cost- effective for ecological and environmental field studies where the "rough but cheap" measurement has a cost. 16 ------- We are continuing to explore when alternative forms of two-phase sampling are preferable to RSS. Composite sampling has been proposed in environmental settings where the costs of measurement are high. It is hoped that by compositing data collected from multiple locations the cost savings will outweigh the loss of information on the individual locations. Unfortunately it is not clear how often this trade- off is successful. NRCSE has funded the collection of side-by-side individual and composite samples so that this trade-off can be explored with real data from a national survey of over 800 houses. The data collection protocol and types of planned analyses will be discussed for this ongoing activity. 9:15-10:45 Wednesday, May 12 SOME ANALYSES AND POTENTIAL ANALYSES AT EPA (CHAIR: DOREEN STERLING, EPA/CEIS) INTEGRATING DATA FOR PLANNING AND TARGETING Michael Barrette US Environmental Protection Agency For each major regulatory program implemented by EPA, the program office has designed databases to house the information critical to the program's needs. In a changing world, data users are now interested in looking at environmental information holistically, which means that databases must relate to each other. To plan its enforcement and compliance activities, EPA makes use of integrated data within the Integrated Data for Enforcement Analysis (IDEA) system. This system provides access to more than 15 databases maintained by EPA and other government agencies. When trying to compare across databases, of course many discrepancies and data errors are found. In this presentation several topics related to data quality and integration will be examined: What is the critical step needed in order to integrate information across databases at the facility level? Discussion will focus on EPA's data integration strategies. What are key methods that have used existing data to find high-priority sector and geographic issues? Discussion will focus on recent efforts to identify priority areas and sectors for inspection targeting. How can data integration be used to find violators? Discussion will focus on some concrete examples showing how comparison of databases can lead facilities that are improperly regulated. 17 ------- THE MID ATLANTIC INTEGRATED ASSESSMENT PROGRAM (MAIA) Tom DeMoss Environmental Services, U.S. Environmental Protection Agency, Region III Tom Pheiffer Atlantic Ecology Division, NHEERL, U.S. Environmental Protection Agency The MAIA program is an integrated environmental assessment program being conducted by USEPA, Region III, and US EPA's Office of Research and Development, partnership with other Federal and State Agencies. Objectives of the MAIA program are to build partnerships and get all stakeholders involved in helping to (1) identify questions needed for assessing major ecological resource area, such as ground water, surface water, forests, estuaries, wetlands, and landscapes; (2) characterize the health of each resource are, based upon exposure and effect information; (3) identify possible associations with stressors, including landscape attributes, that may explain impaired conditions for both specific resources and the overall ecosystem; (4) target geographic areas and critical resources for protection and restoration, and (5) monitor environmental management progress. Our experience with partners uncovered certain key principles of effective watershed management. They were (1) agreement on geologic boundaries and or units of assessment; (2) conduct an assessment of their biological condition of resources; (3) target management to real impairment based upon the biological assessments including TMDL, nutrients and habitat restoration; (4) have watershed approach be holistic or segment by segment bases upon nature of problem; (5) have five-year rotation to monitoring and to assessments to allow time for change of environment and for progress from management action; (6) buy-in stakeholders so assessment and monitoring plans use all available resources and innovative options; (7) success will be more cost-effective monitoring and management fixes. Successful State partnering involves early buy in well before products are developed. MAIA's emphasis on aquatic biology and habitat is a departure from the water quality standards/TMDL mentality and requires open dialogue with state biologists who must educate their managers on the importance of habitat preservation and restoration as the new wave of management of their aquatic resources. STRATEGY TO ADDRESS EVOLVING ENVIRONMENTAL INFORMATION NEEDS John Moses Center for Environmental Information and Statistics (CEIS), US Environmental Protection Agency While primarily a regulatory agency, the U.S. Environmental Protection Agency is devoting an increasing amount of its resources to responding to public requests for information about environmental quality, pollution sources, and human health and ecosystem concerns. 18 ------- Additionally, the Agency must report annually to Congress on its progress in protecting human health and safeguarding the natural environment, as required under the Government Performance and Results Act (GPRA). Yet, in many cases, the data EPA needs to respond to public questions and to report on its progress are not readily available. The Evolving Information Needs Strategy addresses the gaps between the data the Agency needs and the data it currently has. Working with EPA Regional and Program Offices and external stakeholders, CEIS developed a two-phase strategy to identify and address some of the Agency's key environmental information gaps. Phase I is a general screening analysis for identifying major gaps in 26 key environmental problem areas and for setting priorities among these problem areas. Phase II is a methodology for performing a more in-depth analysis of and recommendations to address the gaps associated with each environmental problem area. This paper reports on the Phase I screening analysis, conducted from June through April 1999. MEASUREMENT ISSUES RELATED TO OUR WATER SUPPLY (CHAIR: BARNES JOHNSON (EPA/OSWER/OSW) FORAYS INTO THE UNFORGIVING- OCCURRENCE ESTIMATION IN THE REALM OF DATA WITH MULTIPLE CENSORING POINTS Andrew Schulman, Jennifer Wu, and Ben Smith US Environmental Protection Agency Under the Safe Drinking Water Act, the Agency is charged with establishing standards for allowable levels of contaminants in the Nation's public water systems. Central to the selection of the regulatory level is the determination of the relative benefits and costs likely to be achieved. Benefits and costs are directly proportional to the level of current occurrence. Consequently, sound decision making requires the best possible estimation of occurrence be utilized. In developing a new regulation for arsenic, the Agency has data from over twenty States covering a time span of up to twenty years. Because the current regulation is at a much higher concentration than new options under investigation, however, many State data sets are heavily censored by detection limits within the range of required estimation. This paper will discuss the data and the approaches EPA is considering for the assimilation of the data into national and intra-system occurrence estimation. ESTIMATED WATER CONSUMPTION IN THE U.S. BASED ON THE CSFII Henry D. Kahn, Helen L. Jacobs, and Kathleen A. Stralka US Environmental Protection Agency Knowledge of drinking water intake is fundamental to the mission of the Office of Water and an important component of a number of programs at EPA. This presentation provides a summary of 19 ------- our recent efforts to generate up-to-date estimates of water intake by the population of the United States. To obtain current estimated water consumption distributions, we have analyzed the United States Department of Agriculture's (USDA's) Combined 1994-96 Continuing Survey of Food Intake by Individuals (CSFII) data set. Per capita water intake is estimated for three sources of water: municipal/tap, bottled, and other sources of water (i.e., private well, private cistern, or private or public well). For each source of water, distributions are generated for direct and indirect water consumption. The distributions by age, gender, race, socioeconomic status, and geographical region and separately for pregnant and lactating women are also estimated. Survey design and statistical methodology are discussed. We anticipate that the water consumption distributions will be used in a wide range of applications including: rules limiting amounts of microbes; disinfectant by-products (DBF) rules; radon and other drinking water contaminant rules; protection of sensitive populations and other exposure assessments. DEVELOPMENT OF A NEURAL NETWORK TOOL FOR EVALUATION OF WASTE MANAGEMENT UNIT DESIGNS Virginia Cohen-Bradley Economics, Methods, and Risk Analysis Division, Office of Solid Waste, US Environmental Protection Agency Samuel Figuli, Julia Lewis, and Katrin Arnold HyroGeoLogic, Inc. The Office of Solid Waste recently completed a neural network software tool designed for evaluating leachate concentrations in four different waste management units, with three different liner types. The purpose of the tool is to help non-hazardous industrial waste facilities determine the concentration for the constituent of concern that can be disposed of safely in a specific waste management unit design. The neural network software, EPA's Industrial Waste Management Evaluation Model (IWEM) is based upon EPA's ground-water fate-and-transport model, EPACMTP. EPACMTP was designed for national-level risk assessments. It is run in Monte Carlo mode, using hydrologic data representative of the United States. Seven parameters judged to be the most significant in EPACMTP were used to build four different neural network tools, one for each of the waste management units: landfill, surface impoundment, waste piles, and land application units. IWEM has a multi-layer perceptron architecture and was trained in back-propagation mode from target output generated by the Monte Carlo-style analyses with EPACMTP. Several different approaches to producing training- and test-data sets were used. In general, the comparison between the neural network and the EPACMTP results is good. The accuracy of the neural networks varies with the location of the EPACMTP response surface that is being simulated. 20 ------- ASSESSING RISK (CHAIR: ELIZABETH MARGOSCHES (EPA/OPPTS) PROPOSED EPA METHODOLOGY FOR ASSESSING RISKS FROM INDOOR RADON David Pawel, Ph.D. US Environmental Protection Agency Radon has been determined to be the second leading cause of Jung cancer after cigarette smoking (NAS 1998). Based on methodology published by the National Academy of Sciences (NAS) in its BEIRIV report (NAS 1988) and in its "Comparative Dosimetry" report (NAS 1991), EPA has previously estimated that 13,600 lung cancer deaths in the U.S. each year are radon related (EPA 1992). Subsequently, the Agency sponsored a study by the NAS, which reviewed the large body of evidence about radon that has become available since their earlier reports. The new NAS study, BEIR VI (NAS 1998), confirmed that radon is a serious public health problem, and provided new estimates of radon risk and of radon-attributable lung cancer deaths, which were somewhat higher than EPA had projected previously, particularly for never smokers. The BEIR VI committee concluded, moreover, that about one-third of these cases are preventable if all homes above 4 pCi/L are remediated. We will discuss proposed revisions to EPA's methodology for calculating radon-related risk estimates in light of BEIR VI and the Agency's own previous analysis. These include estimates of attributable risk and risk per working level month (WLM). Attributable risk is the proportion of lung cancer deaths attributable to radon. Risk per WLM is the number of expected radon- induced cancer deaths for the current population divided by the corresponding total of past and future exposures. We will describe life table methods for calculating these quantities, and show how changes in smoking patterns might impact these estimates of risk. It is anticipated that this methodology would be used by EPA in a number of contexts, including: (1) updating its public information aimed at reducing residential radon exposures; (2) its assessment of risk from radon in drinking water; and (3) its assessment of risks associated with radium contaminated sites. HEALTH DATA: HOW DO WE USE IT TO PROTECT THE PUBLIC/ENVIRONMENT? Elizabeth H. Margosches, Ph.D., Jennifer Seed, Ph.D., and Khoan T. Dinh, Ph.D., US Environmental Protection Agency This talk will describe the types of data typically available for analysis by the EPA's Office of Pollution Prevention and Toxics, and how they are used. These data are submitted under various statutes or gathered from the open literature and are used to help decide to what degree the public or the environment may be at risk of incurring adverse effects if certain exposures occur. The decisions include such considerations as whether the available studies are experimental or observed in situ and how inferences may be made from various animal studies to the wild or to humans as well as inferences from one effect to another. Sampling and data collection issues, 21 ------- missing data, and data modeling are all critical statistical aspects of this activity. An example will be given those focuses on generalizing inferences from a dose-response model. 2:00-3:30 Wednesday, May 12 THE VISUAL PRESENTATION OF DATA (CHAIR: AL MORRIS, REGION III, EPA) ENVIROVIZ-TURNING NUMBERS INTO VISUAL RELATIONSHIPS Alvin R. Morris Director, Office of Environmental Data, US Environmental Protection Agency, Region HI We're drowning in data ever hear that plaintive wail? While we may not be drowning, we are faced with under-utilizing data. Another more recent challenge facing us- in the spring of next year is to prove to the congress, the public and others that we are using the funds they provide to actually improve the environment-how much, where and for what price. Data visualization can help solve both those challenges. This presentation will be the first presentation of outputs of a prototype program we named EnviroViz. A program that dynamically links air and water ambient and major point sources to: where they are located: in the Region, state, county, and watershed shows the 6-year trend for each of 7 air and 46 water parameters (stressors) the GPRA goals to the sub-objective level and shows for each sub-objective the associated FTE, contract $, state and tribal grants It's a new approach to more easily understanding the meanings embedded in environmental data and can be applied in many areas-please come see and comment. METHODS FOR DISPLAYING TEMPORAL AND SPATIAL TRENDS David Mintz Air Quality Trends Analysis Group, US Environmental Protection Agency EPA's Office of Air Quality Planning and Standards is tasked with developing an annual report on the nation's air quality. This report, entitled National Air Quality and Emissions Trends Report, uses various graphing techniques to present temporal and spatial trends in the data. This paper discusses the methods employed in the report, their strong points, and their limitations. Much of the graphical design is based on the principles of Edward Tufte and other leading authorities on the visual display of information. 22 ------- TWO TEMPLATES FOR VISUALIZING GEOREFERENCED STATISTICAL SUMMARIES Daniel B. Carr Center for Computational Statistics, George Mason University This paper presents two new templates for visualizing spatially-index statistical summaries. The first template called conditioned choropleth (CC) maps represents a powerful interactive extension of classed choropleth maps. The basic layout is a 3 x 3 matrix of panels containing nine juxtaposed maps. One conditioning variable corresponds to rows and the other to columns. The analyst controls the highlighting of map regions by manipulating row and column sliders that define acceptable intervals for the conditioning variables. A small tab in each panel shows a value summarizing the highlighted region values. The presence or absence of main effects and interaction are evident at a glance. Other analyst interactions including dynamic class interval selection and simultaneous pan and zoom for all panels. The examples emphasize study of human mortality rates for health service areas conditioned on environmental and demographic variables. The second template, called linked micromap (LM) plots, provides an alternative to traditional classed choropleth maps. The new design trades off region boundary resolution for more accurate or extensive statistical summaries. These summaries can be bar plots, dot plots, box plots, time series, line high plots for over a hundred variables and so on. Color provides a local link of each region's (or site's) statistical summary and it's spatial position in the micromap. Examples show numerous variations of this template. The discussion addresses pattern discovery and working in progress for drilling down from state to county to census tract. LISTENING TO OUR INFORMATION CUSTOMERS (PANEL SESSION) Margaret Morgan-Hubbard, Director, EPA Office of Communications Brendan Doyle, (Acting) Director, CEIS Customer Survey and Access Division Emma McNamara, (Acting) Director, EIMD, OIRM (invited), and Pat Bonner, EPA Customer Service US Environmental Protection Agency This session will give participants an overview of how EPA and CEIS are surveying the Agency's current and potential environmental information customers to better understand their needs and access preferences. Several examples of how customer feedback is helping to shape various EPA information products and services will be introduced. CEIS will re-cap lessons learned from the Center's customer surveys over the past two years and give an update on their national customer telephone survey (results due this fall). Session participants will have an opportunity to express their interests in using the Center's survey data for their own analyses and programs. The basics of using customer feedback on your products or services will also be covered. The panel agenda will include: 23 ------- Introductions: Brendan Doyle, CEIS Margaret Morgan-Hubbard, OCEMR: The importance of focussing your information product or service on your customers' needs and a vision for serving EPA's environmental information customers in the future. Brendan Doyle: Overview of what we've learned so far by implementing the CEIS customer survey plan and what we hope to learn from our national information customer telephone survey this fall. Emma McNamara: EPA's web sites- incorporating customer input and customer service principles into developing and maintaining a Web site. Pat Bonner: EPA Customer feedback 101- will discuss how "Hearing the Voice of the Customer" guidelines can help you to obtain useful customer feedback on your products, processes and services. APPLICATION OF SAMPLING IN AQUATIC RESOURCES (CHAIR: HENRY KAHN, EPA/OW/EAD) COMPOSITE SAMPLING ANALYSIS OF CONTAMINANT LEVELS IN FISH Henry D. Kahn and Silvestre Colon US Environmental Agency Samples of fish formed by physically mixing, i.e., grinding together, a number of fish into a combined, aggregate sample are referred to as "composite samples". Chemical analysis of composite samples is a cost-effective mechanism for estimating mean levels when the cost of analysis is high and the cost of obtaining sample units, such as individual fish, is relatively low. A possible concern in the analysis of composite sampling data is the absence of measurement results on individual units that comprise the composite. This presentation considers a set of data on contaminant levels in measured in composite samples offish and individual fish that constitute the composite samples. The results allow for comparison of composite and individual analyses. Additional topics discussed are: estimation of variance components associated with the composite samples, using measurements made on subsamples of the composites and effects of fish length and weight on contaminant levels. 24 ------- NATIONAL FISH TISSUE CONTAMINANT LAKE SURVEY: A NEW SPATIALLY-RESTRICTED SURVEY DESIGN Anthony R. Olsen NHEERL Western Ecology Division, U.S. Environmental Protection Agency In 1998, the U.S. Environmental Protection Agency initiated a national study offish tissue contaminants in lakes and reservoirs. The study requires the development of a survey design to meet the study objectives. For the national lake study, a list frame of waterbodies greater than 1 hectare is available. The frame provides information on the lake surface area and its geographic location, in the form of a geographic information system (GIS) coverage. However, the frame includes waterbodies that do not meet the definition of the target population. The frame includes 270,761 waterbodies. This paper develops the survey designs for the study and discusses how an underlying discrete global grid can be used to control the spatial distribution of the sample and to address the imperfection of the frame. The survey design does not use finite population sampling theory, but a continuous population in a bounded area theory that parallels it. The spatially-restricted design enables the concept of a systematic sample to be implemented while maintaining the ability to obtain design-based estimates and variance estimates. 8:30-10:30 Thursday, May 13 APPLYING MONTE CARLO SIMULATION TECHNIQUES WITH S- PLUS Steven P. Millard Probability Statistics and Information (PSI) Monte Carlo Simulation covers a broad range of topics, including simply generating random numbers, probabilistic risk assessment, bootstrapping to obtain the distribution of (and hence confidence intervals for) some statistic for which the distribution is unknown or not assumed, and permutation tests. This talk will discuss the concepts behind each of these main topics, then use examples to show you how to implement these methods using S-PLUS and ENVIRONMENTAL STATS for S-PLUS. 25 ------- 9:00-10:30 8:30-10:30 Thursday, May 13 THE DATA COME IN, THE DATA GO OUT REDUCING PAPERWORK BURDENS AT EPA Rick Westlund Office of Policy, US Environmental Protection Agency In the March 1995 Reinventing Environmental Regulation report, EPA established a long term commitment to identify and eliminate obsolete, duplicative, and unnecessary monitoring, reporting, and record keeping requirements. To date, EPA has removed more than 25 million baseline burden hours, and built an internal watchdog culture dedicated to avoiding unnecessary new paperwork burdens. Although total burden has continued to creep upward due to new statutory requirements and new right-to-know collections, EPA programs continue to develop creative approaches to chip away at burden without endangering environmental objectives. In addition, EPA is developing many enterprise-wide initiatives designed as strategic investments with the potential for much larger burden reductions three to five years from now. In the last several years the Agency has accelerated its efforts to improve information collection management, with a particular focus on reducing burdens associated with reporting and record keeping, while at the same time enhancing data quality, coordinating our data activities with States, improving our collection and display technologies, and compiling our data into a single Internet site. We have taken major steps, but there is still more to do. The public's right-to-know is now a fundamental cornerstone of our work at EPA, and we have all worked hard to put information into the hands of the American people in the belief that this is one of the best ways to protect public health and the environment. In the course of doing so, we have learned that the Agency's effective management of its data is central to the measurement of our progress in delivering the protections the American people expect. As we embark on a new era of information technology and enhanced public access to data, we are committed to minimizing our paperwork burden on the public while ensuring that our data are timely, accurate, useful to the public, and able to effectively inform our own decision making. The Agency has several initiatives underway to redesign or refocus the way we manage information collection with primary goals to reduce burden on the public while accomplishing our environmental protection mission. The most encompassing initiative is the recently launched reorganization plan involving the formation of a new information organization that will bring together all Agency information programs to better manage our information resources with an expressed goal of reducing burden on the public while enhancing the data quality and integrity as it is used within the Agency and made available to others outside the Agency. Another major initiative, started over a year ago after the 1997 Information Streamlining Plan, is the continued development of the Reinventing Environmental Information (REI) initiative. In its early stages, the plan focuses on data quality and building infrastructure, but burden reduction savings will become more apparent as the efficiencies in reporting options become available. 26 ------- The Agency has been very active working with the States on burden reduction especially through partnership workgroups with the Environmental Council of States (ECOS). The workgroup is identifying burden reduction opportunities by defining what information is and should be collected, how information is transmitted, and how information is used. The workgroup is also engaging industry, the public and others to help draft a tactical approach to burden reduction. Within the Agency, the program offices are developing a range of streamlining and reinvention initiatives to reduce burdens. They range from whole program streamlining as in the Office of Solid Waste's comprehensive review of the RCRA program to the Office of Air's reengineering of the pre-production certification program for new motor vehicles. 10:45-11:45 Thursday May 13 PERSPECTIVES ON DATA AND INFORMATION FROM THE DEPARTMENT OF HEALTH AND HUMAN SERVICES William F. Raub, Ph.D. Deputy Assistant Secretary for Science Policy Department of Health and Human Services The Department of Health and Human Services (DHHS) employs a wide variety of data and information systems as it seeks to enhance the well-being of Americans by providing for effective health and human services and by fostering strong, sustained advances in the sciences underlying medicine, public health, and social services. DHHS data-oriented efforts range from (a) collection of national vital and health statistics to (b) systematic surveillance focused on specific diseases and disorders to (c) special surveys oriented to particular public health issues and'or particular population groups. A major contemporary challenge is to improve surveillance for new and reemerging infectious diseases in general while improving preparedness to detect and respond to potential acts of biological terrorism. N c*«>j TWW * US- ' -^ it _ i j«. ^i "^** O-»- ------- Analytic Challenges and the Government Performance and Results Act Judith Calem Lieberman EPA/OCFO 13* EPA CONFERENCE ON STATISTICS AND INFORMATION May 10-13, 1999 Philadelphia, PA OUTLINE The GPRA-deflnition, objectives, requirements, who's involved 4 Analytic Tasks Analytic Challenges Efforts to Improve EPA's Data Quality ------- Government Performance and Results Act (GPRA) Legislation requiring agencies to set goals, measure performance, and report on the degree to which goals are met A "legal constitution for good management" Seeks to improve the efficiency, effectiveness, and public accountability of federal agencies Promotes transparency in decision-making with a focus on program results (environmental outcomes) GPRA REQUIREMENTS Strategic Plan Description. Mission and Long-term Goals Activities and Resources External Factors After 'Managing for Results Analytic Challenges in Measuring Performance' (GAO. 5/97) ------- GPRA REQUIREMENTS Performance Plan Description. Annual Performance Goals/Targets Links Budget with Goals Performance Measures to Assess Progress After 'Managing for Resulu Analytic Challenges in Measuring Performance' (CAO, 3/97) 5 GPRA REQUIREMENTS Performance Report Description: Assessment of Performance Unmet Goals After 'Managing for Resulu Analytic Challenges in Measuring Performance- (CAO, 3/97} 6 ------- WHO'S INVOLVED IN GPRA IN EPA? Managers and staff in the Program Offices who are directly involved in strategic planning, accountability and budget formulation/execution All staff UJ h- CD 3 EPA's Planning Architecture Goals (10) Objectives (41) Sub-objectives (118) (si / \ I Annual Performance Goals (APGs) Annual Performance Measures (APMs) APG APG D ------- ANALYTIC STAGES GPRA Requirement Strategic Plan Performance Plan Performance Report Analytic Stage Identify Goals Develop Performance Measures Validate and Verify Performance Data Analyze and Report Results After'Managing for Results Analytic Challenges in Measuring Performance* (CAO, 5/97) 9 ANALYTIC STAGES GPRA Requirement Strategic Plan Performance Plan Analytic Stage 1 Identify Goals Challenges: Program mission makes it difficult to define outcomes APGs describe annual progress-difficult to characterize as outcomes After'Managing for Results Analytic Challenges in Measuring Performance' (GAO. 5/97) 10 ------- EXAMPLES Environmental Outcome Sub-objective- By 2010, visibility in some eastern national parks and wilderness areas (Class I areas) will improve by as much as 30% from 1995 levels Sub-objective written as an outcome and APG as an activity: D Sub-objective By 2010, make the air safer to breathe for an additional 74 million Americans living in areas expected to violate the revised standards by attaining and maintaining the new NAAQSforPM2.5 D APG (1999) Deploy PM 2.5 ambient monitors at 1500 sites ANALYTIC STAGES GPRA Requirement Performance Plan Analytic Stage 2 Develop Performance Measures Challenges: Outcomes may take years to develop Need Data Clear relationship to goal Cover key aspects of program After 'Managing for Results Analytic Challenges in Measuring Performance- (GAO. 5/97) 12 ------- ANALYTIC STAGES GPRA Requirement Performance Plan Performance Report Analytic Stage 3 Validate and Verify Performance Data Challenges Data limitations Quality of 3rd party data After'Managing Tor Results Anilytic Challenges in Measuring Performance' (GAO, 3/97) 13 EXAMPLE FY 2000 APG: Air toxics emissions nationwide from stationary and mobile sources combined will be reduced by 5% from 1999 (for a cumulative reduction of 30% from the 1993 level of 1 3 million tons) Corresponding Performance Measures1 D Combined stationary and mobile source reductions in air toxics emissions (5%) D Reductions in national highway vehicle benzene emissions (21,871 tons) QReductions in national highway vehicle butadiene emissions (3,498 tons) 14 ------- ANALYTIC STAGES GPRA Requirement Performance Report Analytic Stage 4 Analyze and Report Results Challenges Understanding impact of program activities on results Understanding roles of different players After'Managing for Results Analytic Challenge] in Measuring Performance' (CAO. 5/97) I i EFFORTS TO IMPROVE EPA'S DATA QUALITY Greater reliance on electronic data interchange Trend in making data available electronically so data can be reviewed by its source(s) Use of external review Boards to review environmental analyses Development of standardized guidance or regulatory definitions of key terms to promote consistency Use of customer surveys to identify data quality and data management problems and action plans 16 ------- CONCLUSION GPRA emphasizes goals and objectives that have environmental outcomes GPRA encourages EPA to rely more heavily on performance data to inform program and resource allocation decisions Poor data quality will limit the usefulness of the data in informing planning decisions Program evaluation is an important analytical tool to understand the impact of program activities on results 17 REFERENCES Managing for Results: Analytic Challenges in Measuring Performance (GAO, May 1997) The Results Act: An Evaluator's Guide to Assessing Agency Annual Performance Plans (GAO, April 1998) Managing for Results: Measuring Program Results that are Under Limited Federal Control (GAO, December 1998) IS ------- Monte Carlo and Permutation Methods in Statistics Woollcott K Smith EPA Statisticians Meeting Sugarloaf, Temple University May 10,1999 ------- Monte Carlo and Permutation Methods in Statistics Outline What's Monte Carlo about Monte Carlo methods? An Example: Paired Comparisons Rules and Guidelines for Simulation Based Methods Some Powerful Applications Missing data: multiple imputation Nonlinear errors in variables, SIMEX Smith's statistical ecology applications - NOT Rattlesnakes Generic la-la Monte Carlo Diagram Analysis of Paired Data * HMO tonntnoH EWgMDff » out of TVS motr CFFCCTIVC ««» TO naws uavtH. VUIUUIY wwu rcmmmit nttrtam. fat emmf. m (0*ranut ttmg OSHH. rut no mtim us «tmom.r uttmta TO uat tMxer* amr on LOT UWPV rm cuwun» vunuanv HG 10 Getting Semi-Real Minitab Lakes Data Set Consisting of 149 Lakes Pairs Alkalinity reading around 1930 Alkalinity reading around 1980 Exercise Trellis Graphics Alkalinity data by Lake Type IT Spring fed Lakes Paired Data 5- ------- qq-Plot for Paired pH Differences qq-Plot for Paired Alkalinity Differences I1 I- I- Paired Analysis I960 1930 gmnaa »* |0( 428.91 44256 .1385 2 44242 669 S3 .2721 5 714.98 539S6 75.3 14 717.72 669.63 48.09 10 980.18 787.71 193.47 16 71&25 715.05 32 1 785.34 750.47 14.87 3 791.93 39098 ~ 100.96 15 593.85 578.8 15.05 4 371.4 308.32 65.08 12 672.55 711.87 -3832 7 492.18 46073 3145 6 S94t S4693 4717 9 34505 41531 -70.26 1S 441 55 397 15 44 4 8 86384 608.32 55.52 11 2 5 7 13 Paired t-test Assumptions: d's are iid normal random variables t = = 2.1996 p - value = Pr(|t|> 2.1996) = 0.0439 Exact Wilcoxon signed-rank test Assumptions d's are iid symmetric random variables Rank the absolute differences V= sum of ranks associated with positive differences = 109 p-value = Pr(v < 27 or v > 109) =.0335 Exact binomial test sign test Assumptions d's are iid AND Pr ( d> 0) =.5 X" number of positive differences ^observed = 13 p - value = Pr(X < 3 or X > 13) =.0768 ------- Finally a Monte Carlo Randomization Test Assumptions: d's are i.i.d symmetric random variables Assign signs at random to d| Choose a good test statistic, sample mean of d Repeat procedure n-1 times Paired Comparison Monte Carlo Hypothesis Test Diagram Finally a Monte Carlo Result Repeat procedure 19 times 1+0 p value = n Repeat procedure 999 times p - value = 1000 20 = 0.05 1000 From the Trivial to an Impossible Multivariate Ecological Paired Comparison Three replicate Hester- Dendy collectors were placed at both downstream and upstream locations to study the effect of a municipal outfall. Complex, highly variable data set, total of 96 taxonomic groups. The 1996 upstream collectors contained 1,080 individuals and 32 taxonomic groups, while a year earlier the same location had only 65 individuals and 13 taxonomic groups. ------- Paired Comparison over Time Reach 1 > ir.d Downstream (Reach 2) 1200 :; - 1000 "*0 800 :o - BOO " - 400 CO - 200 00 H 0 CO j I /\ A / \ A / / \/' ^s> v v##+t Reach 1 Total Reach 2 Total Ecological Distance Measure The distance between two samples is defined as the number of species not shared: that is, the number of species that are present in exactly one of the two samples. False absence: A species is absent from the sample but present in the population. Smith, Solow and Preston ( Biometrics, 1996) and many others have detailed the statistical problems associated with presence-absence measures. Observed between Location and within Location Distance Paired ComparisonMonteCarloHvpothesis Test Diagram All possible permutations Random permutations (paired example) Jackknife Single observation removed in each sample Blocks of observations removed Bootstrap Sampling with replacement Parametric Bootstrap Sample from the estimated parametric model Gibbs' Sampling and Markov Chain Monte Carlo All permutations. Any nonparametric text Lehmann, E. H.(1975) An Example: Exact test on Markov chains Smith. W, A.R. Solow (1996) An Exact McNemar Test for Paired Binary Markov Chains. Biometrics. Software: StalExact Random Permutation Methods Sokal, R.R. and F.J. Rolf (1995). Biometry. : Chapter 18 Many, B.F.J. (1995) Randomization and Monte Carlo Methods In Biology. Software: Splus. SAS, Resampling Stats , Fortran. ,, ------- Jackknife and Bootstrap Efron .B. and Tfoshirani (1993) An Introduction to the Bootstrap Efron. B. (1982> The Jackknife. Bootstrap and Other Resampling Methods Gray, H. L. and Schucany, W. R. (1972) The Generalized Jackknife Statistic An Application to Ecological Measures Confidence Intervals for Similarity Measures using the Two Sample Jackknife: Smith, Kravitz and Grassle (1979) Multivariate Methods in Ecological Work. Expected Species Shared- ESS Normalized Expected Species Shared - NESS Jackknifed NESS- JNESS 'Hidden Species Area Curves: Clara Chu's Temple Thesis Properties of Samplers Reputable Will the sampler produce the same set of samples each rime? That is, it's not a Monte Carlo Sampler. Checkable Could a reasonable analyst duplicate the procedure? But not the exact results. Could a reasonable analyst read and understand the procedure? Computationally Inexpensive SIMEX- Simulation Extrapolation Raymond Carroll (199&). Measurement Error in Epidemiologic Studies. Encyclopedia o/Biostatistics Simple Measurement Error Model Y= \ + x2 + £ Example of Errors in Variable Model Paired Comparison Monte Carlo Hypothesis Test Diagram SIMEX Plot Means of simulated estimates with error ------- ------- Notes ------- REGISTRANTS The 1999 EPA Conference on Environmental Statistics and Information SugarLoaf Conference Center Philadelphia, Pennsylvania May 10-13.1999 Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail RUTH ALLEN OPP/HED/CEB US EPA 703-305-7191 301-402-4279 Allen.ruth@epamail.epa.gov HANS ALLENDER US EPA 703-305-7883 703-605-0645 AI lender.hans@epamail .epa.gov JOSEPH ANDERSON US EPA 202-260-3016 LARA P. AUTRY OAR/OAQPS/EMAD US EPA 919-541-5544 919-541-1039 Autrv.lara@epa.gov MICHAEL BARRETTE US EPA 202-564-7019 Barrette.michael@epamail.epa.gov SAMANTHA BATES UNIV OF WASHINGTON Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail MALCOLM BERTONI RESEARCH TRIANGLE INSTITUTE 202-728-2067 202-728-2095 MJB@rti.org CINDY BETHELL US EPA GEORGE BONINA OIRM 202-260-6227 Bonma.george@epa.gov PATRICIA BONNER US EPA 202-260-0599 Bonner.patricia@epamail.epa.gov ED BRANDT CEIS/IAIAD US EPA 202-260-6217 Brandt.edward@epamail.epa.gov LORI BRUNSMAN OPPTS/OPP/HED US EPA 703-308-2902 703-605-0645 Brunsman.lori@.epamail.epa.gov ------- Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail DANIEL CARR GEORGE MASON UNIV 703-993-1671 703-993-1521 HEATHER ANNE CASE OP/CEIS US EPA 202-260-2360 202-260-4903 Case.heather@epamail.epa.gov WENDY CLELAND-HAMNETT OP/CEIS US EPA 206-260-4030 202-260-0275 Cleland-Hamnett.wendv@epa.gov SILVESTRE COLON OFFICE OF WATER US EPA 202-260-3066 202-260-7185 Colon.silvestre@epamail.epa.gov VIRGINIA A.COLTEN-BRADLEY OSWER/EMRAD US EPA 703-308-8613 703-308-0509 Colten-bradlev.virginia@epamail.epa.gov MARGARET CONOMOS OPPE/CEIS US EPA 202-260-3958 202-260-4968 Conomos.margaret@epa.gov LAWRENCE COX ORD/NERL US EPA 919-541-2648 919-541-7588 Cox.larrv@epamail.epa.gov PETER CRAIGMILE UNIV OF WASHINGTON Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail DAVID CROSBY AMERICAN UNIVERISITY 202-885-3155 Dcrosbv@american edu THOMAS CURRAN OAR/OAQPS US EPA 919-541-5694 919-541-4028 Curran.thomas@epamail.epa.gov THOMAS DEMOSS US EPA MAIA 410-305-2739 410-305-3095 Demoss.tom@epa.gov SUSAN DEVESA NATIONAL CANCER INSTITUTE NIH 301-496-8104 301-402-0081 Devesas@epndce.nci.nih.gov SUSAN DILLMAN OPPTS/OPPT/NPCD US EPA 202-260-5375 202-260-0001 Dillman susan@epa.gov KHOAN TAN DINH US EPA 202-260-3891 202-260-1283 Dinh.khoan@epamail.epa.gov DONALD DOERFLER ORD/ERC/NHEERL US EPA 919-541-7741 Doerfler.donald@epamail.epa.gov BRENDAN DOYLE US EPA 202-260-2693 202-260-4968 Dovle.brendan@epamail.ena.gov ------- LEE ELLIS CEIS US EPA Phone 202-260-6123 Fax 202-260-4968 E-mail Ellis.lee@epamail.epa.gov ROBERT ENGLISH INFO TRANS/ORG PLANNING US EPA Phone 202-260-5995 Fax 202-260-3655 E-mail English.robert@epamail.epa.gov DAVID FARRAR OPP US EPA 703-305-5721 703-305-6309 Farrar.david@epamail.epa.gov TERENCE FITZ-SIMONS US EPA 919-541-0889 Phone Fax E-mail Phone Fax E-mail GEORGE T. FLATMAN ORD/NERL-CRD US EPA Phone 702-798-2528 Fax 702-798-2208 E-mail George.flatman@epamail.epa.gov JOHN F. FOX OST US EPA Phone 202-260-9889 Fax 202-260-7185 E-mail Fox.iohn@epamail.epa.gov MARY FRANKENBERRY OPPTS/OPP/EFED US EPA Phone 703-305-5694 Fax 703-305-6309 E-mail Frankenberrv.marv@epamail.epa.gov ANNE FRONDORF US GEOLOGICAL SURVEY Phone 703-648-4205 Fax 703-648-4224 E-mail Anne_frondorf@usgs.gov Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail WILLIAM GARETZ OPPE/CEIS US EPA 202-260-2684 Garetz.william@,epamail.epa.gov PAT GARVEY OIRM/EIMD US EPA 202-260-3103 202-401-8390 Garvey.pat@epamail.epa.gov SUSAN P. GEYER CEIS US EPA 202-260-6637 Gever.susan@epa.gov MELISSA GONZALES ORD/NHEERL US EPA 919-966-7549 919-966-7584 Gonzales.melissa@epa.gov PETER GOODWIN DEAN GRADUATE SCHOOL TEMPLE UNIVERSITY BRIAN GREGORY OAR/ORIA/IED/CHB US EPA 0 202-564-9024 202-565-2038 Gregory.brian@epamail.epa.gov JAY HAKES ENERGY INFORMATION ADMINISTRATION ------- Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail STEVEN M. HASSUR OPPT US EPA 202-260-1735 202-260-0981 Hassur.steven@epamail.epa.gov RICHARD HEIBERGER TEMPLE UNIVERSITY KAREN KLIMA US EPA JAMES HEMBY OAQPS US EPA 919-541-5459 919-541-2464 Hembv.iames@epa.gov DAVID M. HOLLAND ORD/NHEERL US EPA 919-541-3126 919-541-1486 Holiand.david@epamail.epa.gov STEVE HUFFORD CEIS US EPA 202-260-9732 202-260-4968 Hufford.steve@epamail.epa.gov BARNES JOHNSON OSWER/OSW US EPA 703-308-8855 703-308-0511 Johnson.barnes@epamail.epa.gov HENRY KAHN OW/EAD US EPA 202-260-5408 202-260-7185 Kahn.henrv@epamail.epa.gov R. CATHERINE KING US EPA OECEJ 215-814-0871 215-814-2905 Ktng.catherine@epamail.epa.gov Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail ARTHUR T. KOINES OP/CEIS US EPA 202-260-4030 202-260-0275 Koines.artliur@epamail.epa.gov MEL KOLLANDER INSTITUTE FOR SURVEY RESEARCH 202-537-6845 202-537-6873 LEE KYLE OGWOW US EPA 202-260-1154 202-401-3041 Kyle.lee@epamail.epa.gov PEPI HERBERTLACAYO CEIS US EPA 202-260-2714 202-260-4968 Lacavo pepi@epamail.epa.gov RASHMI LAL OP/CEIS US EPA 202-260-3007 202-260-8550 Rashmi.lal@epamail.epa.gov JADE LEE EPA OFFICE OF WATER 202-260-1996 202-260-7185 Lee.jade@epa.gov JUDY LEE WASTE/CHEM MGMT DIV 215-814-3401 215-814-3113 Lee.judy@epamail.epa.gov ------- Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail LAWRENCE LEHRMAN RMD/OIS US EPA Lehrman.lawrence@epamail.epa.gov ELEANOR LEONARD OP/CEIS US EPA 202-260-9753 703-525-3455 Elleonard@aol.com JUDITH C. LIEBERMAN OCEO US EPA 202-260-8638 202-401-1515 Lieberman.iudv@epamail.epa.gov PHILIP LINDENSTRUTH OFFICE OF WATER US EPA 0 202-260-6549 202-260-7024 Lindenstruth.phil@epamail.epa.gov CONNIE LORENZ OP/CEIS/CSAD US EPA 202-260-4660 202-260-4903 ARTHUR LUBIN OSEA US EPA 312-886-6226 312-353-0374 Lubin.arthur@epamail.epa.gov ALLAN MARCUS NCEA US EPA 919-541-0643 919-541-1818 Marcus.allan@epa.gov Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail ELIZABETH MARGOSCHES OPPTS/OPPT US EPA 202-260-1511 202-260-1279 Margosches@epamail.epa.gov MARY A. MARION OPPTS/OPP/HED US EPA 703-308-2854 Marion.marv@epamail.epa.gov DAVID MARKER UNTV OF WASHINGTON ETHAN MCMAHON OP/CEIS US EPA 202-260-8549 Mcmahon.ethan@epamail.epa.gov MICHAEL MESSNER OGWDW US EPA 202-260-8107 Messner.michael@epamail.epa.gov STEVEN P. MILLARD PSI 206-528-4877 206-528-4802 Smillard@.probstatinfo.com CHRISTOPHER MILLER NOAA Phone Fax E-mail DAVID MILLER OPPTS/OPP US EPA 703-305-5352 703-305-5147 Miller.davidJ@epamail.epa.gov ------- DAVID MINTZ OAR/OAQPS US EPA Phone 919-541-5224 Fax 919-541-1903 E-mail Mintz.david@epa.gov MARGARET MORGAN -HUBBARD DIRECTOR, OFFICE OF COMMUNICATION US EPA 202-260-5965 Morgan-hubbard .margaret @epamail.epa.gov Phone Fax E-mail AL MORRIS OFFICE OF ENVIRON DATA Phone 215-814-5701 Fax 215-814-5718 E-mail morris.alvin@epa.gov REBECCA MOSER CEIS US EPA 202-260-6780 202-260-4903 Phone Fax E-mail JOHN MOSES OP/CEIS US EPA Phone 202-260-6380 Fax 202-401-7617 E-mail Moses.iohn@epamail.epa.gov NICKNAPOLI US EPA Phone 215-816-2621 Fax 215-814-2783 E-mail Napoli.nick@epamail.epa.gov MALIHA S. NASH ORD/NERL-CRD US EPA Phone 702-798-2528 Fax 702-798-2208 E-mail Nash .maliha@epamai 1 .epa.gov Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail KJMBERLY NELSON PA DEPT OF ENVIR PROTECTION 717-787-3534 717-783-8926 Nelson.kimberly@dep.state.pa.us BARRY NUSSBAUM OPPE/CEIS US EPA 202-260-1493 202-460-4968 Nussbaum.barrv@epamail.epa.gov ROB O'BRIEN BATTELLE 509-375-6769 509-375-2604 Robert.obrien@pnl.gov ANTHONY R. OLSEN US EPA NHEERL 541-754-4790 541-754-4716 Tolsen@mail.cor.epa.gov G. P. PATIL PENNSYLVANIA STUNIV 814-865-9442 814-863-7114 Gpp@stat.psu.edu ROBERT M. PATTERSON COLLEGE OF ENGINEERING TEMPLE UNIVERITY 215-204-1665 215-204-6936 rpatterson@thunder.temple.edu DAVID PAWEL ORIA US EPA 202-564-9202 PETER PETRAITIS UNIVERISTY OF PENNSYLVANIA ------- Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail ANNE POLITIS CEIS/IAIAD US EPA 202-260-5345 202-260-4903 Pol itis.anne@epamai 1 .epa.gov RAFAEL PONCE UNTV OF WASHINGTON WILLIAM F. RAUB DEPUTY ASST SECTY SCI POLICY DEPT HEALTH & HUMAN SERV BREEDA REILLY CEPPO US EPA 202-260-0716 Reillv.breeda@,eparnail.epa.gov JOSEPH RETZER OP US EPA 202-260-2472 Relzer.ioseph@epamail.epa.gov JOEL REYNOLDS UNIV OF WASHINGTON EDNA RODRIGUEZ OP/CEIS/CSAD US EPA 202-260-3301 202-260-4903 Rodriguez.edna@epamail.epa.gov N. PHILLIP ROSS CEIS US EPA 202-260-5244 202-260-8550 Ross.Nphillip@epamail.epa.gov Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail K.RISTEN RYDING OEA US EPA 206-553-6918 PAUL SAMPSON UNIV OF WASHINGTON DINA SCHREINEMACHERS EBB/HSD/NHEERL/ORD US EPA 919-966-5875 919-966-7584 Schreinemachers.dina@epamail.epa.gov ANDREW SCHULMAN OGWDW/SRMD/TAB US EPA 202-260-4197 202-260-3762 Schulman.andrew@epamail.epa.gov RONALD SHAFER OP/CEIS US EPA 202-260-6766 202-260-4968 Shafer.ronald@epamail.epa.gov BOB SHEPANEK ORD/NCEA US EPA 202-564-3348 202-565-0061 Shepanek.robert@epamail.epa.gov CAROLYN SHETTLE INSTITUTE FOR SURVEY RESEARCH 202-537-6793 202-537-6873 cshettle@ioip.com ABRAHAM SIEGEL OW/OGWDW US EPA 202-260-2804 Siegel .abraham@epamai I .epa.gov ------- Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail BIMAL SINHA OPPE/CEIS US EPA 202-260-2681 Sinha.bimal(5)epamail.epa gov BENJAMIN SMITH US EPA 202-260-3026 202-260-3762 Smith.ben@epamail.epa.gov WILLIAM P. SMITH OPPE/CEIS US EPA 202-260-2697 202-260-4968 Sm ith. wi I l@epamai I .epa. gov WOOLCOTT SMITH TEMPLE UNIVERSITY MINDI SNOPARSKY HYDROGEOLOGY EPA 215-814-3316 Snoparsky.mindi@epamial.epa.gov JOHN A. SORRENTINO TEMPLE UNIVERSITY 215-204-8164 Soirento@astro.ocis.temple.edu DOREEN STERLING CEIS US EPA 202-260-2766 202-260-8550 Sterling.doreen@epamail.epa.gov WILLIAM TASH VICE-PROVOST TEMPLE UNIVERSITY Phone Fax E-mail MARY LOU THOMPSON UNIV OF WASHINGTON Phone 206-616-2723 Fax 206-616-2724 E-mail Mlt@biostat.washington.edu HENRY TOPPER OPPT US EPA Phone 202-260-6750 Fax 202-260-2217 E-mail topper.henrv@epa.gov MARIANNETURLEY UNIV OF WASHINGTON Phone 206-616-9288 Fax 206-616-9443 E-mail Marianne@cqs.washington.edu DIANNE WALKER US EPA REGION III Phone 215-814-3297 Fax 215-814-2134 E-mail Walker.dianne@epamail.epa.gov JOHN WARREN ORD/NCERQA;QAD us EPA Phone 202-260-9464 Fax 202-401-7922 E-mail Warren iohn@epamail.epa.gov NANCY WENTWORTH ORD/NCERQA/QAD US EPA Phone 202-564-6830 Fax 202-565-2441 E-mail Wentworth.nancy@epamail.epa.gov ELLEN WERNER INSTITUTE FOR SURVEY RESEARCH Phone 202-537-6735 Fax 202-537-6873 E-mail Ewemer@ioip.com RICK WESTLUND US EPA OFFICE OF POLICY Phone 202-260-2745 Fax 202-260-9322 E-mail Westlund.rick@epa.gov ------- CHARLES WHITE OW/OSTVEAD US EPA 0 Phone 202-260-5411 Fax 202-260-7185 E-mail White chuck@epamail.epa.gov NATHAN WILKES OFFICE OF POLICY US EPA Phone 202-260-4910 Fax 202-260-4903 E-mail Wilkes.nathan.epa.gov JENNIFER WU OW/OGWDW/SBMD US EPA Phone 202-260-0425 Fax 202-260-3762 E-mail Wu. iennifer@epamail .eoa gov ------- ------- Notes ------- Welcome to the 1999 EPA Conference on Environmental Statistics and Information It is my pleasure to welcome you on behalf of EPA's Center for Environmental Information and Statistics to the 1999 EPA Conference on Environmental Statistics and Information. For many "old timers" to this conference, you will immediately recognize that we have changed the name and scope of the conference. Just as EPA is currently planning a new Office of Information, we have expanded our coverage to information in addition to statistics. We welcome new attendees to the conference which covers all aspects of data, information, computer technology, data systems and statistics. We have the data gatherers, the data administrators, the data users, the data analyzers, the data simulators, the data reinventors, the data modelers, the data miners, the data assessors, and those who still can't figure out if data are singular or plural. We hope that all of the attendees will better understand the interrelationships among these groups as the new Office of Information will encompass so many of these activities. This year's theme is "EPA's Vision for the 21st Century". I am well aware that many of this year's conferences on any topic will have similar names. (I also fear any no-hum building constructed this year will shortly be labeled "turn of the century" architecture.) However, as EPA is undergoing major changes with the creation of the Office of Information, the Agency is trying to position itself to meet the burgeoning information challenges of the new century, so I feel the theme is truly appropriate. I hope that this conference will enable us to better understand those challenges and approaches to meeting them. We have an exciting collection of plenary sessions, featured talks, concurrent presentations, training sessions, poster/computer sessions and panel discussions. However, as with most meetings, some of the informal opportunities to meet and chat with your colleagues can frequently be the most productive aspect of the conference. I encourage you to take full advantage of this year's campus-type setting to continue your dialogues with your associates. I owe a great deal of thanks to the planning and arrangements committees for their efforts to organize this conference. Special thanks also go to Margaret Conomos and Connie Lorenz who assisted me in putting it all together, and to Temple University's Institute for Survey Research that handled the details and coordination. We encourage you to have a good time, learn a lot, and tell us about any enhancements you would like to see in the future. 6A1AL Barry D. Nussbaum 1999 Conference Chair Conference Planning Committee Arrangements Committee Susan Auby Eliga"b'efh MOfgosches Joan Bundy GetfrgeTlatman Trudy McCoy Kfth- Allen Ed Lloyd John Warren ------- Agenda for the 1999 EPA Conference on Environmental Statistics and Information Monday, May 10,1999 3:00-6:00 REGISTRATION AND CHECK-IN Foyer 4:00-6:00 tfpNfilJRRENT TRAINING SESSIONS Woollccrtt Smith (Temple University) and Peter Petraitis jy . R (U^uVfcteity of Pennsylvania) - Workshop on Monte Carlo Methods in Environmental Statistics . Joe Anderson (EPA/OIRM) - EPA's Web Site and You Room p 6:00-7:00 Cash Bar Foyer Tuesday, May 11,1999 8:30-9:00 .Welcoming Remarks and Introduction of Speakers Wendy Cleland-Hamnett, Director, EPA Center for Environmental Information and Statistics t Peter Goodwin, Dean, Graduate School, Temple University Dining Room 9:00-9:30 Al Morris, Director, Office of Environmental Data, EPA Region Dining Room III, Philadelphia, PA - Information, Statistics and the Region 9:30-10:30 Keynote Address Jay Hakes, Administrator, Energy Information Administration Dining Room 10:00-12:00 Statistical Training Session (Videotapes) RoomC ------- Tuesday, May 11,1999 (Continued) 10:30-10:45 Break 10:45-12:00 CONCURRENT PRESENTATIONS Statistics, Information, and GPRA (Chair: George Bonina, EPA) Room D . Judith Calem Lieberman, (EPA/OCFO), Analytic Challenges and the Government Performance and Results Act George Bonina (EPA) - Reinventing Environmental Information Local Applications of EPA Data (Chair: Ron Shafer, EPA/CEIS) Dining Room . Henry Topper (EPA/OPPT) - The Baltimore Community Environmental Partnership: Lessons Learned Kimberly Nelson (Pennsylvania Department of Environmental Protection) - The Department of Environmental Protection Compliance Reporting System . N. Bouwes, Steven M. Hassur (EPA/OPPT), S. Keane, E. Fechner Levy, B. Firlie, and R. Walkling (Abt Associates, Inc.) - Risk-Screening Environmental Indicators Model Statistical Methods for Lab and Air Quality Data Analysis Room B (Chair: Larry Cox, EPA/ORD/NERL) . Mary Lou Thompson and Kerrie Nelson (University of Washington) - Statistical Modeling of Multiply- Censored Data . Peter Craigmile (University of Washington) - Trend Estimation Using Wavelets Joel H. Reynolds (University of Washington) - Meteorological Adjustment of Surface Ozone for Trend Analysis: Pick an Answer, Any Answer 12:00-1:15 Lunch ------- Tuesday, May 11,1999 (Continued) 1:15-2:30 CONCURRENT PRESENTATIONS Databases: The Manager's View (Chair: Phil Lindenstruth, EPA) Room D . Panel: Phil Lindenstruth - STORET, Abraham Siegel - SDWIS, Mike A. Mundell - PCS (EPA) Ensuring the Quality of Environmental Information Dining Room (Chair: Nancy Wenrworth, EPA/ORD) Nancy Wentworth (EPA/ORD) - Quality Assurance and Environmental Information Malcolm Bertoni (Research Triangle Institute) - Using SimSITE to Illustrate Sampling Techniques Models and Model Assessment of Environmental Data Room B (Chair: Mary Lou Thompson, University of Washington) Rafael Ponce (University of Washington) - Development of a Linked Pharmacokinetic-Pharmacodynamic Model of Methylmercury-Induced Developmental Neurotoxicity Samantha Bates, Cullen, A. C, and A. E. Raftery (University of Washington) - Bayesian Model Assessment - Marianne Turley, E. David Ford, and Joel Reynolds (University of Washington) - Pareto Optimal Multi- Criteria Model Assessment 2:30-4:30 Statistical Training Session (Videotapes) Room C 2:30-2:45 Break ------- Tuesday, May 11,1999 (Continued) 2:45-4:00 CONCURRENT PRESENTATIONS Use of the Internet for Sharing Statistics Dining Room (Chair: Steve Hufford, EPA/CEIS) . Pat Garvey (EPA/OIRM) - Envirofacts Warehouse: Environmental Data On The Internet Empowering the Citizen toward Environmental Protection and Awareness Anne Frondorf (USGS) - National Biological Information Infrastructure . Chris Miller (NOAA) - National Environmental Data Index . Bob Shepanek (EPA/ORD) - EPA's Environmental Information Management System Epidemiology and Risk Assessment Cumulative and/or Aggregate Room B Risk Assessment (Chair: Ruth Allen, EPA) . David Miller (EPA/OPPTS/OPP) - Food Quality Protection Act and its Implementation: an Overview of Statistical and Probabilistic Issues Facing the Office of Pesticide Programs . Hans D. Allender (EPA/OPP/HED) - Finding a Statistical Distribution to Use in the Monte Carlo Exposure Assessment of Livestock Commodities . Breeda Reilly (EPA/CEPPO) - Applying Epidemiology to Study the Prevention of Major Chemical Accidents Statistical Research Issues in Quality Assurance (Chair: John Room D Warren, EPA/ORD) John Warren - Integrating Data Quality Indicators (DQIs) into Data Quality Objectives (DQOs) . Charles White (EPA/OW/OST) - A Performance Evaluation of the Method Detection Limit ------- Tuesday, May 11,1999 (Continued) 4:15-5:15 PLENARY SESSION (Chair: Wendy Cleland-Hamnett, EPA/CEIS) Dining Room Corrinne Caldwell, Acting Provost & Vice President, Temple University - Welcoming Remarks . Tom Curran (EPA/OAR) - Data, Information and Statistics: Putting it All Together for Decision-Making 5:15-8:00 POSTER AND COMPUTER SESSIONS Room A . George T. Flatman (EPA/ORD/NERL) - Satellite data for Landscape Ecology Lawrence Lehrman (EPA/RMD/OIS) - Cluster analysis of fish species and land use . Connie Lorenz (EPA/CEIS) - All the Stats that are Fit to Surf Brand Neimann (EPA) - Digital Library Demonstration Stuart H. Kerzner (EPA/Region III) - Information Visualization - Turning Data into Information You Can Easily Understand . Maliha S. Nash (EPA/LEB) - Geostatistical Analysis of Ecological Indicators Arthur Lubin (EPA/OSEA) - Environmental Random Stratified Sampling Designs Developed Via Cluster Analysis . Heather Case (EPA Customer Service) - What the CEIS National Telephone Survey will be able to Tell EPA's Information Providers . Susannah Dillman (EPA/OPPTS/OPPT) - Methods to Minimize Human Error in Reporting Analysis Results John S. Graves (EPA/Region III) - Using Perl Scripts to Import Data into CIS: An Example Using USGS Ground Water Site Inventory Data Rich Heiberger (Temple University) - Demonstration of ESS, S-Plus, and Trellis Graphics . William P. Smith (EPA/OPPE/CES) - CD Toxic Release Inventory (TRI) Data Explorer 5:30-6:30 Wine and Cheese Party - Hosted by William Tash, Vice Provost Pool House for Research, Temple University ------- Wednesday, May 12,1999 8:00-9:00 CONCURRENT PRESENTATIONS Data Integration and Quality: Vision for the Future (Chair: Ruth Allen, EPA) Susan Oevesa, D. Grauman, W. Blot, G. Pennello, R. Hoover, and J. Fraumeni (NCI) - Atlas of Cancer Mortality in the United States, 1950-94 Ruth Allen (EPA) - Surveillance Improvement Report Dining Room Analysis of Cleanups (Chair: Michael J. Messner, EPA/OGWDW) Room B . Michael J. Messner (EPA/OGWDW) - Cryptosporidium Occurrence in the Nation's Drinking Water Sources . Bimal Sinha (EPA/OPPE/CES) - Statistical Estimation of Average Reid Vapor Pressure of Regular Gasoline Sampling and Design Issues in Environmental Studies (Chair: Tony Olsen, EPA/NHEERL) . David Marker (WESTAT) - Sample Designs for Environmental Data Collection: Ranked Set Sampling and Composite Sampling Paul D. Sampson (University of Washington) - Monitoring network design with applications to regional air quality RoomD 9:00-9:15 Break 9:15-11:15 Statistical Training Session (Videotapes) RoomC ------- Wednesday, May 12,1999 (Continued) 9:15-10:45 CONCURRENT PRESENTATIONS Some Analyses and Potential Analyses at EPA (Chair: Doreen Sterling, EPA/CEIS) . Mike Barrette (EPA/OE) - Integrating Data for Planning and Targeting . Tom DeMoss and Tom Pheiffer (EPA) - The Mid Atlantic Integrated Assessment Program (MAIA) . John Moses (EPA/CEIS) - Strategy to Address Evolving Environmental Information Needs RoomD Measurement Issues Related to our Water Supply (Chair: Barnes Johnson (EPA/OSWER/OSW) Andrew Schulman, Jennifer Wu, and Benjamin Smith (EPA/OGWDW) - Forays into the Unforgiving - Occurrence Estimation in the Realm of Data with Multiple Censoring Points (arsenic in the public water supply) Henry Kahn, Helen L. Jacobs, and Kathleen A. Stralka (EPA/OW/EAD) - Estimated Water Consumption In The U.S. Based On The CSFII . Virginia A.Colten-Bradley (EPA/OSWER/OSW) - Development of a Neural Network Tool for Evaluation of Waste Management Unit Designs Dining Room Assessing Risk (Chair: Elizabeth Margosches (EPA/OPPTS) . Mary Marion (EPA/OPPT) - Simulation and Acute Dietary Risk Assessments . David Pawel (EPA/OAR) - Proposed EPA Methodology for Assessing Risks from Indoor Radon . Elizabeth H. Margosches, Ph.D., Jennifer Seed, Ph.D., and Khoan T. Dinh, Ph.D. (EPA/OPPTS) - Health Data: How Do We Use It To Protect the Public/Environment? Margaret Conomos (EPA/CEIS) - Discussant RoomB ------- Wednesday, May 12,1999 (Continued) 10:45-11:00 Break 11:00-11:30 PLENARY SESSION (Chair: Barry Nussbaum, EPA/CEIS) . Woollcott Smith (Temple University) - A Walk on the Wild Side of Statistical Communication Dining Room 11:30-12:30 PLENARY SESSION (Chair: Doreen Sterling, EPA/CEIS) - Robert English (EPA) - Proposed Information Management Office Dining Room 12:30-2:00 Lunch 2:00-4:00 Statistical Training Session (Videotapes) RoomC 2:00-3:30 CONCURRENT PRESENTATIONS The Visual Presentation of Data (Chair: Al Morris, EPA, Dining Room Region III) Al Morris (EPA) - Enviroviz-Tuming Numbers into Visual Relationships . David Mintz (EPA/OAR/OAQPS) - Methods for Displaying Temporal and Spatial Trends . Daniel Carr (George Mason University) - Two Templates for Visualizing Georeferenced Statistical Summaries Listening To Our Information Customers (Co-Chairs: Brendan Room D Doyle (EPA/CEIS) and Margaret Morgan-Hubbard (EPA/Office of Communications) Panel: Margaret Morgan-Hubbard, Director, EPA Office of Communications, Brendan Doyle, (Acting) Director, CEIS Customer Survey and Access Division, Emma McNamara, (Acting) Director, EIMD, OIRM, and Pat Bonner, EPA Customer Service 8 ------- Wednesday, May 12,1999 (Continued) Application of Sampling in Aquatic Resources (Chair: Henry Kahn, EPA/OW/EAD) . Henry Kahn and Silvestre Colon (EPA/OW/EAD) - Composite Sampling Analysis of Contaminant Levels in Fish . Anthony R.Olsen (EPA/NHEERL) - National Fish Tissue Contaminant Lake Survey: A New Spatially-Restricted Survey Design . Barnes Johnson (EPA/OSWER) - How to Survey Water Designs 3:30-3:45 Break RoomB 3:45-5:15 PLENARY SESSION: Statistics and Information at EPA as we Start a New Century: Where Are We Going? (Chair: Phil Ross, EPA/CEIS) . Larry Cox (EPA/ORD/NERL) . Karen Klima (EPA/IW1) . Heather Case (EPA/CEIS) . G.P. Patil (Pennsylvania Slate University) Dining Room Thursday, May 13,1999 8:30-10:30 TRAINING Steven P. Millard (PSI) - Applying Monte Carlo Simulation Techniques with S-PLUS Dining Room 9:00-10:30 PRESENTATIONS (Note this time overlaps with above training) The Data Come In, the Data Go Out Room D Rick Westlund (EPA/OP) - Reducing Paperwork Burdens at EPA . Charlotte Cottrill (EPA/ORD) - EMPACT's Role in the 21st Century ------- Thursday, May 13,1999 (Continued) 10:30-10:45 Break WRAP-UP SESSION (Chair: Barry Nussbaum, EPA/CEIS) Dining Room 10-45 11 -45 " William R&ub, Deputy Assistant Secretary for Science Policy, Department of Health and Human Services - Perspectives on Data and Information from the Department of Health and Human Services 11:45-12:00 Door Prize and Closing Remarks Dining Room 10 ------- ABSTRACTS 1999 EPA Conference on Environmental Statistics and Information 4:00-6:00 Monday, May 10 CONCURRENT TRAINING SESSIONS WORKSHOP ON MONTE CARLO METHODS IN ENVIRONMENTAL STATISTICS Woollcott Smith Statistics Department, Temple University Peter Petraitis Biology Department, University of Pennsylvania This workshop is divided into two parts: 1. Smith will present an overview of modern computer intensive Monte Carlo methods. The review will include the statistical motivation as well as technical and philosophical advantages and disadvantages in using these methods in administrative and legal settings. We will briefly describe how these methods are used to attack hard statistical problems in missing data imputation, measurement error and Bayesian analysis. Finally the details of randomization and simulation methods will be illustrated using a basic aired comparison design. 2. Petraitis will present a case study on the pros and cons of using randomization methods as an alternative to analysis of variance and the analysis of covariance. ------- 10:45-12:00 Tuesday, May 11,1999 STATISTICS, INFORMATION, AND GPRA (CHAIR: GEORGE BONINA, EPA) ANALYTIC CHALLENGES AND THE GOVERNMENT PERFORMANCE AND RESULTS ACT Judith Calem Lieberman OCFO, US Environmental Protection Agency The Government Performance and Results Act (GPRA) of 1993 set into motion a spate of activity in Agency strategic planning and accountability. In essence a legal constitution for good management, the GPRA requires federal agencies to set goals, measure performance, and report on the degree to which goals are met. It also places emphasis on attaining results rather than tracking program activities. The Office of the Chief Financial Officer has been leading EPA's effort to meet GPRA's statutory requirements, which includes development of a 5-year strategic plan, annual performance plans (and budgets), and annual performance reports. During the first cycle of GPRA implementation, several analytical challenges have been revealed. The most significant ones relate to identification of outcome goals, development of performance measures, validation/verification of performance data, and comparison of performance with annual goals. Working through these challenges will require a good understanding of the Agency's mission, a little creativity and the analytical skills to understand the impact of program activities on emironmental results. LOCAL APPLICATIONS OF EPA DATA (CHAIR: RON SHAFER, EPA/CEIS) THE BALTIMORE COMMUNITY ENVIRONMENTAL PARTNERSHIP: LESSONS LEARNED Henry Topper US Environmental Protection Agency In this case study, participants in the Baltimore Community Environmental Partnership will describe their experiences and present lessons they have learned. The experience presented will be based on a three-year project involving a Partnership among the residents, governments, and businesses in south Baltimore and northern Anne Arundel County. This Partnership worked together to begin addressing the long term environmental and economic concerns in four neighborhoods in south Baltimore and northern Anne Arundel County. For many years, both residents and businesses in this heavily industrialized section of the metropolitan Baltimore area have expressed concerns about health and the environment in their neighborhoods. By working ------- together in a Partnership, the community completed a comprehensive review of all aspects of its environment and has begun work to implement a plan to make real improvements. The Partnership has taken a holistic view of community problems and has developed efforts to address a broad range of issues facing the community including health concerns, housing issues, illegal dumping, subsistence fishing, park restoration and enhancement, community gardening, economic development, air quality, and crime. Based on this holistic approach, the Partnership has begun to develop an understanding of the complexity of the environmental stresses facing the community and the need for a multifaceted approach to improving community health and building a sustainable community. In the area of community health, Partnership committees are now working to address the issues of indoor air, fish consumption, truck traffic, and industrial toxic releases. As a part of this effort, the Air Committee of the Partnership completed a comprehensive screening analysis of air releases from all the businesses and facilities in and around the Partnership area. This analysis, based on exposure modeling, has given the community information on the cumulative concentrations of toxics from all sources in each of the four Partnership neighborhoods. The Air Committee has developed a protocol to compare these modeled concentrations with established health effect values to determine areas for pollution prevention. The committee has also developed a protocol and screened for potential combined effects of multiple chemicals that have similar target organs, e.g. all the chemicals that are respiratory tract irritants. As a result of the work of the Air Committee, the community now has some key parts of the information it needs to monitor and improve the local environment. THE DEPARTMENT OF ENVIRONMENTAL PROTECTION COMPLIANCE REPORTING SYSTEM Kimberly Nelson Pennsylvania Department of Environmental Protection The Pennsylvania Department of Environmental Protection (DEP) has made significant strides in improving data management. PA DEP has successfully integrated across more than 12 programs data to present a holistic view of the people and places it regulates. The data reside in the DEP client/site database which is fully integrated with departmentwide application processing and compliance reporting systems. The DEP compliance reporting system is one of the few systems in the country that can track multi-media inspections, violations, penalties and enforcement actions for a single facility and is the only system in the country that is on-line for citizens to track compliance activities. The client/site system also is integrated with the department's new Pennsylvania Facility Analysis System, a web based CIS application that went on-line for the public in March. Currently, the department is focusing priority attention on an Environmental Futures Team whose charge it is to develop a plan for measuring environmental outcomes. ------- OPPT'S RISK-SCREENING ENVIRONMENTAL INDICATORS MODEL* Bouwes, N. and Hassur, S. Office of Pollution Prevention and Toxics, U.S. Environmental Protection Agency S. Keane, E, Fechner Levy, B. Firlie, and Walkling, R. Abt Associates, Inc. The Toxics Release Inventory (TRI) provides raw data on the quantities of chemicals released by US manufacturing facilities, but these raw data alone do not provide information about the relative toxicity or exposure potential of these releases. The Office of Pollution Prevention and Toxics (OPPT) of the US EPA has created the Risk-Screening Environmental Indicators Model to provide a risk-based perspective of these releases, in a PC-based model. The Indicators Model integrates toxicity scores with a measure of exposure potential and the size of the potentially exposed population to calculate individual Indicator Elements for each combination of facility, chemical, and release media reported under TRI. Each year of reporting generates approximately 250,000 of these Elements which are summed to provide overall Indicator Values. The Indicator Elements can also be summed to create sub-Indicators that rank relative impacts by medium, chemical, geographic area, industry sector or a combination of these and other variables. This flexibility provides the analyst with the opportunity to examine trends year-to- year, and to rank and prioritize chemicals, industries and regions for strategic planning, risk- related targeting for enforcement and compliance purposes, and community-based environmental protection. The model also permits the user to investigate the relative influence of toxicity, exposure and population on the results. 'Work supported under EPA Contract Number 68-W6-0021, WA#3-02. STATISTICAL METHODS FOR LAB AND AIR QUALITY DATA ANALYSIS (CHAIR: LARRY COX, EPA/ORD/NERL) STATISTICAL MODELING OF MULTIPLY CENSORED DATA Mary Lou Thompson and Kerrie Nelson The National Research Center for Statistics and the Environment, University of Washington Laboratory analyses in a variety of contexts may result in doubly left censored measurements, i.e. amounts of contaminants of concern may be reported by the laboratory as "non-detects" or "trace". The analysis of singly censored observations has received attention in the biostatistical (e.g. in the context of survival analysis) and in the environmental literature. We consider maximum likelihood and semi-parametric approaches to linear models in the doubly censored setting. ------- TREND ESTIMATION USING WAVELETS Peter Craigmile Department of Statistics, University of Washington A common problem in the analysis of environmental time series is how to deal with a possible trend component, which is usually thought of as large scale (or low frequency) variations or patterns in the series that might be best modeled separately from the rest of the series. Trend is often confounded with low frequency stochastic fluctuations, particularly in the case of models such as fractionally differenced processes (FDPs), which can account for long memory independence (slowly decaying auto-correlation) and can be extended to encompass non- stationary processes exhibiting quite significant low frequency components. In this talk we assume a model of polynomial trend plus fractionally differenced noise and apply the discrete wavelet transform (DWT) to separate a time series into pieces that can be used to estimate both the FDP parameters and the trend. The estimation of the FDP parameters is based on an approximation maximum likelihood approach that is made possible by the fact that the DWT decorrelates FDPs approximately. Once the FDP parameters have been estimated, we can then test for a non-zero trend. After outlining the work that we have done to date on testing for non- zero trends, we demonstrate our methodology by applying it to an air quality time series. METEOROLOGICAL ADJUSTMENT OF SURFACE OZONE FOR TREND ANALYSIS: PICK AN ANSWER, ANY ANSWER Joel H. Reynolds NRCSE, Department of Statistics, University of Washington A variety of statistical methods for meteorological adjustment of surface ozone have been proposed in the literature over the last decade. As part of a larger review of the literature, we summarize and compare six different methods applied to the analysis of surface ozone observations in the Chicago region from the 1981 -1991 period: nonlinear regression, regression tree models, extreme events models, time-series filtering, nonlinear additive time-series models, and canonical covariance analysis. Differences in the resulting trend analyses are discussed in terms of differences in each analysis' spatial domain and choice of ozone statistic. The review highlights the need for development of techniques for extreme value analysis of space-time processes. ------- 1:15-2:30 Tuesday, May 11,1999 DATABASE: THE MANAGER'S VIEW (PANEL SESSION) Philip Lindenstrutb, Michael A. Mundell, and Abraham Siegel US Environmental Protection Agency The Panel will present for discussion several issues involved in the administration of a national database. These issues start with requirements for the database and addresses optional data fields, data quality, data ownership, database management issues, and support for the system during its life cycle. Those on the Panel would like their initial presentations to stimulate a discussion of these issues with the attendees. ENSURING THE QUALITY OF ENVIRONMENTAL INFORMATION (CHAIR: NANCY WENTWORTH, EPA/ORD) USING SIMSITE TO ILLUSTRATE SAMPLING TECHNIQUES Malcolm J. Bertoni Center for Environmental Measurements and Quality Assurance, Research Triangle Institute The Simulated Site Interactive Training Environment (SimSITE) is a computer-based training support system that helps environmental scientists and engineers learn how to plan a field investigation at a hazardous waste site. Through the use of a graphical user interface provided by the ArcView geographic information system (CIS), training participants apply concepts such as Data Quality Objectives (DQOs), Data Quality Indicators (DQIs), statistical sampling design, and Data Quality Assessment (DQA). SimSITE contains statistical design and analysis tools and sampling simulation routines that allow the participants to develop and implement sampling plans that satisfy their DQOs. SimSITE then generates a data set (including sampling and measurement errors), and allows the participants to make decisions about whether or not to clean up areas of the artificial site, based on their statistical analysis of the data. At the end of the simulation, the features of the underlying true contamination are revealed to illustrate the phenomenon of decision errors. During this interactive presentation, the features and classroom uses of SimSITE will be demonstrated. ------- MODELS AND MODEL ASSESSMENT OF ENVIRONMENTAL DATA (CHAIR: MARY LOU THOMPSON, UNIVERSITY OF WASHINGTON) DEVELOPMENT OF A LINKED PHARMACOKINETIC- PHARMACODYNAMIC MODEL OF METHYLMERCURY-INDUCED DEVELOPMENTAL NEUROTOXICITY T.A. Lewandowski, S.M. Bartell, R.A. Ponce, C.H. Pierce, and E.M. Faustman Department of Environmental Health, University of Washington Methyl mercury (MeHg) has been shown to cause adverse developmental effects in human and animal conceptuses exposed in utero. A toxicological model of the disposition and cellular action of MeHg in the developing fetus can be used to estimate health outcomes for various levels of exposure. Modeling can also incorporate differences in dose rate, chemical species, or inter-species variability. A linked toxicokinetic and toxicodynamic model for MeHg has been developed for the rat based on work performed in our laboratory. The toxicokinetic model incorporates many of the changes in organ size and blood flow associated with gestation. In the toxicokinetic model, changes in the population of committed fetal neural cells have been estimated based on the observed effects of MeHg on rates of cellular death, proliferation and differentiation in vitro. We are currently determining these rates in vivo using BrdU-Hoechst flow cytometry. The toxicokinetic model demonstrates an adequate fit to experimental loxicokinetic data. For example, 3 days after a dose of 1 mg/kg (given on day 16 of gestation), the model predicts fetal brain and fetal blood levels within 10% of the values observed by Wannag (1976). In terms of toxicodynamic effects, the model predicts 20% and 65% decreases in the number of committed neural cells (on gestational day 15, relative to untreated baseline) at fetal brain concentrations of 10 and 50 umol/kg. It is anticipated that the existing model can be extended to address other species (i.e., humans) and other developmental toxicants which act by similar mechanisms (i.e., cell cycle disruption). Sponsored by the following grants: USEPA R825358 and CR825173 and NIEHS T32ESO-7032. BAYESIAN MODEL ASSESSMENT Samantha Bates and A. E. Raftery Department of Statistics, University of Washington, Cullen, A.C. Graduate School of Public Affairs, University of Washington In this paper we discuss a Bayesian method of analysis which incorporates both prior knowledge of the distributions of the inputs to a deterministic model and any available data on the model inputs and outputs. This method uses Monte Carlo simulation from the prior distributions for the inputs and resampling of these simulations with weights determined by the observed data under ------- the sample importance resampling scheme of Rubin. The method yields posterior distributions for the output from which to find distributions for quantities of interest. The method also allows the separation of the contributions of variability and uncertainty on the posterior distribution of soil concentration. We will present an application of this method to modeling poly-chlorinated biphenyl (PCB) concentrations in various media at a Superfund site in New Bedford Harbor (NBH), MA. Dredging during this clean-up of the Harbor exposes inhabitants of the surrounding region to PCB contaminated air, soil and plants. A deterministic model for PCB concentration in soil was developed by Cullen (1992). The Bayesian method is used to find distributions for the PCB concentration in soil at this site. In addition we will contrast the results of this Bayesian method with those of a traditional Monte Carlo approach and a trial-and-error approach. i PARETO OPTIMAL MULTI-CRITERIA MODEL ASSESSMENT Marianne Turley, E. David Ford, and Joel Reynolds University of Washington Evolutionary computation (EC) is an optimization technique for finding Pareto optimal solutions to multiple objective functions. It borrows ideas from evolutionary theory to direct the optimization search through the parameter space. We applied this optimization to process models to improve model assessment by requiring a solution, a model parameterization, to achieve multiple criteria simultaneously. In this talk, I will discuss the algorithm, two alternative search errors and some examples. 2.45-4:00 Tuesday, May 11, 1999 USE OF THE INTERNET FOR SHARING STATISTICS (CHAIR: STEVE HUFFORD, EPA) ENVIROFACTS WAREHOUSE: ENVIRONMENTAL DATA ON THE INTERNET - EMPOWERING THE CITIZEN TOWARD ENVIRONMENTAL PROTECTION AND AWARENESS Pat Garvey US Environmental Protection Agency Governments and the courts are acknowledging more and more that the Public has a right to know what is being discharged and released to the environment. The U S Congress and the Executive Branch have taken decisive action to ensure this public right to access of data and information. The U.S. EPA created the Envirofacts Warehouse to provide the public with direct access to the vast amounts of information and data in its national program environmental data systems. The 8 ------- Envirofacts Warehouse helps EPA fulfill its responsibility to make information available to the public, as required by federal legislation and Executive Order. Envirofacts is available from the Internet, (www.epa.gov/enviro) allowing EPA to disseminate information quickly and easily. Envirofacts Warehouse contains: a relational database of the national databases on Superfund (abandoned hazardous waste) sites, hazardous waste handlers, discharges to water, toxic releases, air releases, and drinking water suppliers, the relational database also contains the facility index system, the Envirofacts Master Chemical Integrator, locational reference tables, and, spatial data and demographic data from the other sources. Internet applications are available and part of the Envirofacts Warehouse Internet site to provide easily designed queries to the databases and to create maps and other reports. The Presentation shows the capabilities and reasons for the Envirofacts Warehouse. The presentation demonstrate the features and principles behind the design of the Web site, the database design and model and demonstrates the various application features and query options from the Web. The presentation will demonstrate: How On-line Queries and Results are useful to the concerned public, interested organizations, governmental regulatory staff and to Environmental Officer of a plant, facility or company; CIS Mapping capabilities and Outputs that are On-line and what are the CIS capabilities in the future; Data refresh schedules and the importance of On-line Documentation; and Customer Feedback procedures for data quality and user needs. The presentation will address the US EPA directions and program initiatives in public access of governmental data and community empowerment with environmental data. NATIONAL BIOLOGICAL INFORMATION INFRASTRUCTURE Anne Frondorf U.S. Geological Survey This presentation will provide a brief description/overview of the National Biological Information Infrastructure (NBII) program, a collaborative effort to build a distributed, Internet- based federation of biological science data, information and analytical tools. Examples of the types of data and information available from the NBII and the types of different agencies and organizations and partnerships involved in building the NBII will be provided. Two key elements of the NBII "infrastructure" (i.e. the standards-related activities that help to support and pull together this distributed data network) will be highlighted. These are the ------- development of a biological metadata content standard (and an accompanying biological metadata clearinghouse network) and the continued development of the Integrated Taxonomic Information System (ITIS) as a standard reference for biological nomenclature and taxonomy ITIS is a partnership among USGS, EPA, NOAA, USDA, and the Smithsonian Institution. EPA'S ENVIRONMENTAL INFORMATION MANAGEMENT SYSTEM Bob Shepanek Office of Research and Development, US Environment Protection Agency Presented is an integrated vision for scientific information management approaches supporting monitoring and assessment activities within the US EPA's, Office of Research and Development (ORD). This vision was developed based upon lessons-learned from the implementation of several scientific information management systems and from development of the ORD's strategic and implementation plans for scientific information management. The vision reflects that effective management of scientific information must address technical, cultural and management challenges. Technical challenges include management and integration of metadata, data, and the modeling, analysis, and visualization tools used as part of assessment activities. Cultural challenges relate mainly to the protection of intellectual capital produced by individual investigators. Management issues include commitment of adequate resources for systems development and operation, support for related policies and procedures, and appropriate incentives for involvement by staff and project participants. EPIDEMIOLOGY AND RISK ASSESSMENT CUMULATIVE AND/OR AGGREGATE RISK ASSESSMENT (CHAIR: RUTH ALLEN, EPA) FOOD QUALITY PROTECTION ACT AND ITS IMPLEMENTATION: AN OVERVIEW OF STATISTICAL AND PROBABILISTIC ISSUES FACING THE OFFICE OF PESTICIDE PROGRAMS David Miller US Environmental Protection Agency With the passage of the Food Quality Protection Act, the Agency's Office of Pesticide Programs is now required to aggregate risks from pesticides across exposure pathways and to accumulate risks from pesticides across chemicals. As a result and in an attempt to develop better risk and exposure estimates that consider the probabilities associated with simultaneous exposures, the Office of Pesticide Programs is now using probabilistic (Monte Carlo) techniques in its risk and exposure assessments. This had necessitated that OPP develop further refinements to its risk assessment procedures. This presentation will provide an overview of FQPA and discuss its major science impacts. It will review the traditional (deterministic) type methods used by OPP in exposure and risk assessments as well as the probabilistic techniques now being used with increasing frequency. Finally, it will review some of the statistical and policy issues which are 10 ------- now being considered by the Office as it implements the probabilistic risk analysis framework now in place. FINDING A STATISTICAL DISTRIBUTION TO USE IN THE MONTE CARLO EXPOSURE ASSESSMENT OF LIVESTOCK COMMODITIES Hans D. Allender, Ph.D., P.E. US Environmental Protection Agency The presentation develops a methodology to find a frequency distribution of animals' contamination because of the ingestion of pesticide-contaminated food. Given the percentage of crops treated (%CT), the methodology calculates the distribution of animals that will be exposed. Determination of the frequency distribution can be used later in connection with the application of a Monte Carlo Analysis to the Exposure Assessment of humans to contaminated animal products. The flexibility of the method allows the construction of frequency distributions to multiple cases with different %CT. A non-agricultural example explains the process in a way that everyone can relate to the calculations. The ubiquitous spreadsheet is used as the preferred medium to obtain random numbers, recalculate probabilities, generate totals, and produce graphics. A detail explanation of how the spreadsheet is constructed ensures the audience the possibility of duplicating the exercise. The simplicity of the methodology makes the process easy to replicate and to extend to similar situations. It also allows the study of severe contamination by pointing out the percentage of animals which diet has been contaminated from different sources. In summary, the article indicates a way of calculating a realistic statistical distribution of animal contamination based on ingestion of contaminated food. Also, the procedure can be extended to non-agricultural situations. APPLYING EPIDEMIOLOGY TO STUDY THE PREVENTION OF MAJOR CHEMICAL ACCIDENTS Breeda Reilly Chemical Emergency Preparedness and Prevention Office, US Environmental Protection Agency Mandated by the Clean Air Act Amendments of 1990, accident histories from some 69,000 chemical facilities in the United States will become available in the fall of 1999. This presentation describes the challenges of using the tools of epidemiology with this data to investigate drivers of severity and frequency of accidents. This study was proposed by Center for Risk Management and Decision Processes at the Wharton School and is a major focus of an EPA cooperative agreement. The Major Accident Epidemiology Project aims to contribute to the process of determining which plants are most likely to incur major events, by ascertaining whether certain predictors (characteristics of manufacturing plants or of the companies that own them) are associated with increased probability of a major event. This knowledge can be helpful in two ways: (1) plants with such risk factors can be monitored more closely (by the companies themselves as well as by regulators and other stakeholders); and (2) these associations may provide clues about characteristics of companies' organizational systems that act as underlying causes of major events. 11 ------- STATISTICAL RESEARCH ISSUES IN QUALITY ASSURANCE (CHAIR: JOHN WARREN, (EPA/ORD) INTEGRATING DATA QUALITY INDICATORS (DQIS) INTO DATA QUALITY OBJECTIVES (DQOS) John Warren Quality Assurance Division, Office of Research and Development, US Environmental Protection Agency EPA Order 5360.1 CHG 1 (July 1998) requires all EPA organizations to use a systematic planning process to develop acceptance or performance criteria for the collection, evaluation, or use of environmental data. Systematic planning identifies the expected outcome of the project, the technical goals, the cost and schedule, and the acceptance criteria for the final result. The Data Quality Objectives (DQO) Process is the Agency's recommended planning process when data are being used to select between two opposing conditions, such as decision-making or determining compliance with a standard. The outputs of this planning process (the data quality objectives themselves) define the performance criteria. The DQO Process is a seven-step planning approach based on the scientific method that is used to prepare for data collection activities such as environmental monitoring efforts and research. It provides the criteria that a data collection design should satisfy, where to collect samples; tolerable decision error rates; and the number of samples to collect. Data Quality Indicators (DQIs) are the individual performance characteristics specified in the mandatory Quality Assurance Project Plan (QAPP) that accompanies any environmental data collection. Typical DQIs include precision, completeness, comparability, and sensitivity. This discussion centers on how the Agency can effectively make the link between DQOs and DQI A PERFORMANCE EVALUATION OF THE METHOD DETECTION LIMIT Charles White US Environmental Agency Performance criteria specified in the original (1981) publication are evaluated using EPA data. Data available for preliminary evaluation include over thirty combinations of pollutant by chemical analytical technique. 12 ------- 5:15-8:00 Tuesday, May 11,1999 POSTER AND COMPUTER SESSIONS INFORMATION VISUALIZATION - TURNING DATA INTO INFORMATION YOU CAN EASILY UNDERSTAND Stuart H. Kerzner US Environmental Protection Agency, Region III The poster shows "EnviroSnax", which are graphics showing tidbits of environmental information in ways that are easy to understand and highlight past or future environmental impacts on the Region. They are used for management briefings, public use, press releases and presentations. WHAT THE CEIS NATIONAL TELEPHONE SURVEY WILL BE ABLE TO TELL EPA'S INFORMATION PROVIDERS Heather Case EPA Customer Service, US Environmental Protection Agency This presentation will describe the potential uses of the results from a national telephone survey recently completed by the CEIS. The national telephone survey, which began in February 1999, was designed to: identify and describe environmental information customers within the U.S. population; identify the public's high interest environmental topics; and determine the public's access preferences for obtaining and using information. The survey results will be used to guide CEIS information product and service development. The survey results will be available for peer review in mid-August 1999. This presentation will highlight potential uses by information providers in the Programs and Regions. METHODS TO MINIMIZE HUMAN ERROR IN REPORTING ANALYSIS RESULTS Susannah Dillman US Environmental Protection Agency Using "Paste Special" multiple graphs and tables in Excel can be linked to the report in WordPerfect 8 and updated all at once. 13 ------- USING PERL SCRIPTS TO IMPORT DATA INTO CIS: AN EXAMPLE USING USGS GROUND WATER SITE INVENTORY DATA John S. Graves US Environmental Protection Agency, Region III One of the primary tools in EPA Region HI for evaluating environmental data is the Geographic Information System or GIS. A difficulty in using a GIS is that environmental data is not always readily available in a GIS format. The Perl computer language was used to translate U.S. Geological Survey ground water data into a format, which could then be imported into a GIS. This poster presents relevant portions of the Perl script used with explanations of the data processing steps undertaken as well as examples of GIS generated plots from the resulting data in EPA Region III. DEMONSTRATION OF ESS, S-PLUS, AND TRELLIS GRAPHICS Richard M. Heiberger Department of Statistics, Temple University ESS [Emacs Speaks Statistics] is a GNU Emacs interface for interactive statistical programming and dala analysis. Languages supported include S-Plus, XLispStat, and SAS. ESS provides a standard interface between statistical programs and statistical processes and has as one of its goals an increase in efficiency for statistical programming and data analysis, over the usual tools. ESS displays source code in these languages with syntactic indentation and highlighting of source code. ESS interacts "directly" with the statistical package. ESS allows intelligent interaction with the transcript of previous interactive session. Trellis is a graphical display system that uses multiple panels to simultaneously view relationships between differenl variables in your inuhivariate dataset through conditioning. Trellis was developed at Bell Labs as part of S-Plus. We will have a live demonstration of ESS, S-Plus, and trellis graphics. 1 will analyze and display several examples of continuous and discrete multivariate and time series data sets. CD TOXIC RELEASE INVENTORY (TRI) DATA EXPLORER William P. Smith Center for Environmental Information and Statistics, US Environmental Protection Agency The TRI Data Explorer is a web product designed to provide the user quick and easy queries to EPA's TRI Chemical release data for years 1988-1997. The Explorer's portal to TRI chemical release data is through multiple data views which provide detailed and comprehensive chemical reports at all geographic levels down to the facility level by year or across years. In addition for each chemical the explorer provides interesting information such as factoids and information on the top 100 releasing facilities and counties. 14 ------- The TRI Explorer will help our customers find information on topics such as: the chemicals released in their county during the year; the facilities that are releasing these chemicals in the county, state or the nation; the top chemicals released in their county, the state, or the nation; and, the top 100 ranking facilities and counties in the nation that release a given chemical, or all chemicals. And much more. The application runs on the web at httD://athena.was.eDa.gov:2002/~wsmith/tri2/explorer.htm. or on CD for running off-line without the Internet. The CD application will be demonstrated. 8:00-9:00 Wednesday, May 12 DATA INTEGRATION AND QUALITY: VISION FOR THE FUTURE (CHAIR: RUTH ALLEN, EPA) ATLAS OF CANCER MORTALITY IN THE UNITED STATES, 1950-94 Susan Devesa, D. Grauman, W. Blot, G. Pennello, R. Hoover, and J. Fraumeni Division of Cancer Epidemiology and Genetics, National Cancer Institute The geographic patterns of cancer around the world and within countries have provided important clues to the environmental determinants of cancer. In the mid-1970s the NCI prepared county-based maps of cancer mortality in the U.S. that identified distinctive variations and hot- spots for specific tumors, thus prompting a series of analytic studies of cancer in high-risk areas of the country. We have prepared an updated atlas of cancer mortality in the United States during 1950-94, based on mortality data from the National Center for Health Statistics and population estimates from the Census Bureau. Rates per 100.000 person-years, directly standardized using the 1970 US population, were calculated by race (whites, blacks) and gender for 40 forms of cancer. The new atlas includes more than 140 computerized color-coded maps showing variation in rates during 1970-94 at the county (more than 3000 counties) or State Economic Area (more than 500 units) level. Summary tables and figures are also presented. Selected maps for the 1950-69 period are also included. Accompanying text describes the observed variations and suggests explanations based in part on the findings of analytic studies stimulated by the previous atlases. The geographic patterns of cancer displayed in this atlas should help to target further research into the causes and control of cancer. 15 ------- ANALYSIS OF CLEANUPS (CHAIR: MIKE MESSNER, EPA/OGWDW) CRYPTOSPORIDIUM OCCURRENCE IN THE NATION'S DRINKING WATER SOURCES Michael J Messner, Ph.D. US Environmental Protection Agency Cryptosporidium is a microbial pathogen which occurs in most of the nations surface waters. Information on cryptosporidium occurrence will be used in estimating the costs and benefits of future drinking water regulations. A recently completed survey generated monthly estimates of cryptosporidium concentrations in the source waters of over 400 large drinking water utilities. With only two months of validated data in hand, it appears that 80 to 90 percent of the water volumes analyzed yielded zero oocysts. On its face, this sparsely of nonzero results appears to severely limit the data's usefulness. In this presentation, a Bayesian approach is outlined for estimating hierarchical model parameters and their uncertainties. Time permitting, the approach will be illustrated using a small simulated data set. SAMPLING AND DESIGN ISSUES IN ENVIRONMENTAL STUDIES (CHAIR: TONY OLSEN, EPA/NHEERL) SAMPLE DESIGNS FOR ENVIRONMENTAL DATA COLLECTION: RANKED SET SAMPLING AND COMPOSITE SAMPLING David Marker Westat Historically environmental statistics and survey sampling have had relatively limited interaction. Most environmental studies use pre-existing data collection locations, collect from known hot spots, and/or purposively select data collection locations. Efficient survey sampling that can support the evaluation of a wide range of hypotheses has been used to a lesser degree with environmental data than in health, education, or many other types of data. This talk will describe two NRCSE funded research activities that try to bridge this gap between survey sampling and environmental statistics. Ranked set sampling (RSS) is a method to potentially increase precision and reduce costs by using "rough but cheap" information to obtain a more representative sample before the real, more expensive sampling is done. We have explored under what conditions RSS becomes cost- effective for ecological and environmental field studies where the "rough but cheap" measurement has a cost. 16 ------- We are continuing to explore when alternative forms of two-phase sampling are preferable to RSS. Composite sampling has been proposed in environmental settings where the costs of measurement are high. It is hoped that by compositing data collected from multiple locations the cost savings will outweigh the loss of information on the individual locations. Unfortunately it is not clear how often this trade- off is successful. NRCSE has funded the collection of side-by-side individual and composite samples so that this trade-off can be explored with real data from a national survey of over 800 houses. The data collection protocol and types of planned analyses will be discussed for this ongoing activity. 9:15-10:45 Wednesday, May 12 SOME ANALYSES AND POTENTIAL ANALYSES AT EPA (CHAIR: DOREEN STERLING, EPA/CEIS) INTEGRATING DATA FOR PLANNING AND TARGETING Michael Barrette US Environmental Protection Agency For each major regulatory program implemented by EPA, the program office has designed databases to house the information critical to the program's needs. In a changing world, data users are now interested in looking at environmental information holistically, which means that databases must relate to each other. To plan its enforcement and compliance activities, EPA makes use of integrated data within the Integrated Data for Enforcement Analysis (IDEA) system. This system provides access to more than 15 databases maintained by EPA and other government agencies. When trying to compare across databases, of course many discrepancies and data errors are found. In this presentation several topics related to data quality and integration will be examined: What is the critical step needed in order to integrate information across databases at the facility level? Discussion will focus on EPA's data integration strategies. What are key methods that have used existing data to find high-priority sector and geographic issues? Discussion will focus on recent efforts to identify priority areas and sectors for inspection targeting. How can data integration be used to find violators? Discussion will focus on some concrete examples showing how comparison of databases can lead facilities that are improperly regulated. 17 ------- THE MID ATLANTIC INTEGRATED ASSESSMENT PROGRAM (MAIA) Tom DeMoss Environmental Services, U.S. Environmental Protection Agency, Region III Tom Pheiffer Atlantic Ecology Division, NHEERL, U.S. Environmental Protection Agency The MAIA program is an integrated environmental assessment program being conducted by USEPA, Region HI, and US EPA's Office ofjtesearch and Development, partnership with other Federal and State Agencies. Objectives of the MAIA program are to build partnerships and get ail stakeholders involved in helping to (1) identify questions needed for assessing major ecological resource area, such as ground water, surface water, forests, estuaries, wetlands, and landscapes; (2) characterize the health of each resource are, based upon exposure and effect information; (3) identify possible associations with stressors, including landscape attributes, that may explain impaired conditions for both specific resources and the overall ecosystem; (4) target geographic areas and critical resources for protection and restoration, and (5) monitor environmental management progress. Our experience with partners uncovered certain key principles of effective watershed management. They were (1) agreement on geologic boundaries and or units of assessment; (2) conduct an assessment of their biological condition of resources; (3) target management to real impairment based upon the biological assessments including TMDL, nutrients and habitat restoration; (4) have watershed approach be holistic or segment by segment bases upon nature of problem; (5) have five-year rotation to monitoring and to assessments to allow time for change of environment and for progress from management action; (6) buy-in stakeholders so assessment and monitoring plans use all available resources and innovative options; (7) success will be more cost-effective monitoring and management fixes. Successful State partnering involves early buy in well before products are developed. MAIA's emphasis on aquatic biology and habitat is a departure from the water quality standards/TMDL mentality and requires open dialogue with state biologists who must educate their managers on the importance of habitat preservation and restoration as the new wave of management of their aquatic resources. STRATEGY TO ADDRESS EVOLVING ENVIRONMENTAL INFORMATION NEEDS John Moses Center for Environmental Information and Statistics (CEIS), US Environmental Protection Agency While primarily a regulatory agency, the U.S. Environmental Protection Agency is devoting an increasing amount of its resources to responding to public requests for information about environmental quality, pollution sources, and human health and ecosystem concerns. ------- Additionally, the Agency must report annually to Congress on its progress in protecting human health and safeguarding the natural environment, as required under the Government Performance and Results Act (GPRA). Yet, in many cases, the data EPA needs to respond to public questions and to report on its progress are not readily available. The Evolving Information Needs Strategy addresses the gaps between the data the Agency needs and the data it currently has. Working with EPA Regional and Program Offices and external stakeholders, CEIS developed a two-phase strategy to identify and address some of the Agency's key environmental information gaps. Phase I is a general screening analysis for identifying major gaps in 26 key environmental problem areas and for setting priorities among these problem areas. Phase II is a methodology for performing a more in-depth analysis of and recommendations to address the gaps associated with each environmental problem area. This paper reports on the Phase I screening analysis, conducted from June through April 1999. MEASUREMENT ISSUES RELATED TO OUR WATER SUPPLY (CHAIR: BARNES JOHNSON (EPA/OSWER/OSW) FORAYS INTO THE UNFORGIVING- OCCURRENCE ESTIMATION IN THE REALM OF DATA WITH MULTIPLE CENSORING POINTS Andrew Schulman, Jennifer Wu, and Ben Smith US Environmental Protection Agency Under the Safe Drinking Water Act, the Agency is charged with establishing standards for allowable levels of contaminants in the Nation's public water systems. Central to the selection of the regulatory level is the determination of the relative benefits and costs likely to be achieved. Benefits and costs are directly proportional to the level of current occurrence. Consequently, sound decision making requires the best possible estimation of occurrence be utilized. In developing a new regulation for arsenic, the Agency has data from over twenty States covering a time span of up to twenty years. Because the current regulation is at a much higher concentration than new options under investigation, however, many State data sets are heavily censored by detection limits within the range of required estimation. This paper will discuss the data and the approaches EPA is considering for the assimilation of the data into national and mtra-system occurrence estimation. ESTIMATED WATER CONSUMPTION IN THE U.S. BASED ON THE CSFII Henry D. Kahn, Helen L. Jacobs, and Kathleen A. Stralka US Environmental Protection Agency Knowledge of drinking water intake is fundamental to the mission of the Office of Water and an important component of a number of programs at EPA. This presentation provides a summary of 19 ------- our recent efforts to generate up-to-date estimates of water intake by the population of the United States. To obtain current estimated water consumption distributions, we have analyzed the United States Department of Agriculture's (USDA's) Combined 1994-96 Continuing Survey of Food Intake by Individuals (CSFII) data set. Per capita water intake is estimated for three sources of water: municipal/tap, bottled, and other sources of water (i.e., private well, private cistern, or private or public well). For each source of water, distributions are generated for direct and indirect water consumption. The distributions by age, gender, race, socioeconomic status, and geographical region and separately for pregnant and lactating women are also estimated. Survey design and statistical methodology are discussed. We anticipate that the water consumption distributions will be used in a wide range of applications including: rules limiting amounts of microbes; disinfectant by-products (DBF) rules; radon and other drinking water contaminant rules; protection of sensitive populations and other exposure assessments. DEVELOPMENT OF A NEURAL NETWORK TOOL FOR EVALUATION OF WASTE MANAGEMENT UNIT DESIGNS Virginia Cohen-Bradley Economics, Methods, and Risk Analysis Division, Office of Solid Waste, US Environmental Protection Agency Samuel Figuli, Julia Lewis, and Katrin Arnold HyroGeoLogic, Inc. The Office of Solid Waste recently completed a neural network software tool designed for evaluating leachate concentrations in four different waste management units, with three different liner types. The purpose of the tool is to help non-hazardous industrial waste facilities determine the concentration for the constituent of concern that can be disposed of safely in a specific waste management unit design. The neural network software, EPA's Industrial Waste Management Evaluation Model (1WEM) is based upon EPA's ground-water fate-and-transport model, EPACMTP. EPACMTP was designed for national-level risk assessments. It is run in Monte Carlo mode, using hydrologic data representative of the United States. Seven parameters judged to be the most significant in EPACMTP were used to build four different neural network tools, one for each of the waste management units: landfill, surface impoundment, waste piles, and land application units. IWEM has a multi-layer perceptron architecture and was trained in back-propagation mode from target output generated by the Monte Carlo-style analyses with EPACMTP. Several different approaches to producing training- and test-data sets were used. In general, the comparison between the neural network and the EPACMTP results is good. The accuracy of the neural networks varies with the location of the EPACMTP response surface that is being simulated. 20 ------- ASSESSING RISK (CHAIR: ELIZABETH MARGOSCHES (EPA/OPPTS) PROPOSED EPA METHODOLOGY FOR ASSESSING RISKS FROM INDOOR RADON David Pawel, Ph.D. US Environmental Protection Agency Radon has been determined to be the second leading cause of lung cancer after cigarette smoking (NAS 1998). Based on methodology published by the National Academy of Sciences (NAS) in its BEIRIV report (NAS 1988) and in its "Comparative Dosimetry" report (NAS 1991), EPA has previously estimated that 13,600 lung cancer deaths in the U.S. each year are radon related (EPA 1992). Subsequently, the Agency sponsored a study by the NAS, which reviewed the large body of evidence about radon that has become available since their earlier reports. The new NAS study, BEIR VI (NAS 1998), confirmed that radon is a serious public health problem, and provided new estimates of radon risk and of radon-attributable lung cancer deaths, which were somewhat higher than EPA had projected previously, particularly for never smokers. The BEIR VI committee concluded, moreover, that about one-third of these cases are preventable if all homes above 4 pCi/L are remediated. We will discuss proposed revisions to EPA's methodology for calculating radon-related risk estimates in light of BEIR VI and the Agency's own previous analysis. These include estimates of attributable risk and risk per working level month (WLM). Attributable risk is the proportion of lung cancer deaths attributable to radon. Risk per WLM is the number of expected radon- induced cancer deaths for the current population divided by the corresponding total of past and future exposures. We will describe life table methods for calculating these quantities, and show how changes in smoking patterns might impact these estimates of risk. It is anticipated that this methodology would be used by EPA in a number of contexts, including: (1) updating its public information aimed at reducing residential radon exposures; (2) its assessment of risk from radon in drinking water; and (3) its assessment of risks associated with radium contaminated sites. HEALTH DATA: HOW DO WE USE IT TO PROTECT THE PUBLIC/ENVIRONMENT? Elizabeth H. Margosches, Ph.D., Jennifer Seed, Ph.D., and Khoan T. Dinh, Ph.D., US Environmental Protection Agency This talk will describe the types of data typically available for analysis by the EPA's Office of Pollution Prevention and Toxics, and how they are used. These data are submitted under various statutes or gathered from the open literature and are used to help decide to what degree the public or the environment may be at risk of incurring adverse effects if certain exposures occur. The decisions include such considerations as whether the available studies are experimental or observed in situ and how inferences may be made from various animal studies to the wild or to humans as well as inferences from one effect to another. Sampling and data collection issues, 21 ------- missing data, and data modeling are all critical statistical aspects of this activity. An example will be given those focuses on generalizing inferences from a dose-response model. 2:00-3:30 Wednesday, May 12 THE VISUAL PRESENTATION OF DATA (CHAIR: AL MORRIS, REGION III, EPA) ENVIROVIZ-TURNING NUMBERS INTO VISUAL RELATIONSHIPS Alvin R. Morris Director, Office of Environmental Data, US Environmental Protection Agency, Region HI We're drowning in data - ever hear that plaintive wail? While we may not be drowning, we are faced with under-utilizing data. Another more recent challenge facing us- in the spring of next year- is to prove to the congress, the public and others that we are using the funds they provide to actually improve the environment-how much, where and for what price. Data visualization can help solve both those challenges. This presentation will be the first presentation of outputs of a prototype program we named EnviroViz. A program that dynamically links air and water ambient and major point sources to: v\here they are located: in the Region, state, county, and watershed shows the 6-year trend for each of 7 air and 46 water parameters (stressors) the GPRA goals to the sub-objective level and shows for each sub-objective the associated FTE, contract $, state and tribal grants It's a new approach to more easily understanding the meanings embedded in environmental data and can be applied in many areasplease come see and comment. METHODS FOR DISPLAYING TEMPORAL AND SPATIAL TRENDS David Mintz Air Quality Trends Analysis Group, US Environmental Protection Agency EPA's Office of Air Quality Planning and Standards is tasked with developing an annual report on the nation's air quality. This report, entitled National Air Quality and Emissions Trends Report, uses various graphing techniques to present temporal and spatial trends in the data. This paper discusses the methods employed in the report, their strong points, and their limitations. Much of the graphical design is based on the principles of Edward Tufte and other leading authorities on the visual display of information. 22 ------- TWO TEMPLATES FOR VISUALIZING GEOREFERENCED STATISTICAL SUMMARIES Daniel B. Carr Center for Computational Statistics, George Mason University This paper presents two new templates for visualizing spatially-index statistical summaries. The first template called conditioned choropleth (CC) maps represents a powerful interactive extension of classed choropleth maps. The basic layout is a 3 x 3 matrix of panels containing nine juxtaposed maps. One conditioning variable corresponds to rows and the other to columns. The analyst controls the highlighting of map regions by manipulating row and column sliders that define acceptable intervals for the conditioning variables. A small tab in each panel shows a value summarizing the highlighted region values. The presence or absence of main effects and interaction are evident at a glance. Other analyst interactions including dynamic class interval selection and simultaneous pan and zoom for all panels. The examples emphasize study of human mortality rates for health service areas conditioned on environmental and demographic variables. The second template, called linked micromap (LM) plots, provides an alternative to traditional classed choropleth maps. The new design trades off region boundary resolution for more accurate or extensive statistical summaries. These summaries can be bar plots, dot plots, box plots, time series, line high plots for over a hundred variables and so on. Color provides a local link of each region's (or site's) statistical summary and it's spatial position in the micromap. Examples show numerous variations of this template. The discussion addresses pattern discovery and working in progress for drilling down from state to county to census tract. LISTENING TO OUR INFORMATION CUSTOMERS (PANEL SESSION) Margaret Morgan-Hubbard, Director, EPA Office of Communications Brendan Doyle, (Acting) Director, CEIS Customer Survey and Access Division Emma McNamara, (Acting) Director, EIMD, OIRM (invited), and Pat Bonner, EPA Customer Service US Environmental Protection Agency This session will give participants an overview of how EPA and CEIS are surveying the Agency's current and potential environmental information customers to better understand their needs and access preferences. Several examples of how customer feedback is helping to shape various EPA information products and services will be introduced. CEIS will re-cap lessons learned from the Center's customer surveys over the past two years and give an update on their national customer telephone survey (results due this fall). Session participants will have an opportunity to express their interests in using the Center's survey data for their own analyses and programs. The basics of using customer feedback on your products or services will also be covered. The panel agenda will include: 23 ------- Introductions: Brendan Doyle, CEIS Margaret Morgan-Hubbard, OCEMR: The importance of focussing your information product or service on your customers' needs and a vision for serving EPA's environmental information customers in the future. Brendan Doyle: Overview of what we've learned so far by implementing the CEIS customer survey plan and what we hope to learn from our national information customer telephone survey this fall. Emma McNamara: EPA's web sites- incorporating customer input and customer service principles into developing and maintaining a Web site. Pat Bonner: EPA Customer feedback 101- will discuss how "Hearing the Voice of the Customer" guidelines can help you to obtain useful customer feedback on your products, processes and services. APPLICATION OF SAMPLING IN AQUATIC RESOURCES (CHAIR: HENRY KAHN, EPA/OW/EAD) COMPOSITE SAMPLING ANALYSIS OF CONTAMINANT LEVELS IN FISH Henry D. Kahn and Silvestre Colon US Environmental Agency Samples offish formed by physically mixing, i.e., grinding together, a number offish into a combined, aggregate sample are referred to as "composite samples". Chemical analysis of composite samples is a cost-effective mechanism for estimating mean levels when the cost of analysis is high and the cost of obtaining sample units, such as individual fish, is relatively low. A possible concern in the analysis of composite sampling data is the absence of measurement results on individual units that comprise the composite. This presentation considers a set of data on contaminant levels in measured in composite samples offish and individual fish that constitute the composite samples. The results allow for comparison of composite and individual analyses. Additional topics discussed are: estimation of variance components associated with the composite samples, using measurements made on subsamples of the composites and effects of fish length and weight on contaminant levels. 24 ------- NATIONAL FISH TISSUE CONTAMINANT LAKE SURVEY: A NEW SPATIALLY-RESTRICTED SURVEY DESIGN Anthony R. Olsen NHEERL Western Ecology Division, U.S. Environmental Protection Agency In 1998, the U.S. Environmental Protection Agency initiated a national study offish tissue contaminants in lakes and reservoirs. The study requires the development of a survey design to meet the study objectives. For the national lake study, a list frame of waterbodies greater than 1 hectare is available. The frame provides information on the lake surface area and its geographic location, in the form of a geographic information system (GIS) coverage. However, the frame includes waterbodies that do not meet the definition of the target population. The frame includes 270,761 waterbodies. This paper develops the survey designs for the study and discusses how an underlying discrete global grid can be used to control the spatial distribution of the sample and to address the imperfection of the frame. The survey design does not use finite population sampling theory, but a continuous population in a bounded area theory that parallels it. The spatially-restricted design enables the concept of a systematic sample to be implemented while maintaining the ability to obtain design-based estimates and variance estimates. 8-30-10.30 Thursday, May 13 APPLYING MONTE CARLO SIMULATION TECHNIQUES WITH S- PLUS Steven P. Millard Probability Statistics and Information (PSI) Monte Carlo Simulation covers a broad range of topics, including simply generating random numbers, probabilistic risk assessment, bootstrapping to obtain the distribution of (and hence confidence intervals for) some statistic for which the distribution is unknown or not assumed, and permutation tests. This talk will discuss the concepts behind each of these main topics, then use examples to show you how to implement these methods using S-PLUS and ENVIRONMENTAL STATS for S-PLUS. 25 ------- 9:00-10:30 8:30-10:30 Thursday, May 13 THE DATA COME IN, THE DATA GO OUT REDUCING PAPERWORK BURDENS AT EPA Rick Westlund Office of Policy, US Environmental Protection Agency In the March 1995 Reinventing Environmental Regulation report, EPA established a long term commitment to identify and eliminate obsolete, duplicative, and unnecessary monitoring, reporting, and record keeping requirements. To date, EPA has removed more than 25 million baseline burden hours, and built an internal watchdog culture dedicated to avoiding unnecessary new paperwork burdens. Although total burden has continued to creep upward due to new statutory requirements and new right-to-know collections, EPA programs continue to develop creative approaches to chip away at burden without endangering environmental objectives. In addition, EPA is developing many enterprise-wide initiatives designed as strategic investments with the potential for much larger burden reductions three to five years from now. In the last several years the Agency has accelerated its efforts to improve information collection management, with a particular focus on reducing burdens associated with reporting and record keeping, while at the same time enhancing data quality, coordinating our data activities with States, improving our collection and display technologies, and compiling our data into a single Internet site. We have taken major steps, but there is still more to do. The public's right-to-know is now a fundamental cornerstone of our work at EPA, and we have all worked hard to put information into the hands of the American people in the belief that this is one of the best ways to protect public health and the environment. In the course of doing so, we have learned that the Agency's effective management of its data is central to the measurement of our progress in delivering the protections the American people expect. As we embark on a new era of information technology and enhanced public access to data, we are committed to minimizing our paperwork burden on the public while ensuring that our data are timely, accurate, useful to the public, and able to effectively inform our own decision making. The Agency has several initiatives underway to redesign or refocus the way we manage information collection with primary goals to reduce burden on the public while accomplishing our environmental protection mission. The most encompassing initiative is the recently launched reorganization plan involving the formation of a new information organization that will bring together all Agency information programs to better manage our information resources with an expressed goal of reducing burden on the public while enhancing the data quality and integrity as it is used within the Agency and made available to others outside the Agency. Another major initiative, started over a year ago after the 1997 Information Streamlining Plan, is the continued development of the Reinventing Environmental Information (REI) initiative. In its early stages, the plan focuses on data quality and building infrastructure, but burden reduction savings will become more apparent as the efficiencies in reporting options become available. 26 ------- The Agency has been very active working with the States on burden reduction especially through partnership workgroups with the Environmental Council of States (ECOS). The workgroup is identifying burden reduction opportunities by defining what information is and should be collected, how information is transmitted, and how information is used. The workgroup is also engaging industry, the public and others to help draft a tactical approach to burden reduction. Within the Agency, the program offices are developing a range of streamlining and reinvention initiatives to reduce burdens. They range from whole program streamlining as in the Office of Solid Waste's comprehensive review of the RCRA program to the Office of Air's reengineering of the pre-production certification program for new motor vehicles. 10:45-11:45 Thursday May 13 PERSPECTIVES ON DATA AND INFORMATION FROM THE DEPARTMENT OF HEALTH AND HUMAN SERVICES William F. Raub, Ph.D. Deputy Assistant Secretary for Science Policy Department of Health and Human Services The Department of Health and Human Services (DHHS) employs a wide variety of data and information systems as it seeks to enhance the well-being of Americans by providing for effective health and human services and by fostering strong, sustained advances in the sciences underlying medicine, public health, and social services. DHHS data-oriented efforts range from (a) collection of national vital and health statistics to (b) systematic surveillance focused on specific diseases and disorders to (c) special surveys oriented to particular public health issues and or particular population groups. A major contemporary challenge is to improve surveillance for nc\\ and reemerging infectious diseases in general while improving preparedness to detect and respond to potential acts of biological terrorism. 27 ------- Evaluation Form 1999 EPA Conference on Environmental Statistics and Information May 10-13,1999 Please help us improve future conferences by taking a few moments to provide us with your comments on this year's conference. 1 Overall Conference Evaluation Did you broaden your EPA contacts? Did you update your current knowledge? Did vou find exposure to new material1' Did you gain more agency-wide perspective? Were you able to exchange technical methods? Were you able to discuss problems and concerns? Please check one box Very Much Some Extent Limited Extent 2 Session Evaluation Workshop on Monte Carlo Methods in Environmental Statistics fWoollcott Smith and Peter Petraitis) EPA's Web Site and You (Joe Anderson) Information, Statistics and the Region (Al Morris) Keynote Address (Jay Hakes) Statistical Training Session (Videotapes) Statistics, Information, and GPRA (George Bonina and Judith Calem Lieberman) Local Applications of EPA Data (Ron Shafer, Henry Topper, Kimberly Nelson, N. Bouwes, and Steven M Hassur) Statistical Methods for Lab and Air Quality Data Analysis (Larry Cox, Mary Lou Thompson, Kerrie Nelson, Peter Craigmile, and Joel Reynolds) Databases The Manager's View (Phil Lindenstruth, Abraham Siegel, and Mike Mundell) Ensuring the Quality of Environmental Information (Nancy Wentworth and Malcolm Bertoni) Please check one box Highly Relevant Fairly Relevant Not Very Relevant ------- 2 Session Evaluation Continued Models and Model Assessment of Environmental Data (Mary ^ou Thompson, Rafael Ponce, Samantha Bates, A. C. Cullen, A E Raftery, Marianne Turley, E. David Ford, and Joel Reynolds) Use of the Internet for Sharing Statistics (Steve Hufford, Pat Garvey, Anne Frondorf, Chris Miller, and Bob Shepanek) Epidemiology and Risk Assessment Cumulative and/or Aggregate Risk Assessment (Ruth Allen, David Miller, Hans Allender, and Breeda Reilley) Statistical Research Issues in Quality Assurance (John Warren, Rob O'Brien, and Charles White) Data, Information and Statistics Putting it All Together for Decision-Making (Tom Curran ) Poster and Computer Sessions Data Integration and Quality. Vision for the Future (Ruth Allen, Susan Devesa, D Grauman, W Blot, G Pennello, R Hoover, and J Fraumeni ) Analysis of Cleanups (Michael J Messner and Bimal Sinha) Sampling and Design Issues in Environmental Studies ( Tony Olsen, David Marker, and Paul D Sampson) Some Analyses and Potential Analyses at EPA (Doreen Sterling, Mike Barrette, Tom DeMoss, Tom Pheiffer, and John Moses ) Measurement Issues Related to our Water Supply (Barnes Johnson, Andrew Schulman, Jennifer Wu, Benjamin Smith, Jennifer Wu, Henry Kahn, Helen L Jacobs, and Kathleen A. Stralka, and Virginia A. Cohen-Bradley) Assessing Risk (Elizabeth Margosches, Mary Marion, David Pawel, and Margaret Conomos) A Walk on the Wild Side of Statistical Communication (Woollcott Smith) Proposed Information Management Office (Robert English) The Visual Presentation of Data (Al Morris, David Mintz, and Daniel Carr) Listening to Our Information Customers (Brendan Doyle, Margaret Morgan-Hubbard, Pat Bonner, and Emma McNamara] Please check one box Highly Relevant Fairly Relevant Not Very Relevant ------- 2 Session Evaluation Continued Application of Sampling in Aquatic Resources (Henry Kahn, Silvestre Colon, Anthony R Olsen, and Barnes Johnson) Statistics and Information at EPA as we Start a New Century: Where Are We Going? (Phil Ross, Larry Cox, Karen Klima, Heather Case, and G.P. Patil) Applying Monte Carlo Simulation Techniques with S-PLUS (Steven P Millard) The Data Come In, the Data Go Out (Rick Westlund and Charlotte Cottrill ) WRAP-UP SESSION (Barry Nussbaum and William Raub) Please check one box Highly Relevant Fairly Relevant Not Very Relevant 3 What were the greatest strengths of the conference? What aspects did you like the most? 4 What were the greatest weaknesses of the conference? What aspects and sessions did you like the least? 5 Would you be interested in other training sessions that would introduce you to a new development in applied statistical methodology? Yes No Unsure Suggestions for topics 6 Are you planning to attend next year's conference on environmental statistics and Yes No Unsure 7 Other Comments ------- REGISTRANTS The 1999 EPA Conference on Environmental Statistics and Information SugarLoaf Conference Center Philadelphia, Pennsylvania May 10-13.1999 Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail RUTH ALLEN OPP/HED/CEB US EPA 703-305-7191 301-402-4279 Allen.ruth@epamail.epa.gov HANS ALLENDER US EPA 703-305-7883 703-605-0645 A llender.hansfSepamai 1 .eoa.gov JOSEPH ANDERSON US EPA 202-260-3016 LARA P. AUTRY OAR/OAQPS/EMAD US EPA 919-541-5544 919-541-1039 Aurrv.lara@eDa.gov MICHAEL BARRETTE US EPA 202-564-7019 Barrette.michael@epamail.epa.gov SAMANTHA BATES UNIV OF WASHINGTON Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail MALCOLM BERTONI RESEARCH TRIANGLE INSTITUTE 202-728-2067 202-728-2095 MJB@rti.org CINDY BETHELL US EPA GEORGE BONINA OIRM 202-260-6227 Bonma.george@epa.gov PATRICIA BONNER US EPA 202-260-0599 Bonner.patricia@epamail.epa.gov ED BRANDT CEIS/IA1AD US EPA 202-260-6217 Brandt.edward@epamail.eDa.gov LORI BRUNSMAN OPPTS/OPP/HED US EPA 703-308-2902 703-605-0645 Brunsman.lorif2).eDamail.eDa.gov ------- Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail DANIEL CARR GEORGE MASON UNIV 703-993-1671 703-993-1521 HEATHER ANNE CASE OP/CEIS US EPA 202-260-2360 202-260-4903 Case.heather@epamail.eDa.gov WENDY CLELAND-HAMNETT OP/CEIS US EPA 206-260-4030 202-260-0275 Cleland-Hamnett.wendv@epa.gov SILVESTRE COLON OFFICE OF WATER US EPA 202-260-3066 202-260-7185 Colon.silvestre@epamail.epa.gov VIRGINIA A.COLTEN-BRADLEY OSWER/EMRAD US EPA 703-308-8613 703-308-0509 Colten-bradlev virginia(5)epamail.epa.go\ MARGARET CONOMOS OPPE/CEIS US EPA 202-260-3958 202-260-4968 Conomos.margaret@epa.gov LAWRENCE COX ORD/NERL US EPA 919-541-2648 919-541-7588 Cox, larrv@epamail.epa. gov PETER CRAIGMILE UNIV OF WASHINGTON Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail DAVID CROSBY AMERICAN UNIVERISITY 202-885-3155 Dcrosbv@american .edu THOMAS CURRAN OAR/OAQPS US EPA 919-541-5694 919-541-4028 Curran.thomas@epamail.epa.gov THOMAS DEMOSS US EPA MAIA 410-305-2739 410-305-3095 Demoss.tom@epa.gov SUSAN DEVESA NATIONAL CANCER INSTITUTE NIH 301-496-8104 301-402-0081 Devesas@epndce.nci.nih.gov SUSAN DILLMAN OPPTS/OPPT/NPCD US EPA 202-260-5375 202-260-0001 Dillman.susan@epa.gov KHOAN TAN DINH US EPA 202-260-3891 202-260-1283 Dinh.khoan@epamail.epa.gov DONALD DOERFLER ORD/ERC/NHEERL US EPA 919-541-7741 Doerfler.donald@epamail.epa.pov BRENDAN DOYLE US EPA 202-260-2693 202-260-4968 Dovle.brendan@epamail.epa pnv ------- LEE ELLIS CEIS US EPA Phone 202-260-6123 Fax 202-260-4968 E-mail Ellis.lee@epamail.epa.gov ROBERT ENGLISH INFO TRANS/ORG PLANNING US EPA Phone 202-260-5995 Fax 202-260-3655 E-mail English.robert@epamail.epa.gov DAVID FARRAR OPP US EPA 703-305-5721 703-305-6309 Farrar.david@epamail.epa.gov TERENCE FITZ-SIMONS US EPA 919-541-0889 Phone Fax E-mail Phone Fax E-mail GEORGE T. FLATMAN ORD/NERL-CRD US EPA Phone 702-798-2528 Fax 702-798-2208 E-mail George flatman@epamail.epa.gov JOHN F. FOX OST US EPA Phone 202-260-9889 Fax 202-260-7185 E-mail Fox.iohn@epamail.epa.gov MARY FRANKENBERRY OPPTS/OPP/EFED US EPA Phone 703-305-5694 Fax 703-305-6309 E-mail Frankenberrv.mary@epamail.epa.gov ANNE FRONDORF US GEOLOGICAL SURVEY Phone 703-648-4205 Fax 703-648-4224 E-mail Anne_frondorf@usgs.gov Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail WILLIAM GARETZ OPPE/CEIS US EPA 202-260-2684 Garet2.william@epamail.epa.gov PATGARVEY OIRM/EIMD US EPA 202-260-3103 202-401-8390 Garvey.pat@epamail.epa.gqv SUSAN P. GEYER CEIS US EPA 202-260-6637 Gever.susan@epa.gov MELISSA GONZALES ORD/NHEERL US EPA 919-966-7549 919-966-7584 Gonzales.mel issa@epa.gov PETER GOODWIN DEAN GRADUATE SCHOOL TEMPLE UNIVERSITY BRIAN GREGORY OAR/ORIA/IED/CHB US EPA 0 202-564-9024 202-565-2038 Gregory.brian@epamail.epa.gov JAY HAKES ENERGY INFORMATION ADMINISTRATION ------- Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail STEVEN M. HASSUR OPPT US EPA 202-260-1735 202-260-0981 Hassur.steven@epamail.epa.gov RICHARD HEIBERGER TEMPLE UNIVERSITY KAREN KLIMA US EPA JAMES HEMBY OAQPS US EPA 919-541-5459 919-541-2464 Hembv.iames@epa.gov DAVID M. HOLLAND ORD/NHEERL US EPA 919-541-3126 919-541-1486 Holland.davidi@eDamail.epa.gov STEVE HUFFORD CE1S US EPA 202-260-9732 202-260-4968 Hufford.steve@epamail.epa.gov BARNES JOHNSON OSWER/OSW US EPA 703-308-8855 703-308-0511 Johnson. barnes@epamai 1 .epa. eov HENRY KAHN OW/EAD US EPA 202-260-5408 202-260-7185 Kartn.henrv@epamail.epa.gov R. CATHERINE KING US EPA OECEJ 215-814-0871 215-814-2905 King.catherine@epamail.epa.gov Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail ARTHUR T. KOINES OP/CEIS US EPA 202-260-4030 202-260-0275 Koines.arthur@epamail.epa.gov MELKOLLANDER INSTITUTE FOR SURVEY RESEARCH 202-537-6845 202-537-6873 LEE KYLE OGWOW US EPA 202-260-1154 202-401-3041 Kyle.lee@epamaii.epa.gov PEPI HERBERTLACAYO CEIS US EPA 202-260-2714 202-260-4968 Lacavo pepi@epamail.epa.gov RASHM1LAL OP/CEIS US EPA 202-260-3007 202-260-8550 Rashmi.lal@epamail.eDa.gov JADE LEE EPA OFFICE OF WATER 202-260-1996 202-260-7185 Lee.jade@epa.gov JUDY LEE WASTE/CHEM MGMT DIV 215-814-3401 215-814-3113 Lee.judy@epamail.epa.gov ------- Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail LAWRENCE LEHRMAN RMD/OIS US EPA Lehrman.lawrence@epamail.epa.gov ELEANOR LEONARD OP/CEIS US EPA 202-260-9753 703-525-3455 Elleonard@aol.com JUDITH C. LIEBERMAN OCEO US EPA 202-260-8638 202-401-1515 Lieberman.iudv@epamail.epa.gov PHILIP LINDENSTRUTH OFFICE OF WATER US EPA 0 202-260-6549 202-260-7024 Lindenstruth.phil@epamail.epa.gov CONNIE LORENZ OP/CEIS/CSAD US EPA 202-260-4660 202-260-4903 ARTHUR LUBIN OSEA US EPA 312-886-6226 312-353-0374 Lubin.arthur@epamail.epa.gov ALLAN MARCUS NCEA US EPA 919-541-0643 919-541-1818 lvlarcus.allan@epa.gov Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail ELIZABETH MARGOSCHES OPPTS/OPPT US EPA 202-260-1511 202-260-1279 Margosches@epamail.epa.gov MARY A. MARION OPPTS/OPP/HED US EPA 703-308-2854 Marion.marv@epamail.epa.gov DAVID MARKER UNTV OF WASHINGTON ETHAN MCMAHON OP/CEIS US EPA 202-260-8549 Mcmahon.ethan@epamail.epa.gov MICHAEL MESSNER OGWDW US EPA 202-260-8107 Messner.michael@epamail.epa.gov STEVEN P. MILLARD PSI 206-528-4877 206-528-4802 SmiHard@probstatinfo.com CHRISTOPHER MILLER NOAA Phone Fax E-mail DAVID MILLER OPPTS/OPP US EPA 703-305-5352 703-305-5147 Miller.davidJ@epamail.epa.gQv ------- DAVIDJVDNTZ OAR/OAQPS US EPA Phone 919-541-5224 Fax 919-541-1903 E-mail Mintz.david@eDa.gov MARGARET MORGAN -HUBBARD DIRECTOR, OFFICE OF COMMUNICATION US EPA 202-260-5965 Morgan-hubbard.margaret @epamail.epa.gov Phone Fax E-mail AL MORRIS OFFICE OF ENVIRON DATA Phone 215-814-5701 Fax 215-814-5718 E-mail morris.alvin@epa.gov REBECCA MOSER CEIS US EPA 202-260-6780 202-260-4903 Phone Fax E-mail JOHN MOSES OP/CEIS US EPA Phone 202-260-6380 Fax 202-401-7617 E-mail Moses.iohn@epamai I .epa.gov NICKNAPOLI US EPA Phone 215-816-2621 Fax 215-814-2783 E-mail Napoli.nick@epamail.epa.gov MALIHA S. NASH ORD/NERL-CRD US EPA Phone 702-798-2528 Fax 702-798-2208 E-mail Nash.maliha@epamail.epa.gov Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail KIMBERLY NELSON PA DEPT OF ENVIR PROTECTION 717-787-3534 717-783-8926 Nelson.kimberly@dep.state.pa.us BARRY NUSSBAUM OPPE/CEIS US EPA 202-260-1493 202-460-4968 Nussbaum.barrv@.epamail.epa.gov ROB O'BRIEN BATTELLE 509-375-6769 509-375-2604 Robert.obrien@pnl.gov ANTHONY R. OLSEN USEPANHEERL 541-754-4790 541-754-4716 Tolsen@mail.cor.epa.gov G. P. PATIL PENNSYLVANIA ST UNIV 814-865-9442 814-863-7114 Gpp@stat.psu.edu ROBERT M. PATTERSON COLLEGE OF ENGINEERING TEMPLE UNIVERITY 215-204-1665 215-204-6936 rpatterson@thunder.temple.edu DAVID PAWEL ORIA US EPA 202-564-9202 PETER PETRAITIS UNIVERISTY OF PENNSYLVANIA ------- Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail ANNE POLIT1S CEIS/IAIAD US EPA 202-260-5345 202-260-4903 Polids.anne@epamail.epa.gov RAFAEL PONCE UNIV OF WASHINGTON WILLIAM F. RAUB DEPUTY ASST SECTY SCI POLICY DEPT HEALTH & HUMAN SERV BREEDA REILLY CEPPO US EPA 202-260-0716 Reilly breeda@epamail.eDa.gov JOSEPH RETZER OP US EPA 202-260-2472 Retzer.ioseph@epamail.epa.gov JOEL REYNOLDS UNIV OF WASHINGTON EDNA RODRIGUEZ OP/CEIS/CSAD US EPA 202-260-3301 202-260-4903 Rodriguez.edna@epamail.epa.gov N. PHILLIP ROSS CEIS US EPA 202-260-5244 202-260-8550 Ross.Nphillip@eDamail.eDa.gov Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail KRISTEN RYDING OEA US EPA 206-553-6918 PAUL SAMPSON UNIV OF WASHINGTON DINA SCHREINEMACHERS EBB/HSD/NHEERL/ORD US EPA 919-966-5875 919-966-7584 Schreinemacners.dina@epamail.epa.pov ANDREW SCHULMAN OGWDW/SRMD/TAB US EPA 202-260-4197 202-260-3762 Schubnan.andrew@epamail.epa.gov RONALD SHAFER OP/CEIS US EPA 202-260-6766 202-260-4968 Shafer.ronald@epamail.epa.flov BOB SHEPANEK ORD/NCEA US EPA 202-564-3348 202-565-0061 Shepanek.robert@epaman epa p, CAROLYN SHETTLE INSTITUTE FOR SURVEY RESEARCH 202-537-6793 202-537-6873 cshettle@ioip.com ABRAHAM SIEGEL OW/OGWDW US EPA 202-260-2804 Siegel.abraham@epamail.epa.gov ------- Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail Phone Fax E-mail BIMAL SINHA OPPE/CEIS US EPA 202-260-2681 Sinha.bimal@eDaniail.epa.gov BENJAMIN SMITH US EPA 202-260-3026 202-260-3762 Smith.ben@epamail.eDa.eov WILLIAM P. SMITH OPPE/CEIS US EPA 202-260-2697 202-260-4968 Sm ith. wi I l@epamai I .epa.gov WOOLCOTT SMITH TEMPLE UNIVERSITY MINDI SNOPARSKY HYDROGEOLOGY EPA 215-814-3316 Snoparsky .m ind i@epam ial .epa.gov JOHN A. SORRENTINO TEMPLE UNIVERSITY 215-204-8164 Sorrento@astro.ocis.temple.edu DOREEN STERLING CEIS US EPA 202-260-2766 202-260-8550 Sterling.doreen@epamail.epa.gov WILLIAM TASK VICE-PROVOST TEMPLE UNIVERSITY Phone Fax E-mail MARY LOU THOMPSON UNIV OF WASHINGTON Phone 206-616-2723 Fax 206-616-2724 E-mail Mlt@biostat.washington.edu HENRY TOPPER OPPT US EPA Phone 202-260-6750 Fax 202-260-2217 E-mail topper.henrv@epa.gov MARIANNE TURLEY UNTV OF WASHINGTON Phone 206-616-9288 Fax 206-616-9443 E-mail Marianne@cqs.washington.edu DIANNE WALKER US EPA REGION III Phone 215-814-3297 Fax 215-814-2134 E-mail Walker.dianne@epamail.epa.gov JOHN WARREN ORD/NCERQA/QAD US EPA Phone 202-260-9464 Fax 202-401-7922 E-mail Warren.iohn@epamail.epa.eov NANCY WENTWORTH ORD/NCERQA/QAD US EPA Phone 202-564-6830 Fax 202-565-2441 E-mail Wentworth.nancy@epainail.epa.gov ELLEN WERNER INSTITUTE FOR SURVEY RESEARCH Phone 202-537-6735 Fax 202-537-6873 E-mail Ewerner@ioip.com RICK WESTLUND US EPA OFFICE OF POLICY Phone 202-260-2745 Fax 202-260-9322 E-mail Westlund.rick@epa.gov ------- CHARLES WHITE OW/OST/EAD US EPA 0 Phone 202-260-5411 Fax 202-260-7185 E-mail White.chuck@epamail.epa.gov NATHAN WILKES OFFICE OF POLICY US EPA Phone 202-260-4910 Fax 202-260-4903 E-mail Wilkes.nathan.epa.gov JENNIFER WU OW/OGWDW/SBMD US EPA Phone 202-260-0425 Fax 202-260-3762 E-mail Wu.iennifer@epamail.epa.gov ------- |