Next Generation Risk Assessment: Incorporation of Recent Advances in Molecular, Computational, and Systems Biology(External Peer Review Draft


                                EPA/600/R-13/214A | September 2013 | www.epa.go/ncea
United States
Environmental Protection
Agency
    Next Generation Risk Assessment:
    Incorporation of Recent Advances in Molecular, Computational, and Systems Biology
             External  Review Draft
   National Center for Environmental Assessment
   Office of Research and Development, Washington, DC 20460

-------
&EPA
United States
Environmental Protection
Agency
                                        September 2013
                                     EPA/600/R-13/214A
    Next Generation Risk Assessment:

   Incorporation of Recent Advances in
         Molecular, Computational,
            and Systems Biology
                [External Review Draft]
             National Center for Environmental Assessment

               Office of Research and Development

               U.S. Environmental Protection Agency

                  Washington, DC 20460

-------
      :•'" '  W

This document is an external review draft This information is distributed solely for the purpose of
pre-dissemination peer review under applicable information quality guidelines. It has not been
formally disseminated by EPA. It does not represent and should not be construed to represent any
Agency determination or policy. It is being circulated for review of its technical accuracy and
science policy implications. Mention of trade names or commercial products does not constitute
endorsement or recommendation for use.
       This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
September 2013                                  ii

-------
Next Generation Risk Assessment: i
Disclaimer ii
Contents iii
Acronyms and Abbreviations v
Acknowledgments vii
Executive Summary x
1. Introduction 1
2. Preparation for Prototype Development 6
2.1. Consideration of Decision Context 6
2.2. A Framework 8
2.3. Science Community and Stakeholder Engagement 10
2.3.1. Expert Workshop 10
2.3.2. Stakeholder Involvement 11
2.4. Recurring Issues in Risk Assessment 12
3. The Prototypes 13
3.1. Tier 3: Major Scope Assessments 14
3.1.1. Benzene-Induced Leukemia 16
3.1.2. Ozone-induced Lung Inflammation and Injury 26
3.1.3. Benzo[a]pyrene (a Polycyclic Aromatic Hydrocarbon), and Cancer 33
3.1.4. Risk Assessment Implications across the Tier 3 Prototypes 43
3.2. Tier 2: Limited Scope Assessments 45
3.2.1. Knowledge Mining - Diabetes/Obesity 47
3.2.2. Short-Term In Vivo Models - Alternative Species 54
3.2.3. Short-Term In Vivo Models - Mammalian Species 61
3.3. Tier 1: Screening and Prioritization 64
3.3.1. QSAR and High-Throughput Virtual Molecular Docking (HTVMD) 67
3.3.2. High-Throughput and High-Content Assays 68
3.3.3. Toxicokinetics 69
3.3.4. High-Throughput Exposure Estimation: ExpoCast Prioritizations 69
3.3.5. Virtual Tissue (VT) Modeling 70
3.3.6. Example: Thyroid Pathway Disrupting Chemicals and High-Throughput Systems...71
4. Advanced Approaches to Recurring Issues in Risk Assessment 73
4.1. Human Variability 73
4.1.1. Genomic Variability 75
This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
September 2013 iii

-------
    4.1.2.     Early-Life Exposures	78
    4.1.3.     Mixtures and Nonchemical Stressors	79
  4.2.      Inter-Species Extrapolation	80
  4.3.      Low Dose-Response Modeling	81
5.      Lessons Learned from Developing the Prototypes	84
  5.1.      New Methods	84
  5.2.      5.2. Implications for Risk Assessment Derived from Prototypes	86
  5.3.      Summary	90
6.      Conclusions	92
  6.1.      Challenges	92
  6.2.      Next Steps	93
7.      References	95
Appendix A. Technical Papers Supporting the NexGen Report	A-l
Appendix B. Glossary	B-l
       This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
September 2013                                 iv

-------
Acronyms and Abbreviations
 Acronym Abbreviation     Stands For
 AC50
 AER
 AhR
 AML
 AOP
 B[a]P
 BMD
 CDC
 Css
 CTD
 DEC
 DNA
 EPA
 EWAS
 GEO
 GWAS
 HCS
 HPT
 HT
 HTS
 HTVMD
 IC50
 ICio
 IVIVE
 KB
 LEG
 MIE
 MOA
 mRNA
 NCEA
 NexGen
 NHANES
 NRC
concentration at 50% of maximum activity
activity-to-exposure ratio
aryl hydrocarbon receptor
acute myeloid leukemia
adverse outcome pathway
benzo[a]pyrene
benchmark dose
Centers for Disease Control and Prevention
concentration, steady state (in blood)
Comparative Toxicogenomic Database
differentially expressed gene
deoxyribonucleic acid
U.S. Environmental Protection Agency
environment-wide association studies
Gene Expression Omnibus
genome-wide association studies
high-content screening
hypothalamus-pituitary-thyroid
high-throughput
high-throughput screening
high-throughput virtual molecular docking
concentration producing a 50% inhibition of response
concentration producing a 10% inhibition of response
in vitro to in vivo extrapolation
Knowledgebase
lowest  effective concentration
molecular initiating event
mode of action
messenger ribonucleic acid
National Center for Environmental Assessment (EPA)
Next Generation Risk Assessment
National Health and Nutrition Examination Survey
National Research  Council
        This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
September 2013                                    v

-------
 Acronym Abbreviation      Stands For
 NTP
 OECD
 PAH
 PK
 POD
 ppb
 ppm
 QSAR
 RNA
 ROS
 SNP
 SOAR
 Tox21
 VARIMED
 VT
National Toxicology Program
Organization of Economic Co-operation and Development
polycyclic aromatic hydrocarbon
pharmacokinetic
point of departure
part per billion
part per million
quantitative structure-activity relationship
ribonucleic acid
reactive oxygen species
single nucleotide polymorphism
Systematic Omics Analysis Review
Toxicology  in the 21st Century
VARiants Informing MEDicine
virtual tissue
        This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
September 2013                                     vi

-------
This document reflects contributions of many individuals whom we gratefully acknowledge.
Without the generous contributions and collaborations of these scientists this report would not
have been possible. Appendix A lists technical papers related to this report and provide an more
complete list of contributors. We would also like to acknowledge Ms. Rebecca Clark, Dr. Kenneth
Olden, and Ms. Debra Walsh for their continued support of this project
Ila Cote, Lyle Burgoon, Robert DeWoskin—EPA Office of Research and Development
Executive Summary
Elaine Cohen Hubal, Rob DeWoskin, Lyle Burgoon, Ila Cote

Introduction, Preparation for Prototype Development and Consideration of Decision Context
Ila Cote, Paul Anastas, Stan Barone, Linda Birnbaum, Becki Clark, Kathleen Deener, David Dix,
Stephen Edwards, and Peter Preuss

A Framework
Daniel Krewski, Margit Westphal, Greg Paoli, Maxine Croteau, Mustafa Al-Zoughool, Mel Andersen,
Weihsueh Chiu, Lyle Burgoon, and Ila Cote

Science Community and Stakeholder Engagement
Kim Osborn, Gerald Poje, Ron White

Recurring Issues in Risk Assessment
Daniel Krewski, Melvin Andersen, Kim Boekelheide, Frederic Bois, Lyle Burgoon, Weihsueh Chiu,
Michael DeVito, Hisham El-Masri, Lynn Flowers, Michael Goldsmith, Derek Knight,
Thomas Knudsen, William Lefew, Greg Paoli, Edward Perkins, Ivan Rusyn, Cecilia Tan,
Linda Teuschler, Russell Thomas, Maurice Whelan, Timothy Zacharewski, Lauren Zeise, and
Ila Cote

The Prototypes

Major Scope Assessments- Benzene-Induced Leukemia
Ila Cote, Reuben Thomas, Alan Hubbard, Cliona McHale, Luoping Zhang, Stephen Rappaport,
Qing Lan, Nathaniel Rothman, Jennifer Jinot, Babasaheb Sonawane, Martyn Smith and
Kathryn Guyton

Major Scope Assessments - Ozone-induced Lung Inflammation and Injury
Robert Devlin, Kelly Duncan, James Crooks, David Miller, Lyle Burgoon, Michael Schmitt,
Stephen Edwards, Shaun McCullough, and David Diaz-Sanchez

Major Scope Assessments Benzo[a]pyrene, Polycyclic Aromatic Hydrocarbons, and Cancer
Lyle Burgoon and Emma McConnell
       This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
September 2013                               vii

-------
Risk Assessment Implications across the Tier 3 Prototypes
Lyle Burgoon, Rob DeWoskin and Ila Cote
Tier 2: Limited Scope Assessments - Knowledge Mining - Diabetes/Obesity
Lyle Burgoon, Shannon Bell, Chirag Patel, Kristine Thayer, Stephen Edwards

Tier 2: Limited Scope Assessments Short-Term In Vivo Models - Alternative Species
Edward Perkins, Gerald Ankley, Stephanie Padilla,  Dan Peterson, Daniel Villeneuve
Tier 2: Limited Scope Assessments Short-Term In Vivo Models - Mammalian Species
Michael DeVito, Jason Lambert, Scott Wesselkamper, Russell Thomas
Tier 1 QSAR and High-Throughput Virtual Molecular Docking (HTVMD)
Rob DeWoskin, Nina Wang, Jay Zhao, Scott Wesselkamper, Jason Lambert, Dan Petersen,
Lyle Burgoon
Tier 1: High-Throughput and High-Content Assays, Toxicokinetics, High-Throughput
Exposure Estimation, Virtual Tissue (VT) Modeling, Thyroid Example
Kevin Crofton and Richard Judson
Advanced Approaches to Issues in Risk Assessment

Human Variability including Genomic Variability
Lauren Zeise, Frederic Bois, Weihsueh Chiu, Ila Cote, Dale Hattis, Ivan Rusyn, Kathryn Guyton, and
Lyle Burgoon

Early Life Exposures
Ila Cote, adapted from a paper by Boekelheide et al. 2012
Mixtures and Nonchemical Stressors
Timothy Zacharewski, Ila Cote, Linda Teuschler, and Lyle Burgoon
Inter-Species Extrapolation
Lyle Burgoon, Ila Cote, and Edward Perkins
Low Dose-Response Modeling
Weihsueh Chiu, Dan Krewski and Lyle Burgoon
Conclusions
Lessons Learned from Developing the Prototypes
Ila Cote, Rob DeWoskin, Lynn Flowers, John Vandenberg, Douglas-Crawford, Brown, Lyle Burgoon
Challenges and Next Steps
Elaine Cohn Hubel, Tina Bahadori, Ila Cote
ICF              Kim Osborn, Heather Dantzker, Jessica Wignall, William Mendez, Bruce Fowler,
Codi Sharp, Deshira Wallace, and Pam Ross
       This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
September 2013                              viii

-------
                                               and
U.S. Army Corps of Engineers: Edward Perkins and Anita Meyer
U.S. Department of Defense, Office of the Secretary of Defense: Robert Boyd
California EPA: George Alexeeff, Martha Sandy, Lauren Zeise
Centers for Disease Control, National Center for Environmental Health, and the Agency for Toxic
   Substances and Disease Registry: Chris Portier (retired), Thomas Sinks, and Bruce
   Fowler (retired)
European Chemicals Agency: Derek Knight
European Joint Research Commission: Maurice Whelan
FDA's National Center Toxicological Research: Donna Mendrick and William Slikker
Health Canada: Carol Yauk
National Institutes of Health, National Center for Advancing Translational Science:
   Menghang Xia
NIH National Institute of Environmental Health Sciences:
   Linda Birnbaum, Scott Auerbach, John Balbus, Michael DeVito, Elizabeth Maull, Kristine Thayer,
   and Ray Tice
National Institute  for Occupational Safety and Health: Christine Sofge, Paul Schulte, Ainsley Weston
Office of Research and Development:
   Michael Broder, David Bussard, Vincent Cogliano, Rory Conolly Dan Costa, Sally Darney,
   Elizabeth Erwin, Susan Euling. Annette Gatchett, Gary Hatch, Annie Jarabek, Robert Kavlock,
   Channa Keshava, Monica Linnenbrink, Matt Martin, Connie Meacham, Shaun McCullough.
   David Miller, Kenneth Olden, David Reif, James Samet, Rita Schoney, Imran Shah, Deborah Segal,
   Woodrow Setzer, Michael Slimak, Rong-Lin Wang, Debra Walsh, John Wambaugh, Paul White
Office of Air and Radiation: Souad Benromdhane, Bryan Hubbell, Kelly Rimer, Carl Mazza,
   Dierdre Murphy, Susan Stone and Lydia Wegman (retired)
•  Office of Chemical Safety and Pollution Prevention: Stan Barone, Vicki Dellarco, Steven Knott,
   Mary Manibusan, Jennifer McLain, Jeff Morris, Anita Pease, Laura Parsons, and Jennifer Seed
Office of Superfund and Emergency Response: Michele  Burgess, Rebecca Clark, Helen Dawson,
   Stiven Foster, and Kathleen Raffaele
Office of Water: Cynthia Dougherty, Elizabeth Doyle, and Elizabeth Southerland
Office of Children's Health Protection: Michael Firestone, Brenda Foos
Office of Environmental Justice: Charles Lee
Regional Liaisons: Carole Braverman, Bruce Duncan
Ken Ramos, University of Louisville
Peter McClure, Heather Carlson-Lynch, and Julie Stickney, SRC
Catherine Blake, University of Illinois
William Pennie and Karen Leach, Pfizer
       This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
September 2013                                ix

-------
       • =

 1    The Next Generation (NexGen) of Risk Assessment program was initiated in 2010 as a multiyear,
 2    multi-organization  effort  to  consider new  molecular,  computational,  and  systems  biology
 3    approaches for use in risk assessments. The goal is to enable faster, less expensive, and more robust
 4    assessments for chemicals and other stressors that might adversely affect public health and the
 5    environment. Although this report is focused on human disease, the approaches described here are
 6    equally applicable to environmental risks. Specific aims of this initial phase of the NexGen program
 7    are to (1) demonstrate proof-of-concept that the data and methods from recent advances in biology
 8    can better inform risk assessment; (2) identify which of the information resources and practices are
 9    most useful for particular purposes (value of information); (3) articulate decision considerations
10    for use of new types of data and methods to inform risk assessment; and (4) identify important data
11    gaps.

12    To  achieve  the  above, prototypes  or  case  studies  were  designed to  (1)  implement the
13    recommendations from workshops and experts on approaches to identifying and evaluating the
14    available data in molecular,  computational, and systems biology for  use in  risk assessment;
15    (2) provide  risk  assessors,  risk  managers,  and the general  public  with  clear  examples
16    demonstrating how  new  data and  advanced  methods might support  specific types  of risk
17    assessments; and (3) elicit interest,  discussion, and  participation from stakeholders to further
18    improve risk assessments.

19    The assessment prototypes are broadly categorized into three groups based on the  assessment's
20    "fitness for intended use" given the decision context. Primary drivers of the decision context are the
21    number of chemicals that must be addressed and the confidence needed in the scientific data to
22    support a specific type of decision. The three  categories or tiers have  been defined as follows:
23    Tier 3—major  scope decision-making (considerable data indicating high  hazard or widespread
24    exposures);  Tier  2—limited  decision-making  (limited exposure potential or limited hazard
25    potential or data); and Tier 1—prioritization and screening (very little or no traditional data for
26    chemicals known to be  in  commerce). Although the prototypes were  designed for illustrative
27    purposes to address these three types of decision context, the supporting data and methods can be
28    deployed across  all  categories as available and as needed, and are arrayed as a continuum of
29    approaches. Ideally, multiple data streams are brought to bear on consideration of potential risks.

30    The prototypes illustrate types of data and methods that are  likely to be used in the near future, but
31    are not intended to be exhaustive reviews. The primary intent of the first set of chemicals (Tier 3
32    prototypes) is  to verify if and how new data types and approaches could be used to inform risk
33    assessment by comparison to robust traditional "known" risk, thus verifying new approaches. The
34    intent of the Tier 2 prototypes is to (1) explore new  types of computational analyses  and short-
35    duration in vivo  bioassays  that are relatively  uncommon  in risk assessment  but  appear very
36    promising for the near future; and (2) develop an assessment approach well suited to limited scope
37    risk management decisions. In this  case, limited  generally means regional to  local exposure
38    potential, or limited hazard potential, or limited data to conduct more detailed assessments. The
39    Tier 2 efforts  fall between Tier 3 and Tier 1  in terms of resources  required and amount of
40    uncertainty in the assessment results. The Tier 1 prototypes  explore  entirely high-throughput
41    approaches that  could be applied to  thousands or tens of thousands of chemicals,  are the least
42    resource intensive, and are likely to have the greatest uncertainty.
             This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
      September 2013                                x

-------
 1    The following eight chemicals or chemical classes and their associated effects were chosen for
 2    prototype development:

 3      •   Tier 3:
 4               o  Benzene and leukemia (molecular epidemiology),
 5               o  Ozone and lung inflammation and injury (molecular clinical studies), and
 6               o  Benzo[a]pyrene (B[a]P)/polycyclic aromatic hydrocarbons (PAHs) and liver cancer
 7                  (molecular clinical studies meta-analyses and in vivo rodent bioassay).
 8      •   Tier 2:
 9               o  Chemicals associated with diabetes and obesity ("big data" knowledge mining),
10               o  Chemicals associated  with thyroid hormone disruption  (short duration  in vivo
11                  exposure bioassays-alternative species), and
12               o  Chemicals associated  with cancer  (short duration in vivo exposure bioassays-
13                  mammalian).
14      •   Tier 1:
15               o  Chemicals  associated  with   cancer   and  noncancer  disorders   especially
16                  developmental (QSAR) and
17               o  Chemicals associated with thyroid hormone  disruption (high throughput in vitro
18                  assays).
19    Highlighted  methods  include  molecular  clinical  and epidemiologic  studies,  in vivo  molecular
20    nonhuman  studies, high-content  in vivo  assays  (mammalian  and  nonmammalian species),
21    bioinformatics, data mining, high-throughput in vitro screening assays, and quantitative structure
22    activity modeling. NexGen methods and results  were compared to robust traditional data set
23    results.

24    Both bottom-up and top-down perspectives were used to evaluate the available data. The top-down
25    approach focuses on  higher  system-level indicators  of disease resulting from environmental
26    exposures to known chemicals based on data from human clinical and epidemiologic  studies. The
27    bottom-up approach focuses on information describing chemically induced alterations in molecular
28    and cellular components, as well as their network interactions. These data support the capability to
29    develop risk assessments for chemicals with little  or no traditional data.  Additionally, these data
30    can further inform assessments based on traditional data.

31    Data and insights from both bottom-up and top-down approaches are integrated to inform
32    understanding of potential health risks associated with chemical exposures. The following
33    summarizes lessons learned from development of the prototypes, as well as challenges for
34    incorporating novel data streams to inform risk assessment:

35      •   Advances  in genomics, epigenomics, transcriptomics, metabolomics, and  cell and systems
36         biology, together with advanced analytical  methods  in biostatistics, bioinformatics, and
37         computational biology, have the potential to increase dramatically understanding  of the
3 8         molecular basis of disease and environmental factors that alter disease risks.
39      •   Of particular importance are the many new tools that facilitate testing and evaluation, on an
40         unprecedented scale,  of chemicals with limited or no  traditional data. ToxCast™ and
41         Toxicology in the 21st Century (Tox21) Programs provide examples.

             This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
      September 2013                                 xi

-------
1 • New approaches can be used to identify biological patterns or signatures that are associated
2 with specific diseases, thus facilitating grouping and evaluating chemicals based on the
3 mechanistic underpinnings of specific diseases. The Comparative Toxicogenomic Database
4 provides a partial example.
5 • These signatures are best developed and understood as they relate to apical outcomes using
6 systems biology. Conceptualization of these relationships among early molecular events,
7 intermediate events, and apical outcomes are often termed mode of action or "adverse
8 outcome pathways."1
9 • Signatures appear exposure-dose dependent (i.e., the magnitude of response changes with
10 changes in exposure-dose) and hence, might be used to prioritize chemicals based on relative
11 potencies, to serve as biomarkers of exposure and effect, and to inform quantitative risk
12 assessment Biological processes also are often time-dependent, which can complicate
13 interpretation.
14 • The links between molecular perturbations and disease outcomes are influenced by a number
15 of variables, that is, metabolism, cell type, genomic variants, cell and tissue interactions, and
16 species. Thus, some test systems might better predict the potency of a chemical to disrupt
17 normal biology than predict the specific adverse outcome resulting from that disruption.
18 • Historically, many controversial risk assessment issues lack data for substantive progress in
19 understanding. NexGen approaches can provide new data types to improve the
20 characterization of human variability and susceptibility, cross-species relevance, and low
21 exposure-dose-response relationships via understanding mechanistic commonalities and
22 differences. These issues are discussed in this report.

23 The prototype results presented in this report demonstrate proof-of-concept for an integrated
24 approach to risk assessment based on molecular, computational, and systems biology. In addition,
25 they explore which types of information appear most valuable for specific purposes and articulate
26 some decision considerations for use of data. Based on lessons learned from this effort, near-term
2 7 and longer term implications for risk assessment are also discussed.

28 Further advances in methods and knowledge undoubtedly will occur over the near term. Logistical
29 and methodological challenges in interpreting and using newer data and methods in risk
30 assessment, however, remain significant Hence, incorporating new information into risk
31 assessment will remain an ongoing opportunity.
*An adverse outcome pathway has been defined as the mechanistic or predictive relationship between initial
chemical-biological interactions (i.e., molecular initiating event[s]; [MIE]) and subsequent perturbations to
cellular functions sufficient to elicit disruptions at higher levels of organization, culminating in an adverse
phenotypic outcome in an individual and population relevant to risk assessment (e.g., disease progression or
organ dysfunction in humans) (Ankley, G. T. et al. 2010). Although commonly used, the term is something of a
misnomer; pathways are not intrinsically adverse or nonadverse but rather pathways when perturbed in
specific ways can lead to adverse outcomes. The same can be said of the commonly use term "toxicity"
pathways.
This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
September 2013 xii

-------
                                        [page intentionally left blank]
         This document is a draft for review purposes only and does not constitute Agency policy. DRAFT - Do Not Cite or Quote.
September 2013                                        xiii

-------
1. Introduction
Box 1. Next Generation Risk Assessment (NexGen)
This report describes the NexGen program, a multiyear, multi-
organization effort to develop and evaluate new molecular,
computational, and systems biology informed approaches to
risk assessment. The goal of this effort is to advance risk
assessment by facilitating faster, less expensive, and more
robust assessments of public health risks by EPA's Office of
Research and Development. The specific aims of the program
demonstrate proof of concept that recent advances in
biology can better inform risk assessment;
understand what information is most useful for particular
purposes (value of information);
articulate decision considerations for use of new types of
data and methods to inform risk assessment; and
identify important data gaps.
1 In recent years, public concern has grown
2 about the number of chemicals in the
3 environment and the ability to assess the
4 risk to human health from potential
5 exposures. Efforts by government agencies,
6 including the U.S. Environmental
7 Protection Agency (EPA), to protect public
8 health from unreasonable chemical
9 exposure have been hindered by
10 limitations in current chemical testing
11 methods and data. The European
12 Commission has underscored this concern
13 with recent initiatives to identify the many
14 thousands of largely untested chemicals in
15 use today and to increase the available
16 toxicity information on those chemicals relative to the amount of chemical used and the potential
17 for exposure in the environment (ECHA 2013a). As a result, significant efforts are underway
18 throughout the world to redesign toxicity testing and understand how advances in biology,
19 biotechnology, and computational science during the past two decades can be used in risk
20 assessment. Specific goals are to increase dramatically our ability to test and assess chemicals more
21 rapidly, understand disease processes and relationships to environmental factors, and facilitate the
2 2 process from data acquisition to data analysis.

2 3 The technologies that have emerged from the sequencing of the human genome have ushered in a
24 new era in biology (Collins, FS 2010) that supports the above goals. Advances in genomics,
25 epigenomics, transcriptomics, metabolomics, proteomics, and cell and systems biology,2 together
26 with advanced analytical methods in biostatistics, bioinformatics, and computational biology, have
27 dramatically increased our understanding of the molecular basis of disease—what causes disease
28 and what exacerbates and ameliorates our risk of disease. Molecular signatures and other
29 biomarkers are helping identify and define disease states and responses, and thousands of
30 variations in previously unknown human health risk factors are being identified.

31 Researchers are generating massive amounts of biological data from the new "omics" technologies.
32 Approximately 1.8 zettabytes (1021) of new data are generated every year, roughly doubling the
33 world's information resource every 2 years (Dearry 2013). More than 50,000 "genomics" papers
34 are published each year (NCBI 2013). Large, publicly available data sets now support analyses of
35 environmental health data on an unprecedented scale, driving further discovery of new knowledge
36 (Dearry 2013, Abecasis etal. 2012, ENCODE Project Consortium 2012, Mechanic etal. 2012, Wang, I
37 et al. 2012, Collins, MA 2009, Thomas, RS et al. 2009, Ramasamy et al. 2008). Concomitantly,
38 powerful data mining, statistical, and bioinformatics methods have been developed to identify,
2Systems biology is defined as a "scientific approach that combines the principles of engineering,
mathematics, physics, and computer science with extensive experimental data to develop a quantitative as
well as a deep conceptual understanding of biological phenomena, permitting prediction and accurate
simulation of complex (emergent) biological behaviors" (Wanjek 2013). See Wanjek's (2013) Web article
Systems as Biology as Defined by NIH for more discussion of systems biology.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 1

-------
1
2
3
4
5
6
7
8
9

10
11
12
13
14
15
16
17
18
19

20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
prioritize, and classify biomarkers with high discriminatory ability (Fang et al. 2012), and to store
and manage the information in database libraries, including the Gene Expression Omnibus (GEO)
(NCBI 2012a), the Kyoto Encyclopedia of Genes and Genomes (KEGG) (Kanehisa Laboratories
2013), the Comparative Toxicogenomic Database (CTD) (NIEHS 2013), and the Epigenomics
Database (Chadwick 2012, NCBI 2009). As integration across differing types of data and levels of
biological organization occurs (Birney 2012, ENCODE Project Consortium 2012), the degrees to
which environmental risk assessment will be transformed and our understanding of disease at the
individual and population level will be advanced are anticipated to be significant (Bhattacharya et
al. 2011, Chiuetal. 2010).

Scientific discovery is now moving away from the traditional approach of individual scientists'
conducting experiments in their laboratories to pooling of data into publicly available databases
and broad collaborative participation in problem solving (Dearry 2013, Friend 2013, Derry et al.
2012). The magnitude of changes was highlighted in
remarks by Frances Collins (Director of National
Institutes of Health [NIH]) who stated that within the
near future, most people in the United States will have a
genome scan in their medical records as a tool for
diagnosis, prognosis, and treatment of disease (Collins,
FS2010).

The impact of recent scientific advances on our ability
to conduct risk assessments and protect public health
cannot be overestimated. Particularly relevant to
environmental risk assessment is that new data types
and methods will result in much more rapid evaluation
of chemicals, increase identification of causal
mechanisms of disease, and provide a more profound
understanding of the interrelated roles of genetics,
epigenetics, and environmental factors. Experiments
already can be conducted much more rapidly and
efficiently using robotics and in vitro assays to measure
molecular functions. Two examples are (1) Toxicology
in the 21st Century (Tox21) testing of 10,000 chemicals
Figure 1. Toxicology in the 21st Century (Tox21)
robot conducts bioassays on 10,000 chemicals.
A robot arm (foreground) retrieves assay plates
from incubators and places them at compound
transfer stations or hands them off to another
robot arm (background) that services liquid
dispensers or plate readers. Photo by Maggie
Bartlett (NHGRI 2012).
within 3 years using approximately 150 assays (Figure
1) (Tice et al. 2013), and (2) the study of gene- and environment-wide associations with disease in
tens of thousands of humans with multiple diseases—both unimaginable feats 15 years ago (Friend
2013, Mechanic et al. 2012). With the burgeoning amounts of data produced by high-volume testing
and discovery, an effort in the European Union called "Safety Evaluation Ultimately Replacing
Animal Testing," SEURAT-1 (http://www.seurat-1 .eu/) has begun to develop a conceptual
framework that can be used as a basis to combine information derived from predictive tools to
support a safety assessment process. The overarching SEURAT-1 research strategy is to adopt a
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 2

-------
 1    toxicological mode-of-action3 approach to describe how any substance might adversely affect
 2    human health, and to use this knowledge to develop complementary theoretical, computational,
 3    and experimental [in vitro] models.

 4    In collaboration with its partners (see Acknowledgments), EPA initiated the NexGen program in
 5    2010 to evaluate the use of these recent advances in biological and computational sciences for risk
 6    assessment (Text Box 1). We initially conducted workshops and solicited expert opinion to develop
 7    a framework and suggestions for prototype assessments that address the needs of the public and
 8    the risk assessment community. Federal, state, and other partners participated in the workshops
 9    and continue to provide advice, data, and review for NexGen reports. Text Box 2 lists related,
10    ongoing legislation and government research activities in Europe and the United States.
      3"Mode-of-action" is one term used to reference a mechanistic understanding of the impact of a chemical on
      human health. Other terms include "disease signature" and "network perturbations" from epidemiology for
      example, while lexicologists might reference the same concept using the terms "toxicity pathway," "mode-of-
      action," or "adverse outcome pathway." In general, this report uses  the term "mechanism of action," in
      accordance  with  the  National Research Council  (NRC) report, Science  and Decisions: Advancing  Risk
      Assessment (2009); however, the exact term used in a specific section of this report is based on the references
      used and the context of the discussion.
                 This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
      September 2013                                  3

-------
Box 2. Current Legislation and Governmental Research Activities in Europe and the United States

EUROPEAN LEGISLATION AND ACTIVITIES
In response to environmental concerns, a desire for increased assessment efficiencies, and a desire to reduce reliance on
in vivo animal testing, the European Union (EU) enacted an expansive new program called Registration, Evaluation,
Authorisation and Restriction of Chemicals (REACH) in June 2007. This legislation places greater responsibility on
industry to test and manage the risks posed by their chemicals. Under REACH, companies must develop detailed
technical dossiers and chemical safety reports and submit them to the European Chemicals Agency (ECHA).
Approximately 12,000 chemicals have been registered for consideration with ECHA. Many more chemicals are
anticipated in the near future. Additionally, the 7th Amendment to the EU Cosmetics Directive prohibits putting animal-
tested cosmetics on the market in Europe after 2013. Although current alternative methods more closely resemble
traditional methods, the EU has invested 50M Euros in a research program to further next-generation methods (OECD
2012). Current ECHA guidance is available on the use of quantitative structure-activity relationships (QSARs), in vitro
assays, and read-across (also known as near-analog structure-activity relationships) to support assessments.
REACH and the 7th amendment will significantly impact nearly all multinational companies and are important drivers for
the development and use of new molecular-based methodologies. Europe's chemical trade accounts for about 40% of
the global market, involving 27 countries and almost half a billion people.
The Joint Research Centre (JRC) is the scientific and technical arm of the European Commission. It provides scientific
advice and technical support to EU policies. The JRC has seven scientific institutes (featuring laboratories and research
facilities) located at five sites: Belgium, Germany, Italy, the Netherlands, and Spain. The JRC's Institute for Health and
f*r\r\ri Kv^^r Drr-» + ^o+irM"i'c KV^ -^ 11-1 r^fo-^ryl-i rolo» i-a K-I+- 4-r-» M o v ^" OKI I i-i y»l i i yl oc Ii-i-l-^MTr-^-l-oyJ riflx -^ i-i yl l^^M-iofi-l- -5ff e*rr KV^ OKI 4-c r\f y»ki£M-v^ ir"^ I
substances; fit-for-purpose analytical tools to help ensure the safety of food and consumer products; and optimization
and validation of methods that reduce the reliance on animal tests in the safety assessment of chemicals.
U.S. ACTIVITIES
Several documents have guided the NexGen effort, including the Strategic Plan for the Future of Toxicity Testing and Risk
Assessment at the U.S. Environmental Protection Agency (EPA 2009a), the Toxicology in the 21st Century (Tox21)
strategy, and the National Institutes of Health Strategic Plan (NIEHS 2012c). Ongoing research activities of several federal
agencies that have informed and continue to inform the NexGen effort are described below.
computational environmental health and occupational research. The National Center for Environmental Health (NCEH)
and Agency for Toxic Substances and Disease Registry (ATSDR) scientists in the Computational Toxicology Laboratory
have applied several new approaches for improving chemical risk assessments. They have mined the National Health
and Nutrition Examination Survey (NHANES) data set to obtain high-quality analytical and human health information,
which is representative of the general U.S. population, and used computer modeling to identify sensitive populations for
health outcomes at environmental exposure levels. A second project involved use of NHANES public health genomics
data to identify allelic differences in ALA dehydratase for susceptibility to lead-induced hypertension. Another
concerned the development and application of QSAR, physiologically based pharmacokinetic (PBPK), and molecular
docking approaches. These studies involved both data mining of the published scientific literature and collaborative
laboratory studies with scientists at the Food and Drug Administration (FDA).
contribute to the development and severity of occupational diseases using high-density and high-throughput (HT)
genotyping platforms. Understanding the genetic contribution to the development, progression, and outcomes of
complex occupational diseases will help improve the accuracy of risk assessment and improve safe exposure levels for
genetically susceptible groups in the workforce.
scientifically sound basis for regulatory decisions and reduce risks associated with FDA-regulated products. NCTR
research evaluates biological effects of potentially toxic chemicals, defines the complex mechanisms that govern their
toxicity, identifies the critical biological events in the expression of toxicity, discovers biomarkers, and develops new
scientific tools and methods to improve assessment of human exposure, susceptibility, and risk. Examples of tools
created by NCTR include ArrayTrack™, Decision Forest, Endocrine Disrupter Knowledge Base (EDKB), Gene Ontology for
Functional Analysis (GOFFA), and SNPTrack. Efforts include the MicroArray Quality Control (MAQC) consortia.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 4

-------
Box 2. Current Legislation and Governmental Research Activities in Europe and the United States (Continued)

U.S. ACTIVITIES (CONTINUED)
The National Institutes of Health (NIH) National Center for Advancing Translational Sciences (NCATS) conducts
research to resolve scientific and technical challenges that might cause barriers to the efficient development of new
treatments and tests to improve human health. The National Chemical Genomics Center (NCGC) at the National Center
for Advancing Translational Sciences applies high-throughput screening (HTS) assay guidance, informatics, and chemistry
resources for NCAT's Re-engineering Translational Sciences research projects. Specifically, NCGC research programs
include assay development and HTS, and participation in Tox21. NCGC Assay Biology Teams are researching optimization
of biochemical, cellular, and model organism-based assays submitted by the biomedical research community for HT
small molecule screening. The results of these screens (probes) can be used to further examine protein and cell
functions and biological processes relevant to physiology and disease (NIH 2012).
The National Human Genome Research Institute (NHGRI) was established by NIH in 1989 to implement the
International Human Genome Project to map the human genome. NHGRI has developed programs for a variety of
research projects including Encyclopedia of DNA Elements (ENCODE), Gene Expression Omnibus (GEO), and collaborative
projects, including the Comparative Toxicogenomic Database (CTD), HapMap, and Gene. Through the application of
these tools, NHGRI hopes to gain a greater understanding of human genetic disease, and develop better methods for the
detection, prevention, and treatment of genetic disorders.
The National Institute of Environmental Health Science (NIEHS) and the National Toxicology Program (NTP) have
played an integral role in the development and application of HTS data. Current research is focused on developing and
validating Tox21 approaches to improve hazard identification, characterization, and risk assessment (Birnbaum 2012,
Serafimova et al. 2007). The NTP HTS program has three specific goals: (1) prioritizing substances for in-depth
toxicological evaluation, (2) identifying mechanisms of action for further investigation (e.g., disease-associated
pathways), and (3) developing predictive models for in vivo biological response (i.e., predictive toxicology). NTP is
developing innovative and flexible approaches to data integration, both across research programs and across different
data types (e.g., HT, mechanistic, animal studies) (Bucher et al. 2011). These efforts seek to integrate results from new
techniques with traditional toxicology data to provide a public health context.
The Engineer Research and Development Center (ERDC). the research organization of the U.S. Armv Corns of
Engineers, conducts research and development in support of warfighters, military installations, and civil works projects
involving water resources and environmental missions. The ERDC Toxicogenomics research cluster focuses on using
genomics to develop tools to rapidly assess toxicity of military chemicals in a wide range of animals, identifying gene
biomarkers of exposure, understanding the mechanisms by which military chemicals cause toxicity, and extrapolatint
toxicity effects across multiple species. Capabilities of the team include advanced instrumentation to characterize
impacts of chemicals on gene expression with high-density gene arrays, DNA sequencing, and real-time polymerase
chain reaction (RT-PCR) assays. ERDC Toxicogenomic projects include development of rapid assays to assess whole
genome impacts of munitions-related compounds, including gene arrays with short exposure screening in daphnia, rat
cells, rat livers, and fish; comparison of genomicand behavioral responses of fathead minnows and zebrafish to chemical
exposures; conservation of response to nitroaromatics across species; and support for a toxicogenomic assessment
framework to integrate predictive toxicology of munitions-related compounds.
Several EPA Office of Research and Development laboratories and centers have been involved in NexGen. EPA's
National Center for Environmental Assessment (NCEA) has assumed a leadership and coordination role for the NexGen
effort. The National Center for Computational Toxicology (NCCT) is the largest component of EPA's Computational
Toxicology Research Program. The Center coordinates computational toxicology research on chemical screening and
prioritization, informatics, and systems modeling. NCCT research includes the (1) use of informatics, HTS technologies,
and systems biology to develop accurate and flexible computational tools that can screen the thousands of chemicals for
potential toxicity; and (2) application of mathematical and advanced computer models to help assess chemical hazards
and risks. EPA's National Center for Environmental Research (NCER) supports extramural computational toxicology
research. The National Health and Environmental Effects Research Laboratory (NHEERL) conducts toxicological, clinical,
and epidemiological research to improve the process of human health risk assessments, including development of
biological assays and toxicological assessment methods, predictive pharmacokinetic/pharmacodynamic models, and
advanced extrapolation methods.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 5

-------
1 The initial NexGen prototypes were designed to provide concrete examples that illustrate the
2 potential for various new methods and data to be used for specific risk assessments within a
3 decision context4 and to foster further discussion in the risk assessment and risk management
4 communities to promote continual improvement.

5 This report presents and discusses the results of this effort, and is organized as follows:

6 • Section 2: Preparation for Prototype Development - describes the preliminary work and
7 workshops conducted to characterize the decision context and conceptual framework and to
8 identify the stakeholders and key issues so that the prototypes provide examples relevant to
9 the needs of the risk assessment community.
10 • Section 3: The Prototypes - provides detailed examples of the use of various advanced
11 methods and data in each of the three tiers, starting with chemicals for Tier 3 "Major Scope
12 Assessments," which are data-rich chemicals, proceeding to Tier 2 and Tier 1 chemicals that
13 have increasingly limited or no in vivo data sufficient to conduct a traditional (e.g., IRIS) risk
14 assessment
15 • Section 4: Advanced Approaches to Recurring Issues in Risk Assessment - describes how
16 advanced methods are being used to address recurring and challenging issues, including
17 characterizing variability in deriving toxicity values and assessing potential hazards from
18 exposure to mixtures.
19 • Section 5: Lessons Learnedfrom Developing the Protypes - describes lessons learned in
20 developing the Tier 1, 2, and 3 prototype examples.
21 • Section 6: Conclusions- outlines the major challenges and future direction for the NexGen
22 program.

23 • Appendix A lists technical papers supporting this report

24 • Appendix B provides a glossary.

2.1.
25 One of the first tasks undertaken in planning the NexGen effort was consideration of the various
26 environmental situations of concern to EPA's Program Offices—in other words, the decision context
27 [termed in Cote et al. (2012) and the National Research Council (NRC 2009) and National Academy
28 of Science (NAS 2007) reports]. Decision context defines what environmental management decision
29 is being made and why, as well as its relationship to other decisions previously made or anticipated.
30 EPA Program Offices are generally organized around specific pieces of environmental legislation,
31 such as the Clean Air Act and the Clean Water Act, and are responsible for administering those laws.
32 Each major piece of legislation brings different responsibilities and nuances to problems faced by
33 risk managers. In Figure 2, the decision context is represented in three categories for ease of
34 description. The characteristics that define the three decision context categories and examples of
35 specific problems faced by the Program Offices are shown. This figure elaborates on the decision
4See Section 2.1 for a definition of "decision context" and a discussion of its use.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 6

-------
 1    context figure in the report, Science and Decisions: Advancing Risk Assessment, and is the result of
 2    discussions with EPA Program Offices (EPA 2011b).
             Screening and Prioritization
  National Scope Assessments
  Limited Scope Assessments
              Potential oractual exposures
              Unknown or limited data on
              hazards
              includes all unevaluated
              chemicals in the environment
              with actual or potential
              exposures
            Examples
              New assessment queuing
              Test rules support
              Urgent response
              Research priorities development
  Generally wide spread exposures
  Sometoextensivetraditional
  hazard data
  Includes 100s of chemicals of
  highest concern
Examples
  High profile, nationally important
  assessments j e.g., IRIS and ISAs)
  Community assessments
  Special issues evaluations
   - Susceptible/sensitive
     subpopulations
   - Mixtures and other stressors
   Research priorities development
  Generally limited orlocalized
  exposures
  Limitedtraditional hazard data
  Includes 1000s of chemicals with
  limited data suggestive of
  potential hazard
Examples
  Superfund remediatioiV
  hazardous waste disposal
  Potential water contaminant
  identification
  National AirToxics Assessment
  support
  Research priorities development
                                                                              Decision-making
      Figure 2. Description of decision context categories provided by EPA Program Offices. These decision context
      categories reflect the range of environmental problems to be addressed, from the need to screen many untested
      chemicals in the environment to national regulations for high profile chemicals. The flow from decision context
      through risk assessment to decision-making and the related roles of testing and research are also noted.


 3    Three factors integral to the decision context for risk managers are the potential exposure, the
 4    number of chemicals that should be considered, and the weight of scientific evidence for supporting
 5    decision-making. Both legislative language and the history of specific regulatory programs
 6    influence the numbers of chemicals considered and the uncertainty in supporting data that can be
 7    tolerated. Tier 3 decision context focuses on nationally relevant chemicals with widespread
 8    exposures and established hazards and for which major regulatory evaluations are likely in
 9    progress. An example would include the International Agency for Research on Cancer (IARC)
10    benzene assessment (2012) where molecular mechanistic information was used to support the
11    causal link between benzene and hematopoietic cancers, particularly when the epidemiology data
12    were somewhat limited. Tier 2 focuses on chemicals for which exposure or hazard appears limited
13    or available  data for detailed assessment are limited. An example includes evaluation of biological
14    activity and  cumulative risk potential of conazole fungicides (EPA 2011e) and potential endocrine
15    disrupters (EPA 2011c), both of which are based on molecular biology data in combination with
16    traditional data. Tier 1 decision context focuses on the tens of thousands of chemicals present in
                  This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
      September 2013                                    7

-------
1 commerce in significant amounts, but for which we have little knowledge of exposure levels or
2 potential health effects. An example is the high-throughput (HT)-based evaluations of Deep Water
3 Horizon Gulf oil spill dispersants (Judsonetal. 2010).
4 A second task that preceded finalizing plans for the NexGen prototypes was the development of a
5 guiding framework. The framework draws together several important elements of earlier risk
6 assessment frameworks and articulates guiding principles for risk assessment development
7 informed by new data types and methods. A draft version of this framework was presented and
8 discussed in October 2010 at a meeting with scientific experts (EPA 2010) and in February 2011 at
9 a public meeting with stakeholders (EPA 2011a). The framework is described in a report by
10 Krewskietal. (2013).

11 The NexGen framework is built on three cornerstones (as illustrated in Figure 3): (1) new risk
12 assessment methodologies to better inform risk management decision-making; (2) new data types
13 from advances in biology and toxicology on understanding perturbations of biological pathways;
14 and (3) a population health perspective that recognizes that most adverse health outcomes involve
15 multiple determinants. The NexGen framework integrates these three cornerstones into a
16 framework for risk science that progresses in three phases: (1) Objectives, (2) Risk Assessment,
17 and (3) Risk Management Phase 1 (Objectives) focuses on problem formulation and scoping, taking
18 into account the decision context and the range of available or admissible risk management
19 decision-making options. Phase 2 (Risk Assessment) seeks to identify disease or outcome pathways
20 using new toxicity testing tools and technologies and attempts to improve the characterization of
21 risks and uncertainties using advanced risk assessment methodologies. Phase 3 (Risk Management)
22 involves the development of evidence-based population health risk management strategies of a
23 regulatory, economic, advisory, community, or technological nature, based on sound principles of
24 risk management decision-making. Implementation of the NexGen framework is exemplified with a
25 series of case-study prototypes, illustrating how aspects of the framework have been put into
26 practice.

27 NRC provided a blueprint for pathway-based toxicity testing in its 2007 report, Toxicity Testing in
28 the 21st Century: A Vision and a Strategy (NRC 2007). Guidance on some of the new risk assessment
29 methods is provided by the 2009 report, Science and Decisions, Advancing Risk Assessment
30 [NRC (2009)]. The integration of a population health approach was drawn from the McLaughlin
31 Centre's integrated risk management and population health framework. Key elements of risk
32 science and population health are combined to offer a multidisciplinary approach to the assessment
33 and management of health risk issues (Krewskietal. 2007).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 8

-------
                                                              Socio-
                                                             political
                                                          Considerations
                               Risk
                            Perception
                                                                 Life Stage  I   Mixtures
                                                  Dose-response Assessment
                          Chemical
                        Characterization
                                                                          Population-based Studies
                                                               Dose-response
                                                                Analysis for
                                                               Toxicity Pathway
                                                                Perturbation(s)
                           Calibrating In Vitro
                          and Human Dosimetry
  Assess
 Biological
Perturbation(s)
                                                                           Human Exposure Data
                             Hazard Identification
                                                             Exposure Assessment
                             Biological
                                 &
                              Genetic
Environmental
      &
 Occupational
                                     Social
                                       &
                                   Behavioral
                                                Decision-making
                                                    Options
                          Value-of-
                        information
Figure 3. The Next Generation Framework for Risk Science. This framework is divided into three phases:
(1) Objectives: Problem Formulation and Scoping takes into consideration Risk Context,5 Decision-Making Options,
and Value-of-lnformation; (2) Risk Assessment involves three sub-categories: (A) Health Determinants and
Interactions, (B) New Scientific Tools and Technologies, and (C) New Risk Assessment Methodologies; and (3) Risk
Management involves two categories: (A) Risk-Based Decision-Making that involves Risk Management Principles,
Economic Analysis, Socio-Political Consideration, and Risk Perception, and (B) Risk Management Interventions with
five possible categories: Regulatory, Economic, Advisory, Community, and Technical (Krewski et al. 2013).
5The term decision context is used elsewhere in this report.
             This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013                                        9

-------
2-3. S

1 Outreach to the science community and stakeholder groups was part of the NexGen strategy from
2 its inception. This document in its final form is viewed as an interim step to implementation of new
3 advances in risk assessment and is intended to promote further discussion with stakeholders
4 toward continual improvement of risk assessments and prototypes informed by new data types and
5 methods.

6 Given the technical complexity of the research, stakeholder engagement is a particular challenge
7 and will necessitate ongoing outreach and discussion throughout the process. Our initial efforts are
8 described below.

9 EPA convened a 3-day expert workshop on November 1-3, 2010, in Research Triangle Park, North
10 Carolina, to discuss the draft framework, early draft prototypes, research, and other project
11 elements. The workshop sought individual input, rather than consensus, in meeting its discussion
12 goals. Days 1 and 2 of the workshop focused on deliberative drafts of data-rich prototype health
13 assessments. The goals were to (1) refine health assessment case studies of data-rich chemicals
14 informed by molecular biology (i.e., "prototypes"); (2) enhance "reverse engineering" from
15 molecular system biology data, to "known" public health risk estimates based on in vivo human and
16 animal bioassay data to demonstrate proof of concept, elucidate value of information, and
17 characterize decision considerations; and (3) summarize options for expanded future work and
18 research needs.

19 Day 3 focused on approaches applicable to assessing the potential risks posed by chemicals with
20 limited or no traditional data. The goals were to (1) identify and discuss a wider variety of new data
21 types, methods, and knowledge to help characterize data-limited chemicals; (2) consider how this
22 information might augment, extend, or replace traditional data in health assessment; and
23 (3) summarize options for expanded future work and research needs. Approximately 40 federal
24 and nonfederal experts and 80 and partner organization staff members attended the workshop. A
25 workshop report with the agenda and list of participants is available (EPA 2010).

26 In 2012, both the Science Advisory Board (SAB) and the Board of Scientific Counselors (BOSC)
27 reviewed aspects of the NexGen program as part of their evaluations of EPA's computational
28 toxicology research (SAB 2013, BOSC 2010). Both the SAB and BOSC commended the exceptional
29 efforts of the Computational Toxicology Research Program to advance hazard/risk assessment and
30 provided recommendations for the continued success of the program. The reviews emphasized
31 further research on chemical exposure pathways resulting from human activity patterns (e.g.,
32 ExpoCast); engagement of the scientific community and stakeholders to foster future partnerships
33 and promote information exchange; broader outreach for dissemination of scientific findings;
34 gathering of user-feedback from the general public; improvements in data access through enhanced
35 website navigation; and development of guidelines for data usage.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 10

-------
      2.3.2.  Stakeholder Involvement
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30

31
32
33
34
35
36
      Stakeholder Public Dialogue Conference

      To engage stakeholders in the early stages of the NexGen program, EPA sponsored a public dialogue
      conference, "Advancing the Next Generation of Risk Assessment," on February 15 and 16, 2011, in
      Washington, DC. This conference presented stakeholders with an opportunity to learn about the
      NexGen program, and to provide their thoughts on the challenges the project faces and its path
      forward. Approximately 160 participants, representing 11 stakeholder groups (Figure 4), attended
      the conference. The conference report includes the agenda, list of participants, and
      recommendations of the group (EPA 2011a). In addition to this conference, "one-on-one"
      interviews (described below) were conducted with leaders of public-interest groups and the
      business community.
Public Interest Group
Perspectives

After the work shop, follow-up
informal one-on-one interviews
were conducted in mid-2010
with several Washington,
DC-based representatives of
national environmental, public
health, and animal welfare
public-interest organizations.
Ronald White, a faculty member
at Johns Hopkins Bloomberg
School of Public Health,
conducted these interviews and
informational meetings as a
component of his research on
public engagement regarding
emerging risk assessment
methods. He also developed a
web-based assessment in late 2010
organizations, their knowledge and
chemical/pollutant risk assessment
assessment.
                                                           State Agency    International Agency
                                                              1%     \  j       2%
                                                         Industry/Trade
                                                         Organization
                                                            17%
                                                          Environmental/Publ ic
                                                          Health Organization
                                                               17%
Animal Welfare
 Organization
     2%

 Other Non-Profit
     4%
                                      Figure 4. Categories of stakeholders that attended the February 2011
                                      NexGen public dialogue conference (EPA 2011a).
                                     to ascertain, from nongovernmental public-interest
                                     interest in emerging scientific approaches for
                                      Of the 24 organizations contacted, 8 (33%) responded to the
     A key question raised in these interviews and web-based assessment was how relevant the NexGen
     program is to near-term EPA pollutant/chemical risk assessment procedures and control policies.
     The public-interest stakeholders interviewed and those who responded to the online assessment
     questions generally supported the concept of integrating the results from emerging biological
     science and analytic techniques into EPA's approach to conducting chemical health-based risk
     assessment. Significant concerns emerged, however, regarding the following:
                This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
      September 2013                                11

-------
1 • The potential for overstating the utility of NexGen approaches.
2 • How NexGen prototypes will address key risk assessment methodological issues, such as low-
3 dose exposure assessment, population variability in response, and additivity to background
4 exposures and disease processes.
5 • The transparency of the NexGen assessment development process and opportunities for
6 early, meaningful engagement by public-interest organizations, and the application of NexGen
7 approaches in risk management

8 Industry or business perspectives on NexGen approaches also were of interest. Dr. Gerald Poje,
9 (Environmental Health Consultant, Former Board Member of the U.S. Chemical Safety and Hazard
10 Investigation Board) had follow-up discussions with nine individuals representing the high-
11 production volume and smaller specialty chemical manufacturing industries, the pharmaceutical
12 industry, the retail sector, and the energy sector. The participants were generally optimistic about
13 potential advances in risk assessment and identified two potential advantages: (1) better prioritize
14 the needs for more expensive and longer duration whole-animal testing, and (2) save time and
15 money while rationalizing decisions in a tier-based manner using HT and other Tier 1 and Tier 2
16 tests. They also suggested that the success of NexGen effort depends on EPA's ability to prove the
17 value of the newer, tiered approach within EPA's emerging risk assessment model, the level of
18 EPA's investment in the long-term iterative NexGen research effort, and the timely and effective
19 communication of the evidence to support science-based risk assessment.

20 Some in the business community expressed concern over whether EPA could match the
21 pharmaceutical industry's growing infrastructure (needed to support and sustain a NexGen-like
22 effort) such as EPA's ability to unite sufficient numbers of expert biologists, chemists, and
23 bioinformatics to guide the program to a successful conclusion. The technical complexity of the
24 NexGen program might also hinder its impact on current risk assessment, risk management, and
25 business development practices, given the many unknowns that remain. Cultural challenges in
26 winning over a larger community, who will welcome the use of more recent advances in risk
27 assessment methods; however, was thought to be surmountable if EPA could be effective at
28 capacity building and communicating how new data types and approaches could be used for risk
29 assessment

2.4. in

30 The fourth task that preceded the actual prototype development was identification of problematic
31 issues that might be substantively informed by new methods and data. The issues included problem
32 formation, adversity classifications and weight of evidence, dose-response modeling (especially at
3 3 the low-dose end), variability in human response (due to a variety of factors), interspecies
34 extrapolation, mixtures risk assessment, and characterization of uncertainty. These issues are
35 explored in the prototypes to the extent feasible, and some are discussed in more detail in papers
36 on human variability (Zeise et al. 2012), early-life exposure and later-life disease risks (Boekelheide
37 etal. 2012), and multifactorial interactions of environment and genes (Patel etal. 2012a, Patel etal.
38 2012b, Zhuoetal. 2012, Shen etal. 2011, Smith, MT etal. 2011).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 12

-------
3. The Prototypes
i
2
3
4
5
6
7
8

9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27

28
29
30
31
32
33
34
35
36
37
38
39
40
41
EPA's Office of Research and Development, in conjunction with other federal, state, academic,
public, and private partners (see Acknowledgments), developed prototype assessments to provide
concrete illustrations of how new and emerging information could inform risk assessment The
prototypes used a variety of study types, methods, data, and risk assessment approaches, and are
intended to (1) engender movement in the field of risk assessment from strategy to practical
application of new approaches, and (2) foster discussion and refinement of approaches in the risk
assessment and risk management communities, as
well as with the public.
Box 3. Selection Criteria for Prototypes

Decision context applicability (i.e., methods
applicable to various types of risk management
situations)
Data availability (i.e., both NexGen and
traditional data existed to allow for validation of
new approaches)
Illustration of a variety of methods
Methods
•S Data quality
•S Multiple, high-quality studies
•S Consistent, coherent, and biologically
plausible data
Active collaborations with investigators to
benefit from their knowledge, modify
experiments, and conduct additional analyses as
needed
Cross-organizational collaborations fostered
The results presented in this report demonstrate
proof of concept, provide insight on what types of
information are valuable for specific purposes, and
provide examples of the decision considerations for
reasonable, consistent, and coherent use of the new
types of information for specific applications. The
prototypes also illustrate many of the challenges.
Text Box 3 lists selection criteria used in choosing
prototypes. Figure 5 broadly categorizes the types of
methods aligned to decision context and evaluated
in the prototypes. As noted earlier, the number of
chemicals that need to be evaluated and the level of
confidence required for decision-making are key
components of designing fit-for-purpose
assessments. The integration of knowledge from a
wide variety of methods is likely to be most
informative to risk assessment. Lessons learned
from each prototype and group of similar prototypes will be noted as they arise and, then
integrated and summarized in Section 5 "Lessons Learned from Developing the Prototypes."

Throughout this report, characterizing systems biology is greatly emphasized. Systems biology is a
critical field in modern biology aimed at understanding the larger picture by integration across
multiple levels of biology—for example, from the gene to the molecular intermediate phenotypes
(e.g., gene expression), to alterations in molecular pathways and networks, and the propagation of
effects from cells to tissues to organs and the whole body. Systems biology also can encompass
subpopulation and population dynamics. Thoroughly understanding modern biology is difficult
without understanding systems biology. Two basic approaches are used to develop systems
understanding: bottom up and top down. The bottom-up approach focuses on altered molecular
and cellular components, and seeks to understand how the altered components fit together. This
approach is addressed most extensively in Tiers 1 and 2. The top-down approach focuses on larger
scale network interactions and disease indicators based on human clinical and epidemiologic data,
and associations between disease states and environmental factors (Friend 2013). This approach is
addressed most extensively in Tiers 3 and 2. Both the bottom-up and top-down approaches can be
informative, and are best used when integrated to support development of a comprehensive model.
7Vi/s document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 13

-------
Types of Methods and
Types of Methods
Characteristics
Exposure Data:
Structure
Information:
Assay Types:
Extra
Information
Sources:
Time to
Conduct:
Cost:
Exposures:
Exposure
Duration:
Metabolism:
Endpoints:
•.: Screening and
rioritization
Surrogate
QSAR Models
High-throughput (HT)
Assays
Computer (In silica)
Toxicity Models
Hours-Days
$
In Vitro
Short
Little to None
Alterations in Key
Biological Process
B: Limited Scope
ssessments
Limited
QSAR Models and Read
Across
High Content
Database Mining
Hours-Weeks
$-$$
In Vitro and In Vivo
Short
Some
Alterations in Key
Biological Process to
Adverse Effects
^^^J
Extensive environmental
Mechanistic Understanding
All informative
Traditional Data
Biomarkers of Exposure
and Effect
Days-Years
$$-$$$
In Vivo
Longer
Substantial
Alterations in Key
Biological Process,
Intermediate Event to
Adverse Effects/Disease



      Figure 5. Shown are the types of methods used to generate data for the prototypes and characteristics of each
      method. Note that all methods can be used in each decision context as available.
      3.1.  Tier 3: Major Scope Assessments

 1    Tier 3 prototypes focused on chemicals with robust traditional data sets, known public health
 2    outcomes, and high-confidence risk estimates. The purpose of studying these already well-
 3    characterized chemicals was to better understand how new data types and methods can be most
 4    effectively used in risk assessment situations where traditional data are absent or limited. In other
 5    words, by "reverse engineering" from known public health risks to new types of data, it was
 6    thought that potential advances in risk assessment using new types of data could be verified.
 7    Molecular epidemiology, molecular clinical, and molecular in vivo animal data were evaluated in the
 8    context of traditional information (Table 1). The Tier 3 prototypes aimed to: (1) demonstrate proof
 9    of concept that new data and methods can help identify hazards and inform exposure-dose-
10    response relationships; (2) better characterize what information is most valuable for specific risk
11    assessment purposes; and (3) articulate decision considerations for identifying, analyzing, and
12    interpreting data, particularly for use in assessment of data-poor chemicals. Secondarily, this effort
13    explored how new data types can augment robust traditional data sets, and brings new insights to
14    the interpretation of traditional data.
                This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
      September 2013                                 14

-------
Table 1. Tier 3 Prototypes Approach, Including Weight of Evidence, Pros, and Cons
Tier 3: Major Scope Assessments
Environmentally-relevant In Vivo Exposure Studies
with Molecular Characterization
Approaches:
Focuses on human data from molecular epidemiology and molecular clinical studies.
Includes molecularly augmented, traditional in vivo animal bioassays.
Experimentally measures dose-dependent, chemically-induced alterations in biologic
functions linked to traditional intermediate events and disease outcomes.
Evaluates environmentally-relevant exposures.
Characterizes sensitive subpopulations.
Helps characterize impacts of various environmental factors.
Weight of evidence:
Determined by the quality and quantity of data, but can range from suggestive to known.
Pros:
Characterizes human population-associated or causal mechanisms.
Can inform low-dose, species-to-species and inter-individual variability, and uncertainty
with data.
Allows extrapolation of molecular patterns to predict outcomes for less well studied
chemicals.
Cons:
Are not faster or less expensive than traditional bioassays.
Need to control for experimental variability.
1 The Tier 3 prototypes are benzene (and leukemia); ozone (and inflammation and lung injury); and
2 benzo[a]pyrene (B[a]P), a polycyclic aromatic hydrocarbon (PAH) (and liver cancer). The
3 prototypes focused on human data, both molecular epidemiology and molecular clinical data.
4 Human environmental exposures for the benzene and ozone prototypes were very well
5 characterized using a urinary biomarker and 1802 dosimetry, respectively. For B[a]P, we evaluated
6 human environmental exposures and liver cancer omics data; this evaluation was qualitatively
7 successful, but exposures were relatively poorly characterized for quantitative exposure-response
8 assessment. Hence, experimental rodent data were evaluated in addition to the human data.

9 Overall, the prototypes evaluated the use of toxicogenomics to better characterize risks, including
10 DNA transcription (transcriptomics), protein expression (proteomics), and genome-wide analyses
11 of susceptibility genes (genomics analyses of human gene variants). Some limited discussion of
12 epigenetic modification (epigenomics) in human populations is also included. Bioinformatics
13 analyses were used to evaluate toxicogenomic profiles in the context of traditional knowledge of
14 phenotypic endpoints. Each prototype:

15 • Describes a systems biology model suitable for informing hazard identification;
16 • Characterizes molecular biomarkers of exposure and effects suitable for characterizing
17 exposure-response at environmental concentrations;
18 • Illustrates how multiple pathway alterations induced by environmental factors can lead to
19 and modify risks, and notes how this information might be used to characterize data-limited
20 chemicals and cumulative risks; and
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 15

-------
1 • Identifies some gene variants that influence human susceptibility and alter risks for selected
2 subpopulations, and notes how this information could be used to characterize population
3 variability.

4 The results presented here are not intended to be a comprehensive review of all available data that
5 might be used in a risk assessment, but rather provide examples of evaluation of new data types
6 and to illustrate potential uses in risk assessment In addition, toxicogenomics data must be
7 interpreted carefully in this context (see Text Box 4).
.. A Word of Caution in Interpreting Toxicogenomic Resi
oxicogenomic results can be a substantial source of data misinterpr
design and statistical techniques increase confidence in observed associations, and increase the power to detect
associations, between exposure and gene expressions particularly at low exposure levels. More generally, without
such considerations, variability may obscure actual outcomes or lead to specious associations. Studies without
rigorous design, data collection, and analyses are less likely to be considered appropriate for use in risk
assessment.
8 One caveat is that the studies used in the Tier 3 prototypes were chosen mainly because they had
9 some of the most robust, concomitantly collected, traditional and new data types available. These
10 data sets demonstrate partially what can be done with new data types; however, similar data are
11 not likely to be available for many chemicals. This exercise clearly revealed that care must be given
12 to the selection of studies for new types of risk assessment, as many are insufficient for the
13 applications discussed below. The B[a]P prototype, in particular, highlights some of the challenges
14 encountered. Additionally, the fields of molecular, computational, and systems biology are in their
15 infancy in terms of application to human health risk assessment Although results presented here
16 are promising, robust understanding and full implementation of new methods in general practice,
17 might take years, subject to the resources available for data generation and evaluation.

18 Implications for risk assessment identified by the Tier 3 prototypes are discussed at the end of this
19 section and integrated with other lessons learned in Section 5, "Lessons Learned from Developing
20 the Prototypes." It should be reiterated that the primary intention of the Tier 3 prototypes is to
21 "ground truth" approaches that could be used in more data-limited situations.

3.1.1. Benzene-Induced Leukemia

22 Benzene is among the 20 most widely used chemicals in the United States and is among the most
23 common environmental contaminants. A component of crude oil and gasoline, benzene is also used
24 as an intermediate in the manufacture of resins, dyes, chemical solvents, waxes, paints, glues,
25 plastics, and synthetic rubbers. The major sources of benzene exposure are anthropogenic and
26 include fixed industrial sources, fuel evaporation from gasoline filling stations, and automobile
27 exhaust. Benzene has been measured in outdoor air at various locations in the United States at
28 concentrations ranging from 0.02 ppb (0.06 [ig/m3] in a rural area to 112 ppb (356 [ig/m3] in an
29 urban area (IARC 2012). Personal monitoring of benzene exposure in Detroit, Michigan, reported a
30 mean of 1.72 ppb (5.5 [ig/m3) (George etal. 2011). The maximum contaminant level (MCL) in
31 drinking water is 5.0 [ig/L or 5 ppb (EPA 2012b). The OSHA permissible exposure limit (PEL) for
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 16

-------
1 benzene workers in the United States is 1 ppm
2 [https://www.osha.gov/dts/chemicalsampling/data/CH 220100.html).

3 Benzene is a known human carcinogen (IARC 2012, ATSDR 2007, EPA 2000, NIOSH 1992).
4 Epidemiologic studies have shown that benzene exposure leads to an increased risk of acute
5 myeloid leukemia (AML), myelodysplastic syndrome (MDS), hematotoxicity (toxicity to the blood),
6 and other blood disorders (IARC 2012, Schnatter et al. 2012, EPA 2000, Goldstein 1988). AML is
7 characterized by uncontrolled proliferation of clonal neoplastic cells and accumulation in the bone
8 marrow, with an impaired differentiation program. AML accounts for about 3 0% of all adult
9 leukemias and is the most common cause of leukemia death (Howlader etal. 2013). Studies also
10 indicate that benzene mightcause lymphoma and childhood leukemia (Smith, MT etal. 2011).

11 The extensive molecular epidemiologic and molecular clinical data sets available for both benzene
12 and leukemia are ideal to explore how new data types might be used to inform risk assessments.
13 The work described here focuses on studies where traditional and molecular data were collected
14 simultaneously using a variety of omic methods, including genome-wide analyses of susceptibility
15 genes (using genomic methods), protein expression (proteomics), and epigenetic modification
16 (epigenomics) (McHale et al., 2012). The studies also were conducted over a range of
17 environmental exposure levels (<0.1 ppm to < 10 ppm). The information was developed primarily
18 by Martyn Smith and colleagues (University of California, Berkeley). Systems biology of benzene-
19 induced leukemia is summarized in McHale et al. (2011) and Smith etal. (2011).

Systems Biology of Benzene-Induced Disease

20 Although benzene is among the most well-studied environmental chemicals, understanding the
21 molecular mechanisms underlying hematopoietic cancer is somewhat recent (see Text Box 5 for a
22 brief description). In 2009, McHale et al. (2012) identified exposure-dependent alterations in genes
23 and pathways (in peripheral blood mononuclear cells using transcriptomics), and hematotoxicity
24 associated with benzene exposure
25 (>10 ppm) in occupationally
26 exposed Chinese workers. McHale
27 etal. (2011) extended these
28 findings to lower exposure levels
29 (<1 ppm to < 10 ppm). (The
30 current U.S. occupational
31 standard is 1 ppm.) In subsequent
32 work, Thomas et al. demonstrated
33 changes in gene expression at
3 4 current U. S. urban levels in
35 Chinese workers exposed to levels
36 <0.1 ppm. The exposure-response
37 models used in these analyses
38 were not selected a priori, but
39 rather driven by the best fit of the
40 data. Results are consistent with
41 supralinear exposure-responses,
42 which have also been reported in
43 traditional epidemiology studies (Lan et al. 2004).
Box 5. Molecular Mechanism of Acute Myeloid Leukemia (AML)

The probable mechanism by which benzene induces leukemia involves
the "targeting of critical genes and pathways" (McHale et al. 2012).
Benzene has the potential to induce abnormalities in the genes,
chromosomes, or epigenetic mechanisms of hematopoietic stem cells
(HSC). It can also disrupt its normal cell cycle, leading to apoptosis,
increased cell proliferation, and altered differentiation of the HSCs.
Benzene causes these effects and ultimately leukemia through oxidative
stress, dysregulating proteins that control normal functioning of HSCs,
and reducing the ability of the body to detect and destroy cancerous
cells (McHale etal. 2012).

For AML specifically, two events that are important for leukemic
transformation have been identified. The first event is uncontrolled cell
;rowth, which is mediated by upregulation of cell survival genes. The
second event is alteration of transcription factors that control the HSC
differentiation. That is, the transcription factor proteins can be mutated
or can target certain genes in a way that interferes with the appropriate
differentiation of HSCs (Kanehisa Laboratories 2013, Wang, I et al.
2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 17

-------
1 The systems biology of benzene-induced early effects have been articulated by McHale et al. (2012)
2 and others (Smith, MT etal. 2011, Zhang, L etal. 2010). Benzene-induced leukemia is thought to be
3 initiated when metabolites of benzene target genes or pathways that are critical to hematopoiesis
4 in hematopoietic stem cells. Interactions among various cell types within the bone marrow and
5 among various tissues also play a role in leukemia (e.g., immunosurveillance). The underlying
6 mechanisms of benzene-induced leukemia, shown in Figure 6, center on exposure-dependent
7 pathway alterations comprising 147 significantly altered genes (cross validated on two microarray
8 test platforms [Illumina and Affymetrix]). The gene expression profile changes with dose, with
9 some genes (and related biological processes) being expressed at all levels, while others are
10 expressed only at higher concentrations. Of the 147 genes, the expression of was
11 significantly altered at all exposure levels. These 16 signature genes are involved in immune
12 response, inflammatory response, cell adhesion, cell matrix adhesion, and blood coagulation, and
13 are most strongly associated with AML disease pathways (McHale et al., 2011). This set of 16 genes
14 forms a biomarker for exposure (and associated leukemia) for future work, particularly in
15 augmenting traditional epidemiology studies and enabling new types of molecular epidemiology
16 studies at lower concentrations. As will be discussed later in this section, understanding of the
17 systems biology and molecular initiating events (MIEs) in leukemia can also potentially enable
18 screening of relatively unstudied chemicals for similar signature events. Clinical studies of
19 chemotherapeutic agents, which alter gene expression in these same pathways and are used in the
20 treatment of leukemia add evidence to the causal relationships between specific gene/pathway
21 alterations and leukemia (Hatzimichael and Crook 2013).

22 In addition to leukemia, a lymphoma disease signature is evident with benzene exposure (McHale
23 etal. 2012, McHale etal. 2011, Smith, MT etal. 2011). The traditional epidemiology data on
24 lymphoma are not conclusive. Characterization of a benzene-induced molecular mechanism for
2 5 lymphoma adds considerably to the weight of evidence for benzene-induced lymphoma,
26 highlighting the use of molecular mechanism or mode-of-action information to strengthen weight-
27 of-evidence determinations (IARC 2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 18

-------
Oxidative Stress
Benzene Exposure
Metabolism
Stem Cell Niche
Dysregulation
DYSREGULATED
IMMUNE
RESPONSE
AhR
Dysregulation
Induction of HSC
from quiescence to
cycling
T
Apoptosis
HEMATO-
TOXICITY
I Key Events

Q, J Stem cells
Modifying Factor

ffl Toxicological Effect
Figure 6. Multiple modes of action (MOAs) of benzene-induced leukemogenesis. Potential key events, modifying
factors, and toxicological effects are depicted in the legend. Stem cells can be either HSCs (hematopoietic stem
cells) or LSCs (leukemic stem cells) (Smith, MT et al. 2011), reproduced with permission from Elsevier.
De Novo and Other Chemical Leukemogen-lnduced Disease

1 Interestingly, molecular mechanisms for benzene-induced leukemia appear similar to de novo
2 (without an obvious cause) AML and AML induced by other environmental agents (e.g., alkylating
3 agents, topoisomerase II inhibitors) (IARC 2012, McHale etal. 2012, Pedersen-Bjergaard etal.
4 2008). Figure 7a6 shows a network of genes and pathways involved in de novo and chemically
5 induced leukemia [Kyoto Encyclopedia of Genes and Genomes (KEGG) (Kanehisa Laboratories
6 2013)]. The circles in the figure indicate some of the specific genes and pathways affected by
7 leukemogenic agents and environmental modifiers (Kanehisa Laboratories 2013, IARC 2012,
8 McHale etal. 2011, Pedersen-Bjergaard etal. 2008). Additional evidence for the causal role for
9 these genes and pathways in AML is provided by the study of human genetic variants associated
10 with altered risks and chemotherapeutics that reverse adverse alterations in some of these same
6The basic AML network figure used in Figures 7 a and 7b is from the Kyoto Encyclopedia of Genes and Genomes
(KEGG) (Kanehisa Laboratories 2013)). The added circles are the work of the report authors.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 19

-------
1 genes and pathways (discussed below). Although mechanistically similar, different agents can
2 display specific characteristics; including origins in cells at different stages of hematopoiesis,
3 distinct cytogenetic subtypes, and different latencies (Irons et al. 2013, McHale et al. 2012).
4 Figure 7a highlights how a disease network can be modified at different points but still lead to a
5 common disease outcome. These mechanistic commonalities and differences among de novo and
6 chemically-induced health effects can be used to characterize chemicals with limited data, which
7 have nevertheless been shown to induce mutations, chromosome changes, or specific changes in
8 gene expression. Data-limited chemicals would be of elevated concern if they are shown to alter
9 pathways similar to that observed in de novo disease or with well-studied leukemogens. For
10 example, see the work in Thomas R et al. (2012) where the authors used existing information on
11 gene and protein targets of 29 known leukemia-causing chemicals and 11 carcinogens that are not
12 known to cause leukemia, the authors were able to develop a classification scheme that could
13 distinguish a random leukemia-causing/nonleukemia-causing carcinogen pair with a 76%
14 probability. Provided later in this section (in the ozone and B[a]P prototypes) is similar evidence
15 for the importance of networks when considering chemical-related diseases, similarities of
16 chemical-related and de novo diseases, and the role of mechanisms in improved understanding of
17 cumulative risks.

18 Evidence suggests that, in addition to environmental exposures, genetic variations and lifestyle
19 factors such as smoking, obesity, diet, and alcohol use are risk factors for leukemia (Smith, MT et al.
20 2011, Pedersen-Bjergaard et al. 2008, Belson et al. 2007, Ilhan et al. 2006). Environmental
21 exposures of the developing organism could also be a risk factor for disease later in life, given the
22 potential of benzene and other environmental agents to alter epigenetics, the sensitivity of the
23 developing organism to epigenomic changes, and the association of environmental exposures and
24 childhood leukemias (Boekelheide et al. 2012). Figure 7a shows how multiple environmental
25 factors can alter several molecular events in a manner that alters risks, and how mechanistic
26 knowledge might be used to identify or exclude chemicals based on common mechanisms and
27 impacts on cumulative risks.

28 Individuals exposed to known environmental and lifestyle risk factors are estimated to account for
29 approximately 20% of acute leukemia incidences, indicating that host genetic susceptibility might
30 be instrumental in the development of leukemia (Smith, MT et al. 2011). By identifying mechanistic
31 commonalities, or the lack thereof, among chemicals, new omic approaches can provide tools for
32 characterizing roles that intrinsic and extrinsic risk factors might play in individual and
33 subpopulation risks. Below we discuss genetic variation more specifically and provide an example
34 of altered subpopulation risks based on genetic variations.7

and in the

35 Genetic susceptibility for developing AML, and how it relates to chemical risks, has been studied by
36 several investigators (Zhuo etal. 2012, North etal. 2011, Shenetal. 2011, Smith, MT etal. 2011,
37 Garte et al. 2008). Several genetic variations in individual genes appear to increase risks for
7Human genetic variation is the genetic differences among subpopulations. Multiple variants of any given
gene might occur in the population. These differing DNA codings determine distinct traits or polymorphisms
that can influence risks.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 20

-------
1 developing AML, while at least one decreases risks. Sille et al. (2012) reported 12 independent risk
2 loci (specific regions within the genome, which can be a single base, as in this case, or an entire
3 gene) with the potential to alter gene expression. A significant number of variants (single
4 nucleotide polymorphisms [SNPs] or single nucleotide variations) related to a tumor suppressor
5 gene, signaling pathways, or residing in putative regulatory elements,8 have been linked to various
6 types of multiple hematological cancers. Figure 7b highlights genes that vary in the human
7 population and are associated with altered leukemia risks (Hatzimichael and Crook 2013, Kanehisa
8 Laboratories 2013). Figure 8 provides an example of differential risks resulting from one human
9 variant.9 The overall data indicated a significant variation in risk (42%) relative to the CYP1A1
10 genotype (Zhuo et al. 2012). The shift in odds ratio is also shown in Figure 8.

11 When one considers that many genes are associated with benzene-induced leukemia, the potential
12 for variation in subpopulation risks via individual genes, combinations of genes, and gene variants
13 becomes apparent. Other risk factors (e.g., lifestyle) would add to the human variability in response.
14 As discussed in Section 4, NexGen approaches exist that can facilitate characterization of human
15 variability as never before.
8Putative regulatory elements are areas of the gene that do not code for proteins but rather regulate DNA
transcription into proteins.
9SNP leads to a base substitution of isoleucine with valine at codon 462 in exon? (Ile462Val or CYP1A1*2C
polymorphism, rs!048943). Thus, the exon? restriction site polymorphism results in three genotypes: a
predominant homozygous lie/lie, the heterozygote Ile/Val, and a rare homozygous Val/Val.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 21

-------
I ACUTE MVELQID LEUKEMIA |

Hematopmetie pngcnitoB
Ant-apoptotx gews
Acute ruytloblss&c leuktmia
with minima) differentiation (MO)
Acute nralrjbhstic leukemia
•.' u •• it maturation (Ml )
Acute rawloblastc leukemia
with maturation (M2)

Acute fffomjfebsytic leukemia
-------
I ACUTE MYELOID LEUKEMIA |

H*matopoietic pogewtors
H*matopoietic |
cell toage J
Acute myebblastic leukemia
with minimal difFerenfaikui (MD)

Acute mvsloblastic leukemia
without maturation (Ml)

Acute myebblastic leukemia
wifli maturation (M2)

Acute promyebcytic kufcemia (M3)

Acute myebmonocytic leukemia (M4)

Acute m^bmonocytic leukemia
with abnormal eosmophils (M4Eo)

Acute monocyoc leukemia (M5)

Eiythroleukenua (M6)

Acute megalaiyjcytic leakemw (M7)
AnlUpoptotic genes
synthesis [ mTORsigMjing |
(. pethmy J

Proliferate genes
Circled are genes which have naturally

occurring variations (different coding of
the DNA) in the human population. These

variations, individually or in
combination, appear to alter leukemia

incidence and characteristics.
DNA /
fe CV/-+- RAK« /
tujel genes
1 *• ProHeiatbn
!>• Praliferatiiiii
Figure 7b. This figure shows the same acute myeloid leukemia (AML) KEGG diagram (Kanehisa Laboratories 2013) as shown in Figure 7a, with circles added by

authors. In this version, circled are the locations of naturally occurring human genomic variants that increase the risk of AML (Hatzimichael and Crook 2013, Sille
et al. 2012). Characterizing genomic variant subpopulations and associated risks can help us to better describe human variability and susceptibility for specific
diseases.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 23
-------
Study
ID
Krajinovic(1999)
Gao (2003)
D'AIO (2004)
Gallegos-Arreola (2004)
Joseph (2004)
SeMn (2004)
Ma|Limdar(2008)
Lee (2009)
Yamaguti (2009)
Yamaguti (2010)
Razmkhah (Adult) (2011)
Razmkhah (Childhood) (2011)
Swinney(2011)
Kim (2012)
Overall (l-squared - 42.1%, p - 0.049)

NOTE Weights are from random effects analysis
OR(95%CI)
0.87(0.42,1.79)
1.35(0.73,250)
078(0.39,153)
1.83(1.13,297)
246(1 36,443)
089(0.55,1.43)
1.47(0.77,2.81)
086(052,142)
1.74(1.05,2.87)
136(076,244)
236(1.25,4.48)
1.03(0.45,2.34)
107(0.75,1.52)
109(088,136)
1.26(1.05,1.51)
.223 1 ' 4.48
Figure 8. Meta-analysis for the association of acute leukemia risk with CYP1A1. lle462Val polymorphism is shown
(OR = odds ratio). The overall risk was 42% greater (95% Cl = 1.11-1.98) for Val/Val+Val/lle versus lie/lie (Zhuo et
al. 2012). Reproduced with permission from PLoS One.
In Vitro Evaluation of Toxicogenomic Signatures

1 As has been previously noted, the primary function of the Tier 3 prototypes is to inform how we
2 evaluate data-limited chemicals. Hence, a comparison of in vivo and in vitro benzene results is
3 discussed here. Godderis et al. (2012) conducted an in vitro study in TK6 cells to detect gene
4 signatures and biological pathway perturbations, using global gene expression analysis, resulting
5 from exposure to 15 genotoxic carcinogens, including benzene and its metabolites. The goal was to
6 determine if well-characterized chemicals could be used to characterize data-limited chemicals by
7 comparing gene signatures. Although pathways altered by exposure to benzene and its metabolites
8 were in general agreement with previous in vivo studies, the authors pointed out that several
9 factors can complicate comparison of in vivo and in vitro data, for example, metabolism and cell
10 types. The authors concluded that use of toxicogenomic signatures hold great promise for
11 evaluation of data-limited chemicals. They noted that for the carcinogens in the study, some in vitro
12 processes mapped against known or likely carcinogenic processes, but determining discriminatory
13 mechanisms based on in vitro data alone was difficult This observation suggests that the approach
14 of developing putative mechanisms of action based on data-rich meta-analyses of human disease
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 24
-------
1 and mapping in vitro data against these models might prove more successful than attempting to
2 understand mechanisms of action based on in vitro data alone.

Risk Assessment Implications from the Benzene Prototype

3 The benzene prototype demonstrated how molecular biology data, particularly mechanistic
4 signatures, can be used in hazard identification and exposure-dose-response assessment
5 Hazard Identification - Specifically, genes and pathways altered by benzene exposures are
6 strongly associated with a network of pathways associated with known (AML) and likely
7 (lymphoma) outcomes. Additional evidence for a causal relationship between alterations in specific
8 genes and pathways and increased leukemia risk is provided by observed similarities in pathway
9 disruptions: (1) caused by other chemical leukemogens, (2) observed in leukemia of unknown
10 origins, and (3) reversed by certain leukemia chemotherapeutic agents. Hence, observations from
11 both molecular epidemiology and molecular clinical studies provide evidence that molecular
12 signatures can predict specific diseases with some confidence. These data suggest that well-defined
13 pathway and network disruptions strongly associated with a specific disease could be used to
14 screen chemicals with limited molecular data for their potential to increase risks for the specified
15 disease by causing similar mechanistic disruptions. Anchoring of the molecular patterns to apical
16 outcomes, considerable systems biology knowledge, and high-quality data; however, appear
17 necessary to define the disease signature against which data-limited chemicals could be compared.

18 Exposure-Dose-Response Assessment - A specific exposure-dose-dependent gene signature for
19 leukemia was observed at all environmental exposure concentrations measured (<0.1 to >10 ppm);
20 the magnitude of signature expression varied in a dose-dependent manner. This signature is a
21 biomarker of both exposure and effect. Such signatures or biomarkers can extend the exposure
22 range of traditional epidemiologic studies to lower exposures and reduce measurement error. This
23 type of data can measure low-dose-response relationships and, potentially, mitigate a source of
24 substantial controversy in chemical risk assessment, that is, low-dose extrapolation. In the future,
25 one can envision routine replacement of low-dose extrapolation with measurements of molecular
26 signatures. The established dose-response for specific gene signatures could be used to estimate
27 the potency or relative potency for data-limited chemicals. In particular, ranking of chemicals is
28 feasible when using similar protocols such as those characteristic of Toxicology in the 21st Century
29 (Tox21)orToxCast™.

30 The exposure-response models used in this prototype were not specified in advance, but the choice
31 relied on the best fit from among multiple models. Hence, the model was "agnostic" on the issues of
32 threshold/no threshold and the shape of the low-exposure-response relationship. Such an approach
3 3 would mitigate another source of controversy in risk assessment, that of model choice.

34 Cumulative Risk Assessments - Understanding of a common mechanism of action for multiple
35 environmental factors can allow for improved cumulative risk assessments. It should be noted that
36 overly simplified descriptions of mode of action (MOA) or adverse outcome pathways (AOPs) could
37 miss interactions among the environmental factors as shown in Figure 7a.

3 8 Variability and Susceptibility in Human Response - An example of risk characterization
39 associated with different genetic variations is provided. With additional research and data evolving
40 from personalized medicine, the understanding of population variation and distribution of
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 25
-------
1 responses in the human population could be improved. These data also could help improve
2 estimates of the size of sensitive subpopulations.

3.1.2. Ozone-induced Lung Inflammation and Injury

Use of Ozone as a Model Pollutant

3 Hundreds of controlled human exposure studies have described biological changes in volunteers
4 exposed acutely (usually for 2-6 hours) to concentrations of ozone ranging from 0.06 to 0.4 ppm
5 (EPA 201 Id).10 These studies show that exposure to ozone results in decrements in several indices
6 of lung function, increases in markers of pulmonary inflammation, and alterations in host defenses
7 against inhaled pathogens and lung injury. This database represents the single largest human
8 database of any pollutant EPA has studied. As a consequence and because the mechanisms are well
9 understood, the database provides an ideal opportunity to demonstrate proof of concept for use of
10 molecular biology data to inform assessment of human risks, to develop decision considerations for
11 use of such data, and to explore the value of various types of information.

12 The underpinning of an AOP-based paradigm in risk assessment methodology is the concept of
13 studying biological pathways. The perturbation of a biological pathway initiates a set of key events
14 that cause an adverse outcome associated with an environmental stressor. If such pathway
15 responses are known and represented by a set of quantitative in vitro assays, the results of these
16 assays can be used to build quantitative biological activity relationships. Coupling these results with
17 appropriate physiologically based pharmacokinetic (PBPK) modeling and exposure estimates for
18 estimating tissue doses can be useful for hazard identification and dose-response assessment For
19 in vitro pathway information to be used in risk assessment, the quantitative relationship between
20 perturbation of a pathway following in vitro exposure
21 and downstream endpoints (i.e., pathophysiological
22 changes at the tissue or organism level following in vivo
23 exposure of animals or preferably humans) must be
24 established. This framework is not likely to be possible,
25 however, as sufficient in vivo data are lacking for most of
26 the toxicants that EPA is responsible for regulating
27 (Crump etal. 2010). Therefore, using model systems in
28 which both in vitro and in vivo data are available is
29 necessary to validate how well pathway information
30 from the former can predict human responses to
31 toxicants. Ozone provides such a model system for lung
32 inflammation and injury (see Text Box 6 for a
33 description of inflammation). This model can be used
34 for less well-studied chemicals to identify and
35 characterize their potential to induce lung inflammation
36 and injury. Figure 9 outlines physiological and cellular pathways by which ozone causes
37 pathophysiological changes in humans via the lung response. This prototype focuses on pathways
38 that lead to inflammation, which are shown in the open boxes. Several human studies characterize
39 inflammation at multiple ozone concentrations during and after exposure, providing a rich data set
Box 6. Inflammation
Inflammation is the immune system's response
to damage to cells and organs by pathogens,
chemicals, or physical insult. Initially,
inflammation involves changes in local blood
flow and accumulation of various
inflammatory cells (e.g., neutrophils,
lymphocytes) at the site of injury. Pathogens
and cell debris caused by the inflammatory
response are then removed as tissues begin to
repair. If the delicate balance between
inflammation and resolution of the events
leading to the inflammation is dysregulated, or
tissue insult continues, inflammation can lead
to disease pathology (Wang, I et al. 2012,
Medzhitov 2008).
10The current ozone standard calls for limitation of the fourth highest daily maximal 8-hour ozone
concentration in a year to 0.075 ppm, based on a 3-year average.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 26
-------
1 of human in vivo responses. Additional pathways based on neurological responses to ozone
2 exposure that are not captured in this figure also might be possible.
Inhaled Ozone
Inducible
Antioxidants
Antioxidant
Capacity
C fibers
Epithelial Cells
Primary Molecular Events
]_ (e.g. Ca+2 influx, intracellular ROS
production)
Signaling pathways
Transcription Factor Activation
Macrophages
I Irritant Receptor
Activation
Epithelial Cell
Damage
Inflammatory
mediators
Decreased
Pulmonary
Function
Inflammation
Mucociliary Escalator
Impairment;
Increased Mucin Production
Decreased Host
Defense
Figure 9. Framework diagram of ozone key events and modes of action (MOAs) related to lung injury occurring
in vivo.
Challenges with Using an AOP Approach for Risk Assessment

3 Using model systems based on in vitro pathway information to predict human in vivo responses to
4 toxicants for risk assessment purposes presents certain challenges. A major hurdle relates to
5 extrapolation from in vitro to in vivo effects. Many in vitro approaches use animal cells or
6 transformed cell lines derived from humans, which might not accurately reflect cell interactions or
7 events in the pathway for human in vivo effects. For example, a parent toxicant might be biologically
8 transformed into a more active form by cells that are not represented in the in vitro system (e.g.,
9 liver cells) before interacting with the target cells represented in the assay. In the lungs, epithelial
10 cells that line the human airways are the first and primary targets of inhaled toxicants and are
11 believed to be the cells that initiate lung inflammation. Studies have shown that pathways in the
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 27
-------
1 cultured in vitro cells that have been activated by air pollutants are also altered in these same cells
2 following in vivo exposure to the same pollutant (Selgrade et al. 1995). This ability to show
3 concordance between in vitro and in vivo exposures thus is a major advantage of the modeled lung
4 system discussed here.

5 A second challenge associated with in vitro approaches is ensuring that the dose of toxicant
6 delivered to cultured cells is similar to that which these cells would encounter following an in vivo
7 exposure. Frequently, cultured cells are exposed to toxicant levels that are orders of magnitude
8 greater than they would be in vivo. There is no assurance that the same biological pathways are
9 adversely affected in both situations. Ozone, however, can be prepared using the heavy oxygen
10 isotope (1802), which can be separated from 1602 and quantified by mass spectroscopy. When ozone
11 attacks a target tissue, the 1802 tag is bound to that tissue. This approach has been used to
12 normalize the dose of ozone delivered to rats and humans (Hatch et al. 1994) and to support
13 estimates that target tissue doses in rats exposed to 2.0 ppm ozone are comparable to target tissue
14 doses in humans exposed to 0.4 ppm ozone. This same approach can be used to normalize the dose
15 of ozone delivered to cultured cells and humans.

16 Ozone is one of the few pollutants for which an extensive animal and human health effects database
17 is available. Coupled with in vitro pathway data, this prototype pollutant can be used to illustrate
18 both how a biologically based dose-response modeling approach can be used to provide this
19 framework and how a systems biology model and genomics data can be used for risk assessment

20 In - Young, healthy volunteers were exposed to filtered air and a relevant
21 concentration of ozone (0.30 ppm) previously shown to induce a measurable inflammatory
22 response. Bronchoscopy was used to obtain cells and lung fluid at 1 and 24 hours after exposure. To
23 ensure that pathophysiological effects observed in this study were comparable to those reported in
24 earlier studies, downstream biomarkers of inflammation such as the influx of neutrophils were
25 measured (Devlin et al. 2012), as were markers of cell injury (lactate dehydrogenase) and leakage
26 of plasma components across the damaged epithelial cell barrier (albumin) into the lung airways.
27 Bronchial airway epithelial cells were obtained by brush scraping, and the microarray technology
28 was used to define pathways affected by in vivo ozone exposure. In addition, quantitative
29 proteomics was used to correlate changes in messenger ribonucleic acid (mRNA) measured by
30 microarray with changes in their protein counterparts (see Figure 9, event 3).

31 In - A subset of airway epithelial cells was collected from volunteers following
32 exposure to filtered air and cultured at an air-liquid interface. These cells were exposed to
33 concentrations of ozone that had been shown (from the results of 180s experiments) to be
34 comparable to the dose of ozone encountered by airway epithelial cells following a specified in vivo
35 exposure. This approach allows comparison of an in vitro and in vivo response of cells from the
36 same person for comparable exposures. Similar to the in vivo studies, microarray and proteomics
37 were used to identify and define pathways affected by ozone in these cells.

38 - Upstream signaling events shown in Figure 9, event 2 (e.g., transcription
39 factor activation, MAP kinase pathways, production of reactive oxygen species [ROS]) was assessed
40 to determine the MOA by which ozone activates downstream batteries of pro-inflammatory genes.
41 Pathways that are altered by exposure of cultured airway epithelial cells to ozone can be compared
42 with those altered in airway epithelial cells of the same person exposed in vivo to ozone. A
43 comparison can be made to determine the accuracy of the in vitro system in mimicking events

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 28
-------
1 following exposure in the in vivo system and to assess differences in the variability of the response.
2 Figure 10 illustrates potential upstream signaling pathways that could be induced by ozone and
3 lead to activation of downstream batteries of pro-inflammatory genes.
Figure 10. Potential pathways by which ozone causes production of pro-inflammatory mediators in
epithelial cells.
4 Microarray technology was used to determine which of these pathways is most likely to be altered
5 by ozone exposure. The two most highly scoring molecular networks following exposure of
6 cultured airway epithelial cells to ozone in vitro are presented in Figure 11. The networks involve
7 modulation of genes in NF-KB and extracellular signal-regulated kinase signaling pathways. The
8 gene list input to the Ingenuity Pathway Analysis was generated by combining all genes found to be
9 differentially expressed immediately following a 2-hour exposure of bronchial epithelial cells to
10 0.25, 0.50, 0.75, and 1.0 ppm ozone or clean air. Exposure-dose was normalized using 180s
11 dosimetry from in vitro and in vivo human studies. Networks are displayed with representative
12 symbols for the protein products of the mRNA transcripts. Red represents putative upregulated
13 transcripts induced by ozone, and green represents putative downregulated transcripts in response
14 to ozone. Additional molecules from the Ingenuity Knowledge Base, which were not present in the
15 differentially expressed gene (DEC) list, are uncolored in the networks. The same putative
16 networks were also identified in epithelial cells removed from human airways 1 hour after in vivo
17 exposure to ozone.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 29
-------
I.'... ...
P0&BE
•' • ; :• L • •
Figure 11. Molecular pathway analysis by Ingenuity Pathway Analysis.
September 2013
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
30
-------
Primary Molecular Events
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
PTP-SOH
(INACTIVE)
Many pollutants induce intracellular oxidative stress, which can affect signaling pathways and
ultimately lead to activation of batteries of pro-inflammatory genes. One pathway by which this
might occur (Figure 12) is activated in cultured human airway epithelial cells exposed to
particulate air pollution. Ozone is an inherently
potent oxidant and is known to cause oxidative
damage to lipids, proteins, and nucleic acids.
Until recently, whether ozone also induced
intracellular ROS was unknown. Figure 13
shows that ozone can induce a rapid dose- and
time-dependent increase in cytosolic
intracellular glutathione redox potential, a
measure of ROS (Gibbs-Flournoy et al. 2013).
Whether the ROS produced following ozone
exposure actually activates downstream
signaling pathways via the mechanism shown in
Figure 12 is unknown.
System Biology Modeling
Protein Tyrosines
PTP-S"
Kinases
Protein Phosphotyrosines
1
Signaling Pathways
\
Figure 12. Role of reactive oxygen species (ROS) in
mediating pollutant-induced inflammation.
Quantitative systems biology models are
translational, and their development is data
driven, with model structure and dynamics
parameterized using data on (1) basic biology,
(2) how the biology is perturbed by toxicants,
and (3) how and when adaptive and adverse responses develop. Sufficiently well-developed and
well-validated models can be used to predict dose-response and time course behaviors for the
perturbations, adaptive responses, and apical health effects, but the accuracy of these predictions
depends on the extent and quality of the data used as inputs and on the technical quality of the
model itself. Time-course and dose-response pathway data from in vitro exposure studies can be
paired with pathway data from in vivo exposure studies and assembled into a nodes-and-edges
graph encompassing mechanisms of action relevant to ozone toxicity, focusing on pathways most
relevant to lung inflammation. This pairing and assembly will provide a framework for modeling
ozone toxicity pathways to downstream pathophysiological changes (see Figure 9, event 3). At the
intracellular level, upstream signaling pathways (e.g., NF-KB) that have been shown to mediate
ozone-induced changes in gene expression will be represented, connecting the oxidative products
of ozone formed in the cell to time-dependent changes in protein activity and RNA expression. For
example, the canonical NF-KB signaling pathway shown in Figure 9 plays a role in ozone-induced
inflammation. Finally, data on ROS production resulting from ozone exposure (see Figure 9, event
1) will be represented in the model, both as an input to ozone's perturbation of the molecular-level
components and as drivers of downstream signaling pathways.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 31
-------
3-1
0) £
•'• 0 ISppm
n 0-25 ppm
<> 0 50 ppm
• Air control
T
10 20 30 40
Time (min)
SB
15
Figure 13. Exposure to ozone induces a rapid increase in 16
intracellular reactive oxygen species (ROS). Addition of 0.1 mM Hjf^
at the end of the ozone exposure produced a maximal response, -t o
which was fully reversible with the addition of 10 mM dithiothreitoj
(DTT), a strong reducing agent (Gibbs-Flournoy et al. 2013).
Reproduced with permission from Environmental Health
Perspectives.
20
21
22
Susceptibility

Not all individuals are equally
responsive to toxicants; some are
much more responsive because of
age, gender, disease, lifestyle (e.g.,
obesity), or genetic/epigenetic
factors. For example, the range of
response in lung function
decrements to ozone in young
healthy individuals (McDonnell et al.
2012) is 10-fold. Individuals exposed
to ozone a second time, many
months later, retain their hierarchy
on the response curve, implying that
a long lasting factor, perhaps genetic
or epigenetic, plays a role in ozone
responsiveness. Asthmatics are
known to have an enhanced
inflammatory response to ozone
(Bosson et al. 2003, Peden et al.
1997), as do individuals carrying the
GSTM1 null allele (Kim etal. 2011).
Understanding the MOA by which a
23 person is more responsive to a pollutant should be a component of a systems biology approach to
24 toxicity testing. Airway epithelial cells can be obtained from more-responsive and less-responsive
25 individuals, and the pathways altered by ozone can be compared for both groups. Recently,
26 cultured lung epithelial cells obtained from individuals carrying the GSTM1 null allele have been
27 shown to be more responsive to air pollutants than cells obtained from individuals carrying the
28 wild-type GSTM1 allele (Wu et al. 2011). Airway epithelial cells obtained from asthmatics appear to
29 retain an asthma phenotype in culture and are more responsive to pollutants than cells obtained
30 from nonasthmatics (Duncan et al. 2012). Readily obtaining bronchial airway cells can be difficult,
31 so knowing that the response of cultured nasal epithelial cells to toxicants has recently been shown
32 to be similar to that of bronchial cells (McDougall et al. 2008) can be instructive. These nasal cells
33 can be readily and noninvasively obtained from most individuals, including children.

Involvement of the Inflammatory Network in Multiple Diseases

34 Chronic inflammation is implicated in the etiology of several diseases, including atherosclerosis,
35 heart disease, obesity, diabetes, arthritis, cancer, and lung diseases (asthma, emphysema,
36 pulmonary fibrosis). Both common and disease-specific inflammatory molecular patterns have
37 been reported to underlie these diseases (Wang, I et al. 2012). Why a particular disease is
38 expressed in an individual or subpopulation as the result of inflammation is likely the result of the
39 site of injury, co-activation of other networks, genetic variation, or environmental factors. Such
40 complicating factors therefore highlight several issues that might arise when using molecular
41 patterns to predict disease risks: (1) observation of an inflammatory disease signature for a
42 chemical that has not been well studied would raise concerns for inflammatory disease risks; (2)
43 the specific inflammatory disease in question likely would be difficult to predict with a limited
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 32
-------
1 systems biology context; (3) a network might be involved in multiple diseases; and (4) the specific
2 disease expressed could involve multiple interactive pathways and networks.

3 Specifically, many air pollutants appear to induce cardiopulmonary inflammation, which likely
4 plays a role in risks for asthma, emphysema, and pulmonary fibrosis. Molecular biology is likely to
5 be a useful tool in sorting out the relative contributions of various air pollutant exposures to
6 cardiopulmonary disease via inflammatory mechanisms.

Risk Assessment Implications Based on the Ozone Prototype

7 Hazard Identification - The pathway information, coupled with data about ozone-induced changes
8 in upstream transcription factors, signaling pathways, and generation of ROS, can lead to the
9 development of molecularly based dose-response system models that are predictive of downstream
10 in vivo pathophysiological changes. These data suggest that ozone activates the NF-KB and ERK
11 pathways, both known to modulate inflammation, in vitro and in vivo. This suggests that the in vitro
12 airway epithelial cell model used here might be amenable to predicting in vivo inflammation. An
13 HTS assay based on this cell model might be able to provide rapid hazard identification in the
14 future.

15 Exposure-Dose-Response Assessment-We did not perform an analysis of transcriptional changes
16 across a range of doses.

17 Cumulative Risk Assessment - This in vitro model could be used to make comparisons of the
18 transcriptional response upstream of the inflammation process using complex mixtures of air
19 pollutants. The comparison, however, might require specialized equipment and monitoring to
20 ensure the mixture and dose of pollutants are proper and well controlled.

21 Variability and Susceptibility in Human Response - In the future, this and other similar models
22 might identify pathways and mechanisms by which susceptible human populations respond to
23 inhaled toxicants. Just as this in vitro model was derived from several young, healthy volunteers,
24 performing a larger study of variability and susceptibility would be possible by recruiting and
25 including specific populations. Such a study also would facilitate the creation of HTS assays for
26 rapidly studying susceptible populations and variability in response.

3.1.3. Benzo[a]pyrene (a Polycyclic Aromatic Hydrocarbon), and Cancer

2 7 PAHs are produced from combustion or pyrolysis of carbon-containing material, exist in the
28 environment almost exclusively as complex mixtures, are a major component of urban air pollution,
29 and are a drinking water contaminant. Several PAH-containing complex mixtures are known to be
30 carcinogenic in humans (e.g., coke oven emissions, diesel exhaust, and tobacco smoke). Many
31 individual PAHs and PAH-containing mixtures have been tested in traditional bioassays; many, but
32 not all, appear carcinogenic. Additionally, those that are carcinogenic vary in terms of potency.
3 3 Given the universe of PAHs and potential PAH-containing mixtures, testing them all is not feasible.
34 Hence, an alternative approach using molecular biology was explored in this prototype. See Text
35 Box 7 for some challenges related to this prototype.

36 This effort focused on one PAH—B[a]P)—and liver cancer. Repeated B[a]P exposure has been
3 7 associated with increased incidences of total tumors and of tumors at the site of exposure (dietary,
38 gavage, inhalation, intratracheal instillation, and dermal and subcutaneous, in studies of numerous

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 33
-------
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23

24
25
26
27
28
29
strains and species of rodents and
several nonhuman primates).
Distant site tumors also have
been observed after B[a]P
administration by various routes,
and B[a]P is frequently used as a
positive control in
carcinogenicity bioassays.

Systems Biology Model

EPA (2013), Burgoon (2011),
have proposed a cellular systems
model and pathways based on a
systematic meta-analysis of
transcriptomics data for B[a]P-
mediated liver cancer (Figure 14
and Table 2). The core of the
model is focused on induction of
Box 7. Challenges Encountered With This Prototype

This prototype originally focused on identifying whether human
transcriptomics data from PAH mixtures found in cigarette smoke could be
associated with lung cancer. This prototype was envisioned as a real-world
example of how data mining of existing data could be informatively
performed. Unlike the other Tier 3 Prototypes, which were designed to have
the best combination of data available, however, this prototype
encountered numerous data access and experimental design challenges
that we expect to be seen when applying these methods in the future.
These challenges included:

• An inability to easily obtain the raw data required for re-analysis
of the transcriptomics data.
j Lack of clear descriptions of the study design or analysis method.
-> Different microarray platforms being used.
• Different analysis methods being employed within the same
platform.
• Lack of a quantitative exposure estimate (especially common with
human studies that lack a controlled exposure).

Together, these challenges make performing a quantitative meta-analysis
difficult. For new types of data to be useful, improvements to data
collection and concomitant exposure analyses are needed.
DNA adducts, mediation of p53 (a
tumor suppressor gene) signaling, alterations of translesion synthesis,11 and regulation of the Gl/S-
phase transition and the cell cycle. Based on this model, the DNA adducts are believed to be formed
by reactive B[a]P metabolites through cytochrome P450 (GYP) enzyme induction, secondary to
B[a]P activation of the aryl hydrocarbon receptor (AhR). Others have shown AhR-independent DNA
adduct formation, raising questions about other non-CYPlAl- and CYPlA2-mediated B[a]P
metabolism and adduct formation (Sagredo et al. 2006, Kondraganti et al. 2003).

The systematic meta-analysis started with a search for published, peer-reviewed transcriptomics
data sets using B[a]P as the test substance. The Gene Expression Omnibus (GEO) and ArrayExpress
databases were searched for microarray transcriptomic studies using the search terms in Table 3.
The search focused on GEO and ArrayExpress as these databases store submitted data as raw data.
The raw data are critical for performing meta-analyses, especially when different analysis methods
might be used.
^Translesion synthesis is a mechanism that the cell uses to continue DNA replication/synthesis in the
presence of a DNA lesion (e.g., DNA adduct).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 34
-------
1
2
3
4
5
6
7
8
benzo[a]pyrene-1,2-diol ^^
Benzo[a]pyrene 11,12-oxide
/ .
^HR/ARNT complex
CYPTA2 %
/ ® l^HD^
|4> Cdknla C)
UbiquitinCR T
aselr,
G1/S PhaseTransition
Figure 14. Consensus Outcome Pathway. This consensus pathway was synthesized by combining multiple pathway
diagrams identified through analysis of the two data sets using GeneGo Metacore. The nodes (proteins or
outcomes) are connected by lines. The green lines represent activation, while the red lines represent inhibition or
repression. The thick red arrows near proteins represent increases in gene expression.

The search resulted in the identification of 26 peer-reviewed publications with 40 gene expression
data sets. The adult mouse liver was chosen as the focus system based on the number of studies
available across the species and tissues where B[a]P was used. Only 2 of the 26 publications
focused on in vivo transcriptomic studies of the liver in the mouse. Study GSE24907 is a dose-
response study where five male Muta mice (a LacZ transgenic mouse line) per group were gavaged
with an olive oil vehicle and 25, 50, or 75 mg/kg B[a]P. Study GSE18789 is a time-course study
where 27- to 30-day-old B6C3F1 mice were gavaged with 150 mg/kg B[a]P for 3 days and
sacrificed at 4 or 24 hours after the final dose.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 35
-------
Table 2. Altered Genes/Functions and Their Relationship to Cancer (in this Model)
Altered Gene or
Function
Ah R/ARNT Complex
CYPs
(e.g., CYP1A1, CYP1A2)
NRF2
Ubiquitin
CUL3
p53
MDM2

Cdknla/p21

Cyclin D
CDK4
Gl/S Phase Transition
^^^^^^^^H
Translesion Synthesis

DNAAdduct
Relationship to Cancer in this Model
AhR regulated expression of several CYPs, including CYP1A1 and CYP1A2

Upregulation leads to production of oxidative radicals and B[a]P metabolites

Regulates the expression of oxidative stress-protective genes
Protein that tags other proteins for destruction
Regulates the inhibition of NRF2 signaling with ubiquitin
Stops cell cycle by preventing Gl/S phase transition; activated by DNA damage
Regulates p53 through negative feedback mechanism with ubiquitin
Upregulated by p53 activation; inhibits Cyclin D activation and prevents Gl/S phase
transition
Activates Gl/S phase transition, works with CDK4
Activates Gl/S phase transition, works with Cyclin D
Starts cell cycle progression by allowing for DNA synthesis
DNA damage tolerance mechanism; allows DNA replication fork to progress beyond DNA
damage sites
A piece of DNA covalently bound to a chemical that can modify expression of DNA
19
20
21
22
Table 3. Search Terms and the Number of Studies Retrieved from
the Gene Expression Omnibus (GEO) and Array Express Microarra
Repositories
Search Term
Coal tar
Polycyclic aromatic hydrocarbons or PAHs
Diesel
Smoke (NOT cigarette smoke)
Benzo[a]pyrene or B[a]P
Fuel oil
Cigarette smoke
Tobacco smoke
Number of Microarray
Studies Retrieved
The Systematic Omics
Analysis Review (SOAR)
Tool was used to
document and facilitate
the evaluation of both
studies (McConnell and
Bell 2013). SOAR
consists of 35 objective
questions that help
users determine if a
study contains data of
sufficient quality for use
in a risk assessment
context SOAR was
developed by toxicology
and toxicogenomics
experts, and based, in
large part, on existing
and published data standards such as the Minimum Information About a Microarray Experiment
(MIAME) standard. Both studies (GSE24907 and GSE18789) metthe SOAR screening threshold.
Following a more in-depth scientific review, both studies were found to be of sufficient quality for
use.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 36
-------
1 That DEC lists reported in the peer-reviewed literature are not reproducible across similar studies
2 is well established (Shi et al. 2008, Chuang et al. 2007, Ein-Dor et al. 2005, Losses et al. 2004,
3 Fortunel et al. 2003). In one published example, three different studies aimed at identifying
4 "sternness" genes12 each yielded 230, 283, and 385 active genes, yetthe overlap between them was
5 only 1 gene (Fortunel et al. 2003). Therefore, a pathway-based meta-analysis approach was used,
6 whereby fold change-based ranking, or more formal meta-analyses relying on raw data, along with
7 a standardized analysis approach are considered to be more reproducible than published DEGs
8 (Ramasamy et al. 2 0 0 8, Shi et al. 2 0 0 8, Chuang et al. 2 0 0 7).

9 Both studies were reanalyzed independently at the feature level13 using the same pre-processing,
10 normalization, and analysis methods. GeneGo Metacore was used to identify pathways representing
11 a large number of genes from both data sets.

12 The consensus systems model (Figure 14) was synthesized based on the results from GeneGo
13 Metacore. The model conceptually describes the events that might occur when B[a]P enters the cell.
14 Briefly, B[a]P binds to AhR, leading to upregulation of xenobiotic metabolizing enzymes and Nrf2,
15 which might lead to additional B[a]P metabolism to epoxides and increased oxidative stress.
16 B[a]P-mediated genotoxicity, evidenced by DNA adducts, will occur and will activate p53. Although
17 Nrf2 is upregulated transcriptionally, p53 is expected to interfere with Nrf2 signaling, ensuring a
18 pro-oxidant environment, which might perpetuate further DNA adduct formation. Upregulation of
19 p21 (Cdknla) andMDM2 are most likely a result of p5 3. Upregulation of ubiquitin, while in the
20 presence of p53-mediated MDM2 upregulation, is expected to destabilize p53. Destabilization of
21 p53, in the presence of PCNA, is expected to allow translesion synthesis, which will allow mutations
22 and adducts to perpetuate through DNA synthesis. Upregulation of Cyclin D could be sufficient to
23 overcome p21 inhibitory competition, especially as p53 levels decrease, allowing for Gl/S phase
24 transition to occur. Thus, Gl/S phase transition, combined with translesion synthesis, is expected to
2 5 lead to propagation of mutations and DNA adducts into daughter cells. This loop might continue
2 6 into a feed-forward situation until p5 3 signaling can be reinitiated.
12"Stemness" genes are those genes that are hypothesized to confer stem cell characteristics.
13A common misconception about microarrays is that they measure gene expression at the level of a gene. In
reality, microarrays measure only a portion of a gene, typically anywhere from 20 to 100 nucleotide bases.
This portion of the gene that is actually measured is called a "feature." Typically, only one feature exists per
gene on a microarray. Some genes are represented more than once on a microarray, however, complicating
downstream analyses (e.g., deciding how much a gene is expressed when the two features representing
different parts of the same gene yield different numbers). Features could also be believed to map to a specific
gene at one time, and the feature is later discovered to map to a completely different gene (this happens more
frequently with lesser known or studied genes and lesser known or studied organisms where the genome
might not be available). Thus, the gene associated with a feature can change over time, and most analysts will
re-map their feature sequences against the genome periodically to ensure they have the latest annotation.
This might result in reproducibility issues when comparing to studies performed at different times. Generally,
when interpreting gene expression, analysts prefer to operate at the feature level for all analyses.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 37
-------
1 Using the gene expression changes and activating DNA adduct formation, the Boolean Network
2 systems model (Figure 15-17)14 predicts that cell cycle progression will be activated with
3 translesion synthesis15 (Figure 18). These data and the systems model support the notion that the
4 high doses and acute durations used in the two mouse liver studies might initiate liver tumor
5 progression through a genotoxic MOA, and promotion might occur through a cellular proliferation
6 MOA. Due to the lack of data, speculating whether this system could be activated at low doses in the
7 mouse is not possible. Due to genetic and epigenetic variability and potential species differences,
8 these types of effects might occur at lower doses in humans than in mice.

9 The proposed model, however, provides a testable hypothesis for effects at lower doses, with other
10 species, and other PAHs. For instance, transcriptomic studies with PAH mixtures, or other PAHs
11 individually, can be analyzed to see if they might also impinge on this pathway. Further, the gene
12 expression data from these other studies can be placed into this model, and an analysis can be
13 performed to see how the cell might react, compared to B[a]P. This will give an indication of
14 doses/exposures that could lead to DNA damage, activation of translesion synthesis, and Gl/S-
15 phase transition.

16 Variations in human genetics will alter the susceptibility and population variability with respect to
17 the tumorigenesis or carcinogenesis outcomes. For instance, SNPs are known to occur in p53, which
18 might impact its ability to stop Gl/S phase transition. In addition, the p53 gene has been shown to
19 be mutated in many cancers (Vogelstein et al. 2000). A data mining approach can be taken to
20 identify other relevant SNPs for the genes or proteins in the systems model.
14In a Boolean Network model, the system is represented as a series of connected nodes. Each node
represents a gene/protein, and a connection represents some type of action/inhibition relationship. The
connections are directed. For instance, p21 inhibits Cdk4, so the arrow originates at p21 and terminates at
Cdk4. Some of the relationships are not as direct. For instance, Cyclin D interacts with Cdk4 to activate Gl/S
phase transition; however, in the model, this is best represented as a positive interaction between Cyclin D
and Cdk4 given the relationship between Cdk4, Cyclin D, and p21. Each node has a state, either on (1) or off
(0). Based on the state and the relationship to the other nodes, the Boolean Network can cycle through a
series of states. To test the predicted outcomes (i.e., can the model sustain cell cycle progression and
translesion synthesis once initiated?), this model was further simplified into just the DNA adduct/cellular
proliferation part, and represented as a Boolean Network systems model. Specifically, we are looking for
stable states or attractors—cycles of states that recur and self-perpetuate. States that lead to attractors are
called the basin. The Boolean Network in Figure 15 has a single state attractor defined in Figure 16. This state
can be defined as a cell cycle progression state with translesion synthesis turned on. If the cell were to enter
this system state, it would be expected to self-perpetuate until a stimulus shuts it down. Important to note is
that the systems model does not predict that all cells will enter this state or that this state is the default.
Rather, the model is simply stating that if this state were entered, the cell would remain in this state until a
stimulus occurs that forces it out. Such stimuli might include changes in gene expression, alterations of
metabolic states, or a change in overall energy level. The Boolean Network model predicts that, with DNA
adducts alone, the cell will enter into a five-state attractor (Figure 17). In this cycle, the cell is not predicted to
enter into Gl/S phase transition—which is expected because p53 should effectively shut down that pathway.
Translesion synthesis is predicted to occur in this attractor cycle.
15Translesion synthesis is a mechanism for DNA damage tolerance that allows the DNA replication machinery
to move beyond a DNA lesion or abasic site (i.e., a site that lacks a DNA base).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 38
-------
G1S Pli
Translesfon Svnlliesis
DNA aJlluct
Figure 15. Liver Carcinogenesis Systems Model. The nodes represent proteins, and the lines are directional
connections meaning activation or inhibition (activation and inhibition are not treated differently in the
graphical depiction of the model). For instance, the arrow from PCNA to translesion synthesis means that PCNA
activates translesion synthesis. The two major outcomes in this model are translesion synthesis and Gl/S
phase transition. The major external input is DNA adduct formation. DNA adducts cause structural damage to
the DNA, which could become or lead to mutations and ultimately tumorigenesis and cancer.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 39
-------
Default State
Single State Attractor
0:Translesion Synthesis
1:PCNA
2:p2l
3: Cdk4
4: Gl/S Phase Transition
5:p53
6:MDM2
7: Ubiquitin
8:DNAadducts
9:CyclinD
Figure 16. Default State, Single State Attractor. The systems model falls into a default state, single state attractor
system. This is the same as the network represented in Figure 15. The names have been replaced by numbers,
which are noted in the figure legend. Red nodes are those that are activated. Blue nodes are inactivated. The
system here has not been perturbed by external forces. Of particular interest is that the "default" state for the
system is one where the cell is actively proliferating and undergoing translesion synthesis.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 40
-------
External Stimuli: DNA Adduct Formation
Default State
Single State Attractor
0: Translesion Synthesis
1:PCNA
2:p21
3:Cdk4
A: Gl/S Phase Transition
S:pS3
6:MDM2
7: Ubiquitin
8: DNA adducts
9:Cyclin D
Figure 17. DNA Adduct Attractor System. When the systems model is perturbed through an external stimulus
(DNA adduct formation), it transitions from the default stable starting state and moves to a new attractor
(depicted in the inset). Once the system moves out of the basin for the default state attractor, it cannot return to
that state without another significant stimulus. This multistability (the fact that a system can have multiple stable
attractor states) is a characteristic of complex systems. Starting at the upper left of the inset, PCNA is activated,
DNA adducts are activated, and p53 is activated. This leads to translesion synthesis and activation of p21, MDM2,
and ubiquitin. Although Cyclin D gets activated, there is no activation of Gl/S phase transition. The system then
transitions to a state where translesion synthesis is primed and ready to go. If Gl/S phase transition were to
occur, p53 is activated, along with DNA adduct formation, MDM2, and ubiquitin. The next system state has
continued p21 activation, loss of p53 activity presumably through ubiquitin and MDM2 activation in the prior
system state, and DNA adduct formation. The system then transitions to only DNA adduct formation and
ubiquitin activation, followed by restarting of the cycle.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 41
-------
External Stimuli: B[a]P-mediated Gene Expression
Default State
Single State Attractor
0: Trjtiblesion Synlhtiis
1:PCNA
2:p21
3:Cdk4
4: Gl/S Phase Transition
5:p53
6:MDM2
7: Ubiquitin
8: DNAadducts
9:Cyclin D
Figure 18. Gene Expression Data Attractor System. This four-system attractor is based on the gene expression data
observed in both studies. This attractor system is notable as it shows DNA adduct formation, translesion synthesis,
and Gl/S phase transition occurring in all system states. This model predicts that DNA adducts and potential
mutations are being passed forward to daughter cells through translesion synthesis as the cell cycle progresses at
these doses and times in the mouse liver. This suggests that B[a]P at these doses and experimental time-points
post exposure in the mouse liver could be an initiator and promoter of tumorigenesis. This adverse outcome
pathway (AOP) might ultimately result in carcinogenesis.
Risk Assessment Implications Based on the B[a]P Prototype

1 Hazard Identification - These data suggest that B [a]P activates known human disease pathways
2 associated with genotoxicity and tumor promotion/cell cycle progression. Similar pathway-based
3 meta-analyses can be performed on transcriptomic data for other chemicals to screen for
4 genotoxicity and tumor promotion, prior to the observation of tumors. For instance, using this
5 specific Boolean systems model would inform risk assessors of the likelihood that other PAHs and
6 PAH mixtures share a similar AOP. This type of chemical screening would need to be further
7 validated with known or likely carcinogens and compared against chemicals that are believed not
8 to be carcinogens (to establish performance of the screening method).

9 Disease-focused system models could be developed for a larger set of complex human diseases to
10 expand the utility of this approach in the future. The pathway-based, diseased-focused, Boolean
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 42
-------
1 systems model approach could be expanded to include emerging data streams, including
2 metabolomics and proteomics, to create overall improvements in mechanistic understanding and
3 hazard identification screens.

4 The genes in these Boolean systems models can be considered as those that might be tested
5 together in a battery of assays to be used in Tox21 screening. HTS assay batteries based on these
6 models can be implemented easily using current multiplex quantitative PCR assay systems.

7 Exposure-Dose-Response Assessment - Analyzing changes in the systems model and potential
8 differences in adverse outcome across a range of doses was precluded due to the lack of sufficient
9 dose-response data. The Boolean systems models approach used here, however, would allow for
10 the prediction of adverse outcomes across a range of doses. In the B[a]P example, we examined the
11 impacts of different scenarios. This same approach would be used to analyze different doses.

12 Cumulative Risk Assessment - Boolean systems models can be used to compare and integrate
13 pathway-based results from multiple chemicals and nonchemical stressors. This approach would
14 enable prediction of hazards from exposure to mixtures or cumulative stressors.

15 Variability and Susceptibility in Human Response - Human susceptibility can be modeled by
16 using data from genome-wide association studies (GWAS), knock-out studies, or knock-down
17 studies. In this instance, modeling the impacts on the adverse outcome predicted by the Boolean
18 systems model is possible. For instance, the impacts of a gene knock-out generally can be modeled
19 in the Boolean systems model as a constant inactivation of the protein.

2 0 Population variability would be modeled using a Monte Carlo simulation to estimate the risk of
21 adverse outcomes across different genetic profiles. This would be accomplished by using the same
22 types of models as in the human susceptibility context. The population variability scenario can be
2 3 considered as creating a population of susceptibility Boolean systems models, where each model
24 has a chance of being included in the overall analysis equal to its occurrence in the human
2 5 population (or equal to its occurrence in a hypothesized human population if performing a what-if
26 type of scenario). For instance, if 15% of the population is expected to have a loss of function
27 polymorphism, the Monte Carlo model should have a 15% chance of choosing that type of Boolean
28 systems model on each random draw from the population.

3.1.4. Risk Assessment Implications across the Tier 3 Prototypes

29 Looking across the Tier 3 prototypes:

30 • Benzene, ozone, and B[a]P displayed human molecular signatures that are strongly associated
31 with specific human disorders and diseases.
32 • This type of molecular mechanistic understanding can be used to screen and predict an
33 association between a chemical and a disease, or to augment the existing weight of evidence
34 for an association between a chemical and a disease.
35 • With sufficient systems biology understanding and data, disease signatures also could be used
36 to screen chemicals with no or limited traditional data for specific disease hazards.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 43
-------
1 • Meta-analyses that integrate pathway-based data across multiple studies yield the greatest
2 evidence that associate chemical exposure to a disease, and are generally the most
3 appropriate method for using transcriptomics data in a risk assessment. A pathway analysis
4 from a single study will yield more evidence to associate a chemical exposure to a disease,
5 and assuming the study design is adequate, might be appropriate for a risk assessment. An
6 analysis built on a set of DEC lists is not reproducible or adequate for risk assessment
7 purposes.
8 • Dynamic disease-based systems models will facilitate the understanding and prediction of
9 chemical-disease associations in the near future. These models provide a nonbiased view of
10 the underlying biology, and can facilitate making pathway-based predictions of adverse
11 outcomes and disease when the interconnections within the pathway become complicated
12 (e.g., the B[a]P case study).
13 • On an individual level, molecular signatures involve dynamic relationships among adaptive
14 and nonadaptive processes that will require additional research to understand fully. At the
15 population level, environmental factors can be thought of as shifting the population or
16 subpopulation distributions toward (e.g., certain chemical exposures) or away from increased
17 levels of risk (e.g., beneficial nutrients).
18 • In vitro responses appear to have commonalities with in vivo responses but also are affected
19 by a number of variables, such as test system, metabolism, cell type, tissue type, time course
20 of events (ozone data only), individual characteristics (intrinsic and extrinsic), and species.16
21 These complexities make the identification of a specific disease hazard from in vitro only data
22 difficult. Systems biology understanding, derived from in vivo data, increases confidence in
23 the interpretation of in vitro data.
24 • For in vitro data, identifying hazards that occur at the organ or organismal level might be
25 difficult. Thus, in vitro studies might be more appropriate for assessing the relative potencies
26 of chemicals to alter biological processes (vs. induce disease) or to predict hazards that occur
27 or are initiated at the tissue level (e.g., generalized inflammatory response). This is
28 particularly true if relative potency is evaluated within a given protocol.

29 • Future research merging GWAS data and personalized medicine into organized data can help
30 better characterize both intrinsic and extrinsic factors that contribute to human variability
31 and susceptibility.

32 • The networks associated with a disease can apparently be disrupted in multiple places, all
33 leading to altered risks of the specific disease. This is shown by mechanistic commonalities
34 among diseases of unknown origins, other chemicals associated with the disease, and
35 chemotherapeutics that can reverse or block components of the disease processes. This type
36 of information can be a useful tool in characterizing cumulative risks. Overly narrow
37 descriptions of mechanisms can miss interactions among environmental factors.
"Although not evaluated here, lifestage is also an important variable (Boekelheide et al. 2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 44
-------
1 • When searching for candidate Tier 3 prototypes, one important observation was that, even
2 among the most well studied chemicals, very few chemicals had the type and quality of data
3 needed for exploring the use of new data types in risk assessment There are needs for
4 systematic review criteria for new data types, adherence to standards of experimental and
5 statistical practices in data generation and analyses, and thoughtful consideration of
6 variability and uncertainty to improve the utility of new data types for risk assessment.

3.2. 2:

7 The intent of the Tier 2 prototypes is to (1) explore new types of computational analyses and
8 short-duration in vivo bioassay that are currently relatively uncommon in risk assessment but hold
9 great promise for the near future; and (2) develop an assessment approach well suited to limited
10 scope risk management decisions. In this case, "limited" generally means regional to local exposure
11 potential, or limited hazard potential, or limited data to conduct more detailed assessments. Tier 2
12 efforts fall between Tier 3 and Tier 1 in terms of resources required and amount of uncertainty in
13 the assessment results. The number of chemicals possibly identified in Tier 1 meriting further
14 testing could overwhelm traditional or Tier 3 type evaluation, thus the need for an intermediate
15 testing and assessment strategy as provided by Tier 2 (Thomas, RS et al. 2013a).

16 The hallmark of Tier 2 data in the NexGen program is integration across biological systems—
17 molecule-to-cell(s)-to-tissue(s) and, in some systems, to-outcome(s)—to inform associations
18 among environmental exposures, causal mechanisms, and outcomes, but generally using
19 evaluations over relatively short time periods (hours to weeks). Tier 2 considers all information
20 available from Tier 1 approaches, such as quantitative structure-activity relationship (QSAR) and
21 HTS, along with other data derived from more complicated test systems that use intact tissues or
22 organisms to provide a higher level of confidence in the assessment (Table 4). Limited scope
23 assessment could include combining HTS with limited traditional data. Tier 2 data are commonly
24 referred to as high-content data.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 45
-------
Table 4. Summary of Tier 2 NexGen Approaches, Including Weight of Evidence, Pros, and Cons
Tier 2: Limited Scope Assessments
Categories of Approaches Considered
Data Mining of
Existing Databases
Approach: Discovers or identifies
associations among
environmental exposures,
omic patterns, and human
disease.
Often uses meta-analyses of
large existing data sets.
Suggests potential adverse
outcomes based on existing
knowledge of other chemical-
induced molecular event and
disease relationships.
Weight of Determined by the quality and
evidence: amount of underlying evidence
(ranges from suggestive to
likely) or is known with
substantial complementary
experimental data.
Pros: Significantly faster and less
expensive than traditional
bioassays.
Can use combined data sets
that include tens of thousands
of humans.
Alternative Species
In Vivo Assays
Experimentally measure dose-
dependent, chemically-
induced alterations in
biological functions in intact
organisms using a range of
specific and sensitive assays.
Measures adverse outcomes
that range from omics to
phenotypic outcomes and
population effects.
Determined by the quality and
quantity of data, but generally
suggestive to likely. Cross-
species issues need
consideration.
Significantly faster and less
expensive than traditional
bioassays.
Can evaluate complex
outcome birth defects and
neurobehavioral outcomes.
Includes tissue, organism, and
including metabolism
Relationships generally
associative; might be causal in
certain circumstances
(depending on data quality
and amount of underlying
evidence).
Data on effects of early life
exposures and effects
generally lacking.
life span-level integration,

Species-to-species
extrapolation is an issue as is
the potential for parent
compound not to be
metabolized to toxicants that
are active in humans.
Omics information can be
derived from organs, tissues,
and multiple cell types versus
only human-based target cells.
Data on effects of early life
exposures and effects
generally lacking; an exception
is the embryonic fish models.
Mammalian Short-duration
In Vivo Assays
Experimentally measure dose-
dependent, chemically-
induced alterations in
biological functions in intact
animals using a range of
specific and sensitive assays.
Measures molecular or cellular
changes; infers potential
adverse outcomes based on
existing knowledge of other
chemical pathway or disease
relationships.

Determined by the quality and
amount of underlying
evidence, ranges from
suggestive to likely when
anchored to pathway and
traditional data and some
understanding of temporal
progression.
Significantly faster and less
expensive than traditional
bioassays.
Includes tissue and organism
integration, including
metabolism.
Measure events early in
disease initiation process;
early events could be
reversible; links to apical
outcome can be an issue.
Omic information is often
derived from multiple cell
types versus only target cells.
Data on effects of early life
exposures and effects
generally lacking.

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 46
-------
1 Two general approaches to Tier 2 data are discussed here:

2 • High-content knowledge mining (i.e., computer-driven surveys of the literature and large
3 existing data libraries17) to retrieve data and conduct meta-analyses of existing systems
4 biology data to construct mechanism-of-action models and establish associations between
5 environmental exposure and disease. The diabetes/obesity prototype is provided as an
6 example.
7 • Short-term in vivo or in situ exposures of intact organisms to enable incorporation of the
8 intact metabolism in the toxicity evaluation and produce measures of biological change over a
9 short time frame (i.e., ranging from hours to a few months) that are thought to be relevant to
10 longer term outcomes. Two examples are provided using alternative and mammalian species.
11 Considerable work is ongoing at various U.S. Federal Government agencies and elsewhere to
12 refine assays where animals are exposed to chemicals in vivo for periods ranging from hours
13 to a few months.
14 Implications for risk assessment identified by the Tier 2 prototypes are discussed at the end of this
15 section and integrated with other lessons learned in Section 5 "Lessons Learned from Developing
16 the Prototypes."
17 Knowledge mining18 is explored in this prototype as a means to characterize the associative and
18 potentially causal relationships among disease and exposures to environmental factors and
19 intrinsic sources of human variability. The knowledge mining approach capitalizes on huge new
20 databases that are being supplemented with each publication in the field of omics (>50,000 per
21 year). These databases are generally oriented toward the omics of human disease but also include
22 omics information on other species, as well as surveys and clinical assays measuring human
23 exposure and health outcomes. The specific, related diseases explored here are diabetes and
24 obesity and relationships to multiple environmental factors. Diabetes results from environmental
25 and genetic factors and risk varies considerably in the population (Patel et al. 2013). Four
26 interrelated efforts focusing on diabetes/obesity are reported here: (1) Comparison of Knowledge
27 Mining Results and Expert Opinion; (2) Environment-wide Association Studies (EWAS); (3) Itemset
28 Associations between Prediabetes/Diabetes and Chemical Exposures; and (4) Characterizing
29 Human Susceptibility and Population Variability.

of and

30 Thayer et al. (2012) reported on a recent National Toxicology Program (NTP) workshop that
31 examined the possible causal relationships between environmental exposures and diabetes or
32 obesity. At the workshop, results from an extensive information survey were evaluated by experts
17For example, the National Library of Medicine's Gene Expression Omnibus (GEO): a public functional
genomics data repository supporting MIAME-compliant data submissions. Array- and sequence-based data
are accepted. Tools are provided to help users query and download experiments and curated gene expression
profiles.
18Knowledge mining is the computerized extraction of useful, often previously unknown, information from
large databases or data sets using sophisticated data search capabilities and statistical algorithms to discover
patterns and correlations and then interpret this new information in the context of systems biology to create
new knowledge.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 47
-------
1 on the strength of the associations identified. The effort integrated both traditional and new types
2 of data, including approximately 870 findings from more than 200 human studies; and the most
3 useful and relevant endpoints in experimental animals and in vitro assays (e.g., ToxCast™ and
4 Tox21). The environmental factors identified and discussed at the workshop included maternal
5 smoking and nicotine, arsenic, persistent organic pollutants, organotins, phthalates, bisphenol A
6 (BPA), and pesticides. Overall, the workshop results suggest that associations can be made between
7 environmental factors and type 2 diabetes or obesity, but causality was more difficult to assign
8 (Table 5).

Table 5. Summary of Literature Review Findings and Expert Judgments Concerning Causal
Relationships
Chemical/
Environmental
Factor
Outcome
Maternal smoking Childhood
and nicotine obesity
Diabetes
Arsenic
Organochlorine
persistent organic Diabetes
pollutants
Organotins
Bisphenol A (BPA) Diabetes
Association/
Causality
Association, likely
causal
Association
Association
Suggestive of an
Obesity association in animal
and in vitro models
Phthalates
Diabetes or
obesity
Suggestive of an
association
Insufficient data to
assess
Conclusions from
Breakout Group
Likely causal supported by epidemiology data and
animal studies (Behl et al. 2013).
Sufficient support for an association between arsenic
and diabetes in populations with relatively high
exposure levels (> 150 ng arsenic/L in drinking water)
(Maull et al. 2012).
Sufficient for a positive association of some
organochlorine persistent organic pollutants with
type 2 diabetes (Taylor et al. 2013).
Current data from human studies of exposure to
organotins are nonexistent regarding an association
with diabetes or obesity. Recent animal and
mechanistic studies report stimulatory effects of
tributyl tin on adipocyte differentiation (in vitro and
in vivo) and an increased amount of fat tissue (i.e.,
larger epididymal fat pads) in adult animals exposed
to TBT during fetal life. Although the organotin
"obesogen" literature is relatively new, with few
studies, the quality of the existing experimental
studies was considered high by the breakout group
(Thayeretal. 2012).
Overall, this breakout group concluded that the
existing data, primarily based on animal and in vitro
studies, are suggestive of an effect of BPA on glucose
homeostasis, insulin release, cellular signaling in
pancreatic p cells, and adipogenesis (Thayer et al.
2012).
Current data from human studies of exposure to
phthalates provide insufficient evidence of an
association with diabetes or obesity (Thayer et al.
2012).

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 48
-------
1 Diabetes varies in the population due to both genetic and environmental factors but understanding
2 these interactions has been difficult. Using an Environment-wide Association Study approach, Patel
3 et al. (2012b) investigated the problem of many possible contributing factors by integrating
4 genomic and toxicological data to obtain a candidate list of interacting genes, genetic variants, and
5 environmental factors associated with type 2 diabetes. The method involved three steps. First,
6 genetic and environmental data were summarized from VARIMED (VARiants Informing MEDicine; a
7 genetic association database) and the National Health and Nutrition Examination Survey (NHANES,
8 an environmental exposure and effects database). VARIMED contains data on 11,977 gene variants,
9 9,752 genes, and 2,053 individuals; NHANES includes 261 genotyped loci, 266 environmental
10 factors measured in blood and urine, and clinical measures for the same individuals. They identified
11 several environmental factors that positively or negatively affected risks for type 2 diabetes,
12 including nutrients and persistent organic pollutants. They reported 18 human genetic variations
13 (SNPs) and 5 serum-based environmental factors that interacted in association with type 2
14 diabetes. Thus Patel et al. (2013, 2012b) successfully identified association linking diabetes, genes,
15 gene variants, and environmental factors.

16 This approach demonstrates a knowledge mining method that can be applied broadly to any
17 number of common diseases to identify possible interactions between genetic and environmental
18 factors and risks of disease. In Genetic Variability in Molecular Response to Chemical Exposure, Patel
19 and Cullen (2012) review what has been learned to date with these types of efforts and discuss a
2 0 more comprehensive representation of chemical exposures as the "envirome" and how we might
21 use it to examine the interplay of genetics and the environment.

and

22 We followed up efforts by Thayer etal. (2012) and Patel etal. (2013, 2012b), using two
23 independent frequent itemset mining analyses of the NHANES data. Frequent itemset mining is a
24 data mining approach commonly used in business intelligence to derive marketing and pricing
25 strategies or to identify credit risks. For example, grocery stores use frequent itemset mining to
26 uncover products that are typically purchased together to determine pricing strategies (e.g., a
27 grocer does not want to place items commonly purchased together on sale at the same time and
28 might raise the price of an item commonly purchased with a sale item). Similarly, this technique can
29 be used with the NHANES data to uncover a chemical or group of chemicals that tend to be
30 associated with specific diseases.

31 We focused our analyses on the 2003-2004 NHANES cohort and evaluated associations between
32 diabetes and individual chemicals. We also focused on the 2009-2010 NHANES cohort and
33 evaluated associations among diabetes and a more complex lists of chemicals.20 Both analyses
34 focused on metals.
"This section is adapted largely from Patel et al. (2012b) and (2013) with the assistance of Dr. Patel.
20Both analyses use the Apriori algorithm (Borgelt 2013) to generate "rules" where X > Y is read "X is
associated with outcome Y." Our first study constrained the rule to read "prediabetes/diabetes is associated
with chemical Y," or prediabetes/diabetes > chemical Y. Our second study constrained Apriori to return
prediabetes/diabetes as the outcome.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 49
-------
1 - These results suggest that type 2
2 prediabetes/diabetes is most often associated with lead and cadmium (blood or urine), with a
3 suggestive association with arsenicals. Type 2 prediabetes/diabetes is not associated with cesium
4 and uranium. Table 6 lists the resulting rules21 showing the association or lack of association
5 between diabetes and all of the metals monitored in NHANES.

6 When interpreting lift,22 support,23 and confidence,24 we believe lift is the most informative to start
7 with, followed by the other measures. If a rule has a lift value close to 1, the rule has little predictive
8 value, regardless of the support and confidence. Once an analyst has identified models that are
9 significantly different from random (lifts > 1), the analyst will typically then examine the support
10 and confidence.

11 Support provides an indication of the percentage of people surveyed by the NHANES program that
12 have both type 2 prediabetes/diabetes (the antecedent) and high blood lead, for instance (11% in
13 this case). The support indicates what proportion of the population might be expected to have type
14 2 prediabetes/diabetes and high blood lead, assuming the NHANES sample is a truly representative
15 sample of the U.S. population (in this case 11%).

16 The confidence tells the analyst how strong the rule is. In other words, confidence tells the analyst
17 the percentage of people with type 2 prediabetes/diabetes (the antecedent) that have high blood
18 lead, for instance (34% in this case). This is equivalent to the prevalence of Type 2
19 prediabetes/diabetes in individuals that have a high level of the particular metal, and is a potential
20 indicator of risk.
21Ruleset is a collection of one or more rules used, in this case, to predict association between diabetes and
chemical exposures (Oracle 2013a).
22Lift is a measure of how much better prediction results are using a model than could be obtained by chance
(Oracle 2013b). A lift of 1 means the rule is no better at predicting the outcome than random chance. Thus,
knowing that someone in this NHANES cohort is denned as prediabetic or diabetic provides a 1.44 times
better chance to predict that the person has high blood lead, compared to random. The lift close to 1 provides
no better indication of a person's urine uranium or cesium concentration compared to random guessing
knowing that they are prediabetic or diabetic.
23Support is the percentage of subjects in the entire data set/database that have both the
antecedent/condition and the predicted outcome. This can also be thought of as the number of subjects in the
entire data set/database where the rule is true.
24 Confidence is the percentage of the people who meet the antecedent/condition criteria that also meet the
outcome criteria. For instance, 34% of the people in this NHANES cohort defined as being either prediabetic
or diabetic also have high blood lead.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 50
-------
Table 6. Apriori Rule Associations between Type 2 Prediabetes/Diabetes and Chemical
Exposures.
Antecedent/Condition
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Predicted Outcome
High blood lead
High urine cadmium
High blood cadmium
High urine arsenobetaine
High urine lead
High urine total arsenic
High blood total mercury
High urine cesium
High urine uranium
Lift
1.44
1.43
1.26
1.25
1.20
1.18
1.12
1.03
1.01
Support
0.11
0.13
0.09
0.10
0.09
0.09
0.09
0.08
0.07
Confidence
0.34
0.43
0.30
0.33
0.28
0.31
0.30
0.25
0.24
Conclusion
Association
Association
Association
Association
Association
Association
Association
No association
No association
1 Prediabetes/Diabetes and Multiple Chemical Exposures - Table 7 lists the results showing
2 associations between multiple chemicals and prediabetes/diabetes. The rule with the highest lift
3 (1.46 times better than random) is where NHANES subjects had high urine cadmium, high blood
4 lead, and high total urine arsenic. This rule is present in 11% of the 2009-2010 NHANES cohort,
5 suggesting it might be true for 11% of the U.S. population at the time of study, assuming NHANES is
6 a good random sample. Of all the subjects who had high urine cadmium, high blood lead, and high
7 total urine arsenic, 59% also were either prediabetic or diabetic. Not surprisingly, the rule with the
8 next highest lift is the same as the first, except these subjects also had high urine lead levels. This
9 rule had a support of 10% and a confidence of 58%. Overall, this analysis supports strong
10 associations between cadmium, lead, and total urine arsenic and type 2 prediabetes/diabetes due
11 to their frequent occurrence in the top ranked rules. Cesium and cobalt occurred less frequently
12 and would be expected to be less strongly associated.

13 Synthesis of Frequent Itemset Mining Results - Lead and cadmium exposure are highly likely to
14 be associated with type 2 prediabetes/diabetes. High lead levels occurred in 9 of 10 and cadmium
15 in 8 of 10 of the top-ranked rules in Burgeon's data set. Further evidence is provided by the results
16 where blood lead, blood cadmium, and urine cadmium were the highest rated outcomes based on
17 lift. Confirmatory evidence exists that these metals also might be elevated in other diabetic
18 populations (Afridi et al. 2008). Low-dose mixtures of lead, cadmium, and arsenic might induce
19 oxidative stress (Fowler et al. 2004), and evidence suggests that cadmium might induce
20 hyperglycemia in rats (Bell, RRetal. 1990).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 51
-------
Table 7. Apriori Rule Associations between Type 2 Prediabetes/Diabetes and Exposure to
Multiple Chemicals
Antecedent/Condition
High urine cadmium
High blood lead
High total urine arsenic
High urine cadmium
High urine lead
High blood lead
High total urine arsenic
High urine cadmium
Low urine cobalt
High urine cadmium
High blood lead
High urine cadmium
High urine lead
High blood lead
High urine cadmium
High urine cesium
High blood lead
High urine cadmium
High blood cadmium
High blood lead
High urine lead
High blood lead
High total urine arsenic
High urine cesium
High blood lead
High total urine arsenic
High urine cadmium
High urine lead
High blood cadmium
High blood lead
Predicted Outcome
Support Confidence Conclusion
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
1.46 0.11
1.44 0.10
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
1.38 0.15
1.38 0.11
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
Type 2 Prediabetes/Diabetes
1.37 0.13
1.37 0.12
1.37 0.10
1.37 0.11
0.59 Association
0.58 Association
1.40 0.11 0.56 Association

1.38 0.17 0.56 Association

0.56 Association
0.56 Association
0.55 Association
0.55 Association
0.55 Association
0.55 Association
1 Taking these results together, uranium and cesium are not likely to be associated with type 2
2 prediabetes/diabetes. Whether mercury is likely to be associated with type 2 prediabetes/diabetes
3 remains unclear.

4 Extrapolating these results to the U.S. population suggests that a large proportion (>50%) of the
5 population with elevated lead, cadmium, and arsenic levels might have type 2
6 prediabetes/diabetes. Although these data are not sufficient to demonstrate causality, they do
7 suggest that mixtures of these metals are associated with type 2 prediabetes/diabetes. Possible
8 explanations include (1) the mixture of these chemicals might cause type 2 prediabetes/diabetes;
9 (2) prediabetic/diabetic phenotypes might alter the absorption, distribution, metabolism, and
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 52
-------
1 excretion of these metals; or (3) only one of these chemicals might be associated with type 2
2 prediabetes/diabetes, while the absorption, distribution, metabolism, and excretion properties of
3 the other chemicals are impacted by the first Evidence exists that the three metals work together to
4 induce oxidative stress, and cadmium itself might induce hyperglycemia in rats. These results
5 suggest that further studies should be conducted to ascertain potential causality.

6 Further, these results demonstrate that Frequent Itemset Mining yields fruitful results and
7 hypotheses that can be used to identify associations between chemical body burdens and potential
8 disease endpoints. The results also illustrate ways that data mining methods developed for other
9 fields can be implemented to identify predictive biomarkers of exposure and health outcomes.

10 Risk managers can begin to identify populations with genetic susceptibility to Type 2
11 prediabetes/diabetes in their communities by combining the frequent itemset mining results above
12 with data mining of human genetic variability data, health outcomes, and an understanding of
13 disease processes and chemical MOAs. Combining this information with census demographic data,
14 geographic information systems, and exposure models will further drive the possibilities of
15 pinpointing specific geographic susceptible populations. In this prototype, we identify a potentially
16 susceptible population to Type 2 prediabetes/diabetes by combining the cadmium-disease
17 association, known gene-disease associations, and knowledge of risk allele frequencies in human
18 ethnic groups.

19 Recently, a combination of EWAS and GWAS was performed that examined potential interactions
20 between SNPs (i.e., a mutation of a single nucleotide within the DNAof a gene sequence),
21 environmental chemical levels in blood and urine, and health outcomes—specifically type 2
22 diabetes—using data from NHANES (Patel et al. 2013). Although support for genotype and chemical
23 interactions was limited, interesting interactions were noted between the nonsynonymous coding
24 SNP rs!3266634 in the SLC30A8 gene and cis- and trans-beta-carotene and gamma-tocopherol.

25 The SNP rs!3266634 has been noted as being associated with type 2 diabetes previously (Rung et
26 al. 2009, Takeuchi et al. 2009, Timpson etal. 2009, Pare etal. 2008, Diabetes Genetics Initiative of
27 Broad Institute of Harvard etal. 2007, Scott etal. 2007, Sladeketal. 2007, Steinthorsdottir et al.
28 2007, Zeggini et al. 2007). SLC30A8 is a zinc transporter found in the pancreatic beta-cell secretory
29 vesicles. Zinc has been associated with insulin biosynthesis (Emdin et al. 1980), and chronic
30 decreased zinc intake has been associated with an increased risk of diabetes (Miao etal. 2013). The
31 risk allele in rs!3266634 is C (Sladek et al. 2007), while the minor allele is T (NCBI 2012b).

32 Risk managers might find the genotype and allele frequency data in dbSNP to be helpful in
33 understanding population variance and to help identify susceptible populations. For instance, from
34 a random sample of 100 individuals of Mexican descent in Los Angeles, 66% were homozygous for
35 the risk allele, 30% were heterozygous, and 4% were homozygous for the nonrisk allele (NCBI
36 2012b). If we assume the sampling is representative of the entire population of Mexican-descended
37 residents of Los Angeles, then approximately 66% of these individuals might be at an increased risk
38 of developing diabetes, independent of their body mass index (OMIM 2012). Heterozygous
39 individuals (30% of the population) might also carry some risk and might be more affected by their
40 zinc intake (i.e., increased zinc intake might be protective). Likewise, the heterozygous individuals
41 might be more sensitive to other metals, chemicals, or dietary factors that might compete with zinc
42 for absorption, or they might be more sensitive to chemicals that could interfere with zinc

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 53
-------
1 metabolism, transport, and insulin biosynthesis. Given the high rate of zinc deficiency in Mexican
2 children that is not correlated with socioeconomic status, finding zinc deficiency in children of
3 Mexican descent living in Los Angeles might not be surprising, especially if diet plays a significant
4 role in the deficiency (Morales-Ruan Mdel et al. 2012).

5 Cadmium exposure will be of concern to individuals who are homozygous or heterozygous for the
6 risk allele. Cadmium has been shown to compete with zinc transporters and might lead to beta-cell
7 dysfunction, lack of insulin production, and ultimately hyperglycemia (El Muayed et al. 2012).
8 Individuals with the rs!3266634 risk allele could be more sensitive to cadmium exposures than the
9 rest of the population.

10 Through database mining and an understanding of the allele disease pathway and a chemical's
11 adverse outcome pathway, we can identify potentially susceptible populations more easily. This
12 example could be extended by examining cadmium exposure data for the Los Angeles area and
13 using a geographic information systems approach with census data to identify potentially
14 susceptible individuals, based on the allele probabilities. This type of predictive modeling could
15 help advance more targeted community-level responses in the future.

3.2.2. In -

16 Alternative species (i.e., nonmammalian species) provide in vivo models for identifying hazards,
17 integrating dose-response effects, and understanding pathways and apical effects useful for
18 assessing chemical risks to humans and to other species. The shorter life spans of alternative
19 species enable the evaluation of toxicity over the full life span of the intact organism, facilitating
20 study of the entire etiology of disease from the MIE to apical outcomes, including more complex
21 phenomena such as birth defects or neurobehavioral impairment

22 Alternative species studies are playing a progressively more integral role in chemical testing,
23 hazard identification, and dose-response assessment for both human and nonhuman species (ECHA
24 2013b, Perkins et al. 2013, EPA 2012d, EC 2011, Schug et al. 2011, OECD 2004). Both the European
25 Chemicals Agency (ECHA) and EPA use alternative species tests as part of required tests for
26 endocrine disrupters (EPA 2012e, 2009a). Alternative species studies can be used for prioritization
27 and screening or as the basis for Tier 2 type assessments.25

2 to

28 Endocrine disrupters are chemicals that interfere with the body's endocrine system and produce
29 adverse effects in both humans and wildlife. In a state-of-the-science review, the World Health
30 Organization (WHO) concluded that thyroid disruption-associated neurobehavioral disorders are
25The types of alternative or nonmammalian species can vary widely. Considerable work in toxicology has
been done with fish, but work in very simple organisms such as yeast also provides insight into cellular
regulation at multiple levels that control core biological processes and enable cells to respond to genetic and
environmental changes (Yeung et al. 2011). Zhu et al. developed a network reconstruction approach that
simultaneously integrates different types of data, and constructs a probabilistic causal network representing
complex cell regulation: endogenous metabolite concentration, RNA expression, DNA variation, DNA-protein
binding, protein-metabolite interaction, and protein-protein interaction data. Causal regulators of the
resulting network were identified and provide insight into the mechanisms by which variations in network
interactions affect gene expression and metabolite concentrations (Zhu et al. 2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 54
-------
1 occurring in children, and the incidence of these disorders has increased in recent decades (WHO
2 2012). Normal thyroid function is essential for normal brain development, particularly during
3 pregnancy and after birth. Additionally, thyroid hormones are crucial to inner ear and bone
4 development, and bone remodeling and physiological functions such as metabolism (De Coster and
5 van Larebeke 2012). Internationally agreed-upon and validated test methods for identification of
6 endocrine disrupters capture a limited range of the known endocrine disrupting effects (Miller, MD
7 et al. 2009). In its state-of-the-science review, WHO advised that existing testing protocols do not
8 characterize completely all essential functions and that adverse effects "are being overlooked"
9 (WHO 2012). Identifying environmental factors that might disrupt normal thyroid function and
10 impact public environmental health is needed, given the key role that thyroid hormone plays for
11 normal development and physiologic functions in all vertebrates (Woodruff and Sutton 2011,
12 Miller, MDetal. 2009).

13 In regulating development, the role of the thyroid hormone is of particular toxicological interest
14 because thyroid hormone-dependent post-embryonic development is a common feature of
15 vertebrate ontogeny (Paris and Laudet 2008). This period of development is typically characterized
16 by transient elevations of thyroid hormone that elicit species-specific physiological and
17 morphogenetic responses with lasting developmental consequences. Transitions from tadpoles to
18 juvenile frogs and body plan reorganization in flatfish are two nonmammalian examples of thyroid
19 hormone-controlled events. Human and vertebrate post-embryonic neurodevelopment is thyroid
2 0 hormone-dependent and deviations from normal thyroid hormone concentrations at critical times
21 are associated with a variety of neurological defects and deficits (Zoeller et al. 2002). The timing (or
2 2 window) of exposure is critical as the impact of thyroid hormones changes as the brain develops
23 (Zoeller and Rovet 2004). Thyroid hormone regulation is generally essential for normal
24 development in vertebrates, thereby establishing the basis for cross-species extrapolation of
25 developmental risks. Several methods using alternative species have been proposed to measure
26 these outcomes for thyroid pathways (Makris et al. 2011, Nichols et al. 2011).

27 A key factor in thyroid hormone-related risk assessment is the ability to examine hormone
2 8 disruption and the resultant developmental disruption at higher levels of tissue organization.
29 Results from omics technologies and other thyroid hormone toxicity assessments, such as EPA's
30 ToxCast™ chemical screening efforts (EPA 2008), can be linked to adverse outcome data from
31 alternative species studies. Two examples are:

32 1. The construction of regulatory networks using time series data in genotyped populations
33 and integration of multiple data types (i.e., endogenous metabolite concentrations, RNA
34 expression, DNA variation, DNA-protein binding).
35 2. If a chemical is identified as a potential developmental disrupter in HTS, more information
36 on in vivo effects might be required to establish dose-response relationships, windows of
37 susceptibility, potential impacts of maternal exposure on progeny, and existence of subtle
38 impacts on behavior, learning, and memory.

and

39 As discussed throughout this document, understanding of systems biology and the events leading to
40 an adverse effect are central features for the use of molecular biology data in risk assessment
41 Pathway analyses are useful to inform extrapolation across species and to aid in characterizing the
42 variability within populations through identifying and describing both MIEs and key biological
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 55
-------
1 events leading to adverse outcomes. They can also help identify how human-focused screening data
2 can inform ecological risk assessment. Although making quantitative predictions of disease risks
3 based on today's system biology or adverse outcome models is often very difficult, pathway
4 assessments are critical to advancing dose-response assessment.

5 Figure 19 illustrates an example for thyroid hormone disruption. Disruption of the thyroid
6 pathways can occur by thyroid peroxidase inhibition, iodine uptake (sodium iodide symporter)
7 inhibition, enhanced phase II metabolism (glucuronosyltransferases or sulfotransferases) via
8 alterations in specific genes (CAR/PRX [constitutive androstane receptor/prename x receptor]) or
9 receptors (AhR), enhanced cellular transport, deiodinase inhibition, and interference with thyroid
10 receptor function. In humans, this leads to birth defects, decreased IQ, and metabolic disorders. In
11 rats, increased TSH leads to thyroid hyperplasia and cancer.
Organism
response
* T4 & T3
Synthesis

Catabolism

t Cellular
Transporter;

t Biliary .
Elimination
I
* »
4-Serum
T3
&
T4

In situ
detection
of T4 in
Zebrafish
embryo b
T40T3
Conversion
Xenopus trop/calis
plasma TSH c
TTSH
•»
Thyroid
Hyperplasia
»
Thyroid
Tumors
•>
Death
»
Population
reduction
Pathway 1
Xenopus laevis
developmental defects
Pathway 2
Zebrafish embryo
TH levels e
I
Xenopus and Zebrafish
TH/bZIP-eGFP a
Figure 19. Major pathways involved in thyroid disruption with example toxicants and alternative models applicable
to both human and ecological hazard assessment (Perkins et al. 2013). Reproduced with permission from
Environmental Health Perspectives.26
26 The thick black outlined box indicates the critical event of serum level concentrations of thyroid hormones.
Pathway 1: rat pathway leading to tumors via thyroid hyperplasia. Pathway 2: principle pathway of concern
affecting humans. Abbreviations: IQ, intelligence quotient; 4-MC, 4-methylbenzylidene camphor; OMC, octyl
methoxycinnamate; Ts, triiodothyronine; T4, thyroxine; TR, thyroid receptor. "Quantification of plasma TSH
levels in Xenopus tropicalis. ^Direct quantification of intrafollicular concentrations of T4 in zebrafish
embryos. cDetection of developmental defects with X. /oev/smetamorphosis assay. Detection
developmental defects using zebrafish embryos. eReporter gene (eGFP) detection of TR activity.
of
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 56
-------
1 Understanding causal mechanisms and their relationships to adverse outcomes provides insights
2 into both hazard identification and dose-response assessment. Although quantitatively predicting
3 human disease risks based on knowledge of causal mechanisms is currently difficult, several
4 approaches using alternative species data provide information on the potency of chemicals that
5 cause effects: biomarkers of exposure and effect, relative potency to induce adverse effects, and
6 understanding of the complexities of dose-response relationships.

7 Key events in the perturbed pathway can be represented with biomarkers of exposure and effect. In
8 situations where considerable systems biology information links the event to outcomes, a
9 biomarker might provide a measure of hazard for risk assessment In the thyroid disruption
10 example, upstream events converge on serum levels of the thyroid hormones, triiodothyronine (T3)
11 and thyroxine (T4), and downstream events occur in peripheral tissues where a significant degree
12 of species-specific effects are seen (Figure 20). As a result, serum T4 levels can be used as a
13 biomarker of thyroid function across species. In the laboratory, researchers use T4 and thyroid
14 stimulating hormone levels in fish and frogs to assess the thyroid disrupting potential of chemicals
15 (Tietge et al. 2013, Thienpont et al. 2011). To assess human exposures, the Centers for Disease
16 Control and Prevention (CDC) has used decreased serum levels of T4 (noted as key event in Figure
17 20) and increased levels of TSH measured in the U.S. population to infer increased potential risks
18 for thyroid dysfunction-related disorders at low levels of perchlorate exposures (Lau et al. 2013,
19 Blountetal. 2007).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 57
-------
B
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
^BRAIN SUBMODEL^
' TRH,,
PREDICTED TSH VS. 7.S7/ MEASUREMENTS
TSH/t)
TH SUBMODELS
Figure 20. Dose-response relationships. Within species, significant advances are being made in quantitative
systems biology modeling (Eisenberg et al. 2008). Panel A: Overall feedback control system model of thyroid
hormone regulation with three source organ blocks (hypothalamus [HYP], anterior pituitary [ANT PIT], and thyroid
glands [THYROID]); three sink blocks—for TRH, TSH, and T3 and T4 distribution; and elimination (elimination =
metabolism and excretion) (D&E). TRH = thyrotropin-releasing hormone; TSH = thyroid-stimulating hormone; T3 =
triiodothyronine; T4 = thyroxine; SR = secretion rate; p = plasma or portal plasma for TRH-related components; DA
= dopamine; SRIH = somatastatin. Panel B: Feedback control system (FBCS) model validation study results.
Predicted normal circadian TSH versus independent TSH data (not used in fitting the FBCS model) (triangles and
diamonds represent data from Sarapura et al. (2002), circles represent data from Samuels et al. (1994). Also shown
(squares) are the mean TSH data from the larger database used to fit the FBCS model of Blakesley et al. (2004).
Reproduced with permission from Mary Ann Liebert, Inc.
Relative Potency

Identification of pathways and assays impacted by chemicals of known toxicity can be useful in
initial prioritization of many compounds. These can be identified through development of
predictive models built on relationships between in vitro ToxCast™ assay results and in vivo effects
caused by known developmental toxicants (Sipes et al. 2011). A chemical's potency can be further
refined using focused in vivo tests with alternative species to provide informative dose-response
data and exposure window relationships. Alternative species provide easily manipulated model
systems that can detect effects caused by mechanisms not assessed by in vitro HTS. For example,
zebrafish were used as a screening model to assess the 309 EPA ToxCast™ Phase I chemicals for
developmental toxicity to both humans and ecological species. In fish embryos or larvae, 191 (62%)
chemicals were toxic (death or malformations) to the developing zebrafish. Both toxicity incidence
and potency were correlated with chemical class and hydrophobicity. As an integrated model of the
developing vertebrate, the zebrafish embryo screen provides information relative to overt and
organismal toxicity. In 12 classes of chemicals, 100% of the chemicals induced developmental
toxicity, 4 classes of which induced developmental toxicity with an average concentration at 50% of
the maximum level (ACso)27 below 4 [J.M. Translating such results directly into a dose-response for
human risks is difficult, but results of Padilla et al. (2012) show that alternative species can be used
27 In high-throughput screening (HTS) assay, ACso is the concentration at which an assay is inhibited or
activated by 50% when compared to control values. This value is useful in comparing assay results.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 58
-------
1 to build relative rankings of chemicals based on their potency to cause adverse effect Such rankings
2 can be used for ranking and prioritizing chemicals or classes of chemicals for additional evaluation.

3 Chemical dose-response relationships characterized in one alternative species might be
4 extrapolated to other species or to humans, using a pathway-based approach (Perkins et al. 2013).
5 Because many biological functions and disease pathways are conserved across species, similarity of
6 genes encoding those pathways can support direct comparisons of pathway or genomic effects
7 between species. Where pathways are highly conserved between species, this information can be
8 used to extrapolate dose-response relationships in alternative species to analogous relationships in
9 mammals. For example, pathways in the hypothalamus-pituitary-gonad axis are highly conserved
10 among vertebrates, which been used to show that chemical effects in fathead minnows are
11 predictive of endocrine disrupting effects in rats (Ankley, G. T. and Gray 2013).

12 Pathway effects defined through gene expression changes can be used to define a benchmark dose
13 or sensitivity of an animal to a chemical (Thomas, RS etal. 2011). Benchmark concentrations
14 derived from aqueous exposures of alternative species can be translated to oral equivalents in
15 other species, such as humans, by applying a dose scaling factor composed of a simple reverse
16 toxicokinetics approach to estimate the blood dose and amount of metabolism in the target species
17 (Wetmore et al. 2012). Using this approach, chemical concentration effects can be translated from
18 alternative species to mammalian species. See Figure 20 for an example of how systems biology can
19 inform dose-response extrapolation within species.

2 0 However, this type of an approach has added uncertainty, and may generally increase uncertainty
21 to an unacceptable level, which precludes the calculation of a reference value, including a
22 provisional value. There is uncertainty with respect to defining a benchmark dose based on gene
23 expression changes and with respect to the pathway context and interpretation. For instance,
24 changes in gene expression do not directly translate into changes in protein expression or activity.
25 In addition, it is well known that signaling and metabolic pathways within the cell are intersecting
26 and inter-related. There is uncertainty with respect to the dose-response changes at particular key
27 events and how downstream key events may be altered by other intervening pathways. Thus,
2 8 calculating a benchmark dose for a pathway itself is fraught with challenges and additional
29 uncertainty that current reference value approaches do not take into account. In all likelihood,
30 accounting for these additional sources of uncertainty may require new uncertainty factors to be
31 developed, and increases the likelihood that an unacceptable level of uncertainty may be
32 encountered.

33 Thus, it is more likely that, until better, less uncertain methods and techniques are developed and
34 used, pathway-based effects based on gene expression are more suitable for screening level
3 5 decisions and less suitable for reference value derivation.

36 For most species, qualitative predictions are likely to be tenable based on hypothalamus-pituitary-
37 thyroid (HPT) dependent path ways, that is, iodine uptake. Altered iodine uptake hinders
38 development, but the most sensitive outcome indicator might be different. In rats, thyroid hormone
39 disruption can lead to thyroid tumor development (Hurley 1998), while in frogs, metamorphosis is
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 59
-------
1 disrupted (Degitz et al. 2005). Quantitative predictions might not be feasible for many species due
2 to limited data on downstream endpoints and key events (Perkins et al. 2013).

3 Normal thyroid hormone-dependent post-embryonic development requires coordinated spatial-
4 temporal control of thyroid hormone activity. Such activity is regulated not only through the
5 classical features of the HPT axis, but also through peripheral mechanisms external to the
6 hypothalamus, pituitary, and thyroid tissues, such as differential regulation of deiodinase activity,
7 hepatic metabolism and excretion of thyroid hormones, thyroid hormone receptor regulation, and
8 transmembrane thyroid hormone transport. Of these major controlling processes, the mechanisms
9 of the central HPT axis are considered to be generally conserved across vertebrate species and
10 useful for comparative efforts; however, those of the peripheral tissues are typically more divergent
11 and must be used with care in cross-species analysis.

12 Understanding the variation of an individual relative to population variation can be key to
13 identifying an adverse effect on a population. Polymorphisms affecting drug responses can vary
14 widely in populations. In humans, 20-25% of prescription drugs are metabolized in the liver by
15 cytochrome P450 CYP2D6 where variants confer widely different rates of drug metabolism, such
16 that some people might respond with an onset of toxicity while others fail to experience efficacy
17 (Ingelman-Sundberg 2005). Variants causing unanticipated results can comprise a significant
18 portion of a population and that distribution can vary widely across populations (Sistonen et al.
19 2007, Ingelman-Sundberg 2005, Andersen etal. 2002, Wooding etal. 2002). Understanding the
20 variation in adverse responses across a diverse testing population helps reduce the uncertainty of
21 extrapolating laboratory data to real populations. Differential response to chemicals is an important
2 2 consideration in ecological risk assessment where not only potentially sensitive subpopulations
2 3 might exist, but also sensitive species.

24 Approaches have been developed to incorporate population diversity into toxicity testing through
2 5 the use of large collections of different genetic lines of mice or cell cultures derived from them
26 (O'Shea etal. 2011, Rusynetal. 2010, Harrill etal. 2009). Alternative species could be especially
27 useful for incorporating population variability into toxicity testing. The diversity in laboratory lines
2 8 and outbred populations of fish can be high, especially if populations are collected from different
29 areas impacted by pollutants (Williams and Oleksiak 2011, Guryev etal. 2006). Divergent lines of
30 zebrafish can be used to examine variation in responses to chemicals in addition to determining
31 possible genetic factors influencing adverse effects. Using this approach, Waits and Nebert (2011)
3 2 crossed zebrafish lines displaying different levels of sensitivity to developmental cardiotoxicity
33 caused by 3,3',4,4',5-pentachlorobiphenyl. The crosses were used in genome-wide quantitative trait
34 loci mapping to identify several genes in addition to the AhR that contribute to the gene-gene and
3 5 gene-environment interactions that drive developmental toxicity of dioxins and dioxin-like
36 chemicals.

3 7 Although genetic diversity can be incorporated into testing using a panel of genetically inbred lines,
38 unexpected results can occur. In a study comparing the responses of 19 inbred to 20 outbred
39 zebrafish lines, Brown et al. (2011) found that effects of the endocrine disrupting chemical
40 clotrimazole were dramatically different Clotrimazole acts by inhibiting P450 activities involved in
41 steroidogenesis production in fish. In inbred fish lines, 11-ketotestosterone production via
42 steroidogenesis was significantly inhibited. In contrast, outbred lines responded with Leydig cell
43 proliferation in testes and normal plasma concentrations of 11-ketotestosterone indicating that the

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 60
-------
1 outbred lines could compensate for inhibition by clotrimazole. Here, inbreeding had a strong
2 impact on the diversity and type of response to the endocrine disrupter. Ultimately, the
3 combination of in vivo and in vitro data should enable development of a weight-of-evidence case as
4 to the toxicity caused by the chemical and whether potential human health effects are likely.

Cumulative Risks

5 As has been described elsewhere in this document, correct identification of causal perturbations
6 that lead to adverse outcomes will enable determination of which environmental factors are likely
7 to contribute to the cumulative risk for specific outcomes and which are not Additionally, testing of
8 combinations of chemicals can perhaps be conducted most efficiently in alternative species. For
9 example, alterations in neurosensory functions and intrafollicular thyroxine content of zebrafish
10 exposed to potential disrupters have proven to be useful tools for evaluating multiple chemicals
11 (Raldua et al. 2012, Thienpont et al. 2011, Froehlicher et al. 2009), as has the zebrafish
12 developmental assay (Padilla etal. 2012).

3.2.3. Short-Term In Vivo Models - Mammalian Species

13 The use of new short-term in vivo exposure mammalian bioassays to support Tier 2 assessments
14 are described here. The prototype is drawn from research described in papers by Thomas R.S. et al.
15 (2011)and discussed further in Thomas R.S. etal (2013a, 2013b). The goal of this research was to
16 describe what would be required for the application of short-term in vivo transcriptomic assays in
17 predicting chemical toxicity.

Hazard Identification

18 Short-term in vivo transcriptomic assays provide the metabolic capability and systems-level
19 integration of whole animal studies with a more rapid assessment of response to chemical
20 treatment based on molecular-level data. As more data from short-term in vivo transcriptomic
21 studies become publicly available, as study designs become standardized, and as gene expression
22 patterns and network perturbations are identified, the ability to predict chemical toxicity
23 comparable to longer term assays is expected to
24 increase. See Text Box 8 for more about the
25 transcriptome.
Box 8. What is the Transcriptome?
26 For hazard identification, a host of previous studies
27 has demonstrated that transcriptomic signatures
28 from short-term in vivo studies can be used to predict
29 both subchronic and chronic toxic responses. A
30 transcriptomic "signature" is typically defined as a
31 subset of genes for which the qualitative or
3 2 quantitative expression pattern can be used to predict
33 an in vivo adverse response with a defined accuracy.
Ribonucleic acid (RNA) is the functional outcome
of deoxyribonucleic acid (DNA) transcription by
transcription factors. Researchers can study the
transcriptome—the set of all RNA molecules in a
;iven cell—and determine gene expression
patterns, or signatures. Specifically, short-term
transcriptomic assays in mammalian and
alternative species enable observations of the
effects of chemical exposure across multiple
tissues.
34 To develop a broad-based repertoire of gene expression signatures for hazard prediction, several
35 factors should be considered. First, the number of endpoints included should be sufficient to allow a
36 comprehensive prediction of toxicological hazard. Previous studies that have used gene expression
37 microarray analysis following short-term exposures of chemicals have been limited in the breadth
38 of endpoints examined. These endpoints include the prediction of rat liver tumors (Fielden et al.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 61
-------
1 2011, Uehara et al. 2011, Auerbach et al. 2010, Ellinger-Ziegelbauer et al. 2008, Fielden et al. 2008,
2 Fielden etal. 2007, Nie etal. 2006), mouse lung tumors (Thomas, RS etal. 2009), and rat renal
3 tubular toxicity (Fielden et al. 2005). One strategy that could be employed would be the selection of
4 a battery of tissues, which would include those most frequently positive in rodent cancer bioassays
5 (i.e., liver, lung, mammary gland, stomach, vascular system, kidney, hematopoietic system, and
6 urinary bladder) and tissues commonly affected by noncancer disease. In a previous analysis, these
7 eight tissues cover 92 and 82% of all mouse and rat carcinogens, respectively (Gold et al. 2001).
8 Additional tissues also would need to be added for developmental and reproductive effects, which
9 could include the developing fetus and gonadal tissue.

10 Second, the number of positive and negative chemicals for each endpoint in the studies would need
11 to be sufficient, and the chemical diversity must match the diversity in the chemicals that will
12 ultimately be predicted. For complex toxicological responses such as tumor formation, a previous
13 study estimated that at least 25 chemicals were necessary (Thomas, RS et al. 2009). Third, selection
14 of the time point to perform the gene expression analysis is also a consideration. The time point
15 selection is a balance between cost (i.e., the shorter the time point, the less expensive the study)
16 and a more stable gene expression signature. Among the previous efforts, certain studies relied on
17 much shorter time points (e.g., 5 days), but tended to increase the dose beyond that which would be
18 tolerated in a chronic bioassay (Fielden et al. 2007). Other studies used the same doses as those in
19 the chronic bioassay, butused exposures longer than 5 days (Thomas, RS et al. 2009). In one study
2 0 that examined the effect of exposure duration, the overall conclusion was that increasing exposure
21 duration increased the predictive performance of the gene expression signatures for genotoxicants
22 (Auerbachetal. 2010).

23 As described by Thomas R.S. etal. (2013b, 2012, 2011, 2007), short-term in vivo transcriptomic
24 assays have also been applied to dose-response assessment In a NexGen collaborative effort
25 between EPA and the Hamner Institute, female B6C3F1 mice were exposed to multiple
26 concentrations of five chemicals that were positive for lung or liver tumor formation in a two-year
27 rodent cancer bioassay (Thomas, RS etal. 2012, Thomas, RS etal. 2011). Histological and organ
28 weight changes were evaluated and gene expression microarray analysis was performed on the
29 liver or lung tissues. The histological changes, organ weight changes, and tumor incidences in
30 traditional bioassays were analyzed using standard dose-response modeling methods to identify
31 noncancer and cancer points-of-departure. The dose-related changes in gene expression were also
32 analyzed using a modification of EPA's benchmark dose (BMD) approach (EPA 1995). The gene
33 expression changes were grouped based on both biological processes and canonical signaling
34 pathways. A comparison of the transcriptional BMD values with those for the traditional noncancer
35 and cancer apical endpoints showed a high degree of correlation for specific biological processes
36 (Thomas, RS etal. 2011) and signaling pathways (Thomas, RS etal. 2012). In addition,
37 transcriptional changes in the most sensitive pathway were also highly correlated with the apical
38 responses (see Figure 21). Further studies have also demonstrated the stability of the correlation
39 between transcriptional changes and apical responses across different exposure periods (5 days to
40 13 weeks) (Thomas, RS et al. 2013b). Understanding of the effect of exposure duration on outcomes
41 is a key issue in the design and use of new types of bioassays.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 62
-------
10000
B
I
•o
£ 1000
15
u

If
3 ° 100
10
• Cancer BMD
• Lowest Noticancer BMD
1DCCQ
1C
100
1DCC
•DM0
10
1CC
woo
1CCCC
Median Transcription.il BMD for Most Sensitive Gene
Ontology Category {mg/kg-d or mg/m3)
Median Transcrlptlonal BMDL for Most Sensitive Gene
Ontology Category (mg/kg-d or my.'in..1
Figure 21. Scatter plot of the relationship between (A) benchmark dose (BMD) and (B) benchmark dose lower limit
(BMDL) values for the cancer and noncancer apical endpoints and the transcriptional BMD and BMDL values for
the most sensitive GO category. For each chemical and tissue, the BMD and BMDL values for tumor incidence and
the lowest noncancer BMD and BMDL values were plotted. For MECL in the lung, no noncancer BMD or BMDL
values were plotted because of the absence of histological changes (Thomas, RS et al. 2011). Reproduced with
permission from Oxford Journals.
1 With the advent of quantitative high-throughput screening (qHTS), the potential to screen
2 thousands of chemicals for biological activity presents as many challenges as promises. If qHTS can
3 decrease the number of chemicals of interest by 90% (a 10% hit rate across chemicals and assays),
4 this would still overwhelm the throughput of the traditional toxicity testing paradigm. Clearly, a
5 multi-tiered approach to prioritization can lead to more effective applications of animal toxicity
6 testing. As part of this tiered approach, short-term in vivo transcriptomic assays provide a tool that
7 incorporates both metabolism and systems-level integration in response to chemical treatment. See
8 also a description of cost savings in Text Box 9. The development of predictive gene expression
9 signatures and dose-response studies would provide a relatively efficient and cost-effective method
10 for both identifying chemicals of concern and estimating a point-of-departure for adverse
11 responses. This information would help support large-scale prioritization and regulatory efforts in
12 the United States and Europe. The gene expression data combined with other data types (e.g.,
13 toxicity data from similar chemicals, PK data) could provide sufficient information to replace the
14 present chronic toxicity and carcinogenicity studies. It should be noted that expression changes can
15 vary depending on dose, time, species, tissue life stage, and individual genetic profile; thus,
16 increasing the complexity of identifying causal relationships between exposure, specific signatures,
17 and outcomes.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 63
-------
Box 9. Short-Duration In Vivo Models Potential Cost Savings

We concluded that, assuming no overlap among chemicals across the battery of 10 tissues tested, a proof of
concept for predicting hazard using short-term in vivo transcriptomic mammalian assays could be developed using
approximately 250 chemicals at a single dose across 10 tissues. Additionally, they calculated that the costs
associated with the proof of concept would be approximately $90,000 per chemical for a 28-day exposure for a
total cost of $22.5 million. In comparison, each chronic exposure assay costs approximately $1.5 million resulting
in a total of approximately $375 million.
Costs for similar efforts using alternative animals (fathead minnows, zebrafish, invertebrates) would be
approximately $10,000 per chemical for definitive reproductive assays. Short-term, pathway-based benchmark
dose (BMD) assays would be $10,000 per chemical for invertebrate or fish embryo assays and $48,000 per
chemical for 21-day fish reproductive assays (5 tissues) or $2.5 million and $12 million for 250 chemicals.
3.3. Tier 1: Screening and Prioritization

1 This section summarizes new approaches that are available to develop data for screening and
2 prioritizing large numbers of chemicals (i.e., greater than tens of thousands of chemicals) for more
3 focused testing. The increasing maturity of these new approaches has led EPA and other
4 organizations to plan on using these Tier 1 data to prioritize and screen chemicals for immediate
5 regulatory decision, for further testing in Tiers 2 and 3, or in some cases, to add to the weight of
6 evidence in Tier 2 and 3 assessments, especially with respect to identifying pathway or molecular
7 signatures associated with chemical-induced disease.

8 Tier 1 risk assessments are based on in vitro assays (including use of human cells), statistical and
9 systems models that focus on molecular molecular targets, QSAR models, and pathways considered
10 relevant to adverse effects or clinical disease. One scientific rationale for using in vitro assays is that
11 they probe key events or MIEs that can lead to adverse outcomes. Assay endpoints are designed to
12 represent MIEs and predict subsequent adverse outcomes based on previous studies, both in vitro
13 and in vivo. The analyses provide the anchoring information critical to characterizing relevance of a
14 "hit" in an in vitro assay. Documenting the linkage from assay endpoint to MIE to potential for
15 adversity is thus key to evaluating the relevance of each assay that might be used as part of a Tier 1
16 risk assessment. The evidence for this linkage can come from statistical modeling using in vivo and
17 in vitro data on the same chemicals, or from detailed biological modeling (e.g., virtual tissue (VT)
18 models or other types of systems biology models).

19 The modeling techniques used in Tier 1 (e.g., QSAR and HTS methods) are designed to assess
20 hundreds to thousands of chemicals in parallel (see Figure 23). In addition to using high-
21 throughput (HT) assays to generate hazard information, moderate- to HT toxicokinetics approaches
22 (here called reverse toxicokinetics or RTK (Rotroff et al. 2010)) are also developed and applied.
23 New approaches can now estimate doses that can activate particular relevant pathways in humans,
24 using data from in vitro assays (Wetmore et al. 2012). Bayesian-based exposure models can also be
25 used to generate exposure estimates for chemicals based on production volume and use patterns
26 (Wambaugh and Shah 2010).

27 The data generated from the Tier 1 assays can be used to prioritize chemicals for further study or
28 can simply augment the weight of evidence for chemicals that are already being considered in Tiers
29 2 or 3. For prioritization, from a large set of chemicals for which HTS data are available, one might
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 64
-------
1 identify the subset that is likely to interact with known relevant pathways, or which demonstrate
2 pathway disruptions similar to known diseases. Doses at which pathways may be activated or
3 perturbed may be estimated. These estimates may be combined with exposure, occurrence, and
4 other information to select chemicals which may advance into Tiers 2 or 3. Tier 1 data might also
5 directly augment Tiers 2 and 3 weight-of-evidence determinations helping to identify or further
6 characterize pathways alterations associated with disease for sensitive endpoints observed in
7 higher level in vivo testing, providing good examples of the integration of the bottom-up and top-
8 down perspectives advocated in the NexGen framework strategy. In vitro and modeling data can
9 also be used to guide a next round of in vivo data generation.

10 Table 8 provides a brief description and critical review of the tools, methods, and models that could
11 be used in Tier 1.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 65
-------
Table 8. Summary of Tier 1 NexGen Approaches, Including Weight of Evidence, Pros, and Cons
Tier 1: Screening and Prioritization
Categories of Approaches Considered
Approach:
Weight of
evidence:
Pros:
Cons:
New QSAR Models
Uses structural characteristics and
experimental data from chemical
analogues to predict modes of action,
metabolism, hazard, and fate and
potency for data-poor chemicals.
Determined by quality and amount of
existing data, but generally suggestive.

Data are readily and inexpensively
available for all chemicals. If the basis
for the QSAR model(s) matches the
physical chemistry of the evaluated
chemicals, the model(s) generally
predicted potency within a factor of
100. Harmonized international
approaches are available.

If models do not match the physical
chemistry of evaluated chemicals,
results are unreliable. Models do not
predict active metabolites.
Validated High-Throughput
In Vitro Assays
Experimentally measures dose-
dependent, chemically-induced
alterations in biological functions using a
range of specific and sensitive in vitro
assays. Infer potential adverse outcomes
based on existing knowledge of other
chemical and potential importance of
selected biological processes.

Determined by supporting traditional
data and systems biology knowledge, but
generally suggestive to likely.

Rapid, inexpensive, multiple bioassay
options are available. False negatives and
positives for ToxCast™ evaluated assays
are low (when testing directly acting
chemicals, not toxic metabolites).

Assay coverage of all important biological
processes currently incomplete resulting
in false negatives for chemicals that
perturbed those processes. Similarly,
disorders related to interactions among
cell types or tissues cannot be evaluated,
that is, reproductive/developmental
effects. Limited ability to test for active
metabolites or volatiles. False negative
rates are of concern. Links to disease
outcomes are variable.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 66
-------
and

1 (Q)SAR28 models are regression or pattern recognition models that are used in risk assessment to
2 classify or predict the potency of chemicals for toxicological activity, exposure potential, and the
3 like as a function of one or more chemical descriptors. The descriptors are generally the inherent
4 physiochemical properties of the chemical such as atomic composition, structure, substructures,
5 hydrophobicity, surface area charge, and molecular volume. QSAR models require only the inherent
6 properties of the 2-D or 3-D chemical structure as input parameters, and are thus considerably less
7 costly and faster than hazard animal test results. With a variety of QSAR models to choose from
8 (Hansenetal. 2011), and each model having a set of assumptions and a chemical domain of
9 applicability, interpreting QSAR results for use in hazard and dose-response assessment requires
10 expertise.

11 QSAR models have been most commonly used in classification of chemicals with unknown hazard
12 or exposure potential by comparing the "query" chemical's inherent properties with similar
13 properties for a set of chemicals that have known toxicological or exposure potential called the
14 "training set" (Venkatapathy and Wang 2013, EPA2012c, Goldsmith et al. 2012, OECD 2012, Wang,
15 N et al. 2012b, EC 2010). SAR models provide a qualitative identification of specific hazards (e.g.,
16 suspected carcinogens, mutagens, and reprotoxicants). The commercially available TOPKAT model
17 (TOPKAT, 2013) provides quantitative estimates that can be used to rank chemicals for potency
18 (Venkatapathy and Wang 2013, Venkatapathy et al. 2004). In the European Community, QSAR
19 results are used to prioritize chemicals for additional toxicity testing.

20 At EPA, (Q)SAR models are being used to screen, rank, and categorize chemicals for level of concern
21 in a variety of EPA programs, including Superfund mitigation; the Office of Chemical Safety and
22 Pollution Prevention (OCSPP) High Production Volume Challenge Program and Pre-Manufacture
23 Notice review process; the OCSPP/Office of Water Endocrine Disrupters Screening Program (Weiss
24 et al. 2012); and the Office of Water Candidate Contaminant List The QSAR models used by EPA
25 include the Sustainable Futures Initiative suite of models, the Organization of Economic Co-
26 operation and Development (OECD) QSAR toolbox models (OECD 2012, 2004), High-throughput
27 Virtual Molecular Docking (HTVMD) (Rabinowitz etal. 2008), MetaCore (Teschendorff and
28 Widschwendter 2012, van Leeuwen et al. 2011), and the TOPKAT model (Rakyan et al. 2011,
29 Venkatapathy et al. 2004).

3 0 HTVMD models use a ligand-based chemoinformatics strategy to predict relationships between
31 various attributes of ligands and their binding to known targets. These models are increasingly
3 2 being used in risk assessment and can screen thousands of chemicals for the potential affinity of
33 their 3D structures to bind to active protein binding sites. HTVMD models have been used in the
34 pharmaceutical industry for many years to identify candidate drugs. These models can also be used
35 to estimate the likelihood that a chemical of toxicological interest would bind to a target protein, for
36 example, the potential affinity as a direct agonist of the estrogen receptor.
28The parentheses around the "Q" in (Q)SAR indicate that the term refers to both qualitative predictive tools,
i.e., structure-activity relationships (SARs) and quantitative predictive methods, i.e., quantitative structure-
activity relationships (QSARs). Although the term (Q)SAR is often used to refer to predictive models,
especially computer-based models, (Q)SAR actually includes a wide variety of computerized and
noncomputerized tools and approaches (Hansen et al. 2011).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 67
-------
1 Recent advances in high-performance computing support simultaneous runs of QSAR and HTVMD
2 models, dramatically decreasing the time to discovery. The U.S. Army Medical Research and
3 Materiel Command, for example, has recently published their version of a Docking-based Virtual
4 Screening pipeline that facilitates the usage of the AutoDock molecular docking software on
5 high-performance computing systems (Jiang et al. 2008).

6 The OECD provides a free downloadable QSAR software package, the QSAR Toolbox, that is
7 intended for use by governments, the chemical industry, and other stakeholders to assess potential
8 chemical human and ecological toxicities for data-poor chemicals (OECD 2012). The QSAR Toolbox
9 estimates the potential toxicity of a compound of interest based on the available information (e.g.,
10 mechanism, MOA, or toxicological effects) for structurally similar analogs, and uses read-across or
11 trend analysis to construct categories of chemicals for screening purposes even if only a few of the
12 members in the category have available test data. The popularity of the read-across method is
13 driven by its relative simplicity and the availability of the QSAR Toolbox online. OECD has also
14 developed guidance on the validation of QSAR models when used for regulatory purposes (OECD
15 2004). Assessments informed by new data types and methods will incorporate the results from
16 data sources that can be automated (e.g., QSAR and molecular docking models, and HTS data), with
17 the more traditional data (when available) to advance the speed and accuracy of chemical screening
18 and to support the weight-of-evidence approach to toxicity prediction (Golbraikh et al. 2012, Lock
19 etal. 2012, Rusynetal. 2012, Wignall etal. 2012, Sedykh etal. 2011). Use of the above models and
2 0 approaches will advance the ranking of chemicals currently being produced, as well as support the
21 design of new products and chemical processes that increasingly minimize harm to health and the
22 environment.

and

23 HTS and high-content screening (HCS) assays are major tools used for early evaluation of chemicals
24 and their ability to perturb molecular pathways (Judson et al. 2013, Sipes et al. 2013, Tice et al.
25 2013, Kavlocketal. 2012, Judson etal. 2011). Much of the HTS/HCS (for the remainder of this
26 section use of the term HTS includes both HTS and HCS) methodology was developed to aid the
27 pharmaceutical and biotechnology industries in the drug discovery process where one has a drug
28 target of interest (e.g., a receptor or enzyme) and a need to screen up to millions of candidate
29 compounds for leads (Mayr and Bojanic 2009, Bleicher et al. 2003). The technology has been used
30 more broadly in approaches often called chemical genetics (or sometimes chemical biology) where
31 small molecule screening is used to identify probes for biological signaling networks and cellular
32 phenotypes (Schreiber 2003). These assays became of interest in toxicology because many targets
33 of pharmaceutical and chemical biology interest could also be postulated to be involved in disease
34 processes driven by unintentional exposures to environmental chemicals (Houck and Kavlock
35 2008). Generating a large data matrix of toxic chemicals and HTS assays against critical proteins
36 and cellular phenotypic effects provides toxicologists an opportunity to discover novel MOAs that
3 7 have long eluded the field.

38 The underlying technologies for HTS assays are well known, so a detailed discussion is not
39 presented here. Instead, the discussion focuses on a broad description of the types of assays and
40 some of the key issues to be considered when designing in vitro Tier 1 approaches. HTS assays are
41 broadly divided into two types: cell-free/biochemical or cell-based. Cell-free assays typically test
42 for the direct interaction of a test chemical with a specific protein such as a receptor or enzyme.
43 Measures of interaction include binding or inhibition of enzyme activity. In cell-based assays, a
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 68
-------
1 cellular readout can be molecular-based (e.g., changes in gene or protein expression) or phenotypic
2 (cytotoxicity, changes in cell morphology). In a cell-based assay, the selection of the cell system is
3 critical. Assays have been developed using a variety of primary cell types from various organs and
4 species, immortalized cell lines, and stem cell types (Dick et al. 2010). The choices reflect the
5 strengths and weaknesses of the different approaches. For example, immortalized cell lines
6 generally produce very reproducible screening results over long periods of time due to the
7 continuous growth and stability of the cell lines; however, this occurs at the cost of having
8 significant differences from the in vivo physiology of the cell type from which the line was derived.
9 The converse holds true for most primary cells, that is, better representation of true physiology but
10 more challenging to work with in producing consistent, reproducible screening results. Co-culture
11 systems combine different cells in an attempt to mimic in vivo systems requiring complex cell-cell
12 signaling networks (Berg et al. 2010). Certain whole organisms, including Caenorhabditis elegans
13 and zebrafish embryos, can also be used in HTS assays (Smith, MV et al. 2009, Parng et al. 2002).

3,3.3,

14 HTS assays provide toxicologists with an efficient and cost-effective tool to broadly screen
15 chemicals for potential proximal biochemical and cellular interactions. As previously mentioned,
16 the HTS assays are run in concentration-response format. The potency of each chemical in each
17 assay can be summarized using ACso or LEC (lowest effective concentration) values, depending on
18 the type of dose-response data collected. The potency values among the in vitro assays, along with
19 other chemical information, have been proposed for use in hazard identification (Martin etal. 2011,
20 Sipes etal. 2011) and prioritization of chemicals for further testing (Reif etal. 2010). The
21 relationship between the in vitro concentration of the chemical in the well to the concentration of
22 the chemical in the blood or target tissue [in vivo], however, can be complex and dependent on
23 variables that are not captured in the HTS assays. These variables include bioavailability, clearance,
24 and protein binding (Wetmore et al. 2012).

25 In vitro to in vivo extrapolation (IVIVE) is a process that uses data generated within in vitro assays
26 to estimate in vivo drug or chemical fate. In the past, IVIVE has been predominantly developed and
27 applied in the pharmaceutical industry to estimate therapeutic blood concentrations for specific
28 candidate drugs, and to identify potential drug-drug interactions (Chen, Y et al. 2012, Shaffer et al.
29 2012, Gibson and Rostami-Hodjegan 2007). Due to both legislative mandates and public pressure
30 for increased toxicity testing, IVIVE is increasingly being used to predict the in vivo PK behavior of
31 environmental and industrial chemicals (Basketter et al. 2012).

32 A combination of IVIVE and reverse dosimetry can be used to estimate the daily human oral dose
33 (called the oral equivalent dose) necessary to produce steady-state in vivo blood concentrations
34 (Css) that are considered equivalent (with respect to chemical concentration at potential targets) to
35 the dose delivered in vitro at the ACso or LEC values, and can be used for those values across the
36 more than 600 in vitro assays (Wetmore etal. 2012, Rotroff etal. 2010).

3,3.4,

37 The use of HT assays to characterize biological activity in vitro enables prioritization of potential
38 environmental hazards once the results of in vitro assays have been anchored to, and found to be
39 predictive of, in vivo effects. Without capabilities for HT assessment of potential for exposure,
40 prioritization (with respect to potential risk) cannot be completed, as most chemicals have little or
41 no exposure data (Wetmore etal. 2012, Arnotetal. 2010b, Arnotetal. 2010a, Cohen Hubal etal.

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 69
-------
1 2010, Rotroffetal. 2010, Hubal 2009, Sheldon and Cohen Hubal 2009, Rosenbaum et al. 2008,
2 Arnot and Mackay 2007, NRC 2006). Currently, few, if any, inexpensive in vitro assays are widely
3 available to characterize those properties of chemicals relevant to exposure. Furthermore, the
4 studies for assessing both the presence of environmental chemicals in the immediate vicinity of
5 individuals (exposure potential) and any known biomarkers of actual exposure are expensive, labor
6 intensive, and, with the notable exception of CDC's NHANES, typically difficult to extrapolate to the
7 general population (Rudel et al. 2008, Angerer et al. 2006, Eskenazi et al. 2003). For these reasons,
8 exposure prioritization must be drawn from mathematical models which, when parameterized by
9 chemical-specific properties, provide a structured, consistent way to approach large numbers of
10 unknown chemicals.

11 Physicochemical properties (e.g., water solubility, preference for binding in lipids) inherent to a
12 given compound have been used to predict potential bioaccumulation, and even toxicity, within
13 ecological species to make HT prioritizations of potential chemical exposure (Gangwal et al. 2012,
14 Reuschenbach et al. 2008, Walker et al. 2002, Walker and Carlsen 2002). Beyond inherency,
15 environmental fate and transport models have been developed to account for the accumulation of
16 compounds in various environmental media (i.e., air, soil, water) and the degradation rates of those
17 compounds in those media. These fate and transport models enable predictions of human exposure
18 based on assumptions of human interaction with environmental media and derivation of food from
19 the environment (Arnot et al. 2010b, Arnot et al. 2010a, Rosenbaum et al. 2008, Arnot and Mackay
20 2007). Parameterized using chemical structure and production volumes alone, these models can be
21 used to make HT exposure prioritizations (Arnot and Mackay 2007).

22 EPA is developing the ExpoCast exposure model prioritization framework, which is flexible and
23 expandable to incorporate new HT exposure models as they become available. Currently the
24 framework relies on two quantitative fate and transport models amenable to HT operation: USEtox
25 (Rosenbaum et al. 2008) and RAIDAR (Arnot and Mackay 2007). These models have been
26 empirically assessed for their ability to predict exposures inferred from the NHANES data set.
27 These "ground truth" biomonitoring data are used to calibrate the model predictions and estimate
28 de facto uncertainty of the predictions for 41 chemicals where intake per unit emission, total
29 production volume or volume applied, and actual exposures inferred from biomonitoring data were
30 available. The calibration and uncertainty are then extrapolated to ~1,600 chemicals to make rank
31 order predictions on a per unit emission basis, as well as a rank order prediction for ~600
32 chemicals adjusted using production volume (Wambaugh and Shah 2010).

33 NexGen efforts to incorporate exposure prioritization information could proceed along three fronts.
34 First, efforts to evaluate the utility of the predictions must be undertaken to determine if the
3 5 chemicals of highest priority are indeed present in the environment. Next, new models must be
3 6 developed to address aspects of exposure currently underrepresented by fate and transport
37 models—namely exposure from personal contact sources (i.e., consumer use). Finally, using the full
38 uncertainty range of the absolute exposure predictions (mg/kg body weight/day), risk potentials
39 could be calculated for risk-based prioritization.

(¥11

40 VT models provide an experimental and theoretical framework for the systematic and integrative
41 analysis of complex multicellular systems. These models capture the flow of molecular information
42 across cellular and biological networks, and process this information computationally into higher
43 order responses that ideally simulate a potential adverse outcome(s). Responses to perturbation

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 70
-------
1 depend on network topology, system state dynamics, and collective cellular behavior. A unique
2 aspect is that these simulations are enabled from individual cellular behaviors in a multicellular
3 field that can result in emergent properties, which are behaviors that arise from interactions of
4 parts at the next level of a system (e.g., functions, phenotypes) that are not apparent from
5 knowledge about the behavior of the parts alone.

6 The field of VT models is in the early stages of development but will become more prominent as the
7 state of science develops. Jacket al. (2011) and Knudsenetal. (2010) provide examples of VT utility
8 and the state of science. VT models are practical solutions for translating between biological data
9 and individual and population-level health outcomes. They combine data and knowledge into
10 computer models that predict behavior of a complex system, leading to adverse outcomes in
11 hepatic toxicity, developmental toxicity, reproductive toxicity, cardiopulmonary toxicity, and more
12 (EPA2009b).

13 Virtual models are also briefly discussed in Section 4.4 as one of the new approaches that can
14 address recurring issues in risk assessment, in this case, dose-response characterization.

3.3.6. and

15 For EPA to base regulatory decisions on data from mechanistic-based evaluations, several issues
16 must be addressed. EPA will need to develop criteria and approaches for translating data across the
17 various types of testing and to identify the types of data and information to support the use of these
18 data in a regulatory context. To this end, EPA's NexGen Thyroid Disrupting Chemical Workgroup
19 (EPA 2012a) conducted a thyroid prototype case study that reviewed existing ToxCast™ assays and
20 provided recommendations for how the data could be used to predict thyroid disruption-induced
21 developmental neurotoxicity.

22 A major reason the workgroup selected the thyroid hormone system as its prototype is that the
23 underlying biology of thyroid hormone homeostasis is well established, thus enabling the
24 elucidation of the pathway(s) for thyroid hormone disruption (Zoeller and Crofton 2005). The
25 workgroup identified three issues that should be addressed to use HT assays to predict which
26 environmental chemicals would likely cause developmental neurotoxicity via disruption of thyroid
27 hormone homeostasis. These issues are Assay Identification and Refinement; Algorithm
28 Development for Toxicity and Hazard Prediction; and Standards Development for Assay Conduct,
29 Data Analysis, and Data Reporting for Risk Assessment Needs. The following is a brief summary of
30 the case study.
31 As a first step, the workgroup identified the HT assays in the ToxCast™ database that assess
32 endpoints known to be relevant to disruption of thyroid function. The workgroup found that
33 ToxCast™ contains multiple assays relevant to assessing the potential for a chemical to disrupt
34 thyroid hormone homeostasis. Coverage of the effects of concern, however, is quite variable.
35 Although five of the identified assays evaluate endpoints that directly affect the thyroid hormone
36 pathway (e.g., thyroid hormone receptor binding and TRH receptor binding), the rest evaluate
37 endpoints not specific to the thyroid hormone pathway. For example, of the 90 assays identified as
38 thyroid-relevant, 85 are related to hepatic stimulation, metabolism, and clearance of thyroid
39 hormones. Alteration of these pathways influences thyroid hormone homeostasis indirectly, and
40 neurodevelopmental effects tied to thyroid disruption by this mechanism are thus secondary effects
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 71
-------
1 of a chemical (inadequate hormone availability due to increased elimination). These secondary
2 effects are in contrast to a primary effect, whereby a chemical interferes directly with the function
3 of the thyroid gland itself or interacts at the site of thyroid hormone receptor in the brain of a
4 developing organism.

5 Adequately assessing the potential of an environmental chemical to disrupt thyroid hormone
6 homeostasis requires that appropriate endpoints be identified and assays be developed and
7 incorporated into testing schemes. This process will involve identifying the specific endpoints in
8 the pathways that need to be tested, additional assays that could be available but which are not
9 currently part of ToxCast™, and additional assays that need to be developed. A recent workshop
10 review by Murk et al. (2013) provides a state-of-the-science assessment of important MIEs for
11 thyroid disrupters, potential and currently used assays for these MIEs, and recommendations for
12 research priorities.
13 The workgroup's second recommendation was to develop algorithms or decision logic flows that
14 balance the potential adversity of the outcome with the uncertainty of the available data. Should
15 assays evaluating endpoints directly affecting the thyroid-related brain changes be weighted more
16 heavily in algorithms than those measuring upstream hepatic enzyme induction? How will
17 algorithms incorporate the fact that multiple chemicals might interact with the same key event, and
18 one chemical might interact with various MIEs, and thus lead to multiple adverse outcomes?
19 Biological plausibility should be the driver in algorithm development

20 Another aspect to consider is the methods used to incorporate assay results into analyses. Clearly,
21 incorporating many sets of dose-response information into combinatorial analysis requires some
2 2 simplification of assay results. Many current HT assay results are simplified via classification as
23 either a positive or negative ("hit" or "no hit"), or are assigned a summary statistic such as an ICso
24 (the concentration producing a 50% inhibition of response) or lowest effective dose. Obviously,
25 binary decisions such as hit/no hit determinations depend on the criteria chosen to define a hit
26 These criteria could be derived from statistical significance, biological significance, or an arbitrary,
27 nominal level of change. Depending on the data set, the basis for the classification criteria might be
28 difficult to determine, and might not be consistent across assays. Similarly, summary statistics
29 depend on the model used to generate them or on the specific value chosen (such as ICso versus
30 ICio). Relative potency ranks also might vary depending on the shape of the dose-response curve,
31 such that within a given set of chemicals, Chemical A could have the lowest ICso while Chemical B
32 had the lowest ICio value. Lack of such information will lead to greater uncertainty in its use.

and for

33 Understanding the characteristics of the individual assays that will serve as the basis of these
34 predictions is critical when using HTS data. Individual assay characteristics are key regardless of
35 the ways in which the data are ultimately used, which might span the spectrum from combinatorial
36 use in predictive algorithms, test batteries for hazard identification and prioritization, to
37 supporting data for individual chemical risk assessments. Although these uses are potentially
38 diverse, several common assay characteristics will be needed. Some of the specific types of
39 information needs might vary depending on the type of risk assessment to be performed.
40 Minimally, the data reporting should include sufficient information to document assay conduct and
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 72
-------
1 reliability, the rationale for selection of exposure levels, data analysis techniques, and underlying
2 assumptions regarding assay analysis, conduct, or conclusions.

3 Some advantages of the ToxCast™ data sets are the (1) availability of dose-response information
4 for all assays, (2) availability of assay method details, and (3) availability of the source code for all
5 computational models used in the data analyses. Reliable dose-response information is critical for
6 these types of assays to be useful in risk assessments. Dose-response information is fundamental to
7 understanding the many aspects of chemical toxicity, as it provides a means to evaluate the potency
8 of the chemical and whether a threshold exists.

9 In conclusion, the current case study was complicated by the multitude of target sites at which the
10 thyroid axis can be disrupted (Murk et al. 2013, Crofton and Zoeller 2005); the secondary, indirect
11 nature of the insult produced; and the complexity of the endpoint of concern—neurodevelopment
12 By conducting this case study, however, the workgroup could identify not only the nodes in the
13 thyroid toxicity pathway that still need coverage, but also the algorithm development and assay
14 conduct issues that should be addressed if HTS assays are to be used in risk assessments.

15 In addition to informing chemical specific assessments as discussed above, new data types and
16 advanced approaches also can inform important, recurrent, cross-cutting risk assessment issues.
17 These issues are often sources of controversy due to limited data specific to the issue. A number of
18 these issues are discussed below: variability in human response (e.g., genetic variability, early life
19 exposures; exposure to mixtures and nonchemical stressors); inter-species differences; and
20 characterization of low-level chemical exposures likely to be encountered in the environment This
21 section discussed how new data types and approaches can inform these difficult issues, thus
2 2 improving our understanding of public health risks.

4.1.

23 Human response to environmental chemicals is influenced by both intrinsic (e.g., genetics, life
24 stage) and extrinsic (e.g., chemical exposure, stress, nutrition) factors. New methods to examine
25 gene-gene, gene-environment, and epigenome-gene-environment interactions are available (Patel
26 etal. 2013, Lvovs etal. 2012, Meissner 2012, Patel etal. 2012a, Patel etal. 2012b, Baker 2010,
27 Thomas D 2010, Cordell 2009). Zeise etal. (2012) explored how these factors can influence each of
28 the series of biological and physiological steps (known as the source-to-outcome continuum) that
29 ultimately manifests in variability with respect to adverse health outcomes (see Figure 22). The
30 Zeise et al. (2012) review was informed by a National Research Council (NRC) workshop,
31 "Biological Factors that Underlie Individual Susceptibility to Environmental Stressors and Their
32 Implications for Decision-Making." The authors considered current and emerging data streams that
3 3 are providing new types of information and models relevant for assessing interindividual
34 variability.

3 5 Currently, human variability is usually accounted for by including an uncertainty factor of 1, 3, or
36 10 in the calculation of a reference dose for noncancer health effects. Variability is not explicitly
37 accounted for in cancer health assessment with the exception of the incorporation of an age-specific
38 adjustment factor of < 10 for childhood exposures to genotoxic carcinogens. In a few cases, data on
39 sensitive populations (e.g., asthmatics and those sensitive to air pollutants) might be specifically
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 73
-------
1 incorporated into risk assessments. Figure 23 from Zeise et al. (2012) illustrates how different
2 types of variability can influence dose-response relationships.

3 Several strategies have been developed to characterize variability in pharmacokinetics (PKs): (1)
4 for data-rich chemicals (such as Pharmaceuticals), a "population PK" approach is used to measure
5 variability and discover the determinants; (2) "predictive PK" uses mechanistic models, assigns
6 a priori distributions to specific parameters that can be measured experimentally, and uses Monte
7 Carlo simulations to propagate distributions from model parameters to model predictions; and (3)
8 reproduced "Bayesian PBPK" employs a synthesis of the two previous approaches (EPA 2008).
Types of
biological
variability
Horedltv
(genetic and
epigenetic)
Sex,
life stag a, and
aging
Existing health
condrtions
Composures
I sources outside
decision context]
Food/nutrition
Psychosocial
stressors
Modifying how
changes in
source/media
concentrations are
propagated to
changes in
outcome.
Source-to-outcome continuum

Source/media concentrations
Multiple sources leading to
chemicals in multiple media

Exposure
parameters
Background and
coexposure doses
Multiple chemicals via
m LI tip's routes
ntarna concentrations
Endogenous
concentrations
Multiple chemicals {including metabolites)
at multiple target sites
For fixed
source/media
concentrations,
modifying the
background or
baseline
conditions.
Baseline biomark
values
Multiple biological responses in
multiple tissues/biological media
Systems
parameters
Physiological/hearth status
Outcome latency,
likelihood, and
severity
Susceptibility
indicators
Figure 22. Framework illustration of how susceptibility arises from variability. Multiple types of biological variability
intersect with the source-to-outcome continuum, either by modifying how changes to source/media
concentrations propagate through to health outcomes, or by modifying the baseline conditions along the
continuum. The aggregate result of these modifications is variability in how a risk management decision affects
individual health outcomes. The parameters and initial conditions along the source-to-outcome continuum serve
as indicators of differential susceptibility, some of which are more or less influential to the overall outcome (see
Figure 25) (Zeise et al. 2012). Reproduced with permission from Environmental Health Perspectives.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 74
-------
4.1.1. Genomic Variability
1 An estimated 20%-50% of phenotypic variation is captured when all single nucleotide
2 polymorphisms (SNPs) are considered simultaneously for several complex diseases and traits. The
3 proportion of total variation explained by individual genome-wide-significant variants has reached
4 10%-20% for a number of diseases (Visscher et al. 2013). Environmental factors are thoughtto
5 contribute the remaining variability. The interaction between genetic and environmental factors is
6 a key concern in the description of public health risks.
CO

I
Si
s>
1.0
09
03
07
Oft
OE>
0.4
03
0.2
0.1
0
Different
internal ilusi;
depending on
PK parameters
ID 15 20
External dose
25
30
Fixed change in
external dose
due to source
Different change
in internal dose
depending an
background/
coexposure doses
ID 15 20
Total external dose
25
3D
0.25

0.15

0.1
.2 0.05
Different biological
response depending
on PD parameters
02S
0.2
£ 0.15
0.5 1.D

Internal dose

Fixed change in
internal dose
due to source
0.5 1,0
Total internal dose
DiHerant change
in response
depending on
endogenous
internal dose
1.5
Figure 23. Effects of variability in pharmacokinetics (PK) (A), pharmacodynamics (PD) (B),
background/exposures (C), and endogenous concentrations (D). In (A) and (B), individuals differ in PK or
PD parameters. In (C) and (D), individuals have different initial baseline conditions (e.g., exposure to
sources outside of the risk management decisions context; endogenously produced compounds) (Zeise et
al. 2012). Reproduced with permission from Environmental Health Perspectives.
7 Several approaches to generating and evaluating genomic data are now emerging that can provide
8 new insights into human variability (both PK and pharmacodynamic [PD]) including (1) in silica
9 modeling approaches in which variability in parameter values is simulated, and differences among
10 subpopulations explored (Shahetal. submitted, Knudsenetal. 2011, KnudsenandDeWoskin2011,
11 Shah and Wambaugh 2010); (2) high-throughput (HT) in vitro data generation using cells lines with
12 different genetic backgrounds (Abdo etal. 2012, Locketal. 2012, O'Sheaetal. 2011); (3) in vivo
13 studies in genetically diverse strains of rodents to identify genetic determinants of susceptibility
14 (Harrill etal. 2012, NIEHS 2012a); (4) comprehensive scanning of gene coding regions in panels of
15 diverse individuals to examine the relationships between environmental exposures, interindividual
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 75
-------
1 sequence variation in human genes, and population disease risks (Mortensen and Euling 2013,
2 NIEHS 2012b); (5) genome-wide association studies (GWAS) to uncover genomic loci that might
3 contribute to human risk of disease (NHGRI 2013, Abecasis etal. 2012, Bush and Moore 2012); and
4 (6) association studies that correlate measures of phenotypic differences among diverse
5 populations with expression patterns for groupings of genes based on co-expression (Friend 2013,
6 Patel etal. 2013, Patel etal. 2012a, Weiss etal. 2012). New understanding of the contribution of
7 epigenomics to disease is rapidly advancing with evaluation of changes such as differential
8 methylation of DNA (Teschendorff and Widschwendter 2012, Hansenetal. 2011, Rakyan etal.
9 2011). Risk assessments of the future will begin to incorporate these types of data as they become
10 available.

11 Panel a) in Figure 24 illustrates one example of how new types of genetic variation data can be used
12 in risk assessment, in this case, how a population concentration-response curve can be estimated
13 for cycloheximide based on HT in vitro data using cell lines with different genetic backgrounds. The
14 approach reported by Lock etal. 2012 is being used in Tox21 Phase II, (in collaboration with Rusyn
15 and colleagues at the University of North Carolina) to expand the study of interindividual
16 differential sensitivity to evaluate approximately 1,100 different human lymphoblastoid cell lines,
17 with densely sequenced genomes representing 9 races of humankind, to 180 toxicants. Data will be
18 collected on more chemicals in the future. The numbers of chemicals evaluated in the future in this
19 manner will expand. The large number of human cell lines used allows for an analysis of genetic
20 determinants associated with differential cytotoxicity in vitro. This approach will provide
21 significant new insights into human variability in response and can better inform current and
22 future risk assessments. Other examples of human variability data are discussed in the benzene
23 prototype and in Text Box 10 using GWAS data.29
290ne caveat: The differential risks conferred by human genetic variability are complex and might not be
captured by analyses of small-scale gene variability alone. Hundreds to thousands of genes are likely to be
involved in any disease, and multiple variations in genetic makeup might confer similar increased or
decreased risk for the same disease. The occurrence of disease also could be influenced by emergent system
properties that require analysis not only of how gene variations affect cellular components, but how effects
on critical network interactions propagate up through higher levels of the biological system (Torkamani et al.
2008). Consequently, although incorporation of new types of data can better characterize human variability,
the characterizations are likely to be incomplete.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 76
-------
_ o
o ^

§0

Jo
. '
'g
'x o
2 °?
"o
M" °
U o
5th percentile^ ^ Mean
0.0003 0.0023 0.02 0.19 1.71
[Cycloheximide], [iM
15.35 46.08
Figure 24. Panel a: A population concentration-response was modeled using in vitro
quantitative high-throughput screening (qHTS) data using cycloheximide data
(cytotoxicity assay) as an example. Logistic dose-response modeling was performed for
each individual to the values shown in gray, providing individual 10% effect
concentration values (EC10). The EC10 values obtained by performing the modeling on
average assay values for each concentration (see frequency distribution) are shown in
the inset. Panel b: A heat map of clustered FDRs (q values, see color bar) for
associations of the data from caspase-3/7 assay with publicly available RNA-Seq
expression data on a subset of cell lines. A sample subcluster is shown (Lock et al.
2012). Reproduced with permission from Oxford Journals.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 77
-------
• . , "!.

1 Early-life chemical exposures can invoke molecular effects that appear to result in increased
2 susceptibility to disease or other morbidity later in life, often via epigenetic modifications
3 (Boekelheide et al. 2012). Evidence from both humans and animals helped establish the influence of
4 early-life exposure on later-life outcomes. For example, human observational data and animal
5 studies report that arsenic exposure during prenatal and early postnatal life increase the risk of
6 cancer, respiratory, and cardiovascular diseases, and neurobehavioral disorders, as supported by
7 human observational data and animal models (Cronican et al. 2013, Boekelheide et al. 2012, Tokar
8 etal. 2012, NRC 2011, Tokar etal. 2011). Later-in-life outcomes can be influenced by time of
9 exposure, species' predisposition to a particular disease, an individual's genetic predilection to
10 disease, or gender. Improved ability to predict disease risk associated with in utero or early
11 postnatal exposures results from advances in identifying the targeted genomic region of
12 chemicals/chemical mixtures, epigenetic alteration of gene expression, and the causal links
13 between early-life chemical exposure and later-life outcomes (Boekelheide et al. 2012, NRC 2011).

14 Epigenetic biomarkers for early-life exposures (e.g., placental epigenetic biomarkers, plasma
15 biomarkers) have the potential for use as early indicators of adverse health effects later in life.
16 Development and interpretation of epigenomic biomarkers is in the early stages of development
17 (Hansen et al. 2011, Rakyan et al. 2011); however, as understanding of the underlying epigenetic
18 mechanisms (e.g., DNA methylation, histone modification, microRNA) advances, more will be
19 known about the relationship between biomarkers of early-life exposure and later-life disease risk.
20 A good example is the work that associated early-life exposure to arsenic and DNA
21 hypomethylation with the development of arsenic-induced skin lesions (Boekelheide et al. 2012).
22 The roles of environmental factors that positively and negatively influence health outcomes require
23 study.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 78
-------
Box 10. Combining Genetics and Bioinformatics to Improve Estimates of Variability in Human Response

Variability in human response to chemical exposures is partly due to genetic influences. The National Center for
Biotechnology Information at the National Institutes of Health National Library of Medicine has a vast array of
databases devoted to human variability, especially genotype-to-phenotype associations. These resources include dbSNP
(database of single nucleotide polymorphisms and estimates of their occurrence within the population), dbGaP
(database of Genotypes and Phenotypes), GTEx database (Genotype-Tissue Expression), OMIM (Online Mendelian
Inheritance in Man), and PheGenl (Phenotype-Genotype Integrator; aggregates information from many of the
aforementioned resources).

In this example, genome-wide association study (GWAS) data were reviewed to examine the relationship between
genotype and white blood cell count in benzene-exposed and non-benzene-exposed workers in China. This work has
been used, in part, to describe a possible mode of action for benzene hematotoxicity. Lan et al. (2009) identified single
nucleotide polymorphisms (SNPs) associated with four DNA repair and genomic maintenance genes that could be
involved in carcinogenesis. These SNPs confer significant odds ratios from 1.4 to 5.7 of having a white blood cell count <
4000 cells/ul blood. This observation demonstrates a quantitative increased risk of hematotoxicity in people with any
of these SNPs. Hematotoxicity is highly correlated with leukemia resulting from benzene exposure. Hence, these SNPs
also might confer susceptibility to leukemia.

PheGenl provides links to dbSNP to view genetic diversity of SNPs within reported populations. For instance,
rs!2951053's A/C genotype is reported to occur in 51.1 % of Chinese and 31.1% of Japanese; and among Europeans and
those of European decent, the A/C genotype occurs in approximately 9-17 % of the population (NCBI 2012a).

Overall, the minor allele (C), has a relatively low penetration within the global population at just 18.7% ± 2.2% (mean ±
standard error of the mean), and an average heterozygosity of 30.0% ± 24.5 % (average ± standard error of the mean).

Using the global minor allele rate of 18.7% ± 2.2 %, we can construct a probability function and model that any given
member of the population has the minor allele A for rs!2951053 SNP. Using this probability function, we can estimate
the number of people who might have a white blood cell count < 4,000, thus the potential for hematotoxicity, as well as
the model uncertainty. This gives us a quantitative estimate of human health hazard.

In addition, this approach can help with environmental justice issues. For instance, by using census demographic data
and the SNP occurrence data for people descended from specific groups, creating probabilistic models that might more
accurately reflect the SNP pool of a population, and thus, human variability, is possible. With respect to at-risk
populations, regulatory agencies could use this type of information to inform their site-specific risk assessments, such as
a Superfund Site Risk Assessment in the United States.
4.1.3. Mixtures and Nonchemical Stressors

1 Cumulative risk is a function of the exposure to the combined threats from all intrinsic and extrinsic
2 stressors (e.g., chemical exposure, pharmaceutical use, underlying susceptibility, socioeconomic
3 status, work-life stress) and factors that improve health (e.g., good diet, exercise). The assessment
4 of cumulative risk remains a challenging area for human health risk assessment Only a few studies
5 have examined the potential impact of exposure to environmental chemical mixtures, or to
6 mixtures and nonchemical stressors; and innumerable combinations of chemical mixtures and
7 nonchemical stressors occur in the environment Conventional methods for risk assessment have
8 made little progress in scaling this particularly mountainous cumulative risk challenge. New
9 methodologies in systems biology, computational models, and data mining provide promise by
10 taking a more comprehensive disease-oriented approach to identification and management of
11 cumulative risk for chemical classes or structures. HTS and omics assay data can be combined with
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 79
-------
1 bioinformatics data mining and computational cellular signaling simulations to predict possible
2 disease outcomes (for screening-level assessments) that, combined with higher level systems data,
3 can identify common patterns of significant pathway or network alterations associated with disease
4 (for more quantitative risk assessments). As our molecular understanding of how nonchemical
5 stressors modulate disease continues to evolve, we will also be able to leverage data from systems
6 biology and network analyses to obtain a better understanding of potential cumulative chemical
7 and nonchemical stressor interactions in biological systems and the resulting health impacts.
8 Because epigenomic networks are more easily modulated by environmental factors than the
9 genome, epigenomics should be considered an area of focus for identifying mechanisms that
10 mediate cumulative risks imposed by exposures to environmental factors (Cortessis et al. 2012,
11 Koturbash et al. 2011, Bollati and Baccarelli 2010).

4,2.

12 The traditional use of animal models in hazard identification and characterization of dose-response
13 employs chemical testing in mammalian species, and application of an interspecies (animal-to-
14 human) uncertainty factor (< 10) or body-weight conversion factor to derive an EPA reference
15 value. Increased understanding of the toxicological or biological pathways and their similarity (or
16 lack thereof) among species will improve the extrapolation of chemical effects across species, and
17 the related challenge of selecting model organisms for testing, in contrast to solely comparing apical
18 responses. As knowledge increases on the extent of pathway conservation among species,
19 alternative test species, including nonmammalian vertebrates (adult and embryonic zebrafish) and
2 0 invertebrate models, will be of greater use in chemical risk assessment Regulatory toxicology as a
21 whole will move toward increasing reliance on predictive approaches to assessing chemical risk,
22 with a greater emphasis placed on understanding chemical perturbation(s) of conserved biological
23 pathways at key junctures, including molecular initiating events (MIEs) (e.g., activation or
24 inactivation of specific receptors, enzymes, or transport proteins).

25 Data from alternative mammalian species and in vitro models are valuable for both ecological and
26 human health risk assessment when used in a pathway-based framework (Ankley, G. T. et al. 2010).
27 The extrapolation between species can occur at different levels of biological organization, such as
28 the MIE, the pathway, and the organ or individual levels. Based on the similarity of pathway-based
29 values to standard toxicological values, this appears to be a useful approach for extrapolating
30 hazard values across species, especially if a known pathway is involved.

31 That gene sequences are conserved—even between distantly related species—is well known and
32 conservation across species is indicative of an essential function. DNA sequence similarity can, but
33 does not always, reflect a functionally conserved role for the genes in question. Investigations of
34 gene function homology can be approached through interspecies comparisons of various
35 components that affect the phenotype in question. The implicated genes, their sequence variation,
36 and the relevant signaling pathways and tissues (cells, organs, circuits) are all informative. Thus,
37 new approaches to understanding the underlying molecular mechanism can improve our cross-
38 species extrapolation (e.g., see Chen etal. (2007), Jubeaux etal. (2012), and Reaume and
39 Sokolowski(2011)).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 80
-------
4.3.

1 Empirical dose-response models (e.g., benchmark dose [BMD] models) are widely used in
2 environmental health risk assessment for screening and categorization of toxic substances;
3 determination of toxic potency; determination of a point of departure (POD) for low-dose
4 extrapolation; determination of human exposure guidelines; estimation of risk under specific
5 exposure circumstances; and interpretation of human data. Models that are based on a robust
6 understanding of biological processes, in contrast, are not common. Dose-response models could
7 incorporate data from in vitro studies, human or test animal in vivo studies, or human epidemiology
8 studies. For public and ecosystem health risk assessment, characterizing population-level
9 responses is the goal.

10 Many risk assessments require models that can extrapolate beyond the data set used in developing
11 the model to derive the toxicity values of interest. Such models are called biologically based models.
12 To date, the main biologically based models used in risk assessment are physiologically based
13 toxicokinetic (PBTK) models that simulate the toxicokinetic behavior of a chemical (i.e., the internal
14 disposition of the chemical in the body following a given dosing regimen). Only a few examples of
15 physiologically based toxicodynamic (PBTD) models are available to characterize the "response"
16 side of the dose-response curve. Well-developed and adequately tested PBTK models are currently
17 used in risk assessment to simulate the toxicokinetics of a chemical or chemicals across dosing
18 regimens (duration, amounts, delivery rate, routes) and species, or from in vitro regimens to in vivo
19 doses (IVIVE).

20 The establishment of human exposure guidelines for environmental agents involves determining a
21 POD on the dose-response curve, such as a particular response level on a BMD model estimate of
22 the dose-response, corresponding to a specified increase in risk usually in the 5% to 10% range, or
23 signal-to-noise-crossover dose introduced by Sand etal. (2011). This POD is then further reduced
24 by adjustment factors to derive a level of exposure that is considered to be protective of human
25 health and the environment. The National Research Council (2009) suggests an integrated
26 approach to the establishment of human exposure guidelines using adjustment factors applied to
27 the POD, where the magnitude of the factor depends on the "expected" behavior of the exposure-
28 response curve at low levels of exposure. The NRC also examined the influence of background
29 exposures and background disease rates on the shape of the exposure-response curve at low levels
30 of exposure.

31 Characterizing the expected response at low exposure levels (i.e., those that the public is most likely
32 to encounter) is another of the great challenges to previous methods used in risk assessment,
33 specifically the use of relatively high-dose in vivo animal assays as the source of data for apical
34 endpoints because the spectrum of adverse effects might be quite different at lower doses. The NRC
35 (2007) recommended developing new approaches and models to generate the data needed for
36 characterizing dose-response curves and to improve estimates especially at doses applicable to
37 likely human exposures. Examples of some new approaches to dose-response modeling are
38 described in Burgoon and Zacharewski (2008), Parham et al. (2009), and Zhang et al. (2010). The
39 application of sensitive HTS assays for pathway perturbations that directly measure biological
40 effects at environmental exposure levels are described in Rotroff etal. (2010) and Wetmore etal.
41 (2012). The reduced cost of HTS assays relative to mammalian toxicity tests might also permit the
42 use of a much broader range of exposure levels, leading to a more detailed description of dose-
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 81
-------
1 response relationships throughout the exposure range of interest. Figure 25 summarizes the
2 automated dose-response modeling approach proposed by Burgoon and Zacharewski (2008).

3 A new class of biologically based models called "virtual models" is also being developed to simulate
4 normal biology and to predict how chemical perturbations might lead to adverse effects (i.e., to
5 predict a chemical's toxicodynamics) based on knowledge of potential mechanisms. Examples of
6 virtual models being developed at various levels of biological organization or function include
7 (1) the PhyMomePjo^SSt (Physiome Project 2013), a major resource and model repository of
8 hundreds of physiology models (Hunter et al. 2002); (2) the European Virtual Physiological Human
9 (VPH) project (Hunter et al. 2010); (3) HumMod, a whole-body integrated human physiology model
10 (Hester et al. 2011); (4) Virtual Cell (V-Cell), a spatially realistic quantitative model of intracellular
11 dynamics (Moraru etal. 2008); (5) EPA's Virtual Embryo™ (v-Embryo) project, a suite of models
12 that simulate normal development leading to the formation of blood vessels, limb-buds,
13 reproductive systems, and eye and neural differentiation (Knudsen etal. 2011, Knudsen and
14 DeWoskin 2011); (6) EPA's Virtual Liver™ (v-Liver) model that simulates the dynamic interactions
15 in the liver used to translate in vitro endpoints into predictions of low-dose chronic in vivo effects in
16 humans (Shah and Wambaugh 2010); and (7) the yjiMsLlMMLEs^loi^ (German Federal Ministry
17 for Education and Research 2013), a German initiative to develop a dynamic model of human liver
18 physiology, morphology, and function integrating quantitative data from all levels of organization
19 (Holzhutter etal. 2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 82
-------
C©
Model Fitting —
• Identify closest model in clique fQ
• Move the models towards X-
closest member in clique
Best
in Class Model
Identification
Identify closest model across
all ofthe cliques
Save the model
Initialization
•Randomize model parameters
•Assign cliques
Dose Response Data
• From any large-scale study
^^^^ ^^^Puaussia
Feature Initialization
1 Feed algorithm data forone
feature at a time
1 Data validity testing
Iterative
Algorithm
(iterated per
feature)
iss = (Linear, Quadratic,
aussian. Exponential. Sigmoidafr
Best Overall Model
•Weighted vote identifies best
model across all classes
1 Save the best overall model
Dose Response Models
• EDn calculation
• POD calculation
• Putative biomarker identification

Mechanistic Insight
1 Functional annotation
1 Potency and point of departure
• Phenotypic anchoring
• Model-based clustering
(y ) Complementary Studies
^^ "Other ligands

Figure 25. Overview of automated dose-response modeling from Burgoon and Zacharewski (2008). Step 1—Dose-
response data from a large-scale study are loaded. Step 2—The application feeds dose-response data for one
feature into the algorithm. Examples of feature data include mRNA, protein, or metabolite levels and enzyme or
binding activities at each dose within a study. Step 3—The application initializes the particle swarm optimization
(PSO) algorithm by randomizing model parameters and assigning cliques. Step 4—The PSO identifies the closest
model in each clique at the end of an iteration, and moves the members of each clique toward that model.
Step 5—This iterative process ends once a best-fit model has been identified, or when all of the iterations have
been used. Steps 3 through 5 are repeated for each model class for the same feature, thus generating best-fit
models for the exponential, Gaussian, quadratic, linear, and sigmoidal classes. Step 6—The best exponential,
Gaussian, quadratic, sigmoidal, and linear models are compared with the best overall model using a weighted vote
method. The model with the smallest Euclidean distance compared with the dose-response data receives the most
votes. Step 7—The application uses the best overall model to calculate EDn and point of departure (POD) values,
used to rank and prioritize putative biomarkers or chemical activities. Step 8—Model-based clusters can provide
additional mechanistic insight by integrating potency and POD data with functional annotation and phenotypic
anchoring. For example, EDn and POD data might generate model-based clusters for lipid metabolism and
transport gene expression that could be associated with the occurrence of hepatic vacuolization and lipid
accumulation. Step 9—Through complementary comparative studies using toxic and nontoxic congeners in
responsive and nonresponsive species across time, data could emerge that differentiate biomarkers of exposure
from toxicity-related responses that can support mechanistically based quantitative risk assessments. Reproduced
with permission from Oxford Journals.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 83
-------
5.

1 The NexGen prototypes presented in this report illustrate new data types and approaches applied
2 to risk assessments for various decision contexts and provide concrete examples for discussion of
3 new approaches in the risk assessment community. The prototypes show how large amounts of
4 data can be synthesized in useful ways, and how the data can increase our understanding of the
5 potential risk posed by chemical exposures, including hazard identification, and dose-response
6 assessment. In addition, the outcomes of the prototype assessments have implications for current
7 challenges in risk assessment, such as evaluating human variability and susceptibility, mixtures,
8 and low-dose responses characterization. Overall, these new data types and approaches appear
9 promising in terms of improved risk assessment and decision support These new approaches are
10 faster and less expensive than traditional approaches provide new insights. Each prototype
11 considered hazard identification, exposure-dose-response, and mechanisms of action or adverse
12 outcome pathways.

5.1.

13 Thousands of chemicals to which humans are exposed have inadequate data for predicting their
14 potential for toxicological effects. Dramatic technological advances in molecular and systems
15 biology, computational toxicology, and bioinformatics, however, have provided researchers and
16 regulators with powerful new public health tools (NRC 2007, 2006). "High-throughput screening
17 techniques are now routinely used in conjunction with computational methods and information
18 technology to probe how chemicals interact with biological systems, both in vitro and in vivo.
19 Progress is being made in recognizing the patterns of response in genes and pathways induced by
20 certain chemicals or chemical classes that might be predictive of adverse health outcomes in
21 humans. However, as with any new technology, both the reliability and the relevance of the
22 approach need to be demonstrated in the context of current knowledge and practice" (Tice et al.
23 2013).

24 In general, two basic approaches are being taken to advance our understanding of the causes and
25 modifiers of human disease risks: top-down and bottom-up (Friend 2013). In general, the top-down
26 approach focuses on developing large-scale network models of disease by sifting through the
27 substantial body of new human molecular clinical and epidemiologic data (e.g., >50,000 omics
2 8 papers per year; zettabytes (1021) of new data), looking for patterns associated with various disease
29 states and environmental or genetic risk factors. In general, the bottom-up approach focuses on
30 using in vitro high- and medium-throughput bioassays to understand alteration in molecular and
31 cellular processes caused by chemical exposures. The top-down approach provides the human
3 2 population biology context, while the bottom-up approach provides experimental support for
33 associations identified in the top-down approach. The approaches are mutually supportive and,
34 when integrated, provide a powerful means to advance risk assessment The various prototypes
35 presented in this document sought to illustrate both approaches. Due to the greater current
3 6 availability of genomic data, the prototypes were heavily biased toward use of gene expression and
37 transcriptomic data (i.e., gene expression levels and factors influencing transcription into proteins).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 84
-------
1 The results from large and essential areas of research, including epigenomics, are rapidly adding to
2 our knowledge and will be incorporated in more detail in future efforts.30

3 The three sets or tiers of prototypes were intended to explore different aspects of decision context.
4 The primary intent of the first set of chemicals (Tier 3 prototypes) was to verify if and how new
5 data and approaches could be used to inform risk assessment by comparison to robust traditional
6 assessments where risks are generally considered "known." In essence, we attempted to reverse
7 engineer from known answers to verify new approaches, explore value information, and begin to
8 characterize decision rules that could be reasonably applied to chemicals with limited or no
9 traditional data. Secondarily, the Tier 3 prototypes explored how new types of data could expand
10 our understanding of well-studied chemicals. The intent of the Tier 2 prototypes was to explore
11 new types of computational analyses and short-duration in vivo bioassays that are intermediate in
12 terms of required resources and confidence in the data between Tiers 3 and 1, and are suitable for
13 evaluating hundreds to thousands of chemicals. These approaches are relatively uncommon in risk
14 assessment to date but hold much promise. The intent of the Tier 1 prototypes was to explore
15 entirely HT approaches that could be applied to tens of thousands of chemicals, might have the
16 greatest uncertainties, but are the least resource intensive to use.

17 The following eight chemicals or chemical classes and their associated effects were chosen for
18 prototype development:

19 • Tier 3:
20 o Benzene and leukemia (molecular epidemiology),
21 o Ozone and lung inflammation and injury (molecular clinical studies), and
22 o Benzo[a]pyrene (B[a]P, a polycyclic aromatic hydrocarbon (PAH) and liver cancer
23 (molecular clinical studies meta-analyses and in vivo rodent bioassay).
30In terms of top-down approaches, molecular, computational, and systems biology data have
grown phenomenally in recent years, and have informed mechanisms of disease and factors that
alter risks of disease. These data are generally stored in large databases such as ENCODE. Gene
Expression Omnibus fGEOI. and the Comparative Toxicogenomic Database fCTDI and are publicly
available for further analyses. Analyses and meta-analyses of these data are providing new insights
into environmental public health risks. Bioinformatics (computer-assisted approaches) are
necessary to use these new data effectively due to the size of the relevant databases. The polycyclic
aromatic hydrocarbon (PAH) and diabetes prototypes, in particular, illustrate bioinformatic
"knowledge mining" to understand environmentally related disease.

In terms of bottom-up approaches, many new high- and medium-throughput methods have been
and are being developed that facilitate testing and evaluation of chemicals on an unprecedented
scale. In particular, the in vitro evaluations of chemicals with limited or no traditional data are being
enabled. ToxCast™ and Toxicology in the 21st Century [Tox21] provide examples (see Section 3.3).
Tox21 will test ~10,000 chemicals in a few years. New in vivo short-duration (hours to weeks)
exposure paradigms also are emerging that provide new types of data to be used in health
assessments. These paradigms use both nonmammalian (see Section 3.2.2) and mammalian species
(see Section 3.2.3).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 85
-------
1 • Tier 2:
2 o Chemicals associated with diabetes and obesity ("big data" knowledge mining),
3 o Chemicals associated with thyroid hormone disruption (short-duration in vivo
4 exposure bioassays - alternative species), and
5 o Chemicals associated with cancer (short-duration in vivo exposure bioassays -
6 mammalian).
7 • Tier 1:
8 o Chemicals associated with cancer and noncancer disorders, especially
9 developmental (QSAR) and
10 o Chemicals associated with thyroid hormone disruption (high-throughput in vitro
11 assays).
12 Table 9 provides more information on methods explored in each prototype (Krewski et al. 2013).

5.2.

13 Based on the prototypes provided here and the work of others, new molecular, computational, and
14 systems biology tools likely can better inform risk assessment. Substantial caution in interpretation
15 and use of new information is warranted, however, in large part because our understanding of the
16 science is still evolving, and appropriate data are still scarce. We propose initially to use new
17 methods discussed in this document to: (1) generate hypotheses; (2) screen and rank chemicals for
18 additional research and assessment; and (3) augment understanding of traditional data. Areas of
19 particular promise include improved understanding of relative potency of chemicals to disrupt
20 biologic processes, hazard identification, and mechanisms of disease and disorders; human
21 variability and susceptibility; human relevancy of animal models; and low-dose-response
22 relationships. These future risk assessments ideally would rely on the integration of a variety of
23 new types of data and traditional data, as available. Additional discussion of the lessons learned
24 from the prototypes follows.

25 Systems biology context is key to understanding these new data types and the relationship among
26 various types of data. Network-level understanding is typically more informative than pathway-
27 level understanding, which is usually more informative than individual genes. In general,
28 information on individual genes, in the absence of systems biology-level of understanding, is likely
29 to be inadequate for risk assessment purposes. Information that links molecular events to apical
30 outcomes need not be chemical specific, but can be derived from mechanistic information on
31 disease or from related chemicals. As with any risk assessment, the studies used should be well
32 designed, conducted, and reported; systematic review criteria are necessary in study selection.
33 Characterization of multisource variability is a substantial challenge with new data types because of
34 the sheer amount of data being analyzed and, thus, must be carefully considered. Also, traditional
35 weight-of-evidenee criteria continue to be useful in considering new data types, for example, data
36 from multiple, similar studies are preferred (Krauth et al. 2013). That environment-induced
37 changes in biology are dynamic in nature also should be noted, and these dynamic changes are not
38 well understood.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 86
-------
1 Highlights of the prototypes include:

2 • The effects of human chemical exposures at environmental levels on molecular events were
3 linked to intermediate biological events and apical adverse outcomes using molecular
4 epidemiology, molecular clinical, and environment-wide association studies (e.g., evaluation
5 of NHANES) (EPA 2013, Patel etal. 2013, Devlin 2012, McHale etal. 2012, Patel etal. 2012a,
6 Thayeretal. 2012, Burgoon 2Oil, McHale etal. 2011, Smith, MT etal. 2011). Chemicals
7 evaluated included benzene, ozone, B[a]P (a PAH), metals, and persistent organic pollutants.
8 • The B[a]P and diabetes prototypes illustrated the use of "big data" knowledge mining to
9 identify associations between environmental chemical exposures and disease (Patel et al.
10 2013, Patel etal. 2012a, Burgoon 2011). The chemicals of concern for diabetes identified
11 using knowledge mining also were identified in a review of traditional literature by experts
12 (Thayer et al. 2012). This powerful, relatively new technique has not been used extensively in
13 environmental risk assessment, although it is commonly used in other areas of biology.
14 Knowledge mining is particularly useful in developing a broad understanding of potential
15 mechanisms of action, factors that may cause or modify disease risks, and human variability
16 and susceptibility.
17 • Short-duration exposures coupled with new molecular and computational approaches appear
18 to provide additional insights into potential environmental risks. Use of both alternative
19 species and mammalian species in these new experimental models is explored. These models
20 are faster and less expensive than the molecular epidemiology and molecular clinical studies
21 noted above. Furthermore, unlike the QSAR and high-throughput (HT) models noted below,
22 these models address intact metabolism and cell, and tissue interactions and can be used to
23 study more complex outcomes such as developmental and neurobehavioral outcomes. In the
24 case of alternative species, these models can detect effects over the entire lifespan of the
25 organism and to population dynamics. These models have been used successfully to describe
26 mechanisms, explore complex mechanistic behaviors, describe hazards, and evaluate
27 chemical potency. Confidence in the data also generally lies between Tier 3 and Tier 2
28 approaches (Perkins et al. (2013), Thomas RS et al. (2013a), and Padilla et al. (2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 87
-------
Table 9. Prototype Use of New Scientific Tools and Techniques (Krewski et al. 2013)
Scientific Tools
Used in Specific Prototypes
ler 1: Screening and Pnontization
Endocrine Disrupter
Cancer &
Hydrocarbon
Tier 2: Limited Scope Assessments

Cancer &
Reproduct1'"
Developme
Tier 3: Major Scope
Assessments
Lung Injury
Leukemia
& Benzene

^^£^^1 ^^^^^^ffl
W^^^^^^^^^^^M
tiazaras (^non-
Duration In Vivo
Exposure Bioassays)
^/iv7'7fcf3y7ffJJ?;/i7ff^/B
•oiecuiar
smiology)
Hazard Identification and Dose-Response Estimation Methods
Quantitative structure-activity
relationships
High-throughput in vitro assays
High-content omic assays
Molecular and genetic
population-based studies
Biomarkers of effect
Pathway/network analyses
•

•
•
•

•

m
m
m
m
m

•
•
•
•
•

•
•
•
•
•
Dosimetry and Exposure Assessment Methods
In-vitro-to-in-vivo extrapolation
Pharmacokinetic models and
dosimetry
Biomarkers of exposure
•

•
•

m
m
m

•
•

•
•
Cross-cutting Disciplines
Bioinformatics/
computational biology
Functional genomics
Systems biology
•

•

m
m
m
m
m
m
•
•
•
•
•
•
September 2013
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
-------
1 • QSAR models (Venkatapathy and Wang 2013, Goldsmith et al. 2012, 2012a, Wang, N et al.
2 2012b) and HT in vitro bioassays are being used to rapidly evaluate a wide array of chemicals
3 (Judsonetal. 2013, 2011, Sipes etal. 2013,Tice etal. 2013, Kavlocketal. 2012, Rusynetal.
4 2012). "These tools can probe chemical-biological interactions at fundamental levels,
5 focusing on the molecular and cellular pathways that are targets of chemical disruption"
6 (Kavlocketal. 2012). Thousands of chemicals are currently being evaluated, particularly in
7 the ToxCast and Tox21 programs. Both estimates of potency and insights into potential
8 hazards are being generated. Additionally, tools exist to relate in vitro concentration to
9 potential human exposure levels (reverse dosimetry) (Wetmore et al. 2013, Wetmore et al.
10 2012, Rotroff et al. 2010, Hubal 2009). Although directly correlating in vitro findings to risks
11 of human disease is difficult, these QSAR and HT methods provide powerful new tools for
12 screening and ranking large numbers of chemicals for further evaluation and assessment, as
13 well as exploring underlying mechanisms of toxicity, and evaluating human variability in
14 response to chemical exposures (Lock et al. 2012).

15 Thomas RS et al. (2013a) propose a framework for incorporating these new technologies into
16 toxicity testing and risk assessment in an integrated fashion. The first steps proposed are to use
17 in vitro assays to separate chemicals based on their relative selectivity in interacting with biological
18 targets and to identify the concentration at which these interactions occur. Dosimetry modeling
19 converts in vitro concentrations into external dose for calculation of the point-of-departure (POD)
20 and comparisons to human exposure estimates to yield a margin of exposure (MOE). The second
21 step involves short-term in vivo studies, expanded pharmacokinetic evaluations, and refined human
22 exposure estimates, thus increasing confidence in the evaluation. The third step represents the
23 traditional animal studies currently used to assess chemical risks. A significant percentage of
24 chemicals evaluated in the first two tiers could be eliminated from further testing based on their
25 MOE. Additionally, at each step, information might be suitable for supporting some types of Agency
26 decision-making. The framework provides a risk-based and animal-sparing approach for evaluating
27 chemicals using technological advances to increase efficiency.

28 In addition to informing hazard identification and dose-response, new data types and methods have
29 the potential to inform recurrent, challenging risk assessment issues.

30 Experimental Low Dose Data vs. Low Dose Extrapolation - Dose-dependent molecular changes
31 associated with adverse outcomes can be observed at environmental concentrations. Thus, these
32 new approaches can provide experimental data to help characterize dose-response relationships at
33 concentrations where responses, to date, have often been only inferred. Both assay methods and
34 statistical analyses must demonstrate sufficient sensitivity to be considered informative. Observed
35 molecular changes include changes in both magnitude and character, reflecting underlying
36 alterations in biology with increasing dose and time. Biological processes that are consistently
37 observed across the exposure range of interest are likely to be the most useful as biomarkers of
38 exposure and effect. Elucidating the meaning of these dynamic changes in terms of risk will be
39 challenging.

40 Variability and Susceptibility - New data and methods can enhance our ability to understand
41 variability in response and the identification of potentially susceptible populations. Human cells
42 from various individuals (e.g., 1000 Genome Project) evaluated in in vitro high-throughput models
43 provide an avenue for understanding responses across subsets of the human population (Lock et al.
44 2012). Data mining and bioinformatics analyses will facilitate the identification of susceptible
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 89
-------
1 populations and underlying sources of variability by combing existing molecular epidemiology and
2 clinical databases. In all, this work can provide quantitative data, which to date have been generally
3 lacking, to support more accurate estimates of human variability and identification of susceptible
4 populations.

5 Evaluation of the Effects of Multiple Stressors - The ability to map mechanism of disease and
6 adverse outcome pathways disrupted by various environmental agents gives us new tools for
7 understanding the interactions of multiple environmental stressors, including chemical mixtures
8 and lifestyle factors.

9 Certain caveats that apply generally to use of new data types in risk assessment deserve mention.

10 • Cell type, tissue, individual, subpopulation, species, and test system can alter how specific
11 omics are expressed as traditional intermediate and apical outcomes, even when the
12 molecular signature is the same. This is likely due, at least in part, to epigenomic differences
13 and genomic plasticity. This issue should be considered, as feasible, in data interpretation.
14 • The metabolism of many chemicals often plays an important role in toxicity. That most HT
15 test systems are not metabolically competent is important to consider. Various approaches to
16 the issue of in vitro metabolism are being evaluated; however, this currently remains a
17 complicating factor in most in vitro testing.
18 • Molecular profiles appear time-dependent, that is, they evolve over time with continued
19 exposure and post-exposure. This can confound prediction of outcomes or disease outcomes
20 based on "snapshots" in time of biological events. Fortunately, however, at least some
21 signatures appear to stabilize over time and can serve as reliable indicators of chronic
22 outcomes.
23 • Currently, studying multiple molecular processes (i.e., genomics, transcriptomics, proteomics,
24 and epigenomics) in a single study is relatively rare, primarily due to expense. This lack of
25 biological integration limits our understanding.
26 • Due primarily to experimental design and reporting issues (see B[a]P [a PAH] prototype),
27 adequate data from the open literature to support risk assessment activities currently are
28 available for few chemicals. This underscores the importance of high-quality research and
29 testing programs like ToxCast™ and Tox21 and systematic review of data.
30 • Data reproducibility and false negative rates may remain a potential limitation of high
31 throughput screening and high content assays (e.g., toxicogenomics). The false negative rate
32 (i.e., calling a chemical non-toxic when it is) tends to decrease as an increasing number of
33 independent replicates are used. Successful screening programs need low false negative rates,
34 while balancing their efficiencies (i.e., cost, time, throughput).
35 The challenge is to use what we know today wisely, with the understanding that biological
36 knowledge is evolving very rapidly, and likewise, risk assessment also will need to evolve.

53.

37 Throughout this report, examples are provided that illustrate how new types of data might be used
38 to improve risk assessment. Table 10 summarizes (1) various decision context examples common
39 at EPA; (2) a "toolbox" of various NexGen methodologies that could provide data to support each
40 decision context; (3) types of "fit for purpose" toxicity values that might be derived from new data
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 90
-------
1 types or traditional data; and (4) assessment products in which molecularly or computationally
2 informed toxicity values could be used. Although all approaches can be used in any type of
3 assessment, any one of the health data approaches listed in Tiers 1 and 2 could provide a minimum
4 data set. In this scheme, Tier 1 is primarily QSAR or HT data-driven. Tier 2 is high-content or
5 traditional data-driven (in addition to Tier 1 data, if available). Tier 3 will continue to be traditional
6 data driven but could be augmented by molecular, computational, and systems biology data if the
7 data are available, of sufficient quality, and substantively useful.

Table 10. Problem Formulation Table
Examples
Toolbox of
Possible
Approaches
Possible Types
of Toxicity
Values
Health
Assessment
Categories
Exposure
Assessment
Tierl: Screening and
Prioritization
• Emergency response
• Unregulated drinking water
chemicals identification
• Potential emerging chemical
problems or opportunities
• Research directions
• QSAR
• High-throughput (HT)
Screening Assays
• Computational Toxicology
Models
• No Traditional Data
• Automated Data Integration
High-Throughput Toxicity Values
Prioritized Chemicals of Concern
List; Screening Values
Physical-Chemical Surrogates
Tier 2: Limited Scope
Assessments
• National Air Toxics Assessment
* Superfund listing/removal
actions
• Drinking Water Health
Advisories

• High-content Assays
>• Knowledge Mining
> Short Duration In Vivo
Exposure Paradigms3
• Limited Traditional Datab
• Automated Data Integration

High-content Toxicity Values
Provisional Toxicity Values
Limited Exposure Data
Tier 3: Major Scope
Assessments
National Regulatory Decisions
International, Tribal, State, &
Local Technical Support
Molecular Biology Data
Systems Biology Data
All Policy Relevant Data
Hand-Curated Data
Integration
Molecularly Informed
Traditional Type Values

IRIS or ISA
Extensive Exposure Data
' Both alternative and mammalian species paradigms.
' Potentially not chemical-specific data but rather disease or chemical class data.
8 Integration of information from multiple data types is preferred, but all types of data shown for any
9 tier might not be available or of sufficient quality for inclusion in an assessment. Systematic review
10 criteria are being established and are discussed in section 3.1.3 (McConnell and Bell 2013).
11 Stakeholder input and external peer review will be solicited for new approaches to risk assessment

12 Systems biology understanding is a fundamental aspect of the weight-of-evidence evaluation. As
13 one progresses from Tier 1 to Tier 3 assessments, the weight of evidence increases; however, the
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 91
-------
1 resources to generate the assessments also increases. For example, in Tier 1, toxicity values can be
2 generated solely from extant QSAR data, a process that can be fully automated to be very quick and
3 cost-efficient for a large number of chemicals. Wignall et al. (2013)(SOT poster abstract; manuscript
4 in progress) describe an approach to generate toxicity values for chemicals with limited
5 experimental data using a combination of QSAR, regression, and hybrid modeling (Rusyn et al.
6 2012), and incorporating Organization of Economic Co-operation and Development (OECD)
7 principles for model building and external cross-validation. Tier 2 type assessments, ideally, enable
8 the use of more types of data to inform our understanding of data-limited chemicals. For Tier 2, EPA
9 is beginning to develop high-content toxicity values on a trial basis. High-content toxicity values can
10 be developed using bioinformatic approaches based on data that can be machine read and, hence,
11 readily mined and analyzed. Additional data integration and hand curation might be needed to use
12 available data resources, such as high-throughput screening (HTS)/high-content screening (HCS)
13 assays, alternative species testing, and study data compiled the European Union's Registration,
14 Evaluation, Authorisation and Restriction of Chemicals (REACH) regulation. Tier 3 assessments will
15 continue to be driven by traditional data, but new data types could provide new insights into
16 difficult issues such as low dose-response, human variability and susceptibility, and the effects of
17 multiple environmental stressors. These various "fit for purpose" assessment types can be used to
18 develop hypotheses, screen chemicals, mechanistically fingerprint toxicants, set priorities, and
19 inform hazards, relative potencies, and risks.

i. , ••

6.1. ;

20 Novel data streams and approaches are rapidly emerging that present opportunities for informing
21 and supporting human health risk assessment, but challenges remain. Four key challenges are the
22 need for (1) the ability to predict metabolism of test compounds, (2) improved understanding of
23 the biology from a systems perspective; (3) evaluated methods to measure key aspects of biological
24 space across multiple levels of organization; and (4) the knowledge infrastructure to ensure
25 availability of relevant data. Future directions include filling these scientific gaps and continuing to
26 build the framework for incorporating new information fit-for-purpose into assessments to support
27 a range of decisions to promote health, protect the environment, and manage risks.

28 Arguably, the greatest challenge is posed by the need to consider and evaluate complex interactions
29 of chemical and biological systems to predict potential for health risks. Systems biology provides an
3 0 approach for investigating emergent properties in complex chemical-biological systems by probing
31 how changes in one part affect the others, and the behavior of the whole. New data types are
32 providing required information to develop these predictive models.

3 3 There is an imbalance, however, in the sophistication of methods available and the resolution of
34 data being developed to evaluate impacts of chemical perturbations and to discover mechanistic
3 5 commonalities. Large amounts of network or high-throughput screening/high-content data can be
36 collected to measure effects at the molecular level. Substantial information is also available on
37 disease outcomes, yet only very sparse data are being generated on intermediate events. A similar
3 8 lack of exposure information commensurate with hazard data is also evident Even given the rich
39 data coming from implementation of the high-throughput (HT) toxicity testing schemes, gaps in
40 coverage for key endpoints occur, and thus, developing and incorporating assays are needed to fill
41 gaps in the biology required to assess potential for the full range of adverse outcomes required by
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 92
-------
1 risk assessors. This discrepancy in available data across levels of biological organization should
2 narrow over time, as methods continue to advance and as more metabolomics data for biomarkers
3 of effects and exposure are made available. This will lead to development of models that predict
4 disease outcomes with greater certainty from initiating events in individuals and populations
5 relative to exposures likely to be experienced in the real environment.

6.2.

6 Future plans for facilitating use of new data types and tools to support the full range of risk-based
7 assessments and decisions include addressing needs for validated testing schemes and clearly
8 articulating decision considerations for incorporating results of these analyses. In addition, further
9 prototypes or case examples for incorporating HT toxicity data and other novel data types to inform
10 risk assessment are required to demonstrate the added value of these advanced tools and to
11 identify further the most significant scientific gaps.

12 Validation of HT toxicity testing schemes will be necessary if the data developed using these
13 methods are to be used to inform risk-based decisions and to support efficient chemical risk
14 assessments. The key to moving the wealth of information being generated through research efforts
15 such as ToxCast™ and Toxicology in the 21st Century (Tox21) is to develop a framework for
16 validating HT toxicity testing schemes to support specific chemical evaluation objectives.
17 Traditional "validation" schemes designed to evaluate conventional assay and testing structures do
18 not adequately address this gap and would take years to implement. As the technology for
19 providing rapid, efficient, robust hazard and effects data continues to advance, the validation
20 process for evaluating these new methods is also expected to undergo a transformation to provide
21 fit-for-purpose confidence in results. Future incorporation of new types information to improve the
22 scientific basis and efficiency of risk assessment requires clear articulation of decision
23 considerations for using new types of data and methods. Some of these decision considerations
24 might have standard principles supported by a broad range of risk managers and stakeholders,
25 while others will need to be fit-for-purpose. Early consideration of these decision considerations
2 6 has been initiated and plans are in place to develop criteria for systematic review of new types of
2 7 data, disease signatures, adequate weight of evidence for use in risk assessment, and new
28 approaches for risk assessment

29 Demonstrating approaches for incorporating new molecular biology data and evaluating advanced
30 methods might be facilitated by additional case examples and prototypes. Conducting a variety of
31 case studies focused on using the HT toxicity data from ToxCast™ and Tox21, in combination with
32 other chemical-specific information to improve efficiency of risk-based decisions where little
33 traditional toxicity data are available, will be important for assessing the value added of these new
34 datatypes.

35 Examples also will be identified where molecular biology data can be considered for Tier 3
3 6 assessments to augment traditional assessment methodologies. These will provide opportunities to
3 7 solicit public comment and peer review.

3 8 Opportunities also exist for using new data types to guide development of NexGen approaches by
39 considering prototypes for how this information could support some of the most challenging
40 questions faced by risk managers. Population-level risks could be considered using both traditional
41 and molecular biology data, with an additional emphasis on epigenomics and influences of broadly
42 defined environmental factors. Additional insights for risk managers can be found in Crawford-

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 93
-------
1 Brown (2013). Application of new methods might better inform our understanding of the combined
2 effects of multiple stressors, such as multiple chemical exposures, diet, stress, and pre-existing
3 disease. In recognition of the tremendous potential for these new methods and data types to
4 support risk assessment, the EPA Office of Research and Development will continue to elaborate the
5 NexGen framework, and begin to develop toxicity values informed by new biology for specific risk
6 assessment purposes.

7 • EPA's Office of Research and Development will work with EPA's Program Offices using Tier 1
8 screening and prioritization approaches to queue up new assessments. Results from this work
9 will be used to feed back into the testing paradigm for its refinement.
10 • Toxicity values informed by new types of knowledge will be developed in each tier to address
11 needs from screening chemicals for future testing to assessment for potency or category of
12 adverse effect.
13 • Levels of confidence in those values will be characterized depending on the types and quality
14 of the supporting data.

15 EPA's Office of Research and Development will expand stakeholder discussion and the community
16 of practice with regard to the use of new data types and methods in risk assessment, and the peer
17 review of new methods. New assessments will receive public comment and peer review.

18 Finally, EPA's Office of Research and Development will continue working with other national and
19 international agencies involved in assessment, testing, and research to coordinate and harmonize
20 activities, and improve data collection, analyses, curation, sharing, and warehousing.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 94
-------
7. References

Abdo N, Xia M, Kosyk 0, Huang R, Sakamuru S, Austin CP, et al. (2012). The 1000 genomes toxicity screening project:
Utilizing the power of human genome variation for population-scale in vitro testing. 51st Annual Meeting of the
Society of Toxicology Poster Abstract #765. http://www.toxicology.org/AI/Pub/Tox/2012Tox.pdf.

Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, et al. (2012). An integrated map of genetic
variation from 1,092 human genomes. Nature 491: 56-65. http://www.ncbi.nlm.nih.gov/pubmed/23128226.

Afridi HI, Kazi TG, Kazi N, Jamali MK, Arain MB, Jalbani N, et al. (2008). Evaluation of status of toxic metals in biological
samples of diabetes mellitus patients. Diabetes Research & Clinical Practice 80: 280-288.
http://www.ncbi.nlm.nih.gov/pubmed/18276029.

Andersen S, Pedersen KM, Bruun NH, Laurberg P. (2002). Narrow individual variations in serum T(4) and T(3) in
normal subjects: A clue to the understanding of subclinical thyroid disease. Journal of Clinical Endocrinology &
Metabolism 87:1068-1072. http://www.ncbi.nlm.nih.gov/pubmed/11889165.

Angerer J, Bird MG, Burke TA, Doerrer NG, Needham L, Robison SH, et al. (2006). Strategic biomonitoring initiatives:
moving the science forward. Toxicological Sciences 93: 3-10. http://www.ncbi.nlm.nih.gov/pubmed/16785253.

Ankley GT, Bennett RS, Erickson RJ, Hoff DJ, Hornung MW, Johnson RD, et al. (2010). Adverse outcome pathways: A
conceptual framework to support ecotoxicology research and risk assessment. Environmental Toxicology and
Chemistry 29: 730-741.
http://service004.hpc.ncsu.edu/toxicology/websites/iournalclub/linked files/FalllO/Environ%20Toxicol%20Ch
em%202010%20Ankley.pdf.

Ankley GT, Gray LE. (2013). Cross-species conservation of endocrine pathways: A critical analysis of tier 1 fish and rat
screening assays with 12 model chemicals. Environmental Toxicology and Chemistry.
http://www.ncbi.nlm.nih.gov/pubmed/23401061.

Arnot JA, Mackay D TU Canadian Environmental Modelling Centre. (2007). Risk Prioritizationfor a Subset of Domestic
Substance List Chemicals Using the RAIDAR Model.

Arnot JA, Mackay D, Parkerton TF, Zaleski RT, Warren CS. (2010a). Multimedia modeling of human exposure to
chemical substances: The roles of food web biomagnification and biotransformation. Environmental Toxicology
and Chemistry 29: 45-55.

Arnot JA, Mackay D, Sutcliffe R, Lo B. (2010b). Estimating farfield organic chemical exposures, intake rates and intake
fractions to human age classes. Environmental Modelling & Software 25:1166-1175.

ATSDR (Agency for Toxic Substances and Disease Registry). (2007). Toxicological profile for benzene. Retrieved from
http://www.atsdr.cdc.gov/toxprofiles/tp3.pdf (accessed March 25,2013).

Auerbach SS, Shah RR, Mav D, Smith CS, Walker NJ, Valiant MK, et al. (2010). Predictingthe hepatocarcinogenic
potential ofalkenylbenzene flavoring agents using toxicogenomics and machine learning. Toxicology and Applied
Pharmacology 243: 300-314. http://dx.doi.0rg/10.1016/i.taap.2009.ll.021.

Baker M. (2010). Epigenome: Mapping in motion. Nature Methods 7:181-186.

Basketter DA, Clewell H, Kimber I, Rossi A, Blaauboer B, Burrier R, et al. (2012). A roadmap for the development of
alternative (non-animal) methods for systemic toxicity testing -14 report*. Alternatives to Animal Experiments 29:
3-91. http://www.ncbi.nlm.nih.gov/pubmed/22307314.

Behl M, Rao D, Aagaard K, Davidson TL, Levin ED, Slotkin TA, et al. (2013). Evaluation of the association between
maternal smoking, childhood obesity, and metabolic disorders: a national toxicology program workshop review.
Environmental Health Perspectives 121:170-180. http://www.ncbi.nlm.nih.gov/pubmed/23232494.

Bell RR, Early JL, Nonavinakere VK, Mallory Z. (1990). Effect of cadmium on blood glucose level in the rat. Toxicology
Letters 54:199-205. http://www.ncbi.nlm.nih.gov/pubmed/2260118.

Belson M, Kingsley B, Holmes A. (2007). Risk factors for acute leukemia in children: a review. Environmental Health
Perspectives 115:138-145. http://www.ncbi.nlm.nih.gov/pubmed/17366834.

Berg EL, Yang J, Melrose J, Nguyen D, Privat S, Rosier E, et al. (2010). Chemical target and pathway toxicity
mechanisms defined in primary human cell systems. Journal of Pharmacological and Toxicological Methods 61: 3-
15. http://www.ncbi.nlm.nih.gov/Dubmed/19879948.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 95
-------
Bhattacharya S, Zhang Q, Carmichael PL, Boekelheide K, Andersen ME. (2011). Toxicity testing in the 21 century:
defining new risk assessment approaches based on perturbation of intracellular toxicity pathways. Public Library
of Science One 6: e20887. http://www.ncbi.nlm.nih.gov/pubmed/21701582.

Birnbaum LS. (2012). NIEHS and the future of toxicology. Presentation. FutureTox Meeting of the Society of Toxicology
October 18-19. Arlington, VA.

Birney E. (2012). The making of ENCODE: Lessons for big-data projects. Nature 489: 49-51.
http://www.ncbi.nlm.nih.gov/pubmed/22955613.

Blakesley V, Awni W, Locke C, Ludden T, Granneman GR, Braverman LE. (2004). Are bioequivalence studies of
levothyroxine sodium formulations in euthyroid volunteers reliable? Thyroid 14:191-200.
http://www.ncbi.nlm.nih.gov/pubmed/15072701.

Bleicher KH, Bohm HJ, Muller K, Alanine AI. (2003). Hit and lead generation: beyond high-throughput screening.
Nature Reviews Drug Discovery 2: 369-378.
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list uids= 12750740.

Blount BC, Valentin-Blasini L, Osterloh JD, Mauldin JP, Pirkle JL. (2007). Perchlorate exposure of the US Population,
2001-2002. Journal of Exposure Science and Environmental Epidemiology 17: 400-407.
http://www.ncbi.nlm.nih.gov/pubmed/17051137.

Boekelheide K, Blumberg B, Chapin RE, Cote I, Graziano JH, Janesick A, et al. (2012). Predicting later-life outcomes of
early-life exposures. Environmental Health Perspectives 120:1353-1361. http://dx.doi.org/10.1289/ehp.1204934.

Bollati V, Baccarelli A. (2010). Environmental epigenetics. Heredity (Edinb) 105:105-112.
http://www.ncbi.nlm.nih.gov/pubmed/20179736.

Borgelt C. (2013). Software for frequent pattern mining. Retrieved from http://www.borgelt.net/fpm.html (accessed
March 20,2013).

BOSC (Board of Scientific Counselors). (2010). Review of the EPA Office of Research and Development's (ORD)
Computational Toxicology Research Program [CTRP). Washington, DC. Retrieved from
http://www.epa.gov/osp/bosc/pdf/ctoxl004rpt.pdf (accessed February 26,2013).

Bosson J, Stenfors N, Bucht A, Helleday R, Pourazar J, Holgate ST, et al. (2003). Ozone-induced bronchial epithelial
cytokine expression differs between healthy and asthmatic subjects. Clinical and Experimental Allergy 33: 777-
782. http://www.ncbi.nlm.nih.gov/pubmed/12801312.

Brown AR, Bickley LK, Le Page G, Hosken DJ, Paull GC, Hamilton PB, et al. (2011). Are toxicological responses in
laboratory (inbred) zebrafish representative of those in outbred (wild) populations? - A case study with an
endocrine disrupting chemical. Environmental Science and Technology 45: 4166-4172.
http://www.ncbi.nlm.nih.gov/pubmed/21469706.

Bucher JR, Thayer K, Birnbaum LS. (2011). The Office of Health assessment and translation: A problem-solving
resource for the National Toxicology Program. Environmental Health Perspectives 119: A196-197.
http://www.ncbi.nlm.nih.gov/pubmed/21531652.

Burgoon LD. (2011). Potential of Genomic Data on PAHs to Inform Cumulative Assessment. National Academy of
Sciences Meeting: Mixtures and Cumulative Risk Assessment. http://nas-
sites.org/emergingscience/meetings/mixtures/workshop-presentations-mixtures/.

Burgoon LD, Zacharewski TR. (2008). Automated quantitative dose-response modeling and point of departure
determination for large toxicogenomic and high-throughput screening data sets. Toxicological Sciences 104: 412-
418. http://www.ncbi.nlm.nih.gov/pubmed/18441342.

Bush WS, Moore JH. (2012). Chapter 11: Genome-wide association studies. Public Library of Science Computational
Biology 8.

Chadwick LH. (2012). The NIH roadmap epigenomics program data resource. Epigenomics 4: 317-324.
http://www.ncbi.nlm.nih.gov/pubmed/22690667.

Chen J, Blackwell TW, Fermin D, Menon R, Chen Y, Gao J, et al. (2007). Evolutionary-conserved gene expression
response profiles across mammalian tissues. OMICS 11: 96-115.
http://www.ncbi.nlm.nih.gov/pubmed/17411398.

Chen Y, Jin JY, Mukadam S, Malhi V, Kenny JR. (2012). Application of IVIVE and PBPK modeling in prospective
prediction of clinical pharmacokinetics: Strategy and approach during the drug discovery phase with four case
studies. Biopharmaceutics & Drug Disposition 33: 85-98. http://www.ncbi.nlm.nih.gov/pubmed/22228214.

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 96
-------
Chiu WA, Euling SY, Scott CS, Subramaniam RP. (2010). Approaches to advancing quantitative human health risk
assessment of environmental chemicals in the post-genomic era. Toxicology and Applied Pharmacology.
http://www.ncbi.nlm.nih.gov/pubmed/20353796.

Chuang HY, Lee E, Liu YT, Lee D, Ideker T. (2007). Network-based classification of breast cancer metastasis. Molecular
Systems Biology 3:140. http://www.ncbi.nlm.nih.gov/pubmed/17940530.

Cohen Hubal EA, Richard A, Aylward L, Edwards S, Gallagher J, Goldsmith MR, et al. (2010). Advancing exposure
characterization for chemical evaluation and risk assessment. Journal of Toxicology and Environmental Health Part
B: Critical Reviews 13: 299-313. http://www.ncbi.nlm.nih.gov/pubmed/20574904.

Collins FS. (2010). Transcript from Newshour PBS interview with Frances Collins (June 24,2010).
http://www.pbs.org/newshour/bb/science/ian-iunelO/genome 06-24.html.

Collins MA. (2009). Generating 'omic knowledge': The role of informatics in high content screening. Combinatorial
Chemistry & High Throughput Screening 12: 917-925. http://www.ncbi.nlm.nih.gov/pubmed/19531005.

Cordell HJ. (2009). Detecting gene-gene interactions that underlie human diseases. Nature Reviews Genetics 10: 392-
404. http://www.ncbi.nlm.nih.gov/pubmed/19434077.

Cortessis VK, Thomas DC, Levine AJ, Breton CV, Mack TM, Siegmund KD, et al. (2012). Environmental epigenetics:
Prospects for studying epigenetic mediation of exposure-response relationships. Human genetics 131: 1565-1589.
http://www.ncbi.nlm.nih.gov/pubmed/22740325.

Cote I, Anastas PT, Birnbaum LS, Clark RM, Dix DJ, Edwards SW, et al. (2012). Advancing the next generation of health
risk assessment. Environmental Health Perspectives 120:1499-1502.
http://www.ncbi.nlm.nih.gov/pubmed/22875311.

Crawford-Brown D. (2013). The role of advanced biological data in the rationality of risk-based regulatory decisions.
Journal of Environmental Protection 4: 238-249.
http://www.scirp.org/journal/PaperInformation.aspx?PaperID=28638.

Crofton KM, Zoeller RT. (2005). Mode of action: neurotoxicity induced by thyroid hormone disruption during
development-hearing loss resulting from exposure to PHAHs. Critical Reviews in Toxicology 35: 757-769.
http://www.ncbi.nlm.nih.gov/pubmed/16417043.

Cronican AA, Fitz NF, Carter A, Saleem M, Shiva S, BarchowskyA, etal. (2013). Genome-wide alteration of histone
H3K9 acetylation pattern in mouse offspring prenatally exposed to arsenic. Public Library of Science One 8:
e53478. http://www.ncbi.nlm.nih.gov/pubmed/23405071.

Crump KS, Chen C, Louis TA. (2010). The future use of in vitro data in risk assessment to set human exposure
standards: challenging problems and familiar solutions. Environmental Health Perspectives 118: 1350-1354.
http://www.ncbi.nlm.nih.gov/pubmed/20562051.

De Coster S, van Larebeke N. (2012). Endocrine-disrupting chemicals: Associated disorders and mechanisms of action.
Journal of Environmental Public Health 2012: 713696. http://www.ncbi.nlm.nih.gov/pubmed/22991565.

Dearry A. (2013). Integrating environmental health data to advance discovery. Presentation. National Academy of
Sciences Meeting: Integrating Environmental Health Data to Advance Discovery January 10-11.
http://www.youtube.com/watch?v=5TIVa2x4x7c&list=PLzsdEyVNFvgyizsegxlcIbLzlglyOIHxl&index=2.

Degitz SJ, Holcombe GW, Flynn KM, Kosian PA, Korte JJ, Tietge JE. (2005). Progress towards development of an
amphibian-based thyroid screening assay using Xenopus laevis. Organismal and thyroidal responses to the model
compounds 6-propylthiouracil, methimazole, and thyroxine. Toxicological Sciences 87: 353-364.
http://www.ncbi.nlm.nih.gov/pubmed/16002479.

Derry JM, Mangravite LM, Suver C, Furia MD, Henderson D, Schildwachter X, et al. (2012). Developing predictive
molecular maps of human disease through community-based modeling. Nature Genetics44:127-130.
http://www.ncbi.nlm.nih.gov/pubmed/22281773.

Devlin RB. (2012). Using Ozone to Validate a Systems Biology Approach to Toxicity Testing. National Academy of
Sciences Meeting: Systems Biology-Informed Risk Assessment.
http://www.youtube.com/watch?v=5TIVa2x4x7c&list=PLzsdEyVNFvgyizsegxlcIbLzlglyOIHxJ&index=2.

Devlin RB, Duncan KE, Jardim M, Schmitt MT, Rappold AG, Diaz-Sanchez D. (2012). Controlled exposure of healthy
young volunteers to ozone causes cardiovascular effects. Circulation 126: 104-111.
http://www.ncbi.nlm.nih.gov/Dubmed/22732313.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 97
-------
Diabetes Genetics Initiative of Broad Institute of Harvard, MIT LU, Novartis Institutes of BioMedical Research, Saxena
R, Voight BF, Lyssenko V, et al. (2007). Genome-wide association analysis identifies loci for type 2 diabetes and
triglyceride levels. Science 316:1331-1336. http://www.sciencemag.org/content/316/5829/1331.abstract.

Dick E, Rajamohan D, Ronksley J, Denning C. (2010). Evaluating the utility of cardiomyocytes from human pluripotent
stem cells for drug screening. Biochemical Society Transactions 38:1037-1045.
http://www.ncbi.nlm.nih.gov/pubmed/20659000.

Duncan KE, Crooks J, Miller DJ, Burgoon L, Schmitt MT, Edwards S, et al. (2013). Temporal profile Of gene expression
alterations in primary human bronchial epithelial cells following in vivo exposure to ozone American Thoracic
Society International Conference: D105 Genetics and Epigenetics of Lung Disease Meeting Abstract A5866.
http://www.atsiournals.0rg/doi/abs/10.1164/airccm-conference.2013.187.l MeetingAbstracts.A5866.

Duncan KE, Dailey LA, Carson JL, Hernandez ML, Peden DB, Devlin RB. (2012). Cultured basal airway epithelial cells
from asthmatics display baseline gene expression profiles that differ from normal healthy cells and exhibit
differential responses to ambient air pollution particles American Journal of Respiratory and Critical Care Medicine
185:A4291.

EC (European Commission). (2010). Main Findings of the Report: Alternative (Non-animal) Methods for Cosmetics
Testing: Current Status and Future Prospects. Institute for Health and Consumer Protection. Retrieved from
http://www.iivs.org/workspace/assets/news-assets/findings cosmetics 2011.pdf (accessed February 22,2013).

EC (European Commission). (2011). 4th Report on the Implementation of the "Community Strategy for Endocrine
Disrupters" a Range of Substances Suspected of Interfering with the Hormone Systems of Humans and Wildlife. (COM
(1999) 706). Brussels. Retrieved from
http://ec.europa.eu/environment/endocrine/documents/sec 2011 1001 en.pdf (accessed March 4,2013).

ECHA (European Chemicals Agency). (2013a). Evaluation Under REACH: Progress Report 2012. Helsinki. Retrieved
from http://echa.europa.eu/documents/10162/13628/evaluation report 2012 en.pdf (accessed March 29,
2013).

ECHA (European Chemicals Agency). (2013b). Proposal for Identification of a Substance as a CMR1A or IB, Pbt, vPvB or
a Substance of an Equivalent Level of Concern. Retrieved from
http://echa.europa.eu/documents/10162/13638/annex xv svhc 4nonylphenol en.pdf (accessed March 28,
2013).

Ein-Dor L, Kela I, Getz G, Givol D, Domany E. (2005). Outcome signature genes in breast cancer: Is there a unique set?
Bioinformatics21:171-178. http://www.ncbi.nlm.nih.gov/pubmed/15308542.

Eisenberg M, Samuels M, DiStefano JJ, 3rd. (2008). Extensions, validation, and clinical applications of a feedback
control system simulator of the hypothalamo-pituitary-thyroid axis. Thyroid 18:1071-1085.
http://www.ncbi.nlm.nih.gov/pubmed/18844475.

El Muayed M, Raja MR, Zhang X, MacRenaris KW, Bhatt S, Chen X, et al. (2012). Accumulation of cadmium in insulin-
producing (3 cells. Islets 4: 405-416.

Ellinger-Ziegelbauer H, Gmuender H, Bandenburg A, Ahr HJ. (2008). Prediction of a carcinogenic potential of rat
hepatocarcinogens using toxicogenomics analysis of short-term in vivo studies. Mutation Research 637: 23-39.
http://dx.doi.0rg/10.1016/i.mrfmmm.2007.06.010.

Emdin SO, Dodson GG, Cutfield JM, Cutfield SM. (1980). Role of zinc in insulin biosynthesis. Some possible zinc-insulin
interactions in the pancreatic B-cell. Diabetologia 19:174-182. http://www.ncbi.nlm.nih.gov/pubmed/6997118.

ENCODE Project Consortium. (2012). An integrated encyclopedia of DNA elements in the human genome. Nature 489:
57-74. http://www.ncbi.nlm.nih.gov/pubmed/22955616.

EPA (U.S. Environmental Protection Agency). (1995). The Use of the Benchmark Dose Approach in Health Risk
Assessment. (EPA/630/R-94/007). Washington, DC. Retrieved from
http://www.epa.gov/raf/publications/pdfs/BENCHMARK.PDF (accessed March 8,2013).

EPA (U.S. Environmental Protection Agency). (2000). Benzene (CASRN 71-43-2) IRIS Summary. II. Carcinogenicity
Assessment for Lifetime Exposure. January 19. Retrieved from http://www.epa.gov/iris/subst/0276.htm
(accessed March 25,2013).

EPA (U.S. Environmental Protection Agency). (2008). Uncertainty and Variability in Physiologically Based
Pharmacokinetic Models: Key Issues and Case Studies. Washington, DC. Retrieved from
httD://ofmpub.epa.gov/eims/eimscomm.getfile?p download id=477286.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 98
-------
EPA (U.S. Environmental Protection Agency). (2009a). Strategic Plan for the Future ofToxicity Testing and Risk
Assessment at the U.S. EPA. Washington, DC: Office of the Science Advisor, Science Policy Council. Retrieved from
http://www.epa.gov/spc/toxicitytesting/docs/toxtest strategy 032309.pdf (accessed February 22,2013).

EPA (U.S. Environmental Protection Agency). (2009b). Virtual Tissues Project. April 21-22. Retrieved from
http://www.epa.gov/ncct/virtual tissues/ (accessed April 2,2013).

EPA (U.S. Environmental Protection Agency). (2010). Advancing the Next Generation (NexGen) of Risk Assessment: The
Prototypes Workshop. Research Triangle Park, NC. Retrieved from
http://www.epa.gov/risk/nexgen/docs/NexGen-Prototypes-Workshop-Summary.pdf (accessed February 22,
2013).

EPA (U.S. Environmental Protection Agency). (2011a). Advancing the Next Generation [NexGen) of Risk Assessment:
Public Dialogue Conference. Washington, DC. Retrieved from http://www.epa.gov/risk/nexgen/docs/NexGen-
Public-Conf-Summary.pdf (accessed February 22.2013).

EPA (U.S. Environmental Protection Agency). (2011b). Framework for an EPA Chemical Safety for Sustainability
Research Program. Washington, DC: Office of Research and Development. Retrieved from
http://www.epa.gov/ord/priorities/docs/CSSFramework.pdf (accessed February 22,2013).

EPA (U.S. Environmental Protection Agency). (2011c). The Incorporation of In Silica Models and In Vitro High
Throughput Assays in the Endocrine Disruptor Screening Program (EDSP)forPrioritization and Screening.
Washington, DC. Retrieved from
http://www.epa.gov/endo/pubs/edsp21 work plan summary%20 overview final.pdf (accessed February 22,
2013).

EPA (U.S. Environmental Protection Agency). (2011d). Integrated Science Assessment of Ozone and Related
Photochemical Oxidants (Final Report). Washington, DC. Retrieved from
http://cfpub.epa.gov/ncea/isa/recordisplay.cfm?deid=247492 (accessed March 28,2013).

EPA (U.S. Environmental Protection Agency). (2011e). Use of'Omic" Technology to Inform the Risk Assessment, Support
Document for Case Study: Propiconazole. Appendices A and B. Washington, DC: Federal Insecticide, Fungicide, and
Rodenticide Act Scientific Advisory Panel (FIFRA SAP). Retrieved from
http://www.regulations.gov/#!documentDetail:D=EPA-HO-OPP-2011-0284-0003 (accessed February 22,2013).

EPA (U.S. Environmental Protection Agency). (2012a). Advancing the Next Generation (NexGen) of Risk Assessment.
July 31. Retrieved from http://www.epa.gov/risk/nexgen/ (accessed March 21,2013).

EPA (U.S. Environmental Protection Agency). (2012b). Basic Information About Benzene in Drinking Water. Retrieved
from http://water.epa.gov/drink/contaminants/basicinformation/benzene.cfm (accessed March 25,2013).

EPA (U.S. Environmental Protection Agency). (2012c). Computational Toxicology Research. Retrieved from
http://www.epa.gov/ncct/ (accessed February 22,2013).

EPA (U.S. Environmental Protection Agency). (2012d). Computational Toxicology Research: Overview Materials for EPA
Science Advisory Board Exposure & Human Health Committee. Washington, DC. Retrieved from
http://yosemite.epa.gov/sab/sabproduct.nsf/AEACDCDADEDFAAF3852579F2005A4C5A/$File/EPA+CompTox+
SAB+Materials+4-26-2012+v4.pdf (accessed February 22,2013).

EPA (U.S. Environmental Protection Agency). (2012e). Endocrine Disruptor Screening Program (EDSP). November 27.
Retrieved from http://www.epa.gov/endo/ (accessed March 4,2013).

EPA (U.S. Environmental Protection Agency). (2013). Sustainable futures: Models & methods. February 6. Retrieved
from http://www.epa.gov/oppt/sf/tools/methods.htm (accessed March 22, 2013).

Eskenazi B, Bradman A, Gladstone EA, Jaramillo S, Birch K, Holland N. (2003). CHAMACOS, A longitudinal birth cohort
study: Lessons from the fields. Journal of Children's Health 1: 3-27.
http://ehs.sph.berkeley.edu/holland/publications/files/Eskenazi2003.pdf

FangX, Bai C, WangX. (2012). Bioinformatics insights into acute lung injury/acute respiratory distress syndrome.
Clinical and Translational Medicine 1: 9. http://www.ncbi.nlm.nih.gov/pubmed/23369517.

Fielden MR, Adai A, Dunn RT, Olaharski A, Searfoss G, Sina J, et al. (2011). Development and evaluation of a genomic
signature for the prediction and mechanistic assessment of nongenotoxic hepatocarcinogens in the rat.
Toxicological Sciences 124: 54-74. http://dx.doi.org/10.1093/toxsci/kfr202.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 99
-------
Fielden MR, Brennan R, Gollub J. (2007). A gene expression biomarker provides early prediction and mechanistic
assessment of hepatic tumor induction by nongenotoxic chemicals. Toxicological Sciences 99: 90-100.
http://dx.doi.org/10.1093/toxsci/kfml56.

Fielden MR, Eynon BP, Natsoulis G, Jarnagin K, Banas D, Kolaja KL. (2005). A gene expression signature that predicts
the future onset of drug-induced renal tubular toxicity. Toxicologic Pathology 33: 675-683.
http://dx.doi.org/10.1080/01926230500321213.

Fielden MR, Nie A, McMillian M, Elangbam CS, Trela BA, Yang Y, et al. (2008). Interlaboratory evaluation of genomic
signatures for predicting carcinogenicity in the rat. Toxicological Sciences 103: 28-34.
http://dx.doi.org/10.1093/toxsci/kfn022.

Fortunel NO, Otu HH, Ng HH, Chen J, Mu X, Chevassut T, et al. (2003). Comment on "'Sternness': Transcriptional
profiling of embryonic and adult stem cells" and "a stem cell molecular signature". Science 302: 393; author reply
393. http://www.ncbi.nlm.nih.gov/pubmed/14563990.

Fowler BA, Whittaker MH, Lipsky M, Wang G, Chen XQ. (2004). Oxidative stress induced by lead, cadmium and arsenic
mixtures: 30-day, 90-day, and 180-day drinking water studies in rats: an overview. Biometals 17: 567-568.
http://www.ncbi.nlm.nih.gov/pubmed/15688865.

Friend S. (2013). Scientific opportunities from heterogeneous biological data analysis: Overcoming complexity.
Presentation. National Academy of Sciences Meeting: Integrating Environmental Health Data to Advance Discovery
January 10-11. http://nas-sites.org/emergingscience/files/2013/01/FRIEND-lanlO-final.pdf.

Froehlicher M, Liedtke A, Groh KJ, Neuhauss SC, Segner H, Eggen RI. (2009). Zebrafish (Danio rerio) neuromast:
Promising biological endpoint linking developmental and toxicological studies. Aquatic Toxicology 95: 307-319.
http://www.ncbi.nlm.nih.gov/pubmed/19467721.

Gangwal S, Reif DM, Mosher S, Egeghy PP, Wambaugh JF, Judson RS, et al. (2012). Incorporating exposure information
into the toxicological prioritization index decision support framework. Science of the Total Environment 435-436:
316-325. http://www.ncbi.nlm.nih.gov/pubmed/22863807.

Garte S, Taioli E, Popov T, Bolognesi C, Farmer P, Merlo F. (2008). Genetic susceptibility to benzene toxicity in humans.
Journal of Toxicology and Environmental Health, Part A 71:1482-1489.
http://www.ncbi.nlm.nih.gov/pubmed/18836923.

George BJ, Schultz BD, Palma T, Vette AF, Whitaker DA, Williams RW. (2011).^n Evaluation of EPA's National-Scale Air
Toxics Assessment (NATA): Comparison with Benzene Measurements in Detroit, Michigan. U.S. Environmental
Protection Agency. Retrieved from
http://www.google.com/url?sa=t&rct=i&q=an%20evaluation%20of%20epa's%20national-
scale%20air%20toxics%20assessment&source=web&cd=2&ved=OCEEOFiAB&url=http%3A%2F%2Fcfpub.epa.go
v%2Fsi%2Fsi public file download.cfm%3Fp download id%3D500973&ei=zYEmUfi7L62r2AWMuoC4BO&usg=
AFOiCNGLT3m4187I8hvEnaYfF6c24-uMA&bvm=bv.42661473.d.b2I (accessed February 22,2013).

German Federal Ministry for Education and Research. (2013). Virtual Liver Network. Retrieved from
http://www.virtual-liver.de/wordpress/en (accessed March 28,2013).

Gibbs-Flournoy EA, Simmons SO, Bromberg PA, Dick TP, Samet JM. (2013). Monitoring intracellular redox changes in
ozone-exposed airway epithelial cells. Environmental Health Perspectives 121: 312-317.
http://www.ncbi.nlm.nih.gov/pubmed/23249900.

Gibson GG, Rostami-Hodjegan A. (2007). Modelling and simulation in prediction of human xenobiotic absorption,
distribution, metabolism and excretion (ADME): In vitro-in vivo extrapolations (IVIVE). Xenobiotica 37: 1013-
1014. http://www.ncbi.nlm.nih.gov/pubmed/17968734.

Godderis L, Thomas R, Hubbard AE, Tabish AM, Hoet P, Zhang L, et al. (2012). Effect of chemical mutagens and
carcinogens on gene expression profiles in human TK6 cells. Public Library of Science One 7: e39205.
http://europepmc.org/abstract/MED/22723965.

Golbraikh A, WangXS, Zhu H, Tropsha A. (2012). Predictive QSAR modeling: Methods and applications in drug
discovery and chemical risk assessment. Handbook of Computational Cftem/strj/1309-1342.
http://link.springer.com/referenceworkentry/10.1007%2F978-94-007-0711-5 37#.

Gold LS, Manley NB, SloneTH, Ward JM. (2001). Compendium of chemical carcinogens by target organ: Results of
chronic bioassays in rats, mice, hamsters, dogs, and monkeys. Toxicologic Pathology 29: 639-652.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 100
-------
Goldsmith MR, Peterson SD, Chang DT, Transue TR, Tornero-Velez R, Tan YM, et al. (2012). Informing mechanistic
toxicology with computational molecular models. Methods in Molecular Biology 929:139-165.
http://www.ncbi.nlm.nih.gov/pubmed/23007429.

Goldstein BD. (1988). Benzene toxicity. Occupational Medicine 3: 541-554.
http://www.ncbi.nlm.nih.gov/pubmed/3043738.

Greenawalt DM, Sieberts SK, Cornells MC, Girman CJ, Zhong H, Yang X, et al. (2012). Integrating genetic association,
genetics of gene expression, and single nucleotide polymorphism set analysis to identify susceptibility Loci for
type 2 diabetes mellitus. American Journal of Epidemiology 176: 423-430.
http://www.ncbi.nlm.nih.gov/pubmed/22865700.

Guryev V, Koudijs MJ, Berezikov E, Johnson SL, Plasterk RH, van Eeden FJ, et al. (2006). Genetic variation in the
zebrafish. Genome Research 16: 491-497. http://www.ncbi.nlm.nih.gov/pubmed/16533913.

Hansen KD, Timp W, Bravo HC, Sabunciyan S, Langmead B, McDonald OG, etal. (2011). Increased methylation
variation in epigenetic domains across cancer types. Nature Genetics 43: 768-775.
http://www.ncbi.nlm.nih.gov/pubmed/21706001.

Harrill AH, Desmet KD, Wolf KK, Bridges AS, Eaddy JS, Kurtz CL, et al. (2012). A mouse diversity panel approach
reveals the potential for clinical kidney injury due to DB289 not predicted by classical rodent models.
Toxicological Sciences 130: 416-426. http://www.ncbi.nlm.nih.gov/pubmed/22940726.

Harrill AH, Ross PK, Gatti DM, Threadgill DW, Rusyn I. (2009). Population-based discovery of toxicogenomics
biomarkers for hepatotoxicity using a laboratory strain diversity panel. Toxicological Sciences 110: 235-243.
http://www.ncbi.nlm.nih.gov/pubmed/19420014.

Hatch GE, Slade R, Harris LP, McDonnell WF, Devlin RB, Koren HS, et al. (1994). Ozone dose and effect in humans and
rats. A comparison using oxygen-18 labeling and bronchoalveolar lavage. American Journal of Respiratory and
Critical Care Medicine 150: 676-683. http://www.ncbi.nlm.nih.gov/pubmed/8087337.

Hatzimichael E, Crook T. (2013). Cancer epigenetics: new therapies and new challenges. Journal of Drug Delivery 2013:
529312. http://www.ncbi.nlm.nih.gov/pubmed/23533770.

Hester RL, Brown AJ, Husband L, Illiescu R, Pruett D, Summers R, et al. (2011). HumMod: A modeling environment for
the simulation of integrative human physiology. Frontiers in Physiology 2.

Holzhutter HG, Drasdo D, Preusser T, Lippert J, Henney AM. (2012). The virtual liver: A multidisciplinary, multilevel
challenge for systems biology. Wiley Interdisciplinary Reviews: Systems Biology and Medicine 4: 221-235.
http://www.ncbi.nlm.nih.gov/pubmed/22246674.

Houck KA, Kavlock RJ. (2008). Understanding mechanisms of toxicity: Insights from drug discovery research.
Toxicology and Applied Pharmacology 227: 163-178. http://www.ncbi.nlm.nih.gov/pubmed/18063003.

Howlader N, Noone AM, Krapcho M, Garshell J, Neyman N, Altekruse SF, et al. (2013). SEER Cancer Statistics Review,
1975-2010. Bethesda, MD: U.S. National Institutes of Health, National Cancer Institute. Retrieved from
http://seer.cancer.gov/csr/1975 2010/. based on November 2012 SEER data submission, posted to the SEER web
site, April 2013.

Hubal EA. (2009). Biologically relevant exposure science for 21st century toxicity testing. Toxicological Sciences 111:
226-232. http://www.ncbi.nlm.nih.gov/pubmed/19602574.

Hunter P, Coveney PV, de Bono B, Diaz V, Fenner J, Frangi AF, et al. (2010). A vision and strategy for the virtual
physiological human in 2010 and beyond. Philosophical Transactions Series A: Mathematical, Physical, and
Engineering Sciences 368: 2595-2614. http://www.ncbi.nlm.nih.gov/pubmed/20439264.

Hunter P, Robbins P, Noble D. (2002). The IUPS human Physiome Project. PflugersArchiv 445:1-9.
http://www.ncbi.nlm.nih.gov/pubmed/12397380.

Hurley PM. (1998). Mode of carcinogenic action of pesticides inducing thyroid follicular cell tumors in rodents.
Environmental Health Perspectives 106: 437-445. http://www.ncbi.nlm.nih.gov/pubmed/9681970.

IARC lAfRo Cancer). (2012). Monograph 100F: Benzene. Retrieved from
http://monographs.iarc.fr/ENG/Monographs/vollOOF/monolOOF-24.pdf (accessed March 25,2013).

Ilhan G, Karakus S, Andic N. (2006). Risk factors and primary prevention of acute leukemia. Asian Pacific Journal of
Cancer Prevention 7: 515-517. http://www.ncbi.nlm.nih.gov/Dubmed/17250419.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 101
-------
Ingelman-Sundberg M. (2005). Genetic polymorphisms of cytochrome P450 2D6 (CYP2D6): clinical consequences,
evolutionary aspects and functional diversity. Pharmacogenomics 5: 6-13.
http://www.ncbi.nlm.nih.gov/pubmed/15492763.

Irons RD, Chen Y, WangX, Ryder J, Kerzic PJ. (2013). Acute myeloid leukemia following exposure to benzene more
closely resembles de novo than therapy related-disease. Genes Chromosomes Cancer 52: 887-894.
http://www.ncbi.nlm.nih.gov/pubmed/23840003.

Jack J, Wambaugh JF, Shah I. (2011). Simulating quantitative cellular responses using asynchronous threshold Boolean
network ensembles. BMC Systems Biology 5:109. http://www.ncbi.nlm.nih.gov/pubmed/21745399.

Jiang X, Kumar K, Hu X, Wallqvist A, Reifman J. (2008). DOVIS 2.0: An efficient and easy to use parallel virtual
screening tool based on AutoDock 4.0. Chemical Central Journal 2:18.
http://www.ncbi.nlm.nih.gov/pubmed/18778471.

Jubeaux G, Audouard-Combe F, Simon R, Tutundjian R, Salvador A, Geffard 0, et al. (2012). Vitellogenin-like proteins
among invertebrate species diversity: potential of proteomic mass spectrometry for biomarker development.
Environmental Science and Technology 46: 6315-6323. http://www.ncbi.nlm.nih.gov/pubmed/22578134.

Judson RS, Kavlock R, Martin M, Reif D, Houck K, Knudsen T, et al. (2013). Perspectives on validation of high-
throughput assays supporting 21st century toxicity testing. Alternatives to Animal Experiments 30: 51-56.
http://www.ncbi.nlm.nih.gov/pubmed/23338806.

Judson RS, Kavlock RJ, Setzer RW, Hubal EA, Martin MT, Knudsen TB, et al. (2011). Estimating toxicity-related
biological pathway altering doses for high-throughput chemical risk assessment. Chemical Research in Toxicology
24: 451-462. http://dx.doi.org/10.1021/txl00428e.

Judson RS, Martin MT, Reif DM, Houck KA, Knudsen TB, Rotroff DM, et al. (2010). Analysis of eight oil spill dispersants
using rapid, in vitro tests for endocrine and other biological activity. Environmental Science and Technology 44:
5979-5985. http://dx.doi.org/10.1021/esl02150z.

Kanehisa Laboratories. (2013). KEGG: Kyoto encyclopedia of genes and genomes. Retrieved from
http://www.genome.jp/kegg/ (accessed February 22,2013).

Kavlock R, Chandler K, Houck K, Hunter S, Judson R, Kleinstreuer N, et al. (2012). Update on EPA's ToxCast program:
providing high throughput decision support tools for chemical risk management. Chemical Research in Toxicology
25: 1287-1302. http://www.ncbi.nlm.nih.gov/pubmed/22519603.

Kim CS, Alexis NE, Rappold AG, Kehrl H, Hazucha MJ, Lay JC, et al. (2011). Lung function and inflammatory responses
in healthy young adults exposed to 0.06 ppm ozone for 6.6 hours. American Journal of Respiratory and Critical Care
Medicine 183:1215-1221. http://www.ncbi.nlm.nih.gov/pubmed/21216881.

Knudsen T, Daston GP. (2010). Virtual tissues and developmental systems biology (Chapter 23). In T Knudsen, G
Daston (Eds.), Second Edition Comprehensive Toxicology (pp. 347-358). Oxford, UK: Elsevier Ltd.

Knudsen T, DeWoskin RS. (2011). Systems modeling in development toxicity. In Handbook of Systems Toxicology: John
Wiley & Sons, Ltd.

Knudsen T, Kavlock RJ, Daston GP, Stedman D, Hixon M, Kim JH. (2011). Developmental toxicity testing for safety
assessment: New approaches and technologies. Birth Defects Research: PartB, Developmental and Reproductive
Toxicology 92: 413-420. http://www.ncbi.nlm.nih.gov/pubmed/21770025.

Kondraganti SR, Fernandez-Salguero P, Gonzalez FJ, Ramos KS, Jiang W, Moorthy B. (2003). Polycyclic aromatic
hydrocarbon-inducible DNA adducts: Evidence by 32P-postlabeling and use of knockout mice for Ah receptor-
independent mechanisms of metabolic activation in vivo. International Journal of Cancer 103: 5-11.
http://www.ncbi.nlm.nih.gov/pubmed/12455047.

Koturbash I, Beland FA, Pogribny IP. (2011). Role of epigenetic events in chemical carcinogenesis-a justification for
incorporating epigenetic evaluations in cancer risk assessment. Toxicology Mechanisms and Methods 21: 289-297.
http://www.ncbi.nlm.nih.gov/pubmed/21495867.

Krauth D, Woodruff TJ, Bero L. (2013). Instruments for assessing risk of bias and other methodological criteria of
published animal studies: a systematic review. Environmental Health Perspectives 121: 985-992.
http://www.ncbi.nlm.nih.gov/pubmed/23771496.

Krewski D, Hogan V, Turner MC, Zeman PL, McDowell I, Edwards N, et al. (2007). An integrated framework for risk
management and population health. Human and Ecological Risk Assessment 13: 1288-1312.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 102
-------
Krewski D, Westphal M, Paoli G, Croteau MC, Al-Zoughool M, Andersen ME, et al. (2013). A framework for the next
generation of risk science.

Lan Q, Zhang L, Li G, Vermeulen R, Weinberg RS, Dosemeci M, et al. (2004). Hematotoxicity in workers exposed to low
levels of benzene. Science 306:1774-1776. http://www.ncbi.nlm.nih.gov/pubmed/15576619.

Lau FK, Decastro BR, Mills-Herring L, Tao L, Valentin-Blasini L, Alwis KU, et al. (2013). Urinary perchlorate as a
measure of dietary and drinking water exposure in a representative sample of the United States population 2001-
2008. Journal of 'Exposure Science and Environmental Epidemiology 23: 207-214.
http://www.ncbi.nlm.nih.gov/pubmed/23188482.

Lock EF, Abdo N, Huang R, Xia M, Kosyk 0,0'Shea SH, et al. (2012). Quantitative high-throughput screening for
chemical toxicity in a population-based in vitro model. Toxicological Sciences 126: 578-588.
http://dx.doi.org/10.1093/toxsci/kfs023.

Lossos IS, Czerwinski DK, Alizadeh AA, Wechser MA, Tibshirani R, Botstein D, et al. (2004). Prediction of survival in
diffuse large-B-cell lymphoma based on the expression of six genes. New England Journal of Medicine 350:1828-
1837. http://www.ncbi.nlm.nih.gov/pubmed/15115829.

Lvovs D, Favorova 00, Favorov AV. (2012). A polygenic approach to the study of polygenic diseases. Acta Naturae 4:
59-71. http://www.ncbi.nlm.nih.gov/pubmed/23150804.

Makris SL, Kim JH, Ellis A, Faber W, Harrouk W, Lewis JM, etal. (2011). Current and future needs for developmental
toxicity testing. Birth Defects Research: Part B, Developmental and Reproductive Toxicology 92: 384-394.
http://www.ncbi.nlm.nih.gov/pubmed/21922641.

Martin MT, Knudsen TB, Reif DM, Houck KA, Judson RS, Kavlock RJ, et al. (2011). Predictive Model of Rat Reproductive
Toxicity from ToxCast High Throughput Screening. Biology of Reproduction 85: 327-339.
http://www.ncbi.nlm.nih.gov/pubmed/21565999.

Maull EA, Ahsan H, Edwards J, Longnecker MP, Navas-Acien A, Pi J, et al. (2012). Evaluation of the association between
arsenic and diabetes: a National Toxicology Program workshop review. Environmental Health Perspectives 120:
1658-1670. http://www.ncbi.nlm.nih.gov/pubmed/22889723.

Mayr LM, Bojanic D. (2009). Novel trends in high-throughput screening. Current Opinion in Pharmacology 9: 580-588.
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list uids= 19775937.

McConnell ER, Bell SM. (2013). Systematic Omics Analysis Review (SOAR) tool to support risk assessment. Submitted.

McDonnell WF, Stewart PW, Smith MV, Kim CS, Schelegle ES. (2012). Prediction of lung function response for
populations exposed to a wide range of ozone conditions. Inhalation Toxicology 24: 619-633.
http://www.ncbi.nlm.nih.gov/pubmed/22906168.

McDougall CM, Blaylock MG, Douglas JG, Brooker RJ, Helms PJ, Walsh GM. (2008). Nasal epithelial cells as surrogates
for bronchial epithelial cells in airway inflammation studies. American Journal of Respiratory Cell and Molecular
Biology 39: 560-568. http://www.ncbi.nlm.nih.gov/pubmed/18483420.

McHale CM, Zhang L, Lan Q, Vermeulen R, Li G, Hubbard AE, et al. (2011). Global gene expression pro filing of a
population exposed to a range of benzene levels. Environmental Health Perspectives 119: 628-634.
http://dx.doi.org/10.1289/ehp.1002546.

McHale CM, Zhang L, Smith MT. (2012). Current understanding of the mechanism of benzene-induced leukemia in
humans: Implications for risk assessment. Carcinogenesis 33: 240-252.

Mechanic LE, Chen HS, Amos CI, Chatterjee N, Cox NJ, Divi RL, etal. (2012). Next generation analytic tools for large
scale genetic epidemiology studies of complex diseases. Genetic epidemiology 36: 22-35.
http://www.ncbi.nlm.nih.gov/pubmed/22147673.

Medzhitov R. (2008). Origin and physiological roles of inflammation. Nature 454: 428-435.
http://www.ncbi.nlm.nih.gov/pubmed/18650913.

Meissner A. (2012). What can epigenomics do for you? Genome Biology 13: 420.
http://www.ncbi.nlm.nih.gov/pubmed/23095436.

Miao X, Sun W, Fu Y, Miao L, Cai L. (2013). Zinc homeostasis in the metabolic syndrome and diabetes. Frontiers of
Medicine 7: 31-52. http://www.ncbi.nlm.nih.gov/Dubmed/23385610.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 103
-------
Miller MD, Crofton KM, Rice DC, Zoeller RT. (2009). Thyroid-disrupting chemicals: Interpreting upstream biomarkers
of adverse outcomes. Environmental Health Perspectives 117:1033-1041.
http://www.ncbi.nlm.nih.gov/pubmed/19654909.

Morales-Ruan Mdel C, Villalpando S, Garcia-Guerra A, Shaman-Levy T, Robledo-Perez R, Avila-Arcos MA, et al. (2012).
Iron, zinc, copper and magnesium nutritional status in Mexican children aged 1 to 11 years. Salud Publica de
Mexico 54:125-134. http://www.ncbi.nlm.nih.gov/pubmed/22535171.

Moraru, II, Schaff JC, Slepchenko BM, Blinov ML, Morgan F, Lakshminarayana A, et al. (2008). Virtual cell modelling
and simulation software environment. IETSystems Biology 2: 352-362.
http://www.ncbi.nlm.nih.gov/pubmed/19045830.

Mortensen HM, Euling SY. (2013). Integrating mechanistic and polymorphism data to characterize human genetic
susceptibility for environmental chemical risk assessment in the 21st century. Toxicology and Applied
Pharmacology 271: 395-404. http://dx.doi.0rg/10.1016/i.taap.2011.01.015.

MurkAJ, Rijntjes E, Blaauboer BJ, Clewell R, Crofton KM, Dingemans MM, etal. (2013). Mechanism-based testing
strategy using in vitro approaches for identification of thyroid hormone disrupting chemicals. Toxicology In Vitro.
http://www.ncbi.nlm.nih.gov/pubmed/23453986.

NAS (National Academy of Sciences). (2007). Scientific Review of the Proposed Risk Assessment Bulletin from the Office
of Management and Budget. (9780309104777). Washington, DC: The National Academies Press. Retrieved from
http://www.nap.edu/openbook.php7record id=11811.

NCBI (National Center for Biotechnology Information). (2009). Epigenomics (database). Retrieved from
http://www.ncbi.nlm.nih.gov/epigenomics (accessed March 4,2013).

NCBI (National Center for Biotechnology Information). (2012a). Gene Expression Omnibus. Retrieved from
http://www.ncbi.nlm.nih.gov/geo/ (accessed February 22, 2013).

NCBI (National Center for Biotechnology Information). (2012b). Reference SNP(refSNP) Cluster Report: rs!3266634.
dbSNP: Short Genetic Variations. Retrieved from
http://www.ncbi.nlm.nih.gov/projects/SNP/snp ref.cgi?rs= 13266634 (accessed March 20,2013).

NCBI (National Center for Biotechnology Information). (2013). PubMed Website. Retrieved from
http://www.ncbi.nlm.nih.gov/pubmed (accessed April 2,2013).

NHGRI (National Human Genome Research Institute). (2012). Home Page. Retrieved from http://www.genome.gov/
(accessed February 22,2013).

NHGRI (National Human Genome Research Institute). (2013). A Catalog of Published Genome-Wide Association
Studies. Retrieved from http://www.genome.gov/gwastudies/ (accessed March 25,2013).

Nichols JW, Breen M, Denver RJ, Distefano JJ, 3rd, Edwards JS, Hoke RA, et al. (2011). Predicting chemical impacts on
vertebrate endocrine systems. Environmental Toxicology and Chemistry 30: 39-51.
http://www.ncbi.nlm.nih.gov/pubmed/20963851.

Nie AY, Mcmillian M, Parker JB, Leone A, Bryant S, Yieh L, et al. (2006). Predictive toxicogenomics approaches reveal
underlying molecular mechanisms of nongenotoxic carcinogenicity. Molecular Carcinogenesis 45: 914-933.

NIEHS (National Institute of Environmental Health Sciences). (2012a). Host Susceptibility Program. September 7.
Retrieved from http://ntp.niehs.nih.gov/?obiectid=B76D131B-FlF6-975E-71657BC3DC88C299 (accessed April 2,
2013).

NIEHS (National Institute of Environmental Health Sciences). (2012b). NIEHS SNPs Environmental Genome Project.
Retrieved from http://egp.gs.washington.edu/ (accessed March 25,2013).

NIEHS (National Institute of Environmental Health Sciences). (2012c). NIEHS Strategic Plan. Retrieved from
http://niehs.nih.gov/about/strategicplan/index.cfm (accessed February 22,2013).

NIEHS (National Institute of Environmental Health Sciences). (2013). Comparative Toxicogenomic Database (CTD)™.
February 6. Retrieved from http://ctdbase.org/ (accessed March 4,2013).

NIH (National Institutes of Health). (2012). NIH Chemical Genomics Center. Retrieved from
http://www.ncats.nih.gov/research/reengineering/ncgc/ncgc.html (accessed February 22,2013).

NIOSH (National Institute of Occupational Safety and Health). (1992). Recommendations for Occupational Safety and
Health: Compendium of Policy Documents and Statements. Retrieved from http://www.cdc.gov/niosh/pdfs/92-
100.pdf (accessed March 25,2013).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 104
-------
North M, Tandon VJ, Thomas R, Loguinov A, Gerlovina I, Hubbard AE, et al. (2011). Genome-wide functional profiling
reveals genes required for tolerance to benzene metabolites in yeast. Public Library of Science One 6: e24205.
http://www.ncbi.nlm.nih.gov/pubmed/21912624.

NRC (National Research Council). (2006). Human Biomonitoring for Environmental Chemicals. The National Academies
Press. Retrieved from http://www.nap.edu/openbook.php7record id= 11700.

NRC (National Research Council). (2007). Toxicity Testing in the 21st Century: A Vision and a Strategy. Washington DC.
Retrieved from http://dels.nas.edu/resources/static-assets/materials-based-on-reports/reports-in-
brief/Toxicity Testing final.pdf (accessed February 22,2013).

NRC (National Research Council). (2009). Science and Decisions: Advancing Risk Assessment. Washington, DC.
Retrieved from http://www.nap.edu/catalog/12209.html (accessed February 22,2013).

NRC (National Research Council). (2011). Predicting Later-Life Outcomes of Early-Life Exposures. Retrieved from
http://nas-sites.org/emergingscience/files/2011/05/inutero final April2011.pdf (accessed April 2,2013).

O'Shea SH, Schwarz J, Kosyk 0, Ross PK, Ha MJ, Wright FA, et al. (2011). In vitro screening for population variability in
chemical toxicity. ToxicologicalSciences 119: 398-407. http://www.ncbi.nlm.nih.gov/pubmed/20952501.

OECD (Organization for Economic Cooperation and Development). (2004). OECD Principles for the Validation, for
Regulatory Purposes, of Quantitative Structure Activity Relationship Models. Paris, France. Retrieved from
http://www.oecd.org/chemicalsafety/assessmentofchemicals/37849783.pdf (accessed February 22,2013).

OECD (Organization for Economic Cooperation and Development). (2012). The OECD QSAR Toolbox. Version 3.0.
Retrieved from
http://www.oecd.org/env/chemicalsafetyandbiosafety/assessmentofchemicals/theoecdqsartoolbox.htm
(accessed February 22,2013).

OMIM (Online Mendelian Inheritance in Man). (2012). Solute Carrier Family 30 (Zinc Transporter), Member 8,
SLC30A8 November 13. Retrieved from http://omim.org/entry/611145tfOQ01 (accessed March 21,2013).

Oracle. (2013a). 5 Configuring Rule Sets. Retrieved from
http://docs.oracle.com/cd/B28359 01/server.lll/b31222/cfrulset.htm#DVADM70150 (accessed March 20,
2013).

Oracle. (2013b). Glossary: "Lift". Retrieved from
http://docs.oracle.com/cd/B28359 01/datamine.lll/b28129/glossary.htm (accessed March 20,2013).

Padilla S, Corum D, Padnos B, Hunter DL, Beam A, Houck KA, et al. (2012). Zebrafish developmental screening of the
ToxCast Phase I chemical library. Reproductive Toxicology 33:174-187.
http://www.ncbi.nlm.nih.gov/pubmed/22182468.

Pare G, Chasman DI, Parker AN, Nathan DM, Miletich JP, Zee RY, et al. (2008). Novel association of HK1 with glycated
hemoglobin in a non-diabetic population: A genome-wide evaluation of 14,618 participants in the Women's
Genome Health Study. Public Library of Science Genetics 4: e!000312.
http://www.ncbi.nlm.nih.gov/pubmed/19096518.

Parham F, Austin C, Southall N, Huang R, Tice R, Portier C. (2009). Dose-response modeling of high-throughput
screening data. Journal of Biomolecular Screening 14:1216-1227.
http://www.ncbi.nlm.nih.gov/pubmed/19828774.

Paris M, LaudetV. (2008). The history of a developmental stage: Metamorphosis inchordates. Genesis 46: 657-672.
http://www.ncbi.nlm.nih.gov/pubmed/18932261.

Parng C, Seng WL, Semino C, McGrath P. (2002). Zebrafish: A preclinical model for drug screening. Assay & Drug
Development Technologies 1: 41-48. http://www.ncbi.nlm.nih.gov/pubmed/15090155.

Patel CJ, Cullen MR. (2012). Genetic variability in molecular responses to chemical exposure. Experientia 101: 437-
457. http://www.ncbi.nlm.nih.gov/pubmed/22945578.

Patel CJ, Chen R, Butte AJ. (2012a). Data-driven integration of epidemiological and toxicological data to select
candidate interacting genes and environmental factors in association with disease. Bioinformatics 28:1121-126.
http://www.ncbi.nlm.nih.gov/pubmed/22689751.

Patel CJ, Cullen MR, Loannidis JP, Butte AJ. (2012b). Systematic evaluation of environmental factors: Persistent
pollutants and nutrients correlated with serum lipid levels. International Journal of Epidemiology 4-1: 828-843.
httD://dx.doi.org/10.1093/iie/dvs003.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 105
-------
Patel CJ, Chen R, Kodama K, Loannidis JP, Butte AJ. (2013). Systematic identification of interaction effects between
genome-and environment-wide associations in type 2 diabetes mellitus. Human genetics.

Peden DB, Boehlecke B, Horstman D, Devlin R. (1997). Prolonged acute exposure to 0.16 ppm ozone induces
eosinophilic airway inflammation in asthmatic subjects with allergies. Journal of Allergy and Clinical Immunology
100: 802-808. http://www.ncbi.nlm.nih.gov/pubmed/9438490.

Pedersen-Bjergaard J, Andersen MK, Andersen MT, Christiansen DH. (2008). Genetics of therapy-related
myelodysplasia and acute myeloid leukemia. Leukemia 22: 240-248.
http://www.ncbi.nlm.nih.gov/pubmed/18200041.

Perkins EJ, Ankley GT, Crofton KM, Garcia-Reyero N, Lalone CA, Johnson MS, et al. (2013). Current perspectives on the
use of alternative species in human health and ecological risk assessments. Environmental Health Perspectives.
http://www.ncbi.nlm.nih.gov/pubmed/23771518.

Physiome Project. (2013) Retrieved from http://physiomeproiect.org/ (accessed March 28,2013).

Rabinowitz JR, Goldsmith MR, Little SB, Pasquinelli MA. (2008). Computational molecular modeling for evaluating the
toxicity of environmental chemicals: Prioritizing bioassay requirements. Environmental Health Perspectives 116:
573-577. http://www.ncbi.nlm.nih.gov/pubmed/18470285.

Rakyan VK, Down TA, Balding DJ, Beck S. (2011). Epigenome-wide association studies for common human diseases.
Nature Reviews Genetics 12: 529-541. http://www.ncbi.nlm.nih.gov/pubmed/21747404.

Raldua D, Thienpont B, Babin PJ. (2012). Zebrafish eleutheroembryos as an alternative system for screening chemicals
disrupting the mammalian thyroid gland morphogenesis and function. Reproductive Toxicology 33:188-197.
http://www.ncbi.nlm.nih.gov/pubmed/21978863.

RamasamyA, MondryA, Holmes CC, Altman DG. (2008). Key issues in conducting a meta-analysis of gene expression
microarray datasets. Public Library of Science Medicine 5: e!84. http://www.ncbi.nlm.nih.gov/pubmed/18767902.

Reaume CJ, Sokolowski MB. (2011). Conservation of gene function in behaviour. Philosophical Transactions of the
Royal Society London B: Biological Science 366: 2100-2110. http://www.ncbi.nlm.nih.gov/pubmed/21690128.

Reif DM, Martin MT, Tan SW, Houck KA, Judson RS, Richard AM, et al. (2010). Endocrine profiling and prioritization of
environmental chemicals using ToxCast data. Environmental Health Perspectives 118: 1714-1720.
http://www.ncbi.nlm.nih.gov/pubmed/20826373.

Reuschenbach P, Silvani M, Dammann M, Warnecke D, Knacker T. (2008). ECOSAR model performance with a large
test set of industrial chemicals. Chemosphere 71:1986-1995. http://www.ncbi.nlm.nih.gov/pubmed/18262586.

Rosenbaum RK, Bachmann TM, Swirsky Gold L, Huijbregts MAJ, Jolliet 0, Juraske R. (2008). USEtox - the UNEP-SETAC
toxicity model: Recommended characterization factors for human toxicity and freshwater ecotoxicicty in life cycle
impact assessment. International Journal of Life Cycle Assessment 13: 532-546.

Rotroff DM, Wetmore BA, Dix DJ, Ferguson SS, Clewell HJ, Houck KA, et al. (2010). Incorporating human dosimetry and
exposure into high-throughput in vitro toxicity screening. Toxicological Sciences 117: 348-358.
http://www.ncbi.nlm.nih.gov/pubmed/20639261.

Rudel RA, Dodson RW, E. N, A.R. Z, J.G. B. (2008). Correlations between urinary phthalate metabolites and phthalates,
estrogenic compounds 4-butyl phenol and o-phenyl phenol, and some pesticides in home indoor air and house
dust. Epidemiology 19: S332.

Rung J, Cauchi S, Albrechtsen A, Shen L, Rocheleau G, Cavalcanti-Proenca C, et al. (2009). Genetic variant near IRS1 is
associated with type 2 diabetes, insulin resistance and hyperinsulinemia. Nature Genetics 41:1110-1115.
http://www.ncbi.nlm.nih.gov/pubmed/19734900.

Rusyn I, Gatti DM, Wiltshire T, Kleeberger SR, Threadgill DW. (2010). Toxicogenetics: Population-based testing of drug
and chemical safety in mouse models. Pharmacogenomics 11:1127-1136.
http://www.ncbi.nlm.nih.gov/pubmed/20704464.

Rusyn I, Sedykh A, Low Y, Guyton KZ, Tropsha A. (2012). Predictive modeling of chemical hazard by integrating
numerical descriptors of chemical structures and short-term toxicity assay data. Toxicological Sciences 127:1-9.
http://dx.doi.org/10.1093/toxsci/kfs095.

SAB (Science Advisory Board). (2013). Draft SAB advice on advancing the application of computational toxicology
research for human health risk assessment. Washington, DC. Retrieved from
http://yosemite.epa.gov/sab/sabproduct.nsf/46963ceebabd621905256cae0053d5c6/F3315DOEE2EDC1128525
7BOA005F5B7E/$File/CompTox-edited+l-29-13.pdf (accessed February 26,2013).

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 106
-------
Sagredo C, Ovrebo S, Haugen A, Fujii-Kuriyama Y, Baera R, Botnen IV, et al. (2006). Quantitative analysis of
benzo[a]pyrene biotransformation and adduct formation in Ahr knockout mice. Toxicology Letters 167:173-182.
http://www.ncbi.nlm.nih.gov/pubmed/17049425.

Samuels MH, Luther M, Henry P, Ridgway EC. (1994). Effects of hydrocortisone on pulsatile pituitary glycoprotein
secretion. Journal of Clinical Endocrinology & Metabolism 78: 211-215.
http://www.ncbi.nlm.nih.gov/pubmed/8288706.

Sand S, Portier CJ, Krewski D. (2011). A signal-to-noise crossover dose as the point of departure for health risk
assessment. Environmental Health Perspectives 119: 1766-1774.
http://www.ncbi.nlm.nih.gov/pubmed/21813365.

Sarapura VD, Samuels MH, Ridgway EC. (2002). Thyroid-Stimulating Hormone. In S Melmed (Ed.), The Pitutary
(Second ed., pp. 187-229). Maiden, MA: Blackwell Science.

Schnatter AR, Glass DC, Tang G, Irons RD, Rushton L. (2012). Myelodysplastic syndrome and benzene exposure among
petroleum workers: an international pooled analysis. Journal of the National Cancer Institute 104:1724-1737.
http://www.ncbi.nlm.nih.gov/pubmed/23111193.

Schreiber SL. (2003). The small-molecule approach to biology: Chemical genetics and diversity-oriented organic
synthesis make possible the systematic exploration of biology. Chemical & Engineering News 81: 51-61.

Schug TT, Janesick A, Blumberg B, Heindel JJ. (2011). Endocrine disrupting chemicals and disease susceptibility.
Journal of Steroid Biochemistry and Molecular Biology 127: 204-215.
http://www.ncbi.nlm.nih.gov/pubmed/21899826.

Scott LJ, Mohlke KL, Bonnycastle LL, Wilier CJ, Li Y, Duren WL, et al. (2007). A genome-wide association study of type 2
diabetes in Finns detects multiple susceptibility variants. Science 316:1341-1345.
http://www.ncbi.nlm.nih.gov/pubmed/17463248.

Sedykh A, Zhu H, Tang H, Zhang L, Richard A, Rusyn I, et al. (2011). Use of in vitro HTS-derived concentration-
response data as biological descriptors improves the accuracy of QSAR models of in vivo toxicity. Environmental
Health Perspectives 119: 364-370. http://www.ncbi.nlm.nih.gov/pubmed/20980217.

Selgrade MK, Cooper KD, Devlin RB, van Loveren H, Biagini RE, Luster MI. (1995). Immunotoxicity-bridging the gap
between animal research and human health effects. Fundamental and Applied Toxicology 24:13-21.
http://www.ncbi.nlm.nih.gov/pubmed/7713335.

Serafimova R, Todorov M, Nedelcheva D, Pavlov T, Akahori Y, Nakai M, et al. (2007). QSAR and mechanistic
interpretation of estrogen receptor binding. SAR and QSAR in Environmental Research 18: 389-421.
http://www.ncbi.nlm.nih.gov/pubmed/17514577.

Shaffer CL, Scialis RJ, Rong H, Obach RS. (2012). Using Simcyp to project human oral pharmacokinetic variability in
early drug research to mitigate mechanism-based adverse events. Biopharmaceutics & Drug Disposition 33: 72-84.
http://www.ncbi.nlm.nih.gov/pubmed/22213407.

Shah et al. (submitted). OnToP: An ontology for toxicity pathways.

Shah I, Wambaugh J. (2010). Virtual tissues in toxicology. Journal of Toxicology and Environmental Health PartB:
Critical Reviews 13: 314-328. http://www.ncbi.nlm.nih.gov/pubmed/20574905.

Sheldon LS, Cohen Hubal EA. (2009). Exposure as part of a systems approach for assessing risk. Environmental Health
Perspectives 117:119-1194. http://www.ncbi.nlm.nih.gov/pubmed/19672394.

Shen M, Zhang L, Lee KM, Vermeulen R, Hosgood HD, Li G, et al. (2011). Polymorphisms in genes involved in innate
immunity and susceptibility to benzene-induced hematotoxicity. Experimental & Molecular Medicine 43: 374-378.
http://www.ncbi.nlm.nih.gov/pubmed/21540635.

Shi L, Jones WD, Jensen RV, Harris SC, Perkins RG, Goodsaid FM, et al. (2008). The balance of reproducibility,
sensitivity, and specificity of lists of differentially expressed genes in microarray studies. BMC Bioinformatics 9
Suppl 9: S10. http://www.ncbi.nlm.nih.gov/pubmed/18793455.

Sille FC, Thomas R, Smith MT, Conde L, Skibola CF. (2012). Post-GWAS functional characterization of susceptibility
variants for chronic lymphocytic leukemia. Public Library of Science One 7: e29632.
http://www.ncbi.nlm.nih.gov/pubmed/22235315.

Sipes NS, Martin MT, Kothiya P, Reif DM, Judson RS, Richard AM, et al. (2013). Profiling 976 ToxCast chemicals across
331 enzymatic and receptor signaling assays. Chemical Research in Toxicology 26: 878-895.
http://www.ncbi.nlm.nih.gov/pubmed/23611293.

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 107
-------
Sipes NS, Padilla S, Knudsen TB. (2011). Zebrafish: As an integrative model for twenty-first century toxicity testing.
Birth Defects Research: Part C, Embryo Today 93: 256-267. http://www.ncbi.nlm.nih.gov/pubmed/21932434.

Sistonen J, Sajantila A, Lao 0, Corander J, Barbujani G, Fuselli S. (2007). CYP2D6 worldwide genetic variation shows
high frequency of altered activity variants and no continental structure. Pharmacogenetics and Genomics 17: 93-
101. http://www.ncbi.nlm.nih.gov/pubmed/17301689.

Skolness SY, Blanksma CA, Cavallin JE, Churchill JJ, Durhan EJ, Jensen KM, et al. (2013). Propiconazole inhibits
steroidogenesis and reproduction in the fathead minnow (Pimephales promelas). Toxicological Sciences.
http://www.ncbi.nlm.nih.gov/pubmed/23339182.

Sladek R, Rocheleau G, Rung J, Dina C, Shen L, Serre D, et al. (2007). A genome-wide association study identifies novel
risk loci for type 2 diabetes. Nature 445: 881-885. http://www.ncbi.nlm.nih.gov/pubmed/17293876.

Smith MT, Zhang L, McHale CM, Skibola CF, Rappaport SM. (2011). Benzene, the exposome and future investigations of
leukemia etiology. Chemico-Biological Interactions 192:155-159.
http://www.ncbi.nlm.nih.gov/pubmed/21333640.

Smith MV, Boyd WA, Kissling GE, Rice JR, Snyder DW, Portier CJ, et al. (2009). A discrete time model for the analysis of
medium-throughput C. elegans growth data. Public Library of Science One 4: e7018.
http://www.ncbi.nlm.nih.gov/pubmed/19753303.

Steinthorsdottir V, Thorleifsson G, Reynisdottir I, Benediktsson R, Jonsdottir T, Walters GB, et al. (2007). A variant in
CDKAL1 influences insulin response and risk of type 2 diabetes. Nature Genetics 39: 770-775.
http://www.ncbi.nlm.nih.gov/pubmed/17460697.

Takeuchi F, Serizawa M, Yamamoto K, Fujisawa T, Nakashima E, Ohnaka K, et al. (2009). Confirmation of multiple risk
Loci and genetic impacts by a genome-wide association study of type 2 diabetes in the Japanese population.
Diabetes 58:1690-1699. http://www.ncbi.nlm.nih.gov/pubmed/19401414.

Taylor KW, Novak RF, Anderson HA, Birnbaum LS, Blystone C, Devito M, et al. (2013). Evaluation of the Association
between Persistent Organic Pollutants (POPs) and Diabetes in Epidemiological Studies: A National Toxicology
Program Workshop Review. Environmental Health Perspectives 121: 774-783.
http://www.ncbi.nlm.nih.gov/pubmed/23651634.

Teschendorff AE, Widschwendter M. (2012). Differential variability improves the identification of cancer risk markers
in DNA methylation studies profiling precursor cancer lesions. Bioinformatics 28:1487-1494.
http://www.ncbi.nlm.nih.gov/pubmed/22492641.

Thayer KA, Heindel JJ, Bucher JR, Gallo MA. (2012). Role of environmental chemicals in diabetes and obesity: A
National Toxicology Program Workshop report. Environmental Health Perspectives 120: 779-789.
http://dx.doi.org/10.1289/ehp.1104597.

Thienpont B, Tingaud-Sequeira A, Prats E, Barata C, Babin PJ, Raldua D. (2011). Zebrafish eleutheroembryos provide a
suitable vertebrate model for screening chemicals that impair thyroid hormone synthesis. Environmental Science
and Technology 45: 7525-7532. http://www.ncbi.nlm.nih.gov/pubmed/21800831.

Thomas D. (2010). Gene-environment-wide association studies: Emerging approaches. Nature Reviews Genetics 11:
259-272. http://www.ncbi.nlm.nih.gov/pubmed/20212493.

Thomas R, PhuongJ, McHale CM, Zhang L. (2012). Using bioinformatic approaches to identify pathways targeted by
human leukemogens. International Journal of Environmental Research and Public Health 9: 2479-2503.
http://www.ncbi.nlm.nih.gov/pubmed/22851955.

Thomas RS, Allen BC, Nong A, Yang L, Bermudez E, Clewell HJ, III, et al. (2007). A method to integrate benchmark dose
estimates with genomic data to assess the functional effects of chemical exposure. Toxicological Sciences 98: 240-
248. http://dx.doi.org/10.1093/toxsci/kfm092.

Thomas RS, Bao W, Chu TM, Bessarabova M, Nikolskaya T, Nikolsky Y, et al. (2009). Use of short-term transcriptional
profiles to assess the long-term cancer-related safety of environmental and industrial chemicals. Toxicological
Sciences 112: 311-321. http://dx.doi.org/10.1093/toxsci/kfp233.

Thomas RS, Clewell HJ, 3rd, Allen BC, Yang L, Healy E, Andersen ME. (2012). Integrating pathway-based
transcriptomic data into quantitative chemical risk assessment: A five chemical case study. Mutation Research 746:
135-143. httD://www.ncbi.nlm.nih.gov/Dubmed/22305970.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 108
-------
Thomas RS, Clewell HJ, Allen BC, Wesselkamper SC, Wang NC, Lambert JC, et al. (2011). Application of transcriptional
benchmark dose values in quantitative cancer and noncancer risk assessment. Toxicological Sciences 120: 194-
205. http://dx.doi.org/10.1093/toxsci/kfq355.

Thomas RS, Philbert MA, Auerbach SS, Wetmore BA, Devito MJ, Cote I, et al. (2013a). Incorporating new technologies
into toxicity testing and risk assessment: Moving from 21st century vision to a data-driven framework.
Toxicological Sciences, http://www.ncbi.nlm.nih.gov/pubmed/23958734.

Thomas RS, Wesselkamper SC, Wang NC, Zhao QJ, Petersen DD, Lambert JC, et al. (2013b). Temporal concordance
between apical and transcriptional points of departure for chemical risk assessment. Toxicological Sciences 134:
180-194. http://www.ncbi.nlm.nih.gov/pubmed/23596260.

Tice RR, Austin CP, Kavlock RJ, Bucher JR. (2013). Improving the human hazard characterization of chemicals: atox21
update. Environmental Health Perspectives 121: 756-765. http://www.ncbi.nlm.nih.gov/pubmed/23603828.

Tietge JE, Degitz SJ, Haselman JT, Butterworth BC, Korte JJ, Kosian PA, et al. (2013). Inhibition of the thyroid hormone
pathway in Xenopus laevis by 2-mercaptobenzothiazole. Aquatic Toxicology 126:128-136.
http://www.ncbi.nlm.nih.gov/pubmed/23178179.

Timpson NJ, Lindgren CM, Weedon MN, Randall J, Ouwehand WH, Strachan DP, et al. (2009). Adiposity-related
heterogeneity in patterns of type 2 diabetes susceptibility observed in genome-wide association data. Diabetes 58:
505-510. http://www.ncbi.nlm.nih.gov/pubmed/19056611.

Tokar EJ, Diwan BA, Thomas DJ, Waalkes MP. (2012). Tumors and proliferative lesions in adult offspring after
maternal exposure to methylarsonous acid during gestation in GDI mice. Archives of Toxicology 86: 975-982.
http://www.ncbi.nlm.nih.gov/pubmed/22398986.

Tokar EJ, Qu W, Waalkes MP. (2011). Arsenic, stem cells, and the developmental basis of adult cancer. Toxicological
Sciences 120 Suppl 1: S192-203. http://www.ncbi.nlm.nih.gov/pubmed/21071725.

Torkamani A, Topol EJ, Schork NJ. (2008). Pathway analysis of seven common diseases assessed by genome-wide
association. Genomics 92: 265-272. http://www.ncbi.nlm.nih.gov/pubmed/18722519.

Uehara T, Minowa Y, Morikawa Y, Kondo C, Maruyama T, Kato I, et al. (2011). Prediction model of potential
hepatocarcinogenicity of rat hepatocarcinogens using a large-scale toxicogenomics database. Toxicology and
Applied Pharmacology 255: 297-306. http://dx.doi.0rg/10.1016/i.taap.2011.07.001.

van Leeuwen DM, Pedersen M, Knudsen LE, Bonassi S, Fenech M, Kleinjans JC, et al. (2011). Transcriptomic network
analysis of micronuclei-related genes: A case study. Mutagenesis 26: 27-32.
http://www.ncbi.nlm.nih.gov/pubmed/21164179.

Venkatapathy R, Moudgal CJ, Bruce RM. (2004). Assessment of the oral rat chronic lowest observed adverse effect
level model in TOPKAT, a QSAR software package for toxicity prediction. Journal of Chemical Information and
Computer Sciences 44:1623-1629. http://www.ncbi.nlm.nih.gov/pubmed/15446819.

Venkatapathy R, Wang NC. (2013). Developmental toxicity prediction. Methods in Molecular Biology 930: 305-340.
http://www.ncbi.nlm.nih.gov/pubmed/23086848.

Visscher H, Ross CJ, Rassekh SR, Sandor GS, Caron HN, van Dalen EC, et al. (2013). Validation of variants in SLC28A3
and UGT1A6 as genetic markers predictive of anthracycline-induced cardiotoxicity in children. Pediatr Blood
Cancer 60:1375-1381. http://www.ncbi.nlm.nih.gov/pubmed/23441093.

Vogelstein B, Lane D, Levine AJ. (2000). Surfing the p53 network. Nature 408: 307-310.
http://www.ncbi.nlm.nih.gov/pubmed/11099028.

Waits ER, Nebert DW. (2011). Genetic architecture of susceptibility to PCB126-induced developmental cardiotoxicity
in zebrafish. Toxicological Sciences 122: 466-475. http://www.ncbi.nlm.nih.gov/pubmed/21613231.

Walker JD, Carlsen L. (2002). QSARs for identifying and prioritizing substances with persistence and bioconcentration
potential. SAR and QSAR in Environmental Research 13: 713-725.
http://www.ncbi.nlm.nih.gov/pubmed/12570048.

Walker JD, Carlsen L, Hulzebos E, Simon-Hettich B. (2002). Global government applications of analogues, SARs and
QSARs to predict aquatic toxicity, chemical or physical properties, environmental fate parameters and health
effects of organic chemicals. SAR and QSAR in Environmental Research 13: 607-616.
http://www.ncbi.nlm.nih.gov/pubmed/12479375.

Wambaugh J, Shah I. (2010). Simulating microdosimetry in a virtual hepatic lobule. Public Library of Science
Computational Biology 6: e!000756. http://www.ncbi.nlm.nih.gov/pubmed/20421935.

This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 109
-------
Wang I, Zhang B, Yang X, Stepaniants S, Zhang C, Meng Q, et al. (2012). Systems analysis of eleven rodent disease
models reveals an inflammatome signature and key drivers. Molecular Systems Biology 8: 594.
http://www.ncbi.nlm.nih.gov/pubmed/22806142.

Wang N, Jay Zhao Q, Wesselkamper SC, Lambert JC, Petersen D, Hess-Wilson JK. (2012a). Application of computational
toxicological approaches in human health risk assessment I. A tiered surrogate approach. Regulatory Toxicology
and Pharmacology 63:10-19. http://www.ncbi.nlm.nih.gov/pubmed/22369873.

Wang N, Rice GE, Teuschler LK, Colman J, Yang RS. (2012b). An in silico approach for evaluating a fraction-based, risk
assessment method for total petroleum hydrocarbon mixtures. Journal of Toxicology 2012: 410143.
http://www.ncbi.nlm.nih.gov/pubmed/22496687.

Wang N, Venkatapathy R, Bruce RM, Moudgal C. (2011). Development of quantitative structure-activity relationship
(QSAR) models to predict the carcinogenic potency of chemicals. II. Using oral slope factor as a measure of
carcinogenic potency. Regulatory Toxicology and Pharmacology 59: 215-226.
http://www.ncbi.nlm.nih.gov/pubmed/20951756.

WanjekC. (2013). Systems Biology as Defined by National Institute of Health (NIH). Retrieved from
http://irp.nih.gov/catalyst/vl9i6/systems-biology-as-defined-by-nih (accessed April 2,2013).

Warner CM, Gust KA, Stanley JK, Habib T, Wilbanks MS, Garcia-Reyero N, et al. (2012). A systems toxicology approach
to elucidate the mechanisms involved in RDX species-specific sensitivity. Environmental Science and Technology
46: 7790-7798. http://www.ncbi.nlm.nih.gov/pubmed/22697906.

Weiss JN, Karma A, MacLellan WR, Deng M, Rau CD, Rees CM, et al. (2012). "Good enough solutions" and the genetics
of complex diseases. Circulation Research 111: 493-504. http://www.ncbi.nlm.nih.gov/pubmed/22859671.

Wetmore BA, Wambaugh JF, Ferguson SS, Li L, Clewell HJ, 3rd, Judson RS, et al. (2013). Relative impact of
incorporating pharmacokinetics on predicting in vivo hazard and mode of action from high-throughput in vitro
toxicity assays. Toxicological Sciences 132: 327-346. http://www.ncbi.nlm.nih.gOV/pubmed/23358191.

Wetmore BA, Wambaugh JF, Ferguson SS, Sochaski MA, Rotroff DM, Freeman K, etal. (2012). Integration of dosimetry,
exposure, and high-throughput screening data in chemical toxicity assessment. Toxicological Sciences 125:157-
174. http://www.ncbi.nlm.nih.gov/pubmed/21948869.

WHO (World Health Organization). (2012). State of the Science of Endocrine Disrupting Chemicals. Retrieved from
http://apps.who.int/iris/bitstream/10665/78102/l/WHO HSE PHE IHE 2013.1 eng.pdf (accessed March 4,
2013).

Wignall J, Muratov E, Fourches D, Sedykh A, Tropsha A, Woodruff TJ, et al. (2012). Modeling toxicity values using
chemical structure, in vitro screening, and in vivo toxicity data 51st Annual Meetign of the Society of Toxicology
Poster Abstract # 299. http://www.toxicology.org/AI/Pub/Tox/2012Tox.pdf.

Wignall J, Muratov E, Fourches D, Tropsha A, Woodruff TJ, Zeise L, et al. (2013). Conditional toxicity value (CTV)
predictor for generating toxicity values for data sparse chemicals. 52nd Annual Meeting of the Society of Toxicology
Poster Abstract #2454. http://www.toxicology.org/AI/PUB/Tox/2013Tox.pdf.

Williams LM, Oleksiak MF. (2011). Ecologically and evolutionarily important SNPs identified in natural populations.
Molecular Biology and Evolution 28: 1817-1826. http://www.ncbi.nlm.nih.gov/pubmed/21220761.

Wooding SP, Watkins WS, Bamshad MJ, Dunn DM, Weiss RB, Jorde LB. (2002). DNA sequence variation in a 3.7-kb
noncoding sequence 5' of the CYP1A2 gene: Implications for human population history and natural selection.
American Journal of Human Genetics 71: 528-542. http://www.ncbi.nlm.nih.gov/pubmed/12181774.

Woodruff TJ, Sutton P. (2011). An evidence-based medicine methodology to bridge the gap between clinical and
environmental health sciences. Health Affairs [Millwood) 30: 931-937.
http://www.ncbi.nlm.nih.gov/pubmed/21555477.

Wu W, Doreswamy V, Diaz-Sanchez D, Samet JM, Kesic M, Dailey L, et al. (2011). GSTM1 modulation of IL-8 expression
in human bronchial epithelial cells exposed to ozone. Free Radical Biology and Medicine 51: 522-529.
http://www.ncbi.nlm.nih.gov/pubmed/21621609.

Yeung KY, Dombek KM, Lo K, Mittler JE, Zhu J, Schadt EE, et al. (2011). Construction of regulatory networks using
expression time-series data of a genotyped population. Proceedings of the National Academy of Sciences USA 108:
19436-19441. http://www.ncbi.nlm.nih.gov/pubmed/22084118.

Zacharewski T, Teuschler L, Cote I, Burgoon L. (submitted). Improving cumulative risk assessment through systems
and network biology driven data mining.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 110
-------
Zeggini E, Weedon MN, Lindgren CM, Frayling TM, Elliott KS, Lango H, et al. (2007). Replication of genome-wide
association signals in UK samples reveals risk loci for type 2 diabetes. Science 316:1336-1341.
http://www.ncbi.nlm.nih.gov/pubmed/17463249.

Zeise L, Bois FY, Chiu WA, Hattis D, Rusyn I, Guyton KZ. (2012). Addressing human variability in next-generation
human health risk assessments of environmental chemicals. Environmental Health Perspectives Epub doi:
10.1289/ehp.l205687.

Zhang L, McHale CM, Rothman N, Li G, Ji Z, Vermeulen R, et al. (2010). Systems biology of human benzene exposure.
Chemico-Biological Interactions 184: 86-93. http://www.ncbi.nlm.nih.gov/pubmed/20026094.

Zhang Q, Bhattacharya S, Andersen ME, Conolly RB. (2010). Computational systems biology and dose-response
modeling in relation to new directions in toxicity testing. Journal of Toxicology and Environmental Health PartB:
Critical Reviews 13: 253-276. http://www.ncbi.nlm.nih.gov/pubmed/20574901.

Zhu H, Rao RS, ZengT, Chen L. (2012). Reconstructing dynamic gene regulatory networks from sample-based
transcriptional data. Nucleic Acids Research 40:10657-10667. http://www.ncbi.nlm.nih.gov/pubmed/23002138.

Zhuo W, Zhang L, Zhu B, Qiu Z, Chen Z. (2012). Association between CYP1A1 Ile462Val variation and acute leukemia
risk: Meta-analyses including 2164 cases and 4160 controls. Public Library of Science One 7: e46974.
http://www.ncbi.nlm.nih.gov/pubmed/23056546.

Zoeller RT, Crofton KM. (2005). Mode of action: Developmental thyroid hormone insufficiency-neurological
abnormalities resulting from exposure to propylthiouracil. Critical Reviews in Toxicology 35: 771-781.
http://www.ncbi.nlm.nih.gov/pubmed/16417044.

Zoeller RT, Dowling AL, Herzig CT, lannacone EA, Gauger KJ, Bansal R. (2002). Thyroid hormone, brain development,
and the environment. Environmental Health Perspectives 110 Suppl 3: 355-361.
http://www.ncbi.nlm.nih.gov/pubmed/12060829.

Zoeller RT, RovetJ. (2004). Timing of thyroid hormone action in the developing brain: Clinical observations and
experimental findings. Journal of Neuroendocrinology 16: 809-818.
httD://www.ncbi.nlm.nih.gov/Dubmed/15500540.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 111
-------
[page intentionally left blank]
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 112
-------
Appendix A. Technical Papers Supporting the NexGen Report
Technical Papers Supporting the Report
Advancing the Next Generation of Risk Assessment by Ha Cote, Paul Anastas, Linda Birnbaum, Becki
Clark, David Dix, Stephen Edwards, and Peter Preuss (2012)
Advancing the Next Generation (NexGen) of Risk Assessment: The Prototypes Workshop by EPA
(2010)
Summary Report of Advancing the Next Generation of Risk Assessment Public Dialogue Conference
by EPA(2011a)
A Framework for the Next Generation of Risk Assessment by Daniel Krewski, Margit Westphal, Greg
Paoli, Maxine Croteau, Mustafa Al-Zoughool, Mel Andersen, Weihsueh Chiu, Lyle Burgoon, and Ha
Cote (2013)
Reconsideration of Important Risk Assessment Issues Informed by Molecular Systems Biology by
Daniel Krewski, Melvin Andersen, Kim Boekelheide, Frederic Bois, Lyle Burgoon, Weihsueh Chiu,
Michael DeVito, Hisham EI-Masri, Lynn Flowers, Michael Goldsmith, Derek Knight, Thomas Knudsen,
William Lefew, Greg Paoli, Edward Perkins, Ivan Rusyn, Cecilia Tan, Linda Teuschler, Russell Thomas,
Maurice Whelan, Timothy Zacharewski, Lauren Zeise, and Ha Cote (in preparation)
Characterization of Changes in Gene Expression and Biochemical Pathways at Low Levels of Benzene
Exposure by Reuben Thomas, Alan Hubbard, Cliona McHale, Luoping Zhang, Stephen Rappaport,
Qing Lan, Nathaniel Rothman, Kathryn Guyton, Jennifer Jinot, BabasahebSonawane, and Martyn
Smith (in preparation)
Current Understanding of the Mechanism of Benzene-Induced Leukemia in Humans: Implications for
Risk Assessment by Cliona McHale, Luoping Zhang, and Martyn Smith (2012)
Benzene, the Exposome and Future Investigations of Leukemia Etiology by Martyn Smith, Luoping
Zhang, Cliona McHale, Christine Skibola, and Stephen Rappaport (2011)
Global Gene Expression Profiling of a Population Exposed to a Range of Benzene Levels by Cliona
McHale, Luoping Zhang, Qing Lan, Roel Vermeulen, Guilan Li, Alan Hubbard, Kristin Porter, Reuben
Thomas, Christopher Portier, Min Shen, Stephen Rappaport, Songnian Yin, Martyn Smith, and
Nathaniel Rothman (2011)
Temporal Profile of Gene Expression Alterations In Primary Human Bronchial Epithelial Cells
Following In Vivo Exposure to 0.3 ppm Ozone (meeting abstract) by Kelly Duncan, James Crooks,
David Miller, Lyle Burgoon, Michael Schmitt, Stephen Edwards, David Diaz-Sanchez, and Robert
Devlin (2013)
Transcriptional Profiling of Ozone-Induced Stress Responses in Primary Human Bronchial Epithelial
Cells Cultured at an Air-Liquid Interface by Kelly Duncan et al. (in preparation)
Ozone-induced Inflammation Is not Mediated via the Canonical NF-KB Pathway in Humans by David
Miller, Stephen Edwards, Lyle Burgoon, Rory Conolly, William Lefew, Kelly Duncan, Robert Devlin,
and James Samet (in preparation)
Systems Biology Informed Assessment of Benzo[a]pyrene/Polycyclic Aromatic Hydrocarbons and Liver
Cancer by Lyle Burgoon and Emma McConnell (in preparation)
IRIS Toxicological Review of Benzofalpyrene (Public Comment Draft). U.S. Environmental Protection
Agency, Washington, DC, EPA/635/R-13/138a-b (2013).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 A-l
-------
Technical Papers Supporting the Report
Data Mining Informed Risk Analysis of Environmental and Genetic Factors Associated with Type 2
Diabetes Mellitus by Lyle Burgoon (in preparation)
Data Mining NHANES to Identify Environmental Chemical and Disease Associations by Shannon Bell
and Stephen Edwards (in preparation)
Systematic Identification of Interaction Effects Between Genome- and Environment-Wide
Associations in Type 2 Diabetes Mellitus by Chirag Patel, Rong Chen, Keiichi Kodama, John loannidis,
and Atul Butte (2013)
Data-Driven Integration ofEpidemiological and Toxicological Data to Select Candidate Interacting
Genes and Environmental Factors in Association with Disease by Chirag Patel, Rong Chen, and Atul
Butte (2012a)
Genetic Variability in Molecular Responses to Chemical Exposure by Chirag Patel and Mark Cullen
(2012)
Role of Environmental Chemicals in Diabetes and Obesity: An NTP Workshop Review by Kristina
Thayer, Jerrold Heindel, John Bucher, and Michael Gallo (2012)
Current Perspectives on the Use of Alternative Species in Human Health and Ecological Hazard
Assessments by Edward Perkins, Gerald Ankley, Kevin Crofton, Natalia Garcia-Reyero, Carlie LaLone,
Mark Johnson, Joseph Tietge, and Daniel Villeneuve (2013)
Propiconazole Inhibits Steroidogenesis and Reproduction in the Fathead Minnow (Pimephales
promelas) by Sarah Skolness, Chad Blanksma, Jenna Cavallin, Jessica Churchill, Elizabeth Durhan,
Kathleen Jensen, Rodney Johnson, Michael Kahl, Elizabeth Makynen, Daniel Villeneuve, and Gerald
Ankley (2013)
Zebrafish Developmental Screening of the ToxCast™ Phase I Chemical Library by Stephanie Padilla,
Daniel Corum, Beth Padnos, Deborah Hunter, Andrew Beam, Keith Houck, Nisha Sipes, Nicole
Kleinstreuer, Thomas Knudsen, David Dix, and David Reif (2012)
A Systems Toxicology Approach to Elucidate the Mechanisms Involved in RDX Species-Specific
Sensitivity by Christopher Warner, Kurt Gust, Jacob Stanley, Tanwir Habib, Mitchell Wilbanks, Natalia
Garcia-Reyero, and Edward Perkins (2012)
Development of a Paradigm for the Next Generation of Chemical Risk Assessment: Short-term In Vivo
Models for Tier 2 Assessments by Michael DeVito, Russell Thomas, and Jason Lambert (in
preparation)
Incorporating New Technologies into Toxicity Testing and Risk Assessment: Moving from 21st Century
Vision to a Data-Driven Framework by Russell Thomas, Martin Philbert Scott Auerbach, Barbara
Wetmore, Michael Devito, lla Cote, Craig Rowlands, Maurice Whelan, Sean Hays, Melvin Andersen,
Bette Meek, Lawrence Reiter, Jason Lambert, Harvey Clewell III, Martin Stephens, Jay Zhao, Scott
Wesselkamper, Lynn Flowers, Edward Carney, Timothy Pastoora, Dan Petersen, Carole Yauk, and
Andy Nong(2013a)
Temporal Concordance Between Apical and Transcriptional Points of Departure for Chemical Risk
Assessment by Russell Thomas, Scott Wesselkamper, Nina Wang, Jay Zhao, Dan Peterson, Jason
Lambert, lla Cote, Yang Longlong, Eric Healy, Michael Black, Harvey Clewell, Bruce Allen, and Melvin
Andersen (2013b)

Integrating Pathway-Based Transcriptomic Data into Quantitative Chemical Risk Assessment: A Five
Chemical Case Study by Russell Thomas, Harvey Clewell III, Bruce Allen, Longlong Yang, Eric Healy,
and Melvin Andersen (2012)

Application of Transcriptional Benchmark Dose Values in Quantitative Cancer and Noncancer Risk
Assessment by Russell Thomas, Harvey Clewell III, Bruce Allen, Scott Wesselkamper, Nina Ching
Wang, Jason Lambert, Janet Hess-Wilson, Jay Zhao, and Melvin Andersen (2011)
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 A-2
-------
Technical Papers Supporting the Report
Predictive QSAR Modeling: Methods and Applications in Drug Discovery and Chemical Risk
Assessment by Alexander Golbraikh, Xiang Simon Wang, Hao Zhu, and Alexander Tropsha (2012)
Developmental Toxicity Prediction by Raghuraman Venkatapathy and Nina Wang (2013)
Predictive Modeling of Chemical Hazard by Integrating Numerical Descriptors of Chemical Structures
and Short-term Toxicity Assay Data by Ivan Rusyn, Alexander Sedykh, Yen Low, KZ Guyton, and
Alexander Tropsha (2012)
An In Silica Approach for Evaluating a Fraction-Based, Risk Assessment Method for Total Petroleum
Hydrocarbon Mixtures by Nina Ching Wang, Glenn Rice, Linda Teuschler, Joan Colman, and Raymond
Yang (2012b)
Application of Computational Toxicological Approaches in Human Health Risk Assessment I. A Tiered
Surrogate Approach by Nina Ching Yi Wang, Jay Zhao, Scott Wesselkamper, Jason Lambert, Dan
Petersen, and Janet Hess-Wilson (2012a)
Development of Quantitative Structure-Activity Relationship (QSAR) Models to Predict the
Carcinogenic Potency of Chemicals. II. Using Oral Slope Factor as a Measure of Carcinogenic Potency
by Nina Ching Yi Wang, Raghuraman Venkatapathy, Robert Mark Bruce, and Chandrika Moudgal
(2011)
Perspectives on Validation of High-Throughput Assays Supporting 21st Century Toxicity Testing by
Richard Judson, Robert Kavlock, Matthew Martin, David Reif, Keith Houck, Thomas Knudsen, Ann
Richard, Raymond Tice, Maurice Whelan, Menghang Xia, Ruili Huang, Christopher Austin, George
Daston, Thomas Hartung, John Fowle III, William Wooge, Weida Tong, and David Dix (2013)
Estimating Toxicity-Related Biological Pathway Altering Doses for High-Throughput Chemical Risk
Assessment by Richard Judson, Robert Kavlock, WoodrowSetzer, Elaine Cohen Hubal, Matthew
Martin, Thomas Knudsen, Keith Houck, Russell Thomas, Barbara Wetmore, and David Dix (2011)
Addressing Human Variability in Next Generation Health Assessments of Environmental Chemicals by
Lauren Zeise, Frederic Bois, Weihsueh Chiu, Dale Hattis, Ivan Rusyn, and Kathryn Guyton (2012)
Quantitative High-Throughput Screening for Chemical Toxicity in a Population-Based In Vitro Model
by Eric Lock, Nour Abdo, Ruili Huang, Menghang Xia, Oksana Kosyk, Shannon O'Shea, Yi-Hui Zhou,
Alexander Sedykh, Alexander Tropsha, Christopher Austin, Raymond Tice, Fred Wright, and Ivan
Rusyn (2012)
Predicting Later-Life Outcomes of Early-Life Exposures by Kim Boekelheide, Bruce Blumberg, Robert
Chapin, Ha Cote, Joseph Graziano, Amanda Janesick, Robert Lane, Karen Lillycrop, Leslie Myatt,
Christopher States, Kristina Thayer, Michael Waalkes, and John Rogers (2012)
In Vitro Screening for Population Variability in Chemical Toxicity by Shannon O'Shea, John Schwarz,
Oksana Kosyk, Pamela Ross, Min Jin Ha, Fred Wright, and Ivan Rusyn (2011)
Improving Cumulative Risk Assessment Through Systems and Network Biology Driven Data Mining by
Timothy Zacharewski, lla Cote, Linda Teuschler, and Lyle Burgoon (submitted)
The Role of Advanced Biological Methods and Data in Regulatory Rationality by Douglas Crawford-
Brown (2013)
Incorporating New Technologies into Toxicity Testing and Risk Assessment: Moving from 21st Century
Vision to a Data-Driven Framework T by Russell S. Thomas, Martin Philbert, Scott Auerbach, Barbara
Wetmore, Michael Devito, lla Cote, et al. (2013)

Note: EPA also thanks Christine Sofge, Paul Schulte, and Ainsley Weston for sharing their
pre-publication draft manuscript
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 A-3
-------

Glossary Term
adverse outcome pathway
(AOP)
ArrayTrack™
assay
Bayesian Network
Description

An adverse outcome pathway is the mechanistic or predictive relationship
between an initial chemical-biological interaction (i.e., molecular initiating
event[s]) and subsequent perturbation to cellular functions sufficient to elicit
disruptions at higher levels of organization, culminating in an adverse
phenotypic outcome in an individual and population relevant to risk assessment
(i.e., disease progression or organ dysfunction in humans).
Ankley GT; Bennett RS; Erickson RJ; Hoff DJ; Hornung MW; Johnson RD; Mount
DR; Nichols JW; Russom CL; Schmieder PK; Serrrano JA; Tietge JE; Villeneuve DL
(2010). Adverse outcome pathways: A conceptual framework to support
ecotoxicology research and risk assessment. Environmental Toxicology and
Chemistry 29 (3): 730-741.
http://service004.hpc.ncsu.edu/toxicology/websites/iournalclub/linked files/Fal
llO/Environ0/o20Toxicol0/o20Chem0/o202010%20Anklev.pdf.

Publicly available toxicogenomics software for DNA microarrays. It contains
three integrated components: (1) a database (MicroarrayDB) that stores
microarray data and associated toxicological information; (2) tools (TOOL) for
data visualization and analysis; and (3) libraries (LIB) that provide curated
functional data from public databases for data interpretation. Using
ArrayTrack™, an analysis method can be selected from TOOL and applied to
selected microarray data stored in the MicroarrayDB. Analysis results can be
linked directly to pathways, gene ontology, and other functional information
stored in LIB.
Food and Drug Administration (FDA). (2012). ArrayTrack™ FAQs. Available online
at
http://www.fda.gov/ScienceResearch/BioinformaticsTools/Arraytrack/ucml350
70.htm (accessed September 27, 2012).
1. The process of quantitative or qualitative analysis of a component of a
sample; or
2. Results of a quantitative or qualitative analysis of a component of a sample.
National Library of Medicine. (2012). IUPAC Glossary of Terms Used in
Toxicology, 2nd Ed. Available online at
http://sis.nlm.nih.gov/enviro/iupacglossary/frontmatter.html (accessed
September 27, 2012).
A graph-based model of joint multivariate probability distributions that captures
properties of conditional independence between variables.
Friedman N; Linial M; Nachman I; Pe'er D. (2000). Using Bayesian networks to
analyze expression data. Journal of Computational Biology 7 (3-4): 601-620.
September 2013
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
B-l
-------
Glossary Term
Bayesian Network
reconstruction
bioinformatics
biological assay (bioassay)
biological pathway altering
dose (BPAD)
biomarkers
Description

The process of integrating Bayesian Network data using a software program to
generate gene causal networks predictive of complex phenotypes.
Wang I.; Zhang B; Yang X; Stepaniants S; Zhang C; Meng Q; Peters M; He Y; Ni C;
Slipetz D; Crackower MA; Houshyar H; Tan CM; Asante-Appiah E; O'Neill G; Luo
MJ; Theiringer R; Yuan J; Chiu C; Lum PY; Lamb J; Boie Y; Wilkinson HA; Schadt E;
Dai H; Roberts C. (2012). Systems analysis of eleven rodent disease models
reveals an inflammatome signature and key drivers. Molecular Systems Biology 8
594.
A field of biology in which complex multivariable data from high-throughput
screening and genomic assays are interpreted in relation to target identification
and effects of sustained perturbations on organs and tissues to make biological
discoveries or predictions. This field encompasses all computational methods
and theories applicable to molecular biology and areas of computer-based
techniques for solving biological problems, including manipulation of models and
data sets.
National Center for Biotechnology Information (NCBI). (2012). Bioinformatics.
Available online at http://www.ncbi.nlm.nih.gov/mesh?term=bioinformatics
(accessed September 27, 2012^
A method of measuring the effects of a biologically active substance using an
intermediate in vivo or in vitro tissue or cell model under controlled conditions.
It includes virulence studies in animal fetuses in utero, mouse convulsion
bioassay of insulin, quantitation of tumor-initiator systems in mouse skin,
calculation of potentiating effects of a hormonal factor in an isolated strip of
contracting stomach muscle, etc.
National Center for Biotechnology Information (NCBI). (2012). Biological Assay.
Available online at http://www.ncbi.nlm.nih.gov/mesh?term=bioassay (accessed
September 27, 2012).
The provisional acceptable exposure level at the low end of the distribution of
the external dose required to perturb a biological pathway, accounting for
uncertainty and variability.
Judson RS; Kavlock RJ; Setzer RW; Hubal EA; Martin MT; Knudsen TB; Houck KA;
Thomas RS; Wetmore BA; Dix DJ. (2011). Estimating toxicity-related biological
pathway altering doses for high-throughput chemical risk assessment. Chem Res
Toxicol 24 (4): 451-462. http://dx.doi.org/10.1021/txl00428e
Measurable and quantifiable biological parameters (e.g., specific enzyme
concentrations, specific hormone concentrations, a specific gene phenotype
distribution in a population, presence of biological substances) that serve as
indices for health- and physiology-related assessments, such as disease risk,
psychiatric disorders, environmental exposure and its effects, disease diagnosis,
metabolic processes, substance abuse, pregnancy, cell line development,
epidemiologic studies.
National Center for Biotechnology Information (NCBI). (2012). Biological
Markers. Available online at
http://www.ncbi.nlm.nih.gov/mesh?term=biological%20markers (accessed
September 27, 2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-2
-------
Glossary Term Description

cell biology The study of the structure, behavior, growth, reproduction, and pathology of
cells; and the function and chemistry of cellular components.
National Center for Biotechnology Information (NCBI). (2012). Cell Biology
Available online at http://www.ncbi.nlm.nih.gov/mesh?term=cell%20biology
(accessed September 27, 2012^
Chemical Effects in Biological An NIH/NIEHS publicly available toxicogenomic database that houses data of
Systems (CEBS) database interest to environmental health scientists. CEBS has received depositions of
data from academic, industrial, and governmental laboratories. CEBS is designed
to display data in the context of biology and study design, and to permit data
integration across studies for novel meta-analysis.
National Institute for Environmental Health Sciences (NIEHS). (2012). Chemical
Effects in Biological Systems (CEBS). Available online at
http://www.niehs.nih.gov/research/resources/databases/cebs/index.cfm
(accessed September 27, 2012).
Comparative Toxicogenomic
Database (CTD)™
computational models
A publicly available toxicogenomic database on the National Library of
Medicine's (NLM) Toxicology Data Network (TOXNET®). The CTD™ elucidates
molecular mechanisms by which environmental chemicals affect human disease.
It contains manually curated data describing cross-species chemical-
gene/protein interactions and chemical- and gene-disease relationships. The
results provide insight into the molecular mechanisms underlying variable
susceptibility and environmentally influenced diseases. These data also will
provide insights into complex chemical-gene and protein interaction networks.
National Library of Medicine (NLM). (2012). Fact Sheet. Comparative
Toxicogenomics Database (CTD)™. Available online at
http://www.nlm.nih.gov/pubs/factsheets/ctdfs.html (accessed September 27,
2012).
Computerized predictive tools. Sometimes referred to as "in silico" models.
U.S. Environmental Protection Agency (EPA). (2012). Glossary of Terms: Methods
of Toxicity Testing and Risk Assessment. Available online at
http://www.epa.gov/opp00001/science/comptox-glossary.html (accessed April
2, 2013).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-3
-------
Glossary Term
decision context
DNA microarray
Enzyme-Linked
Immunosorbent Assay
(ELISA)
Description

Decision context seeks to understand and describe what management decisions
are being made, why these decisions are made, and the relationship of these
decisions to previous and anticipated decisions. For example, decision context
tries to answer some of the following questions: Are risks being ranked; if so,
why? How will risk information be used in future decisions? Is a change in policy
or management under consideration; and if so, what is driving the change and
what are the underlying policy objectives? What is the general scope of
alternatives under consideration and why?
Decision context defines the roles and responsibilities of the ultimate decision
maker, stakeholders, and key technical experts in relation to the decision
process. Decision context also identifies the constraints within which a decision
must be made and outputs that will result from the decision.
Structured Decision Making (SDM). (2008). Steps in the Decision Process:
Introduction. Available online at
http://www.structureddecisionmaking.org/DecisionContext.htm (accessed
March 19, 2013).
A grid of nucleic acid molecules of known sequence linked to a solid substrate,
which can be probed with a sample containing either messenger RNA or
complementary DNA from a cell or tissue to reveal changes in gene expression
relative to a control sample. Microarray technology, also known as "DNA gene
chip" technology, enables the expression of many thousands of genes to be
assessed in a single experiment. DNA microarrays exploit the ability of
complementary strands of nucleic acids to base-pair with each other and bind.
For example, ATATGCGC will bind to its complement (TATACGCG) with a certain
affinity. DNA copies (cDNAs) are melted, or denatured, to single strands, which
then can be used to bind to, or hybridize with, fluorescently labeled nucleic acid
samples from cancerous or normal cells. After washing away the unbound
molecules, bound fluorescent nucleic acid samples can be identified by laser
microscopy. Fluorescent dots indicate expressed genes, and differences in
microarray patterns between normal and cancerous cells can be quickly
identified.
National Library of Medicine. (2012). IUPAC Glossary of Terms Used in
Toxicology, 2nd Ed. Available online at
http://sis.nlm.nih.gov/enviro/iupacglossary/frontmatter.html (accessed
September 28, 2012).
An immunoassay utilizing an antibody labeled with an enzyme marker such as
horseradish peroxidase. Although either the enzyme or the antibody is bound to
an immunosorbent substrate, they both retain their biologic activity; the change
in enzyme activity as a result of the enzyme-antibody-antigen reaction is
proportional to the concentration of the antigen and can be measured
spectrophotometrically or with the naked eye. Many variations of the method
have been developed.
National Center for Biotechnology Information (NCBI). (2012). Enzyme-Linked
Immunosorbent Assay. Available online at
http://www.ncbi.nlm.nih.gov/mesh?term=elisa (accessed September 27, 2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-4
-------
Glossary Term
epigenetics
functional genomics
gene-environment
interaction
gene expression
Gene Expression Omnibus
(GEO)
Description

An emerging field of science that studies heritable changes caused by the
activation and deactivation of genes with no change in the underlying DNA
sequence of the organism. The word is Greek in origin and literally means over
and above (epi) the genome.
National Human Genome Research Institute (NHGRI). (2012). Talking Glossary of
Genetic Terms. Available online at
http://www.genome.gov/glossary/index.cfm?id=528&textonly=true (accessed
September 27, 2012).
The study of dynamic cellular processes such as gene transcription, translation,
and gene product interactions that define an organism.
The National Institutes of Health (NIH). (2009). Genomics and Advanced
Technologies. Available online at
http://www.niaid.nih.gov/topics/pathogengenomics/Pages/definitions.aspx
(accessed September 28, 2012).
The combined effects of genotypes and environmental factors on phenotypic
characteristics.
National Center for Biotechnology Information (NCBI). (2012). Gene-
Environment Interaction. Available online at
http://www.ncbi. nlm.nih.gov/mesh?term=gene%20environment%20interaction
(accessed September 28, 2012).
The phenotypic manifestation of a gene or genes by the processes of genetic
transcription and genetic translation.
National Center for Biotechnology Information (NCBI). (2012). Gene Expression.
Available online at http://www.ncbi.nlm.nih.gov/mesh/68015870 (accessed
September 28, 2012).
A public repository that archives and freely distributes microarray, next-
generation sequencing, and other forms of high-throughput functional genomic
data submitted by the scientific community. In addition to data storage, a
collection of Web-based interfaces and applications is available to help users
query and download the studies and gene expression patterns stored in GEO.
National Center for Biotechnology Information (NCBI). (2012). Gene Expression
Omnibus. Frequently Asked Questions. Available online at
http://www.ncbi.nlm.nih.gov/geo/info/faq.html (accessed September 27, 2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-5
-------
Glossary Term
Gene Ontology (GO)
database
genetics
genome-wide association
study (GWAS)
green chemistry
Description

A product of the Gene Ontology (GO) project. The GO project provides
structured, controlled vocabularies and classifications that cover several
domains of molecular and cellular biology and are freely available for community
use in the annotation of genes, gene products, and sequences. Many model
organism databases and genome annotation groups use the GO database and
contribute their annotation sets to the GO resource. The GO database integrates
the vocabularies and contributed annotations and provides full access to this
information in several formats. Members of the GO Consortium continuously
work collectively, involving outside experts as needed, to expand and update the
GO vocabularies. The GO Web resource also provides access to extensive
documentation about the GO project and links to applications that use GO data
for functional analyses.
Gene Ontology Consortium. (2004). The Gene Ontology (GO) database and
informatics resource. Nucleic Acids Research 32: Database issue D258-261.
The branch of science concerned with the means and consequences of
transmission and generation of the components of biological inheritance. Used
for mechanisms of heredity and the genetics of organisms, for the genetic basis
of normal and pathologic states, and for the genetic aspects of endogenous
chemicals. It includes biochemical and molecular influence on genetic material.
National Center for Biotechnology Information (NCBI). (2012). Genetics.
Available online at http://www.ncbi.nlm.nih.gov/mesh?term=genetics (accessed
September 27, 2012).
An approach used in genetics research to associate specific genetic variations
with particular diseases. The method involves scanning the genomes from many
different people and looking for genetic markers that can be used to predict the
presence of a disease. Once such genetic markers are identified, they can be
used to understand how genes contribute to the disease and develop better
prevention and treatment strategies.
National Institutes of Health (NIH). (2012). Talking Glossary of Genetic Terms:
Genome-wide Association Studies (GWAS). National Human Genome Research
Institute. Available online at
http://www.genome.gov/glossary/index.cfm?id=91&textonly=true (accessed
September 27, 2012).
The design of chemical products and processes to reduce or eliminate the use
and generation of hazardous substances. Green Chemistry framework includes
three main principles: (1) to incorporate sustainable designs across all stages of
the chemical lifecycle, (2) to reduce the hazard of chemical products and
processes by design, and (3) to work as a cohesive set of design criteria. Twelve
design criteria have been developed to fulfill these three principles (prevention,
atom economy, less hazardous chemical synthesis, designing safer chemicals,
safer solvents and auxiliaries, design for energy efficiency, use of renewable
feedstocks, reduce derivatives, catalysis, design for degradation, real-time
analysis for pollution prevention, and inherently safer chemistry for accident
prevention).
Anastas, P, Eghbali, N. (2010). Green chemistry: Principles and practice. Chem
Soc Rev 39(1): 301-312.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-6
-------
Glossary Term

high-throughput screening
(HTS)
in silico
IVIV extrapolation (IVIVE)
Description

A rapid method of measuring the effect of an agent in a biological or chemical
assay. The assay usually involves some form of automation or a way to conduct
multiple assays at the same time using sample arrays.
National Center for Biotechnology Information (NCBI). (2012). High-Throughput
Screening Assays. Available online at
http://www.ncbi. nlm.nih.gov/mesh?term=high%20throughput%20screening%2
Omethod (accessed September 27, 2012).
Referring to or describing data generated and analyzed using computer
modeling and information technology.
National Library of Medicine. (2012). IUPAC Glossary of Terms Used in
Toxicology, 2nd Ed. Available online at
http://sis.nlm.nih.gov/enviro/iupacglossary/frontmatter.html (accessed
September 27, 2012).
A method that uses determinations of protein binding, liver/kidney clearance,
and oral uptake to estimate ranges of oral human exposures leading to
tissue/plasma concentrations similar to in vitro point-of-departure
concentrations.
Krewski D; Westphal M; Paoli G; Croteau M; Al-Zoughool M; Andersen M; Chiu
W; Cote I. (in preparation). A framework for the next generation of risk science.
Provide an alternative approach for storing and searching the complete
networks of highly interconnected information produced by linking bioassays
and pathways. Developed decades ago to codify human knowledge so that they
could be used to efficiently support decisions, knowledgebases are finding
practical applications in meaningfully organizing vast amounts of linked
biological data using ontologies.
Kyoto Encyclopedia of Genes A database resource that integrates genomic, chemical, and systemic functional
and Genomes (KEGG) information. In particular, gene catalogs from completely sequenced genomes
are linked to higher level systemic functions of the cell, the organism, and the
ecosystem. KEGG is a reference knowledgebase for integration and
interpretation of large-scale data sets generated by genome sequencing and
other high-throughput experimental technologies.
Kanehisa Laboratories. (2012). KEGG: Kyoto encyclopedia of genes and genomes.
Available online at http://www.genome.jp/kegg/ (accessed February 22, 2013).
knowledgebases
lift
Lift is a measure of how much better prediction results are using a model than
could be obtained by chance. For example, say 2% of customers who receive a
catalog in the mail make a purchase, and when a model is used to select catalog
recipients, 10% make a purchase. The lift for the model would be 10/2 or 5.
Oracle. (2013). Glossary: "Lift". Available online at
http://docs.oracle.com/cd/B28359 01/datamine.lll/b28129/glossary.htm
(accessed March 20, 2013).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-7
-------
Glossary Term
Meta Data Viewer
microarray analysis
microarray technology
mode of action
mode-of-action-based
in vitro toxicity pathway
assays
Description

A publicly available graphical display software program that can be used to
graph animal and human data. Meta Data Viewer can display up to 15 text
columns and to graph 1-5 numerical values. Users can sort, group, and filter
data and examine patterns of findings across studies. Users can use the program
and any associated National Toxicology Program (NTP) data files for their own
purposes, including for use in publications.
National Toxicology Program (NTP). (2012). Meta Data Viewer. Available online
athttp://ntp.niehs.nih.gov/?obiectid=lDF7D40E-A957-9727-
733C9B89E243634B (accessed September 27, 2012).
The simultaneous analysis, on a microchip, of multiple samples or targets
arranged in an array format.
National Center for Biotechnology Information (NCBI). (2012). Microarray
Analysis. Available online at
http://www.ncbi.nlm.nih.gov/mesh/?term=microarrav%20analysis (accessed
September 27, 2012).
A developing technology used to study the expression of many genes at once. It
involves placing thousands of gene sequences in known locations on a glass slide
called a gene chip. A sample containing DNA or RNA is placed in contact with the
gene chip. Complementary base pairing between the sample and the gene
sequences on the chip produces light that is measured. Areas on the chip
producing light identify genes that are expressed in the sample.
National Human Genome Research Institute (NHGRI). (2012). Talking Glossary of
Genetic Terms. Available online at
http://www.genome.gov/glossarv/index.cfm?id=125&textonly=true (accessed
September 27, 2012).
The key steps in the toxic response after chemical interaction at the target site
that is responsible for the physiological outcome or pathology of the chemical;
how chemicals perturb normal biological function.
U.S. Environmental Protection Agency (EPA). (2012). Glossary of Terms: Methods
of Toxicity Testing and Risk Assessment. Available online at
http://www.epa.gov/opp00001/science/comptox-glossary.html (accessed April
2, 2013).
Fit-for-purpose assays using human cells to assess biological pathway
perturbations based on specific or generic modes of action. The suite of these
assays would form the test battery for safety assessment.
Krewski D; Westphal M; Paoli G; Croteau M; Al-Zoughool M; Andersen M;
Chiu W; Cote I. (in preparation). A framework for the next generation of risk
science.
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-8
-------
Glossary Term
molecular epidemiology
omics
ontology
phenotype
polymerase chain reaction
(PCR)
Description

Referring to the application of molecular biology to answer epidemiological
questions. The examination of patterns of changes in DNA to implicate particular
carcinogens and the use of molecular markers to predict which individuals are at
highest risk for a disease are common examples. Molecular epidemiology
incorporates molecular markers of exposure and biological change into
population-based studies; integrates knowledge of the human genome into
epidemiological studies to understand genetic susceptibility and gene-
environment interaction in disease causation.
National Center for Biotechnology Information (NCBI). (2012). Molecular
Epidemiology. Available online at
http://www.ncbi. nlm.nih.gov/mesh?term=molecular%20epidemiology
(accessed September 27, 2012); Krewski D; Westphal M; Paoli G; Croteau M;
Al-Zoughool M; Andersen M; Chiu W; Cote I. (in preparation). A framework for
the next generation of risk science.
Refers to a broad field of study in biology, ending in the suffix "-omics" such as
genomics, proteomics, transcriptomics.
U.S. Environmental Protection Agency (EPA). (2012). Glossary of Terms: Methods
of Toxicity Testing and Risk Assessment. Available online at
http://www.epa.gov/opp00001/science/comptox-glossary.html (accessed April
2, 2013).
Defines types of data (e.g., chemicals, genes, assays, interactions, pathways,
cells, species) and their interrelationships (chemicals "activate" proteins; assays
"measure" changes in proteins; genes are "part of" pathways, etc.).
An individual's observable traits, such as height, eye color, and blood type. The
genetic contribution to the phenotype is called the genotype. Some traits are
largely determined by the genotype, while other traits are largely determined by
environmental factors.
National Human Genome Research Institute (NHGRI). (2012). Talking Glossary of
Genetic Terms. Available online at
http://www.genome.gov/glossarv/index.cfm?id=152&textonly=true (accessed
September 27, 2012).
A method for amplifying a DNA base sequence using a heat-stable polymerase
and two 20-base primers, one complementary to the (+) strand at one end of the
sequence to be amplified and one complementary to the (-) strand at the other
end. Because the newly synthesized DNA strands can subsequently serve as
additional templates for the same primer sequences, successive rounds of
primer annealing, strand elongation, and dissociation produce rapid and highly
specific amplification of the desired sequence. PCR also can be used to detect
the existence of the defined sequence in a DNA sample.
Department of Energy (DOE). (2010). Human Genome Project Information:
Genome Glossary. Available online at
http://www.ornl.gov/sci/techresources/Human Genome/glossary/glossary p.sh
tml (accessed September 27, 2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-9
-------
Glossary Term
principal components
analysis (PCA)
probe
proteomics
quantitative structure
activity relationship (QSAR)
QSAR Toolbox
Description

A mathematical procedure that transforms several possibly correlated variables
into a smaller number of uncorrelated variables called principal components.
National Center for Biotechnology Information (NCBI). (2012). Principal
Components Analysis. Available online at
http://www.ncbi.nlm.nih.gov/mesh?term=principal%20component%20analysis
(accessed September 27, 2012).
Single-stranded DNA or RNA molecules of specific base sequence, labeled either
radioactively or immunologically, that are used to detect the complementary
base sequence by hybridization.
Department of Energy (DOE). (2010). Human Genome Project Information:
Genome Glossary. Available online at
http://www.ornl.gov/sci/techresources/Human Genome/glossary/glossary p.sh
tml (accessed September 27, 2012).
The study of the function of all expressed proteins.
U.S. Environmental Protection Agency (EPA). (2012). Glossary of Terms: Methods
of Toxicity Testing and Risk Assessment. Available online at
http://www.epa.gov/opp00001/science/comptox-glossary.html (accessed April
2, 2013).
A mathematical relationship between a quantifiable aspect of chemical structure
and a chemical property or reactivity or a well-defined biological activity, such as
toxicity. Using a sample set of chemicals, a relationship is established between
one or many physical-chemical properties a chemical possesses due to its
structure and a chemical property or biological activity of concern. This
mathematical expression is then used to predict the chemical property or
biological response expected from other chemicals with similar structures. It is
based on the presumption that similar molecules or chemical structures have
similar properties or biological activities or toxicity potential.
U.S. Environmental Protection Agency (EPA). (2012). Glossary of Terms: Methods
of Toxicity Testing and Risk Assessment. Available online at
http://www.epa.gov/opp00001/science/comptox-glossary.html (accessed April
2, 2013).
A software application intended for use by government, the chemical industry,
and other stakeholders in filling gaps in (eco)toxicity data needed for assessing
the hazards of chemicals. The Toolbox incorporates information and tools from
various sources into a logical workflow. Crucial to this workflow is grouping
chemicals into chemical categories. The seminal features of the Toolbox are
identification of relevant structural characteristics and the potential mechanism
or mode of action of a target chemical, identification of other chemicals that
have the same structural characteristics or mechanism/mode of action (or both),
and use of existing experimental data to fill the data gap(s).
QSAR Toolbox. (2012). About: What does the QSAR Toolbox do? Available online
at http://www.qsartoolbox.org/ (accessed September 28, 2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-10
-------
Glossary Term
reverse toxicokinetics (RTK)
rule
ruleset
SNPs
stem cell biology
Description

Also known as reverse dosimetry, refers to the use of a pharmacokinetic model
to estimate external dose (exposure) from a known internal concentration. The
method uses a one-compartment model and makes default assumptions such as
chemicals are eliminated wholly through metabolism and renal excretion; renal
excretion is a function of the glomerular filtration rate and the fraction of
unbound chemical in the blood (i.e., no active transport); and oral absorption is
100%. Using these assumptions, the plasma concentration of the chemical at
steady state per unit dose then can be estimated. The two experimental
chemical-specific parameters required to generate an estimate are the rate of
disappearance of parent via hepatic metabolism (intrinsic clearance) and
fraction bound (or conversely unbound) to plasma proteins. Both parameters
can be measured experimentally in a relatively high-throughput manner.
Judson RS; Kavlock RJ; Setzer RW; Hubal EA; Martin MT; Knudsen TB; Houck KA;
Thomas RS; Wetmore BA; Dix DJ. (2011). Estimating toxicity-related biological
pathway altering doses for high-throughput chemical risk assessment. Chem Res
Toxicol Chem Res Toxicol 24 (4): 451-462. http://dx.doi.org/10.1021/txl00428e
A rule describes an association between elements on the left-hand side of the
rule and items on the right-hand side of the rule. For instance, the rule [diapers,
cola] => [milk] in a supermarket database might mean that when customers
bought diapers and cola, they also purchased milk.
A ruleset is a collection of one or more rules that can be associated with a realm
authorization, factor assignment, command rule, or secure application role. The
ruleset will be "true" or "false" based on evaluation of each rule in the ruleset
and the evaluation type for the ruleset, which can be "all true" or "any true."
Oracle. (2013). 5 Configuring Rule Sets. Available online at
http://docs.oracle.com/cd/B28359 01/server.lll/b31222/cfrulset.htm#DVAD
M70150 (accessed March 20, 2013).
Refers to single nucleotide polymorphisms, which are single nucleotide
variations in a genetic sequence that occur at appreciable frequency in the
population.
National Center for Biotechnology Information (NCBI). (2012). SNPs. Available
online at http://www.ncbi.nlm.nih.gov/mesh?term=SNPS (accessed September
28, 2012).
A branch of biology that studies and develops stem cells, which are cells with the
ability to divide for indefinite periods in culture and to give rise to specialized
cells.
The National Institutes of Health (NIH). (2009). Stem Cell Basics. Available online
at http://irp.nih.gov/catalvst/vl9i6/svstems-biology-as-defined-bv-nih (accessed
September 28, 2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-ll
-------
Glossary Term
systems biology
TOM (topological overlap
matrix) heat map
toxicity pathways
toxicogenomics
transcription
Description

A scientific approach that combines the principles of engineering, mathematics,
physics, and computer science with extensive experimental data to develop a
quantitative as well as a deep conceptual understanding of biological
phenomena, permitting prediction and accurate simulation of complex
(emergent) biological behaviors.
Wanjek, C. (2011). Systems biology as defined by NIH. The NIH Catalyst 19 (6):
November-December, http://irp.nih.gov/catalvst/vl9i6/svstems-biology-as-
defined-by-nih.
A graphical representation in which the rows and columns represent genes in a
symmetric manner; the color intensity represents the interaction strength
between genes.
Wang I.; Zhang B; Yang X; Stepaniants S; Zhang C; Meng Q; Peters M; He Y; Ni C;
Slipetz D; Crackower MA; Houshyar H; Tan CM; Asante-Appiah E; O'Neill G; Luo
MJ; Theiringer R; Yuan J; Chiu C; Lum PY; Lamb J; Boie Y; Wilkinson HA; Schadt E;
Dai H; Roberts C. (2012). Systems analysis of eleven rodent disease models
reveals an inflammatome signature and key drivers. Molecular Systems Biology 8
594.
The 2007 NRC report on Toxicity Testing in the 21st Century envisioned that new
technologies will help us better understand how chemicals perturb normal
biological function, and thus identify toxicity pathways. Potential toxic effects of
chemicals would be predicted based on in vitro bioactivity profiles derived from
a chemical's effects on cellular molecules and processes. The interpretation of
chemically induced perturbations in toxicity pathways depends on linking in vitro
effects with adverse outcomes in vivo, and on computer modeling that
extrapolates to predicted responses in whole tissues, organisms, and
populations based on realistic human or environmental exposures.
U.S. Environmental Protection Agency (EPA). (2012). Glossary of Terms: Methods
of Toxicity Testing and Risk Assessment. Available online at
http://www.epa.gov/opp00001/science/comptox-glossary.html (accessed April
2, 2013).
Study of the roles that genes play in the biological responses to environmental
toxicants and stressors by the collection, interpretation, and storage of
information about gene and protein activity.
U.S. Environmental Protection Agency (EPA). (2012). Glossary of Terms: Methods
of Toxicity Testing and Risk Assessment. Available online at
http://www.epa.gov/opp00001/science/comptox-glossary.html (accessed April
2, 2013).
The biosynthesis of RNA carried out on a template of DNA. The biosynthesis of
DNA from an RNA template is called reverse transcription.
National Center for Biotechnology Information (NCBI). (2012). Transcription.
Available online at http://www.ncbi.nlm.nih.gov/mesh/68014158 (accessed
September 27, 2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-12
-------
Glossary Term
transcriptome
transcriptomics
transgenic
translation
translesion synthesis
Virtual Tissue (v-Tissues™)
Models
Description

The pattern of gene expression, at the level of genetic transcription, in a specific
organism or under specific circumstances in specific cells.
National Center for Biotechnology Information (NCBI). (2012). Transcriptome.
Available online at http://www.ncbi.nlm.nih.gov/mesh/68059467 (accessed
September 27, 2012).
The study of gene expression at the RNA level.
U.S. Environmental Protection Agency (EPA). (2012). Glossary of Terms: Methods
of Toxicity Testing and Risk Assessment. Available online at
http://www.epa.gov/opp00001/science/comptox-glossary.html (accessed April
2, 2013).
Produced from a genetically manipulated egg or embryo; containing genes from
another species.
National Center for Biotechnology Information (NCBI). (2012). Transgenic.
Available online at http://www.ncbi.nlm.nih.gov/mesh/?term=transgenic
(accessed September 27, 2012).
The process of translating the sequence of a messenger RNA (mRNA) molecule
to a sequence of amino acids during protein synthesis. The genetic code
describes the relationship between the sequence of base pairs in a gene and the
corresponding amino acid sequence that it encodes. In the cell cytoplasm, the
ribosome reads the sequence of the mRNA in groups of three bases to assemble
the protein.
National Human Genome Research Institute (NHGRI). 2012. Talking Glossary of
Genetic Terms. Available online at
http://www.genome.gov/glossary/index.cfm?id=200&textonly=true (accessed
September 28, 2012).
A mechanism for DNA damage tolerance that allows the DNA replication
machinery to move beyond a DNA lesion or abasic site (i.e., a site that lacks a
DNA base).
In silico cross-scale models of cellular organization and emergent functions used
to better understand disease progression. Tissues are the clinically relevant level
for diagnosing and treating the transition from normal to adverse states in
chemical-induced toxicities leading to cancer, immune dysfunction,
developmental defects, and more. Currently, in vivo rodent experiments are
used to evaluate tissue-level effects of altered molecular and cellular function;
however, the extrapolation of animal models to humans is often uncertain.
v-Tissues™ aim to simulate key molecular and cellular processes computationally
in the context of normal tissue biology to: (1) help understand complex
physiological relationships, and (2) predict adverse effects due to chemicals. As
the number of chemicals in consumer products, the workplace, and the
environment continues to rise, v-Tissues™ offers the promise of a more efficient,
effective, and humane approach for evaluating their impact on human health.
U.S. Environmental Protection Agency (EPA), Computational Toxicology Research
Program. What are Virtual Tissues? (2012). Available online at
http://www.epa.gov/ncct/virtual tissues/what.html (accessed September 27,
2012).
This document is a draft for review purposes only and does not constitute Agency policy. Do not cite or quote.
September 2013 B-13
-------